A rule-based approach to form mathematical symbols in printed mathematical expressions

dc.contributor.author Kumar, P. Pavan
dc.contributor.author Agarwal, Arun
dc.contributor.author Bhagvati, Chakravarthy
dc.date.accessioned 2022-03-27T05:52:19Z
dc.date.available 2022-03-27T05:52:19Z
dc.date.issued 2011-12-26
dc.description.abstract Automated understanding of mathematical expressions (MEs) is currently a challenging task due to their complex two- dimensional (2D) structure. Recognition of MEs can be online or offline and in either case, the process involves symbol recognition and analysis of 2D structure. This process is more complex for offline or printed MEs as they do not have temporal information. In our present work, we focus on the recognition of printed MEs and assume connected components (ccs) of a given ME image are labelled. Our approach to ME recognition comprises three stages,namely symbol formation, structural analysis and generation of encoding form like LATEX. In this paper, we present symbol formation process, where multi-cc symbols (like =, ≡ etc.) are formed, identity of context-dependent symbols (like a horizontal line can be MINUS, OVERBAR, FRACTION etc.) are resolved using spatial relations. Multi-line MEs like matrices and enumerated functions are also handled in this stage. A rule-based approach is proposed for the purpose, where the heuristics based on spatial relations are represented in the form of rules (knowledge) and those rules are fired depending on input data (labelled ccs). As knowledge is isolated from data like an expert system in our approach, it allows for easy adaptability and extensibility of the process. Proposed approach also handles both single-line and multi-line MEs in an unified manner. Our approach has been tested on around 800 MEs collected from various mathematical documents and experimental results are reported on them. © 2011 Springer-Verlag.
dc.identifier.citation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). v.7080 LNAI
dc.identifier.issn 03029743
dc.identifier.uri 10.1007/978-3-642-25725-4_16
dc.identifier.uri http://link.springer.com/10.1007/978-3-642-25725-4_16
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8512
dc.subject connected components
dc.subject Mathematical expressions
dc.subject rule-based approach
dc.subject symbol formation
dc.title A rule-based approach to form mathematical symbols in printed mathematical expressions
dc.type Book Series. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: