Home
/
Authors
/
Francisco Álvaro

Author

Francisco Álvaro

Bio: Francisco Álvaro is an academic researcher from Polytechnic University of Valencia. The author has contributed to research in topics: Handwriting recognition & Parsing. The author has an hindex of 8, co-authored 15 publications receiving 314 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden Markov models

[...]

Francisco Álvaro¹, Joan Andreu Sánchez¹, José-Miguel Benedí¹•Institutions (1)

Polytechnic University of Valencia¹

01 Jan 2014-Pattern Recognition Letters

TL;DR: In this article, a formal model for the recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden Markov models is described.

...read moreread less

107 citations

Journal Article•DOI•

An integrated grammar-based approach for mathematical expression recognition

[...]

Francisco Álvaro¹, Joan Andreu Sánchez¹, José-Miguel Benedí¹•Institutions (1)

Polytechnic University of Valencia¹

01 Mar 2016-Pattern Recognition

TL;DR: The statistical framework of a model based on two-dimensional grammars and its associated parsing algorithm is defined and a system that implements this approach is developed and results on the large public dataset of the CROHME international competition are reported.

...read moreread less

69 citations

Proceedings Article•DOI•

Recognition of Printed Mathematical Expressions Using Two-Dimensional Stochastic Context-Free Grammars

[...]

Francisco Álvaro, Joan-Andreu S´nchez, José-Miguel Benedí

18 Sep 2011

TL;DR: A statistical framework based on two-dimensional stochastic context-free grammars has been defined that allows to jointly tackle the segmentation, symbol recognition and structural analysis of a mathematical expression by computing its most probable parsing.

...read moreread less

Abstract: In this work, a system for recognition of printed mathematical expressions has been developed. Hence, a statistical framework based on two-dimensional stochastic context-free grammars has been defined. This formal framework allows to jointly tackle the segmentation, symbol recognition and structural analysis of a mathematical expression by computing its most probable parsing. In order to test this approach a reproducible and comparable experiment has been carried out over a large publicly available (InftyCDB-1) database. Results are reported using a well-defined global dissimilitude measure. Experimental results show that this technique is able to properly recognize mathematical expressions, and that the structural information improves the symbol recognition step.

...read moreread less

48 citations

Proceedings Article•DOI•

A shape-based layout descriptor for classifying spatial relationships in handwritten math

[...]

Francisco Álvaro¹, Richard Zanibbi²•Institutions (2)

Polytechnic University of Valencia¹, Rochester Institute of Technology²

10 Sep 2013

TL;DR: This work improves existing geometric features based on bounding boxes and center points, and proposes a novel feature set for layout classification, using polar histograms computed over points in handwritten strokes, which provide a natural representation with results comparable to those for the geometric features.

...read moreread less

Abstract: We consider the difficult problem of classifying spatial relationships between symbols and subexpressions in handwritten mathematical expressions. We first improve existing geometric features based on bounding boxes and center points, normalizing them using the distance between the centers of the two symbols or subexpressions in question. We then propose a novel feature set for layout classification, using polar histograms computed over points in handwritten strokes. A series of experiments are presented in which a Support Vector Machine is used with these new features to classify spatial relationships of five types in the MathBrush corpus (horizontal, superscript, subscript, below, and inside (e.g. in a square root)). The normalized geometric features provide an improvement over previously published results, while the shape-based features provide a natural representation with results comparable to those for the geometric features. Combining the features produced a very small improvement in accuracy.

...read moreread less

35 citations

Proceedings Article•DOI•

Offline Features for Classifying Handwritten Math Symbols with Recurrent Neural Networks

[...]

Francisco Álvaro, Joan Andreu Sánchez, José-Miguel Benedí

24 Aug 2014

TL;DR: The performance of several well-known offline features for handwritten math symbol recognition are assessed and a novel set of features based on polar histograms and the vertical repositioning method for feature extraction is tested.

...read moreread less

Abstract: In mathematical expression recognition, symbol classification is a crucial step. Numerous approaches for recognizing handwritten math symbols have been published, but most of them are either an online approach or a hybrid approach. There is an absence of a study focused on offline features for handwritten math symbol recognition. Furthermore, many papers provide results difficult to compare. In this paper we assess the performance of several well-known offline features for this task. We also test a novel set of features based on polar histograms and the vertical repositioning method for feature extraction. Finally, we report and analyze the results of several experiments using recurrent neural networks on a large public database of online handwritten math expressions. The combination of online and offline features significantly improved the recognition rate.

...read moreread less

34 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A review on the long short-term memory model

[...]

Greg Van Houdt¹, Carlos Mosquera², Gonzalo Nápoles³, Gonzalo Nápoles¹•Institutions (3)

University of Hasselt¹, Vrije Universiteit Brussel², Tilburg University³

01 Dec 2020-Artificial Intelligence Review

TL;DR: A comprehensive review of LSTM’s formulation and training, relevant applications reported in the literature and code resources implementing this model for a toy example are presented.

...read moreread less

Abstract: Long short-term memory (LSTM) has transformed both machine learning and neurocomputing fields. According to several online sources, this model has improved Google’s speech recognition, greatly improved machine translations on Google Translate, and the answers of Amazon’s Alexa. This neural system is also employed by Facebook, reaching over 4 billion LSTM-based translations per day as of 2017. Interestingly, recurrent neural networks had shown a rather discrete performance until LSTM showed up. One reason for the success of this recurrent network lies in its ability to handle the exploding/vanishing gradient problem, which stands as a difficult issue to be circumvented when training recurrent or very deep neural networks. In this paper, we present a comprehensive review that covers LSTM’s formulation and training, relevant applications reported in the literature and code resources implementing this model for a toy example.

...read moreread less

412 citations

Journal Article•DOI•

Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition

[...]

Jianshu Zhang¹, Jun Du¹, Shiliang Zhang¹, Dan Liu, Hu Yulong, Jin-Shui Hu, Si Wei, Lirong Dai¹ - Show less +4 more•Institutions (1)

University of Science and Technology of China¹

01 Nov 2017-Pattern Recognition

TL;DR: Watch, Attend and Parse (WAP), a novel end-to-end approach based on neural network that learns to recognize HMEs in a two-dimensional layout and outputs them as one-dimensional character sequences in LaTeX format, significantly outperformed the state-of-the-art method.

...read moreread less

182 citations

Journal Article•DOI•

Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)

[...]

Jamshed Memon¹, Maira Sami², Rizwan Ahmad Khan, Mueen Uddin•Institutions (2)

International University, Cambodia¹, Shaheed Zulfiqar Ali Bhutto Institute of Science and Technology²

28 Jul 2020-IEEE Access

TL;DR: This review article serves the purpose of presenting state of the art results and techniques on OCR and also provide research directions by highlighting research gaps.

...read moreread less

Abstract: Given the ubiquity of handwritten documents in human transactions, Optical Character Recognition (OCR) of documents have invaluable practical worth. Optical character recognition is a science that enables to translate various types of documents or images into analyzable, editable and searchable data. During last decade, researchers have used artificial intelligence/machine learning tools to automatically analyze handwritten and printed documents in order to convert them into electronic format. The objective of this review paper is to summarize research that has been conducted on character recognition of handwritten documents and to provide research directions. In this Systematic Literature Review (SLR) we collected, synthesized and analyzed research articles on the topic of handwritten OCR (and closely related topics) which were published between year 2000 to 2019. We followed widely used electronic databases by following pre-defined review protocol. Articles were searched using keywords, forward reference searching and backward reference searching in order to search all the articles related to the topic. After carefully following study selection process 176 articles were selected for this SLR. This review article serves the purpose of presenting state of the art results and techniques on OCR and also provide research directions by highlighting research gaps.

...read moreread less

139 citations

Exploring Artificial Intelligence In The New Millennium

[...]

Sebastian Fischer

01 Jan 2016

TL;DR: Thank you for downloading exploring artificial intelligence in the new millennium, instead of reading a good book with a cup of coffee in the afternoon, instead they are facing with some infectious virus inside their computer.

...read moreread less

Abstract: Thank you for downloading exploring artificial intelligence in the new millennium. Maybe you have knowledge that, people have search hundreds times for their chosen novels like this exploring artificial intelligence in the new millennium, but end up in infectious downloads. Rather than reading a good book with a cup of coffee in the afternoon, instead they are facing with some infectious virus inside their computer.

...read moreread less

137 citations

Proceedings Article•DOI•

Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition

[...]

Jianshu Zhang¹, Jun Du¹, Lirong Dai¹•Institutions (1)

University of Science and Technology of China¹

01 Aug 2018

TL;DR: In this paper, the encoder-decoder model was improved by employing densely connected convolutional networks as they can strengthen feature extraction and facilitate gradient propagation especially on a small training set, and a multi-scale attention model was employed to deal with the recognition of math symbols in different scales and restore the fine-grained details dropped by pooling operations.

...read moreread less

Abstract: Handwritten mathematical expression recognition is a challenging problem due to the complicated two-dimensional structures, ambiguous handwriting input and variant scales of handwritten math symbols. To settle this problem, recently we propose the attention based encoder-decoder model that recognizes mathematical expression images from two-dimensional layouts to one-dimensional LaTeX strings. In this study, we improve the encoder by employing densely connected convolutional networks as they can strengthen feature extraction and facilitate gradient propagation especially on a small training set. We also present a novel multi-scale attention model which is employed to deal with the recognition of math symbols in different scales and restore the fine-grained details dropped by pooling operations. Validated on the CROHME competition task, the proposed method significantly outperforms the state-of-the-art methods with an expression recognition accuracy of 52.8% on CROHME 2014 and 50.1% on CROHME 2016, by only using the official training dataset.

...read moreread less

115 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

Collapse