Textual Description for Mathematical Equations

doi:10.1109/ICDAR.2019.00210

Open AccessProceedings ArticleDOI

Textual Description for Mathematical Equations

Ajoy Mondal, +1 more

- pp 1300-1307

Chats0

TLDR

In this article, a mathematical equation description (MED) model was proposed, which consists of a convolution neural network as an encoder that extracts features of input mathematical equation images and a recurrent neural network with attention mechanism.

Abstract:

Reading of mathematical expression or equation in the document images is very challenging due to the large variability of mathematical symbols and expressions. In this paper, we pose reading of mathematical equation as a task of the generation of the textual description which interprets the internal meaning of this equation. Inspired by the natural image captioning problem in computer vision, we present a mathematical equation description ( MED ) model, a novel end-to-end trainable deep neural network based approach that learns to generate a textual description for reading mathematical equation images. Our MED model consists of a convolution neural network as an encoder that extracts features of input mathematical equation images and a recurrent neural network with attention mechanism which generates description related to the input mathematical equation images. Due to the unavailability of mathematical equation image data sets with their textual descriptions, we generate two data sets for experimental purpose. To validate the effectiveness of our MED model, we conduct a real-world experiment to see whether the students are able to write equations by only reading or listening their textual descriptions or not. Experiments conclude that the students are able to write most of the equations correctly by reading their textual descriptions only.

Textual Description for Mathematical Equations

Citations

Automatic adaptation of open educational resources: an approach from a multilevel methodology based on students’ preferences, educational special needs, artificial intelligence, and accessibility metadata

Classroom Slide Narration System

References

Deep Residual Learning for Image Recognition

Long short-term memory

ImageNet: A large-scale hierarchical image database

Bleu: a Method for Automatic Evaluation of Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Related Papers (5)

Preliminary Exploration of Formula Embedding for Mathematical Information Retrieval: can mathematical formulae be embedded like a natural language?

Application of Methods of Machine Learning for the Recognition of Mathematical Expressions.

Deep bayesian natural language processing

Deep Bayesian Mining, Learning and Understanding

On-Line Recognition of Handwritten Mathematical Expression Based on Stroke-Based Stochastic Context-Free Grammar