M
Milos Raskovic
Researcher at Microsoft
Publications - 15
Citations - 140
Milos Raskovic is an academic researcher from Microsoft. The author has contributed to research in topics: Document layout analysis & Paragraph. The author has an hindex of 8, co-authored 15 publications receiving 140 citations.
Papers
More filters
Patent
Borderless table detection engine
TL;DR: A borderless table detection engine and associated method for identifying borderless tables appearing in data extracted from a fixed format document is presented in this paper. But, due to the lack of visible borders, reliable automated detection of a borderless Table is difficult.
Patent
Detection and Reconstruction of East Asian Layout Features in a Fixed Format Document
TL;DR: In this article, a fixed format document is detected and rotated for layout analysis, and the rotated text is rotated back and restructured in a flow format document, which is used to detect East Asian layout features.
Patent
Fixed format document conversion engine
Milos Lazarevic,Milos Raskovic,Aljosa Obuljen,Milan Sesum,Dusan Radovanovic,Drazen Zaric,Aleksandar Tomic,Dragan Slaveski +7 more
TL;DR: In this paper, a fixed format document conversion engine and associated method for converting a fixed-format document into a flow format document is presented. Butler et al. use a sequence of layout analysis engines and semantic analysis engines to analyze the base physical layout information obtained from the fixed-formatted document to enrich, modify, and classify the layout information into progressively more advanced physical layout and semantic layout information.
Patent
Vector Graphics Classification Engine
TL;DR: A vector graphics classification engine and associated method for classifying vector graphics in a fixed format document is described in this paper, and illustrated in the accompanying figures, which defines a pipeline for categorizing vector graphics parsed from the fixed format documents as font, text, paragraph, table, and page effects.
Patent
Formula Detection Engine
TL;DR: In this article, the authors propose a formula detection engine that locates formulas within a fixed format document portion by identifying formula seeds and creating and expanding a boundary around the formula seed to define a formula area.