scispace - formally typeset
M

Milos Raskovic

Researcher at Microsoft

Publications -  15
Citations -  140

Milos Raskovic is an academic researcher from Microsoft. The author has contributed to research in topics: Document layout analysis & Paragraph. The author has an hindex of 8, co-authored 15 publications receiving 140 citations.

Papers
More filters
Patent

Borderless table detection engine

TL;DR: A borderless table detection engine and associated method for identifying borderless tables appearing in data extracted from a fixed format document is presented in this paper. But, due to the lack of visible borders, reliable automated detection of a borderless Table is difficult.
Patent

Detection and Reconstruction of East Asian Layout Features in a Fixed Format Document

TL;DR: In this article, a fixed format document is detected and rotated for layout analysis, and the rotated text is rotated back and restructured in a flow format document, which is used to detect East Asian layout features.
Patent

Fixed format document conversion engine

TL;DR: In this paper, a fixed format document conversion engine and associated method for converting a fixed-format document into a flow format document is presented. Butler et al. use a sequence of layout analysis engines and semantic analysis engines to analyze the base physical layout information obtained from the fixed-formatted document to enrich, modify, and classify the layout information into progressively more advanced physical layout and semantic layout information.
Patent

Vector Graphics Classification Engine

TL;DR: A vector graphics classification engine and associated method for classifying vector graphics in a fixed format document is described in this paper, and illustrated in the accompanying figures, which defines a pipeline for categorizing vector graphics parsed from the fixed format documents as font, text, paragraph, table, and page effects.
Patent

Formula Detection Engine

TL;DR: In this article, the authors propose a formula detection engine that locates formulas within a fixed format document portion by identifying formula seeds and creating and expanding a boundary around the formula seed to define a formula area.