The hOCR Microformat for OCR Workflow and Results
Citations
239 citations
145 citations
84 citations
Cites methods from "The hOCR Microformat for OCR Workfl..."
...As part of his work on OCRopus, Breuel also developed the very interesting hOCR microformat, designed to describe OCR workflow and results in a flexible and open manner [4]....
[...]
75 citations
22 citations
References
37,183 citations
238 citations
"The hOCR Microformat for OCR Workfl..." refers methods in this paper
...The combination of logical markup and typesetting markup permits us to use hOCR as an intermediate format for performing OCR as model-based reverse typesetting, an approach advocated, for example by Kopec and Chou [7]....
[...]
154 citations
"The hOCR Microformat for OCR Workfl..." refers methods in this paper
...the basis format, together with CSS (cascading style sheets) [4, 8] for representing typographic markup, and to enhance this format by embedding additional information using facilities of standard HTML....
[...]
105 citations
"The hOCR Microformat for OCR Workfl..." refers background in this paper
...We can distinguish three major classes of OCR output formats: logical formats, suitable for direct use of OCR results by end users (RTF, HTML, LaTeX, and Microsoft Word), OCR engine-specific formats [5, 9], and benchmarking formats [11, 10] proposed for benchmarking various aspects of OCR systems....
[...]