Searching OCR'ed Text: An LDA Based Approach
Citations
10 citations
Cites methods from "Searching OCR'ed Text: An LDA Based..."
...Second Column Shows the Retrieved Results in decreasing order of Rank. across the dataset....
[...]
...To improve recall, we have also indexed word images and used query expansion as explained in Section 2.3 to formulate query histogram based on initial results of text query index....
[...]
...Second category does it mostly in the image domain by matching them in some appropriate feature space....
[...]
...Learn to improve from annotated dataset In Section 2.5, we explained the use of confusion matrix to deal with the errors given by OCR....
[...]
...Since OCRs are not an immediate feasibility (see Section 3 for quantitative performance of OCR on DLI pages) for building search engines, we naturally move towards the image based search and retrieval techniques....
[...]
5 citations
Cites background from "Searching OCR'ed Text: An LDA Based..."
...In [22], topic model based indexing and retrieval framework is proposed for text documents....
[...]
4 citations
Cites methods from "Searching OCR'ed Text: An LDA Based..."
...The general LDA approach is very similar to a Principal Component Analysis (PCA) [13], [15]....
[...]
...LDA is one of the methods used in statistics, pattern recognition [11] in general to find a linear combination of features that characterize or separating two or more classes of objects or events [12-15]....
[...]
...Hassan E [13] had also utilizing the SDA to make OCR (Optical Character Recognition)....
[...]
...ISSN: 2502-4752 IJEECS Vol. 4, No. 2, November 2016 : 479 – 485 482 [ ] (1) Where i = 1,2,3 of the class Now, compute the two 4x4-dimensional matrices: between class scatter matrix SB and within class scatter matrix Sw. if in the PCA is computed the average a whole images only, then in the LDA we should compute the average image contained in one class....
[...]
3 citations
References
30,570 citations
"Searching OCR'ed Text: An LDA Based..." refers background in this paper
...The topic model based indexing groups different terms occurring in the text document based on their semantic relationship [1][2][3]....
[...]
...Latent Dirichlet Allocation (LDA) defines a generative probabilistic model over the document collection [3]....
[...]
25,546 citations
12,443 citations
4,577 citations
175 citations
"Searching OCR'ed Text: An LDA Based..." refers background in this paper
...Topic models have been extensively applied for document summarization, and indexing applications [5][6][7][8][9]....
[...]