Text Document Analysis Using Map-Reduce Framework

doi:10.1007/978-981-10-8237-5_57

Book ChapterDOI

Text Document Analysis Using Map-Reduce Framework

K. V. Kanimozhi, +2 more

- pp 585-594

Chats0

TLDR

The results show the advantage of the proposed map-reduce algorithm by detecting clusters of document features within less computation time and provides premier solution for increasing the precision rate of retrieval in information extraction.

Abstract:

Due to the advance Internet and increasing globalization, the electronics forms of information grow in a rapid manner. Extracting the useful hidden information from those multiple documents is a recent challenge. Hence, efficient and automated clustering algorithm which is effective in identifying topics plays the main role in information retrieval. In this paper, the analysis regarding the large unstructured text document corpus using our proposed map-reduce algorithm has been performed, and the results show the advantage of the proposed method by detecting clusters of document features within less computation time and provides premier solution for increasing the precision rate of retrieval in information extraction.

Text Document Analysis Using Map-Reduce Framework

Citations

Prediction of disease and suggestion of specialist using big data techniques

References

Latent dirichlet allocation

Latent Dirichlet Allocation

A Survey of Text Clustering Algorithms

A framework for understanding Latent Semantic Indexing (LSI) performance

TopCat: data mining for topic identification in a text corpus

Related Papers (5)

A Novel Map-Reduce Based Augmented Clustering Algorithm for Big Text Datasets

Document Clustering based on Topic Maps

Text clustering using statistical and semantic data

Incremental document clustering using cluster similarity histograms

A Survey on optimization approaches to text document clustering