Book ChapterDOI
Techniques, Applications, and Issues in Mining Large-Scale Text Databases
Sandhya Avasthi,Ritu Chauhan,D. P. Acharjya +2 more
- pp 385-396
TLDR
The main objective is to review text mining techniques, application areas, and existing issues.Abstract:
The discovery of knowledge from large-scale text data or semi-structured data is very difficult. In text mining, useful information is extracted out of such large text corpus which fulfills a user current information need. This process is being exploited by various organizations for quality improvement, business need, and understanding user behavior. The text available in unstructured and semi-structured form can come through sources such as medical, financial, market, scientific, and others documents. Text mining applies quantitative approach to analyze massive amount of textual data and tries to solve information overload problem. The main objective is to review text mining techniques, application areas, and existing issues.read more
Citations
More filters
Application of the R-Tree Clustering Model in Medical Information Retrieval
TL;DR: Experiments and empirical evaluation show that the proposed R-tree clustering model index improves data retrieval eciency and the superiority of the method is proved by simulations.
Journal ArticleDOI
A hybrid approach of Poisson distribution LDA with deep Siamese Bi-LSTM and GRU model for semantic similarity prediction for text data
D. Viji,S.R. Revathy +1 more
Proceedings ArticleDOI
A Transfer Learning Framework For Annotating Implementation-Specific Corpus
TL;DR: In this article , a transfer learning-based approach trained on a corpus of annotated domain-level text and semantic tags is presented to annotate implementation-level process information, and the use of state-of-the-art Skip-gram, GloVe, ELMO, and BERT-based learning models is compared.
Journal ArticleDOI
A Secure Decentralized E-Voting with Blockchain & Smart Contracts
TL;DR: In this article , the authors proposed a blockchain-powered e-voting system using smart contracts and OTP Verification and face verification, which is a MERN-based web application with upgraded authentication and permission techniques.
Proceedings ArticleDOI
Tourist reviews summarization and sentiment analysis based on aspects
TL;DR: In this article , a framework is proposed based on extracting coherent aspects from the reviews and applying the extractive summarization method to generate summaries, providing insights into the reviews of tourist attractions by using aspect-based sentiment analysis.
References
More filters
Book
Foundations of Statistical Natural Language Processing
TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.
Journal ArticleDOI
Data-intensive applications, challenges, techniques and technologies: A survey on Big Data
TL;DR: This paper is aimed to demonstrate a close-up view about Big Data, including Big Data applications, Big Data opportunities and challenges, as well as the state-of-the-art techniques and technologies currently adopt to deal with the Big Data problems.
Journal ArticleDOI
Minimum redundancy feature selection from microarray gene expression data.
Chris Ding,Hanchuan Peng +1 more
TL;DR: How to selecting a small subset out of the thousands of genes in microarray data is important for accurate classification of phenotypes.
Journal ArticleDOI
A Survey of Recent Advances in Hierarchical Clustering Algorithms
TL;DR: A general framework for hierarchical, agglomerative clustering algorithms is discussed in this article, which opens up the prospect of much improvement on current, widely-used clustering methods.
Journal ArticleDOI
A survey of current work in biomedical text mining
Aaron Cohen,William R. Hersh +1 more
TL;DR: The major challenge of biomedical text mining over the next 5-10 years will require enhanced access to full text, better understanding of the feature space of biomedical literature, better methods for measuring the usefulness of systems to users, and continued cooperation with the biomedical research community to ensure that their needs are addressed.