scispace - formally typeset
Journal ArticleDOI

An advanced review on text mining in medicine

TLDR
In this review, more than 90 relevant research studies have been analyzed, describing the most important practical applications, terminological resources, tools, and open challenges of TM in medicine.
Abstract
Health care professionals produce abundant textual information in their daily clinical practice and this information is stored in many diverse sources and, generally, in textual form. The extraction of insights from all the gathered information, mainly unstructured and lacking normalization, is one of the major challenges in computational medicine. In this respect, text mining (TM) assembles different techniques to derive valuable insights from unstructured textual data so it has led to be especially relevant in medicine. The aim of this paper is therefore to provide an extensive review of existing techniques and resources to perform TM tasks in medicine. In this review, more than 90 relevant research studies have been analyzed, describing the most important practical applications, terminological resources, tools, and open challenges of TM in medicine.

read more

Citations
More filters
Journal ArticleDOI

Text mining in education

TL;DR: This work presents a systematic overview of the current status of the Educational Text Mining field, answering three main research questions: which are the text mining techniques most used in educational environments?
Journal ArticleDOI

A critical review of text-based research in construction: Data source, analysis method, and implications

TL;DR: A comprehensive review of text analytics finds that the ontology- and rule-based approach has been dominant, at the same time, recent research has attempted to apply the state-of-the-art machine learning methods.
Journal ArticleDOI

Evolving Role and Future Directions of Natural Language Processing in Gastroenterology.

TL;DR: This manuscript provides a clinically focused review of NLP systems in GI practice, presenting state-of-the-art methods and typical use cases within four prominent areas of gastroenterology including endoscopy, inflammatory bowel disease, pancreaticobiliary, and liver diseases.
Journal ArticleDOI

The Secondary Use of Electronic Health Records for Data Mining: Data Characteristics and Challenges

TL;DR: An overview of information found in EHR systems and their characteristics that could be utilized for secondary applications is provided and can serve as a primer for researchers to understand the use of EHRs for data mining and analytics purposes.
Book ChapterDOI

Designing Explainable Text Classification Pipelines: Insights from IT Ticket Complexity Prediction Case Study

TL;DR: In this article, the authors investigate the core design elements of a typical text classification pipeline and their contribution to the overall performance of the system, in particular, text representation techniques and classification algorithms, in the context of their explainability, providing ultimate insights from their IT ticket complexity prediction case study.
References
More filters
Book

C4.5: Programs for Machine Learning

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
Posted Content

Efficient Estimation of Word Representations in Vector Space

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.
Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.
Proceedings ArticleDOI

Convolutional Neural Networks for Sentence Classification

TL;DR: The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification, and are proposed to allow for the use of both task-specific and static vectors.