Correction: Extracting Family History Information From Electronic Health Records: Natural Language Processing Analysis

doi:10.2196/24020

Open AccessJournal ArticleDOI

Correction: Extracting Family History Information From Electronic Health Records: Natural Language Processing Analysis

Maciej Rybinski, +6 more

- 31 Aug 2020 -

JMIR medical informatics

- Vol. 9, Iss: 4

Chats0

TLDR

The authors used transformers to extract disease mentions from clinical notes and used rule-based methods for extracting family member (FM) information from text and coreference resolution techniques to improve the annotation of diseases.

Abstract:

Background: The prognosis, diagnosis, and treatment of many genetic disorders and familial diseases significantly improve if the family history (FH) of a patient is known. Such information is often written in the free text of clinical notes. Objective: The aim of this study is to develop automated methods that enable access to FH data through natural language processing. Methods: We performed information extraction by using transformers to extract disease mentions from notes. We also experimented with rule-based methods for extracting family member (FM) information from text and coreference resolution techniques. We evaluated different transfer learning strategies to improve the annotation of diseases. We provided a thorough error analysis of the contributing factors that affect such information extraction systems. Results: Our experiments showed that the combination of domain-adaptive pretraining and intermediate-task pretraining achieved an F1 score of 81.63% for the extraction of diseases and FMs from notes when it was tested on a public shared task data set from the National Natural Language Processing Clinical Challenges (N2C2), providing a statistically significant improvement over the baseline (P<.001). In comparison, in the 2019 N2C2/Open Health Natural Language Processing Shared Task, the median F1 score of all 17 participating teams was 76.59%. Conclusions: Our approach, which leverages a state-of-the-art named entity recognition model for disease mention detection coupled with a hybrid method for FM mention detection, achieved an effectiveness that was close to that of the top 3 systems participating in the 2019 N2C2 FH extraction challenge, with only the top system convincingly outperforming our approach in terms of precision.

Correction: Extracting Family History Information From Electronic Health Records: Natural Language Processing Analysis

Citations

Deployment of Real-time Natural Language Processing and Deep Learning Clinical Decision Support in the Electronic Health Record: Pipeline Implementation for an Opioid Misuse Screener in Hospitalized Adults

Clinician documentation of patient centered care in the electronic health record

CSIRO Data61 Team at BioLaySumm Task 1: Lay Summarisation of Biomedical Research Articles Using Generative Models

Investigating the Impact of Query Representation on Medical Information Retrieval

Detecting Entities in the Astrophysics Literature: A Comparison of Word-based and Span-based Entity Recognition Methods

References

Deep contextualized word representations

The Stanford CoreNLP Natural Language Processing Toolkit

Neural Machine Translation of Rare Words with Subword Units

Neural Architectures for Named Entity Recognition

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Related Papers (5)

Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining

Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.

Extracting important information from Chinese Operation Notes with natural language processing methods

A Frame-Based NLP System for Cancer-Related Information Extraction.

Open Knowledge Extraction Challenge