scispace - formally typeset
Open AccessJournal ArticleDOI

Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining

Reads0
Chats0
TLDR
These experiments show that the proposed Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition performance.
Abstract
Background Clinical named entity recognition is the basic task of mining electronic medical records text, which are with some challenges containing the language features of Chinese electronic medical records text with many compound entities, serious missing sentence components, and unclear entity boundary. Moreover, the corpus of Chinese electronic medical records is difficult to obtain. Methods Aiming at these characteristics of Chinese electronic medical records, this study proposed a Chinese clinical entity recognition model based on deep learning pretraining. The model used word embedding from domain corpus and fine-tuning of entity recognition model pretrained by relevant corpus. Then BiLSTM and Transformer are, respectively, used as feature extractors to identify four types of clinical entities including diseases, symptoms, drugs, and operations from the text of Chinese electronic medical records. Results 75.06% Macro-P, 76.40% Macro-R, and 75.72% Macro-F1 aiming at test dataset could be achieved. These experiments show that the Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition effect. Conclusions These experiments show that the proposed Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition performance.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Deep learning-based methods for natural hazard named entity recognition

TL;DR: In this article , a natural hazard named entity recognition method based on deep learning is proposed, namely XLNet-BiLSTM-CRF model, which can automatically mine text features and reduce the dependence on manual rules.
Journal ArticleDOI

Establishment of a Chinese critical care database from electronic healthcare records in a tertiary care medical center

TL;DR: Wang et al. as discussed by the authors reported the establishment of an openly accessible critical care database generated from the hospital information system, which can provide insights into the pathophysiology of underlying diseases and healthcare practices.
Journal ArticleDOI

Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT

TL;DR: Wang et al. as discussed by the authors proposed a hybrid neural network model based on medical MC-BERT, namely, the MCBERT + BiLSTM + CNN + Multi-Head Self-Attention (MHA) + CRF model.
Journal ArticleDOI

A multi-layer soft lattice based model for Chinese clinical named entity recognition

TL;DR: In this paper , the authors combined Transformer with Soft Term Position Lattice to form soft lattice structure Transformer, which models long-distance dependencies similarly to LSTM and achieved 91.6% f-measure in recognizing long medical terms, abbreviations, and numbers.
Journal ArticleDOI

TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.

TL;DR: In this article, a novel word-character integrated self-attention module was proposed to improve the performance of the TCM named entity recognition model by using the character-level representation and tagging.
References
More filters
Posted Content

Bidirectional LSTM-CRF Models for Sequence Tagging

TL;DR: This work is the first to apply a bidirectional LSTM CRF model to NLP benchmark sequence tagging data sets and it is shown that the BI-LSTM-CRF model can efficiently use both past and future input features thanks to a biddirectional L STM component.
Journal ArticleDOI

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

TL;DR: The 2010 i2b2/VA Workshop on Natural Language Processing Challenges for Clinical Records presented three tasks, which showed that machine learning approaches could be augmented with rule-based systems to determine concepts, assertions, and relations.
Proceedings Article

Recognizing Named Entities in Tweets

TL;DR: This work proposes to combine a K-Nearest Neighbors classifier with a linear Conditional Random Fields model under a semi-supervised learning framework to tackle the challenges of Named Entities Recognition for tweets.
Journal ArticleDOI

A comprehensive study of named entity recognition in Chinese clinical text

TL;DR: The authors' evaluation on the independent test set showed that most types of feature were beneficial to Chinese NER systems, although the improvements were limited, and the system achieved the highest performance by combining word segmentation and section information, indicating that these two types offeature complement each other.
Journal ArticleDOI

Entity recognition from clinical texts via recurrent neural network

TL;DR: This paper comprehensively investigates the performance of LSTM (long-short term memory), a representative variant of RNN, on clinical entity recognition and protected health information recognition, and shows that L STM outperforms traditional machine learning methods that suffer from fussy feature engineering.
Related Papers (5)