Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining
Reads0
Chats0
TLDR
These experiments show that the proposed Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition performance.Abstract:
Background Clinical named entity recognition is the basic task of mining electronic medical records text, which are with some challenges containing the language features of Chinese electronic medical records text with many compound entities, serious missing sentence components, and unclear entity boundary. Moreover, the corpus of Chinese electronic medical records is difficult to obtain. Methods Aiming at these characteristics of Chinese electronic medical records, this study proposed a Chinese clinical entity recognition model based on deep learning pretraining. The model used word embedding from domain corpus and fine-tuning of entity recognition model pretrained by relevant corpus. Then BiLSTM and Transformer are, respectively, used as feature extractors to identify four types of clinical entities including diseases, symptoms, drugs, and operations from the text of Chinese electronic medical records. Results 75.06% Macro-P, 76.40% Macro-R, and 75.72% Macro-F1 aiming at test dataset could be achieved. These experiments show that the Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition effect. Conclusions These experiments show that the proposed Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition performance.read more
Citations
More filters
Journal ArticleDOI
Deep learning-based methods for natural hazard named entity recognition
TL;DR: In this article , a natural hazard named entity recognition method based on deep learning is proposed, namely XLNet-BiLSTM-CRF model, which can automatically mine text features and reduce the dependence on manual rules.
Journal ArticleDOI
Establishment of a Chinese critical care database from electronic healthcare records in a tertiary care medical center
TL;DR: Wang et al. as discussed by the authors reported the establishment of an openly accessible critical care database generated from the hospital information system, which can provide insights into the pathophysiology of underlying diseases and healthcare practices.
Journal ArticleDOI
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT
TL;DR: Wang et al. as discussed by the authors proposed a hybrid neural network model based on medical MC-BERT, namely, the MCBERT + BiLSTM + CNN + Multi-Head Self-Attention (MHA) + CRF model.
Journal ArticleDOI
A multi-layer soft lattice based model for Chinese clinical named entity recognition
TL;DR: In this paper , the authors combined Transformer with Soft Term Position Lattice to form soft lattice structure Transformer, which models long-distance dependencies similarly to LSTM and achieved 91.6% f-measure in recognizing long medical terms, abbreviations, and numbers.
Journal ArticleDOI
TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition.
TL;DR: In this article, a novel word-character integrated self-attention module was proposed to improve the performance of the TCM named entity recognition model by using the character-level representation and tagging.
References
More filters
Posted Content
Bidirectional LSTM-CRF Models for Sequence Tagging
Zhiheng Huang,Wei Xu,Kai Yu +2 more
TL;DR: This work is the first to apply a bidirectional LSTM CRF model to NLP benchmark sequence tagging data sets and it is shown that the BI-LSTM-CRF model can efficiently use both past and future input features thanks to a biddirectional L STM component.
Journal ArticleDOI
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.
TL;DR: The 2010 i2b2/VA Workshop on Natural Language Processing Challenges for Clinical Records presented three tasks, which showed that machine learning approaches could be augmented with rule-based systems to determine concepts, assertions, and relations.
Proceedings Article
Recognizing Named Entities in Tweets
TL;DR: This work proposes to combine a K-Nearest Neighbors classifier with a linear Conditional Random Fields model under a semi-supervised learning framework to tackle the challenges of Named Entities Recognition for tweets.
Journal ArticleDOI
A comprehensive study of named entity recognition in Chinese clinical text
TL;DR: The authors' evaluation on the independent test set showed that most types of feature were beneficial to Chinese NER systems, although the improvements were limited, and the system achieved the highest performance by combining word segmentation and section information, indicating that these two types offeature complement each other.
Journal ArticleDOI
Entity recognition from clinical texts via recurrent neural network
TL;DR: This paper comprehensively investigates the performance of LSTM (long-short term memory), a representative variant of RNN, on clinical entity recognition and protected health information recognition, and shows that L STM outperforms traditional machine learning methods that suffer from fussy feature engineering.