Clinical Concept Extraction with Contextual Word Embedding

Open AccessPosted Content

Clinical Concept Extraction with Contextual Word Embedding

- 24 Oct 2018 -

TLDR

The authors proposed a clinical concept extraction model for automatic annotation of clinical problems, treatments, and tests in clinical notes utilizing domain-specific contextual word embedding, which achieved the best performance among reported baseline models and outperformed the state-of-the-art models by 3.4%.

Abstract:

Automatic extraction of clinical concepts is an essential step for turning the unstructured data within a clinical note into structured and actionable information. In this work, we propose a clinical concept extraction model for automatic annotation of clinical problems, treatments, and tests in clinical notes utilizing domain-specific contextual word embedding. A contextual word embedding model is first trained on a corpus with a mixture of clinical reports and relevant Wikipedia pages in the clinical domain. Next, a bidirectional LSTM-CRF model is trained for clinical concept extraction using the contextual word embedding model. We tested our proposed model on the I2B2 2010 challenge dataset. Our proposed model achieved the best performance among reported baseline models and outperformed the state-of-the-art models by 3.4% in terms of F1-score.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Jinhyuk Lee, +6 more

- 25 Jan 2019 -

Bioinformatics

TL;DR: This article proposed BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining), which is a domain-specific language representation model pre-trained on large-scale biomedical corpora.

...read moreread less

Proceedings ArticleDOI

Publicly Available Clinical BERT Embeddings

Emily Alsentzer, +6 more

TL;DR: This paper explored and released BERT models for clinical text: one for generic clinical text and another for discharge summaries specifically, and demonstrated that using a domain-specific model yields performance improvements on 3/5 clinical NLP tasks, establishing a new state-of-the-art on the MedNLI dataset.

...read moreread less

Posted Content

Publicly Available Clinical BERT Embeddings

Emily Alsentzer, +6 more

- 06 Apr 2019 -

arXiv: Computation and Language

TL;DR: This work explores and releases two BERT models for clinical text: one for generic clinical text and another for discharge summaries specifically, and demonstrates that using a domain-specific model yields performance improvements on 3/5 clinical NLP tasks, establishing a new state-of-the-art on the MedNLI dataset.

...read moreread less

Journal ArticleDOI

Enhancing clinical concept extraction with contextual embeddings.

Yuqi Si, +3 more

- 01 Nov 2019 -

Journal of the American Medical Informat...

TL;DR: This article explored the space of possible options in utilizing these new models for clinical concept extraction, including comparing these to traditional word embedding methods (word2vec, GloVe, fastText).

...read moreread less

Journal ArticleDOI

Building a PubMed knowledge graph.

Jian Xu, +14 more

- 26 Jun 2020 -

Scientific Data

TL;DR: Wang et al. as mentioned in this paper constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of Health (NIH) ExPORTER, collecting affiliation history and educational background of authors from ORCID®, and identifying fine-grained affiliation data from MapAffil.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014 -

arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Posted Content

Bidirectional LSTM-CRF Models for Sequence Tagging

Zhiheng Huang, +2 more

- 09 Aug 2015 -

arXiv: Computation and Language

TL;DR: This work is the first to apply a bidirectional LSTM CRF model to NLP benchmark sequence tagging data sets and it is shown that the BI-LSTM-CRF model can efficiently use both past and future input features thanks to a biddirectional L STM component.

...read moreread less

Posted Content

Deep contextualized word representations

Matthew E. Peters, +6 more

- 15 Feb 2018 -

arXiv: Computation and Language

TL;DR: This article introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).

...read moreread less

Scientific Data

Deep contextualized word representations

Matthew E. Peters, +6 more

Clinical Concept Extraction with Contextual Word Embedding

Citations

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Publicly Available Clinical BERT Embeddings

Publicly Available Clinical BERT Embeddings

Enhancing clinical concept extraction with contextual embeddings.

Building a PubMed knowledge graph.

References

Glove: Global Vectors for Word Representation

Adam: A Method for Stochastic Optimization

Distributed Representations of Words and Phrases and their Compositionality

Bidirectional LSTM-CRF Models for Sequence Tagging

Deep contextualized word representations

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Distributed Representations of Words and Phrases and their Compositionality

Glove: Global Vectors for Word Representation

MIMIC-III, a freely accessible critical care database

Deep contextualized word representations