Active Learning with Deep Pre-trained Models for Sequence Tagging of Clinical and Biomedical Texts

doi:10.1109/BIBM47256.2019.8983157

Proceedings ArticleDOI

Active Learning with Deep Pre-trained Models for Sequence Tagging of Clinical and Biomedical Texts

Artem Shelmanov, +6 more

- pp 482-489

Chats0

TLDR

An annotation tool empowered with active learning and deep pre-trained models that could be used for entity annotation directly from Jupyter IDE is proposed and a modification to a standard uncertainty sampling strategy is suggested to show that it could be beneficial for annotation of very skewed datasets.

Abstract:

Active learning is a technique that helps to minimize the annotation budget required for the creation of a labeled dataset while maximizing the performance of a model trained on this dataset. It has been shown that active learning can be successfully applied to sequence tagging tasks of text processing in conjunction with deep learning models even when a limited amount of labeled data is available. Recent advances in transfer learning methods for natural language processing based on deep pre-trained models such as ELMo and BERT offer a much better ability to generalize on small annotated datasets compared to their shallow counterparts. The combination of deep pre-trained models and active learning leads to a powerful approach to dealing with annotation scarcity. In this work, we investigate the potential of this approach on clinical and biomedical data. The experimental evaluation shows that the combination of active learning and deep pre-trained models outperforms the standard methods of active learning. We also suggest a modification to a standard uncertainty sampling strategy and empirically show that it could be beneficial for annotation of very skewed datasets. Finally, we propose an annotation tool empowered with active learning and deep pre-trained models that could be used for entity annotation directly from Jupyter IDE.

Active Learning with Deep Pre-trained Models for Sequence Tagging of Clinical and Biomedical Texts

Citations

A Survey of Deep Active Learning

Active Learning for BERT: An Empirical Study

Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction

Unsupervised non-parametric change point detection in electrocardiography

LETS: A Label-Efficient Training Scheme for Aspect-Based Sentiment Analysis by Using a Pre-Trained Language Model

References

Long short-term memory

Attention is All you Need

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Probabilistic Models for Segmenting and Labeling Sequence Data

Related Papers (5)

Annotation-efficient classification combining active learning, pre-training and semi-supervised learning for biomedical images

Optimizing Annotation Effort Using Active Learning Strategies: A Sentiment Analysis Case Study in Persian.

Active Learning for Efficient Audio Annotation and Classification with a Large Amount of Unlabeled Data

An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese

A sequential algorithm for training text classifiers

Trending Questions (2)