Named Entity Recognition by Using XLNet-BiLSTM-CRF

doi:10.1007/S11063-021-10547-1

Journal ArticleDOI

Named Entity Recognition by Using XLNet-BiLSTM-CRF

Rongen Yan, +2 more

- 07 Jun 2021 -

Neural Processing Letters

- Vol. 53, Iss: 5, pp 3339-3356

Chats0

TLDR

A new neural network model is proposed to improve the effectiveness of the NER by using a pre-trained XLNet, bi-directional long-short term memory (Bi-LSTM) and conditional random field (CRF) and the superiority of XLNet in NER tasks is demonstrated.

Abstract:

Named entity recognition (NER) is the basis for many natural language processing (NLP) tasks such as information extraction and question answering. The accuracy of the NER directly affects the results of downstream tasks. Most of the relevant methods are implemented using neural networks, however, the word vectors obtained from a small data set cannot describe unusual, previously-unseen entities accurately and the results are not sufficiently accurate. Recently, the use of XLNet as a new pre-trained model has yielded satisfactory results in many NLP tasks, integration of XLNet embeddings in existent NLP tasks is not straightforward. In this paper, a new neural network model is proposed to improve the effectiveness of the NER by using a pre-trained XLNet, bi-directional long-short term memory (Bi-LSTM) and conditional random field (CRF). Pre-trained XLNet model is used to extract sentence features, then the classic NER neural network model is combined with the obtained features. In addition, the superiority of XLNet in NER tasks is demonstrated. We evaluate our model on the CoNLL-2003 English dataset and WNUT-2017 and show that the XLNet-BiLSTM-CRF obtains state-of-the-art results.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope

Partha Pratim Ray

- 01 Apr 2023 -

Internet of things and cyber-physical sy...

TL;DR: A comprehensive review of the background, applications, key challenges, and future directions of ChatGPT can be found in this article , highlighting the importance of striking a balance between AI-assisted innovation and human expertise.

...read moreread less

Journal ArticleDOI

Deep learning-based methods for natural hazard named entity recognition

Junling Sun, +3 more

- 17 Mar 2022 -

Dental science reports

TL;DR: In this article , a natural hazard named entity recognition method based on deep learning is proposed, namely XLNet-BiLSTM-CRF model, which can automatically mine text features and reduce the dependence on manual rules.

...read moreread less

ChemNLP: A Natural Language Processing based Library for Materials Chemistry Text Data

Kamal Choudhary, +1 more

TL;DR: The ChemNLP library and an accompany-ing web-app that can be used to analyze important materials chemistry information are presented and the overlap between density functional theory and text-based databases for superconductors is determined.

...read moreread less

Journal ArticleDOI

TFM: A Triple Fusion Module for Integrating Lexicon Information in Chinese Named Entity Recognition

Haitao Liu, +4 more

- 22 Apr 2022 -

Neural Processing Letters

Journal ArticleDOI

Transformer-Based Named Entity Recognition for French Using Adversarial Adaptation to Similar Domain Corpora

Arjun Choudhry, +6 more

TL;DR: The authors proposed a transformer-based NER approach for French using adversarial adaptation to similar domain or general corpora for improved feature extraction and better generalization, which outperforms the corresponding non-adaptive models.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +2 more

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

...read moreread less

Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +3 more

Collapse

Related Papers (5)

A Residual BiLSTM Model for Named Entity Recognition

Gang Yang, +1 more

- 21 Dec 2020 -

IEEE Access

Computer Speech & Language

Named Entity Recognition by Using XLNet-BiLSTM-CRF

Citations

ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope

Deep learning-based methods for natural hazard named entity recognition

ChemNLP: A Natural Language Processing based Library for Materials Chemistry Text Data

TFM: A Triple Fusion Module for Integrating Lexicon Information in Chinese Named Entity Recognition

Transformer-Based Named Entity Recognition for French Using Adversarial Adaptation to Similar Domain Corpora

References

Glove: Global Vectors for Word Representation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Efficient Estimation of Word Representations in Vector Space

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Probabilistic Models for Segmenting and Labeling Sequence Data

Related Papers (5)

A Residual BiLSTM Model for Named Entity Recognition

Effect of Character and Word Features in Bidirectional LSTM-CRF for NER

Integrated Machine Learning Techniques for Arabic Named Entity Recognition

A hybrid deep-learning approach for complex biochemical named entity recognition

Character convolutions for Arabic Named Entity Recognition with Long Short-Term Memory Networks

Trending Questions (1)