Neural Entity Linking: A Survey of Models Based on Deep Learning

Open AccessPosted Content

Neural Entity Linking: A Survey of Models Based on Deep Learning

- 31 May 2020 -

TLDR

This work distills a generic architecture of a neural EL system and discusses its components, such as candidate generation, mention-context encoding, and entity ranking, summarizing prominent methods for each of them.

Abstract:

In this survey, we provide a comprehensive description of recent neural entity linking (EL) systems developed since 2015 as a result of the "deep learning revolution" in NLP. Our goal is to systemize design features of neural entity linking systems and compare their performance to the prominent classic methods on common benchmarks. We distill generic architectural components of a neural EL system, like candidate generation and entity ranking, and summarize prominent methods for each of them. The vast variety of modifications of this general neural entity linking architecture are grouped by several common themes: joint entity recognition and linking, models for global linking, domain-independent techniques including zero-shot and distant supervision methods, and cross-lingual approaches. Since many neural models take advantage of entity and mention/context embeddings to catch semantic meaning of them, we provide an overview of popular embedding techniques. Finally, we briefly discuss applications of entity linking, focusing on the recently emerged use-case of enhancing deep pre-trained masked language models based on the transformer architecture.

Citations

PDF

Open Access

More filters

Journal Article

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

Daniël de Kok, +2 more

- 01 Jan 2011 -

The Association for Computational Lingui...

Posted Content

Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases

Gerhard Weikum, +3 more

- 24 Sep 2020 -

arXiv: Artificial Intelligence

TL;DR: In this article, the authors survey fundamental concepts and practical methods for creating and curating large-scale knowledge bases, including methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies.

...read moreread less

Proceedings ArticleDOI

Extended Overview of CLEF HIPE 2020: Named Entity Processing on Historical Newspapers

Maud Ehrmann, +3 more

TL;DR: This paper presents an extended overview of the first edition of HIPE (Identifying Historical People, Places and other Entities), a pioneering shared task dedicated to the evaluation of named entity processing on historical newspapers in French, German and English.

...read moreread less

Journal ArticleDOI

Reddit entity linking dataset

Nicholas Botzer, +2 more

- 01 May 2021 -

Information Processing and Management

TL;DR: An entity linking dataset from Reddit is introduced that contains 17,316 linked entities, each annotated by three human annotators and then grouped into Gold, Silver, and Bronze to indicate inter-annotator agreement, which indicates the need for better entity linking models that can be applied to the enormous amount of social media text.

...read moreread less

Journal ArticleDOI

Medical concept normalization in French using multilingual terminologies and contextual embeddings

Perceval Wajsbürt, +2 more

- 01 Feb 2021 -

Journal of Biomedical Informatics

TL;DR: In this paper, a system for concept normalization in French is presented, which takes advantage of the multilingual nature of available terminologies and embedding models to improve concept normalisation in French without translation nor direct supervision.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal Article

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

성기익, +6 more

- 01 Mar 2003 -

The Korean Journal of Thoracic and Cardi...

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Collapse

Neural Entity Linking: A Survey of Models Based on Deep Learning

Citations

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases

Extended Overview of CLEF HIPE 2020: Named Entity Processing on Historical Newspapers

Reddit entity linking dataset

Medical concept normalization in French using multilingual terminologies and contextual embeddings

References

Long short-term memory

Attention is All you Need

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Robust Disambiguation of Named Entities in Text

Zero-shot Entity Linking by Reading Entity Descriptions

End-to-End Neural Entity Linking

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Deep Joint Entity Disambiguation with Local Neural Attention