Character-based feature extraction with LSTM networks for POS-tagging task

doi:10.1109/ICAICT.2016.7991654

Proceedings ArticleDOI

Character-based feature extraction with LSTM networks for POS-tagging task

- pp 7991654

TLDR

A LSTM-based feature extraction layer that reads in a sequence of characters corresponding to a word and outputs a single fixed-length real-valued vector that can offer a solution to the out-of-vocabulary words problem.

Abstract:

In this paper we describe a work in progress on designing the continuous vector space word representations able to map unseen data adequately. We propose a LSTM-based feature extraction layer that reads in a sequence of characters corresponding to a word and outputs a single fixed-length real-valued vector. We then test our model on a POS tagging task on four typologically different languages. The results of the experiments suggest that the model can offer a solution to the out-of-vocabulary words problem, as in a comparable setting its OOV accuracy improves over that of a state of the art tagger.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Portuguese POS Tagging Using BLSTM Without Handcrafted Features

Rômulo César Costa de Sousa, +1 more

TL;DR: This paper proposes a neural network architecture for POS tagging task for both contemporary and historical Portuguese texts and applies the architecture on three Portuguese corpora, improving the tagging accuracy for Out of Vocabulary words in the Mac-Morpho corpus and in the revised Mac- Morpho.

...read moreread less

Journal ArticleDOI

Slot Filling with Data Augmentation That Allows the Use of Keyword and Context Information for Handling Unknown Slot Values

Yuka Kobayashi, +4 more

- 01 May 2022 -

Transactions of The Japanese Society for...

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Collapse

Related Papers (5)

Multilingual POS tagging by a composite deep architecture based on character-level features and on-the-fly enriched Word Embeddings

Marco Pota, +4 more

- 15 Jan 2019 -

Knowledge Based Systems

International Journal of Speech Technolo...

Character-based feature extraction with LSTM networks for POS-tagging task

Citations

Portuguese POS Tagging Using BLSTM Without Handcrafted Features

Slot Filling with Data Augmentation That Allows the Use of Keyword and Context Information for Handling Unknown Slot Values

References

Long short-term memory

Deep learning

Deep Learning

Glove: Global Vectors for Word Representation

Efficient Estimation of Word Representations in Vector Space

Related Papers (5)

Multilingual POS tagging by a composite deep architecture based on character-level features and on-the-fly enriched Word Embeddings

Part of Speech Tagging in Bengali Using Support Vector Machine

Non-lexical neural architecture for fine-grained POS Tagging

A Deep Learning Approach for Part-of-Speech Tagging in Nepali Language

Corpus based part-of-speech tagging