Better Word Representations with Recursive Neural Networks for Morphology

Open AccessProceedings Article

Better Word Representations with Recursive Neural Networks for Morphology

Thang Luong, +2 more

- pp 104-113

Chats0

TLDR

This paper combines recursive neural networks, where each morpheme is a basic unit, with neural language models to consider contextual information in learning morphologicallyaware word representations and proposes a novel model capable of building representations for morphologically complex words from their morphemes.

Abstract:

Vector-space word representations have been very successful in recent years at improving performance across a variety of NLP tasks. However, common to most existing work, words are regarded as independent entities without any explicit relationship among morphologically related words being modeled. As a result, rare and complex words are often poorly estimated, and all unknown words are represented in a rather crude way using only one or a few vectors. This paper addresses this shortcoming by proposing a novel model that is capable of building representations for morphologically complex words from their morphemes. We combine recursive neural networks (RNNs), where each morpheme is a basic unit, with neural language models (NLMs) to consider contextual information in learning morphologicallyaware word representations. Our learned models outperform existing word representations by a good margin on word similarity tasks across many datasets, including a new dataset we introduce focused on rare words to complement existing ones in an interesting way.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Hassan Shahmohammadi, +4 more

- 17 Jun 2022 -

arXiv.org

TL;DR: A series of evaluations on word similarity benchmarks shows that visual grounding is beneﬁcial not only for concrete words, but also for abstract words, as well as for contextualized embeddings trained on corpora of relatively modest size.

...read moreread less

Journal ArticleDOI

Learning Fair Representations via Rate-Distortion Maximization

Somnath Basu Roy Chowdhury, +1 more

- 31 Jan 2022 -

Transactions of the Association for Comp...

TL;DR: A novel debiasing technique, Fairness-aware Rate Maximization (FaRM), that removes protected information by making representations of instances belonging to the same protected attribute class uncorrelated, using the rate-distortion function.

...read moreread less

Proceedings ArticleDOI

Text Classification Algorithm Based on TF-IDF and BERT

Jian Sun, +2 more

TL;DR: The results suggest that the using BERT model based method for technology information text auto-Categoriz has significantly improved accuracy, recall and fl_score, and has a good Chinese text classification effect.

...read moreread less

Posted Content

Empirical Study of Diachronic Word Embeddings for Scarce Data.

Syrielle Montariol, +1 more

- 04 Sep 2019 -

arXiv: Computation and Language

TL;DR: The authors compare three models to learn diachronic word embeddings on scarce data: incremental updating of a Skip-gram from Kim et al. (2014), dynamic filtering from Bamler and Mandt (2017), and dynamic Bernoulli embedding from Rudolph and Blei (2018).

...read moreread less

Book ChapterDOI

Learning Distributed Representations of Uyghur Words and Morphemes

Halidanmu Abudukelimu, +4 more

TL;DR: An approach to learn distributed representations of Uyghur words and morphemes from unlabeled data is proposed and it is shown that this approach achieves significant improvements over CBOW, a state-of-the-art model for computing vector representations of words.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995 -

Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

Journal ArticleDOI

A neural probabilistic language model

Yoshua Bengio, +3 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: The authors propose to learn a distributed representation for words which allows each training sentence to inform the model about an exponential number of semantically neighboring sentences, which can be expressed in terms of these representations.

...read moreread less

Journal Article

Natural Language Processing (Almost) from Scratch

Ronan Collobert, +5 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: A unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including part-of-speech tagging, chunking, named entity recognition, and semantic role labeling is proposed.

...read moreread less

Proceedings ArticleDOI

A unified architecture for natural language processing: deep neural networks with multitask learning

Ronan Collobert, +1 more

TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.

...read moreread less

Proceedings Article

Recurrent neural network based language model

Tomas Mikolov, +4 more

TL;DR: Results indicate that it is possible to obtain around 50% reduction of perplexity by using mixture of several RNN LMs, compared to a state of the art backoff language model.

...read moreread less