•Journal•ISSN: 2307-387X

Transactions of the Association for Computational Linguistics

Association for Computational Linguistics

About: Transactions of the Association for Computational Linguistics is an academic journal published by Association for Computational Linguistics. The journal publishes majorly in the area(s): Computer science & Artificial intelligence. It has an ISSN identifier of 2307-387X. It is also open access. Over the lifetime, 541 publications have been published receiving 49710 citations. The journal is also known as: TACL.

...read moreread less

Topics: Computer science, Artificial intelligence, Parsing, Natural language processing, Language model ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Enriching Word Vectors with Subword Information

[...]

Piotr Bojanowski¹, Edouard Grave¹, Armand Joulin¹, Tomas Mikolov¹•Institutions (1)

Facebook¹

12 Jun 2017-Transactions of the Association for Computational Linguistics

TL;DR: This paper proposed a new approach based on skip-gram model, where each word is represented as a bag of character n-grams, words being represented as the sum of these representations, allowing to train models on large corpora quickly and allowing to compute word representations for words that did not appear in the training data.

...read moreread less

Abstract: Continuous word representations, trained on large unlabeled corpora are useful for many natural language processing tasks. Popular models to learn such representations ignore the morphology of words, by assigning a distinct vector to each word. This is a limitation, especially for languages with large vocabularies and many rare words. In this paper, we propose a new approach based on the skipgram model, where each word is represented as a bag of character n-grams. A vector representation is associated to each character n-gram, words being represented as the sum of these representations. Our method is fast, allowing to train models on large corpora quickly and allows to compute word representations for words that did not appear in the training data. We evaluate our word representations on nine different languages, both on word similarity and analogy tasks. By comparing to recently proposed morphological word representations, we show that our vectors achieve state-of-the-art performance on these tasks.

...read moreread less

7,537 citations

Journal Article•DOI•

From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions

[...]

Peter Young¹, Alice Lai¹, Micah Hodosh¹, Julia Hockenmaier¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

28 Feb 2014-Transactions of the Association for Computational Linguistics

TL;DR: This work proposes to use the visual denotations of linguistic expressions to define novel denotational similarity metrics, which are shown to be at least as beneficial as distributional similarities for two tasks that require semantic inference.

...read moreread less

Abstract: We propose to use the visual denotations of linguistic expressions (i.e. the set of images they describe) to define novel denotational similarity metrics, which we show to be at least as beneficial as distributional similarities for two tasks that require semantic inference. To compute these denotational similarities, we construct a denotation graph, i.e. a subsumption hierarchy over constituents and their denotations, based on a large corpus of 30K images and 150K descriptive captions.

...read moreread less

2,026 citations

Journal Article•DOI•

Natural Questions: A Benchmark for Question Answering Research

[...]

Tom Kwiatkowski¹, Jennimaria Palomaki¹, Olivia Redfield¹, Michael Collins², Ankur P. Parikh¹, Chris Alberti¹, Danielle Epstein¹, Illia Polosukhin¹, Jacob Devlin¹, Kenton Lee¹, Kristina Toutanova¹, Llion Jones¹, Matthew Kelcey¹, Ming-Wei Chang¹, Andrew M. Dai¹, Jakob Uszkoreit¹, Quoc V. Le¹, Slav Petrov¹ - Show less +14 more•Institutions (2)

Google¹, Columbia University²

02 Aug 2019-Transactions of the Association for Computational Linguistics

TL;DR: The Natural Questions corpus, a question answering data set, is presented, introducing robust metrics for the purposes of evaluating question answering systems; demonstrating high human upper bounds on these metrics; and establishing baseline results using competitive methods drawn from related literature.

...read moreread less

Abstract: We present the Natural Questions corpus, a question answering data set. Questions consist of real anonymized, aggregated queries issued to the Google search engine. An annotator is presented with a...

...read moreread less

1,618 citations

Journal Article•DOI•

Improving Distributional Similarity with Lessons Learned from Word Embeddings

[...]

Omer Levy¹, Yoav Goldberg¹, Ido Dagan¹•Institutions (1)

Bar-Ilan University¹

04 May 2015-Transactions of the Association for Computational Linguistics

TL;DR: It is revealed that much of the performance gains of word embeddings are due to certain system design choices and hyperparameter optimizations, rather than the embedding algorithms themselves, and these modifications can be transferred to traditional distributional models, yielding similar gains.

...read moreread less

Abstract: Recent trends suggest that neural-network-inspired word embedding models outperform traditional count-based distributional models on word similarity and analogy detection tasks. We reveal that much of the performance gains of word embeddings are due to certain system design choices and hyperparameter optimizations, rather than the embedding algorithms themselves. Furthermore, we show that these modifications can be transferred to traditional distributional models, yielding similar gains. In contrast to prior reports, we observe mostly local or insignificant performance differences between the methods, with no global advantage to any single approach over the others.

...read moreread less

1,374 citations

Journal Article•DOI•

Named Entity Recognition with Bidirectional LSTM-CNNs

[...]

Jason P.C. Chiu¹, Eric Nichols²•Institutions (2)

University of British Columbia¹, Honda²

21 Jul 2016-Transactions of the Association for Computational Linguistics

TL;DR: In this article, a hybrid bidirectional LSTM and CNN architecture was proposed to automatically detect word and character-level features, eliminating the need for feature engineering and lexicons to achieve high performance.

...read moreread less

Abstract: Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance. In this paper, we present a novel neural network architecture that automatically detects word- and character-level features using a hybrid bidirectional LSTM and CNN architecture, eliminating the need for most feature engineering. We also propose a novel method of encoding partial lexicon matches in neural networks and compare it to existing approaches. Extensive evaluation shows that, given only tokenized text and publicly available word embeddings, our system is competitive on the CoNLL-2003 dataset and surpasses the previously reported state of the art performance on the OntoNotes 5.0 dataset by 2.13 F1 points. By using two lexicons constructed from publicly-available sources, we establish new state of the art performance with an F1 score of 91.62 on CoNLL-2003 and 86.28 on OntoNotes, surpassing systems that employ heavy feature engineering, proprietary lexicons, and rich entity linking information.

...read moreread less

1,321 citations

Collapse

Performance

Metrics

593

Papers

49,924

Citations

No. of papers from the Journal in previous years
Year	Papers
2023	18
2022	111
2021	76
2020	55
2019	45
2018	48