Conference

Conference on Computational Natural Language Learning

About: Conference on Computational Natural Language Learning is an academic conference. The conference publishes majorly in the area(s): Parsing & Language model. Over the lifetime, 933 publications have been published by the conference receiving 40608 citations.

...read moreread less

Topics: Parsing, Language model, Semantic role labeling, Dependency grammar, Sentence ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Generating Sentences from a Continuous Space

[...]

Samuel R. Bowman¹, Luke Vilnis², Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, Samy Bengio - Show less +2 more•Institutions (2)

Stanford University¹, University of Massachusetts Amherst²

01 Jan 2016

TL;DR: This work introduces and study an RNN-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences that allows it to explicitly model holistic properties of sentences such as style, topic, and high-level syntactic features.

...read moreread less

Abstract: The standard recurrent neural network language model (RNNLM) generates sentences one word at a time and does not work from an explicit global sentence representation. In this work, we introduce and study an RNN-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences. This factorization allows it to explicitly model holistic properties of sentences such as style, topic, and high-level syntactic features. Samples from the prior over these sentence representations remarkably produce diverse and well-formed sentences through simple deterministic decoding. By examining paths through this latent space, we are able to generate coherent novel sentences that interpolate between known sentences. We present techniques for solving the difficult learning problem presented by this model, demonstrate its effectiveness in imputing missing words, explore many interesting properties of the model's latent sentence space, and present negative results on the use of the model in language modeling.

...read moreread less

1,690 citations

Proceedings Article•DOI•

Design Challenges and Misconceptions in Named Entity Recognition

[...]

Lev Ratinov¹, Dan Roth¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

04 Jun 2009

TL;DR: Some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system are analyzed, and several solutions to these challenges are developed.

...read moreread less

Abstract: We analyze some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system. In particular, we address issues such as the representation of text chunks, the inference approach needed to combine local NER decisions, the sources of prior knowledge and how to use them within an NER system. In the process of comparing several solutions to these challenges we reach some surprising conclusions, as well as develop an NER system that achieves 90.8 F1 score on the CoNLL-2003 NER shared task, the best reported result for this dataset.

...read moreread less

1,539 citations

Proceedings Article•DOI•

Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond

[...]

Ramesh Nallapati¹, Bowen Zhou¹, Cicero Nogueira dos Santos¹, Caglar Gulcehre², Bing Xiang¹ - Show less +1 more•Institutions (2)

IBM¹, Université de Montréal²

19 Feb 2016

TL;DR: This paper proposed several novel models that address critical problems in summarization that are not adequately modeled by the basic architecture, such as modeling key-words, capturing the hierarchy of sentence-toword structure, and emitting words that are rare or unseen at training time.

...read moreread less

Abstract: In this work, we model abstractive text summarization using Attentional EncoderDecoder Recurrent Neural Networks, and show that they achieve state-of-the-art performance on two different corpora. We propose several novel models that address critical problems in summarization that are not adequately modeled by the basic architecture, such as modeling key-words, capturing the hierarchy of sentence-toword structure, and emitting words that are rare or unseen at training time. Our work shows that many of our proposed models contribute to further improvement in performance. We also propose a new dataset consisting of multi-sentence summaries, and establish performance benchmarks for further research.

...read moreread less

1,405 citations

Proceedings Article•DOI•

CoNLL-X Shared Task on Multilingual Dependency Parsing

[...]

Sabine Buchholz¹, Erwin Marsi²•Institutions (2)

Toshiba¹, Tilburg University²

08 Jun 2006

TL;DR: How treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured is described and general conclusions about multi-lingual parsing are drawn.

...read moreread less

Abstract: Each year the Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their systems on exactly the same data sets, in order to better compare systems. The tenth CoNLL (CoNLL-X) saw a shared task on Multilingual Dependency Parsing. In this paper, we describe how treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured. We also give an overview of the parsing approaches that participants took and the results that they achieved. Finally, we try to draw general conclusions about multi-lingual parsing: What makes a particular language, treebank or annotation scheme easier or harder to parse and which phenomena are challenging for any dependency parser?

...read moreread less

1,011 citations

Proceedings Article•

Better Word Representations with Recursive Neural Networks for Morphology

[...]

Thang Luong¹, Richard Socher¹, Christopher D. Manning¹•Institutions (1)

Stanford University¹

01 Aug 2013

TL;DR: This paper combines recursive neural networks, where each morpheme is a basic unit, with neural language models to consider contextual information in learning morphologicallyaware word representations and proposes a novel model capable of building representations for morphologically complex words from their morphemes.

...read moreread less

Abstract: Vector-space word representations have been very successful in recent years at improving performance across a variety of NLP tasks. However, common to most existing work, words are regarded as independent entities without any explicit relationship among morphologically related words being modeled. As a result, rare and complex words are often poorly estimated, and all unknown words are represented in a rather crude way using only one or a few vectors. This paper addresses this shortcoming by proposing a novel model that is capable of building representations for morphologically complex words from their morphemes. We combine recursive neural networks (RNNs), where each morpheme is a basic unit, with neural language models (NLMs) to consider contextual information in learning morphologicallyaware word representations. Our learned models outperform existing word representations by a good margin on word similarity tasks across many datasets, including a new dataset we introduce focused on rare words to complement existing ones in an interesting way.

...read moreread less

917 citations

Collapse

Performance

Metrics

933

Papers

40,608

Citations

No. of papers from the Conference in previous years
Year	Papers
2023	1
2022	24
2021	50
2020	55
2019	97
2018	58