Showing papers by "Ashish Vaswani published in 2016"

PDF

Open Access

Proceedings Article•DOI•

[...]

Ashish Vaswani¹, Yonatan Bisk², Kenji Sagae³, Ryan Musa⁴•Institutions (4)

Information Sciences Institute¹, University of Southern California², Institute for Creative Technologies³, IBM⁴

01 Jun 2016

TL;DR: This paper presents new state-of-the-art performance on CCG supertagging and parsing and demonstrates that while feed-forward architectures can compete with bidirectional LSTMs on POS tagging, models that encode the complete sentence are necessary for the long range syntactic information encoded in supertags.

...read moreread less

Abstract: In this paper we present new state-of-the-art performance on CCG supertagging and parsing. Our model outperforms existing approaches by an absolute gain of 1.5%. We analyze the performance of several neural models and demonstrate that while feed-forward architectures can compete with bidirectional LSTMs on POS tagging, models that encode the complete sentence are necessary for the long range syntactic information encoded in supertags.

...read moreread less

92 citations

Proceedings Article•DOI•

Unsupervised Neural Hidden Markov Models

[...]

Ke M. Tran¹, Yonatan Bisk², Ashish Vaswani³, Daniel Marcu³, Kevin Knight⁴ - Show less +1 more•Institutions (4)

University of Amsterdam¹, University of Washington², Information Sciences Institute³, University of Southern California⁴

28 Sep 2016

TL;DR: This paper presented the first results for neuralizing an unsupervised Hidden Markov Model (HMM) and evaluated their approach on tag in-duction, which outperforms existing generative models and is competitive with the state-of-the-art.

...read moreread less

Abstract: In this work, we present the first results for neuralizing an Unsupervised Hidden Markov Model. We evaluate our approach on tag in- duction. Our approach outperforms existing generative models and is competitive with the state-of-the-art though with a simpler model easily extended to include additional context.

...read moreread less

49 citations

Proceedings Article•DOI•

Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies

[...]

Barret Zoph¹, Ashish Vaswani¹, Jonathan May¹, Kevin Knight¹•Institutions (1)

Information Sciences Institute¹

01 Jun 2016

TL;DR: The authors' NCE-trained language models achieve significantly lower perplexity on the One Billion Word Benchmark language modeling challenge, and contain one sixth of the parameters in the best single model in Chelba et al. (2013).

...read moreread less

Abstract: We present a simple algorithm to efficiently train language models with noise-contrastive estimation (NCE) on graphics processing units (GPUs). Our NCE-trained language models achieve significantly lower perplexity on the One Billion Word Benchmark language modeling challenge, and contain one sixth of the parameters in the best single model in Chelba et al. (2013). When incorporated into a strong Arabic-English machine translation system they give a strong boost in translation quality. We release a toolkit so that others may also train large-scale, large vocabulary LSTM language models with NCE, parallelizing computation across multiple GPUs.

...read moreread less

48 citations

Posted Content•

Unsupervised Neural Hidden Markov Models

[...]

Ke M. Tran¹, Yonatan Bisk², Ashish Vaswani³, Daniel Marcu³, Kevin Knight⁴ - Show less +1 more•Institutions (4)

University of Amsterdam¹, University of Washington², Information Sciences Institute³, University of Southern California⁴

28 Sep 2016-arXiv: Computation and Language

TL;DR: The first results for neuralizing an Unsupervised Hidden Markov Model are presented, which outperforms existing generative models and is competitive with the state-of-the-art though with a simpler model easily extended to include additional context.

...read moreread less

41 citations

Proceedings Article•DOI•

Name Tagging for Low-resource Incident Languages based on Expectation-driven Learning

[...]

Boliang Zhang¹, Xiaoman Pan¹, Tianlu Wang², Ashish Vaswani³, Heng Ji¹, Kevin Knight³, Daniel Marcu⁴ - Show less +3 more•Institutions (4)

Rensselaer Polytechnic Institute¹, University of Virginia², Information Sciences Institute³, University of Southern California⁴

01 Jun 2016

TL;DR: This paper tackles a challenging name tagging problem in an emergent setting the tagger needs to be complete within a few hours for a new incident language (IL) using very few resources and proposes a new expectation-driven learning framework that rapidly acquire, categorize, structure and zoom in on ILspecific expectations.

...read moreread less

Abstract: In this paper we tackle a challenging name tagging problem in an emergent setting the tagger needs to be complete within a few hours for a new incident language (IL) using very few resources. Inspired by observing how human annotators attack this challenge, we propose a new expectation-driven learning framework. In this framework we rapidly acquire, categorize, structure and zoom in on ILspecific expectations (rules, features, patterns, gazetteers, etc.) from various non-traditional sources: consulting and encoding linguistic knowledge from native speakers, mining and projecting patterns from both mono-lingual and cross-lingual corpora, and typing based on cross-lingual entity linking. We also propose a cost-aware combination approach to compose expectations. Experiments on seven low-resource languages demonstrate the effectiveness and generality of this framework: we are able to setup a name tagger for a new IL within two hours, and achieve 33.8%-65.1% F-score 1.

...read moreread less

32 citations

Journal Article•DOI•

Efficient Structured Inference for Transition-Based Parsing with Neural Networks and Error States

[...]

Ashish Vaswani¹, Kenji Sagae²•Institutions (2)

Information Sciences Institute¹, Institute for Creative Technologies²

02 Jun 2016-Transactions of the Association for Computational Linguistics

TL;DR: This paper proposes a new approach for approximate structured inference for transition-based parsing that produces scores suitable for global scoring using local models with the introduction of error states in local training.

...read moreread less

Abstract: Transition-based approaches based on local classification are attractive for dependency parsing due to their simplicity and speed, despite producing results slightly below the state-of-the-art. In this paper, we propose a new approach for approximate structured inference for transition-based parsing that produces scores suitable for global scoring using local models. This is accomplished with the introduction of error states in local training, which add information about incorrect derivation paths typically left out completely in locally-trained models. Using neural networks for our local classifiers, our approach produces the highest accuracy for transition-based dependency parsing in English.

...read moreread less

15 citations