scispace - formally typeset
M

Michael Collins

Researcher at Google

Publications -  221
Citations -  30486

Michael Collins is an academic researcher from Google. The author has contributed to research in topics: Parsing & Dependency grammar. The author has an hindex of 72, co-authored 201 publications receiving 27871 citations. Previous affiliations of Michael Collins include AT&T Labs & Columbia University.

Papers
More filters
Proceedings ArticleDOI

A New Statistical Parser Based on Bigram Lexical Dependencies

TL;DR: This paper proposed a new statistical parser which is based on probabilities of dependencies between head-words in the parse tree and extended bigram probability estimation techniques to calculate probabilities of dependency between pairs of words.
Proceedings Article

A Generalization of Principal Components Analysis to the Exponential Family

TL;DR: This paper draws on ideas from the Exponential family, Generalized linear models, and Bregman distances to give a generalization of PCA to loss functions that it is argued are better suited to other data types.
Proceedings Article

Discriminative Reranking for Natural Language Parsing

TL;DR: The boosting approach to ranking problems described in Freund et al. (1998) is applied to parsing the Wall Street Journal treebank, and it is argued that the method is an appealing alternative-in terms of both simplicity and efficiency-to work on feature selection methods within log-linear (maximum-entropy) models.
Proceedings Article

Simple Semi-supervised Dependency Parsing

TL;DR: This work focuses on the problem of lexical representation, introducing features that incorporate word clusters derived from a large unannotated corpus, and shows that the cluster-based features yield substantial gains in performance across a wide range of conditions.
Proceedings Article

Online Learning of Relaxed CCG Grammars for Parsing to Logical Form

TL;DR: A key idea is to introduce non-standard CCG combinators that relax certain parts of the grammar—for example allowing flexible word order, or insertion of lexical items— with learned costs.