scispace - formally typeset
S

Sharon Goldwater

Researcher at University of Edinburgh

Publications -  170
Citations -  7082

Sharon Goldwater is an academic researcher from University of Edinburgh. The author has contributed to research in topics: Text segmentation & Context (language use). The author has an hindex of 41, co-authored 162 publications receiving 6482 citations. Previous affiliations of Sharon Goldwater include Stanford University & Brown University.

Papers
More filters
Journal ArticleDOI

A Bayesian framework for word segmentation: Exploring the effects of context

TL;DR: The results indicate that taking context into account is important for a statistical word segmentation strategy to be successful, and raise the possibility that even young infants may be able to exploit more subtle statistical patterns than have usually been considered.
Proceedings Article

A fully Bayesian approach to unsupervised part-of-speech tagging

TL;DR: This model has the structure of a standard trigram HMM, yet its accuracy is closer to that of a state-of-the-art discriminative model (Smith and Eisner, 2005), up to 14 percentage points better than MLE.
Proceedings ArticleDOI

Contextual Dependencies in Unsupervised Word Segmentation

TL;DR: Two new Bayesian word segmentation methods are proposed that assume unigram and bigram models of word dependencies respectively, and the bigram model greatly outperforms the unigrams model (and previous probabilistic models), demonstrating the importance of such dependencies forword segmentation.
Proceedings Article

Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification

TL;DR: This paper uses higher-order unification to define a hypothesis space containing all grammars consistent with the training data, and develops an online learning algorithm that efficiently searches this space while simultaneously estimating the parameters of a log-linear parsing model.

Learning OT constraint rankings using a maximum entropy model

TL;DR: The Maximum Entropy model is described, a general statistical model, and it is shown how it can be applied in a constraint-based linguistic framework to model and learn grammars with free variation, as well as categorical Grammars.