S
Sharon Goldwater
Researcher at University of Edinburgh
Publications - 170
Citations - 7082
Sharon Goldwater is an academic researcher from University of Edinburgh. The author has contributed to research in topics: Text segmentation & Context (language use). The author has an hindex of 41, co-authored 162 publications receiving 6482 citations. Previous affiliations of Sharon Goldwater include Stanford University & Brown University.
Papers
More filters
Journal ArticleDOI
A Bayesian framework for word segmentation: Exploring the effects of context
TL;DR: The results indicate that taking context into account is important for a statistical word segmentation strategy to be successful, and raise the possibility that even young infants may be able to exploit more subtle statistical patterns than have usually been considered.
Proceedings Article
A fully Bayesian approach to unsupervised part-of-speech tagging
TL;DR: This model has the structure of a standard trigram HMM, yet its accuracy is closer to that of a state-of-the-art discriminative model (Smith and Eisner, 2005), up to 14 percentage points better than MLE.
Proceedings ArticleDOI
Contextual Dependencies in Unsupervised Word Segmentation
TL;DR: Two new Bayesian word segmentation methods are proposed that assume unigram and bigram models of word dependencies respectively, and the bigram model greatly outperforms the unigrams model (and previous probabilistic models), demonstrating the importance of such dependencies forword segmentation.
Proceedings Article
Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification
TL;DR: This paper uses higher-order unification to define a hypothesis space containing all grammars consistent with the training data, and develops an online learning algorithm that efficiently searches this space while simultaneously estimating the parameters of a log-linear parsing model.
Learning OT constraint rankings using a maximum entropy model
Sharon Goldwater,Mark Johnson +1 more
TL;DR: The Maximum Entropy model is described, a general statistical model, and it is shown how it can be applied in a constraint-based linguistic framework to model and learn grammars with free variation, as well as categorical Grammars.