scispace - formally typeset
C

Clara I. Cabezas

Researcher at University of Maryland, College Park

Publications -  8
Citations -  562

Clara I. Cabezas is an academic researcher from University of Maryland, College Park. The author has contributed to research in topics: Machine translation & Clef. The author has an hindex of 6, co-authored 8 publications receiving 549 citations.

Papers
More filters
Journal ArticleDOI

Bootstrapping parsers via syntactic projection across parallel texts

TL;DR: Using parallel text to help solving the problem of creating syntactic annotation in more languages by annotating the English side of a parallel corpus, project the analysis to the second language, and train a stochastic analyzer on the resulting noisy annotations.
Book ChapterDOI

CLEF Experiments at Maryland: Statistical Stemming and Backoff Translation

TL;DR: Results indicate that competitive performance can be achieved using four-stage backoff translation in conjunction with freely available bilingual dictionaries, but that the the usefulness of the statistical stemming algorithms that were tried varies considerably across the three languages to which they were applied.
ReportDOI

Using WSD Techniques for Lexical Selection in Statistical Machine Translation

TL;DR: This technical report describes the initial efforts to employ the power of WSD techniques in helping to guide a state of the art statistical MT system toward better word choices, and describes the approach, which leads to a small improvement in translation performance over a state-of-the art phrase-based statisticalMT system.
Proceedings Article

Supervised Sense Tagging using Support Vector Machines

TL;DR: A highly modular combination of language-independent feature extraction and supervised learning using support vector machines in order to permit rapid ramp-up, language independence, and capability for future expansion.
ReportDOI

Spanish Language Processing at University of Maryland: Building Infrastructure for Multilingual Applications

TL;DR: This work describes the construction of lexical resources, tool creation, building of an aligned parallel corpus, and an approach to automatic treebank creation, which has been developing using Spanish data, based on projection of English syntactic dependency information across a parallel corpus.