C
Chinatsu Aone
Researcher at SRA International
Publications - 18
Citations - 3338
Chinatsu Aone is an academic researcher from SRA International. The author has contributed to research in topics: Information extraction & Machine translation. The author has an hindex of 12, co-authored 18 publications receiving 3204 citations.
Papers
More filters
Proceedings ArticleDOI
Fast and effective text mining using linear-time document clustering
Bjornar Larsen,Chinatsu Aone +1 more
TL;DR: An unsupervised, near-linear time text clustering system that offers a number of algorithm choices for each phase, and a refinement to center adjustment, “vector average damping,” that further improves cluster quality.
Journal ArticleDOI
Kernel methods for relation extraction
TL;DR: This work introduces kernels defined over shallow parse representations of text, and design efficient algorithms for computing the kernels, and uses the devised kernels in conjunction with Support Vector Machine and Voted Perceptron learning algorithms for the task of extracting person-affiliation and organization-location relations from text.
Proceedings ArticleDOI
Kernel Methods for Relation Extraction
TL;DR: This work introduces kernels defined over shallow parse representations of text, and design efficient algorithms for computing the kernels, and uses the devised kernels in conjunction with Support Vector Machine and Voted Perceptron learning algorithms for the task of extracting person-affiliation and organization-location relations from text.
Proceedings ArticleDOI
REES: A Large-Scale Relation and Event Extraction System
TL;DR: This paper reports on a large-scale, end-to-end relation and event extraction system that consists of three specialized pattern-based tagging modules, a high-precision coreference resolution module, and a configurable template generation module.
Patent
Content distribution system and method
TL;DR: In this paper, a system and method for automatically identifying information in unstructured text and extracting data representing certain types of information from the text to produce a structured set of templates with the extracted data is provided.