scispace - formally typeset
Y

Yunyao Li

Researcher at IBM

Publications -  161
Citations -  3356

Yunyao Li is an academic researcher from IBM. The author has contributed to research in topics: Information extraction & Computer science. The author has an hindex of 24, co-authored 122 publications receiving 2888 citations. Previous affiliations of Yunyao Li include University of Michigan & Airbnb.

Papers
More filters
Book ChapterDOI

Schema-free XQuery

TL;DR: This work introduces the notion of Meaningful Lowest Common Ancestor Structure (MLCAS) for finding related nodes within an XML document and adds new functionality to XQuery to enable users to take full advantage of XQuery in querying XML data precisely and efficiently without requiring (perfect) knowledge of the document structure.
Proceedings ArticleDOI

Making database systems usable

TL;DR: A presentation data model is introduced and direct data manipulation with a schema later approach is recommended and the importance of provenance and of consistency across presentation models is stressed.
Proceedings Article

Rule-Based Information Extraction is Dead! Long Live Rule-Based Information Extraction Systems!

TL;DR: A case is made for the importance of rule-based IE to industry practitioners and a research agenda is laid out in advancing the state-of-theart in rule- based IE systems which has the potential to bridge the gap between academic research and industry practice.
Proceedings Article

SystemT: An Algebraic Approach to Declarative Information Extraction

TL;DR: A rule-based IE system whose basic design removes the expressivity and performance limitations of current systems based on cascading grammars, SystemT uses a declarative rule language, AQL, and an optimizer that generates high-performance algebraic execution plans for AQL rules.
Proceedings ArticleDOI

NaLIX: an interactive natural language interface for querying XML

TL;DR: It is shown that NaLIX, while far from being able to pass the Turing test, is perfectly usable in practice, and able to handle even quite complex queries in a variety of application domains.