scispace - formally typeset
Z

Zhe Chen

Researcher at Google

Publications -  14
Citations -  462

Zhe Chen is an academic researcher from Google. The author has contributed to research in topics: Information extraction & Computer science. The author has an hindex of 8, co-authored 13 publications receiving 360 citations. Previous affiliations of Zhe Chen include Renmin University of China & University of Michigan.

Papers
More filters
Proceedings ArticleDOI

Automatic web spreadsheet data extraction

TL;DR: A system that automatically extracts relational data from spreadsheets, thereby enabling relational spreadsheet integration and a novel view of how users organize their data in spreadsheets is presented.
Proceedings ArticleDOI

Integrating spreadsheet data via accurate and low-effort extraction

TL;DR: A two-phase semiautomatic system that extracts accurate relational metadata while minimizing user effort, based on an undirected graphical model, that enables downstream spreadsheet integration applications.
Proceedings ArticleDOI

EgoSet: Exploiting Word Ego-networks and User-generated Ontology for Multifaceted Set Expansion

TL;DR: This paper presents a novel solution to handling multifaceted seeds by combining existing user-generated ontologies with a novel word-similarity metric based on skip-grams that is able to produce sparse word ego-networks that are centered on the seed terms and are able to capture semantic equivalence among words.
Proceedings ArticleDOI

Long-tail Vocabulary Dictionary Extraction from the Web

TL;DR: This paper develops a novel method to construct high-quality dictionaries, especially for long-tail vocabularies, using just a few user-provided seeds for each topic, and achieves a 17.3% improvement on mean average precision for the dictionary generation process, and a 30.7% improvement for the page-specific extraction, when compared to previous state-of-the-art methods.
Journal ArticleDOI

Senbazuru: a prototype spreadsheet database management system

TL;DR: It is demonstrated that Senbazuru, a prototype spreadsheet database management system (SSDBMS), is able to extract relational information from spreadsheets, which opens up opportunities for integration among spreadsheets and with other relational sources.