scispace - formally typeset
F

Fuchun Peng

Researcher at Facebook

Publications -  117
Citations -  4878

Fuchun Peng is an academic researcher from Facebook. The author has contributed to research in topics: Web search query & Web query classification. The author has an hindex of 30, co-authored 113 publications receiving 4647 citations. Previous affiliations of Fuchun Peng include Google & BBN Technologies.

Papers
More filters
Proceedings ArticleDOI

Chinese segmentation and new word detection using conditional random fields

TL;DR: The ability of linear-chain conditional random fields (CRFs) to perform robust and accurate Chinese word segmentation by providing a principled framework that easily supports the integration of domain knowledge in the form of multiple lexicons of characters and words is demonstrated.

N-gram-based author profiles for authorship attribution

TL;DR: This work presents a novel method for computer-assisted authorship attribution based on characterlevel n-gram author proles, which is motivated by an almost-forgotten, pioneering method in 1976.
Proceedings Article

Accurate Information Extraction from Research Papers using Conditional Random Fields

TL;DR: New state-of-the-art performance is achieved on a standard benchmark data set, reducing error in average F1 by 36%, and word error rate by 78% in comparison with the previous best SVM results.
Proceedings ArticleDOI

Event threading within news topics

TL;DR: This work attempts to capture the rich structure of events and their dependencies in a news topic through the authors' event models, and takes into account novel features such as temporal locality of stories for event recognition and time-ordering for capturing dependencies.
Journal ArticleDOI

Augmenting Naive Bayes Classifiers with Statistical Language Models

TL;DR: This paper introduces CAN models, a generalized naive Bayes classifier which allows for a local Markov dependence among observations, and systematically study the key factors in the CAN model that can influence the classification performance, and analyze the strengths and weaknesses.