scispace - formally typeset
W

Wei Li

Researcher at University of Massachusetts Amherst

Publications -  22
Citations -  3012

Wei Li is an academic researcher from University of Massachusetts Amherst. The author has contributed to research in topics: Pachinko allocation & Topic model. The author has an hindex of 14, co-authored 22 publications receiving 2807 citations. Previous affiliations of Wei Li include Yahoo!.

Papers
More filters
Proceedings ArticleDOI

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

TL;DR: This work has shown that conditionally-trained models, such as conditional maximum entropy models, handle inter-dependent features of greedy sequence modeling in NLP well.
Proceedings ArticleDOI

Pachinko allocation: DAG-structured mixture models of topic correlations

TL;DR: Improved performance of PAM is shown in document classification, likelihood of held-out data, the ability to support finer-grained topics, and topical keyword coherence.
Proceedings ArticleDOI

Mixtures of hierarchical topics with Pachinko allocation

TL;DR: H hierarchical PAM is presented---an enhancement that explicitly represents a topic hierarchy that can be seen as combining the advantages of hLDA's topical hierarchy representation with PAM's ability to mix multiple leaves of the topic hierarchy.
Journal ArticleDOI

Rapid development of Hindi named entity recognition using conditional random fields and feature induction

Abstract: This paper describes our application of conditional random fields with feature induction to a Hindi named entity recognition task. With only five days development time and little knowledge of this language, we automatically discover relevant features by providing a large array of lexical tests and using feature induction to automatically construct the features that most increase conditional likelihood. In an effort to reduce overfitting, we use a combination of a Gaussian prior and early stopping based on the results of 10-fold cross validation.
Proceedings ArticleDOI

Exploitation and exploration in a performance based contextual advertising system

TL;DR: This paper develops two novel EE strategies for online advertising that can adaptively balance the two aspects of EE by automatically learning the optimal tradeoff and incorporating confidence metrics of historical performance.