W
Wei Li
Researcher at University of Massachusetts Amherst
Publications - 22
Citations - 3012
Wei Li is an academic researcher from University of Massachusetts Amherst. The author has contributed to research in topics: Pachinko allocation & Topic model. The author has an hindex of 14, co-authored 22 publications receiving 2807 citations. Previous affiliations of Wei Li include Yahoo!.
Papers
More filters
Proceedings ArticleDOI
Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
Andrew McCallum,Wei Li +1 more
TL;DR: This work has shown that conditionally-trained models, such as conditional maximum entropy models, handle inter-dependent features of greedy sequence modeling in NLP well.
Proceedings ArticleDOI
Pachinko allocation: DAG-structured mixture models of topic correlations
Wei Li,Andrew McCallum +1 more
TL;DR: Improved performance of PAM is shown in document classification, likelihood of held-out data, the ability to support finer-grained topics, and topical keyword coherence.
Proceedings ArticleDOI
Mixtures of hierarchical topics with Pachinko allocation
TL;DR: H hierarchical PAM is presented---an enhancement that explicitly represents a topic hierarchy that can be seen as combining the advantages of hLDA's topical hierarchy representation with PAM's ability to mix multiple leaves of the topic hierarchy.
Journal ArticleDOI
Rapid development of Hindi named entity recognition using conditional random fields and feature induction
Wei Li,Andrew McCallum +1 more
Abstract: This paper describes our application of conditional random fields with feature induction to a Hindi named entity recognition task. With only five days development time and little knowledge of this language, we automatically discover relevant features by providing a large array of lexical tests and using feature induction to automatically construct the features that most increase conditional likelihood. In an effort to reduce overfitting, we use a combination of a Gaussian prior and early stopping based on the results of 10-fold cross validation.
Proceedings ArticleDOI
Exploitation and exploration in a performance based contextual advertising system
TL;DR: This paper develops two novel EE strategies for online advertising that can adaptively balance the two aspects of EE by automatically learning the optimal tradeoff and incorporating confidence metrics of historical performance.