scispace - formally typeset
M

Man Lan

Researcher at East China Normal University

Publications -  116
Citations -  2950

Man Lan is an academic researcher from East China Normal University. The author has contributed to research in topics: SemEval & Sentiment analysis. The author has an hindex of 24, co-authored 107 publications receiving 2356 citations. Previous affiliations of Man Lan include Institute for Infocomm Research Singapore & National University of Singapore.

Papers
More filters
Journal ArticleDOI

Supervised and Traditional Term Weighting Methods for Automatic Text Categorization

TL;DR: This study investigates several widely-used unsupervised and supervised term weighting methods on benchmark data collections in combination with SVM and kNN algorithms and proposes a new simple supervisedterm weighting method, tf.rf, to improve the terms' discriminating power for text categorization task.
Proceedings ArticleDOI

A comprehensive comparative study on term weighting schemes for text categorization with support vector machines

TL;DR: Controlled experimental results showed that this newly proposed tf-rf scheme is significantly better than other widely-used term weighting schemes, and the idf factor does not improve or even decrease the term's discriminating power for text categorization.
Proceedings Article

Predicting Discourse Connectives for Implicit Discourse Relation Recognition

TL;DR: This paper attempts to overcome difficulty for implicit relation recognition by automatically inserting discourse connectives between arguments with the use of a language model by proposing two algorithms to leverage the information of these predicted connectives.
Proceedings ArticleDOI

Initialization of cluster refinement algorithms: a review and comparative study

TL;DR: A controlled benchmark identifies two distance optimization methods, namely SCS and KKZ, as complements of the k-means learning characteristics towards a better cluster separation in the output solution.
Proceedings ArticleDOI

ECNU: One Stone Two Birds: Ensemble of Heterogenous Measures for Semantic Relatedness and Textual Entailment

TL;DR: This paper extracted seven types of features including text difference measures proposed in entailment judgement subtask, as well as common text similarity measures used in both subtasks to solve the both subtasking by considering them as a regression and a classification task respectively.