scispace - formally typeset
L

Lipika Dey

Researcher at Tata Consultancy Services

Publications -  131
Citations -  2337

Lipika Dey is an academic researcher from Tata Consultancy Services. The author has contributed to research in topics: Ontology (information science) & Information extraction. The author has an hindex of 20, co-authored 121 publications receiving 2040 citations. Previous affiliations of Lipika Dey include Harvard University & Indian Institutes of Technology.

Papers
More filters
Journal ArticleDOI

A k-mean clustering algorithm for mixed numeric and categorical data

TL;DR: A clustering algorithm based on k-mean paradigm that works well for data with mixed numeric and categorical features is presented and a new cost function and distance measure based on co-occurrence of values is proposed.
Journal ArticleDOI

Devnagari numeral recognition by combining decision of multiple connectionist classifiers

TL;DR: Experimental results show that the technique for recognition of handwritten Devnagari numerals is effective and reliable and a multi-classifier connectionist architecture has been proposed for increasing reliability of the recognition results.
Journal ArticleDOI

A feature selection technique for classificatory analysis

TL;DR: A conditional probability based, efficient method to extract the significant attributes from a database and shows how the classification methodology can be used for cost-sensitive learning where both accuracy and precision of prediction are important.
Journal ArticleDOI

A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set

TL;DR: This work proposes a method to compute distance between two attribute values of same attribute for unsupervised learning and uses proposed distance measure with K-mode clustering algorithm to cluster various categorical data sets.
Proceedings ArticleDOI

Opinion mining from noisy text data

TL;DR: This paper has developed a system, which provides the user with a platform to analyze opinion expressions extracted from a repository, aimed at extracting and consolidating opinions of customers from blogs and feedbacks, at multiple levels of granularity.