scispace - formally typeset
S

Sudeshna Sarkar

Researcher at Indian Institute of Technology Kharagpur

Publications -  111
Citations -  1848

Sudeshna Sarkar is an academic researcher from Indian Institute of Technology Kharagpur. The author has contributed to research in topics: Bengali & Named-entity recognition. The author has an hindex of 20, co-authored 104 publications receiving 1598 citations. Previous affiliations of Sudeshna Sarkar include Indian Institute of Technology Guwahati & Indian Institutes of Technology.

Papers
More filters
Journal ArticleDOI

Investigation and modeling of the structure of texting language

TL;DR: The nature and type of compressions used in SMS texts are investigated, and a Hidden Markov Model based word-model for TL is developed, which results in a 35% reduction of the relative word level error rates.
Proceedings Article

Stylometric Analysis of Bloggers’ Age and Gender

TL;DR: Experimental results show that gender determination is more accurate than age group detection over a data spread across all ages but the accuracy of age prediction increases if the authors sample data with remarkable age difference.
Journal ArticleDOI

Feature selection techniques for maximum entropy based biomedical named entity recognition

TL;DR: This paper provides a study on word clustering and selection based feature reduction approaches for named entity recognition using a maximum entropy classifier and the performance is found to be superior to existing systems which do not use domain knowledge.
Proceedings ArticleDOI

Automatic Part-of-Speech Tagging for Bengali: An Approach for Morphologically Rich Languages in a Poor Resource Scenario

TL;DR: It is found that the use of morphology helps improve the accuracy of the tagger especially when less amount of tagged corpora are available.
Proceedings Article

A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition

TL;DR: The effort in developing a Named Entity Recognition (NER) system for Hindi using Maximum Entropy (MaxEnt) approach is described and a NER annotated corpora for the purpose is developed.