scispace - formally typeset
S

Soumen Chakrabarti

Researcher at Indian Institute of Technology Bombay

Publications -  208
Citations -  16289

Soumen Chakrabarti is an academic researcher from Indian Institute of Technology Bombay. The author has contributed to research in topics: Ranking (information retrieval) & Web page. The author has an hindex of 55, co-authored 208 publications receiving 15481 citations. Previous affiliations of Soumen Chakrabarti include University of California & Indian Institutes of Technology.

Papers
More filters
Proceedings ArticleDOI

Is question answering an acquired skill

TL;DR: A question answering (QA) system which learns how to detect and rank answer passages by analyzing questions and their answers by analyzing QA pairs provided as training data, built in only a few person-months using off-the-shelf components.
Proceedings Article

Distributed Hypertext Resource Discovery Through Examples

TL;DR: It is argued that that a keywordbased “find similar” search based on a giant all-purpose crawler is neither necessary nor adequate for resource discovery, and instead the properties that pages tend to cite pages with related topics are exploited.
Proceedings ArticleDOI

Optimizing scoring functions and indexes for proximity search in type-annotated corpora

TL;DR: The architecture of a next-generation information retrieval engine for such applications is described, and the key technical problems faced in building it are investigated, and a new algorithm that estimates a scoring function from past logs of queries and answer spans is proposed.
Proceedings ArticleDOI

Global communication analysis and optimization

TL;DR: A new compiler algorithm for global analysis and optimization of communication in data-parallel programs that exploits the flexibility resulting from this advanced analysis to eliminate redundancy, reduce the number of messages, and reduce contention for cache and communication buffers, all in a unified framework.
Book ChapterDOI

Document classification through interactive supervision of document and term labels

TL;DR: HIClass is presented, an interactive and exploratory labeling package that actively collects user opinion on feature representations and choices, as well as whole-document labels, while minimizing redundancy in the input sought.