scispace - formally typeset
S

Sameep Mehta

Researcher at IBM

Publications -  167
Citations -  2826

Sameep Mehta is an academic researcher from IBM. The author has contributed to research in topics: Context (language use) & Service (business). The author has an hindex of 22, co-authored 160 publications receiving 2093 citations. Previous affiliations of Sameep Mehta include Lady Hardinge Medical College & All India Institute of Medical Sciences.

Papers
More filters
Posted Content

Bollywood Movie Corpus for Text, Images and Videos

TL;DR: The Bollywood Movie corpus contains 4000 movies extracted from Wikipedia and 880 trailers extracted from YouTube which were released from 1970-2017 and suggests that the data-set is quite useful for performing such tasks.
Patent

Generating a recommended shaping function to integrate data within a data repository

TL;DR: In this article, the authors propose a method for determining, by a controller, a portion of data that is selected by a user, which is to be transformed by at least one shaping function.
Journal ArticleDOI

CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation

TL;DR: The authors proposed a Causally Fair Language (CFL) architecture for text generation, which is based on a Structural Causal Model (SCM) that is mathematically transparent and computationally efficient as compared with many existing detoxification techniques.
Proceedings ArticleDOI

Tutorial on Semantic Automation for Data Discovery

TL;DR: In this paper, the authors present state-of-the-art semantic technologies that enable automation of various tasks in data discovery, focusing on data enrichment, datasets search and recommendations, and explorations within a dataset.
Proceedings Article

An Empirical Assessment of Contemporary Online Media in Ad-Hoc Corpus Creation for Social Events

TL;DR: Using social propensity of evolution of topical discussions on Twitter to assess the goodness of the creation, online news media is found to be most effective in creating adhoc external corpus.