Home
/
Authors
/
Sameep Mehta

Author

Sameep Mehta

Other affiliations: Lady Hardinge Medical College, All India Institute of Medical Sciences, Ohio State University

Bio: Sameep Mehta is an academic researcher from IBM. The author has contributed to research in topics: Service (business) & Resource (project management). The author has an hindex of 22, co-authored 160 publications receiving 2093 citations. Previous affiliations of Sameep Mehta include Lady Hardinge Medical College & All India Institute of Medical Sciences.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Fair Transfer of Multiple Style Attributes in Text

[...]

Karan Dabas¹, Nishtha Madaan², Vijay Arya², Sameep Mehta², Tanmoy Chakraborty¹, Gautam Singh³ - Show less +2 more•Institutions (3)

Indian Institutes of Information Technology¹, IBM², Rutgers University³

01 Nov 2019

TL;DR: This work proposes a neural network architecture for fairly transferring multiple style attributes in a given text and demonstrates that the transfer of multiple styles cannot be achieved by sequentially performing multiple single-style transfers.

...read moreread less

Abstract: To preserve anonymity and obfuscate their identity on online platforms users may morph their text and portray themselves as a different gender or demographic. Similarly, a chatbot may need to customize its communication style to improve engagement with its audience. This manner of changing the style of written text has gained significant attention in recent years. Yet these past research works largely cater to the transfer of single style attributes. The disadvantage of focusing on a single style alone is that this often results in target text where other existing style attributes behave unpredictably or are unfairly dominated by the new style. To counteract this behavior, it would be nice to have a style transfer mechanism that can transfer or control multiple styles simultaneously and fairly. Through such an approach, one could obtain obfuscated or written text incorporated with a desired degree of multiple soft styles such as female-quality, politeness, or formalness. To the best of our knowledge this work is the first that shows and attempt to solve the issues related to multiple style transfer. We also demonstrate that the transfer of multiple styles cannot be achieved by sequentially performing multiple single-style transfers. This is because each single style-transfer step often reverses or dominates over the style incorporated by a previous transfer step. We then propose a neural network architecture for fairly transferring multiple style attributes in a given text. We test our architecture on the Yelp dataset to demonstrate our superior performance as compared to existing one-style transfer steps performed in a sequence.

...read moreread less

1 citations

Patent•

Recommending machine learning models and source codes for input datasets

[...]

Samiulla Shaikh, Sameep Mehta, Manish A. Bhide, Lobig William B

26 Mar 2020

TL;DR: In this article, candidate data analysis assets having a corresponding relatedness score associated with the particular input dataset greater than a defined relatedness measure threshold value are selected and ranked by score.

...read moreread less

Abstract: Asset recommendation for a particular input dataset is provided. Candidate data analysis assets having a corresponding relatedness score associated with the particular input dataset greater than a defined relatedness score threshold value are selected. Those candidate data analysis assets having a corresponding relatedness score greater than the defined relatedness score threshold value are ranked by score. Those candidate data analysis assets having a corresponding relatedness score greater than the defined relatedness score threshold value are listed by rank from highest to lowest. A justification for each candidate data analysis asset is inserted in the ranked list of candidate data analysis assets. The ranked list of candidate data analysis assets along with each respective justification is outputted on a display device.

...read moreread less

1 citations

Posted Content•

Hardening Deep Neural Networks via Adversarial Model Cascades

[...]

Deepak Vijaykeerthy¹, Anshuman Suri², Sameep Mehta¹, Ponnurangam Kumaraguru²•Institutions (2)

IBM¹, Indraprastha Institute of Information Technology²

02 Feb 2018-arXiv: Learning

TL;DR: The proposed Adversarial Model Cascades (AMC) trains a cascade of models sequentially where each model is optimized to be robust towards a mixture of multiple attacks, which yields a single model which is secure against a wide range of attacks.

...read moreread less

Abstract: Deep neural networks (DNNs) are vulnerable to malicious inputs crafted by an adversary to produce erroneous outputs Works on securing neural networks against adversarial examples achieve high empirical robustness on simple datasets such as MNIST However, these techniques are inadequate when empirically tested on complex data sets such as CIFAR-10 and SVHN Further, existing techniques are designed to target specific attacks and fail to generalize across attacks We propose the Adversarial Model Cascades (AMC) as a way to tackle the above inadequacies Our approach trains a cascade of models sequentially where each model is optimized to be robust towards a mixture of multiple attacks Ultimately, it yields a single model which is secure against a wide range of attacks; namely FGSM, Elastic, Virtual Adversarial Perturbations and Madry On an average, AMC increases the model's empirical robustness against various attacks simultaneously, by a significant margin (of 6225% for MNIST, 5075% for SVHN and 265% for CIFAR10) At the same time, the model's performance on non-adversarial inputs is comparable to the state-of-the-art models

...read moreread less

1 citations

Patent•

Methods and arrangements for prioritizing service restoration activities in the event of a catastrophic failure

[...]

Om D. Deshmukh¹, Sameep Mehta¹, Vinayaka Pandit¹•Institutions (1)

IBM¹

23 Nov 2010

TL;DR: In this paper, a service restoration order is implemented responsive to a service disruption and based on the assimilated input data, which includes determining bufferable and non-bufferable services, postponing restoration of the bufferable services and determining an order of priority of the non-buffered services.

...read moreread less

Abstract: Methods and arrangements for prioritizing customer service restoration, in the event of service failure or compromise such that any adverse effect of the service disruption on the customer is minimized, the perceived drop in quality of service, if any, is minimized and timely and efficient resource reallocation for service restoration is achieved. Input data relating to customer service protocols is assimilated. A service restoration order is implemented responsive to a service disruption and based on the assimilated input data. This implementing includes determining bufferable and non-bufferable services, postponing restoration of the bufferable services, and determining an order of priority of the non-bufferable services.

...read moreread less

1 citations

Proceedings Article•DOI•

Feedback-Based Keyphrase Extraction from Unstructured Text Documents

[...]

Nishtha Madaan¹, Mudit Saxena², Hima Patel¹, Sameep Mehta¹•Institutions (2)

IBM¹, Shiv Nadar University²

01 Jan 2020

TL;DR: This paper brings a human in the loop, and enable a human teacher to give feedback to a key-tags extraction framework in the form of natural language, in which the quality of the output can easily be judged by non-experts.

...read moreread less

Abstract: Machine Learning experts use classification and tagging algorithms considering the black box nature of these algorithms. These algorithms, primarily key-tags extraction from unstructured text documents are meant to capture key concepts in a document. With increasing amount of data, size and complexity of the data, this problem is key in industrial setup. Different possible use cases being in IT support, conversational systems/ chatbots and financial domains, this problem is important as shown in [1], [2]. In this paper, we bring a human in the loop, and enable a human teacher to give feedback to a key-tags extraction framework in the form of natural language. We focus on the problem of key-tags extraction in which the quality of the output can easily be judged by non-experts. Our system automatically reads natural language documents, extracts key concepts and presents an interactive information exploration user interface for analysing these documents.

...read moreread less

1 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
…
23
24
25
26
27
28
29
…
30
31
32
33
34

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The spread of true and false news online

[...]

Soroush Vosoughi¹, Deb Roy¹, Sinan Aral¹•Institutions (1)

Massachusetts Institute of Technology¹

09 Mar 2018-Science

TL;DR: A large-scale analysis of tweets reveals that false rumors spread further and faster than the truth, and false news was more novel than true news, which suggests that people were more likely to share novel information.

...read moreread less

Abstract: We investigated the differential diffusion of all of the verified true and false news stories distributed on Twitter from 2006 to 2017. The data comprise ~126,000 stories tweeted by ~3 million people more than 4.5 million times. We classified news as true or false using information from six independent fact-checking organizations that exhibited 95 to 98% agreement on the classifications. Falsehood diffused significantly farther, faster, deeper, and more broadly than the truth in all categories of information, and the effects were more pronounced for false political news than for false news about terrorism, natural disasters, science, urban legends, or financial information. We found that false news was more novel than true news, which suggests that people were more likely to share novel information. Whereas false stories inspired fear, disgust, and surprise in replies, true stories inspired anticipation, sadness, joy, and trust. Contrary to conventional wisdom, robots accelerated the spread of true and false news at the same rate, implying that false news spreads more than the truth because humans, not robots, are more likely to spread it.

...read moreread less

4,241 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Social Network Analysis

[...]

Tom A. B. Snijders

01 Jan 2012

3,692 citations

Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification

[...]

Joy Buolamwini, Timnit Gebru

21 Jan 2018

TL;DR: It is shown that the highest error involves images of dark-skinned women, while the most accurate result is for light-skinned men, in commercial API-based classifiers of gender from facial images, including IBM Watson Visual Recognition.

...read moreread less

Abstract: The paper “Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification” by Joy Buolamwini and Timnit Gebru, that will be presented at the Conference on Fairness, Accountability, and Transparency (FAT*) in February 2018, evaluates three commercial API-based classifiers of gender from facial images, including IBM Watson Visual Recognition. The study finds these services to have recognition capabilities that are not balanced over genders and skin tones [1]. In particular, the authors show that the highest error involves images of dark-skinned women, while the most accurate result is for light-skinned men.

...read moreread less

2,528 citations

Posted Content•

A Survey on Bias and Fairness in Machine Learning

[...]

Ninareh Mehrabi¹, Fred Morstatter¹, Nripsuta Saxena¹, Kristina Lerman¹, Aram Galstyan¹ - Show less +1 more•Institutions (1)

Information Sciences Institute¹

23 Aug 2019-arXiv: Learning

TL;DR: This survey investigated different real-world applications that have shown biases in various ways, and created a taxonomy for fairness definitions that machine learning researchers have defined to avoid the existing bias in AI systems.

...read moreread less

Abstract: With the widespread use of AI systems and applications in our everyday lives, it is important to take fairness issues into consideration while designing and engineering these types of systems. Such systems can be used in many sensitive environments to make important and life-changing decisions; thus, it is crucial to ensure that the decisions do not reflect discriminatory behavior toward certain groups or populations. We have recently seen work in machine learning, natural language processing, and deep learning that addresses such challenges in different subdomains. With the commercialization of these systems, researchers are becoming aware of the biases that these applications can contain and have attempted to address them. In this survey we investigated different real-world applications that have shown biases in various ways, and we listed different sources of biases that can affect AI applications. We then created a taxonomy for fairness definitions that machine learning researchers have defined in order to avoid the existing bias in AI systems. In addition to that, we examined different domains and subdomains in AI showing what researchers have observed with regard to unfair outcomes in the state-of-the-art methods and how they have tried to address them. There are still many future directions and solutions that can be taken to mitigate the problem of bias in AI systems. We are hoping that this survey will motivate researchers to tackle these issues in the near future by observing existing work in their respective fields.

...read moreread less

1,571 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse