Home
/
Authors
/
Sahidullah

Author

Sahidullah

Other affiliations: French Institute for Research in Computer Science and Automation, Indian Institute of Technology Kharagpur, University of Eastern Finland

Bio: Sahidullah is an academic researcher from University of Lorraine. The author has contributed to research in topics: Spoofing attack & Speaker recognition. The author has an hindex of 23, co-authored 80 publications receiving 1589 citations. Previous affiliations of Sahidullah include French Institute for Research in Computer Science and Automation & Indian Institute of Technology Kharagpur.

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

ASVspoof 2019: Future horizons in spoofed and fake audio detection

[...]

Massimiliano Todisco¹, Xin Wang², Ville Vestman³, Sahidullah, Héctor Delgado¹, Andreas Nautsch¹, Junichi Yamagishi, Nicholas Evans¹, Tomi Kinnunen³, Kong Aik Lee⁴ - Show less +6 more•Institutions (4)

Institut Eurécom¹, National Institute of Informatics², University of Eastern Finland³, Institute for Infocomm Research Singapore⁴

15 Sep 2019

TL;DR: The 2019 database, protocols and challenge results are described, and major findings which demonstrate the real progress made in protecting against the threat of spoofing and fake audio are outlined.

...read moreread less

Abstract: ASVspoof, now in its third edition, is a series of community-led challenges which promote the development of countermeasures to protect automatic speaker verification (ASV) from the threat of spoofing. Advances in the 2019 edition include: (i) a consideration of both logical access (LA) and physical access (PA) scenarios and the three major forms of spoofing attack, namely synthetic, converted and replayed speech; (ii) spoofing attacks generated with state-of-the-art neu-ral acoustic and waveform models; (iii) an improved, controlled simulation of replay attacks; (iv) use of the tandem detection cost function (t-DCF) that reflects the impact of both spoofing and countermeasures upon ASV reliability. Even if ASV remains the core focus, in retaining the equal error rate (EER) as a secondary metric, ASVspoof also embraces the growing importance of fake audio detection. ASVspoof 2019 attracted the participation of 63 research teams, with more than half of these reporting systems that improve upon the performance of two baseline spoofing countermeasures. This paper describes the 2019 database, protocols and challenge results. It also outlines major findings which demonstrate the real progress made in protecting against the threat of spoofing and fake audio.

...read moreread less

341 citations

Journal Article•DOI•

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

[...]

Xin Wang¹, Junichi Yamagishi², Junichi Yamagishi¹, Massimiliano Todisco³, Héctor Delgado³, Andreas Nautsch³, Nicholas Evans³, Sahidullah⁴, Ville Vestman⁵, Tomi Kinnunen⁵, Kong Aik Lee⁶, Lauri Juvela⁷, Paavo Alku⁷, Yu-Huai Peng⁸, Hsin-Te Hwang⁸, Yu Tsao⁸, Hsin-Min Wang⁸, Sébastien Le Maguer⁹, Markus Becker¹⁰, Fergus Henderson¹⁰, Robert A. J. Clark¹⁰, Yu Zhang¹⁰, Quan Wang¹⁰, Ye Jia¹⁰, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu¹¹, Wen-Chin Huang¹¹, Tomoki Toda¹¹, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf¹², Jean-François Bonastre¹², Avashna Govender², Srikanth Ronanki², Jing-Xuan Zhang¹³, Zhen-Hua Ling¹³ - Show less +37 more•Institutions (13)

National Institute of Informatics¹, University of Edinburgh², Institut Eurécom³, University of Lorraine⁴, University of Eastern Finland⁵, NEC⁶, Aalto University⁷, Academia Sinica⁸, Trinity College, Dublin⁹, Google¹⁰, Nagoya University¹¹, University of Avignon¹², University of Science and Technology of China¹³

01 Nov 2020-Computer Speech & Language

TL;DR: The ASVspoof challenge as mentioned in this paper was created to foster research on anti-spoofing and to provide common platforms for the assessment and comparison of spoofing countermeasures, and the first edition focused on replay spoofing attacks and countermeasures.

...read moreread less

211 citations

Proceedings Article•DOI•

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

[...]

Héctor Delgado¹, Massimiliano Todisco, Sahidullah², Nicholas Evans¹, Tomi Kinnunen², Kong Aik Lee³, Junichi Yamagishi⁴ - Show less +3 more•Institutions (4)

Institut Eurécom¹, University of Eastern Finland², NEC³, National Institute of Informatics⁴

26 Jun 2018

TL;DR: This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation and contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments which support the analysis of replay detection performance and limits.

...read moreread less

Abstract: The now-acknowledged vulnerabilities of automatic speaker verification (ASV) technology to spoofing attacks have spawned interests to develop so-called spoofing countermeasures. By providing common databases, protocols and metrics for their assessment, the ASVspoof initiative was born to spear-head research in this area. The first competitive ASVspoof challenge held in 2015 focused on the assessment of countermeasures to protect ASV technology from voice conversion and speech synthesis spoofing attacks. The second challenge switched focus to the consideration of replay spoofing attacks and countermeasures. This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation. The paper contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments. These support the analysis of replay detection performance and limits. Also described are new results for the official ASVspoof baseline system which is based upon a constant Q cesptral coefficient frontend and a Gaussian mixture model backend. Reported are enhancements to the baseline system in the form of log-energy coefficients and cepstral mean and variance normalisation in addition to an alternative i-vector backend. The best results correspond to a 48% relative reduction in equal error rate when compared to the original baseline system.

...read moreread less

153 citations

Proceedings Article•DOI•

t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification

[...]

Tomi Kinnunen¹, Kong Aik Lee², Héctor Delgado, Nicholas Evans³, Massimiliano Todisco³, Sahidullah, Junichi Yamagishi⁴, Douglas A. Reynolds⁵ - Show less +4 more•Institutions (5)

University of Eastern Finland¹, NEC², Institut Eurécom³, National Institute of Informatics⁴, Massachusetts Institute of Technology⁵

26 Jun 2018

TL;DR: In this article, the authors proposed a tandem detection cost function (t-DCF) metric to compare the performance of different anti-spoofing countermeasures in isolation from automatic speaker verification (ASV).

...read moreread less

Abstract: The ASVspoof challenge series was born to spearhead research in anti-spoofing for automatic speaker verification (ASV). The two challenge editions in 2015 and 2017 involved the assessment of spoofing countermeasures (CMs) in isolation from ASV using an equal error rate (EER) metric. While a strategic approach to assessment at the time, it has certain shortcomings. First, the CM EER is not necessarily a reliable predic-tor of performance when ASV and CMs are combined. Second, the EER operating point is ill-suited to user authentication applications , e.g. telephone banking, characterised by a high target user prior but a low spoofing attack prior. We aim to migrate from CM-to ASV-centric assessment with the aid of a new tandem detection cost function (t-DCF) metric. It extends the conventional DCF used in ASV research to scenarios involving spoofing attacks. The t-DCF metric has 6 parameters: (i) false alarm and miss costs for both systems, and (ii) prior probabilities of target and spoof trials (with an implied third, nontar-get prior). The study is intended to serve as a self-contained, tutorial-like presentation. We analyse with the t-DCF a selection of top-performing CM submissions to the 2015 and 2017 editions of ASVspoof, with a focus on the spoofing attack prior. Whereas there is little to choose between countermeasure systems for lower priors, system rankings derived with the EER and t-DCF show differences for higher priors. We observe some ranking changes. Findings support the adoption of the DCF-based metric into the roadmap for future ASVspoof challenges, and possibly for other biometric anti-spoofing evaluations.

...read moreread less

147 citations

Journal Article•DOI•

Lung sound classification using cepstral-based statistical features

[...]

Nandini Sengupta¹, Sahidullah², Goutam Saha¹•Institutions (2)

Indian Institute of Technology Kharagpur¹, University of Eastern Finland²

01 Aug 2016-Computers in Biology and Medicine

TL;DR: It is found that the newly investigated features are more robust than existing features and show better recognition accuracy even in low signal-to-noise ratios (SNRs).

...read moreread less

142 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Analyzing linguistic data: a practical introduction to statistics using R

[...]

Elisabeth Dévière¹•Institutions (1)

Katholieke Universiteit Leuven¹

16 Apr 2009-Journal of Applied Statistics

TL;DR: The author guides the reader in about 350 pages from descriptive and basic statistical methods over classification and clustering to (generalised) linear and mixed models to enable researchers and students alike to reproduce the analyses and learn by doing.

...read moreread less

Abstract: The complete title of this book runs ‘Analyzing Linguistic Data: A Practical Introduction to Statistics using R’ and as such it very well reflects the purpose and spirit of the book. The author guides the reader in about 350 pages from descriptive and basic statistical methods over classification and clustering to (generalised) linear and mixed models. Each of the methods is introduced in the context of concrete linguistic problems and demonstrated on exciting datasets from current research in the language sciences. In line with its practical orientation, the book focuses primarily on using the methods and interpreting the results. This implies that the mathematical treatment of the techniques is held at a minimum if not absent from the book. In return, the reader is provided with very detailed explanations on how to conduct the analyses using R [1]. The first chapter sets the tone being a 20-page introduction to R. For this and all subsequent chapters, the R code is intertwined with the chapter text and the datasets and functions used are conveniently packaged in the languageR package that is available on the Comprehensive R Archive Network (CRAN). With this approach, the author has done an excellent job in enabling researchers and students alike to reproduce the analyses and learn by doing. Another quality as a textbook is the fact that every chapter ends with Workbook sections where the user is invited to exercise his or her analysis skills on supplemental datasets. Full solutions including code, results and comments are given in Appendix A (30 pages). Instructors are therefore very well served by this text, although they might want to balance the book with some more mathematical treatment depending on the target audience. After the introductory chapter on R, the book opens on graphical data exploration. Chapter 3 treats probability distributions and common sampling distributions. Under basic statistical methods (Chapter 4), distribution tests and tests on means and variances are covered. Chapter 5 deals with clustering and classification. Strangely enough, the clustering section has material on PCA, factor analysis, correspondence analysis and includes only one subsection on clustering, devoted notably to hierarchical partitioning methods. The classification part deals with decision trees, discriminant analysis and support vector machines. The regression chapter (Chapter 6) treats linear models, generalised linear models, piecewise linear models and a substantial section on models for lexical richness. The final chapter on mixed models is particularly interesting as it is one of the few text book accounts that introduce the reader to using the (innovative) lme4 package of Douglas Bates which implements linear mixed-effects models. Moreover, the case studies included in this

...read moreread less

1,679 citations

Proceedings Article•DOI•

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection

[...]

Tomi Kinnunen¹, Md. Sahidullah¹, Héctor Delgado², Massimiliano Todisco², Nicholas Evans², Junichi Yamagishi³, Kong Aik Lee⁴ - Show less +3 more•Institutions (4)

University of Eastern Finland¹, Institut Eurécom², National Institute of Informatics³, Institute for Infocomm Research Singapore⁴

20 Aug 2017

TL;DR: ASVspoof 2017, the second in the series, focused on the development of replay attack countermeasures and indicates that the quest for countermeasures which are resilient in the face of variable replay attacks remains very much alive.

...read moreread less

Abstract: The ASVspoof initiative was created to promote the development of countermeasures which aim to protect automatic speaker verification (ASV) from spoofing attacks. The first community-led, common evaluation held in 2015 focused on countermeasures for speech synthesis and voice conversion spoofing attacks. Arguably, however, it is replay attacks which pose the greatest threat. Such attacks involve the replay of recordings collected from enrolled speakers in order to provoke false alarms and can be mounted with greater ease using everyday consumer devices. ASVspoof 2017, the second in the series, hence focused on the development of replay attack countermeasures. This paper describes the database, protocols and initial findings. The evaluation entailed highly heterogeneous acoustic recording and replay conditions which increased the equal error rate (EER) of a baseline ASV system from 1.76% to 31.46%. Submissions were received from 49 research teams, 20 of which improved upon a baseline replay spoofing detector EER of 24.77%, in terms of replay/non-replay discrimination. While largely successful, the evaluation indicates that the quest for countermeasures which are resilient in the face of variable replay attacks remains very much alive.

...read moreread less

435 citations

Proceedings Article•

Likelihood Ratios for Out-of-Distribution Detection

[...]

Jie Ren¹, Peter J. Liu¹, Emily Fertig¹, Jasper Snoek¹, Ryan Poplin¹, Mark A. DePristo¹, Joshua V. Dillon¹, Balaji Lakshminarayanan¹ - Show less +4 more•Institutions (1)

Google¹

07 Jun 2019

TL;DR: This paper proposed a likelihood ratio method for deep generative models which effectively corrects for these confounding background statistics and achieved state-of-the-art performance on the genomics dataset.

...read moreread less

Abstract: Discriminative neural networks offer little or no performance guarantees when deployed on data not generated by the same process as the training distribution. On such out-of-distribution (OOD) inputs, the prediction may not only be erroneous, but confidently so, limiting the safe deployment of classifiers in real-world applications. One such challenging application is bacteria identification based on genomic sequences, which holds the promise of early detection of diseases, but requires a model that can output low confidence predictions on OOD genomic sequences from new bacteria that were not present in the training data. We introduce a genomics dataset for OOD detection that allows other researchers to benchmark progress on this important problem. We investigate deep generative model based approaches for OOD detection and observe that the likelihood score is heavily affected by population level background statistics. We propose a likelihood ratio method for deep generative models which effectively corrects for these confounding background statistics. We benchmark the OOD detection performance of the proposed method against existing approaches on the genomics dataset and show that our method achieves state-of-the-art performance. Finally, we demonstrate the generality of the proposed method by showing that it significantly improves OOD detection when applied to deep generative models of images.

...read moreread less

425 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse