T
Tanvina B. Patel
Researcher at Dhirubhai Ambani Institute of Information and Communication Technology
Publications - 26
Citations - 495
Tanvina B. Patel is an academic researcher from Dhirubhai Ambani Institute of Information and Communication Technology. The author has contributed to research in topics: Speech synthesis & Computer science. The author has an hindex of 9, co-authored 21 publications receiving 401 citations.
Papers
More filters
Proceedings ArticleDOI
Combining evidences from mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech.
Tanvina B. Patel,Hemant A. Patil +1 more
TL;DR: A detector based on combination of cochlear filter cepstral coefficients (CFCC) and change in instantaneous frequency (IF), (i.e., CFCCIF) to detect natural vs. spoofed speech for ASVspoof 2015 challenge is proposed.
Proceedings ArticleDOI
Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection.
TL;DR: A novel replay detector based on Variable length Teager Energy OperatorEnergy Separation Algorithm-Instantaneous Frequency Cosine Coefficients (VESA-IFCC) for the ASV spoof 2017 challenge is proposed and the performance of the proposed feature set is compared with the features developed for detecting synthetic and voice converted speech.
Proceedings ArticleDOI
A syllable-based framework for unit selection synthesis in 13 Indian languages
Hemant A. Patil,Tanvina B. Patel,Nirmesh J. Shah,Hardik B. Sailor,Raghava Krishnan,G. R. Kasthuri,T. Nagarajan,Lilly Christina,Naresh Kumar,Veera Raghavendra,Surekha Kishore,S. R. M. Prasanna,Nagaraj Adiga,Sanasam Ranbir Singh,Konjengbam Anand,Pranaw Kumar,Bira Chandra Singh,S. L. Binil Kumar,T. G. Bhadran,T. Sajini,Arup Saha,T. K. Basu,K. Sreenivasa Rao,N. P. Narendra,Anil Kumar Sao,Rajesh Kumar,Pranhari Talukdar,Purnendu Acharyaa,Somnath Chandra,Swaran Lata,Hema A. Murthy +30 more
TL;DR: A consortium effort on building text to speech (TTS) systems for 13 Indian languages using the same common framework and the TTS systems are evaluated using Mean Opinion Score (DMOS) and Word Error Rate (WER).
Journal ArticleDOI
Cochlear Filter and Instantaneous Frequency Based Features for Spoofed Speech Detection
Tanvina B. Patel,Hemant A. Patil +1 more
TL;DR: It was observed that subband energy variations across CFCCIF when estimated by symmetric difference (CFCCIFS) gave better discriminative properties than CFCC IF, and VC speech is relatively difficult to detect than SS by the SSD system.
Proceedings ArticleDOI
Effectiveness of fundamental frequency (F0) and strength of excitation (SOE) for spoofed speech detection
Tanvina B. Patel,Hemant A. Patil +1 more
TL;DR: This work explores excitation source-based features, i.e., fundamental frequency (F0) contour and strength of excitation (SoE) at the glottis as discriminative features using GMM-based classification system and finds that source features captures complementary information than MFCC and CFCCIF used alone.