Showing papers in &quot;Computer Speech &amp; Language in 2014&quot;

Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis

TL;DR: The results show that using either lemma or lexeme information is helpful, as well as using the two part of speech tagsets (RTS and ERTS), but that lemmatization and the ERTS POS tagset are present in a majority of the settings.

...read moreread less

227 citations

Journal Article•DOI•

[...]

Alexandra Balahur¹, Marco Turchi²•Institutions (2)

International Practical Shooting Confederation¹, fondazione bruno kessler²

Ranked WordNet graph for Sentiment Polarity Classification in Twitter

TL;DR: Extensive evaluation scenarios show that machine translation systems are approaching a good level of maturity and that they can, in combination to appropriate machine learning algorithms and carefully chosen features, be used to build sentiment analysis systems that can obtain comparable performances to the one obtained for English.

...read moreread less

180 citations

Journal Article•DOI•

[...]

Arturo Montejo-Ráez¹, Eugenio Martínez-Cámara¹, M. Teresa Martín-Valdivia¹, L. Alfonso Ureña-López¹•Institutions (1)

University of Jaén¹

The listening talker: A review of human and algorithmic context-induced modifications of speech

TL;DR: A novel approach to Sentiment Polarity Classification in Twitter posts is presented, by extracting a vector of weighted nodes from the graph of WordNet, which is used in SentiWordNet to compute a final estimation of the polarity.

...read moreread less

140 citations

Journal Article•DOI•

[...]

Martin Cooke¹, Martin Cooke², Simon King³, Maëva Garnier⁴, Vincent Aubanel¹, Vincent Aubanel² - Show less +2 more•Institutions (4)

Ikerbasque¹, University of the Basque Country², University of Edinburgh³, Centre national de la recherche scientifique⁴

A study of voice activity detection techniques for NIST speaker recognition evaluations

TL;DR: This review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise.

...read moreread less

107 citations

Journal Article•DOI•

[...]

Man-Wai Mak¹, Hon-Bill Yu¹•Institutions (1)

Hong Kong Polytechnic University¹

Speaking in noise: How does the Lombard effect improve acoustic contrasts between speech and ambient noise?

TL;DR: Experimental results based on the NIST 2010 SRE dataset suggest that the proposed VAD outperforms conventional ones whenever interview-style speech is involved, and it is demonstrated that noise reduction is vital for energy-based VAD under low SNR.

...read moreread less

98 citations

Journal Article•DOI•

[...]

Maëva Garnier¹, Nathalie Henrich¹•Institutions (1)

Centre national de la recherche scientifique¹

Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery

TL;DR: Speakers showed two additional modifications as compared to shouted speech, which cannot be interpreted in terms of vocal effort only: they enhanced the modulation of their speech in f"0 and vocal intensity and they boosted their speech spectrum specifically around 3kHz, in the region of maximum ear sensitivity associated with the actor's or singer's formant.

...read moreread less

94 citations

Journal Article•DOI•

[...]

Man-Hung Siu¹, Herbert Gish¹, Arthur Chan¹, William Belfield¹, Steve Lowe¹ - Show less +1 more•Institutions (1)

BBN Technologies¹

Preface: Computational approaches to subjectivity and sentiment analysis: Present and envisaged methods and applications

TL;DR: This work proposes building HMM-based speech recognizers without transcribed data by formulating the HMM training as an optimization over both the parameter and transcription sequence space, and describes how SOU training can be easily implemented using existing HMM recognition tools.

...read moreread less

90 citations

Journal Article•DOI•

[...]

Alexandra Balahur, Rada Mihalcea¹, Andrés Montoyo²•Institutions (2)

University of Michigan¹, University of Alicante²

Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions

TL;DR: An overview of the latest trends in the subjectivity and sentiment analysis fields is presented and the manner in which the articles contained in the special issue contribute to the advancement of the area is described.

...read moreread less

90 citations

Journal Article•DOI•

[...]

Chi-Chun Lee¹, Athanasios Katsamanis¹, Matthew P. Black¹, Brian R. Baucom¹, Andrew Christensen², Panayiotis G. Georgiou¹, Shrikanth S. Narayanan¹ - Show less +3 more•Institutions (2)

University of Southern California¹, University of California, Los Angeles²

Glottal source processing: From analysis to applications

TL;DR: This paper proposes an unsupervised signal-derived approach within a principal component analysis framework for quantifying one aspect of entrainment in communication, namely, vocalEntrainment, and involves measuring the similarity of specific vocal characteristics between the interlocutors in a dialog.

...read moreread less

83 citations

Journal Article•DOI•

[...]

Thomas Drugman¹, Paavo Alku², Abeer Alwan³, B. Yegnanarayana⁴•Institutions (4)

University of Mons¹, Aalto University², University of California, Los Angeles³, International Institute of Information Technology, Hyderabad⁴

01 Sep 2014-Computer Speech & Language

TL;DR: A comprehensive overview of techniques for glottal source processing can be found in this article, where the authors discuss how these tools and techniques might be properly integrated in various voice technology applications.

...read moreread less

81 citations

Journal Article•DOI•

Feature Enhancement by Deep LSTM Networks for ASR in Reverberant Multisource Environments

[...]

Felix Weninger¹, Jürgen T. Geiger¹, Martin Wöllmer¹, Martin Wöllmer², Björn Schuller³, Björn Schuller¹, Gerhard Rigoll¹ - Show less +3 more•Institutions (3)

Technische Universität München¹, BMW², Imperial College London³

A domain-independent statistical methodology for dialog management in spoken dialog systems

TL;DR: Deep bidirectional LSTM networks processing log Mel filterbank outputs deliver best results with clean models, reaching down to 42% word error rate (WER) at signal-to-noise ratios ranging from −6 to 9 dB.

...read moreread less

Journal Article•DOI•

[...]

David Griol¹, Zoraida Callejas², Ramón López-Cózar², Giuseppe Riccardi³•Institutions (3)

Charles III University of Madrid¹, University of Granada², University of Trento³

01 May 2014-Computer Speech & Language

TL;DR: A domain-independent statistical methodology to develop dialog managers for spoken dialog systems allows rapid development of new dialog managers as well as to explore new dialog strategies, which permit developing new enhanced versions of already existing systems.

...read moreread less

Journal Article•DOI•

Shape-based modeling of the fundamental frequency contour for emotion detection in speech

[...]

Juan Pablo Arias¹, Carlos Busso², Néstor Becerra Yoma¹•Institutions (2)

University of Chile¹, University of Texas at Dallas²

Data-driven models for timing feedback responses in a Map Task dialogue system

TL;DR: The results indicate that the proposed scheme can be effectively employed in real applications to detect emotional speech, and can lead to accuracies as high as 75.8% in binary emotion classification.

...read moreread less

Journal Article•DOI•

[...]

Raveesh Meena¹, Gabriel Skantze¹, Joakim Gustafson¹•Institutions (1)

Royal Institute of Technology¹

Exploring high-level features for detecting cyberpedophilia

TL;DR: Traditional dialogue systems use a fixed silence threshold to detect the end of users' turns, but this simplistic model can result in system behaviour that is both interruptive and unresponsive.

...read moreread less

Journal Article•DOI•

[...]

Dasha Bogdanova, Paolo Rosso¹, Thamar Solorio²•Institutions (2)

Polytechnic University of Valencia¹, University of Alabama at Birmingham²

Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

TL;DR: The classification results show that the NPS data and the pedophiles' conversations can be accurately discriminated from each other with character n-grams, while in the more complicated case of cybersex logs there is need for high-level features to reach good accuracy levels.

...read moreread less

Journal Article•DOI•

[...]

Cees H. Taal¹, Richard C. Hendriks², Richard Heusdens²•Institutions (2)

Leiden University Medical Center¹, Delft University of Technology²

Medium-term speaker states-A review on intoxication, sleepiness and the first challenge

TL;DR: A speech pre-processing algorithm is presented that improves the speech intelligibility in noise for the near-end listener by optimally redistributing the speech energy over time and frequency according to a perceptual distortion measure, which is based on a spectro-temporal auditory model.

...read moreread less

Journal Article•DOI•

[...]

Björn Schuller¹, Stefan Steidl², Anton Batliner³, Florian Schiel⁴, Jarek Krajewski⁵, Felix Weninger³, Florian Eyben³ - Show less +3 more•Institutions (5)

Joanneum Research¹, University of Erlangen-Nuremberg², Technische Universität München³, Ludwig Maximilian University of Munich⁴, University of Würzburg⁵

Class-specific multiple classifiers scheme to recognize emotions from speech signals

TL;DR: By fusing participants' systems, it is shown that binary classification of alcoholisation and sleepiness from short-term observations, i.e., single utterances, can both reach over 72% accuracy on unseen test data; and it is demonstrated that these medium-term states can be recognised more robustly by fusing short- term classifiers along the time axis.

...read moreread less

Journal Article•DOI•

[...]

Abul Hasnat Milton, S. Tamil Selvi¹•Institutions (1)

National Engineering College¹

01 May 2014-Computer Speech & Language

TL;DR: The present work investigates the performance of the features of Autoregressive (AR) parameters, which include gain and reflection coefficients, in addition to the traditional linear prediction coefficients (LPC), to recognize emotions from speech signals, and finds that the Features of reflection coefficients recognize emotions better than the LPC.

...read moreread less

Journal Article•DOI•

Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles

[...]

Elizabeth Godoy¹, Maria Koutsogiannaki¹, Maria Koutsogiannaki², Yannis Stylianou¹, Yannis Stylianou² - Show less +1 more•Institutions (2)

Foundation for Research & Technology – Hellas¹, University of Crete²

Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification

TL;DR: The latter part of this work focuses mainly on a novel frequency warping technique that is shown to achieve vowel space expansion, incorporated into an established Lombard-inspired Spectral Shaping method that pairs with dynamic range compression to maximize speech audibility (SSDRC).

...read moreread less

Journal Article•DOI•

[...]

Ming Li¹, Ming Li², Shrikanth S. Narayanan¹•Institutions (2)

University of Southern California¹, Sun Yat-sen University²

Prior and contextual emotion of words in sentential context

TL;DR: The proposed supervised i-vector approach outperforms the i- vector baseline by relatively 12% and 7% in terms of EER and norm old minDCF values, respectively and the use of Gammatone frequency cepstral coefficients, Mel-frequency cep stral coefficients and spectro-temporal Gabor features in conjunction with shifted-delta-cepstral features improves the overall language identification performance significantly.

...read moreread less

Journal Article•DOI•

[...]

Diman Ghazi¹, Diana Inkpen¹, Stan Szpakowicz², Stan Szpakowicz¹•Institutions (2)

University of Ottawa¹, Polish Academy of Sciences²

Modeling phonetic pattern variability in favor of the creation of robust emotion classifiers for real-life applications

TL;DR: A set of features is presented which enable us to distinguish automatically between prior and contextual emotion, with a focus on exploring features important in this task, and a promising learning method is shown which significantly outperforms two reasonable baselines.

...read moreread less

Journal Article•DOI•

[...]

Bogdan Vlasenko¹, Dmytro Prylipko¹, Ronald Böck¹, Andreas Wendemuth¹•Institutions (1)

Otto-von-Guericke University Magdeburg¹

Sense-level subjectivity in a multilingual setting

TL;DR: It is shown that using phoneme-level emotion classes can improve classification performance even with comparably low speech recognition performance obtained with scant a priori knowledge about the language.

...read moreread less

Journal Article•DOI•

[...]

Carmen Banea¹, Rada Mihalcea¹, Janyce Wiebe²•Institutions (2)

University of North Texas¹, University of Pittsburgh²

Automatic scoring for answers to Arabic test questions

TL;DR: This paper identifies two methods that are able to incorporate subjectivity information originating from different languages, namely co-training and multilingual vector spaces, and shows that for this task the latter method is better suited and obtains superior results.

...read moreread less

Journal Article•DOI•

[...]

Wael Hassan Gomaa¹, Aly A. Fahmy²•Institutions (2)

Modern Academy In Maadi¹, Cairo University²

Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise

TL;DR: This research presents the first benchmark Arabic data set that contains 610 students’ short answers together with their English translations, and focuses on applying multiple similarity measures separately and in combination.

...read moreread less

Journal Article•DOI•

[...]

Tuomo Raitio¹, Antti Suni², Martti Vainio², Paavo Alku¹•Institutions (2)

Aalto University¹, University of Helsinki²

Language independent search in MediaEval's spoken web search task

TL;DR: The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability.

...read moreread less

Journal Article•DOI•

[...]

Florian Metze¹, Xavier Anguera², Etienne Barnard³, Marelie H. Davel³, Guillaume Gravier⁴ - Show less +1 more•Institutions (4)

Carnegie Mellon University¹, Telefónica², North-West University³, Centre national de la recherche scientifique⁴

01 Sep 2014-Computer Speech & Language

TL;DR: This paper presents the 2011 and 2012 MediaEval results, and compares the relative merits and weaknesses of approaches developed by participants, providing analysis and directions for future research, in order to improve voice access to spoken information in low resource settings.

...read moreread less

Journal Article•DOI•

I-vector based speaker recognition using advanced channel compensation techniques

[...]

Ahilan Kanagasundaram¹, David Dean¹, Sridha Sridharan¹, Mitchell McLaren², Robbie Vogt¹ - Show less +1 more•Institutions (2)

Queensland University of Technology¹, Radboud University Nijmegen²

Data-driven detection and analysis of the patterns of creaky voice ☆

TL;DR: In this paper, the authors investigated four channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora.

...read moreread less

Journal Article•DOI•

[...]

Thomas Drugman¹, John Kane², Christer Gobl²•Institutions (2)

University of Mons¹, Trinity College, Dublin²

01 Sep 2014-Computer Speech & Language

TL;DR: This paper investigates the temporal excitation patterns of creaky voice using a variety of languages, speakers, and on both read and conversational data and involves a mutual information-based assessment of the various acoustic features proposed in the literature for detectingcreaky voice.

...read moreread less

Journal Article•DOI•

Analysis of voice features related to obstructive sleep apnoea and their application in diagnosis support

[...]

Ana Montero Benavides¹, Rubén Fernández Pozo¹, Doroteo Torre Toledano², José Luis Blanco Murillo¹, Eduardo López Gonzalo¹, Luis A. Hernández Gómez¹ - Show less +2 more•Institutions (2)

Technical University of Madrid¹, Autonomous University of Madrid²