scispace - formally typeset
Search or ask a question
Author

Paavo Alku

Bio: Paavo Alku is an academic researcher from Aalto University. The author has contributed to research in topics: Speech processing & Linear prediction. The author has an hindex of 59, co-authored 433 publications receiving 13272 citations. Previous affiliations of Paavo Alku include Helsinki University of Technology & Johns Hopkins University Applied Physics Laboratory.


Papers
More filters
Journal ArticleDOI
30 Jan 1997-Nature
TL;DR: It is found that the brain's automatic change-detection response, reflected electrically as the mismatch negativity (MMN) was enhanced when the infrequent, deviant stimulus was a prototype relative to when it was a non-prototype (the Estonian /õ/).
Abstract: There is considerable debate about whether the early processing of sounds depends on whether they form part of speech. Proponents of such speech specificity postulate the existence of language-dependent memory traces, which are activated in the processing of speech1–3 but not when equally complex, acoustic non-speech stimuli are processed. Here we report the existence of these traces in the human brain. We presented to Finnish subjects the Finnish phoneme prototype /e/ as the frequent stimulus, and other Finnish phoneme prototypes or a non-prototype (the Estonian prototype /o/) as the infrequent stimulus. We found that the brain's automatic change-detection response, reflected electrically as the mismatch negativity (MMN)4–10, was enhanced when the infrequent, deviant stimulus was a prototype (the Finnish /o/) relative to when it was a non-prototype (the Estonian /o/). These phonemic traces, revealed by MMN, are language-specific, as /o/ caused enhancement of MMN in Estonians. Whole-head magnetic recordings11,12 located the source of this native-language, phoneme-related response enhancement, and thus the language-specific memory traces, in the auditory cortex of the left hemisphere.

1,154 citations

Journal ArticleDOI
01 Jun 1992
TL;DR: The results show that the PSIAIF-algorithm is able to give a fairly accurate estimate for the glottal flow excluding the analysis of vowels with a low first formant that are produced with a pressed phonation type.
Abstract: A new glottal wave analysis method, Pitch Synchronous Iterative Adaptive Inverse Filtering (PSIAIF) is presented. The algorithm is based on a previously developed method, Iterative Adaptive Inverse Filtering (IAIF). In the IAIF-method the glottal contribution to the speech spectrum is first estimated with an iterative structure. The vocal tract transfer function is modeled after eliminating the average glottal contribution. The glottal excitation is obtained by cancelling the effects of the vocal tract and lip radiation by inverse filtering. In the new PSIAIF-method the glottal pulseform is computed by applying the IAIF-algorithm twice to the same signal. The first IAIF-analysis gives as a result a glottal excitation that spans over several pitch periods. This pulseform is used in order to determine positions and lengths of frames for the pitch synchronous analysis. The final result is obtained by analysing the original speech signal with the IAIF-algorithm one fundamental period at a time. The PSIAIF-algorithm was applied in glottal wave analysis using both synthetic and natural vowels. The results show that the method is able to give a fairly accurate estimate for the glottal flow excluding the analysis of vowels with a low first formant that are produced with a pressed phonation type.

499 citations

Journal ArticleDOI
TL;DR: The results demonstrate that, first, auditory orienting deficits in autism cannot be explained by sensory deficits and, second, that orienting deficit in autism might be speech–sound specific.
Abstract: In autism, severe abnormalities in social behavior coexist with aberrant attention and deficient language. In the attentional domain, attention to people and socially relevant stimuli is impaired the most. Because socially meaningful stimulus events are physically complex, a deficiency in sensory processing of complex stimuli has been suggested to contribute to aberrant attention and language in autism. This study used event-related brain potentials (ERP) to examine the sensory and early attentional processing of sounds of different complexity in high-functioning children with autism. Acoustically matched simple tones, complex tones, and vowels were presented in separate oddball sequences, in which a repetitive “standard” sound was occasionally replaced by an infrequent “deviant” sound differing from the standard in frequency (by 10%). In addition to sensory responses, deviant sounds elicited an ERP index of automatic sound-change discrimination, the mismatch negativity, and an ERP index of attentional orienting, the P3a. The sensory sound processing was intact in the high-functioning children with autism and was not affected by sound complexity or “speechness.” In contrast, their involuntary orienting was affected by stimulus nature. It was normal to both simple- and complex-tone changes but was entirely abolished by vowel changes. These results demonstrate that, first, auditory orienting deficits in autism cannot be explained by sensory deficits and, second, that orienting deficit in autism might be speech–sound specific.

357 citations

Journal ArticleDOI
TL;DR: The present study demonstrates the dynamic nature of cortical memory representations for phonemes in adults by using the mismatch negativity (MMN) event-related potential to study Hungarian and Finnish subjects and finds that the MMN for a contrast between two Finnish phoneme was elicited in the fluent Hungarians but not in the naive Hungarians.
Abstract: Learning to speak a new language requires the formation of recognition patterns for the speech sounds specific to the newly acquired language. The present study demonstrates the dynamic nature of cortical memory representations for phonemes in adults by using the mismatch negativity (MMN) event-related potential. We studied Hungarian and Finnish subjects, dividing the Hungarians into a naive (no knowledge of Finnish) and a fluent (in Finnish) group. We found that the MMN for a contrast between two Finnish phonemes was elicited in the fluent Hungarians but not in the naive Hungarians. This result indicates that the fluent Hungarians developed cortical memory representations for the Finnish phoneme system that enabled them to preattentively categorize phonemes specific to this language.

319 citations

Journal ArticleDOI
TL;DR: Comparison between NAQ and its counterpart among the conventional time-domain parameters, the closing quotient, shows that the proposed parameter is more robust against distortion such as measurement noise that make the extraction of conventionalTime-based parameters of the glottal flow problematic.
Abstract: Normalized amplitude quotient (NAQ) is presented as a method to parametrize the glottal closing phase using two amplitude-domain measurements from waveforms estimated by inverse filtering. In this technique, the ratio between the amplitude of the ac flow and the negative peak amplitude of the flow derivative is first computed using the concept of equivalent rectangular pulse, a hypothetical signal located at the instant of the main excitation of the vocal tract. This ratio is then normalized with respect to the length of the fundamental period. Comparison between NAQ and its counterpart among the conventional time-domain parameters, the closing quotient, shows that the proposed parameter is more robust against distortion such as measurement noise that make the extraction of conventional time-based parameters of the glottal flow problematic. Experiments with breathy, normal, and pressed vowels indicate that NAQ is also able to separate the type of phonation effectively.

311 citations


Cited by
More filters
Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

01 Jan 2016
TL;DR: This is an introduction to the event related potential technique, which can help people facing with some malicious bugs inside their laptop to read a good book with a cup of tea in the afternoon.
Abstract: Thank you for downloading an introduction to the event related potential technique. Maybe you have knowledge that, people have look hundreds times for their favorite readings like this an introduction to the event related potential technique, but end up in malicious downloads. Rather than reading a good book with a cup of tea in the afternoon, instead they are facing with some malicious bugs inside their laptop.

2,445 citations