scispace - formally typeset
Search or ask a question

Showing papers in "Journal of the Acoustical Society of America in 2016"


Journal ArticleDOI
TL;DR: A review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality is provided.
Abstract: As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.

233 citations


PatentDOI
TL;DR: In this paper, an automated equalizing system including a microphone unit, a memory for storing characteristic data of the loudspeaker units and further for storing the frequency response measurements is described.
Abstract: An automated process for equalizing an audio system and an apparatus for implementing the process. An audio system includes a microphone unit, for receiving the sound waves radiated from a plurality of speakers, acoustic measuring circuitry, for calculating frequency response measurements; a memory, for storing characteristic data of the loudspeaker units and further for storing the frequency response measurements; and equalization calculation circuitry, for calculating an equalization pattern responsive to the digital data and responsive to the characteristic data of the plurality of loudspeaker units. Also described is an automated equalizing system including a acoustic measuring circuitry including a microphone for measuring frequency response at a plurality of locations; a memory, for storing the frequency responses at the plurality of locations; and equalization calculation circuitry, for calculating, from the frequency responses, an optimized equalization pattern.

198 citations


Journal ArticleDOI
Jitong Chen1, Yuxuan Wang1, Sarah E. Yoho1, DeLiang Wang1, Eric W. Healy1 
TL;DR: The results indicate that DNN-based supervised speech segregation with large-scale training is a very promising approach for generalization to new acoustic environments.
Abstract: Supervised speech segregation has been recently shown to improve human speech intelligibility in noise, when trained and tested on similar noises However, a major challenge involves the ability to generalize to entirely novel noises Such generalization would enable hearing aid and cochlear implant users to improve speech intelligibility in unknown noisy environments This challenge is addressed in the current study through large-scale training Specifically, a deep neural network (DNN) was trained on 10 000 noises to estimate the ideal ratio mask, and then employed to separate sentences from completely new noises (cafeteria and babble) at several signal-to-noise ratios (SNRs) Although the DNN was trained at the fixed SNR of − 2 dB, testing using hearing-impaired listeners demonstrated that speech intelligibility increased substantially following speech segregation using the novel noises and unmatched SNR conditions of 0 dB and 5 dB Sentence intelligibility benefit was also observed for normal-hearing listeners in most noisy conditions The results indicate that DNN-based supervised speech segregation with large-scale training is a very promising approach for generalization to new acoustic environments

170 citations


Journal ArticleDOI
TL;DR: A method for the characterization of the speech signals, based on the automatic segmentation of utterances into voiced and unvoiced frames, is addressed here and proves to be more accurate than classical approaches in the automatic classification of speech of people with PD and healthy controls.
Abstract: The aim of this study is the analysis of continuous speech signals of people with Parkinson's disease (PD) considering recordings in different languages (Spanish, German, and Czech). A method for the characterization of the speech signals, based on the automatic segmentation of utterances into voiced and unvoiced frames, is addressed here. The energy content of the unvoiced sounds is modeled using 12 Mel-frequency cepstral coefficients and 25 bands scaled according to the Bark scale. Four speech tasks comprising isolated words, rapid repetition of the syllables /pa/-/ta/-/ka/, sentences, and read texts are evaluated. The method proves to be more accurate than classical approaches in the automatic classification of speech of people with PD and healthy controls. The accuracies range from 85% to 99% depending on the language and the speech task. Cross-language experiments are also performed confirming the robustness and generalization capability of the method, with accuracies ranging from 60% to 99%. This work comprises a step forward for the development of computer aided tools for the automatic assessment of dysarthric speech signals in multiple languages.

149 citations


Journal ArticleDOI
TL;DR: The reflection of sound of frequency below 1 kHz, by a rigid-backed structure that contains sub-wavelength resonators is studied, and a method of analysis of absorption that uses the structure of the reflection coefficient in the complex frequency plane is proposed.
Abstract: The reflection of sound of frequency below 1 kHz, by a rigid-backed structure that contains sub-wavelength resonators is studied in this work. In particular, only single mode reflected waves are considered, an approximation which is accurate in this low frequency regime. A method of analysis of absorption that uses the structure of the reflection coefficient in the complex frequency plane is proposed. In the absence of losses, the reflection coefficient supports pairs of poles and zeros that are complex conjugate and which have imaginary parts linked to the energy leakage by radiation. When losses are introduced and balanced to the leakage, the critical coupling condition is satisfied and total absorption is obtained. Examples of a slot resonator and of multiple Helmholtz resonators are analyzed to obtain both narrow and broadband total absorption.

143 citations


Journal ArticleDOI
TL;DR: Results do not support an association between exposure to WTN up to 46 dBA and the evaluated health-related endpoints and beyond annoyance, results do notsupport an association among participants aged 18-79 years between WTN levels and these endpoints.
Abstract: Health Canada, in collaboration with Statistics Canada, and other external experts, conducted the Community Noise and Health Study to better understand the impacts of wind turbine noise (WTN) on health and well-being. A cross-sectional epidemiological study was carried out between May and September 2013 in southwestern Ontario and Prince Edward Island on 1238 randomly selected participants (606 males, 632 females) aged 18-79 years, living between 0.25 and 11.22 km from operational wind turbines. Calculated outdoor WTN levels at the dwelling reached 46 dBA. Response rate was 78.9% and did not significantly differ across sample strata. Self-reported health effects (e.g., migraines, tinnitus, dizziness, etc.), sleep disturbance, sleep disorders, quality of life, and perceived stress were not related to WTN levels. Visual and auditory perception of wind turbines as reported by respondents increased significantly with increasing WTN levels as did high annoyance toward several wind turbine features, including the following: noise, blinking lights, shadow flicker, visual impacts, and vibrations. Concern for physical safety and closing bedroom windows to reduce WTN during sleep also increased with increasing WTN levels. Other sample characteristics are discussed in relation to WTN levels. Beyond annoyance, results do not support an association between exposure to WTN up to 46 dBA and the evaluated health-related endpoints.

127 citations


Journal ArticleDOI
TL;DR: The underlying mechanism of these types of AMMs for tuning the effective density will be examined first and the opportunities, limitations, and challenges of membrane- or plate-type AMMs will be discussed.
Abstract: Over the past decade there has been a great amount of research effort devoted to the topic of acoustic metamaterials (AMMs). The recent development of AMMs has enlightened the way of manipulating sound waves. Several potential applications such as low-frequency noise reduction, cloaking, angular filtering, subwavelength imaging, and energy tunneling have been proposed and implemented by the so-called membrane- or plate-type AMMs. This paper aims to offer a thorough overview on the recent development of membrane- or plate-type AMMs. The underlying mechanism of these types of AMMs for tuning the effective density will be examined first. Four different groups of membrane- or plate-type AMMs (membranes with masses attached, plates with masses attached, membranes or plates without masses attached, and active AMMs) will be reviewed. The opportunities, limitations, and challenges of membrane- or plate-type AMMs will be also discussed.

125 citations


Journal ArticleDOI
TL;DR: The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medialsurface, closed quotient, H1-H2, and higher-order harmonics excitation.
Abstract: The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed.

97 citations


Journal ArticleDOI
TL;DR: It is suggested that using external calibrated microphones greatly improves the overall accuracy and precision of smartphone sound measurements, and removes much of the variability and limitations associated with the built-in smartphone microphones.
Abstract: This follow-up study examines the accuracy of selected smartphone sound measurement applications (apps) using external calibrated microphones. The initial study examined 192 apps on the iOS and Android platforms and found four iOS apps with mean differences of ±2 dB of a reference sound level measurement system. This study evaluated the same four apps using external microphones. The results showed measurements within ±1 dB of the reference. This study suggests that using external calibrated microphones greatly improves the overall accuracy and precision of smartphone sound measurements, and removes much of the variability and limitations associated with the built-in smartphone microphones.

87 citations


Journal ArticleDOI
TL;DR: It is shown that the width of a large absorption peak in the frequency spectrum can be broadened if several rectangular channels in the unit cell are detuned and the detuning is achieved by varying the length of the side resonators for each channel.
Abstract: Perfect (100%) absorption by thin structures consisting of a periodic arrangement of rectangular quarter-wavelength channels with side detuned quarter-wavelength resonators is demonstrated. The thickness of these structures is 13-17 times thinner than the acoustic wavelength. This low frequency absorption is due to a slow sound wave propagating in the main rectangular channel. A theoretical model is proposed to predict the complex wavenumber in this channel. It is shown that the speed of sound in the channel is much lower than in the air, almost independent of the frequency in the low frequency range, and it is dispersive inside the induced transparency band which is observed. The perfect absorption condition is found to be caused by a critical coupling between the rectangular channel (sub-wavelength resonators) and the incoming wave. It is shown that the width of a large absorption peak in the frequency spectrum can be broadened if several rectangular channels in the unit cell are detuned. The detuning is achieved by varying the length of the side resonators for each channel. The predicted absorption coefficients are validated experimentally. Two resonant cells were produced with stereolithography which enabled the authors to incorporate curved side resonators.

84 citations


Journal ArticleDOI
TL;DR: Despite the few non-musicians who performed as well as musicians, on a group level, there was a strong musician benefit for speech perception in a speech masker, which does not seem to result from better voice processing and could instead be related to better stream segregation or enhanced cognitive functions.
Abstract: Evidence for transfer of musical training to better perception of speech in noise has been mixed. Unlike speech-in-noise, speech-on-speech perception utilizes many of the skills that musical training improves, such as better pitch perception and stream segregation, as well as use of higher-level auditory cognitive functions, such as attention. Indeed, despite the few non-musicians who performed as well as musicians, on a group level, there was a strong musician benefit for speech perception in a speech masker. This benefit does not seem to result from better voice processing and could instead be related to better stream segregation or enhanced cognitive functions.

Journal ArticleDOI
TL;DR: The analysis suggests that communities are between 11 and 26 dB less tolerant ofWTN than of other transportation noise sources, and the role of community tolerance level as a complement and/or an alternative to multiple regression in predicting the prevalence of WTN annoyance is provided.
Abstract: The possibility that wind turbine noise (WTN) affects human health remains controversial. The current analysis presents results related to WTN annoyance reported by randomly selected participants (606 males, 632 females), aged 18-79, living between 0.25 and 11.22 km from wind turbines. WTN levels reached 46 dB, and for each 5 dB increase in WTN levels, the odds of reporting to be either very or extremely (i.e., highly) annoyed increased by 2.60 [95% confidence interval: (1.92, 3.58), p < 0.0001]. Multiple regression models had R(2)'s up to 58%, with approximately 9% attributed to WTN level. Variables associated with WTN annoyance included, but were not limited to, other wind turbine-related annoyances, personal benefit, noise sensitivity, physical safety concerns, property ownership, and province. Annoyance was related to several reported measures of health and well-being, although these associations were statistically weak (R(2 )< 9%), independent of WTN levels, and not retained in multiple regression models. The role of community tolerance level as a complement and/or an alternative to multiple regression in predicting the prevalence of WTN annoyance is also provided. The analysis suggests that communities are between 11 and 26 dB less tolerant of WTN than of other transportation noise sources.

Journal ArticleDOI
TL;DR: Simulations and experimental measurements show that a composite prior is introduced, which simultaneously promotes a piecewise constant profile and sparsity in the solution, provides high-resolution DOA estimation in a general framework, i.e., in the presence of spatially extended sources.
Abstract: Direction-of-arrival (DOA) estimation refers to the localization of sound sources on an angular grid from noisy measurements of the associated wavefield with an array of sensors. For accurate localization, the number of angular look-directions is much larger than the number of sensors, hence, the problem is underdetermined and requires regularization. Traditional methods use an l2-norm regularizer, which promotes minimum-power (smooth) solutions, while regularizing with l1-norm promotes sparsity. Sparse signal reconstruction improves the resolution in DOA estimation in the presence of a few point sources, but cannot capture spatially extended sources. The DOA estimation problem is formulated in a Bayesian framework where regularization is imposed through prior information on the source spatial distribution which is then reconstructed as the maximum a posteriori estimate. A composite prior is introduced, which simultaneously promotes a piecewise constant profile and sparsity in the solution. Simulations and experimental measurements show that this choice of regularization provides high-resolution DOA estimation in a general framework, i.e., in the presence of spatially extended sources.

Journal ArticleDOI
TL;DR: Speech reception thresholds measured using resynthesized ITFS-processed stimuli were the same for the reference and comparison conditions supporting the conclusion that the amount of energetic masking across conditions was the same.
Abstract: Identification of target speech was studied under masked conditions consisting of two or four independent speech maskers. In the reference conditions, the maskers were colocated with the target, the masker talkers were the same sex as the target, and the masker speech was intelligible. The comparison conditions, intended to provide release from masking, included different-sex target and masker talkers, time-reversal of the masker speech, and spatial separation of the maskers from the target. Significant release from masking was found for all comparison conditions. To determine whether these reductions in masking could be attributed to differences in energetic masking, ideal time-frequency segregation (ITFS) processing was applied so that the time-frequency units where the masker energy dominated the target energy were removed. The remaining target-dominated “glimpses” were reassembled as the stimulus. Speech reception thresholds measured using these resynthesized ITFS-processed stimuli were the same for the reference and comparison conditions supporting the conclusion that the amount of energetic masking across conditions was the same. These results indicated that the large release from masking found under all comparison conditions was due primarily to a reduction in informational masking. Furthermore, the large individual differences observed generally were correlated across the three masking release conditions.

Journal ArticleDOI
TL;DR: Factors that influence the speed of sound in wood, including the different types of acoustic waves which propagate in tree stems and lumber, are described.
Abstract: This paper provides a review on the use of acoustics to measure stiffness of standing trees, stems, and logs. An outline is given of the properties of wood and how these are related to stiffness and acoustic velocity throughout the tree. Factors are described that influence the speed of sound in wood, including the different types of acoustic waves which propagate in tree stems and lumber. Acoustic tools and techniques that have been used to measure the stiffness of wood are reviewed. The reasons for a systematic difference between direct and acoustic measurements of stiffness for standing trees, and methods for correction, are discussed. Other techniques, which have been used in addition to acoustics to try to improve stiffness measurements, are also briefly described. Also reviewed are studies which have used acoustic tools to investigate factors that influence the stiffness of trees. These factors include different silvicultural practices, geographic and environmental conditions, and genetics.

Journal ArticleDOI
TL;DR: The experimental results show that Locadiff achieved to detect and locate the cracking zones in the core of the concrete beam at an early stage by mapping the changes in the concrete's micro-structure and monitor the internal stress level in both temporal and spatial domains by mapped the variation in velocity caused by the acousto-elastic effect.
Abstract: This paper describes the use of an ultrasonic imaging technique (Locadiff) for the Non-Destructive Testing & Evaluation of a concrete structure. By combining coda wave interferometry and a sensitivity kernel for diffuse waves, Locadiff can monitor the elastic and structural properties of a heterogeneous material with a high sensitivity, and can map changes of these properties over time when a perturbation occurs in the bulk of the material. The applicability of the technique to life-size concrete structures is demonstrated through the monitoring of a 15-ton reinforced concrete beam subject to a four-point bending test causing cracking. The experimental results show that Locadiff achieved to (1) detect and locate the cracking zones in the core of the concrete beam at an early stage by mapping the changes in the concrete's micro-structure; (2) monitor the internal stress level in both temporal and spatial domains by mapping the variation in velocity caused by the acousto-elastic effect. The mechanical behavior of the concrete structure is also studied using conventional techniques such as acoustic emission, vibrating wire extensometers, and digital image correlation. The performances of the Locadiff technique in the detection of early stage cracking are assessed and discussed.

Journal ArticleDOI
TL;DR: It is shown that it is crucial to include the pressure distribution of the acoustic characterization beam in the modeling to accurately determine shell parameters of non-linearly oscillating bubbles.
Abstract: The sensitivity and efficiency in contrast-enhanced ultrasound imaging and therapy can potentially be increased by the use of resonant monodisperse bubbles. However, bubbles of the same size may respond differently to ultrasound due to differences in their phospholipid shell. In an acoustic bubble sorting chip, resonant bubbles can be separated from the polydisperse agent. Here, a sample of acoustically sorted bubbles is characterized by measuring scattering and attenuation simultaneously using narrowband acoustic pulses at peak negative pressures of 10, 25, and 50 kPa over a 0.7-5.5 MHz frequency range. A second sample is characterized by attenuation measurements at acoustic pressures ranging from 5 to 75 kPa in steps of 2.5 kPa. Scattering and attenuation coefficients were modeled by integration over the pressure and frequency dependent response of all bubbles located within the non-uniform acoustic characterization beam. For all driving pressures and frequencies employed here, the coefficients could be modeled using a single and unique set of shell parameters confirming that acoustically sorted bubbles provide a uniform acoustic response. Moreover, it is shown that it is crucial to include the pressure distribution of the acoustic characterization beam in the modeling to accurately determine shell parameters of non-linearly oscillating bubbles.

Journal ArticleDOI
Jørgen Hald1
TL;DR: The iterative method is shown to provide very similar results in most cases and to be computationally much more efficient.
Abstract: Patch near-field acoustical holography methods like statistically optimized near-field acoustical holography and equivalent source method are limited to relatively low frequencies, where the average array-element spacing is less than half of the acoustic wavelength, while beamforming provides useful resolution only at medium-to-high frequencies. With adequate array design, both methods can be used with the same array. But for holography to provide good low-frequency resolution, a small measurement distance is needed, whereas beamforming requires a larger distance to limit sidelobe issues. The wideband holography method of the present paper was developed to overcome that practical conflict. Only a single measurement is needed at a relatively short distance and a single result is obtained covering the full frequency range. The method uses the principles of compressed sensing: A sparse sound field representation is assumed with a chosen set of basis functions, a measurement is taken with an irregular array, and the inverse problem is solved with a method that enforces sparsity in the coefficient vector. Instead of using regularization based on the 1-norm of the coefficient vector, an iterative solution procedure is used that promotes sparsity. The iterative method is shown to provide very similar results in most cases and to be computationally much more efficient.

Journal ArticleDOI
TL;DR: Only the scale-adaptive simulation and the lattice Boltzmann method simulations which can describe most of the turbulent structures accurately provide the proper spectral shape and levels, and consequently the overall sound power level.
Abstract: A joint experimental and numerical study has been achieved on a low-speed axial ring fan in clean inflow. Experimental evidence shows large periodic broadband humps at lower frequencies than the blade passing frequencies and harmonics even at design conditions. These sub-harmonic humps are also found to be sensitive to the fan process and consequently to its tip geometry. Softer fans yield more intense humps more shifted to lower frequencies with respect to the fan harmonics. Unsteady turbulent flow simulations of this ring fan mounted on a test plenum have been achieved by four different methods that have been validated by comparing with overall performances and detailed hot-wire velocity measurements in the wake. Noise predictions are either obtained directly or are obtained through Ffowcs Williams and Hawkings' analogy, and compared with narrowband and third-octave power spectra. All unsteady simulations correctly capture the low flow rates, the coherent vortex dynamics in the tip clearance and consequently the noise radiation dominated by the tip noise in the low- to mid-frequency range. Yet, only the scale-adaptive simulation and the lattice Boltzmann method simulations which can describe most of the turbulent structures accurately provide the proper spectral shape and levels, and consequently the overall sound power level.

Journal ArticleDOI
TL;DR: Findings are taken as support for a model on which F0 perturbations are fundamentally the result of laryngeal maneuvers initiated to sustain or inhibit phonation, regardless of other language-particular aspects of phonetic realization.
Abstract: This study investigates consonant-related F0 perturbations (“CF0”) in French and Italian by comparing the effects of voiced and voiceless obstruents on F0 to those of voiced sonorants. The voiceless obstruents /p f/ in both languages are found to have F0-raising properties similar to American English voiceless obstruents, while F0 following the (pre)voiced obstruents /b v/ in French and Italian patterns together with /m/, again similar to English [Hanson (2009). J. Acoust. Soc. Am. 125(1), 425–441]. In both languages, F0 is significantly depressed, relative to sonorants, during the closure for voiced obstruents, but cannot be differentiated from sonorants following the release of oral constriction. These findings are taken as support for a model on which F0 perturbations are fundamentally the result of laryngeal maneuvers initiated to sustain or inhibit phonation, regardless of other language-particular aspects of phonetic realization.

Journal ArticleDOI
TL;DR: Good agreement between model predictions and experiment, encouraging from the perspective of low-cost prediction strategies, demonstrates that the models comprise the essential sound generation mechanisms.
Abstract: Installed jet noise is studied by means of a simplified configuration comprising a flat plate in the vicinity of a round jet. The effects of Mach number, jet-plate radial distance, and trailing-edge sweep angle are explored. Acoustic measurements are performed using a traversable 18-microphone azimuthal array, providing pressure data at 360 points on a cylindrical surface surrounding the jet-plate system. Key observations include a decrease, with increasing Mach number, of the relative level of the scattered field in comparison to the uninstalled jet; an exponential dependence of the scattered sound pressure level on the radial jet-plate separation; and considerable sideline noise reductions with increasing sweep angle, with which there is an overall reduction in acoustic efficiency. The measurements are compared with results obtained using a kinematic wavepacket source model, whose radiation is computed in two ways. A TGF for a semi-infinite flat plate is used to provide a low-order approximation of the scattering effect. Use of a more computationally intensive boundary element method provides additional precision. Good agreement between model predictions and experiment, encouraging from the perspective of low-cost prediction strategies, demonstrates that the models comprise the essential sound generation mechanisms.

Journal ArticleDOI
TL;DR: The results show that the masking effects of birdsong are more significant in the road traffic noise environments with lower sound levels, or of distance from traffic, while adding birdsong can indeed increase the naturalness and pleasantness of the traffic noise environment at different distances of the receiver from a road.
Abstract: This study aims to explore how the soundscape quality of traffic noise environments can be improved by the masking effects of birdsong in terms of four soundscape characteristics, i.e., perceived loudness, naturalness, annoyance and pleasantness. Four factors that may influence the masking effects of birdsong (i.e., distance of the receiver from a sound source, loudness of masker, occurrence frequencies of masker, and visibility of sound sources) were examined by listening tests. The results show that the masking effects are more significant in the road traffic noise environments with lower sound levels (e.g., 19 m). Adding birdsong can indeed increase the naturalness and pleasantness of the traffic noise environment at different distances of the receiver from a road. Naturalness, annoyance, and pleasantness, but not perceived loudness, can be altered by increasing the birdsong loudness (i.e., from 37.5 to 52.5 dBA in this study). The pleasantness of traffic noise environments increases significantly from 2.7 to 6.7, when the occurrence of birdsong over a period of 30 s is increased from 2 to 6 times. The visibility of the sound source also influences the masking effects, but its effect is not as significant as the effects of the three other factors.

Journal ArticleDOI
TL;DR: Use of spontaneous head orientations was measured when listeners attended to long speech clips of gradually diminishing speech-to-noise ratio in a sound-deadened room, and head orientation may prove valuable for hearing-impaired listeners.
Abstract: Spatial release from masking is traditionally measured with speech in front. The effect of head-orientation with respect to the speech direction has rarely been studied. Speech-reception thresholds (SRTs) were measured for eight head orientations and four spatial configurations. Benefits of head orientation away from the speech source of up to 8 dB were measured. These correlated with predictions of a model based on better-ear listening and binaural unmasking (r = 0.96). Use of spontaneous head orientations was measured when listeners attended to long speech clips of gradually diminishing speech-to-noise ratio in a sound-deadened room. Speech was presented from the loudspeaker that initially faced the listener and noise from one of four other locations. In an undirected paradigm, listeners spontaneously turned their heads away from the speech in 56% of trials. When instructed to rotate their heads in the diminishing speech-to-noise ratio, all listeners turned away from the speech and reached head orientations associated with lower SRTs. Head orientation may prove valuable for hearing-impaired listeners.

Journal ArticleDOI
TL;DR: The speech rate and pitch characteristics of naturalistic, longitudinally recorded infant- and adult-directed speech are reported for three, genetically diverse languages and the universality of slowed speech rate, raised pitch, and expanded pitch range is discussed in light of individuals' highly variable implementation of these prosodic features in infant- directed speech.
Abstract: The speech rate and pitch (F0) characteristics of naturalistic, longitudinally recorded infant- and adult-directed speech are reported for three, genetically diverse languages. Previous research has suggested that the prosodic characteristics of infant-directed speech are slowed speech rate, raised mean pitch, and expanded pitch range relative to adult-directed speech. Sixteen mothers (5 Sri Lankan Tamil, 5 Tagalog, 6 Korean) were recorded in their homes during natural interactions with their young infants, and adults, over the course of 12 months beginning when the infant was 4 months old. Regression models indicated that the difference between infant- and adult-directed speech rates decreased across the first year of infants' development. Models of pitch revealed predicted differences between infant- and adult-directed speech but did not provide evidence for cross-linguistic or longitudinal effects within the time period investigated for the three languages. The universality of slowed speech rate, raised pitch, and expanded pitch range is discussed in light of individuals' highly variable implementation of these prosodic features in infant-directed speech.

Journal ArticleDOI
TL;DR: To formulate the transmission and reflection coefficients for SV- and P-waves, an analytical model is established using thin plate theory that couples the waveguide modes with the waves in the exterior body.
Abstract: Gradient index (GRIN), refractive, and asymmetric transmission devices for elastic waves are designed using a solid with aligned parallel gaps. The gaps are assumed to be thin so that they can be considered as parallel cracks separating elastic plate waveguides. The plates do not interact with one another directly, only at their ends where they connect to the exterior solid. To formulate the transmission and reflection coefficients for SV- and P-waves, an analytical model is established using thin plate theory that couples the waveguide modes with the waves in the exterior body. The GRIN lens is designed by varying the thickness of the plates to achieve different flexural wave speeds. The refractive effect of SV-waves is achieved by designing the slope of the edge of the plate array, and keeping the ratio between plate length and flexural wavelength fixed. The asymmetric transmission of P-waves is achieved by sending an incident P-wave at a critical angle, at which total conversion to SV-wave occurs. An array of parallel gaps perpendicular to the propagation direction of the reflected waves stop the SV-wave but let P-waves travel through. Examples of focusing, steering, and asymmetric transmission devices are discussed.

Journal ArticleDOI
TL;DR: Investigation of short-term noise annoyance reactions to wind turbines and road traffic in controlled laboratory listening tests discloses a direct link of different acoustic characteristics to annoyance, yet the generalizability to long-term exposure in the field still needs to be verified.
Abstract: Current literature suggests that wind turbine noise is more annoying than transportation noise. To date, however, it is not known which acoustic characteristics of wind turbines alone, i.e., without effect modifiers such as visibility, are associated with annoyance. The objective of this study was therefore to investigate and compare the short-term noise annoyance reactions to wind turbines and road traffic in controlled laboratory listening tests. A set of acoustic scenarios was created which, combined with the factorial design of the listening tests, allowed separating the individual associations of three acoustic characteristics with annoyance, namely, source type (wind turbine, road traffic), A-weighted sound pressure level, and amplitude modulation (without, periodic, random). Sixty participants rated their annoyance to the sounds. At the same A-weighted sound pressure level, wind turbine noise was found to be associated with higher annoyance than road traffic noise, particularly with amplitude modulation. The increased annoyance to amplitude modulation of wind turbines is not related to its periodicity, but seems to depend on the modulation frequency range. The study discloses a direct link of different acoustic characteristics to annoyance, yet the generalizability to long-term exposure in the field still needs to be verified.

Journal ArticleDOI
TL;DR: A perceptual study evaluating state-of-the-art binaural auralizations showed that the methodical calibration procedure employed in combination with attention to control factors led to ecologically/perceptually valid auralization.
Abstract: Recently, auralizations have become more prevalent in architectural acoustics and virtual reality. However, there have been few studies examining the perceptual quality achievable by room acoustic simulations and auralizations. Such studies have highlighted potential problems in creating perceptually equivalent simulations when compared to measured auralizations in terms of parameter estimation. In order to accomplish realistic auralizations, calibration of the geometrical acoustics model can be considered a necessary step. In situations where the studied space exists, well-calibrated auralizations can be employed for multiple purposes, such as multi-modal virtual reality explorations, studies of the acoustical influence of renovations, and historic research. Using this case type as a base, a perceptual study evaluating state-of-the-art binaural auralizations has been carried out. Three test sites of different complexity and acoustics were selected: the abbey church Saint-Germain-des-Pres, the cathedral Notre-Dame de Paris, and the Theâtre de l'Athenee. Models were calibrated according to omni-directional source-receiver measurements for reverberation and clarity parameters. In the subjective listening test, measured and simulated binaural auralizations were compared according to eight acoustic perceptual attributes. Results showed that the methodical calibration procedure employed in combination with attention to control factors led to ecologically/perceptually valid auralizations.

Journal ArticleDOI
TL;DR: Good agreement between experiments and FE predictions using the mixed-mode model for an approximation of the impact damage was found, and a significant influence of the delamination width due to sideways reflection of the guided waves within the delaminations area was found.
Abstract: Carbon fiber laminate composites are increasingly employed for aerospace structures as they offer advantages, such as a good strength to weight ratio. However, impact during the operation and servicing of the aircraft can lead to barely visible and difficult to detect damage. Depending on the severity of the impact, fiber and matrix breakage or delaminations can occur, reducing the load carrying capacity of the structure. Efficient nondestructive testing and structural health monitoring of composite panels can be achieved using guided ultrasonic waves propagating along the structure. The scattering of the A0 Lamb wave mode at delaminations was investigated using a full three-dimensional (3D) finite element (FE) analysis. The influence of the delamination geometry (size and depth) was systematically evaluated. In addition to the depth dependency, a significant influence of the delamination width due to sideways reflection of the guided waves within the delamination area was found. Mixed-mode defects were simulated using a combined model of delamination with localized material degradation. The guided wave scattering at cross-ply composite plates with impact damage was measured experimentally using a non-contact laser interferometer. Good agreement between experiments and FE predictions using the mixed-mode model for an approximation of the impact damage was found.

Journal ArticleDOI
TL;DR: Listeners may possess specific abilities to overcome particular types of acoustic-phonetic deviation, and vocabulary size offers performance benefits for intelligibility of speech that deviates from one's typical dialect norms.
Abstract: Speech perception abilities vary substantially across listeners, particularly in adverse conditions including those stemming from environmental degradation (e.g., noise) or from talker-related challenges (e.g., nonnative or disordered speech). This study examined adult listeners' recognition of words in phrases produced by six talkers representing three speech varieties: a nonnative accent (Spanish-accented English), a regional dialect (Irish English), and a disordered variety (ataxic dysarthria). Semantically anomalous phrases from these talkers were presented in a transcription task and intelligibility scores, percent words correct, were compared across the three speech varieties. Three cognitive-linguistic areas-receptive vocabulary, cognitive flexibility, and inhibitory control of attention-were assessed as possible predictors of individual word recognition performance. Intelligibility scores for the Spanish accent were significantly correlated with scores for the Irish English and ataxic dysarthria. Scores for the Irish English and dysarthric speech, in contrast, were not correlated. Furthermore, receptive vocabulary was the only cognitive-linguistic assessment that significantly predicted intelligibility scores. These results suggest that, rather than a global skill of perceiving speech that deviates from native dialect norms, listeners may possess specific abilities to overcome particular types of acoustic-phonetic deviation. Furthermore, vocabulary size offers performance benefits for intelligibility of speech that deviates from one's typical dialect norms.

Journal ArticleDOI
TL;DR: Data from the Comprehensive Nuclear-Test Ban Treaty Organization International Monitoring System was used to examine the rate and magnitude of change in low frequency sound (5-115 Hz) over the past decade in the South Atlantic and Equatorial Pacific Oceans.
Abstract: Low frequency sound has increased in the Northeast Pacific Ocean over the past 60 yr [Ross (1993) Acoust. Bull. 18, 5–8; (2005) IEEE J. Ocean. Eng. 30, 257–261; Andrew, Howe, Mercer, and Dzieciuch (2002) J. Acoust. Soc. Am. 129, 642–651; McDonald, Hildebrand, and Wiggins (2006) J. Acoust. Soc. Am. 120, 711–717; Chapman and Price (2011) J. Acoust. Soc. Am. 129, EL161–EL165] and in the Indian Ocean over the past decade, [Miksis-Olds, Bradley, and Niu (2013) J. Acoust. Soc. Am. 134, 3464–3475]. More recently, Andrew, Howe, and Mercer's [(2011) J. Acoust. Soc. Am. 129, 642–651] observations in the Northeast Pacific show a level or slightly decreasing trend in low frequency noise. It remains unclear what the low frequency trends are in other regions of the world. In this work, data from the Comprehensive Nuclear-Test Ban Treaty Organization International Monitoring System was used to examine the rate and magnitude of change in low frequency sound (5–115 Hz) over the past decade in the South Atlantic and Equato...