Home
/
Authors
/
S. Boll

Author

S. Boll

Bio: S. Boll is an academic researcher from University of Utah. The author has contributed to research in topics: Noise & Noise measurement. The author has an hindex of 4, co-authored 4 publications receiving 4721 citations.

Topics: Noise, Noise measurement, Critical band, Background noise, Fourier transform ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Suppression of acoustic noise in speech using spectral subtraction

[...]

S. Boll¹•Institutions (1)

University of Utah¹

01 Apr 1979-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

Abstract: A stand-alone noise suppression algorithm is presented for reducing the spectral effects of acoustically added noise in speech. Effective performance of digital speech processors operating in practical environments may require suppression of noise from the digital wave-form. Spectral subtraction offers a computationally efficient, processor-independent approach to effective digital speech analysis. The method, requiring about the same computation as high-speed convolution, suppresses stationary noise from speech by subtracting the spectral noise bias calculated during nonspeech activity. Secondary procedures are then applied to attenuate the residual noise left after subtraction. Since the algorithm resynthesizes a speech waveform, it can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

4,862 citations

Journal Article•DOI•

Suppression of acoustic noise in speech using two microphone adaptive noise cancellation

[...]

S. Boll¹, D. Pulsipher²•Institutions (2)

University of Utah¹, Sandia National Laboratories²

01 Dec 1980-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: Two approaches to adaptive noise cancellation are compared to reduce ambient noise power by at least 20 dB with minimal speech distortion and thus to be potentially powerful as noise suppression preprocessors for voice communication in severe noise environments.

...read moreread less

Abstract: Acoustic noise with energy greater or equal to the speech can be suppressed by adaptively filtering a separately recorded correlated version of the noise signal and subtracting it from the speech waveform. It is shown that for this application of adaptive noise cancellation, large filter lengths are required to account for a highly reverberant recording environment and that there is a direct relation between filter misadjustment and induced echo in the output speech. The second reference noise signal is adaptively filtered using the least mean squares, LMS, and the lattice gradient algorithms. These two approaches are compared in terms of degree of noise power reduction, algorithm convergence time, and degree of speech enhancement. Both methods were shown to reduce ambient noise power by at least 20 dB with minimal speech distortion and thus to be potentially powerful as noise suppression preprocessors for voice communication in severe noise environments.

...read moreread less

151 citations

Journal Article•DOI•

Critical band analysis-synthesis

[...]

T. Petersen¹, S. Boll²•Institutions (2)

New York Institute of Technology¹, University of Utah²

01 Jun 1983-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: A parameterized family of constant-Q analysis-synthesis transform pairs is developed from a property of homogeneous functions that allows for a wide choice of selections for center frequencies, bandwidths, and filter shapes.

...read moreread less

Abstract: The formal derivation of a transformation which models the frequency selective properties (critical bandwidths) of the auditory system is developed. A parameterized family of constant-Q analysis-synthesis transform pairs is developed from a property of homogeneous functions. This formulation allows for a wide choice of selections for center frequencies, bandwidths, and filter shapes. A particular member of the transform family is implemented to model the frequency selective properties of the peripheral auditory system. With this transform, short-time spectral analysis using critical band filter shapes can be implemented. In the absence of spectral modification, the analysis-synthesis transform can be made arbitrarily close to an identity system. This new approach to analysis-synthesis provides the necessary mathematical support needed to design and optimize both constant-Q and critical band analysis-synthesis transforms.

...read moreread less

15 citations

Proceedings Article•DOI•

Critical band analysis-synthesis

[...]

T. Petersen¹, S. Boll²•Institutions (2)

New York Institute of Technology¹, University of Utah²

01 Apr 1981

TL;DR: A parameterized family of analysis-synthesis transform pairs which behave as identities in the absence of perceptual modification is developed from a property of homogeneous functions to facilitate a flexible choice of analysis frequencies and frequency selective response characteristics.

...read moreread less

Abstract: The formal derivation of an integral transformation which can simulate certain frequency selective (critical bandwidth) properties of the auditory system is given. A parameterized family of analysis-synthesis transform pairs which behave as identities in the absence of perceptual modification is developed from a property of homogeneous functions. The formulation facilitates a flexible choice of analysis frequencies and frequency selective response characteristics. A particular member of the transform famiy is then implemented to simulate frequency selective properties of the peripheral auditory system.

...read moreread less

9 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

[...]

Yariv Ephraim¹, David Malah²•Institutions (2)

Stanford University¹, Technion – Israel Institute of Technology²

01 Dec 1984-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: In this article, a system which utilizes a minimum mean square error (MMSE) estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the "spectral subtraction" algorithm.

...read moreread less

Abstract: This paper focuses on the class of speech enhancement systems which capitalize on the major importance of the short-time spectral amplitude (STSA) of the speech signal in its perception. A system which utilizes a minimum mean-square error (MMSE) STSA estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the "spectral subtraction" algorithm. In this paper we derive the MMSE STSA estimator, based on modeling speech and noise spectral components as statistically independent Gaussian random variables. We analyze the performance of the proposed STSA estimator and compare it with a STSA estimator derived from the Wiener estimator. We also examine the MMSE STSA estimator under uncertainty of signal presence in the noisy observations. In constructing the enhanced signal, the MMSE STSA estimator is combined with the complex exponential of the noisy phase. It is shown here that the latter is the MMSE estimator of the complex exponential of the original phase, which does not affect the STSA estimation. The proposed approach results in a significant reduction of the noise, and provides enhanced speech with colorless residual noise. The complexity of the proposed algorithm is approximately that of other systems in the discussed class.

...read moreread less

3,905 citations

Artificial neural networks

[...]

Andrea Roli

09 Mar 2012

TL;DR: Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems as mentioned in this paper, and they have been widely used in computer vision applications.

...read moreread less

Abstract: Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems. In this entry, we introduce ANN using familiar econometric terminology and provide an overview of ANN modeling approach and its implementation methods. † Correspondence: Chung-Ming Kuan, Institute of Economics, Academia Sinica, 128 Academia Road, Sec. 2, Taipei 115, Taiwan; ckuan@econ.sinica.edu.tw. †† I would like to express my sincere gratitude to the editor, Professor Steven Durlauf, for his patience and constructive comments on early drafts of this entry. I also thank Shih-Hsun Hsu and Yu-Lieh Huang for very helpful suggestions. The remaining errors are all mine.

...read moreread less

2,069 citations

Journal Article•DOI•

RASTA processing of speech

[...]

Hynek Hermansky¹, Nelson Morgan²•Institutions (2)

Oregon Health & Science University¹, International Computer Science Institute²

01 Oct 1994-IEEE Transactions on Speech and Audio Processing

TL;DR: The theoretical and experimental foundations of the RASTA method are reviewed, the relationship with human auditory perception is discussed, the original method is extended to combinations of additive noise and convolutional noise, and an application is shown to speech enhancement.

...read moreread less

Abstract: Performance of even the best current stochastic recognizers severely degrades in an unexpected communications environment. In some cases, the environmental effect can be modeled by a set of simple transformations and, in particular, by convolution with an environmental impulse response and the addition of some environmental noise. Often, the temporal properties of these environmental effects are quite different from the temporal properties of speech. We have been experimenting with filtering approaches that attempt to exploit these differences to produce robust representations for speech recognition and enhancement and have called this class of representations relative spectra (RASTA). In this paper, we review the theoretical and experimental foundations of the method, discuss the relationship with human auditory perception, and extend the original method to combinations of additive noise and convolutional noise. We discuss the relationship between RASTA features and the nature of the recognition models that are required and the relationship of these features to delta features and to cepstral mean subtraction. Finally, we show an application of the RASTA technique to speech enhancement. >

...read moreread less

2,002 citations

Proceedings Article•DOI•

Enhancement of speech corrupted by acoustic noise

[...]

M. Berouti¹, Richard Schwartz¹, John Makhoul¹•Institutions (1)

BBN Technologies¹

02 Apr 1979

TL;DR: This paper describes a method for enhancing speech corrupted by broadband noise based on the spectral noise subtraction method, which can automatically adapt to a wide range of signal-to-noise ratios, as long as a reasonable estimate of the noise spectrum can be obtained.

...read moreread less

Abstract: This paper describes a method for enhancing speech corrupted by broadband noise. The method is based on the spectral noise subtraction method. The original method entails subtracting an estimate of the noise power spectrum from the speech power spectrum, setting negative differences to zero, recombining the new power spectrum with the original phase, and then reconstructing the time waveform. While this method reduces the broadband noise, it also usually introduces an annoying "musical noise". We have devised a method that eliminates this "musical noise" while further reducing the background noise. The method consists in subtracting an overestimate of the noise power spectrum, and preventing the resultant spectral components from going below a preset minimum level (spectral floor). The method can automatically adapt to a wide range of signal-to-noise ratios, as long as a reasonable estimate of the noise spectrum can be obtained. Extensive listening tests were performed to determine the quality and intelligibility of speech enhanced by our method. Listeners unanimously preferred the quality of the processed speech. Also, for an input signal-to-noise ratio of 5 dB, there was no loss of intelligibility associated with the enhancement technique.

...read moreread less

1,352 citations

Journal Article•DOI•

Fast Fourier Transform

[...]

Alan R. Jones¹•Institutions (1)

IBM¹

01 Mar 1970-Sigplan Notices

1,349 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse