Home
/
Authors
/
Birger Kollmeier

Author

Birger Kollmeier

Other affiliations: University of Göttingen, Siemens

Bio: Birger Kollmeier is an academic researcher from University of Oldenburg. The author has contributed to research in topics: Binaural recording & Intelligibility (communication). The author has an hindex of 48, co-authored 345 publications receiving 9080 citations. Previous affiliations of Birger Kollmeier include University of Göttingen & Siemens.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1990
1989
1988
1983

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Modeling auditory processing of amplitude modulation I. Detection and masking with narrow-band carriers

[...]

Torsten Dau¹, Birger Kollmeier, AG Armin Kohlrausch•Institutions (1)

University of Oldenburg¹

01 Nov 1997-Journal of the Acoustical Society of America

TL;DR: A quantitative model for describing data from modulation-detection and modulation-masking experiments is presented, which proposes that the typical low-pass characteristic of the temporal modulation transfer function observed with wide-band noise carriers is not due to "sluggishness" in the auditory system, but can instead be understood in terms of the interaction between modulation filters and the inherent fluctuations in the carrier.

...read moreread less

Abstract: This paper presents a quantitative model for describing data from modulation-detection and modulation-masking experiments, which extends the model of the ‘‘effective’’ signal processing of the auditory system described in Dau et al. @J. Acoust. Soc. Am. 99, 3615‐3622 ~1996!#. The new element in the present model is a modulation filterbank, which exhibits two domains with different scaling. In the range 0‐10 Hz, the modulation filters have a constant bandwidth of 5 Hz. Between 10 Hz and 1000 Hz a logarithmic scaling with a constant Q value of 2 was assumed. To preclude spectral effects in temporal processing, measurements and corresponding simulations were performed with stochastic narrow-band noise carriers at a high center frequency ~5 kHz!. For conditions in which the modulation rate ( f mod) was smaller than half the bandwidth of the carrier (D f ), the model accounts for the low-pass characteristic in the threshold functions @e.g., Viemeister, J. Acoust. Soc. Am. 66, 1364‐1380 ~1979!#. In conditions with f mod.D f /2, the model can account for the high-pass characteristic in the threshold function. In a further experiment, a classical masking paradigm for investigating frequency selectivity was adopted and translated to the modulation-frequency domain. Masked thresholds for sinusoidal test modulation in the presence of a competing modulation masker were measured and simulated as a function of the test modulation rate. In all cases, the model describes the experimental data to within a few dB. It is proposed that the typical low-pass characteristic of the temporal modulation transfer function observed with wide-band noise carriers is not due to ‘‘sluggishness’’ in the auditory system, but can instead be understood in terms of the interaction between modulation filters and the inherent fluctuations in the carrier. © 1997 Acoustical Society of America.@S0001-4966~97!05611-7#

...read moreread less

580 citations

Journal Article•DOI•

Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests.

[...]

Thomas Brand¹, Birger Kollmeier•Institutions (1)

University of Oldenburg¹

04 Jun 2002-Journal of the Acoustical Society of America

TL;DR: The minimum standard deviations achievable for concurrent estimates of thresholds and psychometric function slopes as well as the optimal target values for adaptive procedures are calculated as functions of stimulus level and track length on the basis of the binomial theory.

...read moreread less

Abstract: The minimum standard deviations achievable for concurrent estimates of thresholds and psychometric function slopes as well as the optimal target values for adaptive procedures are calculated as functions of stimulus level and track length on the basis of the binomial theory. The optimum pair of targets for a concurrent estimate is found at the correct response probabilities p1 = 0.19 and p2 = 0.81 for the logistic psychometric function. An adaptive procedure that converges at these optimal targets is introduced and tested with Monte Carlo simulations. The efficiency increases rapidly when each subject's response consists of more than one statistically independent Bernoulli trial. Sentence intelligibility tests provide more than one Bernoulli trial per sentence when each word is scored separately. The number of within-sentence trials can be quantified by the j factor [Boothroyd and Nittrouer, J. Acoust. Soc. Am. 84, 101-114 (1988)]. The adaptive procedure was evaluated with 10 normal-hearing and 11 hearing-impaired listeners using two German sentence tests that differ in j factors. The expected advantage of the sentence test with the higher j factor was not observed, possibly due to training effects. Hence, the number of sentences required for a reliable speech reception threshold (approximately 1 dB standard deviation) concurrently with a slope estimate (approximately 20%-30% relative standard deviation) is at least N = 30 if word scoring for short, meaningful sentences (j approximately 2) is performed.

...read moreread less

361 citations

Journal Article•DOI•

Development and analysis of an International Speech Test Signal (ISTS)

[...]

Inga Holube, Stefan Fredelake¹, Marcel S. M. G. Vlaming², Birger Kollmeier¹•Institutions (2)

University of Oldenburg¹, VU University Amsterdam²

11 Nov 2010-International Journal of Audiology

TL;DR: The primary intention is to include this test signal with a new measurement method for a new hearing aid standard (IEC 60118-15) that is based on natural recordings but is largely non-intelligible because of segmentation and remixing.

...read moreread less

Abstract: For analysing the processing of speech by a hearing instrument, a standard test signal is necessary which allows for reproducible measurement conditions, and which features as many of the m...

...read moreread less

323 citations

Journal Article•DOI•

PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception

[...]

Rainer Huber, Birger Kollmeier¹•Institutions (1)

University of Oldenburg¹

01 Nov 2006-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A new method for the objective assessment and prediction of perceived audio quality is introduced, based on a psychoacoustically validated, quantitative model of the "effective" peripheral auditory processing, which shows a higher prediction accuracy than the ITU-R recommendation BS.1387.

...read moreread less

Abstract: A new method for the objective assessment and prediction of perceived audio quality is introduced. It represents an expansion of the speech quality measure qC, introduced by Hansen and Kollmeier, and is based on a psychoacoustically validated, quantitative model of the "effective" peripheral auditory processing by Dau et al. To evaluate the audio quality of a given distorted signal relative to a corresponding high-quality reference signal, the auditory model is employed to compute "internal representations" of the signals, which are partly assimilated in order to account for assumed cognitive aspects. The linear cross correlation coefficient of the assimilated internal representations represents the perceptual similarity measure (PSM). PSM shows good correlations with subjective quality ratings if different types of audio signals are considered separately, whereas a better accuracy of signal-independent quality prediction is achieved by a second quality measure PSMt represented by the fifth percentile of the sequence of instantaneous audio quality PSM(t). The new measures were evaluated using a large database of subjective listening tests that were originally carried out on behalf of the International Telecommunication Union (ITU) and Moving Pictures Experts Group (MPEG) for the evaluation of various low bit-rate audio codecs. Additional tests with data unknown in the development phase of the model were carried out. Except for linear distortions, the new method shows a higher prediction accuracy than the ITU-R recommendation BS.1387 ("PEAQ") for the tested data

...read moreread less

321 citations

Journal Article•DOI•

Modeling auditory processing of amplitude modulation II. Spectral and temporal integration

[...]

Torsten Dau¹, Birger Kollmeier, AG Armin Kohlrausch•Institutions (1)

University of Oldenburg¹

01 Nov 1997-Journal of the Acoustical Society of America

TL;DR: The combination of the modulation filterbank concept and the optimal decision algorithm proposed here appears to present a powerful strategy for describing modulation-detection phenomena in narrow-band and broadband conditions.

...read moreread less

Abstract: A multi-channel model, describing the effects of spectral and temporal integration in amplitude-modulation detection for a stochastic noise carrier, is proposed and validated. The model is based on the modulation filterbank concept which was established in the accompanying paper [Dau et al., J. Acoust. Soc. Am. 102, 2892–2905 (1997)] for modulation perception in narrow-band conditions (single-channel model). To integrate information across frequency, the detection process of the model linearly combines the channel outputs. To integrate information across time, a kind of “multiple-look” strategy, is realized within the detection stage of the model. Both data from the literature and new data are used to validate the model. The model predictions agree with the results of Eddins [J. Acoust. Soc. Am. 93, 470–479 (1993)] that the “time constants” associated with the temporal modulation transfer functions (TMTF) derived for narrow-band stimuli do not vary with carrier frequency region and that they decrease monotonically with increasing stimulus bandwidth. The model is able to predict masking patterns in the modulation-frequency domain, as observed experimentally by Houtgast [J. Acoust. Soc. Am. 85, 1676–1680 (1989)]. The model also accounts for the finding by Sheft and Yost [J. Acoust. Soc. Am. 88, 796–805 (1990)] that the long “effective” integration time constants derived from the data are two orders of magnitude larger than the time constants derived from the cutoff frequency of the TMTF. Finally, the temporal-summation properties of the model allow the prediction of data in a specific temporal paradigm used earlier by Viemeister and Wakefield [J. Acoust. Soc. Am. 90, 858–865 (1991)]. The combination of the modulation filterbank concept and the optimal decision algorithm proposed here appears to present a powerful strategy for describing modulation-detection phenomena in narrow-band and broadband conditions.

...read moreread less

308 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73

Collapse

Cited by

PDF

Open Access

More filters

Development and Validation of A

[...]

Lao Juan, Kyu-Jin Yoon, Young-Bae Park, Young-Jae Park, Min-Yong Kim - Show less +1 more

01 Jan 2015

TL;DR: The results suggest that the LJQ is a reliable and valid instrument for evaluating LJ.

...read moreread less

Abstract: Objectives: Lao Juan (LJ, 劳倦) is a syndrome described in Chinese medicine (CM) that manifests with : Lao Juan (LJ, 劳倦) is a syndrome described in Chinese medicine (CM) that manifests with fatigue, fever, spontaneous sweating, indigestion, work-induced pain, weakness of the limbs, and shortness of breath. fatigue, fever, spontaneous sweating, indigestion, work-induced pain, weakness of the limbs, and shortness of breath. The present study was conducted to examine the reliability and validity of a Lao Juan Questionnaire (LJQ). The present study was conducted to examine the reliability and validity of a Lao Juan Questionnaire (LJQ). Methods: A total of 151 outpatients and 73 normal subjects were asked to complete the LJQ. Seventy-three normal subjects A total of 151 outpatients and 73 normal subjects were asked to complete the LJQ. Seventy-three normal subjects were additionally asked to complete the Chalder Fatigue Scale (CFS). Twelve clinicians determined whether the were additionally asked to complete the Chalder Fatigue Scale (CFS). Twelve clinicians determined whether the 151 outpatients exhibited LJ or not. The internal consistency and construct validity for the LJQ were estimated using 151 outpatients exhibited LJ or not. The internal consistency and construct validity for the LJQ were estimated using data from the outpatient subjects. The CFS data were used to examine the concurrent validity of the LJQ. Total LJQ data from the outpatient subjects. The CFS data were used to examine the concurrent validity of the LJQ. Total LJQ scores and the clinicians' diagnoses of the outpatients were used to perform receiver operating characteristics (ROC) scores and the clinicians' diagnoses of the outpatients were used to perform receiver operating characteristics (ROC) curve analyses and to defi ne an optimum cut-off score for the LJQ. curve analyses and to defi ne an optimum cut-off score for the LJQ. Results: The 19-item LJQ had satisfactory internal : The 19-item LJQ had satisfactory internal consistency (α=0.828) and concurrent validity, with signifi cant correlations between the LJQ and the CFS subscales. consistency (α=0.828) and concurrent validity, with signifi cant correlations between the LJQ and the CFS subscales. In the test of construct validity using principal component analysis, a total of six factors were extracted, and the overall In the test of construct validity using principal component analysis, a total of six factors were extracted, and the overall variance explained by all factors was 59.5%. In ROC curve analyses, the sensitivity, specifi city, and area under the variance explained by all factors was 59.5%. In ROC curve analyses, the sensitivity, specifi city, and area under the curve were 76.0%, 59.2%, and 0.709, respectively. The optimum cut-off score was defi ned as six points. curve were 76.0%, 59.2%, and 0.709, respectively. The optimum cut-off score was defi ned as six points. Conclusions: Our results suggest that the LJQ is a reliable and valid instrument for evaluating LJ. Our results suggest that the LJQ is a reliable and valid instrument for evaluating LJ. KEYWORDS Chinese medicine, chronic fatigue syndrome, Chinese medicine-pattern Chinese medicine, chronic fatigue syndrome, Chinese medicine-pattern

...read moreread less

3,787 citations

Journal Article•DOI•

EEG source imaging

[...]

Christoph M. Michel, Micah M. Murray, Göran Lantz, Sara L. Gonzalez, Laurent Spinelli, Rolando Grave de Peralta - Show less +2 more

01 Oct 2004-Clinical Neurophysiology

TL;DR: It is shown that modern EEG source imaging simultaneously details the temporal and spatial dimensions of brain activity, making it an important and affordable tool to study the properties of cerebral, neural networks in cognitive and clinical neurosciences.

...read moreread less

1,600 citations

文化と言語の多様性の中のCommon European Framework of Reference for Languages: Learning, teaching, assessment (CEFR)--それは基準か? (第10回明海大学大学院応用言語学研究科セミナー講演)

[...]

茂吉島

01 Mar 2008

TL;DR: It’s time to get used to the idea that there is no such thing as a “magic bullet”.

...read moreread less

Abstract: 中國科技大學通識教育中心英語文證照奬勵金實施要點中華民國 105 年 1 月 8 日通識教育委員會議通過一、中國科技大學(以下簡稱本校)為鼓勵本校學生通過具公信力機構之英語文能力測驗或取得證照,特訂定「中國科技大學通識教育中心英語文證照獎勵金實施要點」(以下簡稱本要點)。二、學生於就讀本校期間,通過歐盟共同架構(CEFR)語言能力參考指標 B1(中級)同等級英語文能力測驗以上(含)者,得依據本要點酌予獎勵。檢測項目請參閱本中心「歐洲語言學習、教學、評量共同參考架構與各英語檢測分級對照表」(參見附表);未列於標準對照表之測驗項目不給予獎助。三、凡本校學生,除應英系外,均得申請。大學部學生通過同等級以申請一次為限,在學期間得重複申請,但該次申請之級別不得低於前次。本獎勵金每學期核發乙次,每次核發全校前 10 名,各名次核發金額如附表。四、申請人應提供在學期間,申請當(學)期參加考試之證明文件及成績證明或證照,以憑辦理。五、獎勵金申請作業:請至通識教育中心網頁下載「英語文證照獎勵金申請表」(附件 1), 填妥後檢附成績單正本及影本(背面簽名並註明與正本無異)各一份、本人金融帳戶存簿(郵局或土地銀行)封面影本送至通識教育中心。通識教育中心得每學期遴選受獎代表,擇期公開頒奬,並辦理後續請款作業。六、奬勵金申請期限:通過相關證照考試半年內應提出申請,逾期視同放棄。七、本要點之獎勵金由學校開設通識教育中心專戶,一切收支專款專用;每年度如有剩餘款,則移至翌年度繼續使用。八、本要點經通識教育中心會議審查通過,陳請校長核定後公告實施,修訂時亦同。

...read moreread less

1,468 citations

Journal Article•DOI•

Ethnologue: Languages of the World

[...]

Sarah L. Nesbeitt

01 Nov 1999-Electronic Resources Review

1,364 citations

Journal Article•DOI•

A regression approach to speech enhancement based on deep neural networks

[...]

Yong Xu¹, Jun Du¹, Li-Rong Dai¹, Chin-Hui Lee²•Institutions (2)

University of Science and Technology of China¹, Georgia Institute of Technology²

01 Jan 2015-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: The proposed DNN approach can well suppress highly nonstationary noise, which is tough to handle in general, and is effective in dealing with noisy speech data recorded in real-world scenarios without the generation of the annoying musical artifact commonly observed in conventional enhancement methods.

...read moreread less

Abstract: In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to alleviate the over-smoothing problem of the regression model, and the dropout and noise-aware training strategies to further improve the generalization capability of DNNs to unseen noise conditions. Experimental results demonstrate that the proposed framework can achieve significant improvements in both objective and subjective measures over the conventional MMSE based technique. It is also interesting to observe that the proposed DNN approach can well suppress highly nonstationary noise, which is tough to handle in general. Furthermore, the resulting DNN model, trained with artificial synthesized data, is also effective in dealing with noisy speech data recorded in real-world scenarios without the generation of the annoying musical artifact commonly observed in conventional enhancement methods.

...read moreread less

1,250 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse