Home
/
Authors
/
K. Jarvinen

Author

K. Jarvinen

Bio: K. Jarvinen is an academic researcher from Université de Sherbrooke. The author has contributed to research in topics: Codec & Adaptive Multi-Rate audio codec. The author has an hindex of 2, co-authored 2 publications receiving 307 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The adaptive multirate wideband speech codec (AMR-WB)

[...]

B. Bessette¹, R. Salami¹, Roch Lefebvre¹, Milan Jelinek¹, Jani Rotola-Pukkila², Janne Vainio², H. Mikkola², K. Jarvinen¹ - Show less +4 more•Institutions (2)

Université de Sherbrooke¹, Nokia²

01 Nov 2002-IEEE Transactions on Speech and Audio Processing

TL;DR: In this paper, the adaptive multirate wideband (AMR-WB) speech codec was selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services.

...read moreread less

Abstract: This paper describes the adaptive multirate wideband (AMR-WB) speech codec selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services. The AMR-WB speech codec algorithm was selected in December 2000 and the corresponding specifications were approved in March 2001. The AMR-WB codec was also selected by the International Telecommunication Union-Telecommunication Sector (ITU-T) in July 2001 in the standardization activity for wideband speech coding around 16 kb/s and was approved in January 2002 as Recommendation G.722.2. The adoption of AMR-WB by ITU-T is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. AMR-WB uses an extended audio bandwidth from 50 Hz to 7 kHz and gives superior speech quality and voice naturalness compared to existing second- and third-generation mobile communication systems. The wideband speech service provided by the AMR-WB codec will give mobile communication speech quality that also substantially exceeds (narrowband) wireline quality. The paper details AMR-WB standardization history, algorithmic description including novel techniques for efficient ACELP wideband speech coding and subjective quality performance of the codec.

...read moreread less

312 citations

Proceedings Article•DOI•

The adaptive multi-rate wideband codec: history and performance

[...]

R. Salami, B. Bessette, R. Lefebvre, M. Jelinek, Jani Rotola-Pukkila, Janne Vainio, H. Mikkola, K. Jarvinen - Show less +4 more

06 Oct 2002

TL;DR: The history and performance of the adaptive multi-rate wideband (AMR-WB) speech codec recently selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services is given.

...read moreread less

Abstract: This paper gives the history and performance of the adaptive multi-rate wideband (AMR-WB) speech codec recently selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services. The AMR-WB speech codec algorithm was selected in December 2000, and the corresponding specifications were approved in March 2001. In July 2001, the AMR-WB codec was also selected by ITU-T in the standardization activity for wideband speech coding around 16 kbit/s. The adoption of AMR-WB by ITU-T is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. AMR-WB uses an extended audio bandwidth from 3.4 kHz to 7 kHz and gives superior speech quality and voice naturalness compared to 2/sup nd/ and 3/sup rd/ generation mobile communication systems.

...read moreread less

14 citations

Cited by

PDF

Open Access

More filters

Patent•DOI•

Methods and devices for low-frequency emphasis during audio compression based on acelp/tcx

[...]

Bruno Bessette

18 Feb 2005-Journal of the Acoustical Society of America

TL;DR: In this paper, a method for low-frequency emphasizing the spectrum of a sound signal transformed in a frequency domain and comprising transform coefficients grouped in a number of blocks, in which a maximum energy for one block is calculated and a position index of the block with maximum energy is determined, a factor is calculated for each block having a position Index smaller than the position Index of the Block with maximum Energy, and for each blocks a gain is determined from the factor and is applied to the transform coefficients of the blocks.

...read moreread less

Abstract: An aspect of the present invention relates to a method for low-frequency emphasizing the spectrum of a sound signal transformed in a frequency domain and comprising transform coefficients grouped in a number of blocks, in which a maximum energy for one block is calculated and a position index of the block with maximum energy is determined, a factor is calculated for each block having a position index smaller than the position index of the block with maximum energy, and for each block a gain is determined from the factor and is applied to the transform coefficients of the block.

...read moreread less

243 citations

Monograph•DOI•

Near-Capacity Multi-Functional MIMO Systems

[...]

Lajos Hanzo, Osamah Alamri, Mohammed El-Hajjar, Nan Wu

22 May 2009

206 citations

Journal Article•DOI•

ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge

[...]

Zhizheng Wu¹, Junichi Yamagishi¹, Tomi Kinnunen², Cemal Hanilci², Mohammed Sahidullah², Aleksandr Sizov², Nicholas Evans³, Massimiliano Todisco³ - Show less +4 more•Institutions (3)

University of Edinburgh¹, University of Eastern Finland², Institut Eurécom³

17 Feb 2017-IEEE Journal of Selected Topics in Signal Processing

TL;DR: A review of postevaluation studies conducted using the same dataset illustrates the rapid progress stemming from ASVspoof and outlines the need for further investigation.

...read moreread less

Abstract: Concerns regarding the vulnerability of automatic speaker verification (ASV) technology against spoofing can undermine confidence in its reliability and form a barrier to exploitation. The absence of competitive evaluations and the lack of common datasets has hampered progress in developing effective spoofing countermeasures. This paper describes the ASV Spoofing and Countermeasures (ASVspoof) initiative, which aims to fill this void. Through the provision of a common dataset, protocols, and metrics, ASVspoof promotes a sound research methodology and fosters technological progress. This paper also describes the ASVspoof 2015 dataset, evaluation, and results with detailed analyses. A review of postevaluation studies conducted using the same dataset illustrates the rapid progress stemming from ASVspoof and outlines the need for further investigation. Priority future research directions are presented in the scope of the next ASVspoof evaluation planned for 2017.

...read moreread less

177 citations

Proceedings Article•DOI•

Unified speech and audio coding scheme for high quality at low bitrates

[...]

Max Neuendorf, Philippe Gournay¹, Markus Multrus, Jeremie Lecomte, B. Bessette¹, R. Geiger, Stefan Bayer, Guillaume Fuchs, Johannes Hilpert, Nikolaus Rettelbach, Redwan Salami, Gerald Schuller, Roch Lefebvre¹, Grill Bernhard - Show less +10 more•Institutions (1)

Université de Sherbrooke¹

19 Apr 2009

TL;DR: This new codec forms the basis of the reference model in the ongoing MPEG standardization activity for Unified Speech and Audio Coding, which results in a codec that exhibits consistently high quality for speech, music and mixed audio content.

...read moreread less

Abstract: Traditionally, speech coding and audio coding were separate worlds. Based on different technical approaches and different assumptions about the source signal, neither of the two coding schemes could efficiently represent both speech and music at low bitrates. This paper presents a unified speech and audio codec, which efficiently combines techniques from both worlds. This results in a codec that exhibits consistently high quality for speech, music and mixed audio content. The paper gives an overview of the codec architecture and presents results of formal listening tests comparing this new codec with HE-AAC(v2) and AMR-WB+. This new codec forms the basis of the reference model in the ongoing MPEG standardization activity for Unified Speech and Audio Coding.

...read moreread less

108 citations

Proceedings Article•DOI•

A harmonic bandwidth extension method for audio codecs

[...]

Frederik Nagel¹, Sascha Disch²•Institutions (2)

Fraunhofer Society¹, Leibniz University of Hanover²

19 Apr 2009

TL;DR: This paper exposes the origin of the roughness and proposes a bandwidth extension method, which does not introduce roughness into the reconstructed audio signal, and demonstrates the advantage of the proposed method compared to a standard bandwidth extension.

...read moreread less

Abstract: Today's efficient audio codecs for low bitrate application scenarios often rely on parametric coding of the upper frequency band portion of a signal while the lower frequency band portion of the same is conveyed by a waveform preserving coding method. At the decoder, the upper frequency signal is approximated from the lower frequency data using the upper frequency band parameters. However, commonly used methods of bandwidth extension almost inevitably suffer from a sensation of unpleasant roughness, which is especially present for tonal music items. In this paper we expose the origin of the roughness and propose a bandwidth extension method, which does not introduce roughness into the reconstructed audio signal. A listening test demonstrates the advantage of the proposed method compared to a standard bandwidth extension.

...read moreread less

106 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65

Collapse