Home
/
Authors
/
Ville Pulkki

Author

Ville Pulkki

Other affiliations: Technical University of Denmark, Helsinki University of Technology

Bio: Ville Pulkki is an academic researcher from Aalto University. The author has contributed to research in topics: Audio signal & Loudspeaker. The author has an hindex of 32, co-authored 247 publications receiving 4923 citations. Previous affiliations of Ville Pulkki include Technical University of Denmark & Helsinki University of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995

Papers

PDF

Open Access

More filters

Journal Article•

Virtual Sound Source Positioning Using Vector Base Amplitude Panning

[...]

Ville Pulkki

01 Jun 1997-Journal of The Audio Engineering Society

TL;DR: In this paper, a vector-based reformulation of amplitude panning is derived, which leads to simple and computationally efficient equations for virtual sound source positioning, and it is possible to create two- or three-dimensional sound fields where any number of loudspeakers can be placed arbitrarily.

...read moreread less

Abstract: A vector-based reformulation of amplitude panning is derived, which leads to simple and computationally efficient equations for virtual sound source positioning. Using the method, vector base amplitude panning (VBAP), it is possible to create two- or three-dimensional sound fields where any number of loudspeakers can be placed arbitrarily. The method produces virtual sound sources that are as sharp as is possible with current loudspeaker configuration and amplitude panning methods. A digital tool that implements two- and three-dimensional VBAP with eight inputs and outputs has been realized.

...read moreread less

933 citations

Journal Article•

Spatial Sound Reproduction with Directional Audio Coding

[...]

Ville Pulkki

15 Jun 2007-Journal of The Audio Engineering Society

TL;DR: Directional audio coding (DirAC) as discussed by the authors is a method for spatial sound representation, applicable for different sound reproduction systems in the analysis part the diffuseness and direction of arrival of sound are estimated in a single location depending on time and frequency.

...read moreread less

Abstract: Directional audio coding (DirAC) is a method for spatial sound representation, applicable for different sound reproduction systems In the analysis part the diffuseness and direction of arrival of sound are estimated in a single location depending on time and frequency In the synthesis part microphone signals are first divided into nondiffuse and diffuse parts, and are then reproduced using different strategies DirAC is developed from an existing technology for impulse response reproduction, spatial impulse response rendering (SIRR), and implementations of DirAC for different applications are described

...read moreread less

408 citations

Spatial sound generation and perception by amplitude panning techniques

[...]

Ville Pulkki¹•Institutions (1)

Helsinki University of Technology¹

03 Aug 2001

TL;DR: The vector base amplitude panning (VBAP) method, which is a reformulation of the existing pair-wise panning method to position virtual sources in arbitrary 2-D or 3-D loudspeaker setups, is introduced.

...read moreread less

Abstract: Spatial audio aims to recreate or synthesize spatial attributes when reproducing audio over loudspeakers or headphones. Such spatial attributes include, for example, locations of perceived sound sources and an auditory sense of space. This thesis focuses on new methods of spatial audio for loudspeaker listening and on measuring the quality of spatial audio by subjective and objective tests. In this thesis the vector base amplitude panning (VBAP) method, which is an amplitude panning method to position virtual sources in arbitrary 2-D or 3-D loudspeaker setups, is introduced. In amplitude panning the same sound signal is applied to a number of loudspeakers with appropriate non-zero amplitudes. With 2-D setups VBAP is a reformulation of the existing pair-wise panning method. However, differing from earlier solutions it can be generalized for 3-D loudspeaker setups as a triplet-wise panning method. A sound signal is then applied to one, two, or three loudspeakers simultaneously. VBAP has certain advantages compared to earlier virtual source positioning methods in arbitrary layouts. Previous methods either used all loudspeakers to produce virtual sources, which results in some artefacts, or they used loudspeaker triplets with a non-generalizable 2-D user interface. The virtual sources generated with VBAP are investigated. The human directional hearing is simulated with a binaural auditory model adapted from the literature. The interaural time difference (ITD) cue and the interaural level difference (ILD) cue which are the main localization cues are simulated for amplitude-panned virtual sources and for real sources. Psychoacoustic listening tests are conducted to study the subjective quality of virtual sources. Statistically significant phenomena found in listening test data are explained by auditory model simulation results. To obtain a generic view of directional quality in arbitrary loudspeaker setups, directional cues are simulated for virtual sources with loudspeaker pairs and triplets in various setups. The directional qualities of virtual sources generated with VBAP can be stated as follows. Directional coordinates used for this purpose are the angle between a position vector and the median plane ( cc), and the angle between a projection of a position vector to the median plane and frontal direction ( cc). The perceived cc direction of a virtual source coincides well with the VBAP panning direction when a loudspeaker set is near the median plane. When the loudspeaker set is moved towards a side of a listener, the perceived cc direction is biased towards the median plane. The perceived cc direction of an amplitude-panned virtual source is individual and cannot be predicted with any panning law.

...read moreread less

179 citations

Journal Article•

Spatial Impulse Response Rendering I: Analysis and Synthesis

[...]

Juha Merimaa, Ville Pulkki

15 Dec 2005-Journal of The Audio Engineering Society

TL;DR: Spatial impulse response rendering (SIRR) analyzes the time-dependent direction of arrival and diffuseness of measured room responses within frequency bands to synthesize a multichannel response suitable for reproduction with any chosen surround loudspeaker setup.

...read moreread less

Abstract: Spatial impulse response rendering (SIRR) is a recent technique for the reproduction of room acoustics with a multichannel loudspeaker system. SIRR analyzes the time-dependent direction of arrival and diffuseness of measured room responses within frequency bands. Based on the analysis data, a multichannel response suitable for reproduction with any chosen surround loudspeaker setup is synthesized. When loaded to a convolving reverberator, the synthesized responses create a very natural perception of space corresponding to the measured room. A technical description of the analysis-synthesis method is provided. Results of formal subjective evaluation and further analysis of SIRR are presented in a companion paper to be published in JAES in 2006 Jan./Feb.

...read moreread less

166 citations

Contextual Relations of Words in Grimm Tales, Analyzed by Self-Organizing Map

[...]

Timo Honkela, Ville Pulkki, Teuvo Kohonen

01 Jan 1995

TL;DR: In the experiments reported in this work the source data consisted of the raw text of Grimm fairy tales without any prior syntactic or semantic categorization of the words, and the algorithm was able to create diagrams that seem to comply reasonably well with the traditional syntactical categorizations and human intuition about the semantics.

...read moreread less

Abstract: Semantic roles of words in natural languages are reeected by the contexts in which they occur. These roles can explicitly be visualized by the Self-Organizing Map (SOM). In the experiments reported in this work the source data consisted of the raw text of Grimm fairy tales without any prior syntactic or semantic categorization of the words. The algorithm was able to create diagrams that seem to comply reasonably well with the traditional syntactical categorizations and human intuition about the semantics of the words. It has earlier been shown that the Self-Organizing Map (SOM) can be applied to the visual-ization of contextual roles of words, i.e., similarities in their usage in short contexts formed of adjacent words 4]. This paper demonstrates that such relations or roles are also statistically reeected in unrestricted, even quaint natural expressions. The source material chosen for this experiment consisted of 200 Grimm tales (English translation). In most practical applications of the SOM, the input to the map algorithm is derived from some measurements, usually after their preprocessing. In such cases, the input vectors are supposed to have metric relations. Interpretation of languages, on the contrary, must be based on the processing of sequences of discrete symbols. If the words were encoded numerically, the ordered sets formed of them could also be compared mutually as well as with reference expressions. However, as no numerical value of the code should imply any order to the words themselves, it will be necessary to use uncorrelated vectors for encoding. The simplest method to introduce uncorrelated codes is to assign a unit vector for each word. When all diierent words in the input material are listed, a code vector can be deened to have as many components as there are words in the list. This method, however, is only practicable in very small experiments. If the vocabulary is large as in the present experiments, we may then encode the words by quasi-orthogonal random vectors of a much smaller dimensionality 4]. To create a map of discrete symbols that occur within the sentences, each symbol must be presented in the due context. The context may consist of the immediate surroundings of the word in the text. Application of the self-organizing maps to natural language processing has been described earlier in, e.g., 2], 3], 4], 5], and 6].

...read moreread less

131 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•

Word Representations: A Simple and General Method for Semi-Supervised Learning

[...]

Joseph Turian¹, Lev-Arie Ratinov², Yoshua Bengio¹•Institutions (2)

Université de Montréal¹, University of Illinois at Urbana–Champaign²

11 Jul 2010

TL;DR: This work evaluates Brown clusters, Collobert and Weston (2008) embeddings, and HLBL (Mnih & Hinton, 2009) embeds of words on both NER and chunking, and finds that each of the three word representations improves the accuracy of these baselines.

...read moreread less

Abstract: If we take an existing supervised NLP system, a simple and general way to improve accuracy is to use unsupervised word representations as extra word features. We evaluate Brown clusters, Collobert and Weston (2008) embeddings, and HLBL (Mnih & Hinton, 2009) embeddings of words on both NER and chunking. We use near state-of-the-art supervised baselines, and find that each of the three word representations improves the accuracy of these baselines. We find further improvements by combining different word representations. You can download our word features, for off-the-shelf use in existing NLP systems, as well as our code, here: http://metaoptimize.com/projects/wordreprs/

...read moreread less

2,243 citations

Journal Article•DOI•

Sphere Packings, Lattices and Groups

[...]

Werner Fischer, Marburg

01 Feb 1990-Zeitschrift Fur Kristallographie

1,584 citations

Journal Article•DOI•

Digital processing of speech signals

[...]

M.G. Bellanger

01 Oct 1980

1,565 citations

Journal Article•DOI•

Words in the brain's language.

[...]

Friedemann Pulvermüller¹•Institutions (1)

University of Konstanz¹

01 Apr 1999-Behavioral and Brain Sciences

TL;DR: These results support a neurobiological model of language in the Hebbian tradition and provide evidence for processing differences between words and matched meaningless pseudowords, and between word classes, such as concrete content and abstract function words, and words evoking visual or motor associations.

...read moreread less

Abstract: If the cortex is an associative memory, strongly connected cell assemblies will form when neurons in different cortical areas are frequently active at the same time. The cortical distributions of these assemblies must be a consequence of where in the cortex correlated neuronal activity occurred during learning. An assembly can be considered a functional unit exhibiting activity states such as full activation ("ignition") after appropriate sensory stimulation (possibly related to perception) and continuous reverberation of excitation within the assembly (a putative memory process). This has implications for cortical topographies and activity dynamics of cell assemblies forming during language acquisition, in particular for those representing words. Cortical topographies of assemblies should be related to aspects of the meaning of the words they represent, and physiological signs of cell assembly ignition should be followed by possible indicators of reverberation. The following postulates are discussed in detail: (1) assemblies representing phonological word forms are strongly lateralized and distributed over perisylvian cortices; (2) assemblies representing highly abstract words such as grammatical function words are also strongly lateralized and restricted to these perisylvian regions; (3) assemblies representing concrete content words include additional neurons in both hemispheres; (4) assemblies representing words referring to visual stimuli include neurons in visual cortices; and (5) assemblies representing words referring to actions include neurons in motor cortices. Two main sources of evidence are used to evaluate these proposals: (a) imaging studies focusing on localizing word processing in the brain, based on stimulus-triggered event-related potentials (ERPs), positron emission tomography (PET), and functional magnetic resonance imaging (fMRI), and (b) studies of the temporal dynamics of fast activity changes in the brain, as revealed by high-frequency responses recorded in the electroencephalogram (EEG) and magnetoencephalogram (MEG). These data provide evidence for processing differences between words and matched meaningless pseudowords, and between word classes, such as concrete content and abstract function words, and words evoking visual or motor associations. There is evidence for early word class-specific spreading of neuronal activity and for equally specific high-frequency responses occurring later. These results support a neurobiological model of language in the Hebbian tradition. Competing large-scale neuronal theories of language are discussed in light of the data summarized. Neurobiological perspectives on the problem of serial order of words in syntactic strings are considered in closing.

...read moreread less

1,009 citations

Journal Article•

The Organization of the Cochlear Receptor

[...]

I. C. Whitfield

01 Jan 1967-Journal of Anatomy

TL;DR: Alk-3-en-1-ols are produced in good yields from isobutylene and formaldehyde in the presence of organic carboxylic acid salts of Group IB metals.

...read moreread less

Abstract: The yield of alkenols and cycloalkenols is substantially improved by carrying out the reaction of olefins with formaldehyde in the presence of selected catalysts. In accordance with one embodiment, alk-3-en-1-ols are produced in good yields from isobutylene and formaldehyde in the presence of organic carboxylic acid salts of Group IB metals.

...read moreread less

851 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse