(Open Access) Auris populi: crowdsourced native transcriptions of Dutch vowels spoken by adult Spanish learners (2015) | P. Burgos

Q: What are the contributions in "Auris populi: crowdsourced native transcriptions of dutch vowels spoken by adult spanish learners" ?

In this paper the authors report on a study in which Dutch vowels produced by Spanish adult L2 learners were orthographically transcribed by Dutch lay listeners through crowdsourcing. The aim of the crowdsourcing experiment was to investigate how the auris populi, the crowd 's ear, would deal with possibly deviant L2 vowel realizations. The authors present data on the transcriptions of the non-expert listeners for all fifteen Dutch vowels. The authors discuss these results with regard to previous studies on the speech production of adult Spanish learners of Dutch and outline perspectives for future research.

Q: What are the future works mentioned in the paper "Auris populi: crowdsourced native transcriptions of dutch vowels spoken by adult spanish learners" ?

This is definitely a topic that deserves attention in future research. Four, the auris populi methodology has proven to be a practical and valuable tool for future L2 speech research. The existence of possible biases is an issue that certainly deserves further examination when dealing with crowdsourced native transcriptions ( cf. [ 4 ], [ 5 ] ), but the transcriptions the authors got seem to reflect quite accurately the phonetic variation in the stimuli. Despite the potential drawback of the auris populi methodology, the authors found crowdsourcing to be a valuable tool to collect a large amount of L2 speech transcriptions from an extensive and diverse group of native non-expert listeners.

PDF hosted at the Radboud Repository of the Radboud University

Nijmegen

The following full text is a publisher's version.

For additional information about this publication click this link.

http://hdl.handle.net/2066/145184

Please be advised that this information was generated on 2022-08-09 and may be subject to

change.

Seediscussions,stats,andauthorprofilesforthispublicationat:http://www.researchgate.net/publication/282607788

Aurispopuli:crowdsourcednative

transcriptionsofDutchvowelsspokenbyadult

Spanishlearners

CONFERENCEPAPER·SEPTEMBER2015

READS

5AUTHORS,INCLUDING:

PepiBurgos

RadboudUniversityNijmegen

6PUBLICATIONS9CITATIONS

SEEPROFILE

EricSanders

RadboudUniversityNijmegen

37PUBLICATIONS201CITATIONS

SEEPROFILE

CatiaCucchiarini

RadboudUniversityNijmegen

141PUBLICATIONS1,442CITATIONS

SEEPROFILE

HelmerStrik

RadboudUniversityNijmegen

208PUBLICATIONS1,756CITATIONS

SEEPROFILE

Availablefrom:PepiBurgos

Retrievedon:16October2015

Auris populi: crowdsourced native transcriptions of Dutch vowels spoken by

adult Spanish learners

Pepi Burgos

, Eric Sanders

, Catia Cucchiarini

, Roeland van Hout

, Helmer Strik

1 2

Center for Language Studies, Radboud University Nijmegen, The Netherlands

Center for Language and Speech Technology, Radboud University Nijmegen, The Netherlands

{j.burgos, e.sanders, c.cucchiarini, r.vanhout, h.strik}@let.ru.nl

Abstract

In this paper we report on a study in which Dutch vowels

produced by Spanish adult L2 learners were orthographically

transcribed by Dutch lay listeners through crowdsourcing. The

aim of the crowdsourcing experiment was to investigate how

the auris populi, the crowd's ear, would deal with possibly

deviant L2 vowel realizations. We present data on the

transcriptions of the non-expert listeners for all fifteen Dutch

vowels. The results of our study indicate that Dutch vowels

pronounced by Spanish learners were transcribed differently

from their canonical (target) forms by native listeners. The

listeners’ transcriptions confirm findings of previous research

based on expert annotations of Spanish learners’ vowel

realizations conducted at our lab, namely, that the five Spanish

vowels seem to function as “attractors” for the larger set of the

Dutch vowels. In general, the results are also in line with the

outcomes of acoustic measurements of the same speech

material, but there are some interesting discrepancies. We

discuss these results with regard to previous studies on the

speech production of adult Spanish learners of Dutch and

outline perspectives for future research. Finally, given our

results, we formulate some evaluative remarks on the auris

populi methodology for future L2 speech research.

Index Terms: L2 speech, orthographic transcription,

crowdsourcing

1. Introduction

Studies on second language (L2) acquisition have shown that

adult learners seldom achieve a native-like pronunciation [1],

[2]. Accented speech does not necessarily impede

communication as long as the pronunciation of the L2 learners

is intelligible and native listeners are able to understand the

intended message [3]. How can we determine whether

accented speech is intelligible? Many studies relied on

evaluations of experts. Another approach is to use native lay

listeners to judge non-native speech, sometimes even asking

them to evaluate specific phonetic contrasts. These

approaches, however relevant, cannot answer the question

what native listeners hear and perceive when they listen to

accented speech. What brings the crowd's ear, the auris populi,

when that ear has to listen to accented pronunciations of a

series of separate words, spoken by a group of L2 learners?

A self-evident manner of finding out whether a word

produced by L2 learners has been perceived or understood is

by asking native listeners to orthographically transcribe the

words uttered by L2 learners. A strong reason for doing this is

that learners do not actually communicate with a limited

number of experts, but with a various and extensive group of

native listeners. A promising way of reaching this group is by

crowdsourcing. In doing so, we will not only obtain a large

and various group of native listeners, but at the same time we

will be able to collect a variety of transcriptions on the speech

of many L2 speakers [4], [5].

The aim of the current study is to investigate how the auris

populi, the crowd's ear, would deal with possibly deviant L2

vowel realizations. The listeners' judgments revealing the

“wisdom of the crowd’s ear” [6] will help us understand which

features of the learner vowel productions may cause

confusions in Dutch lay listeners’perception.

In the remainder of this paper, we first present the research

background in Section 2. Section 3 describes the method, the

crowdsourcing experiment and the quality control. The results

are presented in Section 4 and discussed in Section 5. Finally,

we draw the conclusions of our study in Section 6.

2. Research background

There are considerable differences between the Dutch and the

Spanish vowel inventories [7], [8], [9], [10], [11], [12]. First,

Spanish has five vowels (/DHLRX/) [13], whereas Dutch

has fifteen unreduced vowels (seven tense vowels: /L, \XHˑ,

øˑ, Rˑ, Dˑ/; five lax vowels: /,, ɛ, ɔ, ʏ, ɑ/; three diphthongs:

/ɛLœ\ɔX/) and the reduced vowel schwa /ə/ [14]. Second,

Dutch has a tense/lax distinction, including vowel length

(short vowels: /L,ɛ\ʏɑXɔ/, long vowels: /DˑHˑ, Rˑ, øˑ,

ɛLœ\ɔX/), whereas Spanish does not have contrastive vowel

length. Third, Dutch has four front rounded vowels: /ʏ, \, øˑ,

œ\/, whereas in Spanish all rounded vowels (/RX/) are back.

Previous research has investigated the speech production

of adult Spanish learners of Dutch [7], [8], [9], [10]. Studies

conducted by Burgos et al. [7], [8] based on samples of

extemporaneous speech showed that vowel errors were more

frequent and persistent than consonant mispronunciations. For

this reason, follow-up research was conducted on the vowels.

Burgos et al. [9], [10] reported on studies in which elicited

material containing read speech was employed. The use of

read speech containing all speech sounds that are problematic

for Spanish learners, was aimed at obtaining sufficient

mispronunciations to be acoustically analyzed. Burgos et al.

[9] studied the production of three vowel contrasts (/ɑ-Dˑ/, /,-

/, /ʏ-øˑ/), and that of the Spanish learners’ realizations of all

fifteen Dutch vowels [10]. Both studies [9], [10] concentrated

on the acoustic analysis of the vowels produced by the Spanish

learners in comparison to those produced by Dutch native

speakers, and concluded that adult Spanish learners do not

10, 2015, Dresden, Germany

INTERSPEECH 2015

2819

employ duration and spectral properties in a native-like

manner. Moreover, in Burgos et al. [7], [8], [9], [10] it was

found that the L1 phonology influences L2 vowel production

and that the five Spanish vowels appear to function as

“attractors” for the larger set of Dutch vowels. Based on the

results of the studies mentioned above, we can advance the

following predictions. First, we hypothesize that Dutch lay

listeners will transcribe the tokens produced by the Spanish L2

learners differently from their canonical forms. Second, we

expect to find the

“attractor” effect phenomenon in the

listeners' transcriptions. Third, we predict that deviant patterns

found in the acoustic measurements on the same speech

material will be mirrored in the listener's transcriptions.

3. Method

3.1. Speakers

To obtain a representative sample of Spanish L1-Dutch L2

vowel pronunciation errors, speech samples from 28 adult

Spanish learners of Dutch (9 males, 19 females) with varying

degrees of proficiency (A1, n=10; A2, n=7; B1, n=4; B2, n=7,

according to the Common European Framework of Reference

for Languages [15]) were used in the current study. These data

had previously been analyzed in Burgos et al. [10].

3.2. Speech stimuli

The speech stimuli consisted of isolated words in Dutch read

by adult Spanish learners. Every speaker read a set of 29

monosyllabic words in which all fifteen Dutch vowels in

stressed position were presented. The same elicitation material

was previously used in Van der Harst [16] and Van der Harst

et al. [17]. All the words ended either in /V/ or /W/, as it is

known that these consonants scarcely alter the quality of the

preceding vowel [16], [17].

Table 1. Selected -s and -t words used as speech

stimuli from Van der Harst [16]; Vow=Vowel.

Vow

s-word

t-word

Monothongs

/L/

Kies

/NLV/

Riet

/ULW/

/,/

Vis

/Y,V/

Fit

/I,W/

/ɛ/

Zes

]

Vet

/\/

Fuut

/I\W/

/ʏ/

Zus

]

Put

/X/

Poes

/SXV/

Voet

/YXW/

/ɔ/

Vos

Vlot

/ɑ/

Gas

/xɑ

Rat

ˑ/

Aas

Staat

/VWDˑW/

Long mid

vowels

ˑ/

Mees

Beet

/øˑ/

Neus

øˑ

Neut

øˑ

ˑ/

Boos

Boot

/ERˑW/

Diphthongs

/ɛ

Ijs

/ɛ

Spijt

/œ

Huis

Fluit

/ɔ

Kous

Fout

/IɔXW/

Table 1 shows an overview of all fifteen Dutch vowels and

their corresponding orthographic and phonological

representation. No example of the vowel /\/ followed by /s/

was included, as this combination does not appear in Dutch

monosyllabic words, except proper names.

For this experiment we used a set of 29 words produced by

28 Spanish learners. Six speech samples were left out. During

the task transcribers were offered a word they had transcribed

earlier every 30

token. This was done to calculate the intra-

transcriber agreement. The inclusion of repeated items gave a

maximum of 833 speech stimuli used in the transcription task.

3.3. Listeners

Prior to participating in the experiment, listeners read the

instruction of the transcription task. They were told that they

were going to listen to utterances and that they literally had to

transcribe what they heard using orthographic spelling.

Listeners were allowed to transcribe foreign and non-existing

words which might closely represent the heard utterance. An

online questionnaire was administered to obtain background

information about the listeners. The number of questions

presented in the questionnaire was limited to keep the

crowdsourcing experiment as simple and accessible to lay

listeners as possible. The online questionnaire contained

questions concerning mother tongue, gender, age and

completed education. Almost 200 listeners participated in the

transcription task. Part of the participants were filtered out,

resulting in 159 listeners whose data was included in the

current study (see section 3.5). All participants were Dutch

non-expert native listeners.

3.4. The crowdsourcing experiment

A web application was developed in Django, in which

participants could listen to the stimuli and type what they

heard. The application was set up in such a way that it was

easy to use and also fun to do. Each participant received a

score indicating the percentage of “correct” transcriptions.

This score was based on the most frequent transcriptions given

to a word by all (previous) transcribers. The idea behind

providing a score was to motivate the participants and

introduce a game element, as the score could be shared on

Facebook. This helped recruiting new participants.

Participants transcribed 100 tokens on average. See our

companion paper by Sanders et al. [18] for a detailed

description of the application.

3.5. Quality control

Several criteria were used to filter the data. Only listeners who

had Dutch as a native language were included. Secondly,

listeners had to transcribe >10 tokens, to be sure that they

really got started to perform the task. The maximum of 833

transcriptions per listener was included (three listeners

continued to perform a second round).

We used two additional quality control criteria to

ascertain the reliability of the data, a measure of intra-

transcriber agreement and a measure of inter-transcriber

agreement [4], [5]. The intra-transcriber agreement was based

on the transcriptions of the repeated items. The inter-

transcriber agreement criterion was based on the percentage of

shared common transcriptions (see [18]). Listeners failing to

meet both agreement criteria were removed from the database.

Filtering our data resulted in a total of 17.534 tokens

transcribed and 159 listeners.

2820

4. Results

4.1. Listeners’ transcriptions, vowel confusions

The listeners' transcriptions show that both consonants and

vowels were given canonical and non-canonical transcriptions.

We will now focus on the vowels, although consonants also

deserve further investigation. Table 2 displays the most

frequent listeners' transcriptions per vowel. The fifteen target

Dutch vowels are presented in alphabetic order in the columns,

except for the last three vowels, corresponding to the three

diphthongs. The rows show the transcribed vowels, including

both the canonical transcriptions of the target vowels

(indicated by the black squares) and the non-canonical

transcription <ai>. The percentages in the cells indicate how

often a transcription was given to a target vowel. The column

Total shows the sum of all percentages of transcribed vowels

per row. Transcriptions containing percentages of less than 1%

are aggregated in the Rest category (see last row in Table 2).

Overall percentages for canonical and non-canonical

transcriptions were calculated. Our results indicate that

67.44% of all transcriptions are canonical and 32.56% non-

canonical.

The various non-canonical transcriptions (see rows in

Table 2) show that there is variation in the way the vowels

were transcribed by the lay listeners. The highest variation was

found in the long mid vowel <eu> and the diphthong <ui>.

The lowest variation appears in the vowel <aa>.

An interesting confusion pattern is found in the non-

canonical transcriptions for the target vowels and <uu>,

which are often confused with each other, and especially with

the vowel <oe>, as displayed in Table 2.

Table 2 shows that the target long mid vowel <ee>, and

the target diphthongs <ij> and <ui> have non-canonical

transcriptions, such as <ei>, <ai> and <au>, respectively.

These transcriptions seem to point to strong diphthongization,

as observed earlier in Burgos et al. [9], [10].

The column Total in Table 2 shows that some vowels were

more often transcribed by the listeners, namely, <aa>, <e>,

<ie>, <o>, <oo> and <oe>, all of them producing percentages

above 100. These vowels seem to resemble the five Spanish

vowels <a>, <e>, , <o>, , suggesting the idea of the

Spanish vowels functioning as “attractors” for the larger set of

Dutch vowels, as previously observed in Burgos et al. [7], [8],

[9], [10]. A conspicuous case, which needs to be further

examined, is the one of the two Dutch vowels <o> and <oo>,

which appear to be attracted both by the Spanish vowel <o>.

In order to better understand how lay listeners cope with

transcribing specific vowels in a contrast, we decided to study

three Dutch vowel pairs <a>-<aa>, -<ie> and -<eu> in

more detail. These vowels, produced by Spanish L2 learners,

were acoustically analyzed in Burgos et al. [9]. They differ

from each other in the way duration and place of articulation

are used to make a contrast. The contrast <a>-<aa> is based on

duration and place. The distinction between the vowels in the

pair -<ie> hinges on place and not on duration, as both

vowels are short in native Dutch. The contrast -<eu> is

only based on duration, as both vowels have a similar place of

articulation and are both front rounded vowels.

Table 2. Most frequent orthographic representations of all fifteen Dutch vowels transcribed by Dutch lay native listeners;

transcribed vowels <1% are aggregated in the Rest category, >10% in grey, >5% in light grey, canonical transcriptions in

black squares, the orthographic representation of the target Dutch vowels in the columns, the transcribed vowels in the rows;

Vow=Vowel.

Vow

Total

72.21

13.80

0.25

0.00

0.08

0.00

0.17

2.22

0.74

0.08

1.17

0.00

0.08

1.71

0.08

92.59

23.04

79.12

0.00

0.08

0.00

0.17

0.00

1.24

0.00

0.08

0.81

0.17

104.71

1.58

0.67

89.38

18.71

2.18

3.28

1.51

0.08

0.17

2.33

0.00

4.23

0.32

0.67

125.19

0.00

1.40

59.31

1.01

1.34

6.54

0.00

0.08

0.17

4.39

0.00

74.24

0.00

0.08

0.25

65.86

0.08

0.17

0.00

0.08

0.42

4.78

0.24

3.57

3.76

79.29

0.00

4.94

2.09

0.42

51.34

12.66

0.16

0.00

2.25

0.17

0.08

0.17

72.18

0.00

0.25

2.42

0.67

37.98

73.09

0.00

0.42

0.34

0.00

115.17

0.08

0.00

0.07

0.25

0.08

0.00

86.34

16.29

2.27

1.33

0.34

0.08

0.49

0.25

107.87

0.00

5.95

0.00

0.08

4.94

72.70

11.09

1.50

5.29

0.00

15.84

1.25

118.64

0.08

0.00

0.08

0.25

0.34

0.00

2.39

4.38

78.24

34.89

40.96

0.00

3.33

1.75

166.69

0.50

0.17

1.23

0.00

2.85

3.36

0.17

0.91

0.00

1.93

44.96

4.61

0.00

1.22

0.50

62.41

0.00

0.50

1.26

0.17

0.00

0.33

1.66

9.16

41.10

0.08

1.09

55.43

0.08

0.00

0.16

0.08

0.00

1.95

5.10

7.37

0.08

0.42

0.00

0.08

0.59

0.00

0.25

0.00

9.36

0.00

0.08

10.86

0.00

1.07

6.68

0.25

0.08

1.17

0.00

3.17

0.00

0.17

12.59

0.00

0.42

0.33

6.35

0.34

0.00

0.42

0.00

0.21

0.00

66.48

0.00

0.50

75.05

0.17

0.51

0.00

0.84

0.00

0.33

0.58

2.18

0.08

0.17

0.08

59.46

3.59

67.99

0.92

0.42

0.08

0.17

3.27

0.00

0.92

0.74

0.08

0.17

0.51

3.66

7.23

72.10

90.44

Rest

1.26

4.47

0.99

3.79

14.61

0.86

2.51

1.73

3.50

1.92

1.24

1.56

7.99

3.91

8.77

59.11

2821

Auris populi: crowdsourced native transcriptions of Dutch vowels spoken by adult Spanish learners

Figures

Citations

Maturational Constraints on Language Development

Patterns of learner variation in Spanish accented Dutch

Palabras. Crowdsourcing transcriptions of L2 speech

Confusability in L2 vowels. Analyzing the role of different features

Matching Acoustical Properties and Native Perceptual Assessments of L2 Speech

References

Maturational Constraints on Language Development

Second Language Accent and Pronunciation Teaching: A Research- Based Approach.

The Phonology of Dutch

On the evidence for maturational constraints in second-language acquisition

The sounds of Spanish

Related Papers (5)

Experiences from the Spoken Dutch Corpus Project

An acoustic description of the vowels of northern and southern standard Dutch II: regional varieties.

Refusal Strategies of Native Spanish Speakers in Spanish and in English and of Native English Speakers in English.

The PF STAR Children's Speech Corpus

Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monolingual children

Frequently Asked Questions (2)

Q1. What are the contributions in "Auris populi: crowdsourced native transcriptions of dutch vowels spoken by adult spanish learners" ?

Q2. What are the future works mentioned in the paper "Auris populi: crowdsourced native transcriptions of dutch vowels spoken by adult spanish learners" ?