Home
/
Authors
/
Alan Cruttenden

Author

Alan Cruttenden

Bio: Alan Cruttenden is an academic researcher from University of Oxford. The author has contributed to research in topics: Speech synthesis & Motor theory of speech perception. The author has an hindex of 2, co-authored 2 publications receiving 646 citations.

Papers

PDF

Open Access

More filters

Book•

Gimson's Pronunciation of English

[...]

Alan Cruttenden¹•Institutions (1)

University of Oxford¹

21 Jul 1994

TL;DR: Part I: Speech and language 1. Communication 2. The production of speech 3. The sounds of speech 4. The description and classification of speech sounds 5. Sounds in language 6. The historical background 7. Standard and regional accents 8. The English vowels 9. Words 11. Connected speech 12. Words in connected speech 13. Teaching the pronunciation of English

...read moreread less

Abstract: PART I: Speech and language 1. Communication 2. The production of speech 3. The sounds of speech 4. The description and classification of speech sounds 5. Sounds in language PART II: The sounds of English 6. The historical background 7. Standard and regional accents 8. The English vowels 9. The English consonants PART III: Words and connected speech 10. Words 11. Connected speech 12. Words in connected speech 13. Teaching the pronunciation of English

...read moreread less

659 citations

Journal Article•DOI•

Intonational diglossia : a case study of Glasgow

[...]

Alan Cruttenden¹•Institutions (1)

University of Oxford¹

01 Dec 2007-Journal of the International Phonetic Association

TL;DR: In this paper, audio and acoustic data were produced from recordings of a Glaswegian English speaker in conversational and reading modes, where different intonational systems were used in the two modes.

...read moreread less

Abstract: Auditory and acoustic data were produced from recordings of a Glaswegian English speaker in conversational and reading modes. Clearly different intonational systems were used in the two modes. The reading style used an intonation similar to that used in standard British intonation (the intonation of ‘Received Pronunciation’ (RPI)). The conversational style was an example of the type of intonation used in a number of cities in the north of the UK (Urban North British Intonation (UNBI)), characterised by a default intonation involving rising or rising-slumping nuclear pitch patterns. This speaker illustrates a clear-cut case of intonational diglossia with a falling default tune in the one mode and a rising(-falling) default tune in the other.

...read moreread less

29 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Spoken word recognition and lexical representation in very young children.

[...]

Daniel Swingley¹, Daniel Swingley², Richard N. Aslin¹•Institutions (2)

University of Rochester¹, Max Planck Society²

14 Aug 2000-Cognition

TL;DR: The results suggest that children's representations of familiar words are phonetically well-specified, and that this specification may not be a consequence of the need to differentiate similar words in production.

...read moreread less

429 citations

Book•

Listening in the Language Classroom

[...]

John Field¹•Institutions (1)

University of Reading¹

23 Feb 2009

TL;DR: This paper argued that a preoccupation with the notion of "comprehension" has led teachers to focus upon the product of listening, in the form of answers to questions, ignoring the listening process itself.

...read moreread less

Abstract: This book challenges the orthodox approach to the teaching of second language listening, which is based upon the asking and answering of comprehension questions. The book's central argument is that a preoccupation with the notion of 'comprehension' has led teachers to focus upon the product of listening, in the form of answers to questions, ignoring the listening process itself. The author provides an informed account of the psychological processes which make up the skill of listening, and analyses the characteristics of the speech signal from which listeners have to construct a message. Drawing upon this information, the book proposes a radical alternative to the comprehension approach and provides for intensive small-scale practice in aspects of listening that are perceptually or cognitively demanding for the learner. Listening in the Language Classroom was winner of the Ben Warren International Trust House Prize in 2008.

...read moreread less

348 citations

Posted Content•

LipNet: End-to-End Sentence-level Lipreading

[...]

Yannis M. Assael, Brendan Shillingford, Shimon Whiteson, Nando de Freitas

04 Nov 2016-arXiv: Learning

TL;DR: This work presents LipNet, a model that maps a variable-length sequence of video frames to text, making use of spatiotemporal convolutions, a recurrent network, and the connectionist temporal classification loss, trained entirely end-to-end.

...read moreread less

Abstract: Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional approaches separated the problem into two stages: designing or learning visual features, and prediction. More recent deep lipreading approaches are end-to-end trainable (Wand et al., 2016; Chung & Zisserman, 2016a). However, existing work on models trained end-to-end perform only word classification, rather than sentence-level sequence prediction. Studies have shown that human lipreading performance increases for longer words (Easton & Basala, 1982), indicating the importance of features capturing temporal context in an ambiguous communication channel. Motivated by this observation, we present LipNet, a model that maps a variable-length sequence of video frames to text, making use of spatiotemporal convolutions, a recurrent network, and the connectionist temporal classification loss, trained entirely end-to-end. To the best of our knowledge, LipNet is the first end-to-end sentence-level lipreading model that simultaneously learns spatiotemporal visual features and a sequence model. On the GRID corpus, LipNet achieves 95.2% accuracy in sentence-level, overlapped speaker split task, outperforming experienced human lipreaders and the previous 86.4% word-level state-of-the-art accuracy (Gergen et al., 2016).

...read moreread less

295 citations

Massive reduction in conversational American English

[...]

Keith A. Johnson

01 Jan 2004

TL;DR: The English are a lazy lot, and will not speak a word as it should be spoken when they can slide through it as discussed by the authors. Why be bothered to say extraordinary when you can get away with strawdiny?... Many of the Oxford Cockneys are weaklings too languid or emasculated to speak their noble language with any vigor.

...read moreread less

Abstract: The English are a lazy lot, and will not speak a word as it should be spoken when they can slide through it. Why be bothered to say extraordinary when you can get away with strawdiny? ... Many of the Oxford Cockneys are weaklings too languid or emasculated to speak their noble language with any vigor, but the majority are following a foolish fashion which had better be abandoned. Its ugliness alone should make it unpopular, but it has the additional effect of causing confusion. [Irish playwright St. John Ervine, quoted by H.L. Mencken (1948, p. 39)]

...read moreread less

273 citations

Journal Article•DOI•

The Formants of Monophthong Vowels in Standard Southern British English Pronunciation

[...]

David Deterding¹•Institutions (1)

National Institute of Education¹

01 Jun 1997-Journal of the International Phonetic Association

TL;DR: This article measured the formants of the eleven monophthong vowels of Standard Southern British pronunciation of English using linear-prediction-based formant tracks overlaid on digital spectrograms for an average of ten instances of each vowel for each speaker.

...read moreread less

Abstract: The formants of the eleven monophthong vowels of Standard Southern British (SSB) pronunciation of English were measured for five male and five female BBC broadcasters whose speech was included in the MARSEC database. The measurements were made using linear-prediction-based formant tracks overlaid on digital spectrograms for an average of ten instances of each vowel for each speakers, These measurements were taken from connected speech, allowing comparison with previous formant values measured from citation words. I was found that the male vowels were significantly less peripheral in the measurements from connected speech than in measurements from citation words.

...read moreread less

220 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137

Collapse