Home
/
Authors
/
Marc Schröder

Author

Marc Schröder

Other affiliations: University of Hamburg, Karlsruhe Institute of Technology, Saarland University ...read more

Bio: Marc Schröder is an academic researcher from RWTH Aachen University. The author has contributed to research in topics: Speech synthesis & Nash equilibrium. The author has an hindex of 31, co-authored 102 publications receiving 4110 citations. Previous affiliations of Marc Schröder include University of Hamburg & Karlsruhe Institute of Technology.

Topics: Speech synthesis, Nash equilibrium, Price of anarchy, Diphone, Prosody ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
1998

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The SEMAINE Database: Annotated Multimodal Records of Emotionally Colored Conversations between a Person and a Limited Agent

[...]

Gary McKeown¹, Michel Valstar², Roddy Cowie¹, Maja Pantic², Marc Schröder - Show less +1 more•Institutions (2)

Queen's University Belfast¹, Imperial College London²

01 Jan 2012-IEEE Transactions on Affective Computing

TL;DR: A large audiovisual database is created as a part of an iterative approach to building Sensitive Artificial Listener agents that can engage a person in a sustained, emotionally colored conversation.

...read moreread less

Abstract: SEMAINE has created a large audiovisual database as a part of an iterative approach to building Sensitive Artificial Listener (SAL) agents that can engage a person in a sustained, emotionally colored conversation. Data used to build the agents came from interactions between users and an "operator” simulating a SAL agent, in different configurations: Solid SAL (designed so that operators displayed an appropriate nonverbal behavior) and Semi-automatic SAL (designed so that users' experience approximated interacting with a machine). We then recorded user interactions with the developed system, Automatic SAL, comparing the most communicatively competent version to versions with reduced nonverbal skills. High quality recording was provided by five high-resolution, high-framerate cameras, and four microphones, recorded synchronously. Recordings total 150 participants, for a total of 959 conversations with individual SAL characters, lasting approximately 5 minutes each. Solid SAL recordings are transcribed and extensively annotated: 6-8 raters per clip traced five affective dimensions and 27 associated categories. Other scenarios are labeled on the same pattern, but less fully. Additional information includes FACS annotation on selected extracts, identification of laughs, nods, and shakes, and measures of user engagement with the automatic system. The material is available through a web-accessible database.

...read moreread less

627 citations

Journal Article•DOI•

The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching

[...]

Marc Schröder, Jürgen Trouvain¹•Institutions (1)

Saarland University¹

01 Oct 2003-International Journal of Speech Technology

TL;DR: The usefulness of the modular and transparent design approach is illustrated with an early prototype of an interface for emotional speech synthesis and examples of how this interface can be put to use in research, development and teaching.

...read moreread less

Abstract: This paper introduces the German text-to-speech synthesis system MARY. The system's main features, namely a modular design and an XML-based system-internal data representation, are pointed out, and the properties of the individual modules are briefly presented. An interface allowing the user to access and modify intermediate processing steps without the need for a technical understanding of the system is described, along with examples of how this interface can be put to use in research, development and teaching. The usefulness of the modular and transparent design approach is further illustrated with an early prototype of an interface for emotional speech synthesis.

...read moreread less

456 citations

Proceedings Article•

Emotional speech synthesis: a review.

[...]

Marc Schröder

01 Jan 2001

TL;DR: An overview of what has been done in the field of emotion effects to synthesised speech is given, pointing out the inherent properties of the various synthesis techniques used, summarising the prosody rules employed, and taking a look at the evaluation paradigms.

...read moreread less

Abstract: Attempts to add emotion effects to synthesised speech have existed for more than a decade now. Several prototypes and fully operational systems have been built based on different synthesis techniques, and quite a number of smaller studies have been conducted. This paper aims to give an overview of what has been done in this field, pointing out the inherent properties of the various synthesis techniques used, summarising the prosody rules employed, and taking a look at the evaluation paradigms. Finally, an attempt is made to discuss interesting directions for future development.

...read moreread less

360 citations

Journal Article•DOI•

Search for invisible decays of Higgs bosons in the vector boson fusion and associated ZH production modes

[...]

S. Chatrchyan¹, Vardan Khachatryan¹, Albert M. Sirunyan¹, Armen Tumasyan¹ +2230 more•Institutions (144)

13 Aug 2014-European Physical Journal C

TL;DR: The observed (expected) upper limit on the invisible branching fraction at 0.58┬á(0.44) is interpreted in terms of a Higgs-portal model of dark matter interactions.

...read moreread less

Abstract: A search for invisible decays of Higgs bosons is performed using the vector boson fusion and associated ZH production modes. In the ZH mode, the Z boson is required to decay to a pair of charged leptons or a $b\bar{b}$ quark pair. The searches use the 8 TeV pp collision dataset collected by the CMS detector at the LHC, corresponding to an integrated luminosity of up to 19.7 inverse femtobarns. Certain channels include data from 7 TeV collisions corresponding to an integrated luminosity of 4.9 inverse femtobarns. The searches are sensitive to non-standard-model invisible decays of the recently observed Higgs boson, as well as additional Higgs bosons with similar production modes and large invisible branching fractions. In all channels, the observed data are consistent with the expected standard model backgrounds. Limits are set on the production cross section times invisible branching fraction, as a function of the Higgs boson mass, for the vector boson fusion and ZH production modes. By combining all channels, and assuming standard model Higgs boson cross sections and acceptances, the observed (expected) upper limit on the invisible branching fraction at $m_H$=125 GeV is found to be 0.58 (0.44) at 95% confidence level. We interpret this limit in terms of a Higgs-portal model of dark matter interactions.

...read moreread less

246 citations

Journal Article•DOI•

Experimental study of affect bursts

[...]

Marc Schröder¹•Institutions (1)

Saarland University¹

01 Apr 2003-Speech Communication

TL;DR: It is shown that affect bursts, presented without context, can convey a clearly identifiable emotional meaning and the influence of the segmental structure on emotion recognition, as opposed to prosody and voice quality, is investigated.

...read moreread less

215 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Collapse

Cited by

PDF

Open Access

More filters

How to Do Things With Words

[...]

Csr Young

01 Jan 2009

7,241 citations

Journal Article•DOI•

The measurement of meaning

[...]

John M. Kittross

01 Jun 1959

3,442 citations

Proceedings Article•

A morphable model for the synthesis of 3D faces

[...]

Matthew Turk

01 Jan 1999

2,010 citations

Journal Article•DOI•

Communication of emotions in vocal expression and music performance: different channels, same code?

[...]

Patrik N. Juslin¹, Petri Laukka¹•Institutions (1)

Uppsala University¹

01 Sep 2003-Psychological Bulletin

TL;DR: A review of 104 studies of vocal expression and 41 studies of music performance reveals similarities between the two channels concerning (a) the accuracy with which discrete emotions were communicated to listeners and (b) the emotion-specific patterns of acoustic cues used to communicate each emotion as mentioned in this paper.

...read moreread less

Abstract: Many authors have speculated about a close relationship between vocal expression of emotions and musical expression of emotions. but evidence bearing on this relationship has unfortunately been lacking. This review of 104 studies of vocal expression and 41 studies of music performance reveals similarities between the 2 channels concerning (a) the accuracy with which discrete emotions were communicated to listeners and (b) the emotion-specific patterns of acoustic cues used to communicate each emotion. The patterns are generally consistent with K. R. Scherer's (1986) theoretical predictions. The results can explain why music is perceived as expressive of emotion, and they are consistent with an evolutionary perspective on vocal expression of emotions. Discussion focuses on theoretical accounts and directions for future research.

...read moreread less

1,474 citations

2005년 대한 생화학·분자생물학회 하계학술대회를 다녀와서

[...]

양희영

01 Sep 2005

1,274 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse