Home
/
Topics
/
Closed captioning

Topic

Closed captioning

About: Closed captioning is a research topic. Over the lifetime, 3011 publications have been published within this topic receiving 64494 citations. The topic is also known as: CC.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1989
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Social, Environmental, and Technical: Factors at Play in the Current Use and Future Design of Small-Group Captioning

[...]

Emma J. McDonnell¹, Ping Liu¹, Steven N. Goodman¹, Raja S. Kushalnagar¹, Jon E. Froehlich¹, Leah Findlater¹ - Show less +2 more•Institutions (1)

University of Washington¹

18 Oct 2021

TL;DR: In this paper, a qualitative account of DHH people's real-time captioning experiences during small-group conversation and future design considerations to better support the groups being captioned, both in person and online, is presented.

...read moreread less

Abstract: Real-time captioning is a critical accessibility tool for many d/Deaf and hard of hearing (DHH) people. While the vast majority of captioning work has focused on formal settings and technical innovations, in contrast, we investigate captioning for informal, interactive small-group conversations, which have a high degree of spontaneity and foster dynamic social interactions. This paper reports on semi-structured interviews and design probe activities we conducted with 15 DHH participants to understand their use of existing real-time captioning services and future design preferences for both in-person and remote small-group communication. We found that our participants' experiences of captioned small-group conversations are shaped by social, environmental, and technical considerations (e.g., interlocutors' pre-established relationships, the type of captioning displays available, and how far captions lag behind speech). When considering future captioning tools, participants were interested in greater feedback on non-speech elements of conversation (e.g., speaker identity, speech rate, volume) both for their personal use and to guide hearing interlocutors toward more accessible communication. We contribute a qualitative account of DHH people's real-time captioning experiences during small-group conversation and future design considerations to better support the groups being captioned, both in person and online.?

...read moreread less

18 citations

Proceedings Article•DOI•

Audio Captioning Based on Combined Audio and Semantic Embeddings

[...]

Aysegul Ozkaya Eren¹, Mustafa Sert¹•Institutions (1)

Başkent University¹

01 Dec 2020

TL;DR: In this article, a BiGRU-based encoder-decoder architecture was proposed to extract subject-verb embeddings using the subjects and verbs from the audio captions.

...read moreread less

Abstract: Audio captioning is a recently proposed task for automatically generating a textual description of a given audio clip. Most existing approaches use the encoder-decoder model without using semantic information. In this study, we propose a bi-directional Gated Recurrent Unit (BiGRU) model based on encoder-decoder architecture using audio and semantic embed-dings. To obtain semantic embeddings, we extract subject-verb embeddings using the subjects and verbs from the audio captions. We use a Multilayer Perceptron classifier to predict subject-verb embeddings of test audio clips for the testing stage. Within the aim of extracting audio features, in addition to log Mel energies, we use a pretrained audio neural network (PANN) as a feature extractor which is used for the first time in the audio captioning task to explore the usability of audio embeddings in the audio captioning task. We combine audio embeddings and semantic embeddings to feed the BiGRU-based encoder-decoder model. Following this, we evaluate our model on two audio captioning datasets: Clotho and AudioCaps. Experimental results show that the proposed BiGRU-based deep model significantly outperforms the state of the art results across different evaluation metrics and inclusion of semantic information enhance the captioning performance.

...read moreread less

18 citations

Journal Article•DOI•

Learner perceptions of reliance on captions in EFL multimedia listening comprehension

[...]

Aubrey Neil Leveridge¹, Jie Chi Yang¹•Institutions (1)

National Central University¹

13 Oct 2014-Computer Assisted Language Learning

TL;DR: Pedagogical implications that captioning support, added or removed, based on learner self-reports, may not be inherently beneficial, as perceptions on the reliance of captioning may be inaccurate.

...read moreread less

Abstract: Instructional support has been widely discussed as a strategy to optimize student-learning experiences. This study examines instructional support within the context of a multimedia language-learning environment, with the predominant focus on learners’ perceptions of captioning support for listening comprehension. The study seeks to answer two questions: (1) do learners’ perceptions regarding dependence on captions match their actual reliance on captioning for listening comprehension? and (2) which learners’ perceptions are most influenced by proficiency: low-intermediate, intermediate, or high-intermediate? A total of 139 students from a high school English course in northern Taiwan, all accustomed to multimedia instruction that includes full captions, completed an English language proficiency test as well as a caption reliance test (CRT), and also provided their perceived degree of reliance on captions for English listening comprehension. The results show that overall perceived reliance was significantly...

...read moreread less

18 citations

Journal Article•DOI•

Leveraging unpaired out-of-domain data for image captioning

[...]

Xinghan Chen¹, Ming-Xing Zhang¹, Zheng Wang¹, Lin Zuo¹, Bo Li¹, Yang Yang¹ - Show less +2 more•Institutions (1)

University of Electronic Science and Technology of China¹

01 Apr 2020-Pattern Recognition Letters

TL;DR: This method can utilize image and text data scraped from the internet respectively to improve the performance limited in concepts-decoder framework and can transfer the knowledge learned from web data to the standard dataset.

...read moreread less

18 citations

Patent•

Automatic audio captioning

[...]

Gregory Frederick Diamos, Sudnya Diamos, Michael Allen Evans

30 Aug 2017

TL;DR: In this paper, a raw audio waveform including a non-speech sound is received and relevant features are extracted from the raw audio Waveform using a recurrent neural network (RNN) acoustic model.

...read moreread less

Abstract: A method, computer readable medium, and system are disclosed for audio captioning. A raw audio waveform including a non-speech sound is received and relevant features are extracted from the raw audio waveform using a recurrent neural network (RNN) acoustic model. A discrete sequence of characters represented in a natural language is generated based on the relevant features, where the discrete sequence of characters comprises a caption that describes the non-speech sound.

...read moreread less

18 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
…
179
180
181
182
183
184
185
…
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

4,575

Papers

96,790

Citations

No. of papers in the topic in previous years
Year	Papers
2023	536
2022	1,030
2021	504
2020	530
2019	448
2018	334

Closed captioning

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics