Home
/
Topics
/
Closed captioning

Topic

Closed captioning

About: Closed captioning is a research topic. Over the lifetime, 3011 publications have been published within this topic receiving 64494 citations. The topic is also known as: CC.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1989
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977

Papers

PDF

Open Access

More filters

Book•

Digital Television: Technology and Standards

[...]

John F. Arnold, Michael R. Frater, Mark R. Pickering

17 Sep 2007

TL;DR: This chapter discusses Digital Television Channel Coding and Modulation, Closeding, Subtitling, and Teletext, and the MPEG-2 Video Compression Standard.

...read moreread less

Abstract: Preface. 1. Introduction to Analog and Digital Television. 2. Characteristics of Video Material. 3. Predictive Encoding. 4. Transform Coding. 5. Video Coder Syntax. 6. The MPEG-2 Video Compression Standard. 7. Perceptual Audio Coding. 8. Frequency Analysis and Synthesis. 9. MPEG Audio. 10. Dolby AC-3 Audio. 11. MPEG-2 Systems. 12. DVB Service Information and ATSC Program and System Information Protocol. 13. Digital Television Channel Coding and Modulation. 14. Closed Captioning, Subtitling, and Teletext. Appendix: MPEG Tables. Index.

...read moreread less

22 citations

Posted Content•

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

[...]

Andrew Shin¹, Katsunori Ohnishi¹, Tatsuya Harada¹•Institutions (1)

University of Tokyo¹

18 May 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, the authors attempt to generate video captions that convey richer contents by temporally segmenting the video with action localization, generating multiple captions from multiple frames, and connecting them with natural language processing techniques, in order to generate a story-like caption.

...read moreread less

Abstract: Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process and does not fully take advantage of dynamic contents present in videos. We attempt to generate video captions that convey richer contents by temporally segmenting the video with action localization, generating multiple captions from multiple frames, and connecting them with natural language processing techniques, in order to generate a story-like caption. We show that our proposed method can generate captions that are richer in contents and can compete with state-of-the-art method without explicitly using video-level features as input.

...read moreread less

22 citations

Book Chapter•DOI•

Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation

[...]

Mike Wald¹, Keith Bain²•Institutions (2)

University of Southampton¹, Saint Mary's University²

22 Jul 2007

TL;DR: This paper describes the development of a system that can provide an automatic text transcription of multiple speakers using speech recognition (SR), with the names of speakers identified in the transcription and corrections of SR errors made in real-time by a human 'editor'.

...read moreread less

Abstract: Text transcriptions of the spoken word can benefit deaf people and also anyone who needs to review what has been said (e.g. at lectures, presentations, meetings etc.) Real time captioning (i.e. creating a live verbatim transcript of what is being spoken) using phonetic keyboards can provide an accurate live transcription for deaf people but is often not available because of the cost and shortage of highly skilled and trained stenographers. This paper describes the development of a system that can provide an automatic text transcription of multiple speakers using speech recognition (SR), with the names of speakers identified in the transcription and corrections of SR errors made in real-time by a human 'editor'.

...read moreread less

22 citations

Journal Article•DOI•

Hierarchical attention-based multimodal fusion for video captioning

[...]

Chunlei Wu¹, Yiwei Wei¹, Xiaoliang Chu¹, Sun Weichen², Sun Weichen³, Fei Su³, Leiquan Wang¹ - Show less +3 more•Institutions (3)

China University of Petroleum¹, Chinese Ministry of Public Security², Beijing University of Posts and Telecommunications³

13 Nov 2018-Neurocomputing

TL;DR: A hierarchical attention-based multi-modal fusion model for video captioning is proposed by jointly considering the intrinsic properties of multimodal features and experimental results show that the proposed method has achieved competitive performance compared with the relatedVideo captioning methods.

...read moreread less

22 citations

Proceedings Article•DOI•

VizSeq: a visual analysis toolkit for text generation tasks

[...]

Changhan Wang¹, Anirudh Jain², Danlu Chen, Jiatao Gu¹•Institutions (2)

Facebook¹, Stanford University²

12 Sep 2019

TL;DR: VizSeq is presented, a visual analysis toolkit for instance-level and corpus-level system evaluation on a wide variety of text generation tasks, and covers most common n-gram based metrics accelerated with multiprocessing, and also provides latest embedding-based metrics such as BERTScore.

...read moreread less

Abstract: Automatic evaluation of text generation tasks (e.g. machine translation, text summarization, image captioning and video description) usually relies heavily on task-specific metrics, such as BLEU and ROUGE. They, however, are abstract numbers and are not perfectly aligned with human assessment. This suggests inspecting detailed examples as a complement to identify system error patterns. In this paper, we present VizSeq, a visual analysis toolkit for instance-level and corpus-level system evaluation on a wide variety of text generation tasks. It supports multimodal sources and multiple text references, providing visualization in Jupyter notebook or a web app interface. It can be used locally or deployed onto public servers for centralized data hosting and benchmarking. It covers most common n-gram based metrics accelerated with multiprocessing, and also provides latest embedding-based metrics such as BERTScore.

...read moreread less

21 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
…
158
159
160
161
162
163
164
…
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

4,575

Papers

96,790

Citations

No. of papers in the topic in previous years
Year	Papers
2023	536
2022	1,030
2021	504
2020	530
2019	448
2018	334

Closed captioning

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics