Topic

Closed captioning

About: Closed captioning is a research topic. Over the lifetime, 3011 publications have been published within this topic receiving 64494 citations. The topic is also known as: CC.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Posted Content•

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval.

[...]

Yuma Koizumi, Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi, Masahiro Yasuda - Show less +1 more

14 Dec 2020-arXiv: Audio and Speech Processing

TL;DR: Experimental results show that the proposed method has succeeded to use a pre- trained language model for audio captioning, and the oracle performance of the pre-trained model-based caption generator was clearly better than that of the conventional method trained from scratch.

...read moreread less

Abstract: The goal of audio captioning is to translate input audio into its description using natural language. One of the problems in audio captioning is the lack of training data due to the difficulty in collecting audio-caption pairs by crawling the web. In this study, to overcome this problem, we propose to use a pre-trained large-scale language model. Since an audio input cannot be directly inputted into such a language model, we utilize guidance captions retrieved from a training dataset based on similarities that may exist in different audio. Then, the caption of the audio input is generated by using a pre-trained language model while referring to the guidance captions. Experimental results show that (i) the proposed method has succeeded to use a pre-trained language model for audio captioning, and (ii) the oracle performance of the pre-trained model-based caption generator was clearly better than that of the conventional method trained from scratch.

...read moreread less

20 citations

Patent•

Content detection device in digital broadcast

[...]

Yasuaki Iwata¹, Sadatoshi Chozui¹•Institutions (1)

Panasonic¹

11 Jan 2006

TL;DR: In this article, a content detecting device for a digital broadcast signal receiver or a recording apparatus that records the digital broadcast signals is presented. But the detection of a commercial based on information on presence or absence of one of a closed captioning broadcast and a data broadcast is not considered.

...read moreread less

Abstract: A content detecting device for a digital broadcast signal receiver or a recording apparatus that records the digital broadcast signal. A program-related-information acquiring unit acquires program specific information and information for creating an electronic program guide and causes a memory to store the information. A detecting unit detects a commercial based on information on presence or absence of one of a closed captioning broadcast and a data broadcast and causes the memory to store detection information. A discriminating unit reads out the detection information and outputs a signal for distinguishing the program and the commercial. When information in the program specific information and information in the electronic program guide information in the memory contradict each other concerning presence or absence of one of a closed captioning broadcast and a data broadcast, the detecting unit causes the memory to store information indicating the detection of the commercial.

...read moreread less

20 citations

Patent•

Real-time captioning framework for mobile devices

[...]

Richard F. Pettinato

25 Jan 2005

TL;DR: In this article, a system for providing caption information for one or more mobile devices includes a communication network, and a transcription device providing near real time delivery of the data transcription, using the communication network to send text from the caption data to at least one of the mobile devices.

...read moreread less

Abstract: A system for providing caption information for one or more mobile devices includes a communication network, and one or more mobile devices connected to the communication network. The one or more mobile devices can include a cellular device, a personal digital assistant, or a wireless device. The system includes a captioning device to present caption data on a display, and a transcription device to transcribe data. The transcription device provides near real time delivery of the data transcription. The system uses the communication network to send text from the caption data to at least one of the mobile devices, while the system sends the caption data to one or more captioning devices simultaneously.

...read moreread less

20 citations

Journal Article•DOI•

Social App Accessibility for Deaf Signers

[...]

Kelly Mack, Danielle Bragg, Meredith Ringel Morris, Maarten W. Bos, Isabelle Albi, Andrés Monroy-Hernández - Show less +2 more

13 Aug 2020-arXiv: Human-Computer Interaction

TL;DR: It is found that Deaf signers share the most in written English, despite their desire to share in sign language, and key areas of difficulty in consuming content and producing content on social media platforms are identified.

...read moreread less

Abstract: Social media platforms support the sharing of written text, video, and audio. All of these formats may be inaccessible to people who are deaf or hard of hearing (DHH), particularly those who primarily communicate via sign language, people who we call Deaf signers. We study how Deaf signers engage with social platforms, focusing on how they share content and the barriers they face. We employ a mixed-methods approach involving seven in-depth interviews and a survey of a larger population (n = 60). We find that Deaf signers share the most in written English, despite their desire to share in sign language. We further identify key areas of difficulty in consuming content (e.g., lack of captions for spoken content in videos) and producing content (e.g., captioning signed videos, signing into a phone camera) on social media platforms. Our results both provide novel insights into social media use by Deaf signers and reinforce prior findings on DHH communication more generally, while revealing potential ways to make social media platforms more accessible to Deaf signers.

...read moreread less

19 citations

Journal Article•DOI•

M-VAD names: a dataset for video captioning with naming

[...]

Stefano Pini¹, Marcella Cornia¹, Federico Bolelli¹, Lorenzo Baraldi¹, Rita Cucchiara¹ - Show less +1 more•Institutions (1)

University of Modena and Reggio Emilia¹

01 May 2019-Multimedia Tools and Applications

TL;DR: This paper investigates multimodal architectures to replace the “someone” tags with proper character names in existing video captions, and presents an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure.

...read moreread less

Abstract: Current movie captioning architectures are not capable of mentioning characters with their proper name, replacing them with a generic “someone” tag. The lack of movie description datasets with characters’ visual annotations surely plays a relevant role in this shortage. Recently, we proposed to extend the M-VAD dataset by introducing such information. In this paper, we present an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure. The resulting dataset contains 63 k visual tracks and 34 k textual mentions, all associated with character identities. To showcase the features of the dataset and quantify the complexity of the naming task, we investigate multimodal architectures to replace the “someone” tags with proper character names in existing video captions. The evaluation is further extended by testing this application on videos outside of the M-VAD Names dataset.

...read moreread less

19 citations

Collapse

Network Information

Performance

Metrics

4,575

Papers

96,790

Citations

No. of papers in the topic in previous years
Year	Papers
2023	536
2022	1,030
2021	504
2020	530
2019	448
2018	334

Closed captioning

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics