Compute to Tell the Tale: Goal-Driven Narrative Generation

doi:10.1145/3503161.3549202

Open AccessProceedings ArticleDOI

Compute to Tell the Tale: Goal-Driven Narrative Generation

TLDR

This paper reviews the problem of computational narrative generation where a goal-driven narrative (in the form of text with or without video) is generated from a single or multiple long videos and outlines a general narrative generation framework.

Abstract:

Man is by nature a social animal. One important facet of human evolution is through narrative imagination, be it fictional or factual, and to tell the tale to other individuals. The factual narrative, such as news, journalism, field report, etc., is based on real-world events and often requires extensive human efforts to create. In the era of big data where video capture devices are commonly available everywhere, a massive amount of raw videos (including life-logging, dashcam or surveillance footage) are generated daily. As a result, it is rather impossible for humans to digest and analyze these video data. This paper reviews the problem of computational narrative generation where a goal-driven narrative (in the form of text with or without video) is generated from a single or multiple long videos. Importantly, the narrative generation problem makes itself distinguished from the existing literature by its focus on a comprehensive understanding of user goal, narrative structure and open-domain input. We tentatively outline a general narrative generation framework and discuss the potential research problems and challenges in this direction. Informed by the real-world impact of narrative generation, we then illustrate several practical use cases in Video Logging as a Service platform which enables users to get more out of the data through a goal-driven intelligent storytelling AI agent.

Compute to Tell the Tale: Goal-Driven Narrative Generation

Citations

Narrative Dataset: Towards Goal-Driven Narrative Generation

References

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Attention is All you Need

Deep Residual Learning for Image Recognition

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Related Papers (5)

Narrative across media : the languages of storytelling

Socially-Aware emergent narrative

Understanding organizational narrative-counter-narratives dynamics: An overview of Communication Constitutes Organization (CCO) and Storytelling Organization Theory (SOT) approaches

Tell Me a Story: Toward More Expressive and Coherent Computational Narratives (Invited Talk)

Narrative Devices of Metamodernism