Learning Latent Personas of Film Characters

Open AccessProceedings Article

Learning Latent Personas of Film Characters

- pp 352-361

TLDR

Two latent variable models for learning character types, or personas, in film, are presented, in which a persona is defined as a set of mixtures over latent lexical classes.

Abstract:

We present two latent variable models for learning character types, or personas, in film, in which a persona is defined as a set of mixtures over latent lexical classes. These lexical classes capture the stereotypical actions of which a character is the agent and patient, as well as attributes by which they are described. As the first attempt to solve this problem explicitly, we also present a new dataset for the text-driven analysis of film, along with a benchmark testbed to help drive future work in this area.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories

Nasrin Mostafazadeh, +7 more

TL;DR: A new framework for evaluating story understanding and script learning: the `Story Cloze Test’, which requires a system to choose the correct ending to a four-sentence story, and a new corpus of 50k five- Sentence commonsense stories, ROCStories, to enable this evaluation.

...read moreread less

Journal ArticleDOI

The NarrativeQA Reading Comprehension Challenge

Tomáš Kočiský, +6 more

- 28 May 2018 -

Transactions of the Association for Comp...

TL;DR: A new dataset and set of tasks in which the reader must answer questions about stories by reading entire books or movie scripts are presented, designed so that successfully answering their questions requires understanding the underlying narrative rather than relying on shallow pattern matching or salience.

...read moreread less

Proceedings ArticleDOI

Looking Beyond the Surface: A Challenge Set for Reading Comprehension over Multiple Sentences

Daniel Khashabi, +4 more

TL;DR: The dataset is the first to study multi-sentence inference at scale, with an open-ended set of question types that requires reasoning skills, and finds human solvers to achieve an F1-score of 88.1%.

...read moreread less

Proceedings ArticleDOI

A Bayesian Mixed Effects Model of Literary Character

David Bamman, +2 more

TL;DR: A model that employs multiple effects to account for the influence of extra-linguistic information (such as author) is introduced and it is found that this method leads to improved agreement with the preregistered judgments of a literary scholar, complementing the results of alternative models.

...read moreread less

Posted Content

Event Representations for Automated Story Generation with Deep Neural Nets

Lara J. Martin, +6 more

- 05 Jun 2017 -

arXiv: Computation and Language

TL;DR: This article explore the question of event representations that provide a mid-level abstraction between words and sentences in order to retain the semantic information of the original data, while minimizing event sparsity.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

Finding scientific topics

Thomas L. Griffiths, +1 more

- 06 Apr 2004 -

Proceedings of the National Academy of S...

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.

...read moreread less

Book

The Design of Experiments

R. A. Fisher

Book

The Hero with a Thousand Faces

Joseph Campbell

TL;DR: The Power of Myth as discussed by the authors is a seminal work that combines the spiritual and psychological insights of modern psychoanalysis with the archetypes of world mythology and creates a roadmap for navigating the frustrating path of contemporary life.

...read moreread less

Collapse

Learning Latent Personas of Film Characters

Citations

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories

The NarrativeQA Reading Comprehension Challenge

Looking Beyond the Surface: A Challenge Set for Reading Comprehension over Multiple Sentences

A Bayesian Mixed Effects Model of Literary Character

Event Representations for Automated Story Generation with Deep Neural Nets

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Finding scientific topics

The Design of Experiments

The Hero with a Thousand Faces

Related Papers (5)

Unsupervised Learning of Narrative Schemas and their Participants

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Latent dirichlet allocation

Glove: Global Vectors for Word Representation

Unsupervised Learning of Narrative Event Chains