scispace - formally typeset
Open AccessProceedings Article

Learning Latent Personas of Film Characters

TLDR
Two latent variable models for learning character types, or personas, in film, are presented, in which a persona is defined as a set of mixtures over latent lexical classes.
Abstract
We present two latent variable models for learning character types, or personas, in film, in which a persona is defined as a set of mixtures over latent lexical classes. These lexical classes capture the stereotypical actions of which a character is the agent and patient, as well as attributes by which they are described. As the first attempt to solve this problem explicitly, we also present a new dataset for the text-driven analysis of film, along with a benchmark testbed to help drive future work in this area.

read more

Citations
More filters
Proceedings ArticleDOI

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories

TL;DR: A new framework for evaluating story understanding and script learning: the `Story Cloze Test’, which requires a system to choose the correct ending to a four-sentence story, and a new corpus of 50k five- Sentence commonsense stories, ROCStories, to enable this evaluation.
Journal ArticleDOI

The NarrativeQA Reading Comprehension Challenge

TL;DR: A new dataset and set of tasks in which the reader must answer questions about stories by reading entire books or movie scripts are presented, designed so that successfully answering their questions requires understanding the underlying narrative rather than relying on shallow pattern matching or salience.
Proceedings ArticleDOI

Looking Beyond the Surface: A Challenge Set for Reading Comprehension over Multiple Sentences

TL;DR: The dataset is the first to study multi-sentence inference at scale, with an open-ended set of question types that requires reasoning skills, and finds human solvers to achieve an F1-score of 88.1%.
Proceedings ArticleDOI

A Bayesian Mixed Effects Model of Literary Character

TL;DR: A model that employs multiple effects to account for the influence of extra-linguistic information (such as author) is introduced and it is found that this method leads to improved agreement with the preregistered judgments of a literary scholar, complementing the results of alternative models.
Posted Content

Event Representations for Automated Story Generation with Deep Neural Nets

TL;DR: This article explore the question of event representations that provide a mid-level abstraction between words and sentences in order to retain the semantic information of the original data, while minimizing event sparsity.
References
More filters
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article

Latent Dirichlet Allocation

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Book

The Design of Experiments

R. A. Fisher
Book

The Hero with a Thousand Faces

TL;DR: The Power of Myth as discussed by the authors is a seminal work that combines the spiritual and psychological insights of modern psychoanalysis with the archetypes of world mythology and creates a roadmap for navigating the frustrating path of contemporary life.