scispace - formally typeset
Search or ask a question
Proceedings Article

Image Reconstruction of Tablet Front Camera Recordings in Educational Settings.

TL;DR: The applicability of the setting and processing pipeline on affective state prediction based on front camera recordings during math-solving tasks and emotional stimuli from pictures shown on a tablet are demonstrated.
Abstract: Front camera data from tablets used in educational settings offer valuable clues to student behavior, attention, and affective state. Due to the camera’s angle of view, the face of the student is partially occluded and skewed. This hinders the ability of experts to adequately capture the learning process and student states. In this paper, we present a pipeline and techniques for image reconstruction of front camera recordings. Our setting consists of a cheap and unobtrusive mirror construction to improve the visibility of the face. We then process the image and use neural inpainting to reconstruct missing data in the recordings. We demonstrate the applicability of our setting and processing pipeline on affective state prediction based on front camera recordings (i.e., action units, eye gaze, eye blinks, and movement) during math-solving tasks (active) and emotional stimuli from pictures (passive) shown on a tablet. We show that our setup provides comparable performance for affective state prediction to recordings taken with an external and more obtrusive GoPro camera.

Content maybe subject to copyright    Report

Citations
More filters
Journal Article
TL;DR: A method of representing audience behavior through facial and body motions from a single video stream, and using these features to predict the rating for feature-length movies is proposed.
Abstract: We propose a method of representing audience behavior through facial and body motions from a single video stream, and use these features to predict the rating for feature-length movies. This is a very challenging problem as: i) the movie viewing environment is dark and contains views of people at different scales and viewpoints; ii) the duration of feature-length movies is long (80-120 mins) so tracking people uninterrupted for this length of time is still an unsolved problem, and; iii) expressions and motions of audience members are subtle, short and sparse making labeling of activities unreliable. To circumvent these issues, we use an infrared illuminated test-bed to obtain a visually uniform input. We then utilize motion-history features which capture the subtle movements of a person within a pre-defined volume, and then form a group representation of the audience by a histogram of pair-wise correlations over a small-window of time. Using this group representation, we learn our movie rating classifier from crowd-sourced ratings collected by rottentomatoes.com and show our prediction capability on audiences from 30 movies across 250 subjects (> 50 hrs).

3 citations

References
More filters
Journal Article

28,685 citations


"Image Reconstruction of Tablet Fron..." refers methods in this paper

  • ...To extract facial landmarks, eye gaze, and head position from the camera recordings, we rely on OpenFace [4] using static extraction (i.e., per frame without calibrating to a person)....

    [...]

  • ...Reported confidence values by OpenFace are between 0 (not confident) and 1 (fully confident)....

    [...]

  • ...Table 1 presents the average confidence in landmark detection of OpenFace over all frames for the IAPS and math-solving tasks and the full recordings (including also parts not belonging to the IAPS and math tasks)....

    [...]

  • ...To extract facial landmarks, eye gaze, and head position from the camera recordings, we rely on OpenFace [4] using static extraction (i....

    [...]

  • ...Figure 6 shows the facial landmarks detected by OpenFace for three participants from the front camera without inpainting, using neural inpainting, and from the GoPro....

    [...]

Journal ArticleDOI

12,519 citations


"Image Reconstruction of Tablet Fron..." refers methods in this paper

  • ...Our method assumes that we have access to reports of affective states of users based on the circumplex model of affect [48]....

    [...]

Book
01 Jan 1990

12,284 citations

Journal ArticleDOI
TL;DR: Reports of affective experience obtained using SAM are compared to the Semantic Differential scale devised by Mehrabian and Russell (An approach to environmental psychology, 1974), which requires 18 different ratings.

7,472 citations


"Image Reconstruction of Tablet Fron..." refers methods in this paper

  • ...After each image and math task, participants were asked to fill in the self-assessment manikin (SAM) [7] to judge their current valence and arousal level on a 9-point Likert scale....

    [...]

Book
01 Jan 1974
TL;DR: In this paper, the authors proposed that environmental stimuli are linked to behavioral responses by the primary emotional responses of arousal, pleasure, and dominance, and used information rate to compare the effects of different environments, each with stimulation in many sense modalities.
Abstract: Environmental psychology, though a fast-growing field, is one of the most difficult to fit into the confines of scientific inquiry. Measuring such subjective data as reactions to color, heat, light, and sound would seem to be an almost impossible task; indeed, until now there has been no theory around which the research in this field could be organized. This volume represents a preliminary effort to identify the relevant variables involved and fit them into a systematic framework. Furthermore, it presents extensive sets of measures for investigating the theory and implementing it in a variety of everyday environments.Basically, the framework outlined here proposes that environmental stimuli are linked to behavioral responses by the primary emotional responses of arousal, pleasure, and dominance. By considering the impact of the environment on these basic emotional responses, the effects of diverse stimulus components within or across sense modalities can be readily compared. An additional concept, information rate, is used to compare the effects of different environments, each with stimulation in many sense modalities. In the final chapters the authors present a series of hypotheses which relate the emotional response variables to a diversity of behaviors such as physical approach, performance, affiliation, and verbally or nonverbally expressed preference.

5,419 citations


"Image Reconstruction of Tablet Fron..." refers background in this paper

  • ..., anger, disgust, fear, happiness, sadness, and surprise) or described by the valence and arousal dimensions [40]....

    [...]