scispace - formally typeset
Proceedings ArticleDOI

A Java3D Talking Head for a Chatbot

TLDR
This paper's "talking head" explores the naturalness of the facial animation and provides a real-time interactive interface to the user and delegating the Chatbot, the natural language processing and the digital signal processing services to the server while the client is involved in animation, synchronization.
Abstract
Facial animation is referred to all those systems performing the speech synchronization with an animated face model. This kind of systems are called "talking head" or "talking face". In this paper a Talking Head oriented to the creation of a Chatbot is presented. It requires an input query and an answer is generated in form of text. The answer is transduced into a facial animation using a 3D face model whose lips movements are synchronized with the sound produced by a speech synthesis module. Our "talking head" explores the naturalness of the facial animation and provides a real-time interactive interface to the user. The Web infrastructure has been realized using the client-server model delegating the Chatbot, the natural language processing and the digital signal processing services to the server, while the client is involved in animation, synchronization; in this way, the server can handle multiple requests from clients.

read more

Citations
More filters
Proceedings ArticleDOI

Development of Vietnamese Voice Chatbot with Emotion Expression

TL;DR: A Vietnamese voice Chatbot is introduced, in which users can use Vietnamese to communicate with Chatbot through voice, which can complete tasks from user’s requests and introduces the emotion recognition from user voice for the chatbot.
References
More filters
Dataset

TIMIT Acoustic-Phonetic Continuous Speech Corpus

TL;DR: The TIMIT corpus as mentioned in this paper contains broadband recordings of 630 speakers of eight major dialects of American English, each reading ten phonetically rich sentences, including time-aligned orthographic, phonetic and word transcriptions as well as a 16-bit, 16kHz speech waveform file for each utterance.
Book

Computer Facial Animation, Second Edition

TL;DR: This book integrates all aspects of computer-generated facial animation including computer-based visualization techniques, three-dimensional character animation, anatomical, and psychological considerations and discusses them in the framework of promising applications in entertainment, human-computer interface, research, and education.
Journal ArticleDOI

Audio-visual integration in multimodal communication

TL;DR: This work reviews recent research that examines audio-visual integration in multimodal communication, including bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, andbimodal speaker verification.
Journal ArticleDOI

Audiovisual speech processing

TL;DR: Audiovisual speech processing results have shown that, with lip reading, it is possible to enhance the reliability of audio speech recognition, which may result in a computer that can truly understand the user via hand-free natural spoken language even in a very noisy environments.