Proceedings ArticleDOI
A Java3D Talking Head for a Chatbot
Salvatore Gaglio,Giovanni Pilato,Roberto Pirrone,Orazio Gambino,A. Augello,A. Caronia +5 more
- pp 709-714
TLDR
This paper's "talking head" explores the naturalness of the facial animation and provides a real-time interactive interface to the user and delegating the Chatbot, the natural language processing and the digital signal processing services to the server while the client is involved in animation, synchronization.Abstract:
Facial animation is referred to all those systems performing the speech synchronization with an animated face model. This kind of systems are called "talking head" or "talking face". In this paper a Talking Head oriented to the creation of a Chatbot is presented. It requires an input query and an answer is generated in form of text. The answer is transduced into a facial animation using a 3D face model whose lips movements are synchronized with the sound produced by a speech synthesis module. Our "talking head" explores the naturalness of the facial animation and provides a real-time interactive interface to the user. The Web infrastructure has been realized using the client-server model delegating the Chatbot, the natural language processing and the digital signal processing services to the server, while the client is involved in animation, synchronization; in this way, the server can handle multiple requests from clients.read more
Citations
More filters
Proceedings ArticleDOI
Development of Vietnamese Voice Chatbot with Emotion Expression
Thanh Vo Nhu,Hideyuki Sawada +1 more
TL;DR: A Vietnamese voice Chatbot is introduced, in which users can use Vietnamese to communicate with Chatbot through voice, which can complete tasks from user’s requests and introduces the emotion recognition from user voice for the chatbot.
References
More filters
Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST
John S. Garofolo,Lori Lamel,W M. Fisher,Jonathan G. Fiscus,David S. Pallett,Nancy L. Dahlgren +5 more
Dataset
TIMIT Acoustic-Phonetic Continuous Speech Corpus
John S. Garofolo,Lori Lamel,William M. Fisher,Jonathan C. Fiscus,David S. Pallett,Nancy L. Dahlgren,Victor W. Zue +6 more
TL;DR: The TIMIT corpus as mentioned in this paper contains broadband recordings of 630 speakers of eight major dialects of American English, each reading ten phonetically rich sentences, including time-aligned orthographic, phonetic and word transcriptions as well as a 16-bit, 16kHz speech waveform file for each utterance.
Book
Computer Facial Animation, Second Edition
Frederic I. Parke,Keith Waters +1 more
TL;DR: This book integrates all aspects of computer-generated facial animation including computer-based visualization techniques, three-dimensional character animation, anatomical, and psychological considerations and discusses them in the framework of promising applications in entertainment, human-computer interface, research, and education.
Journal ArticleDOI
Audio-visual integration in multimodal communication
Tsuhan Chen,R.R. Rao +1 more
TL;DR: This work reviews recent research that examines audio-visual integration in multimodal communication, including bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, andbimodal speaker verification.
Journal ArticleDOI
Audiovisual speech processing
TL;DR: Audiovisual speech processing results have shown that, with lip reading, it is possible to enhance the reliability of audio speech recognition, which may result in a computer that can truly understand the user via hand-free natural spoken language even in a very noisy environments.