M
Melvyn J. Hunt
Researcher at Apple Inc.
Publications - 13
Citations - 392
Melvyn J. Hunt is an academic researcher from Apple Inc.. The author has contributed to research in topics: Frame (networking) & Speaker recognition. The author has an hindex of 7, co-authored 13 publications receiving 371 citations. Previous affiliations of Melvyn J. Hunt include Samsung.
Papers
More filters
Patent
Digital assistant providing whispered speech
TL;DR: In this article, a system and processes for detecting and/or providing a whispered speech response are provided, where speech is received from a user, and based on the speech input, determined that a whispering speech response is to be provided.
Patent
Speech detection method, medium, and system
TL;DR: In this paper, an energy change of each frame making up signals including speech and non-speech signals is detected and a speech segment corresponding to frames that include only speech signals from among the frames based on the detected energy change.
Proceedings ArticleDOI
Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System.
Tim Capes,Paul Coles,Alistair Conkie,Ladan Golipour,Abie Hadjitarkhani,Qiong Hu,Nancy Huddleston,Melvyn J. Hunt,Jiangchuan Li,Matthias Neeracher,Kishore Prahallad,Tuomo Raitio,Ramya Rasipuram,Greg Townsend,Becci Williamson,David A. Winarsky,Zhizheng Wu,Hepeng Zhang +17 more
TL;DR: Apple’s hybrid unit selection speech synthesis system, which provides the voices for Siri with the requirement of naturalness, personality and expressivity, is described and various techniques that enable on-device capability such as preselection optimization, caching for low latency, and unit pruning for low footprint are described.
Patent
Speech recognition involving a mobile device
Melvyn J. Hunt,John Bridle +1 more
TL;DR: In this article, a system and method of speech recognition involving a mobile device is described, where a set of phonetic symbols are converted to speech input and data relating to the symbols is transferred from the mobile device over a communications network to a remote processing device, where it is used to identify at least one matching data item.
Proceedings ArticleDOI
Topic and speaker identification via large vocabulary continuous speech recognition
Barbara Peskin,Larry Gillick,Yoshiko Ito,Stephen Lowe,Robert Roth,Francesco Scattone,James K. Baker,Janet M. Baker,John Bridle,Melvyn J. Hunt,Jeremy Orloff +10 more
TL;DR: A novel approach to the problems of topic and speaker identification that makes use of a large vocabulary continuous speech recognizer and describes the symmetric way in which it has implemented their solution.