scispace - formally typeset
M

Melvyn J. Hunt

Researcher at Apple Inc.

Publications -  13
Citations -  392

Melvyn J. Hunt is an academic researcher from Apple Inc.. The author has contributed to research in topics: Frame (networking) & Speaker recognition. The author has an hindex of 7, co-authored 13 publications receiving 371 citations. Previous affiliations of Melvyn J. Hunt include Samsung.

Papers
More filters
Patent

Digital assistant providing whispered speech

TL;DR: In this article, a system and processes for detecting and/or providing a whispered speech response are provided, where speech is received from a user, and based on the speech input, determined that a whispering speech response is to be provided.
Patent

Speech detection method, medium, and system

TL;DR: In this paper, an energy change of each frame making up signals including speech and non-speech signals is detected and a speech segment corresponding to frames that include only speech signals from among the frames based on the detected energy change.
Proceedings ArticleDOI

Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System.

TL;DR: Apple’s hybrid unit selection speech synthesis system, which provides the voices for Siri with the requirement of naturalness, personality and expressivity, is described and various techniques that enable on-device capability such as preselection optimization, caching for low latency, and unit pruning for low footprint are described.
Patent

Speech recognition involving a mobile device

TL;DR: In this article, a system and method of speech recognition involving a mobile device is described, where a set of phonetic symbols are converted to speech input and data relating to the symbols is transferred from the mobile device over a communications network to a remote processing device, where it is used to identify at least one matching data item.
Proceedings ArticleDOI

Topic and speaker identification via large vocabulary continuous speech recognition

TL;DR: A novel approach to the problems of topic and speaker identification that makes use of a large vocabulary continuous speech recognizer and describes the symmetric way in which it has implemented their solution.