Home
/
Authors
/
Kalyan Ganesan

Author

Kalyan Ganesan

Bio: Kalyan Ganesan is an academic researcher from ExxonMobil. The author has contributed to research in topics: Code-excited linear prediction & Voice activity detection. The author has an hindex of 5, co-authored 6 publications receiving 420 citations.

Papers

PDF

Open Access

More filters

Patent•DOI•

Mode-specific method and apparatus for encoding signals containing speech

[...]

Kumar Swaminathan, Kalyan Ganesan, Prabhat K. Gupta

11 Oct 1995-Journal of the Acoustical Society of America

TL;DR: A method for encoding a signal that includes a speech component that is classified in one of at least two modes, based, for example, on pitch stationarity, short-term level gradient or zero crossing rate, is described.

...read moreread less

Abstract: A method for encoding a signal that includes a speech component is described. First and second linear prediction windows of a frame are analyzed to generate sets of filter coefficients. First and second pitch analysis windows of the frame are analyzed to generate pitch estimates. The frame is classified in one of at least two modes, e.g. voiced, unvoiced and noise modes, based, for example, on pitch stationarity, short-term level gradient or zero crossing rate. Then the frame is encoded using the filter coefficients and pitch estimates in a particular manner depending upon the mode determination for the frame, preferably employing CELP based encoding algorithms.

...read moreread less

282 citations

Patent•DOI•

Voiced, unvoiced or noise modes in a CELP vocoder

[...]

Kumar Swaminathan, Kalyan Ganesan, Prabhat K. Gupta

18 Apr 1994-Journal of the Acoustical Society of America

TL;DR: In this paper, a bit rate Codebook Excited Linear Predictor (CELP) communication system is proposed, which includes a transmitter that organizes a signal containing speech into frames of 40 millisecond duration, and classifies each frame as one of three modes: voiced and stationary, unvoiced or transient, and background noise.

...read moreread less

Abstract: A bit rate Codebook Excited Linear Predictor (CELP) communication system which includes a transmitter that organizes a signal containing speech into frames of 40 millisecond duration, and classifies each frame as one of three modes: voiced and stationary, unvoiced or transient, and background noise.

...read moreread less

57 citations

Patent•DOI•

Speech recognition method having noise immunity

[...]

John W. Klovstad¹, Chin-hui Lee¹, Kalyan Ganesan¹•Institutions (1)

ExxonMobil¹

27 May 1984-Journal of the Acoustical Society of America

TL;DR: In a speech recognition system, the beginning of speech versus non-speech (a cough or noise) is distinguished by reverting to a nonspeech decision process whenever the liklihood cost of template (vocabulary) patterns, including silence, is worse than a predetermined threshold, established by a Joker Word which represents a non-vocabulary word score and path in the grammar graph as discussed by the authors.

...read moreread less

Abstract: In a speech recognition system, the beginning of speech versus non-speech (a cough or noise) is distinguished by reverting to a non-speech decision process whenever the liklihood cost of template (vocabulary) patterns, including silence, is worse than a predetermined threshold, established by a Joker Word which represents a non-vocabulary word score and path in the grammar graph.

...read moreread less

29 citations

Patent•DOI•

Speech recognition training method

[...]

James K. Baker¹, John W. Klovstad¹, Chin-hui Lee¹, Kalyan Ganesan¹•Institutions (1)

ExxonMobil¹

27 Mar 1984-Journal of the Acoustical Society of America

TL;DR: In this article, a speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters.

...read moreread less

Abstract: A speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters. The acoustic parameters represent the speech input signal for a frame time. A plurality of template matching and cost processing circuitries are connected to a system bus, along with the speech processing circuitry, for determining, or identifying, the speech units in the input speech, by comparing the acoustic parameters with stored template patterns. The apparatus can be expanded by adding more template matching and cost processing circuitry to the bus thereby increasing the speech recognition capacity of the apparatus. Template pattern generation is advantageously aided by using a "joker" word to specify the time boundaries of utterances spoken in isolation, by finding the beginning and ending of an utterance surrounded by silence.

...read moreread less

25 citations

Patent•

Method of encoding a signal containing speech

[...]

Kalyan Ganesan, Prabhat K. Gupta, Kumar Swaminathan

17 Apr 1995

TL;DR: In this paper, a method of encoding a signal containing speech is employed in a bit rate Codebook Excited Linear Predictor (CELP) communication system, which includes a transmitter that organizes a signal-containing speech into frames of 40 millisecond duration, and classifies each frame as one of three modes: voiced and stationary, unvoiced or transient, and background noise.

...read moreread less

Abstract: A method of encoding a signal containing speech is employed in a bit rate Codebook Excited Linear Predictor (CELP) communication system. The system includes a transmitter that organizes a signal containing speech into frames of 40 millisecond duration, and classifies each frame as one of three modes: voiced and stationary, unvoiced or transient, and background noise.

...read moreread less

22 citations

Cited by

PDF

Open Access

More filters

Patent•

Intelligent Automated Assistant

[...]

Thomas R. Gruber¹, Adam Cheyer¹, Dag Kittlaus¹, Didier Rene Guzzoni¹, Christopher Dean Brigham¹, Richard Donald Giuli¹, Marcello Bastea-Forte¹, Harry J. Saddler¹ - Show less +4 more•Institutions (1)

Apple Inc.¹

11 Jan 2011

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

...read moreread less

1,462 citations

Patent•

Automated Response to and Sensing of User Activity in Portable Devices

[...]

Brian Q. Huppi¹, Anthony M. Fadell¹, Derek Boyd Barrentine¹, Daniel Freeman¹•Institutions (1)

Apple Inc.¹

19 Oct 2007

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.

...read moreread less

Abstract: The various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors. The response by the device may include a change of state of the device, and the response may be automatically performed after the user activity is determined.

...read moreread less

844 citations

Patent•

Using context information to facilitate processing of commands in a virtual assistant

[...]

Thomas R. Gruber¹, Christopher Dean Brigham¹, Daniel S. Keen¹, Gregory Novick¹, Phipps Benjamin S¹ - Show less +1 more•Institutions (1)

Apple Inc.¹

28 Sep 2012

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Abstract: A virtual assistant uses context information to supplement natural language or gestural input from a user. Context helps to clarify the user's intent and to reduce the number of candidate interpretations of the user's input, and reduces the need for the user to provide excessive clarification input. Context can include any available information that is usable by the assistant to supplement explicit user input to constrain an information-processing problem and/or to personalize results. Context can be used to constrain solutions during various phases of processing, including, for example, speech recognition, natural language processing, task flow processing, and dialog generation.

...read moreread less

593 citations

Patent•

Method and apparatus for building an intelligent automated assistant

[...]

Adam Cheyer¹, Didier Rene Guzzoni¹•Institutions (1)

Apple Inc.¹

08 Sep 2006

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Abstract: A method and apparatus are provided for building an intelligent automated assistant. Embodiments of the present invention rely on the concept of “active ontologies” (e.g., execution environments constructed in an ontology-like manner) to build and run applications for use by intelligent automated assistants. In one specific embodiment, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontology includes at least one active processing element that models a domain. At least one of the remote services is then registered for use in the domain.

...read moreread less

389 citations

Patent•

Contextual voice commands

[...]

Marcel van Os¹, Gregory Novick¹, Scott Herz¹•Institutions (1)

Apple Inc.¹

05 Jun 2009

TL;DR: In this paper, techniques and systems for implementing contextual voice commands are described and a physical input that relates the selected data item to an operation in a second context is received, and the operation is performed on the input data item in the second context.

...read moreread less

Abstract: Among other things, techniques and systems are disclosed for implementing contextual voice commands. On a device, a data item in a first context is displayed. On the device, a physical input selecting the displayed data item in the first context is received. On the device, a voice input that relates the selected data item to an operation in a second context is received. The operation is performed on the selected data item in the second context.

...read moreread less

385 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82

Collapse