scispace - formally typeset
K

Kazuhiro Nakadai

Researcher at Tokyo Institute of Technology

Publications -  420
Citations -  7150

Kazuhiro Nakadai is an academic researcher from Tokyo Institute of Technology. The author has contributed to research in topics: Acoustic source localization & Microphone array. The author has an hindex of 42, co-authored 396 publications receiving 6481 citations. Previous affiliations of Kazuhiro Nakadai include Honda & Kyoto University.

Papers
More filters
Journal ArticleDOI

Audio-visual speech recognition using deep learning

TL;DR: A connectionist-hidden Markov model (HMM) system for noise-robust AVSR is introduced and it is demonstrated that approximately 65 % word recognition rate gain is attained with denoised MFCCs under 10 dB signal-to-noise-ratio (SNR) for the audio signal input.
Proceedings Article

Active Audition for Humanoid

TL;DR: The experimental result demonstrates that the active audition by integration of audition, vision, and motor control enables sound source tracking in variety of conditions.
Patent

Automatic Speech Recognition System

TL;DR: In this article, an automatic speech recognition system includes a sound source localization module for localizing a sound direction of a speaker based on the acoustic signals detected by the plurality of microphones, a sound-source separation module for separating a speech signal of the speaker from the acoustic signal according to the sound direction, and an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals.
Journal ArticleDOI

Design and Implementation of Robot Audition System 'HARK' — Open Source Software for Listening to Three Simultaneous Speakers

TL;DR: The design and implementation of the HARK robot audition software system consisting of sound source localization modules, sound source separation modules and automatic speech recognition modules of separated speech signals that works on any robot with any microphone configuration are presented.
Proceedings Article

Real-time sound source localization and separation for robot audition

TL;DR: The active direction-pass filter (ADPF) to separate sounds originating from the specified direction with a pair of microphones is presented and the signal-to-noise ratio (SNR) of each sound separated from a mixture of two speeches with the same loudness is improved.