scispace - formally typeset
B

Bochen Li

Researcher at University of Rochester

Publications -  28
Citations -  647

Bochen Li is an academic researcher from University of Rochester. The author has contributed to research in topics: Music theory & Computer science. The author has an hindex of 10, co-authored 26 publications receiving 355 citations. Previous affiliations of Bochen Li include Chinese Academy of Sciences.

Papers
More filters
Book ChapterDOI

Audio-Visual Event Localization in Unconstrained Videos

TL;DR: An audio-guided visual attention mechanism to explore audio- visual correlations, a dual multimodal residual network (DMRN) to fuse information over the two modalities, and an audio-visual distance learning network to handle the cross-modality localization are developed.
Posted Content

Audio-Visual Event Localization in Unconstrained Videos

TL;DR: In this paper, an audio-visual event is defined as an event that is both visible and audible in a video segment, and a dual multimodal residual network (DMRN) is proposed to fuse information over the two modalities.
Journal ArticleDOI

Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications

TL;DR: A dataset for facilitating audio-visual analysis of music performances that comprises 44 simple multi-instrument classical music pieces assembled from coordinated but separately recorded performances of individual tracks is introduced.
Journal ArticleDOI

Creating A Multi-track Classical Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications

TL;DR: The dataset as mentioned in this paper consists of 44 simple multi-instrument classical music pieces assembled from coordinated but separately recorded performances of individual tracks and provides the musical score in MIDI format, audio recordings of the individual tracks, the audio and video recording of the assembled mixture, and ground truth annotation files including frame-level and note-level transcriptions.
Posted Content

High-resolution Piano Transcription with Pedals by Regressing Onset and Offset Times

TL;DR: This article proposes a high-resolution AMT system trained by regressing precise times of onsets and offsets, and proposes an algorithm to analytically calculate the precise onset and offsets times of note and pedal events.