SoundSense: scalable sound sensing for people-centric applications on mobile phones

doi:10.1145/1555816.1555834

Proceedings ArticleDOI

SoundSense: scalable sound sensing for people-centric applications on mobile phones

- pp 165-178

TLDR

This paper proposes SoundSense, a scalable framework for modeling sound events on mobile phones that represents the first general purpose sound sensing system specifically designed to work on resource limited phones and demonstrates that SoundSense is capable of recognizing meaningful sound events that occur in users' everyday lives.

Abstract:

Top end mobile phones include a number of specialized (e.g., accelerometer, compass, GPS) and general purpose sensors (e.g., microphone, camera) that enable new people-centric sensing applications. Perhaps the most ubiquitous and unexploited sensor on mobile phones is the microphone - a powerful sensor that is capable of making sophisticated inferences about human activity, location, and social events from sound. In this paper, we exploit this untapped sensor not in the context of human communications but as an enabler of new sensing applications. We propose SoundSense, a scalable framework for modeling sound events on mobile phones. SoundSense is implemented on the Apple iPhone and represents the first general purpose sound sensing system specifically designed to work on resource limited phones. The architecture and algorithms are designed for scalability and Soundsense uses a combination of supervised and unsupervised learning techniques to classify both general sound types (e.g., music, voice) and discover novel sound events specific to individual users. The system runs solely on the mobile phone with no back-end interactions. Through implementation and evaluation of two proof of concept people-centric sensing applications, we demostrate that SoundSense is capable of recognizing meaningful sound events that occur in users' everyday lives.

SoundSense: scalable sound sensing for people-centric applications on mobile phones

Citations

Using approximated auditory roughness as a pre-filtering feature for human screaming and affective speech AED

Nonverbal acoustic communication in human-computer interaction

An Empirical Analysis of Perforated Audio Classification

PION: Human mobility-based service provisioning framework for smartphone users

AudioIMU: Enhancing Inertial Sensing-Based Activity Recognition with Acoustic Models

References

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning (Information Science and Statistics)

Fundamentals of speech recognition

On the use of windows for harmonic analysis with the discrete Fourier transform

Related Papers (5)

Sensing meets mobile social networks: the design, implementation and evaluation of the CenceMe application

SurroundSense: mobile phone localization via ambience fingerprinting

A survey of mobile phone sensing

Nericell: rich monitoring of road and traffic conditions using mobile smartphones

PEIR, the personal environmental impact report, as a platform for participatory sensing systems research