A Low Power, Fully Event-Based Gesture Recognition System

doi:10.1109/CVPR.2017.781

Proceedings ArticleDOI

A Low Power, Fully Event-Based Gesture Recognition System

Arnon Amir, +15 more

- pp 7388-7397

Chats0

TLDR

This work presents the first gesture recognition system implemented end-to-end on event-based hardware, using a TrueNorth neurosynaptic processor to recognize hand gestures in real-time at low power from events streamed live by a Dynamic Vision Sensor (DVS).

Abstract:

We present the first gesture recognition system implemented end-to-end on event-based hardware, using a TrueNorth neurosynaptic processor to recognize hand gestures in real-time at low power from events streamed live by a Dynamic Vision Sensor (DVS). The biologically inspired DVS transmits data only when a pixel detects a change, unlike traditional frame-based cameras which sample every pixel at a fixed frame rate. This sparse, asynchronous data representation lets event-based cameras operate at much lower power than frame-based cameras. However, much of the energy efficiency is lost if, as in previous work, the event stream is interpreted by conventional synchronous processors. Here, for the first time, we process a live DVS event stream using TrueNorth, a natively event-based processor with 1 million spiking neurons. Configured here as a convolutional neural network (CNN), the TrueNorth chip identifies the onset of a gesture with a latency of 105 ms while consuming less than 200 mW. The CNN achieves 96.5% out-of-sample accuracy on a newly collected DVS dataset (DvsGesture) comprising 11 hand gesture categories from 29 subjects under 3 illumination conditions.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Event-based Vision: A Survey

Guillermo Gallego, +10 more

- 10 Jul 2020 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper provides a comprehensive overview of the emerging field of event-based vision, with a focus on the applications and the algorithms developed to unlock the outstanding properties of event cameras.

...read moreread less

Journal ArticleDOI

Deep Learning With Spiking Neurons: Opportunities and Challenges.

Michael Pfeiffer, +1 more

- 25 Oct 2018 -

Frontiers in Neuroscience

TL;DR: This review addresses the opportunities that deep spiking networks offer and investigates in detail the challenges associated with training SNNs in a way that makes them competitive with conventional deep learning, but simultaneously allows for efficient mapping to hardware.

...read moreread less

Proceedings Article

SLAYER: Spike Layer Error Reassignment in Time

Sumit Bam Shrestha, +1 more

TL;DR: A new general back Propagation mechanism for learning synaptic weights and axonal delays which overcomes the problem of non-differentiability of the spike function and uses a temporal credit assignment policy for backpropagating error to preceding layers is introduced.

...read moreread less

Proceedings ArticleDOI

Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling

Jiancheng Yang, +6 more

TL;DR: This work develops Point Attention Transformers (PATs), using a parameter-efficient Group Shuffle Attention (GSA) to replace the costly Multi-Head Attention, and proposes an end-to-end learnable and task-agnostic sampling operation, named Gumbel Subset Sampling (GSS), to select a representative subset of input points.

...read moreread less

Journal ArticleDOI

Event-Based Vision: A Survey

- 01 Jan 2022 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Event cameras as discussed by the authors are bio-inspired sensors that differ from conventional frame cameras: instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Journal ArticleDOI

3D Convolutional Neural Networks for Human Action Recognition

Shuiwang Ji, +3 more

- 01 Jan 2013 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Wang et al. as mentioned in this paper developed a novel 3D CNN model for action recognition, which extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.

...read moreread less

Proceedings Article

3D Convolutional Neural Networks for Human Action Recognition

Shuiwang Ji, +3 more

TL;DR: A novel 3D CNN model for action recognition that extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.

...read moreread less

Proceedings ArticleDOI

MatConvNet: Convolutional Neural Networks for MATLAB

Andrea Vedaldi, +1 more

TL;DR: MatConvNet exposes the building blocks of CNNs as easy-to-use MATLAB functions, providing routines for computing convolutions with filter banks, feature pooling, normalisation, and much more.

...read moreread less