scispace - formally typeset
Open AccessJournal ArticleDOI

TinyRadarNN: Combining Spatial and Temporal Convolutional Neural Networks for Embedded Gesture Recognition With Short Range Radars

Reads0
Chats0
TLDR
A low-power high-accuracy embedded hand-gesture recognition algorithm targeting battery-operated wearable devices using low- power short-range RADAR sensors is proposed, demonstrating that real-time prediction is feasible with only 21 mW of power consumption for the full TCN sequence prediction network, while a system-level power consumption of less than 120 mW is achieved.
Abstract
This work proposes a low-power high-accuracy embedded hand-gesture recognition algorithm targeting battery-operated wearable devices using low-power short-range RADAR sensors. A 2-D convolutional neural network (CNN) using range-frequency Doppler features is combined with a temporal convolutional neural network (TCN) for time sequence prediction. The final algorithm has a model size of only 46 thousand parameters, yielding a memory footprint of only 92 KB. Two data sets containing 11 challenging hand gestures performed by 26 different people have been recorded containing a total of 20’210 gesture instances. On the 11 hand gesture data set, accuracies of 86.6% (26 users) and 92.4% (single user) have been achieved, which are comparable to the state of the art, which achieves 87% (10 users) and 94% (single user), while using a TCN-based network that is $7500\times $ smaller than the state of the art. Furthermore, the gesture recognition classifier has been implemented on a parallel ultralow power processor, demonstrating that real-time prediction is feasible with only 21 mW of power consumption for the full TCN sequence prediction network, while a system-level power consumption of less than 120 mW is achieved. We provide open-source access to example code and all data collected and used in this work on tinyradar.ethz.ch.

read more

Citations
More filters
Journal ArticleDOI

μBrain: An Event-Driven and Fully Synthesizable Architecture for Spiking Neural Networks.

TL;DR: In this article, the authors present μBrain, the first digital yet fully event-driven without clock architecture, with co-located memory and processing capability that exploits event-based processing to reduce an always-on system's overall energy consumption.
Proceedings ArticleDOI

Radar Trajectory-based Air-Writing Recognition using Temporal Convolutional Network

TL;DR: In this paper, a 1D temporal convolutional network (TCN) was proposed for simultaneous feature extraction and temporal modeling to recognize the drawn character from the local target trajectory.
Proceedings ArticleDOI

A Microcontroller is All You Need: Enabling Transformer Execution on Low-Power IoT Endnodes

TL;DR: TinyTransformer as mentioned in this paper proposes a new set of execution kernels tuned for efficient execution on MCU-class RISC-V and ARM Cortex-M cores, which focus on minimizing memory movements while maximizing data reuse in the Attention layers.
Proceedings ArticleDOI

SmartHand: Towards Embedded Smart Hands for Prosthetic and Robotic Applications

TL;DR: In this article, a smart embedded system, called SmartHand, was designed to enable real-time processing of high-resolution tactile information from a hand-shaped multi-sensor array for prosthetic and robotic applications.
Journal ArticleDOI

Spiking Neural Network-Based Radar Gesture Recognition System Using Raw ADC Data

TL;DR: This work presents an embedded gesture recognition system using a 60 GHz frequency modulated continuous wave radar using spiking neural networks (SNNs) applied directly to raw analog-to-digital converter (ADC) data.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Journal ArticleDOI

Long short-term memory

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Journal ArticleDOI

An introduction to hidden Markov models

TL;DR: The purpose of this tutorial paper is to give an introduction to the theory of Markov models, and to illustrate how they have been applied to problems in speech recognition.
Journal ArticleDOI

3D Convolutional Neural Networks for Human Action Recognition

TL;DR: Wang et al. as mentioned in this paper developed a novel 3D CNN model for action recognition, which extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.
Proceedings Article

3D Convolutional Neural Networks for Human Action Recognition

TL;DR: A novel 3D CNN model for action recognition that extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.
Related Papers (5)