A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine

doi:10.1109/JSSC.2009.2031768

Journal ArticleDOI

A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine

Joo-Young Kim, +8 more

- Vol. 45, Iss: 1, pp 32-45

Chats0

TLDR

In the proposed hardware architecture, three recognition tasks (visual perception, descriptor generation, and object decision) are directly mapped to the neural perception engine, 16 SIMD processors including 128 processing elements, and decision processor and executed in the pipeline to maximize throughput of the object recognition.

Abstract:

A 201.4 GOPS real-time multi-object recognition processor is presented with a three-stage pipelined architecture. Visual perception based multi-object recognition algorithm is applied to give multiple attentions to multiple objects in the input image. For human-like multi-object perception, a neural perception engine is proposed with biologically inspired neural networks and fuzzy logic circuits. In the proposed hardware architecture, three recognition tasks (visual perception, descriptor generation, and object decision) are directly mapped to the neural perception engine, 16 SIMD processors including 128 processing elements, and decision processor, respectively, and executed in the pipeline to maximize throughput of the object recognition. For efficient task pipelining, proposed task/power manager balances the execution times of the three stages based on intelligent workload estimations. In addition, a 118.4 GB/s multi-casting network-on-chip is proposed for communication architecture with incorporating overall 21 IP blocks. For low-power object recognition, workload-aware dynamic power management is performed in chip-level. The 49 mm2 chip is fabricated in a 0.13 ?m 8-metal CMOS process and contains 3.7 M gates and 396 KB on-chip SRAM. It achieves 60 frame/sec multi-object recognition up to 10 different objects for VGA (640 × 480) video input while dissipating 496 mW at 1.2 V. The obtained 8.2 mJ/frame energy efficiency is 3.2 times higher than the state-of-the-art recognition processor.

A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine

Citations

DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning

PRIME: a novel processing-in-memory architecture for neural network computation in ReRAM-based main memory

Project Adam: building an efficient and scalable deep learning training system

A Survey of Neuromorphic Computing and Neural Networks in Hardware.

NeuFlow: A runtime reconfigurable dataflow processor for vision

References

Distinctive Image Features from Scale-Invariant Keypoints

Distinctive Image Features from Scale-Invariant Keypoints

A model of saliency-based visual attention for rapid scene analysis

The organization of behavior

A model of saliency-based visual attention for rapid scene analysis

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

A dynamically configurable coprocessor for convolutional neural networks

Gradient-based learning applied to document recognition

DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning

NeuFlow: A runtime reconfigurable dataflow processor for vision