scispace - formally typeset
P

Peter A. Beerel

Researcher at University of Southern California

Publications -  236
Citations -  3784

Peter A. Beerel is an academic researcher from University of Southern California. The author has contributed to research in topics: Asynchronous communication & Computer science. The author has an hindex of 30, co-authored 208 publications receiving 3403 citations. Previous affiliations of Peter A. Beerel include Intel & University of California, San Diego.

Papers
More filters
Proceedings ArticleDOI

qBSA: Logic Design of a 32-bit Block-Skewed RSFQ Arithmetic Logic Unit

TL;DR: This paper proposes to increase the throughput of SFQ pipelines by redesigning the datapath to accept and operate on least-significant bits (LSBs) clock cycles earlier than more significant bits, and develops a block-skewed MIPS-compatible 32-bit ALU.
Journal ArticleDOI

Voltage-pulse driven harmonic resonant rail drivers for low-power applications

TL;DR: The proposed circuit topology can be used to generate any desired periodic 50% duty-cycle waveform by superimposing multiple harmonics of the desired waveform, however, this paper focuses on the generation of trapezoidal-wave clock signals.
Proceedings ArticleDOI

P2M-DeTrack: Processing-in-Pixel-in-Memory for Energy-efficient and Real-Time Multi-Object Detection and Tracking

TL;DR: P 2 M-DeTrack is a algorithm-hardware co-design framework based on a custom faster R-CNN-based model that is distributed partly inside the pixel array (front-end) and partly in a separate FPGA/ASIC (back-end), which reduces the data bandwidth between sensor and back-end by up to 24 × .
Journal ArticleDOI

Pre-Defined Sparse Neural Networks with Hardware Acceleration

TL;DR: The results show that storage and computational complexity can be reduced by factors greater than 5X without significant performance loss.
Proceedings ArticleDOI

An asynchronous pipeline comparisons with application to DCT matrix-vector multiplication

TL;DR: Comprehensive energy-throughput comparisons of two well-known asynchronous design styles applied to a matrix-vector multiplication core of the discrete cosine transforms (DCT) show that all QDI designs achieve higher throughput at the cost of larger area and energy and in particular have 22% better E/spl tau//sup 2/ metric.