scispace - formally typeset
S

Sunil Shukla

Researcher at IBM

Publications -  43
Citations -  913

Sunil Shukla is an academic researcher from IBM. The author has contributed to research in topics: Field-programmable gate array & Reconfigurable computing. The author has an hindex of 12, co-authored 41 publications receiving 671 citations. Previous affiliations of Sunil Shukla include University of Queensland & Karlsruhe Institute of Technology.

Papers
More filters
Journal ArticleDOI

A Scalable Multi-TeraOPS Core for AI Training and Inference

TL;DR: This letter presents a multi-TOPS AI accelerator core for deep learning training and inference that achieves >90% sustained utilization across the range of neural network topologies by employing a dataflow architecture to provide high throughput and an on-chip scratchpad hierarchy to meet the bandwidth demands of the compute units.
Proceedings ArticleDOI

QUKU: A FPGA Based Flexible Coarse Grain Architecture Design Paradigm using Process Networks

TL;DR: A modification of Kahn process network is used to solve the problem of finding an optimum architectural template for coarse grain array on per application basis by applying the model at architectural level in QUKU.
Journal ArticleDOI

QUKU: A dual-layer reconfigurable architecture

TL;DR: The experimental results demonstrate that a dual layered reconfigurable architecture provides significant potential benefits in terms of flexibility, area and processing efficiency over existing reconfigured computing architectures for DSP.
Patent

Tightly coupled processor arrays using coarse grained reconfigurable architecture with iteration level commits

TL;DR: In this paper, an apparatus and method for supporting simultaneous multiple iterations (SMI) in a course-grained reconfigurable architecture (CGRA) is presented, which includes hardware structures that connect all of multiple processing engines (PEs) to a load-store unit (LSU) configured to keep track of which compiled program code iterations have completed, which ones are in flight, and a control unit including hardware structures to maintain synchronization and initiate and terminate loops within the PEs.