K
Khalid Al-Hawaj
Researcher at Cornell University
Publications - 8
Citations - 94
Khalid Al-Hawaj is an academic researcher from Cornell University. The author has contributed to research in topics: Computer science & Speedup. The author has an hindex of 4, co-authored 6 publications receiving 50 citations.
Papers
More filters
Proceedings ArticleDOI
A New Approach to Automatic Memory Banking using Trace-Based Address Mining
TL;DR: This paper proposes TraceBanking, a novel and flexible trace-driven address mining algorithm that can automatically generate efficient memory banking schemes by analyzing a stream of memory address bits and achieves competitive performance and resource usage compared to the state-of-the-art across a set of real-life benchmarks with affine memory accesses.
Proceedings ArticleDOI
A 1.4 GHz 695 Giga Risc-V Inst/s 496-Core Manycore Processor With Mesh On-Chip Network and an All-Digital Synthesized PLL in 16nm CMOS
Austin Rovinski,Chun Zhao,Khalid Al-Hawaj,Paul Gao,Shaolin Xie,Christopher Torng,Scott Davidson,Aporva Amarnath,Luis Vega,Bandhav Veluri,Anuj Rao,Tutu Ajayi,Julian Puscar,Steve Dai,Ritchie Zhao,Dustin Richmond,Zhiru Zhang,Ian Galton,Christopher Batten,Michael Taylor,Ronald G. Dreslinski +20 more
TL;DR: The main feature is the NoC architecture, which uses only 1881μm2 per router node, enables highly scalable and dense compute, and provides up to 361 Tb/s of aggregate bandwidth.
Proceedings ArticleDOI
Towards a Reconfigurable Bit-Serial/Bit-Parallel Vector Accelerator using In-Situ Processing-In-SRAM
TL;DR: This paper is the first work to perform a rigorous evaluation of bit-serial vs. bit-parallel in-situ processing-in-SRAM, and shows that both approaches have similar area overheads.
Journal ArticleDOI
Evaluating Celerity: A 16-nm 695 Giga-RISC-V Instructions/s Manycore Processor With Synthesizable PLL
Austin Rovinski,Bandhav Veluri,Anuj Rao,Tutu Ajayi,Julian Puscar,Steve Dai,Ritchie Zhao,Dustin Richmond,Zhiru Zhang,Ian Galton,Christopher Batten,Chun Zhao,Michael Taylor,Ronald G. Dreslinski,Khalid Al-Hawaj,Paul Gao,Shaolin Xie,Christopher Torng,Scott Davidson,Aporva Amarnath,Luis Vega +20 more
TL;DR: This letter presents a 16-nm 496-core RISC-V network-on-chip (NoC) architecture, which enables highly scalable and dense compute, and provides up to 361 Tb/s of aggregate bandwidth.
Proceedings ArticleDOI
CAPE: A Content-Addressable Processing Engine
Helena Caminal,Kailin Yang,Srivatsa Srinivasa,Akshay Krishna Ramanathan,Khalid Al-Hawaj,Tianshu Wu,Vijaykrishnan Narayanan,Christopher Batten,Jose F. Martinez +8 more
TL;DR: The content-addressable parallel processing paradigm (CAPP) as discussed by the authors is an in-situ PIM architecture that leverages content addressable memories to realize bit-serial arithmetic and logic operations via sequences of search and update operations over multiple memory rows in parallel.