M
Mohammad Alian
Researcher at University of Illinois at Urbana–Champaign
Publications - 19
Citations - 473
Mohammad Alian is an academic researcher from University of Illinois at Urbana–Champaign. The author has contributed to research in topics: Server & Network interface. The author has an hindex of 9, co-authored 19 publications receiving 235 citations. Previous affiliations of Mohammad Alian include University of Kansas.
Papers
More filters
Posted Content
The gem5 Simulator: Version 20.0+
Jason Lowe-Power,Abdul Ahmad,Adria Armejach,Adrian Herrera,Alec Roelke,Amin Farmahini-Farahani,Andrea Mondelli,Andreas Hansson,Andreas Sandberg,Anthony Gutierrez,Austin Harris,Ayaz Akram,Bagus Hanindhito,Binh Pham,Bobby R. Bruce,Boris Shingarov,Brad Beckmann,Carlos Escuin,Christian Menard,Christian Weis,Daniel Rodrigues Carvalho,Darien Wood,Dibakar Gope,Éder F. Zulian,Gabe Black,Gedare Bloom,Giacomo Travaglini,Hamidreza Khaleghzadeh,Hanhwi Jang,Hoa Nguyen,Hongil Yoon,Ilias Vougioukas,Javier Setoain,Jayneel Gandhi,Jeronimo Castrillon,Krishnendra Nathella,Lena E. Olson,Lizhong Chen,Mahyar Samani,Marc S. Orr,Marjan Fariborz,Matteo Andreozzi,Matthew D. Sinclair,Matthew James Horsnell,Matthias Jung,Michael Upton,Miquel Moreto,Mohammad Alian,Nicolas Derumigny,Nikos Nikoleris,Nilay Vaish,Nils Asmussen,Norbert Wehn,Omar Naji,Pablo Prieto,Pouya Fotouhi,Radhika Jagtap,Rahul Thakur,Raza Jafri,Reiley Jeyapaul,Rico Amslinger,Ryan Gambord,Srikant Bharadwaj,Stephan Diestelhorst,Subash Kannoth,Swapnil Haria,Syed Ali,Thomas Grass,Tiago Muck,Timothy Hayes,Timothy M. Jones,Tommaso Marinelli,Trivikram Reddy,Tuan Ta,Tushar Krishna,Wendy Arnott Elsasser,William S.-Y. Wang,Yuetsu Kodama,Zhengrong Wang +78 more
TL;DR: How the gem5 simulator has transitioned to a formal governance model to enable continued improvement and community support for the next 20 years of computer architecture research is discussed.
Proceedings ArticleDOI
A network-centric hardware/algorithm co-design to accelerate distributed training of deep neural networks
Youjie Li,Jongse Park,Mohammad Alian,Yifan Yuan,Zheng Qu,Peitian Pan,Ren Wang,Alexander G. Schwing,Hadi Esmaeilzadeh,Nam Sung Kim +9 more
TL;DR: This paper sets out to reduce this significant communication cost by embedding data compression accelerators in the Network Interface Cards (NICs) and proposes an aggregator-free training algorithm that exchanges gradients in both legs of communication in the group, while the workers collectively perform the aggregation in a distributed manner.
Proceedings ArticleDOI
Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural Networks
Soroush Ghodrati,Byung Hoon Ahn,Joon Kyung Kim,Sean Kinzer,Brahmendra Reddy Yatham,Navateja Alla,Hardik Sharma,Mohammad Alian,Eiman Ebrahimi,Nam Sung Kim,Cliff Young,Hadi Esmaeilzadeh +11 more
TL;DR: This paper defines Planaria1, a microarchitectural capability that can dynamically fission (break) into multiple smaller yet full-fledged DNN engines at runtime that enables spatially co-locating multiple DNN inference services on the same hardware, offering simultaneous multi-tenant DNN acceleration.
Proceedings ArticleDOI
Flashshare: punching through server storage stack from kernel to firmware for ultra-low latency SSDs
Jie Zhang,Miryeong Kwon,Donghyun Gouk,Sungjoon Koh,Changlim Lee,Mohammad Alian,Myoungjun Chun,Mahmut Kandemir,Nam Sung Kim,Jihong Kim,Myoungsoo Jung +10 more
TL;DR: FLASHSHARE is a holistic cross-stack approach, which can significantly reduce I/O interferences among co-running applications at a server without any change in applications, and can shorten the average and 99th-percentile turnaround response times of co- running applications by 22% and 31%, respectively.
Proceedings ArticleDOI
Application-transparent near-memory processing architecture with memory channel network
Mohammad Alian,Seungwon Min,Hadi Asghari-Moghaddam,Ashutosh Dhar,Dong Kai Wang,Thomas Roewer,Adam J. McPadden,Oliver O'Halloran,Deming Chen,Jinjun Xiong,Daehoon Kim,Wen-mei W. Hwu,Nam Sung Kim +12 more
TL;DR: Memory Channel Network can serve as an application-transparent framework which can seamlessly unify near-memory processing within a server and distributed computing across such servers for data-intensive applications.