C
C. Richard Ho
Researcher at D. E. Shaw Research
Publications - 7
Citations - 5054
C. Richard Ho is an academic researcher from D. E. Shaw Research. The author has contributed to research in topics: Parallel algorithm & Massively parallel. The author has an hindex of 7, co-authored 7 publications receiving 3888 citations. Previous affiliations of C. Richard Ho include Stanford University.
Papers
More filters
Posted Content
In-Datacenter Performance Analysis of a Tensor Processing Unit
Norman P. Jouppi,Cliff Young,Nishant Patil,David A. Patterson,Gaurav Agrawal,Raminder Bajwa,Sarah Bates,Suresh Bhatia,Nan Boden,Albert T. Borchers,Rick Boyle,Pierre-luc Cantin,Clifford Chao,Christopher Aaron Clark,Jeremy Coriell,Michael J. Daley,Matt Dau,Jeffrey Dean,Ben Gelb,Tara Vazir Ghaemmaghami,Rajendra Gottipati,William John Gulland,Robert Hagmann,C. Richard Ho,Doug Hogberg,John Hu,Robert Hundt,D. Hurt,Julian Ibarz,Aaron Jaffey,Alek Jaworski,Alexander Kaplan,Khaitan Harshit,Andy Koch,Naveen Kumar,Steve Lacy,James Laudon,James Law,Diemthu Le,Chris Leary,Zhuyuan Liu,Kyle Lucke,Alan Lundin,Gordon MacKean,Adriana Maggiore,Maire Mahony,Kieran Miller,Rahul Nagarajan,Ravi Narayanaswami,Ray Ni,Kathy Nix,Thomas Norrie,Mark Omernick,Narayana Penukonda,Andrew Everett Phelps,Jonathan Ross,Matt Ross,Amir Salek,Emad Samadiani,Chris Severn,Gregory Sizikov,Matthew Snelham,Jed Souter,Dan Steinberg,Andy Swing,Mercedes Tan,Gregory Michael Thorson,Bo Tian,Horia Toma,Erick Tuttle,Vijay K. Vasudevan,Richard Walter,Walter Wang,Eric Wilcox,Doe Hyun Yoon +74 more
TL;DR: This paper evaluates a custom ASIC-called a Tensor Processing Unit (TPU)-deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN) and compares it to a server-class Intel Haswell CPU and an Nvidia K80 GPU, which are contemporaries deployed in the samedatacenters.
Journal ArticleDOI
Anton, a special-purpose machine for molecular dynamics simulation
David E. Shaw,Martin M. Deneroff,Ron O. Dror,Jeffrey S. Kuskin,Richard H. Larson,John K. Salmon,Cliff Young,Brannon Batson,Kevin J. Bowers,Jack C. Chao,Michael P. Eastwood,Joseph Gagliardo,J. P. Grossman,C. Richard Ho,Douglas J. Ierardi,István Kolossváry,John L. Klepeis,Timothy Layman,Christine McLeavey,Mark A. Moraes,Rolf Mueller,Edward C. Priest,Yibing Shan,Jochen Spengler,Michael Theobald,Brian Towles,Stanley C. Wang +26 more
TL;DR: A massively parallel machine called Anton is described, which should be capable of executing millisecond-scale classical MD simulations of such biomolecular systems and has been designed to use both novel parallel algorithms and special-purpose logic to dramatically accelerate those calculations that dominate the time required for a typical MD simulation.
Proceedings ArticleDOI
Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer
David E. Shaw,J. P. Grossman,Joseph A. Bank,Brannon Batson,J. Adam Butts,Jack C. Chao,Martin M. Deneroff,Ron O. Dror,Amos Even,Christopher H. Fenton,Anthony Forte,Joseph Gagliardo,Gennette Gill,Brian Greskamp,C. Richard Ho,Douglas J. Ierardi,Lev Iserovich,Jeffrey S. Kuskin,Richard H. Larson,Timothy Layman,Li-Siang Lee,Adam Lerer,Chester Li,Daniel Killebrew,Kenneth M. Mackenzie,Shark Yeuk-Hai Mok,Mark A. Moraes,Rolf Mueller,Lawrence J. Nociolo,Jon L. Peticolas,Terry Quan,Daniel Ramot,John K. Salmon,Daniele Paolo Scarpazza,U. Ben Schafer,Naseer Siddique,Christopher W. Snyder,Jochen Spengler,Ping Tak Peter Tang,Michael Theobald,Horia Toma,Brian Towles,Benjamin Vitale,Stanley C. Wang,Cliff Young +44 more
TL;DR: The architecture of Anton 2 is tailored for fine-grained event-driven operation, which improves performance by increasing the overlap of computation with communication, and also allows a wider range of algorithms to run efficiently, enabling many new software-based optimizations.
Proceedings ArticleDOI
Anton, a special-purpose machine for molecular dynamics simulation
David E. Shaw,Martin M. Deneroff,Ron O. Dror,Jeffrey S. Kuskin,Richard H. Larson,John K. Salmon,Cliff Young,Brannon Batson,Kevin J. Bowers,Jack C. Chao,Michael P. Eastwood,Joseph Gagliardo,J. P. Grossman,C. Richard Ho,Douglas J. Ierardi,István Kolossváry,John L. Klepeis,Timothy Layman,Christine McLeavey,Mark A. Moraes,Rolf Mueller,Edward C. Priest,Yibing Shan,Jochen Spengler,Michael Theobald,Brian Towles,Stanley C. Wang +26 more
TL;DR: A massively parallel machine called Anton is described, which should be capable of executing millisecond-scale classical MD simulations of such biomolecular systems and is designed to use both novel parallel algorithms and special-purpose logic to dramatically accelerate those calculations that dominate the time required for a typical MD simulation.
Proceedings ArticleDOI
Architecture validation for processors
TL;DR: This paper uses techniques from formal verification to derive transition tours of a fully enumerated state graph of the control logic of the processor to validate an embedded dual-issue processor in the node controller of the Stanford FLASH Multiprocessor.