T
Todd Massengill
Researcher at Microsoft
Publications - 8
Citations - 1359
Todd Massengill is an academic researcher from Microsoft. The author has contributed to research in topics: Microarchitecture & Stratix. The author has an hindex of 5, co-authored 8 publications receiving 1021 citations.
Papers
More filters
Proceedings ArticleDOI
A cloud-scale acceleration architecture
Adrian M. Caulfield,Eric S. Chung,Andrew Putnam,Hari Angepat,Jeremy Fowers,Michael Haselman,Stephen F. Heil,Matt Humphrey,Puneet Kaur,Joo-Young Kim,Lo Daniel,Todd Massengill,Kalin Ovtcharov,Michael K. Papamichael,Lisa Woods,Sitaram Lanka,Derek Chiou,Doug Burger +17 more
TL;DR: A new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and applications, and is much more scalable than prior work which used secondary rack-scale networks for inter-FPGA communication.
Proceedings ArticleDOI
A configurable cloud-scale DNN processor for real-time AI
Jeremy Fowers,Kalin Ovtcharov,Michael K. Papamichael,Todd Massengill,Ming Liu,Lo Daniel,Shlomi Alkalay,Michael Haselman,Logan Adams,Mahdi Ghandi,Stephen F. Heil,Prerak Patel,Adam Sapek,Gabriel Weisz,Lisa Woods,Sitaram Lanka,Steven K. Reinhardt,Adrian M. Caulfield,Eric S. Chung,Doug Burger +19 more
TL;DR: This paper describes the NPU architecture for Project Brainwave, a production-scale system for real-time AI, and achieves more than an order of magnitude improvement in latency and throughput over state-of-the-art GPUs on large RNNs at a batch size of 1.5 teraflops.
Journal ArticleDOI
Serving DNNs in Real Time at Datacenter Scale with Project Brainwave
Eric S. Chung,Jeremy Fowers,Kalin Ovtcharov,Michael K. Papamichael,Adrian M. Caulfield,Todd Massengill,Ming Liu,Lo Daniel,Shlomi Alkalay,Michael Haselman,Maleen Abeydeera,Logan Adams,Hari Angepat,Christian Boehn,Derek Chiou,Oren Firestein,Alessandro Forin,Kang Su Gatlin,Mahdi Ghandi,Stephen F. Heil,Kyle Holohan,Ahmad M. El Husseini,Tamas Juhasz,Kara Kagi,Ratna Kumar Kovvuri,Sitaram Lanka,Friedel van Megen,Dima Mukhortov,Prerak Patel,Brandon Perez,Amanda Rapsang,Steven K. Reinhardt,Bita Darvish Rouhani,Adam Sapek,Raja Seera,Sangeetha Shekar,Balaji Sridharan,Gabriel Weisz,Lisa Woods,Phillip Yi Xiao,Dan Zhang,Ritchie Zhao,Doug Burger +42 more
TL;DR: Project Brainwave, Microsofts principal infrastructure for AI serving in real time, accelerates deep neural network inferencing in major services such as Bings intelligent search features and Azure by exploiting distributed model parallelism and pinning over low-latency hardware microservices.
Journal ArticleDOI
Configurable Clouds
Adrian M. Caulfield,Eric S. Chung,Andrew Putnam,Hari Angepat,Daniel Firestone,Jeremy Fowers,Michael Haselman,Stephen F. Heil,Matt Humphrey,Puneet Kaur,Joo-Young Kim,Lo Daniel,Todd Massengill,Kalin Ovtcharov,Michael K. Papamichael,Lisa Woods,Sitaram Lanka,Derek Chiou,Doug Burger +18 more
TL;DR: The authors deploy the Configurable Cloud architecture over a production server bed and show how it can be used to accelerate applications that were explicitly ported to FPGAs and support hardware-first services.
Journal ArticleDOI
Inside Project Brainwave's Cloud-Scale, Real-Time AI Processor
Jeremy Fowers,Kalin Ovtcharov,Michael K. Papamichael,Todd Massengill,Ming Liu,Lo Daniel,Shlomi Alkalay,Michael Haselman,Logan Adams,Mahdi Ghandi,Stephen F. Heil,Prerak Patel,Adam Sapek,Gabriel Weisz,Lisa Woods,Sitaram Lanka,Steven K. Reinhardt,Adrian M. Caulfield,Eric S. Chung,Doug Burger +19 more
TL;DR: The Project Brainwave NPU is described, a parameterized microarchitecture specialized at synthesis time for convolutional and recurrent DNN workloads that achieves sustained performance of 35 teraflops at a batch size of 1 on a large recurrent neural network (RNN).