K
Kalin Ovtcharov
Researcher at Microsoft
Publications - 27
Citations - 2463
Kalin Ovtcharov is an academic researcher from Microsoft. The author has contributed to research in topics: Artificial neural network & Cloud computing. The author has an hindex of 11, co-authored 27 publications receiving 1942 citations.
Papers
More filters
Proceedings ArticleDOI
A cloud-scale acceleration architecture
Adrian M. Caulfield,Eric S. Chung,Andrew Putnam,Hari Angepat,Jeremy Fowers,Michael Haselman,Stephen F. Heil,Matt Humphrey,Puneet Kaur,Joo-Young Kim,Lo Daniel,Todd Massengill,Kalin Ovtcharov,Michael K. Papamichael,Lisa Woods,Sitaram Lanka,Derek Chiou,Doug Burger +17 more
TL;DR: A new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and applications, and is much more scalable than prior work which used secondary rack-scale networks for inter-FPGA communication.
Proceedings ArticleDOI
A configurable cloud-scale DNN processor for real-time AI
Jeremy Fowers,Kalin Ovtcharov,Michael K. Papamichael,Todd Massengill,Ming Liu,Lo Daniel,Shlomi Alkalay,Michael Haselman,Logan Adams,Mahdi Ghandi,Stephen F. Heil,Prerak Patel,Adam Sapek,Gabriel Weisz,Lisa Woods,Sitaram Lanka,Steven K. Reinhardt,Adrian M. Caulfield,Eric S. Chung,Doug Burger +19 more
TL;DR: This paper describes the NPU architecture for Project Brainwave, a production-scale system for real-time AI, and achieves more than an order of magnitude improvement in latency and throughput over state-of-the-art GPUs on large RNNs at a batch size of 1.5 teraflops.
Accelerating Deep Convolutional Neural Networks Using Specialized Hardware
TL;DR: Hardware specialization in the form of GPGPUs, FPGAs, and ASICs offers a promising path towards major leaps in processing capability while achieving high energy efficiency, and combining multiple FPGA over a low-latency communication fabric offers further opportunity to train and evaluate models of unprecedented size and quality.
Proceedings Article
Azure accelerated networking: SmartNICs in the public cloud
Daniel Firestone,Andrew Putnam,Mundkur Sambhrama Madhusudhan,Derek Chiou,Alireza Dabagh,Mike Andrewartha,Hari Angepat,Vivek Bhanu,Adrian M. Caulfield,Eric S. Chung,Chandrappa Harish Kumar,Chaturmohta Somesh,Matt Humphrey,Jack Lavier,Lam Norman C,Fengfen Liu,Kalin Ovtcharov,Jitu Padhye,Gautham Popuri,Shachar Raindel,Tejas Sapre,Mark Shaw,Gabriel Silva,Madhan Sivakumar,Nisheeth Srivastava,Anshuman Verma,Qasim Zuhair,Deepak Bansal,Doug Burger,Kushagra Vaid,David A. Maltz,Albert Greenberg +31 more
TL;DR: The design of AccelNet is presented, including the hardware/software codesign model, performance results on key workloads, and experiences and lessons learned from developing and deploying Accel net on FPGA-based Azure SmartNICs.
Journal ArticleDOI
Serving DNNs in Real Time at Datacenter Scale with Project Brainwave
Eric S. Chung,Jeremy Fowers,Kalin Ovtcharov,Michael K. Papamichael,Adrian M. Caulfield,Todd Massengill,Ming Liu,Lo Daniel,Shlomi Alkalay,Michael Haselman,Maleen Abeydeera,Logan Adams,Hari Angepat,Christian Boehn,Derek Chiou,Oren Firestein,Alessandro Forin,Kang Su Gatlin,Mahdi Ghandi,Stephen F. Heil,Kyle Holohan,Ahmad M. El Husseini,Tamas Juhasz,Kara Kagi,Ratna Kumar Kovvuri,Sitaram Lanka,Friedel van Megen,Dima Mukhortov,Prerak Patel,Brandon Perez,Amanda Rapsang,Steven K. Reinhardt,Bita Darvish Rouhani,Adam Sapek,Raja Seera,Sangeetha Shekar,Balaji Sridharan,Gabriel Weisz,Lisa Woods,Phillip Yi Xiao,Dan Zhang,Ritchie Zhao,Doug Burger +42 more
TL;DR: Project Brainwave, Microsofts principal infrastructure for AI serving in real time, accelerates deep neural network inferencing in major services such as Bings intelligent search features and Azure by exploiting distributed model parallelism and pinning over low-latency hardware microservices.