Home
/
Authors
/
Kelvin Lin

Author

Kelvin Lin

Bio: Kelvin Lin is an academic researcher from University of Washington. The author has contributed to research in topics: Hardware acceleration & Throughput (business). The author has an hindex of 3, co-authored 4 publications receiving 28 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

GPU coprocessors as a service for deep learning inference in high energy physics

[...]

Jeffrey Krupa¹, Kelvin Lin², Maria Acosta Flechas³, Jack Dinsmore¹, Javier Duarte⁴, Philip Harris¹, Scott Hauck², Burt Holzman³, Shih-Chieh Hsu², Thomas Klijnsma³, Mia Liu³, Kevin Pedro³, Dylan Rankin¹, Natchanon Suaysom², Matthew Trahms², Nhan Tran⁵, Nhan Tran³ - Show less +13 more•Institutions (5)

Massachusetts Institute of Technology¹, University of Washington², Fermilab³, University of California, San Diego⁴, Northwestern University⁵

04 Mar 2021

TL;DR: A comprehensive exploration of the use of GPU-based hardware acceleration for deep learning inference within the data reconstruction workflow of high energy physics is presented.

...read moreread less

Abstract: In the next decade, the demands for computing in large scientific experiments are expected to grow tremendously. During the same time period, CPU performance increases will be limited. At the CERN Large Hadron Collider (LHC), these two issues will confront one another as the collider is upgraded for high luminosity running. Alternative processors such as graphics processing units (GPUs) can resolve this confrontation provided that algorithms can be sufficiently accelerated. In many cases, algorithmic speedups are found to be largest through the adoption of deep learning algorithms. We present a comprehensive exploration of the use of GPU-based hardware acceleration for deep learning inference within the data reconstruction workflow of high energy physics. We present several realistic examples and discuss a strategy for the seamless integration of coprocessors so that the LHC can maintain, if not exceed, its current performance throughout its running.

...read moreread less

16 citations

Journal Article•DOI•

GPU coprocessors as a service for deep learning inference in high energy physics

[...]

Jeffrey Krupa¹, Kelvin Lin², Maria Acosta Flechas³, Jack Dinsmore¹, Javier Duarte⁴, Philip Harris¹, Scott Hauck², Burt Holzman³, Shih-Chieh Hsu², Thomas Klijnsma³, Mia Liu³, Kevin Pedro³, Dylan Rankin¹, Natchanon Suaysom², Matthew Trahms², Nhan Tran³, Nhan Tran⁵ - Show less +13 more•Institutions (5)

Massachusetts Institute of Technology¹, University of Washington², Fermilab³, University of California, San Diego⁴, Northwestern University⁵

20 Jul 2020-arXiv: Computational Physics

TL;DR: In this article, the authors explore the use of GPU-based hardware acceleration for deep learning inference within the data reconstruction workflow of high energy physics at the CERN Large Hadron Collider (LHC).

...read moreread less

15 citations

Proceedings Article•DOI•

FPGAs-as-a-Service Toolkit (FaaST)

[...]

Dylan Rankin¹, Jeffrey Krupa¹, Philip Harris¹, Maria Acosta Flechas², Burt Holzman², Thomas Klijnsma², Kevin Pedro², Nhan Tran², Scott Hauck³, Shih-Chieh Hsu³, Matthew Trahms³, Kelvin Lin³, Yu Lou³, Ta-Wei Ho⁴, Javier Duarte⁵, Mia Liu⁶ - Show less +12 more•Institutions (6)

Massachusetts Institute of Technology¹, Fermilab², University of Washington³, National Tsing Hua University⁴, University of California, San Diego⁵, Purdue University⁶

01 Nov 2020

TL;DR: In this paper, the performance of FPGAs as a service for high energy physics applications is evaluated using a series of workflows developed to establish the performance capabilities of the FPGA as a Service.

...read moreread less

Abstract: Computing needs for high energy physics are already intensive and are expected to increase drastically in the coming years. In this context, heterogeneous computing, specifically as-a-service computing, has the potential for significant gains over traditional computing models. Although previous studies and packages in the field of heterogeneous computing have focused on GPUs as accelerators, FPGAs are an extremely promising option as well. A series of workflows are developed to establish the performance capabilities of FPGAs as a service. Multiple different devices and a range of algorithms for use in high energy physics are studied. For a small, dense network, the throughput can be improved by an order of magnitude with respect to GPUs as a service. For large convolutional networks, the throughput is found to be comparable to GPUs as a service. This work represents the first open-source FPGAs-as-a-service toolkit.

...read moreread less

11 citations

Posted Content•

FPGAs-as-a-Service Toolkit (FaaST)

[...]

Massachusetts Institute of Technology¹, Fermilab², University of Washington³, National Tsing Hua University⁴, University of California, San Diego⁵, Purdue University⁶

16 Oct 2020-arXiv: Computational Physics

TL;DR: This work represents the first open-source FPGAs-as-a-service toolkit and shows that for large convolutional networks, the throughput is found to be comparable to GPUs as a service.

...read moreread less

9 citations

Journal Article•DOI•

Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml

[...]

Nico Ghielmetti, Vladimir Loncar, Maurizio Pierini, Marcel Roed, Sioni Summers, Thea Klaeboe Aarrestad, Christoffer Petersson, Hampus Linander, Jennifer Ngadiuba, Kelvin Lin, Philip Harris - Show less +7 more

16 May 2022-Machine learning: science and technology

TL;DR: In this article , the authors investigate how field programmable gate arrays can serve as hardware accelerators for real-time semantic segmentation tasks relevant for autonomous driving, and demonstrate a fully-on-chip deployment with a latency of 4.9 ms per image, using less than 30% of the available resources on a Xilinx ZCU102 evaluation board.

...read moreread less

Abstract: In this paper, we investigate how field programmable gate arrays can serve as hardware accelerators for real-time semantic segmentation tasks relevant for autonomous driving. Considering compressed versions of the ENet convolutional neural network architecture, we demonstrate a fully-on-chip deployment with a latency of 4.9 ms per image, using less than 30% of the available resources on a Xilinx ZCU102 evaluation board. The latency is reduced to 3 ms per image when increasing the batch size to ten, corresponding to the use case where the autonomous vehicle receives inputs from multiple cameras simultaneously. We show, through aggressive filter reduction and heterogeneous quantization-aware training, and an optimized implementation of convolutional layers, that the power consumption and resource utilization can be significantly reduced while maintaining accuracy on the Cityscapes dataset.

...read moreread less

1 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Graph Neural Networks for Particle Tracking and Reconstruction

[...]

Javier Duarte, Jean-Roch Vlimant

03 Dec 2020-arXiv: High Energy Physics - Phenomenology

TL;DR: This chapter recapitulate the mathematical formalism of GNNs and highlight aspects to consider when designing these networks for HEP data, including graph construction, model architectures, learning objectives, and graph pooling.

...read moreread less

Abstract: Machine learning methods have a long history of applications in high energy physics (HEP). Recently, there is a growing interest in exploiting these methods to reconstruct particle signatures from raw detector data. In order to benefit from modern deep learning algorithms that were initially designed for computer vision or natural language processing tasks, it is common practice to transform HEP data into images or sequences. Conversely, graph neural networks (GNNs), which operate on graph data composed of elements with a set of features and their pairwise connections, provide an alternative way of incorporating weight sharing, local connectivity, and specialized domain knowledge. Particle physics data, such as the hits in a tracking detector, can generally be represented as graphs, making the use of GNNs natural. In this chapter, we recapitulate the mathematical formalism of GNNs and highlight aspects to consider when designing these networks for HEP data, including graph construction, model architectures, learning objectives, and graph pooling. We also review promising applications of GNNs for particle tracking and reconstruction in HEP and summarize the outlook for their deployment in current and future experiments.

...read moreread less

41 citations

Posted Content•

Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

[...]

Aneesh Heintz, Lindsey Gray, Nhan Tran, Savannah Thais, Gage Dezoort, Maurizio Pierini, Vesal Razavimaleki, Edward Kreinar, Mia Liu, Thea Klaeboe Aarrestad, Dylan Rankin, Jennifer Ngadiuba, Isobel Ojalvo, Mark Neubauer, Javier Duarte, Zhenbin Wu, Sioni Summers, Philip Harris, Sergo Jindariani, Markus Atkinson, Vladimir Loncar - Show less +17 more

30 Nov 2020-arXiv: Instrumentation and Detectors

TL;DR: A considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.

...read moreread less

Abstract: We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.

...read moreread less

34 citations

Journal Article•DOI•

MLPF: efficient machine-learned particle-flow reconstruction using graph neural networks

[...]

Joosep Pata¹, Joosep Pata², Javier Duarte³, Jean-Roch Vlimant¹, Maurizio Pierini⁴, Maria Spiropulu¹ - Show less +2 more•Institutions (4)

California Institute of Technology¹, National Institute of Chemical Physics and Biophysics², University of California, San Diego³, CERN⁴

21 Jan 2021-European Physical Journal C

TL;DR: In this paper, an end-to-end trainable, machine-learned particle-flow algorithm based on parallelizable, computationally efficient, and scalable graph neural network optimized using a multi-task objective on simulated events is presented.

...read moreread less

Abstract: In general-purpose particle detectors, the particle-flow algorithm may be used to reconstruct a comprehensive particle-level view of the event by combining information from the calorimeters and the trackers, significantly improving the detector resolution for jets and the missing transverse momentum. In view of the planned high-luminosity upgrade of the CERN Large Hadron Collider (LHC), it is necessary to revisit existing reconstruction algorithms and ensure that both the physics and computational performance are sufficient in an environment with many simultaneous proton–proton interactions (pileup). Machine learning may offer a prospect for computationally efficient event reconstruction that is well-suited to heterogeneous computing platforms, while significantly improving the reconstruction quality over rule-based algorithms for granular detectors. We introduce MLPF, a novel, end-to-end trainable, machine-learned particle-flow algorithm based on parallelizable, computationally efficient, and scalable graph neural network optimized using a multi-task objective on simulated events. We report the physics and computational performance of the MLPF algorithm on a Monte Carlo dataset of top quark–antiquark pairs produced in proton–proton collisions in conditions similar to those expected for the high-luminosity LHC. The MLPF algorithm improves the physics response with respect to a rule-based benchmark algorithm and demonstrates computationally scalable particle-flow reconstruction in a high-pileup environment.

...read moreread less

31 citations

Journal Article•DOI•

Performance of a geometric deep learning pipeline for HL-LHC particle tracking

[...]

Xiangyang Ju¹, Daniel Murnane¹, P. Calafiura¹, Nicholas Choma¹, Sean Conlon¹, Steven Farrell¹, Yaoyuan Xu¹, Maria Spiropulu², Jean-Roch Vlimant², Adam Aurisano³, J. Hewes³, Giuseppe Benedetto Cerati⁴, Lindsey Gray⁴, Thomas Klijnsma⁴, Jim Kowalkowski⁴, Markus Atkinson⁵, Mark Neubauer⁵, Gage Dezoort⁶, Savannah Thais⁶, Aditi Chauhan⁷, Alex Schuy⁷, Shih-Chieh Hsu⁷, Alex Ballow⁸, Alina Lazar⁸ - Show less +20 more•Institutions (8)

Lawrence Berkeley National Laboratory¹, California Institute of Technology², University of Cincinnati³, Fermilab⁴, University of Illinois at Urbana–Champaign⁵, Princeton University⁶, University of Washington⁷, Youngstown State University⁸

01 Oct 2021-European Physical Journal C

TL;DR: The Exa.TrkX project as mentioned in this paper applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking and achieved tracking efficiency and purity similar to production tracking algorithms.

...read moreread less

Abstract: The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX’s tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, including DUNE Liquid Argon TPC and CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.

...read moreread less

17 citations

DOI•

Charged Particle Tracking via Edge-Classifying Interaction Networks

[...]

Gage Dezoort¹, Savannah Thais¹, Javier Duarte², Vesal Razavimaleki², Markus Atkinson³, Isobel Ojalvo¹, Mark Neubauer³, Peter Elmer¹ - Show less +4 more•Institutions (3)

Princeton University¹, University of California, San Diego², University of Illinois at Urbana–Champaign³

01 Dec 2021

TL;DR: In this article, the authors adapt the physics-motivated interaction network (IN) GNN to the problem of particle tracking in pileup conditions similar to those expected at the high-luminosity Large Hadron Collider.

...read moreread less

Abstract: Recent work has demonstrated that geometric deep learning methods such as graph neural networks (GNNs) are well suited to address a variety of reconstruction problems in high-energy particle physics. In particular, particle tracking data are naturally represented as a graph by identifying silicon tracker hits as nodes and particle trajectories as edges, given a set of hypothesized edges, edge-classifying GNNs identify those corresponding to real particle trajectories. In this work, we adapt the physics-motivated interaction network (IN) GNN toward the problem of particle tracking in pileup conditions similar to those expected at the high-luminosity Large Hadron Collider. Assuming idealized hit filtering at various particle momenta thresholds, we demonstrate the IN’s excellent edge-classification accuracy and tracking efficiency through a suite of measurements at each stage of GNN-based tracking: graph construction, edge classification, and track building. The proposed IN architecture is substantially smaller than previously studied GNN tracking architectures; this is particularly promising as a reduction in size is critical for enabling GNN-based tracking in constrained computing environments. Furthermore, the IN may be represented as either a set of explicit matrix operations or a message passing GNN. Efforts are underway to accelerate each representation via heterogeneous computing resources towards both high-level and low-latency triggering applications.

...read moreread less

15 citations

1
2
3
4
…
5
6

Collapse