Home
/
Authors
/
Felix Baum

Author

Felix Baum

Bio: Felix Baum is an academic researcher from Qualcomm. The author has contributed to research in topics: Mobile device & Android (operating system). The author has an hindex of 2, co-authored 2 publications receiving 128 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

AI Benchmark: All About Deep Learning on Smartphones in 2019

[...]

Andrey Ignatov¹, Radu Timofte¹, Andrei Kulik², Seung-Soo Yang³, Ke Wang⁴, Felix Baum⁵, Max Wu⁶, Lirong Xu, Luc Van Gool¹ - Show less +5 more•Institutions (6)

ETH Zurich¹, Google², Samsung³, Huawei⁴, Qualcomm⁵, MediaTek⁶

02 Nov 2019

TL;DR: In this article, the authors evaluate the performance and compare the results of all chipsets from Qualcomm, HiSilicon, Samsung, MediaTek and Unisoc that are providing hardware acceleration for AI inference.

...read moreread less

Abstract: The performance of mobile AI accelerators has been evolving rapidly in the past two years, nearly doubling with each new generation of SoCs. The current 4th generation of mobile NPUs is already approaching the results of CUDA-compatible Nvidia graphics cards presented not long ago, which together with the increased capabilities of mobile deep learning frameworks makes it possible to run complex and deep AI models on mobile devices. In this paper, we evaluate the performance and compare the results of all chipsets from Qualcomm, HiSilicon, Samsung, MediaTek and Unisoc that are providing hardware acceleration for AI inference. We also discuss the recent changes in the Android ML pipeline and provide an overview of the deployment of deep learning models on mobile devices. All numerical results provided in this paper can be found and are regularly updated on the official project website: http://ai-benchmark.com.

...read moreread less

145 citations

Posted Content•

AI Benchmark: All About Deep Learning on Smartphones in 2019

[...]

Andrey Ignatov¹, Radu Timofte¹, Andrei Kulik², Seung-Soo Yang³, Ke Wang⁴, Felix Baum⁵, Max Wu⁶, Lirong Xu, Luc Van Gool¹ - Show less +5 more•Institutions (6)

ETH Zurich¹, Google², Samsung³, Huawei⁴, Qualcomm⁵, MediaTek⁶

15 Oct 2019-arXiv: Performance

TL;DR: This paper evaluates the performance and compares the results of all chipsets from Qualcomm, HiSilicon, Samsung, MediaTek and Unisoc that are providing hardware acceleration for AI inference and discusses the recent changes in the Android ML pipeline.

...read moreread less

88 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Meta-Learning in Neural Networks: A Survey

[...]

Timothy M. Hospedales¹, Antreas Antoniou¹, Paul Micaelli¹, Amos Storkey¹•Institutions (1)

University of Edinburgh¹

11 Apr 2020-arXiv: Learning

TL;DR: A new taxonomy is proposed that provides a more comprehensive breakdown of the space of meta-learning methods today, including few-shot learning, reinforcement learning and architecture search, and promising applications and successes.

...read moreread less

Abstract: The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent years. Contrary to conventional approaches to AI where tasks are solved from scratch using a fixed learning algorithm, meta-learning aims to improve the learning algorithm itself, given the experience of multiple learning episodes. This paradigm provides an opportunity to tackle many conventional challenges of deep learning, including data and computation bottlenecks, as well as generalization. This survey describes the contemporary meta-learning landscape. We first discuss definitions of meta-learning and position it with respect to related fields, such as transfer learning and hyperparameter optimization. We then propose a new taxonomy that provides a more comprehensive breakdown of the space of meta-learning methods today. We survey promising applications and successes of meta-learning such as few-shot learning and reinforcement learning. Finally, we discuss outstanding challenges and promising areas for future research.

...read moreread less

831 citations

Proceedings Article•DOI•

MLPerf inference benchmark

[...]

Vijay Janapa Reddi¹, Christine Cheng², David Kanter, Peter Mattson³, Guenther Schmuelling⁴, Carole-Jean Wu⁵, Brian M. Anderson³, Maximilien Breughe⁶, Mark Charlebois⁷, William Chou⁷, Ramesh Chukka², Cody Coleman⁸, Sam Davis, Pan Deng⁹, Greg Diamos, Jared Duke³, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Satish Idgunji⁶, Thomas B. Jablin³, Jeff Jiao, Tom St. John, Pankaj Kanwar³, David Lee¹⁰, Jeffery Liao¹¹, Anton Lokhmotov, Francisco Massa⁵, Peng Meng⁹, Paulius Micikevicius⁶, Colin Osborne, Gennady Pekhimenko¹², Arun Tejusve Raghunath Rajan², Dilip Sequeira⁶, Ashish Sirasao¹³, Fei Sun⁵, Hanlin Tang², Michael Thomson¹⁴, Frank Wei¹⁵, Ephrem C. Wu¹³, Lingjie Xu, Koichi Yamada², Bing Yu¹⁰, George Yuan⁶, Aaron Zhong, Peizhao Zhang⁵, Yuchen Zhou¹⁶ - Show less +43 more•Institutions (16)

Harvard University¹, Intel², Google³, Microsoft⁴, Facebook⁵, Nvidia⁶, Qualcomm⁷, Stanford University⁸, Tencent⁹, MediaTek¹⁰, Synopsys¹¹, University of Toronto¹², Xilinx¹³, Centaur Technology¹⁴, Alibaba Group¹⁵, General Motors¹⁶

30 May 2020

TL;DR: This paper presents the benchmarking method for evaluating ML inference systems, MLPerf Inference, and prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures.

...read moreread less

Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark’s flexibility and adaptability.

...read moreread less

284 citations

Journal Article•DOI•

Pruning and quantization for deep neural network acceleration: A survey

[...]

Tailin Liang¹, John Glossner¹, Lei Wang¹, Shaobo Shi¹, Xiaotong Zhang¹ - Show less +1 more•Institutions (1)

University of Science and Technology Beijing¹

21 Oct 2021-Neurocomputing

TL;DR: A survey on two types of network compression: pruning and quantization is provided, which compare current techniques, analyze their strengths and weaknesses, provide guidance for compressing networks, and discuss possible future compression techniques.

...read moreread less

266 citations

Proceedings Article•DOI•

SPINN: synergistic progressive inference of neural networks over device and cloud

[...]

Stefanos Laskaridis¹, Stylianos I. Venieris¹, Mario Almeida¹, Ilias Leontiadis¹, Nicholas D. Lane¹ - Show less +1 more•Institutions (1)

University of Cambridge¹

21 Sep 2020

TL;DR: SPINN is proposed, a distributed inference system that employs synergistic device-cloud computation together with a progressive inference method to deliver fast and robust CNN inference across diverse settings, and provides robust operation under uncertain connectivity conditions and significant energy savings compared to cloud-centric execution.

...read moreread less

Abstract: Despite the soaring use of convolutional neural networks (CNNs) in mobile applications, uniformly sustaining high-performance inference on mobile has been elusive due to the excessive computational demands of modern CNNs and the increasing diversity of deployed devices. A popular alternative comprises offloading CNN processing to powerful cloud-based servers. Nevertheless, by relying on the cloud to produce outputs, emerging mission-critical and high-mobility applications, such as drone obstacle avoidance or interactive applications, can suffer from the dynamic connectivity conditions and the uncertain availability of the cloud. In this paper, we propose SPINN, a distributed inference system that employs synergistic device-cloud computation together with a progressive inference method to deliver fast and robust CNN inference across diverse settings. The proposed system introduces a novel scheduler that co-optimises the early-exit policy and the CNN splitting at run time, in order to adapt to dynamic conditions and meet user-defined service-level requirements. Quantitative evaluation illustrates that SPINN outperforms its state-of-the-art collaborative inference counterparts by up to 2× in achieved throughput under varying network conditions, reduces the server cost by up to 6.8× and improves accuracy by 20.7% under latency constraints, while providing robust operation under uncertain connectivity conditions and significant energy savings compared to cloud-centric execution.

...read moreread less

131 citations

Posted Content•

MLPerf Inference Benchmark

[...]

06 Nov 2019-arXiv: Learning

TL;DR: MLPerf Inference as mentioned in this paper is a benchmarking method for evaluating ML inference systems with different architectures and architectures. And it is based on the first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities.

...read moreread less

89 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40

Collapse