Home
/
Authors
/
Yukun Ding

Author

Yukun Ding

Other affiliations: Beijing University of Posts and Telecommunications

Bio: Yukun Ding is an academic researcher from University of Notre Dame. The author has contributed to research in topics: Artificial neural network & Segmentation. The author has an hindex of 8, co-authored 25 publications receiving 324 citations. Previous affiliations of Yukun Ding include Beijing University of Posts and Telecommunications.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Scaling for edge inference of deep neural networks

[...]

Xiaowei Xu¹, Yukun Ding¹, Sharon Hu¹, Michael Niemier¹, Jason Cong², Yu Hu³, Yiyu Shi¹ - Show less +3 more•Institutions (3)

University of Notre Dame¹, University of California, Los Angeles², Huazhong University of Science and Technology³

01 Apr 2018

TL;DR: There are increasing gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required.

...read moreread less

Abstract: Deep neural networks offer considerable potential across a range of applications, from advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the exponential growth of network size and the associated increases in computational complexity and memory consumption. However, the performance and energy efficiency of edge inference, in which the inference (the application of a trained network to new data) is performed locally on embedded platforms that have limited area and power budget, is bounded by technology scaling. Here we analyse recent data and show that there are increasing gaps between the computational complexity and energy efficiency required by data scientists and the hardware capacity made available by hardware architects. We then discuss various architecture and algorithm innovations that could help to bridge the gaps. This Perspective highlights the existence of gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required; it then discusses various architecture and algorithm innovations that could help to bridge these gaps.

...read moreread less

354 citations

Proceedings Article•DOI•

Revisiting the Evaluation of Uncertainty Estimation and Its Application to Explore Model Complexity-Uncertainty Trade-Off

[...]

Yukun Ding¹, Jinglan Liu¹, Jinjun Xiong², Yiyu Shi¹•Institutions (2)

University of Notre Dame¹, IBM²

14 Jun 2020

TL;DR: In this paper, the authors focus on two main use cases of uncertainty estimation, i.e., selective prediction and confidence calibration, and apply these new metrics to explore the trade-off between model complexity and uncertainty estimation quality.

...read moreread less

Abstract: Accurately estimating uncertainties in neural network predictions is of great importance in building trusted DNNs-based models, and there is an increasing interest in providing accurate uncertainty estimation on many tasks, such as security cameras and autonomous driving vehicles. In this paper, we focus on the two main use cases of uncertainty estimation, i.e., selective prediction and confidence calibration. We first reveal potential issues of commonly used quality metrics for uncertainty estimation in both use cases, and propose our new metrics to mitigate them. We then apply these new metrics to explore the trade-off between model complexity and uncertainty estimation quality, a critically missing work in the literature. Our empirical experiment results validate the superiority of the proposed metrics, and some interesting trends about the complexity-uncertainty trade-off are observed.

...read moreread less

34 citations

Proceedings Article•

On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks

[...]

Yukun Ding¹, Jinglan Liu¹, Jinjun Xiong², Yiyu Shi¹•Institutions (2)

University of Notre Dame¹, IBM²

27 Sep 2018

TL;DR: This paper proves the universal approximability of quantized ReLU networks on a wide class of functions and provides upper bounds on the number of weights and the memory size for a given approximation error bound and the bit-width of weights for function-independent and function-dependent structures.

...read moreread less

Abstract: Compression is a key step to deploy large neural networks on resource-constrained platforms. As a popular compression technique, quantization constrains the number of distinct weight values and thus reducing the number of bits required to represent and store each weight. In this paper, we study the representation power of quantized neural networks. First, we prove the universal approximability of quantized ReLU networks on a wide class of functions. Then we provide upper bounds on the number of weights and the memory size for a given approximation error bound and the bit-width of weights for function-independent and function-dependent structures. Our results reveal that, to attain an approximation error bound of $\epsilon$, the number of weights needed by a quantized network is no more than $\mathcal{O}\left(\log^5(1/\epsilon)\right)$ times that of an unquantized network. This overhead is of much lower order than the lower bound of the number of weights needed for the error bound, supporting the empirical success of various quantization techniques. To the best of our knowledge, this is the first in-depth study on the complexity bounds of quantized neural networks.

...read moreread less

18 citations

Uncertainty-Aware Training of Neural Networks for Selective Medical Image Segmentation

[...]

Yukun Ding, Jinglan Liu, Xiaowei Xu, Meiping Huang, Jian Zhuang, Jinjun Xiong, Yiyu Shi - Show less +3 more

25 Jan 2020

TL;DR: A novel method is presented that considers such uncertainty in the training process to maximize the accuracy on the confident subset rather than the Accuracy on the whole dataset.

...read moreread less

Abstract: State-of-the-art deep learning based methods have achieved remarkable performance on medical image segmentation. Their applications in the clinical setting are, however, limited due to the lack of trustworthiness and reliability. Selective image segmentation has been proposed to address this issue by letting a DNN model process instances with high confidence while referring difficult ones with high uncertainty to experienced radiologists. As such, the model performance is only affected by the predictions on the high confidence subset rather than the whole dataset. Existing selective segmentation methods, however, ignore this unique property of selective segmentation and train their DNN models by optimizing accuracy on the entire dataset. Motivated by such a discrepancy, we present a novel method in this paper that considers such uncertainty in the training process to maximize the accuracy on the confident subset rather than the accuracy on the whole dataset. Experimental results using the whole heart and great vessel segmentation and gland segmentation show that such a training scheme can significantly improve the performance of selective segmentation.

...read moreread less

17 citations

Journal Article•DOI•

Hardware design and the competency awareness of a neural network

[...]

Yukun Ding¹, Weiwen Jiang¹, Qiuwen Lou¹, Jinglan Liu¹, Jinjun Xiong², Xiaobo Sharon Hu¹, Xiaowei Xu³, Yiyu Shi¹ - Show less +4 more•Institutions (3)

University of Notre Dame¹, IBM², Cardiovascular Institute of the South³

01 Sep 2020

TL;DR: The relationship between hardware platforms and the competency awareness of a neural network is examined, highlighting how hardware developments can impact uncertainty estimation quality, and exploring the innovations required in order to build competency-aware neural networks in resource constrained hardware platforms.

...read moreread less

Abstract: The ability to estimate the uncertainty of predictions made by a neural network is essential when applying neural networks to tasks such as medical diagnosis and autonomous vehicles. The approach is of particular relevance when deploying the networks on devices with limited hardware resources, but existing competency-aware neural networks largely ignore any resource constraints. Here we examine the relationship between hardware platforms and the competency awareness of a neural network. We highlight the impact of two key areas of hardware development — increasing memory size of accelerator architectures and device-to-device variation in the emerging devices typically used in in-memory computing — on uncertainty estimation quality. We also consider the challenges that developments in uncertainty estimation methods impose on hardware designs. Finally, we explore the innovations required in terms of hardware, software, and hardware–software co-design in order to build future competency-aware neural networks. This Perspective examines the relationship between hardware platforms and the competency awareness of a neural network, highlighting how hardware developments can impact uncertainty estimation quality, and exploring the innovations required in order to build competency-aware neural networks in resource constrained hardware platforms.

...read moreread less

17 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Posted Content•

Meta-Learning in Neural Networks: A Survey

[...]

Timothy M. Hospedales¹, Antreas Antoniou¹, Paul Micaelli¹, Amos Storkey¹•Institutions (1)

University of Edinburgh¹

11 Apr 2020-arXiv: Learning

TL;DR: A new taxonomy is proposed that provides a more comprehensive breakdown of the space of meta-learning methods today, including few-shot learning, reinforcement learning and architecture search, and promising applications and successes.

...read moreread less

Abstract: The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent years. Contrary to conventional approaches to AI where tasks are solved from scratch using a fixed learning algorithm, meta-learning aims to improve the learning algorithm itself, given the experience of multiple learning episodes. This paradigm provides an opportunity to tackle many conventional challenges of deep learning, including data and computation bottlenecks, as well as generalization. This survey describes the contemporary meta-learning landscape. We first discuss definitions of meta-learning and position it with respect to related fields, such as transfer learning and hyperparameter optimization. We then propose a new taxonomy that provides a more comprehensive breakdown of the space of meta-learning methods today. We survey promising applications and successes of meta-learning such as few-shot learning and reinforcement learning. Finally, we discuss outstanding challenges and promising areas for future research.

...read moreread less

831 citations

Journal Article•DOI•

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

[...]

Xiaofei Wang¹, Yiwen Han¹, Victor C. M. Leung², Dusit Niyato³, Xueqiang Yan⁴, Xu Chen⁵ - Show less +2 more•Institutions (5)

Tianjin University¹, Shenzhen University², Nanyang Technological University³, Huawei⁴, Sun Yat-sen University⁵

30 Jan 2020-IEEE Communications Surveys and Tutorials

TL;DR: By consolidating information scattered across the communication, networking, and DL areas, this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of edge intelligence and intelligent edge, i.e., Edge DL.

...read moreread less

Abstract: Ubiquitous sensors and smart devices from factories and communities are generating massive amounts of data, and ever-increasing computing power is driving the core of computation and services from the cloud to the edge of the network. As an important enabler broadly changing people’s lives, from face recognition to ambitious smart factories and cities, developments of artificial intelligence (especially deep learning, DL) based applications and services are thriving. However, due to efficiency and latency issues, the current cloud computing service architecture hinders the vision of “providing artificial intelligence for every person and every organization at everywhere”. Thus, unleashing DL services using resources at the network edge near the data sources has emerged as a desirable solution. Therefore, edge intelligence , aiming to facilitate the deployment of DL services by edge computing, has received significant attention. In addition, DL, as the representative technique of artificial intelligence, can be integrated into edge computing frameworks to build intelligent edge for dynamic, adaptive edge maintenance and management. With regard to mutually beneficial edge intelligence and intelligent edge , this paper introduces and discusses: 1) the application scenarios of both; 2) the practical implementation methods and enabling technologies, namely DL training and inference in the customized edge computing framework; 3) challenges and future trends of more pervasive and fine-grained intelligence. We believe that by consolidating information scattered across the communication, networking, and DL areas, this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of edge intelligence and intelligent edge , i.e., Edge DL.

...read moreread less

611 citations

Journal Article•DOI•

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

[...]

Xiaofei Wang¹, Yiwen Han¹, Victor C. M. Leung², Dusit Niyato³, Xueqiang Yan⁴, Xu Chen⁵ - Show less +2 more•Institutions (5)

Tianjin University¹, Shenzhen University², Nanyang Technological University³, Huawei⁴, Sun Yat-sen University⁵

19 Jul 2019-arXiv: Networking and Internet Architecture

TL;DR: In this paper, a survey on the relationship between edge intelligence and intelligent edge computing is presented, and the practical implementation methods and enabling technologies, namely DL training and inference in the customized edge computing framework, challenges and future trends of more pervasive and fine-grained intelligence.

...read moreread less

Abstract: Ubiquitous sensors and smart devices from factories and communities are generating massive amounts of data, and ever-increasing computing power is driving the core of computation and services from the cloud to the edge of the network. As an important enabler broadly changing people's lives, from face recognition to ambitious smart factories and cities, developments of artificial intelligence (especially deep learning, DL) based applications and services are thriving. However, due to efficiency and latency issues, the current cloud computing service architecture hinders the vision of "providing artificial intelligence for every person and every organization at everywhere". Thus, unleashing DL services using resources at the network edge near the data sources has emerged as a desirable solution. Therefore, edge intelligence, aiming to facilitate the deployment of DL services by edge computing, has received significant attention. In addition, DL, as the representative technique of artificial intelligence, can be integrated into edge computing frameworks to build intelligent edge for dynamic, adaptive edge maintenance and management. With regard to mutually beneficial edge intelligence and intelligent edge, this paper introduces and discusses: 1) the application scenarios of both; 2) the practical implementation methods and enabling technologies, namely DL training and inference in the customized edge computing framework; 3) challenges and future trends of more pervasive and fine-grained intelligence. We believe that by consolidating information scattered across the communication, networking, and DL areas, this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of edge intelligence and intelligent edge, i.e., Edge DL.

...read moreread less

518 citations

Journal Article•DOI•

A fully integrated reprogrammable memristor–CMOS system for efficient multiply–accumulate operations

[...]

Fuxi Cai¹, Justin M. Correll¹, Seung Hwan Lee¹, Yong Lim¹, Yong Lim², Vishishtha Bothra¹, Zhengya Zhang¹, Michael P. Flynn¹, Wei Lu¹ - Show less +5 more•Institutions (2)

University of Michigan¹, Samsung²

01 Jul 2019

TL;DR: A programmable neuromorphic computing chip based on passive memristor crossbar arrays integrated with analogue and digital components and an on-chip processor enables the implementation of neuromorphic and machine learning algorithms.

...read moreread less

Abstract: Memristors and memristor crossbar arrays have been widely studied for neuromorphic and other in-memory computing applications. To achieve optimal system performance, however, it is essential to integrate memristor crossbars with peripheral and control circuitry. Here, we report a fully functional, hybrid memristor chip in which a passive crossbar array is directly integrated with custom-designed circuits, including a full set of mixed-signal interface blocks and a digital processor for reprogrammable computing. The memristor crossbar array enables online learning and forward and backward vector-matrix operations, while the integrated interface and control circuitry allow mapping of different algorithms on chip. The system supports charge-domain operation to overcome the nonlinear I–V characteristics of memristor devices through pulse width modulation and custom analogue-to-digital converters. The integrated chip offers all the functions required for operational neuromorphic computing hardware. Accordingly, we demonstrate a perceptron network, sparse coding algorithm and principal component analysis with an integrated classification layer using the system. A programmable neuromorphic computing chip based on passive memristor crossbar arrays integrated with analogue and digital components and an on-chip processor enables the implementation of neuromorphic and machine learning algorithms.

...read moreread less

460 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

Collapse