CRADLE: Cr oss-backend v a lidation to D etect and L ocalize bugs in D e ep learning libraries

doi:10.1109/ICSE.2019.00107

Proceedings ArticleDOI

CRADLE: Cr oss-backend v a lidation to D etect and L ocalize bugs in D e ep learning libraries

Hung Viet Pham, +3 more

- pp 1027-1038

Chats0

TLDR

This work proposes CRADLE, a new approach that performs cross-implementation inconsistency checking to detect bugs in DL libraries, and leverages anomaly propagation tracking and analysis to localize faulty functions inDL libraries that cause the bugs.

Abstract:

Deep learning (DL) systems are widely used in domains including aircraft collision avoidance systems, Alzheimer's disease diagnosis, and autonomous driving cars. Despite the requirement for high reliability, DL systems are difficult to test. Existing DL testing work focuses on testing the DL models, not the implementations (e.g., DL software libraries) of the models. One key challenge of testing DL libraries is the difficulty of knowing the expected output of DL libraries given an input instance. Fortunately, there are multiple implementations of the same DL algorithms in different DL libraries. Thus, we propose CRADLE, a new approach that focuses on finding and localizing bugs in DL software libraries. CRADLE (1) performs cross-implementation inconsistency checking to detect bugs in DL libraries, and (2) leverages anomaly propagation tracking and analysis to localize faulty functions in DL libraries that cause the bugs. We evaluate CRADLE on three libraries (TensorFlow, CNTK, and Theano), 11 datasets (including ImageNet, MNIST, and KGS Go game), and 30 pre-trained models. CRADLE detects 12 bugs and 104 unique inconsistencies, and highlights functions relevant to the causes of inconsistencies for all 104 unique inconsistencies.

Citations

PDF

Open Access

More filters

Posted Content

Machine Learning Testing: Survey, Landscapes and Horizons

Jie Zhang, +3 more

- 19 Jun 2019 -

arXiv: Learning

TL;DR: This paper provides a comprehensive survey of techniques for testing machine learning systems; Machine Learning Testing (ML testing) research, covering 144 papers on testing properties, testing components, and application scenarios.

...read moreread less

Proceedings ArticleDOI

Deep learning library testing via effective model generation

Zan Wang, +4 more

TL;DR: This work designs a series of mutation rules for DL models, with the purpose of exploring different invoking sequences of library code and hard-to-trigger behaviors, and proposes a heuristic strategy to guide the model generation process towards the direction of amplifying the inconsistent degrees of the inconsistencies between different DL libraries caused by bugs.

...read moreread less

Proceedings ArticleDOI

Repairing deep neural networks: fix patterns and challenges

Johirul Islam, +3 more

TL;DR: This work presents a comprehensive study of bug fix patterns of Deep Neural Network (DNN) and investigates challenges in repairs and patterns that are utilized when manually repairing DNNs to address challenges faced by developers when fixing bugs.

...read moreread less

Proceedings ArticleDOI

Problems and opportunities in training deep learning software systems: an analysis of variance

Hung Viet Pham, +7 more

TL;DR: In this paper, the authors study the variance of deep learning systems and the awareness of this variance among researchers and practitioners, and find that only 19.5±3% of papers in recent top software engineering (SE), artificial intelligence (AI), and systems conferences use multiple identical training runs to quantify the variance in their DL approaches.

...read moreread less

Proceedings ArticleDOI

Audee: automated testing for deep learning frameworks

Qianyu Guo, +6 more

TL;DR: Audee as discussed by the authors adopts a search-based approach and implements three different mutation strategies to generate diverse test cases by exploring combinations of model structures, parameters, weights and inputs, which is able to detect three types of bugs: logical bugs, crashes and Not-a-Number (NaN) errors.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Proceedings Article

Intriguing properties of neural networks

Christian Szegedy, +7 more

TL;DR: It is found that there is no distinction between individual highlevel units and random linear combinations of high level units, according to various methods of unit analysis, and it is suggested that it is the space, rather than the individual units, that contains of the semantic information in the high layers of neural networks.

...read moreread less

Proceedings Article

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Christian Szegedy, +3 more

TL;DR: In this article, the authors show that training with residual connections accelerates the training of Inception networks significantly, and they also present several new streamlined architectures for both residual and non-residual Inception Networks.

...read moreread less

Collapse

CRADLE: Cr oss-backend v a lidation to D etect and L ocalize bugs in D e ep learning libraries

Citations

Machine Learning Testing: Survey, Landscapes and Horizons

Deep learning library testing via effective model generation

Repairing deep neural networks: fix patterns and challenges

Problems and opportunities in training deep learning software systems: an analysis of variance

Audee: automated testing for deep learning frameworks

References

Gradient-based learning applied to document recognition

ImageNet Large Scale Visual Recognition Challenge

TensorFlow: a system for large-scale machine learning

Intriguing properties of neural networks

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Related Papers (5)

DeepXplore: Automated Whitebox Testing of Deep Learning Systems

DeepTest: automated testing of deep-neural-network-driven autonomous cars

DeepGauge: multi-granularity testing criteria for deep learning systems

Guiding deep learning system testing using surprise adequacy

DeepHunter: a coverage-guided fuzz testing framework for deep neural networks