Hidden technical debt in Machine learning systems

Open AccessProceedings Article

Hidden technical debt in Machine learning systems

D. Sculley, +9 more

- Vol. 28, pp 2503-2511

Chats0

TLDR

It is found it is common to incur massive ongoing maintenance costs in real-world ML systems, and several ML-specific risk factors to account for in system design are explored.

Abstract:

Machine learning offers a fantastically powerful toolkit for building useful complex prediction systems quickly. This paper argues it is dangerous to think of these quick wins as coming for free. Using the software engineering framework of technical debt, we find it is common to incur massive ongoing maintenance costs in real-world ML systems. We explore several ML-specific risk factors to account for in system design. These include boundary erosion, entanglement, hidden feedback loops, undeclared consumers, data dependencies, configuration issues, changes in the external world, and a variety of system-level anti-patterns.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, +2 more

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

Posted Content

Towards A Rigorous Science of Interpretable Machine Learning

Finale Doshi-Velez, +1 more

- 28 Feb 2017 -

arXiv: Machine Learning

TL;DR: This position paper defines interpretability and describes when interpretability is needed (and when it is not), and suggests a taxonomy for rigorous evaluation and exposes open questions towards a more rigorous science of interpretable machine learning.

...read moreread less

Proceedings Article

Anchors: High-Precision Model-Agnostic Explanations

Marco Tulio Ribeiro, +2 more

TL;DR: This work introduces a novel model-agnostic system that explains the behavior of complex models with high-precision rules called anchors, representing local, “sufficient” conditions for predictions, and proposes an algorithm to efficiently compute these explanations for any black-box model with high probability guarantees.

...read moreread less

Proceedings ArticleDOI

Software engineering for machine learning: a case study

Saleema Amershi, +8 more

TL;DR: A study conducted on observing software teams at Microsoft as they develop AI-based applications finds that various Microsoft teams have united this workflow into preexisting, well-evolved, Agile-like software engineering processes, providing insights about several essential engineering challenges that organizations may face in creating large-scale AI solutions for the marketplace.

...read moreread less

Journal ArticleDOI

Massive MIMO is a reality—What is next?: Five promising research directions for antenna arrays

Emil Björnson, +4 more

- 01 Nov 2019 -

Digital Signal Processing

TL;DR: In this paper, the authors explain how the first chapter of the massive MIMO research saga has come to an end, while the story has just begun, and outline five new massive antenna array related research directions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Refactoring: Improving the Design of Existing Code

Martin Fowler

TL;DR: Almost every expert in Object-Oriented Development stresses the importance of iterative development, but how do you add function to the existing code base while still preserving its design integrity?

...read moreread less

Proceedings ArticleDOI

Refactoring improving the design of existing code

Mauricio A. Saca

TL;DR: The present document details the how, why and when to apply refactoring in computer systems that have been poorly designed, this in order to a better performance and maintenance of the constituent components.

...read moreread less

Proceedings ArticleDOI

Scaling Distributed Machine Learning with the Parameter Server

Mu Li

TL;DR: View on new challenges identified are shared, and some of the application scenarios such as micro-blog data analysis and data processing in building next generation search engines are covered.

...read moreread less

Proceedings ArticleDOI

Scaling distributed machine learning with the parameter server

Mu Li, +8 more

TL;DR: In this paper, the authors propose a parameter server framework for distributed machine learning problems, where both data and workloads are distributed over worker nodes, while the server nodes maintain globally shared parameters, represented as dense or sparse vectors and matrices.

...read moreread less

Book

AntiPatterns: Refactoring Software, Architectures, and Projects in Crisis

William H. Brown, +3 more

TL;DR: An entertaining and often enlightening text that defines what seasoned developers have long suspected: despite advances in software engineering, most software projects still fail to meet expectations--and about a third are cancelled altogether.

...read moreread less

Nature

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

Hidden technical debt in Machine learning systems

Citations

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Towards A Rigorous Science of Interpretable Machine Learning

Anchors: High-Precision Model-Agnostic Explanations

Software engineering for machine learning: a case study

Massive MIMO is a reality—What is next?: Five promising research directions for antenna arrays

References

Refactoring: Improving the Design of Existing Code

Refactoring improving the design of existing code

Scaling Distributed Machine Learning with the Parameter Server

Scaling distributed machine learning with the parameter server

AntiPatterns: Refactoring Software, Architectures, and Projects in Crisis

Related Papers (5)

Software engineering for machine learning: a case study

Scikit-learn: Machine Learning in Python

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Deep learning

ImageNet Classification with Deep Convolutional Neural Networks