Hidden Technical Debt in Machine Learning Systems

Part of Advances in Neural Information Processing Systems 28 (NIPS 2015)

Authors

D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison

Abstract

Machine learning offers a fantastically powerful toolkit for building useful complexp rediction systems quickly. This paper argues it is dangerous to think of these quick wins as coming for free. Using the software engineering framework of technical debt, we find it is common to incur massive ongoing maintenance costs in real-world ML systems. We explore several ML-specific risk factors to account for in system design. These include boundary erosion, entanglement, hidden feedback loops, undeclared consumers, data dependencies, configuration issues, changes in the external world, and a variety of system-level anti-patterns.

Read Full Document Here

Hidden Technical Debt in Machine Learning Systems

Authors

Abstract

Previous PostCase Study - How to Reduce Churn and Increase Revenue

Next PostRoadmap to (Truly) being a data-driven company, Combining Data Science, KPIs and Experimentation