© Joshua Cook 2017

Joshua Cook, Docker for Data Science, https://doi.org/10.1007/978-1-4842-3012-1_1

1. Introduction

Joshua Cook

(1)Santa Monica, California, USA

The typical data scientist consistently has a series of extremely complicated problems on their mind beyond considerations stemming from their system infrastructure. Still, it is inevitable that infrastructure issues will present themselves. To oversimplify, we might draw a distinction between the “modeling problem” and the “engineering problem.” The data scientist is uniquely qualified to solve the former, but can often come up short in solving the latter.

Docker has been widely adopted by the system administrator and DevOps community as a modern solution to the challenges presented in ...

Get Docker for Data Science: Building Scalable and Extensible Data Infrastructure Around the Jupyter Notebook Server now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.