© Joshua Cook 2017

Joshua Cook, Docker for Data Science, https://doi.org/10.1007/978-1-4842-3012-1_8

8. The Data Stores

Joshua Cook

(1)Santa Monica, California, USA

In this chapter, let’s extend the discussion beyond the Jupyter Notebook server to explore open source data store technologies and how we can use Docker to simplify the process of working with these technologies. I propose that using Docker, it is possible to streamline the process to an extent that using a data store for even the smallest of datasets becomes a practical matter. I’ll show you a series of best practices for designing and deploying data stores, a set of practices that will be sufficient for working with all but the largest of data sets. Conforming to Docker best practice, ...

Get Docker for Data Science: Building Scalable and Extensible Data Infrastructure Around the Jupyter Notebook Server now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.