Chapter 2. Setting Up Your Notebook Environment

Each approach we discuss is beneficial to go through, as you will be exposed to various technologies and services that are useful for you to understand. The examples in this book are designed to be run in a laptop-scale environment, which means that they can run on a single node. No GPU or multinode (cluster) resources will be needed. Figure 2-1 shows the technologies that you will be working with in this book.

Once you become familiar with these technologies at this scale, you can consider investing resources in the advanced hardware that is needed to run enterprise-scale workloads. The great benefit of working with the code at this smaller scale is that the syntax is essentially unchanged on the more sophisticated hardware configurations.

The entire repository of notebooks used for this book can be found at https://github.com/nilmeier/DSatEnterpriseScale.

images/ch2Ecosystem.png
Figure 2-1. The enterprise-scale data science ecosystem covered in this book

Quick Start with Watson Studio

Watson Studio is IBM’s hosted notebook service, and you can create a free account at https://www.ibm.com/cloud/watson-studio. Other hosted notebook services can be used to run the noteooks as well, but Watson Studio offers all of the frameworks and languages that are used for this book’s examples. Once you have created an account and logged in, you can begin by creating ...

Get Data Science and Engineering at Enterprise Scale now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.