Book description
As enterprise-scale data science sharpens its focus on data-driven decision making and machine learning, new tools have emerged to help facilitate these processes. This practical ebook shows data scientists and enterprise developers how the notebook interface, Apache Spark, and other collaboration tools are particularly well suited to bridge the communication gap between their teams.
Through a series of real-world examples, author Jerome Nilmeier demonstrates how to generate a model that enables data scientists and developers to share ideas and project code. You’ll learn how data scientists can approach real-world business problems with Spark and how developers can then implement the solution in a production environment.
- Dive deep into data science technologies, including Spark, TensorFlow, and the Jupyter Notebook
- Learn how Spark and Python notebooks enable data scientists and developers to work together
- Explore how the notebook environment works with Spark SQL for structured data
- Use notebooks and Spark as a launchpad to pursue supervised, unsupervised, and deep learning data models
- Learn additional Spark functionality, including graph analysis and streaming
- Explore the use of analytics in the production environment, particularly when creating data pipelines and deploying code
Table of contents
- Foreword
- Preface
- 1. Sharing Information Across Disciplines in the Enterprise
- 2. Setting Up Your Notebook Environment
- 3. Data Science Technologies
- 4. Introduction to Machine Learning
- 5. Classic Machine Learning Examples and Applications
- 6. Advanced Machine Learning Examples and Applications
Product information
- Title: Data Science and Engineering at Enterprise Scale
- Author(s):
- Release date: April 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492039334
You might also like
book
Managing Data Science
Understand data science concepts and methodologies to manage and deliver top-notch solutions for your organization Key …
book
The Applied Data Science Workshop - Second Edition
Designed with beginners in mind, this workshop helps you make the most of Python libraries and …
book
Machine Learning in Production: Developing and Optimizing Data Science Workflows and Applications
The typical data science task in industry starts with an “ask” from the business. But few …
video
Building Effective Data Science Infrastructure in 30 Minutes
Ville Tuulos demonstrates how to develop and deploy a production-grade machine learning application on the fly, …