Book description
The typical data science task in industry starts with an “ask” from the business. But few data scientists have been taught what to do with that ask. This book shows them how to assess it in the context of the business’s goals, reframe it to work optimally for both the data scientist and the employer, and then execute on it. Written by two of the experts who’ve achieved breakthrough optimizations at BuzzFeed, it’s packed with real-world examples that take you from start to finish: from ask to actionable insight.
Andrew Kelleher and Adam Kelleher walk you through well-formed, concrete principles for approaching common data science problems, giving you an easy-to-use checklist for effective execution. Using their principles and techniques, you’ll gain deeper understanding of your data, learn how to analyze noise and confounding variables so they don’t compromise your analysis, and save weeks of iterative improvement by planning your projects more effectively upfront.
Once you’ve mastered their principles, you’ll put them to work in two realistic, beginning-to-end site optimization tasks. These extended examples come complete with reusable code examples and recommended open-source solutions designed for easy adaptation to your everyday challenges. They will be especially valuable for anyone seeking their first data science job – and everyone who’s found that job and wants to succeed in it.
Table of contents
- Cover
- About This E-Book
- Title Page
- Copyright Page
- Dedication
- Contents
- Foreword
- Preface
- About the Authors
- I: Principles of Framing
- II: Algorithms and Architectures
- III: Bottlenecks and Optimizations
- Bibliography
- Index
- Credits
- Code Snippets
Product information
- Title: Machine Learning in Production: Developing and Optimizing Data Science Workflows and Applications
- Author(s):
- Release date: May 2019
- Publisher(s): Addison-Wesley Professional
- ISBN: 9780134116556
You might also like
book
Foundational Python for Data Science
Data science and machine learning two of the worlds hottest fields are attracting talent from a …
book
Data Science with Python and Dask
Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries you’re already …
book
Machine Learning for Streaming Data with Python
Apply machine learning to streaming data with the help of practical examples, and deal with challenges …
book
Data Mining for Business Analytics
Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python presents an applied approach to …