Book description
Data science libraries, frameworks, modules, and toolkits are great for doing data science, but they’re also a good way to dive into the discipline without actually understanding data science. With this updated second edition, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch.
If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with hacking skills you need to get started as a data scientist. Today’s messy glut of data holds answers to questions no one’s even thought to ask. This book provides you with the knowhow to dig those answers out.
Publisher resources
Table of contents
 Preface to the Second Edition
 Preface to the First Edition
 1. Introduction

2. A Crash Course in Python
 The Zen of Python
 Getting Python
 Virtual Environments
 Whitespace Formatting
 Modules
 Functions
 Strings
 Exceptions
 Lists
 Tuples
 Dictionaries
 Counters
 Sets
 Control Flow
 Truthiness
 Sorting
 List Comprehensions
 Automated Testing and assert
 ObjectOriented Programming
 Iterables and Generators
 Randomness
 Regular Expressions
 Functional Programming
 zip and Argument Unpacking
 args and kwargs
 Type Annotations
 Welcome to DataSciencester!
 For Further Exploration
 3. Visualizing Data
 4. Linear Algebra
 5. Statistics
 6. Probability
 7. Hypothesis and Inference
 8. Gradient Descent
 9. Getting Data
 10. Working with Data
 11. Machine Learning
 12. kNearest Neighbors
 13. Naive Bayes
 14. Simple Linear Regression
 15. Multiple Regression
 16. Logistic Regression
 17. Decision Trees
 18. Neural Networks
 19. Deep Learning
 20. Clustering
 21. Natural Language Processing
 22. Network Analysis
 23. Recommender Systems
 24. Databases and SQL
 25. MapReduce
 26. Data Ethics
 27. Go Forth and Do Data Science
 Index
Product information
 Title: Data Science from Scratch, 2nd Edition
 Author(s):
 Release date: April 2019
 Publisher(s): O'Reilly Media, Inc.
 ISBN: 9781492041085
You might also like
book
SQL for Data Analysis
With the explosion of data, computing power, and cloud data warehouses, SQL has become an even …
book
Learning SQL, 3rd Edition
As data floods into your company, you need to put it to work right away—and SQL …
book
Machine Learning with Python Cookbook
This practical guide provides nearly 200 selfcontained recipes to help you solve machine learning challenges you …
book
Practical Time Series Analysis
Time series data analysis is increasingly important due to the massive production of such data through …