Book description
NoneTable of contents
-
Preface
- Motivation
- Origins of the Class
- Origins of the Book
- What to Expect from This Book
- How This Book Is Organized
- How to Read This Book
- How Code Is Used in This Book
- Who This Book Is For
- Prerequisites
- Supplemental Reading
- About the Contributors
- Conventions Used in This Book
- Using Code Examples
- O’Reilly Online Learning
- How to Contact Us
- Acknowledgments
- 1. Introduction: What Is Data Science?
- 2. Statistical Inference, Exploratory Data Analysis, and the Data Science Process
- 3. Algorithms
- 4. Spam Filters, Naive Bayes, and Wrangling
- 5. Logistic Regression
-
6. Time Stamps and Financial Modeling
- Kyle Teague and GetGlue
- Timestamps
- Cathy O’Neil
- Thought Experiment
-
Financial Modeling
- In-Sample, Out-of-Sample, and Causality
- Preparing Financial Data
- Log Returns
- Example: The S&P Index
- Working out a Volatility Measurement
- Exponential Downweighting
- The Financial Modeling Feedback Loop
- Why Regression?
- Adding Priors
- A Baby Model
- Exercise: GetGlue and Timestamped Event Data
- Exercise: Financial Data
- 7. Extracting Meaning from Data
-
8. Recommendation Engines: Building a User-Facing Data Product at Scale
-
A Real-World Recommendation Engine
- Nearest Neighbor Algorithm Review
- Some Problems with Nearest Neighbors
- Beyond Nearest Neighbor: Machine Learning Classification
- The Dimensionality Problem
- Singular Value Decomposition (SVD)
- Important Properties of SVD
- Principal Component Analysis (PCA)
- Alternating Least Squares
- Fix V and Update U
- Last Thoughts on These Algorithms
- Thought Experiment: Filter Bubbles
- Exercise: Build Your Own Recommendation System
-
A Real-World Recommendation Engine
- 9. Data Visualization and Fraud Detection
- 10. Social Networks and Data Journalism
- 11. Causality
- 12. Epidemiology
- 13. Lessons Learned from Data Competitions: Data Leakage and Model Evaluation
- 14. Data Engineering: MapReduce, Pregel, and Hadoop
- 15. The Students Speak
- 16. Next-Generation Data Scientists, Hubris, and Ethics
- Index
Product information
- Title: Doing Data Science
- Author(s):
- Release date:
- Publisher(s):
- ISBN: None
You might also like
book
Fundamentals of Data Engineering
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and …
book
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
book
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition
Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. …
book
The Staff Engineer's Path
For years, companies have rewarded their most effective engineers with management positions. But treating management as …