Book description
Learn how to apply test-driven development (TDD) to machine-learning algorithms—and catch mistakes that could sink your analysis. In this practical guide, author Matthew Kirk takes you through the principles of TDD and machine learning, and shows you how to apply TDD to several machine-learning algorithms, including Naive Bayesian classifiers and Neural Networks.
Machine-learning algorithms often have tests baked in, but they can’t account for human errors in coding. Rather than blindly rely on machine-learning results as many researchers have, you can mitigate the risk of errors with TDD and write clean, stable machine-learning code. If you’re familiar with Ruby 2.1, you’re ready to start.
- Apply TDD to write and run tests before you start coding
- Learn the best uses and tradeoffs of eight machine learning algorithms
- Use real-world examples to test each algorithm through engaging, hands-on exercises
- Understand the similarities between TDD and the scientific method for validating solutions
- Be aware of the risks of machine learning, such as underfitting and overfitting data
- Explore techniques for improving your machine-learning models or data extraction
Table of contents
- Preface
- 1. Test-Driven Machine Learning
- 2. A Quick Introduction to Machine Learning
- 3. K-Nearest Neighbors Classification
- 4. Naive Bayesian Classification
- 5. Hidden Markov Models
- 6. Support Vector Machines
- 7. Neural Networks
- 8. Clustering
- 9. Kernel Ridge Regression
- 10. Improving Models and Data Extraction
- 11. Putting It All Together
- Index
Product information
- Title: Thoughtful Machine Learning
- Author(s):
- Release date: October 2014
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781449374068
You might also like
book
Bayesian Statistics the Fun Way
Probability and statistics are increasingly important in a huge range of professions. But many people use …
book
Feature Engineering for Machine Learning
Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined …
book
Practical Time Series Analysis
Time series data analysis is increasingly important due to the massive production of such data through …
book
Practical Statistics for Data Scientists, 2nd Edition
Statistical methods are a key part of data science, yet few data scientists have formal statistical …