Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning

by Tshepo Chris Nokeri

Released March 2021

Publisher(s): Apress

ISBN: 9781484268704

Start your free trial

Book description

Get insight into data science techniques such as data engineering and visualization, statistical modeling, machine learning, and deep learning. This book teaches you how to select variables, optimize hyper parameters, develop pipelines, and train, test, and validate machine and deep learning models. Each chapter includes a set of examples allowing you to understand the concepts, assumptions, and procedures behind each model.

The book covers parametric methods or linear models that combat under- or over-fitting using techniques such as Lasso and Ridge. It includes complex regression analysis with time series smoothing, decomposition, and forecasting. It takes a fresh look at non-parametric models for binary classification (logistic regression analysis) and ensemble methods such as decision trees, support vector machines, and naive Bayes. It covers the most popular non-parametric method for time-event data (the Kaplan-Meier estimator). It also covers ways of solving classification problems using artificial neural networks such as restricted Boltzmann machines, multi-layer perceptrons, and deep belief networks. The book discusses unsupervised learning clustering techniques such as the K-means method, agglomerative and Dbscan approaches, and dimension reduction techniques such as Feature Importance, Principal Component Analysis, and Linear Discriminant Analysis. And it introduces driverless artificial intelligence using H2O.

After reading this book, you will be able to develop, test, validate, and optimize statistical machine learning and deep learning models, and engineer, visualize, and interpret sets of data.

What You Will Learn

Design, develop, train, and validate machine learning and deep learning models
Find optimal hyper parameters for superior model performance
Improve model performance using techniques such as dimension reduction and regularization
Extract meaningful insights for decision making using data visualization

Who This Book Is For

Beginning and intermediate level data scientists and machine learning engineers

Product information

Title: Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning
Author(s): Tshepo Chris Nokeri
Release date: March 2021
Publisher(s): Apress
ISBN: 9781484268704

book

Practical Data Science with Python 3: Synthesizing Actionable Insights from Data

by Ervin Varga

Gain insight into essential data science skills in a holistic manner using data engineering and associated …

book

Cleaning Data for Effective Data Science

by David Mertz

Think about your data intelligently and ask the right questions Key Features Master data cleaning techniques …

book

Hands-on Scikit-Learn for Machine Learning Applications: Data Science Fundamentals with Python

by David Paper

Aspiring data science professionals can learn the Scikit-Learn library along with the fundamentals of machine learning …

book

Reproducible Data Science with Pachyderm

by Svetlana Karslioglu

Create scalable and reliable data pipelines easily with Pachyderm Key Features Learn how to build an …

Data Science Revealed: With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning

Book description

Table of contents

Product information

You might also like

Practical Data Science with Python 3: Synthesizing Actionable Insights from Data

Cleaning Data for Effective Data Science

Hands-on Scikit-Learn for Machine Learning Applications: Data Science Fundamentals with Python

Reproducible Data Science with Pachyderm

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly