Examine the latest technological advancements in building a scalable machine learning model with Big Data using R. This book shows you how to work with a machine learning algorithm and use it to build a ML model from raw data.
All practical demonstrations will be explored in R, a powerful programming language and software environment for statistical computing and graphics. The various packages and methods available in R will be used to explain the topics. For every machine learning algorithm covered in this book, a 3-D approach of theory, case-study and practice will be given. And where appropriate, the mathematics will be explained through visualization in R. All the images are available in color and hi-res as part of the code download.
This new paradigm of teaching machine learning will bring about a radical change in perception for many of those who think this subject is difficult to learn. Though theory sometimes looks difficult, especially when there is heavy mathematics involved, the seamless flow from the theoretical aspects to example-driven learning provided in this book makes it easy for someone to connect the dots..
What You'll Learn
Use the model building process flow
Apply theoretical aspects of machine learning
Review industry-based cae studies
Understand ML algorithms using R
Build machine learning models using Apache Hadoop and Spark
Who This Book is For
Data scientists, data science professionals and researchers in academia who want to understand the nuances of machine learning approaches/algorithms along with ways to see them in practice using R.
The book will also benefit the readers who want to understand the technology behind implementing a scalable machine learning model using Apache Hadoop, Hive, Pig and Spark.
Table of Contents
- 1. Introduction to Machine Learning and R
- 2. Data Preparation and Exploration
- 3. Sampling and Resampling Techniques
- 4. Data Visualization in R
- 5. Feature Engineering
- 6. Machine Learning Theory and Practices
- 7. Machine Learning Model Evaluation
- 8. Model Performance Improvement
- 9. Scalable Machine Learning and Related Technologies
- Title: Machine Learning Using R
- Release date: December 2016
- Publisher(s): Apress
- ISBN: 9781484223345