O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hands-On Machine Learning with Scala and Spark

Video Description

Implement machine learning algorithms and evaluate how well they perform with the Scala programming language

About This Video

  • Learn how to extract ML features from unstructured data for input to ML models so you can build models for any input data
  • Leverage Spark's Powerful ML toolkit to build models by learning how to choose the best model for your problem
  • Use Deep Learning methods with Apache Spark to stay on the cutting edge of ML techniques

In Detail

Programmers face multiple challenges while implementing ML; dealing with unstructured data and picking the proper ML model are among the hardest.

In this course we will go through day-to-day challenges that programmers face when implementing ML pipelines and consider different approaches and models to solve complex problems.

You will learn about the most effective machine learning techniques and implement them in your favor. You will implement algorithms in practical hands-on projects, building data models and understanding how they work by using different types of algorithm.

Each section of the course deals with a specific machine learning problem and analysis and gives you insights by using real-world datasets.

By the end of this course, you will be able to take huge datasets, extract features from it, and apply a machine learning model that is well suited to your problem.

The code bundle for the course is available at: https://github.com/PacktPublishing/Hands-On-Machine-Learning-with-Scala-and-Spark

Downloading the example code for this course: You can download the example code files for all Packt video courses you have purchased from your account at http://www.PacktPub.com. If you purchased this course elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.

Table of Contents

  1. Chapter 1 : Advanced Text Processing in Spark and Building a Classification Model
    1. The Course Overview 00:02:26
    2. Analyzing Text Input Data 00:04:14
    3. Feature Generation from Text – Count Vectorizer, TFIDF, LDA 00:03:49
    4. Extracting Features from Data – Transforming Text into Vector of Numbers 00:06:09
    5. Bag-of-Words and Skip Gram 00:04:59
    6. Training Classification Models – Implementing Word2Vect Using Apache Spark 00:04:15
  2. Chapter 2 : Building a Regression Model with Spark
    1. Logistic Regression Explanation 00:05:00
    2. Writing a Logistic Regression Model Per Author in Apache Spark 00:05:31
    3. Training Regression Model 00:04:04
    4. Key Concepts, Machine Learning Pipelines, and Operations 00:02:47
    5. Learn How to Validate Models Using Cross-Validation 00:04:47
  3. Chapter 3 : Building a Clustering Model with Spark
    1. Analyzing Time of Post Using Clustering – (GMM Explanation) 00:03:52
    2. Implementing GMM in Apache Spark 00:05:21
    3. K-Means Clustering Explanation and Use Cases 00:02:42
    4. Implementing K-Means Clustering in Apache Spark 00:04:30
    5. Measure Accuracy Using Area Under ROC 00:03:04
  4. Chapter 4 : Dimensionality Reductions and Recommendation Engines
    1. Dimensionality Reduction Using Singular Value Decomposition (SVD) 00:03:42
    2. Building Recommendation Engine in Spark Using Collaborative Filtering 00:04:59
    3. Using Recommendation Engine to Get Top Recommendations 00:05:01
    4. Dense and Sparse Vectors 00:04:19
    5. LabeledPoints, Rating, and Other Data Types 00:02:41
  5. Chapter 5 : Deep Learning with Spark
    1. The Spark versus Deep Learning Use Case 00:04:30
    2. Spark for Parallelizing Deep Learning Evaluation 00:03:24
    3. Deep Learning As a Feature Generator for Existing Spark ML Algorithms 00:03:11
    4. Spark/Deep Learning Made Simple 00:02:53