Skip to Main Content
Apache Spark Deep Learning Cookbook
book

Apache Spark Deep Learning Cookbook

by Ahmed Sherif, Amrith Ravindra, Michal Malohlava, Adnan Masood
July 2018
Intermediate to advanced content levelIntermediate to advanced
474 pages
13h 37m
English
Packt Publishing
Content preview from Apache Spark Deep Learning Cookbook

How it works...

This section will walk through our design process for joining tables together as well as which final columns will be kept:

  1. As was mentioned in the previous section, the ratings dataframe will serve as our fact table, since it contains all the main transactions of ratings for each user over time. The columns in ratings will be used in each subsequent join with the other three tables, and to maintain a uniqueness of the columns, we will attach a _1 to the end of each column name, as seen in the following screenshot:
  1. We can now join the three lookup tables to the ratings table. The first two joins to ratings are inner joins, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Apache Spark for Data Science Cookbook

Apache Spark for Data Science Cookbook

Padma Priya Chitturi
Learning Apache Spark 2

Learning Apache Spark 2

Muhammad Asif Abbasi

Publisher Resources

ISBN: 9781788474221Supplemental Content