Skip to Main Content
Hands-On Deep Learning with Apache Spark
book

Hands-On Deep Learning with Apache Spark

by Guglielmo Iozzia
January 2019
Intermediate to advanced content levelIntermediate to advanced
322 pages
7h 29m
English
Packt Publishing
Content preview from Hands-On Deep Learning with Apache Spark

Data ingestion through DataVec and transformation through Spark

Data can come from many sources and in many types, for example:

  • Log files
  • Text documents
  • Tabular data
  • Images
  • Videos

When working with neural nets, the end goal is to convert each data type into a collection of numerical values in a multidimensional array. Data could also need to be pre-processed before it can be used to train or test a net. Therefore, an ETL process is needed in most cases, which is a sometimes underestimated challenge that data scientists have to face when doing ML or DL. That's when the DL4J DataVec library comes to the rescue. After data is transformed through this library API, it comes into a format (vectors) understandable by neural networks, so DataVec ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Next-Generation Machine Learning with Spark: Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More

Next-Generation Machine Learning with Spark: Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More

Butch Quinto
Apache Spark Deep Learning Cookbook

Apache Spark Deep Learning Cookbook

Ahmed Sherif, Amrith Ravindra, Michal Malohlava, Adnan Masood

Publisher Resources

ISBN: 9781788994613Supplemental Content