Book description
Access real-world documentation and examples for the Spark platform for building large-scale, enterprise-grade machine learning applications.
Next-Generation Machine Learning with Spark provides a gentle introduction to Spark and Spark MLlib and advances to more powerful, third-party machine learning algorithms and libraries beyond what is available in the standard Spark MLlib library. By the end of this book, you will be able to apply your knowledge to real-world use cases through dozens of practical examples and insightful explanations.
What You Will Learn
- Be introduced to machine learning, Spark, and Spark MLlib 2.4.x
- Achieve lightning-fast gradient boosting on Spark with the XGBoost4J-Spark and LightGBM libraries
- Detect anomalies with the Isolation Forest algorithm for Spark
- Use the Spark NLP and Stanford CoreNLP libraries that support multiple languages
- Optimize your ML workload with the Alluxio in-memory data accelerator for Spark
- Use GraphX and GraphFrames for Graph Analysis
- Perform image recognition using convolutional neural networks
- Utilize the Keras framework and distributed deep learning libraries with Spark
Who This Book Is For
Data scientists and machine learning engineers who want to take their knowledge to the next level and use Spark and more powerful, next-generation algorithms and libraries beyond what is available in the standard Spark MLlib library; also serves as a primer for aspiring data scientists and engineers who need an introduction to machine learning, Spark, and Spark MLlib.
Product information
- Title: Next-Generation Machine Learning with Spark: Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More
- Author(s):
- Release date: February 2020
- Publisher(s): Apress
- ISBN: 9781484256695
You might also like
book
Scaling Machine Learning with Spark
Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, …
book
Machine Learning with PySpark: With Natural Language Processing and Recommender Systems
Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers …
book
Machine Learning Using TensorFlow Cookbook
Master TensorFlow to create powerful machine learning algorithms, with valuable insights on Keras, Boosted Trees, Tabular …
book
Interpretable Machine Learning with Python - Second Edition
A deep dive into the key aspects and challenges of machine learning interpretability using a comprehensive …