T. C. NokeriData Science Solutions with Pythonhttps://doi.org/10.1007/978-1-4842-7762-1_9

9. Principal Component Analysis with Scikit-Learn, PySpark, and H2O

Tshepo Chris Nokeri¹

(1)

Pretoria, South Africa

This chapter executes a simple dimension reducer (a principal component method) by implementing a diverse set of Python frameworks (Scikit-Learn, PySpark, and H2O). To begin, it clarifies how the method computes components.

Exploring the Principal Component Method

The principal component method is a simple dimension reducer. It carries out linear transformations on the entire data set to attain vectors (identified as eigenvalues), then identifies incremental ...

Get Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn by Tshepo Chris Nokeri

9. Principal Component Analysis with Scikit-Learn, PySpark, and H2O

Exploring the Principal Component Method

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly