Index
A
accuracy_score() function
Alternating Least Squares (ALS)
Amazon Web Services (AWS)
Apache Airflow
Apache Hadoop
Apache Hadoop YARN
Apache Mesos
Apache Spark
Area Under ROC (AUC)
Area Under the Curve (AUC)
B
“Black-box” algorithms
Black-box models
Breast Cancer Wisconsin dataset
built-in print() function
build() method
build_random_forest() function
C
Cloud-based deployment
Colab notebook
col() function
Collaborative filtering
collect() method
confusion_matrix() function
Content-based filtering
count() method
createDataFrame() function
createDataFrame() method
create_pandas_dataframe() function
D
Databricks
DataFrame() constructor
DataFrame() function
Decision tree classification
advantages
dataset
imbalanced ...

Get Distributed Machine Learning with PySpark: Migrating Effortlessly from Pandas and Scikit-Learn now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.