© Sridhar Alla, Suman Kalyan Adari 2021
S. Alla, S. K. AdariBeginning MLOps with MLFlowhttps://doi.org/10.1007/978-1-4842-6549-9_2

2. Building Models

Sridhar Alla1   and Suman Kalyan Adari2
(1)
Delran, NJ, USA
(2)
Tampa, FL, USA
 

In this chapter, we will go over how to build a simple logistic regression model in both scikit-learn and PySpark. We will also go over the process of k-fold cross validation to tune a hyperparameter in scikit-learn.

Introduction

In the previous chapter, you loaded the credit card data set and analyzed the distribution of its data. You also looked at the relationships between the features and got a general idea of how heavily they influence the labels.

Now that you’ve gained a better understanding of the data set, you will proceed ...

Get Beginning MLOps with MLFlow: Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.