S. Alla, S. K. AdariBeginning MLOps with MLFlowhttps://doi.org/10.1007/978-1-4842-6549-9_2

2. Building Models

Sridhar Alla¹ and Suman Kalyan Adari²

(1)

Delran, NJ, USA

(2)

Tampa, FL, USA

In this chapter, we will go over how to build a simple logistic regression model in both scikit-learn and PySpark. We will also go over the process of k-fold cross validation to tune a hyperparameter in scikit-learn.

Introduction

In the previous chapter, you loaded the credit card data set and analyzed the distribution of its data. You also looked at the relationships between the features and got a general idea of how heavily they influence the labels.

Now that you’ve gained a better understanding of the data set, you will proceed ...

Get Beginning MLOps with MLFlow: Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Beginning MLOps with MLFlow: Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure by Sridhar Alla, Suman Kalyan Adari

2. Building Models

Introduction

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly