Low-Code AI

Book description

Take a data-first and use-case-driven approach with Low-Code AI to understand machine learning and deep learning concepts. This hands-on guide presents three problem-focused ways to learn no-code ML using AutoML, low-code using BigQuery ML, and custom code using scikit-learn and Keras. In each case, you'll learn key ML concepts by using real-world datasets with realistic problems.

Business and data analysts get a project-based introduction to ML/AI using a detailed, data-driven approach: loading and analyzing data; feeding data into an ML model; building, training, and testing; and deploying the model into production. Authors Michael Abel and Gwendolyn Stripling show you how to build machine learning models for retail, healthcare, financial services, energy, and telecommunications.

You'll learn how to:

  • Distinguish between structured and unstructured data and the challenges they present
  • Visualize and analyze data
  • Preprocess data for input into a machine learning model
  • Differentiate between the regression and classification supervised learning models
  • Compare different ML model types and architectures, from no code to low code to custom training
  • Design, implement, and tune ML models
  • Export data to a GitHub repository for data management and governance

Publisher resources

View/Submit Errata

Table of contents

  1. Preface
    1. Who Should Read This Book?
    2. What Is and Isn’t in This Book
    3. Conventions Used in This Book
    4. Using Code Examples
    5. O’Reilly Online Learning
    6. How to Contact Us
    7. Acknowledgments
  2. 1. How Data Drives Decision Making in Machine Learning
    1. What Is the Goal or Use Case?
    2. An Enterprise ML Workflow
      1. Defining the Business Objective or Problem Statement
      2. Data Collection
      3. Data Preprocessing
      4. Data Analysis
      5. Data Transformation and Feature Selection
      6. Researching the Model Selection or Using AutoML (a No-Code Solution)
      7. Model Training, Evaluation, and Tuning
      8. Model Testing
      9. Model Deployment (Serving)
      10. Maintaining Models
    3. Summary
  3. 2. Data Is the First Step
    1. Overview of Use Cases and Datasets Used in the Book
      1. 1. Retail: Product Pricing
      2. 2. Healthcare: Heart Disease Campaign
      3. 3. Energy: Utility Campaign
      4. 4. Insurance: Advertising Media Channel Sales Prediction
      5. 5. Financial: Fraud Detection
      6. 6. Energy: Power Production Prediction
      7. 7. Telecommunications: Customer Churn Prediction
      8. 8. Automotive: Improve Custom Model Performance
    2. Data and File Types
      1. Quantitative and Qualitative Data
      2. Structured, Unstructured, and Semistructured Data
      3. Data File Types
      4. How Data Is Processed
    3. An Overview of GitHub and Google’s Colab
      1. Use GitHub to Create a Data Repository for Your Projects
      2. Using Google’s Colaboratory for Low-Code AI Projects
    4. Summary
  4. 3. Machine Learning Libraries and Frameworks
    1. No-Code AutoML
    2. How AutoML Works
    3. Machine Learning as a Service
    4. Low-Code ML Frameworks
      1. SQL ML Frameworks
      2. Open Source ML Libraries
    5. Summary
  5. 4. Use AutoML to Predict Advertising Media Channel Sales
    1. The Business Use Case: Media Channel Sales Prediction
      1. Project Workflow
      2. Project Dataset
    2. Exploring the Dataset Using Pandas, Matplotlib, and Seaborn
      1. Load Data into a Pandas DataFrame in a Google Colab Notebook
      2. Explore the Advertising Dataset
    3. Use AutoML to Train a Linear Regression Model
      1. No-Code Using Vertex AI
      2. Create a Managed Dataset in Vertex AI
      3. Select the Model Objective
      4. Build the Training Model
      5. Evaluate Model Performance
      6. Model Feature Importance (Attribution)
      7. Get Predictions from Your Model
    4. Summary
  6. 5. Using AutoML to Detect Fraudulent Transactions
    1. The Business Use Case: Fraud Detection for Financial Transactions
      1. Project Workflow
      2. Project Dataset
    2. Exploring the Dataset Using Pandas, Matplotlib, and Seaborn
      1. Loading Data into a Pandas DataFrame in a Google Colab Notebook
      2. Exploring the Dataset
      3. Exporting the Dataset
    3. Classification Models and Metrics
    4. Using AutoML to Train a Classification Model
      1. Creating a Managed Dataset and Selecting the Model Objective
      2. Exploring Dataset Statistics
      3. Training the Model
      4. Evaluating Model Performance
      5. Model Feature Importances
      6. Getting Predictions from Your Model
    5. Summary
  7. 6. Using BigQuery ML to Train a Linear Regression Model
    1. The Business Use Case: Power Plant Production
    2. Cleaning the Dataset Using SQL in BigQuery
      1. Loading a Dataset into BigQuery
      2. Exploring Data in BigQuery Using SQL
    3. Linear Regression Models
      1. Feature Selection and Correlation
      2. Google Colaboratory
      3. Plotting Feature Relationships to the Label
      4. The CREATE MODEL Statement in BigQuery ML
    4. Introducing Explainable AI
      1. Explainable AI in BigQuery ML
      2. Exercises
    5. Neural Networks in BigQuery ML
      1. Brief Overview of Neural Networks
      2. Activation Functions and Nonlinearity
      3. Training a Deep Neural Network in BigQuery ML
      4. Exercises
    6. Deep Dive: Using Cloud Shell to View Your Cloud Storage File
    7. Summary
  8. 7. Training Custom ML Models in Python
    1. The Business Use Case: Customer Churn Prediction
    2. Choosing Among No-Code, Low-Code, or Custom Code ML Solutions
    3. Exploring the Dataset Using Pandas, Matplotlib, and Seaborn
      1. Loading Data into a Pandas DataFrame in a Google Colab Notebook
      2. Understanding and Cleaning the Customer Churn Dataset
      3. Transforming Features Using Pandas and Scikit-Learn
    4. Building a Logistic Regression Model Using Scikit-Learn
      1. Logistic Regression
      2. Training and Evaluating a Model in Scikit-Learn
      3. Classification Evaluation Metrics
      4. Serving Predictions with a Trained Model in Scikit-Learn
      5. Pipelines in Scikit-Learn: An Introduction
    5. Building a Neural Network Using Keras
      1. Introduction to Keras
      2. Training a Neural Network Classifier Using Keras
    6. Building Custom ML Models on Vertex AI
    7. Summary
  9. 8. Improving Custom Model Performance
    1. The Business Use Case: Used Car Auction Prices
    2. Model Improvement in Scikit-Learn
      1. Loading the Notebook with the Preexisting Model
      2. Loading the Datasets and the Training-Validation-Test Data Split
      3. Exploring the Scikit-Learn Linear Regression Model
      4. Feature Engineering and Improving the Preprocessing Pipeline
      5. Hyperparameter Tuning
    3. Model Improvement in Keras
      1. Introduction to Preprocessing Layers in Keras
      2. Creating the Dataset and Preprocessing Layers for Your Model
      3. Building a Neural Network Model
      4. Hyperparameter Tuning in Keras
    4. Hyperparameter Tuning in BigQuery ML
      1. Loading and Transforming Car Auction Data
      2. Training a Linear Regression Model and Using the TRANSFORM Clause
      3. Configure a Hyperparameter Tuning Job in BigQuery ML
    5. Options for Hyperparameter Tuning Large Models
      1. Vertex AI Training and Tuning
      2. Automatic Model Tuning with Amazon SageMaker
      3. Azure Machine Learning
    6. Summary
  10. 9. Next Steps in Your AI Journey
    1. Going Deeper into Data Science
      1. Working with Unstructured Data
      2. Generative AI
      3. Explainable AI
    2. ML Operations
    3. Continuous Training and Evaluation
    4. Summary
  11. Index
  12. About the Authors

Product information

  • Title: Low-Code AI
  • Author(s): Gwendolyn Stripling, Michael Abel
  • Release date: September 2023
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 9781098146825