O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Predictive Analytics Using Oracle Data Miner

Book Description

Build Next-Generation In-Database Predictive Analytics Applications with Oracle Data Miner

“If you have an Oracle Database and want to leverage that data to discover new insights, make predictions, and generate actionable insights, this book is a must read for you! In Predictive Analytics Using Oracle Data Miner: Develop & Use Oracle Data Mining Models in Oracle Data Miner, SQL & PL/SQL, Brendan Tierney, Oracle ACE Director and data mining expert, guides you through the basic concepts of data mining and offers step-by-step instructions for solving data-driven problems using SQL Developer’s Oracle Data Mining extension. Brendan takes it full circle by showing you how to deploy advanced analytical methodologies and predictive models immediately into enterprise-wide production environments using the in-database SQL and PL/SQL functionality. Definitely a must read for any Oracle data professional!” --Charlie Berger, Senior Director Product Management, Oracle Data Mining and Advanced Analytics

Perform in-database data mining to unlock hidden insights in data. Written by an Oracle ACE Director, Predictive Analytics Using Oracle Data Miner shows you how to use this powerful tool to create and deploy advanced data mining models. Covering topics for the data scientist, Oracle developer, and Oracle database administrator, this Oracle Press guide shows you how to get started with Oracle Data Miner and build Oracle Data Miner models using SQL and PL/SQL packages. You'll get best practices for integrating your Oracle Data Miner models into applications to automate the discovery and distribution of business intelligence predictions throughout the enterprise.

  • Install and configure Oracle Data Miner for Oracle Database 11g Release 11.2 and Oracle Database 12c
  • Create Oracle Data Miner projects and workflows
  • Prepare data for data mining
  • Develop data mining models using association rule analysis, classification, clustering, regression, and anomaly detection
  • Use data dictionary views and prepare your data using in-database transformations
  • Build and use data mining models using SQL and PL/SQL packages
  • Migrate your Oracle Data Miner models, integrate them into dashboards and applications, and run them in parallel
  • Build transient data mining models with the Predictive Queries feature in Oracle Database 12c

Table of Contents

  1. Cover 
  2. About the Author
    1. About the Technical Editor
  3. Title Page
  4. Copyright Page
  5. Contents at a Glance
  6. Contents 
  7. Acknowledgments
  8. Introduction
  9. PART I: Oracle Data Miner Fundamentals
    1. Chapter 1: Oracle Data Miner
      1. In-Database Data Mining with Oracle Data Mining
      2. Oracle Advanced Analytics Option
        1. Oracle Data Mining
        2. Oracle R Enterprise (ORE)
      3. History of Data Mining in Oracle
      4. Oracle Data Mining Components
        1. Oracle Data Mining Architecture
        2. Oracle Data Miner GUI Tool
        3. Oracle Data Mining Using SQL and PL/SQL
      5. Oracle Statistical Functions
      6. Applications Powered by Oracle Data Mining
      7. How Are Customers Using Oracle Advanced Analytics
        1. Customer Success Stories
    2. Chapter 2: The Predictive Modeling Lifecycle
      1. Predictive Modeling Lifecycles
        1. Knowledge Discovery in Data (KDD) Process
        2. SEMMA
      2. CRISP-DM
        1. Business Understanding
        2. Data Understanding
        3. Data Preparation
        4. Modeling
        5. Evaluation
        6. Deployment
      3. Summary
    3. Chapter 3: How to Install, Set Up, and Get Started
      1. Enabling the Oracle Advanced Analytics Option
      2. Creating a Data Mining Tablespace
      3. Creating an ODM Schema
      4. Creating a Connection for Your DM User in SQL Developer
      5. Creating the Oracle Data Mining Repository
        1. Using SQL Developer to Create the ODM Repository
        2. Using SQL Scripts to Create the ODM Repository
      6. Setting Up Additional Users to Access ODM
      7. ODM Schema System Privileges
      8. Setting Up and Using the Pre-built Database Appliance
      9. Summary
  10. PART II: Using the Oracle Data Miner Tool
    1. Chapter 4: ODM Menus, Projects, and Workflows
      1. The ODM Menus
      2. Creating a Project
      3. Creating, Exporting, and Importing a Workflow
        1. Creating a Workflow
        2. Exporting a Workflow
        3. Importing a Workflow
      4. Adjusting the Layout
      5. The ODM Workflow Menu
      6. The Components Workflow Editor
      7. Summary
    2. Chapter 5: Exploring Your Data
      1. Gathering Statistics and Exploring Your Data
      2. Adding a Data Source
        1. Data Source Properties
      3. The Explore Data Node
        1. Building Up a Story About the Data
        2. Exploring the Data Based on Attribute Grouping
      4. Graphs
        1. Creating a Graph
      5. SQL Query Node
      6. Feature Selection
      7. Summary
    3. Chapter 6: Data Preparation
      1. Aggregate
        1. Using the Aggregation Wizard
        2. Adding a New Aggregation-Level Attribute
        3. Adding an Aggregation Expression
      2. Filter Columns
      3. Filter Columns Details
      4. Filter Rows
      5. Join
      6. Sample
      7. Transform
      8. Automatic Data Preparation (ADP)
      9. Summary
    4. Chapter 7: Association Rule Analysis
      1. What Is Association Rule Analysis?
      2. Association Rule Analysis in Oracle
      3. Building Association Rules Using ODM
      4. Defining the Data Source
      5. Creating the Association Node
        1. Model Settings
        2. Association Node Properties Tab
      6. Viewing the Association Rules
        1. Generated Rules
        2. Generated Itemsets
      7. Adding a Data Source Node for Transaction Descriptions
      8. Applying Filters to the Association Rules
      9. Outputting and Persisting the Association Rules
      10. Summary
    5. Chapter 8: Classification
      1. What Is Classification?
      2. Classification Methods Available in Oracle
      3. Building Classification Models
        1. Model Settings
        2. Property Inspector
        3. Using Different Build and Test Data Sets
        4. Creating Additional Models and Removing Unwanted Models
      4. Generating the Models
      5. Evaluating the Classification Models
        1. Performance
        2. Performance Matrix
        3. ROC
        4. Lift
        5. Profit
      6. Applying a Classification Model to New Data
      7. Summary
    6. Chapter 9: Clustering
      1. What Is Clustering?
      2. Clustering Methods Available in Oracle
      3. Building Clustering Models
        1. Model Settings
        2. Property Inspector
        3. Creating Additional Models and Removing Unwanted Models
      4. Generating the Models
      5. Evaluating the Classification Models
        1. View Your Cluster Models
        2. Cluster Details
        3. Comparing Clusters: Multicluster-Multivariable Comparison
        4. Renaming Clusters
      6. Applying a Clustering Model to New Data
      7. Summary
    7. Chapter 10: Regression
      1. What Is Regression?
      2. Regression Methods Available in Oracle
      3. Preparing Your Data for Regression
      4. Building Regression Models
        1. Regression Model Settings
        2. Property Inspector
      5. Generating the Regression Models
      6. Viewing the Regression Models
      7. Regression Model Test Results
      8. Applying a Regression Model to New Data
      9. Summary
    8. Chapter 11: Anomaly Detection
      1. What Is Anomaly Detection?
      2. Anomaly Detection in Oracle
      3. Building an Anomaly Detection Model
        1. Model Settings
        2. Property Inspector
        3. Generating the Models
      4. Evaluating the Anomaly Detection Model
      5. Applying the Model to Your Data
      6. Summary
  11. PART III: Data Mining Using SQL and PL/SQL
    1. Chapter 12: The ODM Data Dictionary, SQL, and PL/SQL Packages
      1. ODM Data Dictionary Views
      2. ODM SQL Functions
      3. ODM PL/SQL Packages
        1. DBMS_DATA_MINING PL/SQL Package
      4. Summary
    2. Chapter 13: Data Preparation
      1. Data Preparation for Data Mining
        1. Data Sampling
        2. Data Aggregation and Pivoting the Data
        3. Handling Missing Data
        4. Histograms and Binning
        5. Creating a Target Variable/Attribute
      2. Automatic Data Preparation in ODM
        1. ADP with Transformation Lists
        1. List of Package Procedures and Functions
        2. Example of Using the DBMS_DATA_MINING_TRANSFORM Package
      4. Embedding Transformation List into the Model
      5. Summary
    3. Chapter 14: Association Rule Analysis
      1. Setting Up Your Data
      2. Settings Table
      3. Creating the Association Rule Analysis Model
      4. Viewing the Association Rule Model Item Sets and Rule
        1. Viewing the Frequent Item Sets
        2. Viewing the Association Rules
      5. Summary
    4. Chapter 15: Classification
      1. Setting Up Your Data
      2. Settings Table
      3. Creating the Classification Models
      4. Evaluating the Classification Models
        1. Preparing the Data
        2. Computing the Confusion Matrix
        3. Computing the Lift
        4. Computing the ROC
      5. Applying the Model to New Data
        1. Applying the Model in Real Time
        2. Applying the Model in Batch
      6. Summary
    5. Chapter 16: Clustering
      1. Setting Up Your Data
      2. Viewing Your Existing Cluster Models
      3. Settings Table
      4. Creating a Cluster Model
      5. Examining the Cluster Model
        1. Querying the Cluster Models in Your Schema
        2. Examining the Cluster Details
      6. Applying the Cluster Model to New Data
        1. Applying the Cluster Model in Real Time
        2. Applying the Cluster Model in Batch Mode
      7. Combining Clusters
      8. Summary
    6. Chapter 17: Regression
      1. Examining the Existing Regression Model(s)
      2. Settings Table for Regression
      3. Creating a Regression Model
      4. Examining and Evaluating the Regression Models
        1. Global Statistics for a GLM Regression Model
        2. GLM Regression Model Details
        3. SVM Regression Model Details
        4. Residual Statistics
      5. Applying Regression Model to Your Data
        1. Using the Regression Model in Real Time
        2. Using the Regression Model in Batch Mode
      6. Summary
    7. Chapter 18: Anomaly Detection
      1. Examining the Existing Anomaly Detection Model(s)
      2. Settings Table
      3. Creating an Anomaly Detection Model
      4. Applying the Anomaly Detection Model to Your Data
        1. Using the Anomaly Detection Model in Real Time
        2. Using the Anomaly Detection Model in Batch Mode
      5. Summary
  12. PART IV: Migration and Implementations
    1. Chapter 19: How to Migrate Your ODM Models
      1. Oracle Data Miner Script Generation
        1. Running the ODM Workflow Scripts
      2. PL/SQL Procedures for ODM Model Migration
        1. System Privileges Needed for Exporting and Importing ODM Models
        2. Exporting an ODM Model
        3. Importing and the ODM Model
        4. Dropping an ODM Model
        5. Renaming an ODM Model
      3. Summary
    2. Chapter 20: Implementation-Related Topics
      1. How to Add Your ODM Models to Your OBI Dashboards
        1. Importing the ODM Model
        2. Creating a View to Include the ODM Model
        3. Importing the View to the Physical Layer of the BI Repository (RPD)
        4. Adding New Columns to the Business Model Layer
        5. Adding to the OBI Dashboards
      2. How to Build and Apply ODM Models in Parallel
        1. How to Run Your ODM Workflows and ODM Models in Parallel
        2. How to Run Your ODM Model in Real Time Using Parallel Query
        3. How to Run Your ODM Model in Batch Mode Using Parallel Query
      3. Predictive Queries
      4. Summary
  13. Index