Book description
Written for anyone involved in the data preparation process for analytics, Gerhard Svolba's Data Preparation for Analytics Using SAS offers practical advice in the form of SAS coding tips and tricks, and provides the reader with a conceptual background on data structures and considerations from a business point of view. The tasks addressed include viewing analytic data preparation in the context of its business environment, identifying the specifics of predictive modeling for data mart creation, understanding the concepts and considerations of data preparation for time series analysis, using various SAS procedures and SAS Enterprise Miner for scoring, creating meaningful derived variables for all data mart types, using powerful SAS macros to make changes among the various data mart structures, and more!
Table of contents
- Preface
-
Part 1 Data Preparation: Business Point of View
- Chapter 1 Analytic Business Questions
-
Chapter 2 Characteristics of Analytic Business Questions
- 2.1 Introduction
- 2.2 Analysis Complexity: Real Analytic or Reporting?
- 2.3 Analysis Paradigm: Statistics or Data Mining?
- 2.4 Data Preparation Paradigm: As Much Data As Possible or Business Knowledge First?
- 2.5 Analysis Method: Supervised or Unsupervised?
- 2.6 Scoring Needed: Yes/No?
- 2.7 Periodicity of Analysis: One-Shot Analysis or Re-run Analysis?
- 2.8 Need for Historic Data: Yes/No?
- 2.9 Data Structure: One-Row-per-Subject or Multiple-Rows-per-Subject?
- 2.10 Complexity of the Analysis Team
- 2.11 Conclusion
- Chapter 3 Characteristics of Data Sources
- Chapter 4 Different Points of View on Analytic Data Preparation
-
Part 2 Data Structures and Data Modeling
- Chapter 5 The Origin of Data
- Chapter 6 Data Models
- Chapter 7 Analysis Subjects and Multiple Observations
- Chapter 8 The One Row-per-Subject Data Mart
- Chapter 9 The Multiple-Rows-per-Subject Data Mart
- Chapter 10 Data Structures for Longitudinal Analysis
- Chapter 11 Considerations for Data Marts
- Chapter 12 Considerations for Predictive Modeling
-
Part 3 Data Mart Coding and Content
- Chapter 13 Accessing Data
- Chapter 14 Transposing One- and Multiple-Rows-per-Subject Data Structures
- Chapter 15 Transposing Longitudinal Data
- Chapter 16 Transformations of Interval-Scaled Variables
- Chapter 17 Transformations of Categorical Variables
- Chapter 18 Multiple Interval-Scaled Observations per Subject
-
Chapter 19 Multiple Categorical Observations per Subject
- 19.1 Introduction
- 19.2 Absolute and Relative Frequencies of Categories
- 19.3 Concatenating Absolute and Relative Frequencies
- 19.4 Calculating Total and Distinct Counts of the Categories
- 19.5 Using ODS to Create Different Percent Variables
- 19.6 Business Interpretation of Percentage Variables
- 19.7 Other Methods
- Chapter 20 Coding for Predictive Modeling
- Chapter 21 Data Preparation for Multiple-Rows-per-Subject and Longitudinal Data Marts
-
Part 4 Sampling, Scoring, and Automation
- Chapter 22 Sampling
-
Chapter 23 Scoring and Automation
- 23.1 Introduction
- 23.2 Scoring Process
- 23.3 Explicitly Calculating the Score Values from Parameters and Input Variables
- 23.4 Using the Respective SAS/STAT Procedure for Scoring
- 23.5 Scoring with PROC SCORE of SAS/STAT
- 23.6 Using the Respective SAS/ETS Procedure for Scoring
- 23.7 The Score Code That Can Be Produced in SAS Enterprise Miner
- 23.8 The Pre-checks on the Data That Are Useful before Scoring
- 23.9 Automation of Data Mart Creation in General
- Chapter 24 Do’s and Don’ts When Building Data Marts
-
Part 5 Case Studies
- Chapter 25 Case Study 1—Building a Customer Data Mart
- Chapter 26 Case Study 2—Deriving Customer Segmentation Measures from Transactional Data
-
Chapter 27 Case Study 3—Preparing Data for Time Series Analysis
- 27.1 Introduction
- 27.2 The Business Context
- 27.3 The Data
- 27.4 From Transactional Data to the Most Appropriate Aggregation
- 27.5 Comparing PROC SQL, PROC MEANS, and PROC TIMESERIES
- 27.6 Additional Aggregations
- 27.7 Derived Variables
- 27.8 Creating Observations for Future Months
- 27.9 The Results and Their Usage
- Chapter 28 Case Study 4—Data Preparation in SAS Enterprise Miner
-
Appendix A Data Structures from a SAS Procedure Point of View
- A.1 Introduction
- A.2 Relationship between Data Mart Elements and SAS Procedure Statements
- A.3 Data Mart Structure Requirements for Selected Base SAS Procedures
- A.4 Data Mart Structure Requirements for Selected SAS/STAT Procedures
- A.5 Data Mart Structure Requirements forSelected SAS/ETS Procedures
- A.6 Data Mart Structure Requirements for Selected SAS/QC Procedures
- A.7 Data Mart Structure Requirements for Selected SAS/GRAPH Procedures
- A.8 Data Mart Structure Requirements for SAS Enterprise Miner Nodes
-
Appendix B The Power of SAS for Analytic Data Preparation
- B.1 Motivation
- B.2 Overview
- B.3 Extracting Data from Source Systems
- B.4 Changing the Data Mart Structure: Transposing
- B.5 Data Management for Longitudinal and Multiple-Rows-per-Subject Data Sets
- B.6 Selected Features of the SAS Language for Data Management
- B.7 Benefits of the SAS Macro Language
- B.8 Matrix Operations with SAS/IML
- Appendix C Transposing with DATA Steps
- Glossary
- Index
Product information
- Title: Data Preparation for Analytics Using SAS
- Author(s):
- Release date: November 2006
- Publisher(s): SAS Institute
- ISBN: 9781629597904
You might also like
book
Data Quality for Analytics Using SAS
Analytics offers many capabilities and options to measure and improve data quality, and SAS is perfectly …
book
Big Data Analytics with SAS
Leverage the capabilities of SAS to process and analyze Big Data About This Book Combine SAS …
book
Practical Business Analytics Using SAS: A Hands-on Guide
Practical Business Analytics Using SAS: A Hands-on Guide shows SAS users and businesspeople how to analyze …
book
SAS Visual Analytics for SAS Viya
Discover your data and create reports in an elegant and intuitive web-based interface! SAS Visual Analytics …