IBM WebSphere Information Analyzer and Data Quality Assessment

Book description

IBM Information Server is a revolutionary new software platform that helps organizations derive more value from the complex heterogeneous information that is spread across their systems. It enables organizations to integrate disparate data and deliver trusted information wherever and whenever needed, in line and in context, to specific people, applications, and processes.

IBM WebSphere Information Analyzer is a data profiling and analysis tool that is a critical component of IBM Information Server. It is designed to help business and data analysts understand the content, quality, and structure of their data sources by automating the data discovery process. Bundled with IBM WebSphere Information Analyzer is AuditStage, a data rule monitoring tool that is designed to help business and data analysts validate data and assess ongoing data quality trends.

This book describes a usage scenario that covers all dimensions of profiling, rule building, deployment, and quality monitoring through a data integration life cycle.

Table of contents

  1. Figures (1/3)
  2. Figures (2/3)
  3. Figures (3/3)
  4. Tables
  5. Examples
  6. Notices
    1. Trademarks
  7. Preface
    1. The team that wrote this book
    2. Become a published author
    3. Comments welcome
  8. Chapter 1: IBM WebSphere Information Analyzer overview
    1. Introduction
    2. Data quality assessment (DQA) methodology
      1. Data assessment approach (1/3)
      2. Data assessment approach (2/3)
      3. Data assessment approach (3/3)
      4. Data assessment tools
      5. Data assessment benefits
    3. IBM WebSphere Information Analyzer architecture
    4. Main functions
    5. Main components
    6. Setting up your system
      1. SETUPSTEP1: Set up the various roles
      2. SETUPSTEP2: Configure ODBC to access data sources (1/2)
      3. SETUPSTEP2: Configure ODBC to access data sources (2/2)
      4. SETUPSTEP3: Optionally, configure Analysis Settings
      5. SETUPSTEP4: Configure data source connections (1/3)
      6. SETUPSTEP4: Configure data source connections (2/3)
      7. SETUPSTEP4: Configure data source connections (3/3)
      8. SETUPSTEP5: Import metadata (1/2)
      9. SETUPSTEP5: Import metadata (2/2)
      10. SETUPSTEP6: Create/open a project
      11. SETUPSTEP7: Configure the project (1/8)
      12. SETUPSTEP7: Configure the project (2/8)
      13. SETUPSTEP7: Configure the project (3/8)
      14. SETUPSTEP7: Configure the project (4/8)
      15. SETUPSTEP7: Configure the project (5/8)
      16. SETUPSTEP7: Configure the project (6/8)
      17. SETUPSTEP7: Configure the project (7/8)
      18. SETUPSTEP7: Configure the project (8/8)
    7. Column analysis
      1. Column analysis functions
      2. Column analysis results (1/4)
      3. Column analysis results (2/4)
      4. Column analysis results (3/4)
      5. Column analysis results (4/4)
      6. Column analysis usage scenario (1/18)
      7. Column analysis usage scenario (2/18)
      8. Column analysis usage scenario (3/18)
      9. Column analysis usage scenario (4/18)
      10. Column analysis usage scenario (5/18)
      11. Column analysis usage scenario (6/18)
      12. Column analysis usage scenario (7/18)
      13. Column analysis usage scenario (8/18)
      14. Column analysis usage scenario (9/18)
      15. Column analysis usage scenario (10/18)
      16. Column analysis usage scenario (11/18)
      17. Column analysis usage scenario (12/18)
      18. Column analysis usage scenario (13/18)
      19. Column analysis usage scenario (14/18)
      20. Column analysis usage scenario (15/18)
      21. Column analysis usage scenario (16/18)
      22. Column analysis usage scenario (17/18)
      23. Column analysis usage scenario (18/18)
    8. Primary key analysis
      1. Primary key analysis functions
      2. Primary key analysis results
      3. Primary key analysis usage scenario (1/7)
      4. Primary key analysis usage scenario (2/7)
      5. Primary key analysis usage scenario (3/7)
      6. Primary key analysis usage scenario (4/7)
      7. Primary key analysis usage scenario (5/7)
      8. Primary key analysis usage scenario (6/7)
      9. Primary key analysis usage scenario (7/7)
    9. Foreign key analysis
      1. Foreign key analysis functions
      2. Foreign key analysis results (1/2)
      3. Foreign key analysis results (2/2)
      4. Foreign key analysis usage scenario (1/6)
      5. Foreign key analysis usage scenario (2/6)
      6. Foreign key analysis usage scenario (3/6)
      7. Foreign key analysis usage scenario (4/6)
      8. Foreign key analysis usage scenario (5/6)
      9. Foreign key analysis usage scenario (6/6)
    10. Cross domain analysis
      1. Cross domain analysis functions
      2. Cross domain analysis results
      3. Cross domain analysis usage scenario (1/3)
      4. Cross domain analysis usage scenario (2/3)
      5. Cross domain analysis usage scenario (3/3)
    11. Publish analysis results
      1. Publish an analysis result
      2. Review a published analysis result in DataStage (1/2)
      3. Review a published analysis result in DataStage (2/2)
    12. IBM WebSphere AuditStage business rule validation
      1. Configure the DSN (1/2)
      2. Configure the DSN (2/2)
      3. Business rule examples (1/7)
      4. Business rule examples (2/7)
      5. Business rule examples (3/7)
      6. Business rule examples (4/7)
      7. Business rule examples (5/7)
      8. Business rule examples (6/7)
      9. Business rule examples (7/7)
    13. Baseline analysis
      1. Baseline analysis functions
      2. Baseline analysis results
      3. Baseline analysis usage scenario (1/3)
      4. Baseline analysis usage scenario (2/3)
      5. Baseline analysis usage scenario (3/3)
    14. Reports
      1. Generate a report (1/2)
      2. Generate a report (2/2)
      3. Sample reports (1/3)
      4. Sample reports (2/3)
      5. Sample reports (3/3)
  9. Chapter 2: Financial services business scenario
    1. Introduction
    2. Business requirements
    3. Environment configuration
    4. General approach
      1. Step 1: General guidelines for the process
      2. Step 2: Identify differences between the sources and targets
      3. Step 3: Determine action in specific cases
      4. Step 4: Determine strategy and plan to execute action
      5. Step 5: Execute the plan
      6. Step 6: Review success of the process
    5. Migration from North American Bank systems to Northern California Bank systems
      1. Assumptions about the migration
      2. IBM WebSphere Information Analyzer features used
      3. IBM WebSphere AuditStage features used
      4. North American Bank analysis
      5. Northern California Bank analysis (1/5)
      6. Northern California Bank analysis (2/5)
      7. Northern California Bank analysis (3/5)
      8. Northern California Bank analysis (4/5)
      9. Northern California Bank analysis (5/5)
      10. Migration Analysis (1/2)
      11. Migration Analysis (2/2)
    6. Data integration of North American Bank and Northern California Bank systems
      1. Assumptions about data integration
      2. IBM WebSphere Information Analyzer features used
      3. IBM WebSphere AuditStage features used
      4. North American Bank non-core services analysis
      5. Northern California Bank non-core services analysis (1/3)
      6. Northern California Bank non-core services analysis (2/3)
      7. Northern California Bank non-core services analysis (3/3)
      8. Data integration analysis (1/3)
      9. Data integration analysis (2/3)
      10. Data integration analysis (3/3)
  10. Appendix A: IBM Information Server overview
    1. Introduction
    2. IBM Information Server architecture
      1. Unified user interface
      2. Common services (1/2)
      3. Common services (2/2)
      4. Key integration functions
      5. Unified parallel processing
      6. Unified metadata
      7. Common connectivity
      8. Client application access to services
    3. Configuration flow
      1. Step1a: Create connection to an Information Server provider
      2. Step1b: Create a project
      3. Step1c: Create an application
      4. Step1d: Generate SOA services
      5. Step1e: Deploy SOA services
      6. Step1f: Test deployed SOA services
      7. Step1g: Optionally export service to WebSphere Service Registry and Repository
    4. Runtime flow
      1. Service artifacts
      2. Flow of a request
  11. Appendix B: IBM Information Integrator Classic Federation setup
    1. Introduction
    2. Configure ODBC data sources on the z/OS platform
  12. Appendix C: Miscellaneous tips regarding IBM WebSphere Information Analyzer
    1. General tips (1/2)
    2. General tips (2/2)
  13. Appendix D: Code and scripts used in the business scenario
    1. Introduction (1/5)
    2. Introduction (2/5)
    3. Introduction (3/5)
    4. Introduction (4/5)
    5. Introduction (5/5)
  14. Appendix E: Additional material
    1. Locating the Web material
    2. Using the Web material
      1. How to use the Web material
  15. Related publications
    1. IBM Redbooks
    2. Other publications
    3. Online resources
    4. How to get IBM Redbooks publications
    5. Help from IBM
  16. Index (1/3)
  17. Index (2/3)
  18. Index (3/3)
  19. Back cover

Product information

  • Title: IBM WebSphere Information Analyzer and Data Quality Assessment
  • Author(s): Nagraj Alur, Reginald Joseph, Harshita Mehta, Jorgen Tang Nielsen, Denis Vasconcelos
  • Release date: December 2007
  • Publisher(s): IBM Redbooks
  • ISBN: None