View attendee comments

INFORMATION QUALITY IN THE DATA WAREHOUSE:

Principles and Practice


Why this Course?

A manufacturing firm wasted $1 million on its data warehouse before it recognized the need for quality data architecture and for data quality control in its data warehousing processes. A major bank scrapped a $29 million data warehouse to start over from scratch. The reason? Failing to understand and avoid the pitfalls described here. This tutorial details the essential ingredients of an effective information quality environment for the data warehouse.

Mr. English describes how to assess information quality (IQ) at the data sources and in the warehouse. You learn processes for correcting defective data and for preventing recurrence of data defects. Mr. English defines how to control data movement from source to warehouse. You learn both the technical and management requirements for a sustainable information quality environment for data warehousing.

 

"This course has been the top-rated course in our last two [Data Warehousing Institute] Implementation Conferences."

- Data Warehousing Institute Leadership Conference


Learning Objectives:

You learn how to:

  • Define information quality and its meaning to data warehouse customers
  • Assess data warehouse data definition and data architecture quality
  • Assess information quality
  • Implement a data cleansing and data reengineering process
  • Audit and control the ECTL (Extract, Correction, Transformation and Load) processes
  • Describe strengths of IQ tools, their limitations and how to overcome them
  • Improve information processes to eliminate the causes of defective data
  • Implement an information quality environment for data warehousin
Audience: Data Warehouse Managers and staff, Information Quality Managers and staff, Data Administrators, Data Analysts, Database Administrators, Systems Analysts and Business Analysts, Quality Assurance Personnel, Information Stewards, Internal Auditors, Quality Program staff and business management who need quality data
Format: Lecture with discussion to reinforce the concepts
Duration:
1 Day
Pre-requisites: Basic data management concepts

 

 

Course Outline:

       

1. Business Intelligence and Information Quality

  • The high—and hidden—costs of low quality information
  • Why information quality is a critical success factor for data warehousing
  • Defining information quality
  • Three components of information quality in the data warehouse

 

2. Applying Quality Principles to the Data Warehouse

  • TQM principles and information quality
  • Information as a product
  • Data "customers" and "suppliers"
  • Components of data quality
  • Total Quality data Management (TQdM®) process for data warehousing

 

3. Information Quality Tool and Processes

  • Categories of DQ tools
  • IQ tools in the TQdM® process

 

4. Assessing Data Definition (Metadata) Quality

  • Guidelines for a quality data warehouse data model
  • Data standards quality characteristics
  • Data definition quality characteristics
  • Metrics for data definition quality
  • Assuring data definition quality

5. Assessing Information Quality

    • Business information quality characteristics
    • Metrics for information quality
    • How to take a random sample
    • Automated assessments for business rule conformance
    • Assessment tools: examples, strengths, limitations and guidelines
    • Physical data assessments for accuracy
    • Information quality interpretation and reporting techniques
    • Assuring consistency of source and warehouse data

 

6. Improving Information Product Quality: Data Reengineering and Cleansing

  • Legacy data problems
  • Defining a data clean-up strategy and process
  • Identifying data sources
  • "Discovering" business rules in legacy data
  • Mapping and reengineering legacy data to data warehouse architecture
  • Cleansing tools: examples, strengths, limitations and guidelines
  • Transformation tools: examples, strengths, limitations and guidelines
  • Audiding and controlling the ECTL (Extract, Correction, Transformation and Load) processes

 

7. Improving Information Process Quality:
Data Defect Prevention

  • The Plan-Do-Check-Act cycle for information quality improvement
  • Root cause analysis: solving the right problems
  • Planning and implementing process improvements for sustainable informaiton quality
  • Best practices in information quality

8. Implementing an Information Quality Environment for the Data Warehouse

  • Steps to start information quality
  • Steps to sustain information quality

 

 

INFORMATION IMPACT International, Inc.
871 Nialta Lane, Suite 100, Brentwood, TN 37027
Phone: +1 615-837-1211 - Fax: +1 615-837-8804
Email:
Larry.English@infoimpact.com

TQdM® is a registered trademark of INFORMATION IMPACT International, Inc.
© Copyright 1996, 1997, 1998, 1999, 2000, 2001 - INFORMATION IMPACT International Inc.
All materials on this site are protected by international copyright laws.
Reproduction in any form is prohibited without permission.
ALL RIGHTS RESERVED - Code of Ethics - Privacy Statement