Building Making It Happen
Establishing Making-it-Happen as ‘Formal & Measurable’ Business Discipline
  Sign-in         Register
    
   Back-Room Data Warehouse Metadata Data Warehouse job control and audit  

Execution-MiH Encyclopedia  →   Enterprise Intelligence  →  SECTION -  Data-Warehouse/Mart  →  CHAPTER -  DW Design & Architecture  → 

Data Warehouse Data Quality assurance

Data Warehouse operations are mostly through batch processing. Adequate validations are designed to ensure that there is integrity of data through its journey from Source system to DW repository.

Subject of 'Data Quality is covered comprehensively in earlier sections as a 'Foundation Element' equally applicable across Data-Warehouse, Source Systems, Destination Systems OR all other Data Repositories.

'Data Quality' section provides 90% of the input you need for Data-Warehouse. In this page, we are looking at specific items in context of Data Warehouse. The Data Quality Assurance is taken care of during Extraction, Transformation process as well as improvement in source systems for upstream quality. The Data Quality assurance actions mentioned here are mainly the multi-cornered reconciliation routines.

There are no specific rules OR guidelines on when you should do reconciliation and checking. It depends on the level of complexity and confidence of staging process and criticality of data. The quality assurance steps include:

Data Quality Assurance between Source systems and staging areas–

This validates that the facts and dimensions values have not changed at the base level between the source systems and the post-Transformation. The typical recon is the aggregate checks on all tables and sample checks on specific values.

Completion of Data Transformation in terms of Data-Sets, Attributes and the facts-

Many people question the need of doing this, when the scripts run do take care of this. However, if you have a good metadata, you have one more reference to check the creation of all Dimensions, Attributes and Facts.

Data Quality Assurance between the staging area and loaded area.

This part takes care of the completeness of Loading. One can run a script to test the aggregate values across the data sets and loaded area.

Data Quality Assurance between the Loaded Area and Destination Systems.

Check the data congruency between the desktop level cubes, the reports, Data modeling systems databases.

Data Quality Assurance between the Destination Systems and Source Systems.

Check the Data congruency between the source production systems and desktop level cubes, the reports, Data modeling systems databases.

The quality assurance part of data Warehouse is generally the biggest component of the Data warehouse testing. Data warehouse is not a transactional system. Therefore, comparing aggregates across the stages of processing will mostly test its operations.

 

   Back-Room Data Warehouse Metadata Data Warehouse job control and audit  
 
All Topics in: "DW Design & Architecture" Chapter
 Data Warehouse Design and Architecture Overview →  Data Warehouse Source Systems →  Data Warehouse ETL Extraction →  Data Warehouse ETL Transformation →  Data Warehouse ETL Loading →  Data Warehouse Metadata →  Back-Room Data Warehouse Metadata →  Data Warehouse Data Quality assurance →  Data Warehouse job control and audit →  Data Warehouse sharing and browsing →  Data Warehouse Infrastructure → 
 

Was this page helpful?
If you like it ? share it !
Digg
Digg
Reddit
Reddit
Del.icio.us
Delicious
Google
Google
Live
Live
Facebook
Facebook
Slashdot
Slashdot
Netscape
Netscape
Technorati
Technorati
Stumbleupon
Stumbleupon
Spurl
Spurl
Furl
Furl
Blogmarks
Blogmarks
Yahoo
Yahoo
Plugim
Plugim
Squidoo
Squidoo
BlinkBits
BlinkBits
 
CONTENT ZONE
Data-Warehouse/Mart

Featured Pages
Data Mapping and Assessment WBS
Master-Data-Management CDI Usage pattern
Data Quality Policy
Parallel Dimensional Hierarchy

Make 'Executable' Strategy
Maximize Results
Maximize People
Manage Execution

Featured Pages
MDM-CDI Hub
Derived Facts table in DW
Data Warehouse Information Systems Assessment
Business Intelligence Project Management Success Metrics