Sales Management Customer Relationship Human Resources Business Performance BI & Data Quality IT Tools & Vendors

Sign-in   Register
Establishing 'Making it Happen' as a 'Formal & Predictable' Discipline
Principles and Rules Listing Page

You may not be a able to have a perfect ETL

Due to various data cleansing and transformation activities done, the transformed data and the data in the source systems may not match 100%.
 
This page of 'Principles and Rules' is linked to:  Data Warehousing, Data Analysis/OLAP, BI platform Tools Evaluation, BI business intelligence end-to-end view,

Refer Data Quality in Data Warehouse for more context.

For even a medium size data warehouse, there can be thousands of check you have to run to assess the accuracy of ETL for the previous night data. The most common approach is to run some reports on source systems and Data-warehouse and compare the two. The level at which you will do depend upon the reliability experience of the ETL. For example, you may like to do it for overall sales ' For each sales office' for each sales officer in each sales office..

Typically you can fix the level of deviation which you will allow. You may:

  • Sset the tolerance range of say +/
  • X% to that of value in the source system, depending upon the level of data quality and complexity of the data transformation done. This delta can be for the same day or the average across certain number of days in the past (wit maximum cap for any single day)
  • Set-up the maximum standard deviation allowed across multiple instances (multiple sales offices) for a given day.
  • Set-up the maximum standard deviation allowed across the multiple days for the same instance (for last 30 days for the same office)

If the data is not meeting the standards, one then need to drill down further and investigate. Experience will let you optimize on the level of checks you should apply.


Quick Feedback- Was this information helpful ?
Relevant Links to this page
TOPIC - Slowly Changing Dimensions SCD in Dimensional Modeling → Principles & Rules → Dimensional model has to be aligned to the Entity-Relationship → Principles & Rules → Always Use Conformed Dimensions → Principles & Rules → You may not be a able to have a perfect ETL → Practice Techniques → Handling Sparse Dimensional tables → Principles & Rules → Do not separate the parent and child line item data → Practice Techniques → Managing time-stamps across multiple time-zones → Practice Techniques → Recording events in multiple currencies → Practice Techniques → Handle different units of measure in the same fact table → Principles & Rules → Handling of Null foreign Keys in fact tables → Principles & Rules → Dimension Attributes as NULL → Principles & Rules → Don't rely too much on Meta Data Tools to enforce Business Intelligence → Principles & Rules → Don't wait for universal models for Data Marting → Principles & Rules → Add extra buffer for ETL phase → Principles & Rules → Homework before interviews is must (Business Requirements Phase in Data Warehouse) → Principles & Rules → Excel is the competition, which should be challenged → Principles & Rules → Avoid Pure MOLAP → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Consolidate Data-Marts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Licensing & Maintenance Contracts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Governance & Standards → Practice Techniques → Field Tips Series- Streamlining & reducing cost of Business Intelligence- Evaluate Open Source → Principles & Rules → Master Data Management- Making a Right Start → Practice Techniques → How to integrate stand-alone BI environments- Gradual Approach → Principles & Rules → Business owned applications are a reality- Manage it → Principles & Rules → New Data Standards- What about existing data and applications? → Principles & Rules → Handle Each Time-stamp in the Fact Table as a separate dimension → Principles & Rules → Keep Aggregates and Details data in different Fact tables → Principles & Rules → Some considerations for Infrastructure in Data Warehouse → Principles & Rules → For Core BI platform go for a single, established and robust player → Principles & Rules → Don't be guided only by the business requirements for your Business Intelligence → Practice Techniques → Using Synonyms and Views → 
 
Back
 
Relevant links to this page
Slowly Changing Dimensions SCD
Dimensional model has to be aligned to the Entity-Relationship
Always Use Conformed Dimensions
You may not be a able to have a perfect ETL
Handling Sparse Dimensional tables
Do not separate the parent and child line item data
Managing time-stamps across multiple time-zones
Recording events in multiple currencies
Handle different units of measure in the same fact table
Handling of Null foreign Keys in fact tables
Dimension Attributes as NULL
Don't rely too much on Meta Data Tools to enforce Business Intelligence
Don't wait for universal models for Data Marting
Add extra buffer for ETL phase
Homework before interviews is must (Business Requirements Phase in Data Warehouse)
Excel is the competition, which should be challenged
Avoid Pure MOLAP
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Consolidate Data-Marts
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Licensing & Maintenance Contracts
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Governance & Standards
Field Tips Series- Streamlining & reducing cost of Business Intelligence- Evaluate Open Source
Master Data Management- Making a Right Start
How to integrate stand-alone BI environments- Gradual Approach
Business owned applications are a reality- Manage it
New Data Standards- What about existing data and applications?
Handle Each Time-stamp in the Fact Table as a separate dimension
Keep Aggregates and Details data in different Fact tables
Some considerations for Infrastructure in Data Warehouse
For Core BI platform go for a single, established and robust player
Don't be guided only by the business requirements for your Business Intelligence
Using Synonyms and Views
Additional Channels
Principles & Rules
Free Templates
Glossary
Key Performance Indicators

Most Popular Zones with list of pages crossing 25000 hits  →→→ 
Maximising Sales Performance
Sales facility Infrastructure
Sales Leads Management SWOT
Sales Synergies
Sales Compensation System
Sales Compensation Structure Decision
Read more...
  Customer Relationship Management
Supply Chain for Customer Service and Support
Customer Value and Profitability-Overview
Customer Knowledge and Organizational Knowledge
Drivers for Customer Satisfaction & Retention
Customer Segmentation Data Management
Read more...
  Human Resources & Leadership
Competencies Definitions
Maximize the output first and then the potential
Fitting leadership dimension in employee performance
Roles and Level based Competency Segregation
Business and Financial Acumen
Read more...
 
 
Business Performance & Planning
For important KPIs- Install first & Fix later
Strategic Vision and Mission
Scorecards need manual finish
Strategic Planning leadership commitment
Scorecard Health Checklist
Read more...
  Business Intelligence & Data Quality
Time Trending Data Analysis
Give & Take for Source systems
Drill (horizontal) and Cross (horizontal) Navigation
Source system mapping matrix
Data Warehouse ETL Loading
Read more...
  IT Vendors & Tools Management
Report objects for Enterprise Reporting
Vendor Delivery Evaluation Governance
Vendor Delivery Project Evaluation
Delivery Evaluation Matrix
OLAP Architecture Cache Management
Read more...