Sales Management Customer Relationship Human Resources Business Performance BI & Data Quality IT Tools & Vendors

Sign-in   Register
Establishing 'Making it Happen' as a 'Formal & Predictable' Discipline
Principles and Rules Listing Page

Documenting your data-integration system

There is no data-integration system, which can fully self-document the data integration flow and process.
 
This page of 'Principles and Rules' is linked to:  Data Warehousing, Data Analysis/OLAP, BI platform Tools Evaluation, BI business intelligence end-to-end view, Metadata Management, Core Data Management Tools,

The reasons are:

  • Source system documentation: Its generally not possible to have a complete description of :
    • the source system tables
    • inter-linkages between the tables,
    • the same data-elements with different field names in different tables
  • The transformation rules: Many transformation rules are fairly complex, and cannot be implemented through standard options given in a tool and have to be programmed into the system.
  • The reasoning: While it might be possible for an ETL system to self-document the simple transformational rules, it cannot document the reasons and objectives behind the transformation. For example, why are you splitting a customer_ID (AA302456) into sub-components like Customer_type(AA), customer location (30) and customer_number(2456). The different reasons for this splitting could be:
    • Enabling the specific type of queries around customer type.
    • Two different tables could be having different field structure for customer type (AA v/s XXAA)
  • The documentation on risks related to the efficacy of ETL: During the design of an ETL system, one comes to know the limitations of the ETL. For example, you may not be able to achieve 100% perfect extractions or transformations given the:
    • limitations of data quality in source systems
    • limitations of the extraction flexibility,
    • the performance load due to a complex extraction query.
  • These limitations should always be documented, which give a more realistic view of the level of accuracy around the data and the output information.

  • The flow of ETL: A set of data goes sometimes go through multiple transformation routines before it reaches end-state and be ready for loading. A good documentation should be able to provide an end-to-end view of this entire flow so that one can understand the purpose behind this flow. This end-to-end view should be able to answer the following questions:
    • Why we are following X steps and not Y steps to do a transformation?
    • What is the completion criteria related to each step of transformation/Extraction?
  • Data Quality checks: An ETL system generally does not document the data quality checks, which need to be done and their reasoning.

Quick Feedback- Was this information helpful ?
Relevant Links to this page
Principles & Rules → Dimensional model has to be aligned to the Entity-Relationship → Principles & Rules → Always Use Conformed Dimensions → Principles & Rules → You may not be a able to have a perfect ETL → Practice Techniques → Handling Sparse Dimensional tables → Principles & Rules → Do not separate the parent and child line item data → Practice Techniques → Managing time-stamps across multiple time-zones → Practice Techniques → Recording events in multiple currencies → Practice Techniques → Handle different units of measure in the same fact table → Principles & Rules → Handling of Null foreign Keys in fact tables → Principles & Rules → Dimension Attributes as NULL → Principles & Rules → Don't rely too much on Meta Data Tools to enforce Business Intelligence → Principles & Rules → Don't wait for universal models for Data Marting → Principles & Rules → Add extra buffer for ETL phase → Principles & Rules → Homework before interviews is must (Business Requirements Phase in Data Warehouse) → Principles & Rules → Excel is the competition, which should be challenged → Principles & Rules → Avoid Pure MOLAP → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Consolidate Data-Marts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Licensing & Maintenance Contracts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Governance & Standards → Practice Techniques → Field Tips Series- Streamlining & reducing cost of Business Intelligence- Evaluate Open Source → Principles & Rules → Master Data Management- Making a Right Start → Practice Techniques → How to integrate stand-alone BI environments- Gradual Approach → Principles & Rules → Business owned applications are a reality- Manage it → Principles & Rules → New Data Standards- What about existing data and applications? → Principles & Rules → Handle Each Time-stamp in the Fact Table as a separate dimension → Principles & Rules → Keep Aggregates and Details data in different Fact tables → Principles & Rules → Some considerations for Infrastructure in Data Warehouse → Principles & Rules → For Core BI platform go for a single, established and robust player → Principles & Rules → Don't be guided only by the business requirements for your Business Intelligence → Practice Techniques → Using Synonyms and Views → 
 
Back
 
Relevant links to this page
Dimensional model has to be aligned to the Entity-Relationship
Always Use Conformed Dimensions
You may not be a able to have a perfect ETL
Handling Sparse Dimensional tables
Do not separate the parent and child line item data
Managing time-stamps across multiple time-zones
Recording events in multiple currencies
Handle different units of measure in the same fact table
Handling of Null foreign Keys in fact tables
Dimension Attributes as NULL
Don't rely too much on Meta Data Tools to enforce Business Intelligence
Don't wait for universal models for Data Marting
Add extra buffer for ETL phase
Homework before interviews is must (Business Requirements Phase in Data Warehouse)
Excel is the competition, which should be challenged
Avoid Pure MOLAP
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Consolidate Data-Marts
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Licensing & Maintenance Contracts
Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Governance & Standards
Field Tips Series- Streamlining & reducing cost of Business Intelligence- Evaluate Open Source
Master Data Management- Making a Right Start
How to integrate stand-alone BI environments- Gradual Approach
Business owned applications are a reality- Manage it
New Data Standards- What about existing data and applications?
Handle Each Time-stamp in the Fact Table as a separate dimension
Keep Aggregates and Details data in different Fact tables
Some considerations for Infrastructure in Data Warehouse
For Core BI platform go for a single, established and robust player
Don't be guided only by the business requirements for your Business Intelligence
Using Synonyms and Views
Additional Channels
Principles & Rules
Free Templates
Glossary
Key Performance Indicators

Most Popular Zones with list of pages crossing 25000 hits  →→→ 
Maximising Sales Performance
Sales Leads Generation through advertising
Sales force density
Sales Behavior
Sales Leads Generation through Events
Sales Campaign SWOT analysis
Read more...
  Customer Relationship Management
Customer Satisfaction & Retention- Data Management
Customer Value and Profitability- BI
Customer Service and Support Overview
Customer Value and Profitability Data Management
Drivers for Customer Satisfaction & Retention
Read more...
  Human Resources & Leadership
Deliver Results
Lead diverse and collaborative teams
Empower Front-line Employees
Customer Focus
Fostering Innovation
Read more...
 
 
Business Performance & Planning
Stakeholder test for Scorecard
SWOT Assessment Report
Strategy Map Objectives Measures and Initiatives
Review Session should stay focused
Strategy Map to Strategic theme
Read more...
  Business Intelligence & Data Quality
Data Mapping and Assessment
Back-Room Data Warehouse Metadata
Business Ownership of Data Quality
Data Warehouse Metadata
Data Warehouse Project Scoping and Planning
Read more...
  IT Vendors & Tools Management
Report objects for Enterprise Reporting
Vendor Credentials and Track-Record Evaluation
Single point vendor needs to be cost-effective
Data Quality Tools Wizards
Vendor Company structure Evaluation
Read more...