Building Making It Happen
Building Making It Happen
  Sign-in         Register
    
Principles and Rules Listing Page
Keep the same names and definitions for all data elements
This tip is basically for not to cut corners, when you are designing your dimensional model. Having some mapping tables to map inconsistent names of same data elements, is an almost criminal short-cut to having same names in the dimensional model design itself.
 
This page of 'Principles and Rules' is linked to:  Data Warehousing, Data Analysis/OLAP, BI platform Tools Evaluation, Metadata Management, Core Data Management Tools,


Sometimes the same data element carries different names in different dimensional tables, due to the following reasons..

  • Inconsistencies in data element names in source systems is so huge (like the same customer ID data element is named in ten different ways in ten different tables- cust_ID, CustomerID, C_ID..).
  • There being multiple teams designing different parts of the dimensional models.
  • Your dimensional model having evolved over time.

To accelerate the DW project, you may create a mapping table in metadata which can link these different names. You do it as you don't want to disturb your transformation routines, and want to have an alternate method to align your dimensional models.

The reason you should not do this is:

  • Data warehouse is storehouse. People may or may not choose to refer to the mapping tables or Meta-data definitions. This may severely impact the potential of DW, especially when you need drill across capabilities.
  • With field-names being different, you will end up having more transformation and loading routines to create and run.
  • The maintenance of dimensional model becomes more complex over time.
  • You end up promoting the culture of inconsistent naming in the DW and source system.

Benefits of same names and definitions across all schemas

  • High productivity: Same names mean that there is no confusion and same queries can be used in different schemas.
  • Automated joins across schemas: As you might see in the multi-cube architecture, you can join two schemas if they share a common dimension. With same names, it becomes easier to create these joins.

A good naming strategy has to be:

  • Intuitive and consistent: the naming approach should be easily understood and should be applied universally.
  • Extendible: With new facts and dimensions and attributes, the naming structure should be able to absorb these changes.
  • Easy to understand: Names should not be too cryptic. The names should be more in a business like language (Use Customer_ID instead of Cusid)
  • Should have identifiers linked to the business process, the dimension, fact table to which it belong etc.

Naming strategy should be defined by the dimensional modeling team as they have a got a wide view of all the dimensions and fact tables involved. Post the first draft, we have to go through the iterative process of fine-tuning given the feedback from business, technical and information managers.

   Access more details on this page   

Quick Feedback- Was this information helpful ?
Relevant Links to this page
Principles & Rules → Add extra buffer for ETL phase → Principles & Rules → Homework before interviews is must (Business Requirements Phase in Data Warehouse) → Principles & Rules → Excel is the competition, which should be challenged → Principles & Rules → Avoid Pure MOLAP → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Consolidate Data-Marts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Licensing & Maintenance Contracts → Practice Techniques → Field Tips Series- Streamlining & Cost-Reduction in Business Intelligence- Governance & Standards → Practice Techniques → Field Tips Series- Streamlining & reducing cost of Business Intelligence- Evaluate Open Source → Principles & Rules → Master Data Management- Making a Right Start → Practice Techniques → How to integrate stand-alone BI environments- Gradual Approach → Principles & Rules → Business owned applications are a reality- Manage it → Principles & Rules → New Data Standards- What about existing data and applications? → Principles & Rules → Handle Each Time-stamp in the Fact Table as a separate dimension → Principles & Rules → Keep Aggregates and Details data in different Fact tables → Principles & Rules → Some considerations for Infrastructure in Data Warehouse → Principles & Rules → For Core BI platform go for a single, established and robust player → Principles & Rules → Don't be guided only by the business requirements for your Business Intelligence → Practice Techniques → Using Synonyms and Views → 
 
Back
Featured Pages
Data Domain and Data Standards Controls
Two tier Data Warehouse Architecture
Customer Data Variations
Data Warehouse Performance Management

Make 'Executable' Strategy
Maximize Results
Maximize People
Manage Execution

Featured Pages
Data Warehouse sharing and browsing
Enterprise Intelligence' Evaluation- Data Pipelines
Fix Business Intelligence at functional level first
Don't rely too much on Meta Data Tools