Building Making It Happen
Building Making It Happen
  Sign-in         Register
    
Principles and Rules Listing Page
Enabling Metadata Generation for Unstructured Content
Unstructured content in an organization is typically the last phase of evolution of a metadata environment. Most of the unstructured content is linked to business metadata. You can use various tools for content management, collaboration management and business process management to enable automatic generation of Metadata. There are some more basic methods like using shared drives and encouraging standard templates.
 
This page of 'Principles and Rules' is linked to:  Metadata Management, Core Data Management Tools,


Unstructured content is perhaps the biggest challenge in the way of building a holistic metadata. Metadata is essentially carries the details on the data, information and knowledge existing within an organization environment. This includes all kind of structured (like the data in business applications) and unstructured (like word documents, paper documents) content. Business Metadata (like process maps, memos etc...)is mostly in the form of unstructured content.

One impractical way to create the metadata for unstructured content is to manually enter the details into the metadata repository. However, given the volume of this data, it will not be possible to implement this method and to sustain it. That's the reason that business metadata around the unstructured content is typically the last phase of a metadata repository evolution. As a general principle- A metadata repository works only if it can be updated mostly automated.

Here are some methods, which you can deploy to enable metadata repository around unstructured metadata:

Storing Data in the shared drives:

You can bar or discourage people from storing their files in local drives. If you cannot bar, you can ensure that the files are replicated to the shared drives. Shared drives will help the metadata extraction programs to sniff out the details about these documents (like a crawler).

Bringing your files and documents within the ambit of versioning system

While there are versioning systems for software applications, the same is not done for the unstructured content used by business. By having versioning tool encircling these documents, one can create the metadata on the change history of this unstructured content.

Encourage people to use standard templates

You can create standard templates for different kinds of documents. Examples are:

  • Minutes of the meetings
  • Memos
  • Policies and rules
  • Business process maps
  • Desk Instructions...

For each of the document category, a template can be added in the MS word (one can also use InfoPath). Users should be encouraged to use these templates. This will enforce a discipline to enter the details around document title, description, date of creation, authors(s) etc...

Use a content management system

You can deploy a content management system and encourage people to use it. Many smart organizations make it mandatory for employees to use these platforms to manage their content. These content management platforms add huge value in terms of creating an easily accessible metadata. They have the typical capabilities of versioning, indexing, tagging, document pathing etc. You can go to the sites for Stellent and share-point to understand more about these tools.

Use an effective collaboration platform

This is the world of forums, wikis, blogs and intranet. Having robust platforms for people to collaborate will enable a better management of what people are writing a sharing. This will create metadata for the metadata repository. For example, a good Wiki platform will have a database of all wikis, their purpose, the participants, subjects, tags, indexes etc

Use a business process management

A business process management (BPM) platform essentially allows you to capture, design, store, change manage and implement a business process. These applications can have close tightness with the other applications like work flow and distribution systems. For example, if you make changes in BPM, it can trigger changes in the workflow programs. These BPM applications have the metadata related to the business process flows and process maps etc.

Scan and Image the documents

In this automated environment, there are many documents which are not in form of a soft file. This includes original legal documents, invoices etc. One way to handle it is to have the scans of these documents and make them part of image management software.

Leverage Record Retention Services

There are 3rd party record retention companies which have the expertise on creating metadata on the physical documents. While record retention companies are primarily interested in building indexes for retrieving documents, they can expand their brief by building more details around these physical documents to serve the needs of metadata repositories.

   Access more details on this page   

Quick Feedback- Was this information helpful ?
Relevant Links to this page
Principles & Rules → Business intelligence need not wait for legacy conversion → Principles & Rules → Beware of Data Federation as an ultimate solution to your data integration solution → Principles & Rules → Periodic Rationalization & Prioritization of Information has multiple benefits → Practice Techniques → Which Metadata Architecture to use and when → 
 
Back
Featured Pages
Data Warehouse Benefits Usage
Data Monitoring Request Form
Data Quality Assurance Track
Don't worry for NULL as facts

Make 'Executable' Strategy
Maximize Results
Maximize People
Manage Execution

Featured Pages
Master-Data-Management CDI Usage pattern
Data Map & Assessment Report
Avoid Pure MOLAP
Master-Data-Management CDI Architecture