Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 36 Next »

A Definition of Master Data Management (MDM)

Master data management (MDM) is an approach to reducing data redundancy by maintaining a definitive "record of truth," or Master Data, for critical data in order to supply a single source as a reference. Ideally, MDM organizes data sharing among multiple applications or departments.

It Starts with Data Integration

It's common for organizations to have duplicate information on different systems. For example, customer information could be stored in both a CRM and accounting system:

While some data in both systems is identical, some is similar but not the same. 

With small organizations, there usually isn't an issue keeping just a few systems up to date. The process of keeping data up to date between disparate systems is called Data Integration (DI). As organizations bring more and more systems online, their business data typically gets increasingly duplicated and the problem of keeping all of the systems up to date becomes problematic.

For example, imagine a delivery service that stores information about its customers in several systems:

Delivery Service Example


As customer information changes over time (i.e. customer address or phone number), keeping all the systems up to date becomes exponentially more difficult.

Considering the diagram above:

  • Where is the truth?

A customer's email address would be stored on several systems and there may be conflicting values. If this is the case, which system holds the true/correct value?

  • Which system gets notified of a change and what data elements do other systems need? 

If all systems need to be updated, then each system needs to know how to transform the data to be consumed by each of the other systems. This requires programming many transformations. In an extreme case, the example above could take five transformations for each of the six systems, totaling 30 transformation modules or adaptors.

Note: An adaptor is software located within a system that connects and shares data through the YOUnite Data Hub. The adaptor focuses on Extract, Transform, and Load (ETL): ensuring any system "outbound data" meets defined format requirements and then gets transformed into "inbound data" format that another system requires. 

  • Are changes handled in real time or in batch?

The easiest and most common change handling is via batch updates. However, latency between batch updates may cause business process issues.  

  • How is DI handled?

The onus of transferring data between systems can become a daunting task. While some applications have built-in adaptors to handle transformations, these generally handle only a subset of the required data. Where the built-in adaptors fall short or don't exist, the organization may end up spending resources developing "one-off" adaptors to meet the ongoing transformation need. 

  • How does the organization manage access to the data?

Using the delivery service example above, the Warehouse Management system should get access to only a subset of the customer data for security reasons. And it shouldn't get any level of access to the Credit Card Processing system. This level of access is called Access Control Lists or "ACLs". Data Governance is used to describe managing the ACLs. Data Governance also defines where the data records are stored.

The problem doesn't end with just the customer data. Other data such as product, inventory, and employee data may need to be kept up to date on several systems as well:

In Master Data Management, the set of fields or properties (e.g. name, address, phone number) that define a set of data (e.g. Customer) is called a data domain

Migrating to Master Data Management

MDM solves the problem of keeping interrelated systems up to date by creating a separate system where data domains (or domains) are defined for all systems inside the organization. The domains provide neutral data formats or schemas for all systems, facilitating data sharing. The latest (most recent) version of a given record is called the Data Record.

The data for a data domain defined in YOUnite can:

  1. Be stored in the MDM data store or,
  2. Remain in the organization's systems and, through adaptors, shares data with other systems via the YOUnite Data Hub. This is called federated MDM. With federated MDM, YOUnite's data store does not store the data.

Using the delivery service example above:

  • MDM Data Store: The customer Data Records are stored in YOUnite's data store and can be retrieved, in whole or in part, by the delivery service company's system applications that have appropriate access.
  • Federated MDM: If the customer domain is federated, then the customer Data Record is NOT stored in YOUnite's data store but is created in real-time by referencing customer domain properties as they reside inside the various systems (i.e. the customer name, address, and phone number properties as stored in the Customer Service, Accounting, and CRM systems). 

In the federated model, data records can be stored in the YOUnite data store or in one or more systems connected to YOUnite. Many systems may hold similar data but generally the organization as a whole decides which system(s) hold  the data records. Note that with YOUnite's federated model, different groups inside of an organization can designate which system holds the master data.

YOUnite's governance model can manage who can access the data. In the federated example, governance can be set so that the Warehouse Management system only has access to data stored in the Distribution and CRM systems, but allow the Accounting System to have access to all the other systems. Based on the governance setting, when the Warehouse Management Division looks up the data record for its customer, the Acme Company, it may get a different result (appropriate for that division's needs) than the Accounting Division which has access, or ACLs, to Acme Company's information on all systems, including Credit Card Processing for example. 

MDM is the process of determining where the data records are stored (mdm data store or federated) and managing who (a person or a system) has read, write, update, and delete privileges to those systems. 

Data Records vs Master Data

Data records are the latest, most recent version of a record. They are stored in many systems connected to YOUnite, including the YOUnite data store. Master data is a process performed by the organization's data governance to establish which of these systems will be considered to contain the "record of truth" for a given data domain. It's not always necessary or appropriate for systems to access the organization's master data, as many data access requests are for data records that may or may not contain master data. However, one feature of YOUnite is the ability to propagate changes from a system that contains data records to others in the YOUnite eco-system on a permission-appropriate basis (i.e. governance).

Reviewing New Terms

Several terms have been introduced and it may be helpful to review them before moving on:

  • Data Integration (DI) The process of transforming and transferring data from one system to another.
  • Adaptor Applications, modules, or some software or hardware component that transforms data from one format to another so that the data can be consumed by another system.
  • Data Governance Managing data access, i.e. who accesses certain data sets based on role, application, etc. Defining where the Master Data is stored.
  • Data Domain (Domain) A set of fields or properties that define a set of data (i.e. the "Customer" data is comprised of the name, address, and phone number properties).
  • Master Data Management (MDM) An approach to reducing data redundancy across systems by maintaining a master file for critical data.
  • Data Record The latest (most recent) version of a given record.
  • Master Data (MD) The master or golden version of a record (for a customer, for example). The "record of truth."
  • YOUnite Data Store  Whereby the latest change to a record is saved inside of YOUnite (in the YOUnite MDM domain). 
  • Federated MDM Whereby the latest change to a record is noted in one of the organization's systems without actually storing the data in YOUnite (in the YOUnite MDM data store). 
  • Acess Control Lists (ACLs) The defined access limit to MDM data records for any given system, application, or role.

Further Reading

An good source for more MDM background is Mark Allen and Cervo Dalton's Multi-domain Master Data Management: Advanced MDM and Data Governance in Practice. Waltham, MA: Morgan Kaufmann, 2015. (ISBM 978-0-12-800835-5).

Next

An Introduction to YOUnite


  • No labels