Search FCW


Subscribe Now!
Table of Contents
Business
BPM
CXOs
Columns
Columnists
Defense
E-Government
Elections 2008
Enterprise Architecture
Funding
Homeland Security
Health IT
IPv6
LOB
Management
Procurement
Privacy
Policy
Program Management
State and Local
Security
Technology
Telework
Workforce

More Topics
resourcecenter
Home
Letters to the Editor
Current Issue/Download
Print/Online Archives
Editorial Calendar
researchstore
resourcecenter
Sprint Communications for Continuity Operations
Oracle Resource Center
GSA: Your Customer Service Agency
Government Leadership Survey
Green Solutions Guide
Report: Information Sharing
DISA IT Strategy & Vision
Emergency Preparedness Report
Report: Green Computing
PEO EIS Guidebook
Content Library

More >>



Latest News
ADVERTISEMENT





 

The data reference model gets real

XML schema is major step in tearing down agencies' data walls

By David Perera
Published on June 27, 2005

Comment

Click here to comment on this article


Related story links

Keeping data flowing

Sharing drives DHS data project


Newsletters

You might also be interested in these FCW newsletters:

Daily

To learn more, click here.


When data resides in incompatible systems, useful information can be hard to find, which means links are missed and conclusions are incomplete. It's expensive, too, when agencies duplicate data-collection efforts. Despite calls for greater information sharing governmentwide, agencies continue to gather and store data in incompatible ways.

The Office of Management and Budget aims to improve data sharing through governmentwide adoption of a data reference model (DRM) schema based on Extensible Markup Language (XML). The schema is a hierarchy of specifications for agencies to describe and exchange data.

OMB officials have selected the model — the fifth and final portion of the federal enterprise architecture — to fulfill a section of the E-Government Act of 2002 requiring government information to be organized, categorized and electronically searchable. Under that timeline, the DRM must be finalized by Dec. 17, say members of the CIO Council's Data Reference Model Task Force.

"This is a detailed blueprint for how organizations are going to describe the structure, categorization and exchange of their information," said Michael Daconta, the task force's leader, during the draft schema's public release June 13.

If agencies widely adopt the schema, they could more easily spot complementary or overlapping datasets, proponents say.

"You have to be able to have an organization find the other information assets and logical data models of other organizations," Daconta said. Public access to government information would likewise improve.

A draft version of the proposed schema is available for public discussion; the deadline for comments is Sept. 14.

The schema would be the template for the DRM documents produced by all agencies, said Owen Ambur, chief XML strategist at the Interior Department. Daconta credits Ambur with devising the schema's approach.

But not all agency data needs to be expressed within the schema.

"That's too big, too scary," Daconta said. "We very clearly state that this is information that you share or will share within a year."

Ideally, OMB would direct agencies to identify information that should adhere to the schema to achieve annual information-sharing objectives, Ambur said.

Some of the schema's elements would be optional when tagging data. "You populate different things to achieve different purposes," Daconta said.

Agencies already invested in their own approaches to data tagging will be able to reference them within the DRM schema.

"The intent is having to save them from redoing the effort they've already done and just capitalize on it automatically," Ambur said.

Unlike the data exchange model proposed in the first DRM version, which was released last October, the new XML approach will support the exchange of structured, unstructured and semi-structured data, Daconta said.

That is important because "80 percent of government information is either unstructured or semi-structured," said Andy Hoskinson, one of the contractors OMB employs in the Federal Enterprise Architecture Program Management Office.

The DRM schema allows unstructured information, such as text documents or photos, to be tagged with metadata that identifies the information's subject, source and creator. That information can also be linked to other resources.

The revised reference model will allow agencies to exchange information and query data via a registry, preferably federated, Daconta said. The Core.gov site is under consideration as a registry.

The DRM is more abstract than an agency- specific effort called the National Information Exchange Model (NIEM), which seeks to identify and standardize a core set of XML schema terminology. NIEM participants include most law enforcement communities within the Justice and Homeland Security departments and state and local organizations.

They have also agreed to standardize the data elements for people, places, dates and other items.

"NIEM is creating a framework for how you assemble messages rapidly that are interoperable out of the box," Daconta said. He added that the DRM is "more about how does everything tie together to answer high-level questions, not all the details of how do you exchange it and why do you exchange it."

Experts say that inventorying data is a separate task from harmonizing data. The latter step will be an ongoing bottom-up and top-down process, with emphasis on the former, Daconta said.

"This approach will work," he said. "Let's move forward with it."



upcoming event

Enterprise Architecture 2008 - Washington, DC
September 9 - September 10, 2008

Occupational Health & Safety Executive Summit - Arlington, VA
October 6 - October 7, 2008


 

head
fcw
issue
First Name State
Last Name Zip
Title Email