US Government Joins the Dots with Irish ‘Linked Data’ Technologies

Thursday, 26 April 2012

Agencies in the US Government have adopted a set of web tools and standards developed in Ireland by researchers at NUI Galway’s Digital Enterprise Research Institute (DERI).

DERI’s technologies are being utilised by Data.gov, a portal developed to bring an unprecedented level of transparency to the US Government. DERI’s research, which is funded by Science Foundation Ireland, focuses on enabling networked knowledge, using the latest Semantic Web and Linked Data technologies. Its technologies allow related data that was not previously linked to be connected together, so that a person or computer can see the bigger picture through interlinked datasets. Data.gov allows the linking of open government data from agency publishers to contributions from other public and private organisations.

DERI’s Dr John Breslin, who also lectures in Electronic Engineering at NUI Galway, explains: “I recently saw a universal toy adaptor that allowed you to connect plastic building blocks to wooden construction sets. Linked Data is a bit like that – it’s based on a universal data format that allows you to bring datasets from different realms together, making them more useful as a whole. Your planning applications could be linked to your broadband penetration rates or your traffic congestion data to help identify issues and trends.”

Among the DERI outputs being used by Data.gov and the related Healthdata.gov site are Neologism and the GRefine RDF Extension. Neologism is a new tool which allows for the easy creation of ‘vocabularies’ needed to link data and is built on the powerful open source content management platform Drupal. One such vocabulary that is listed in vocab.data.gov is the Vocabulary of Interlinked Datasets (VOID), which was co-created by DERI researchers. The second technology in use, the RDF Extension for Google Refine, is a graphical user interface for exporting data from Google Refine (a tool for working with messy data) as interlinked Semantic Web data.

George Thomas, Enterprise Architect with the US Health and Human Services Administration, has said: “More behind the scenes work that routinely benefits from substantial DERI engagement includes an ongoing contribution to the creation and promulgation of open standards related to open government data catalogs and communities. But DERI doesn’t stop there, they put these new standards into practice through enhancements to Drupal 7 core, helping make it an even more powerful publishing and visualization tool for the emerging Web of Data.”

He added: “We hope to leverage all of these features and capabilities in our current and ongoing Healthdata.gov modernization efforts. They also create lots of other useful tools and pen helpful blog posts that promote the proper use and integration of standards. Furthermore, DERI folks are active in many other efforts to promote structured data using open standards and help to clarify best practices that will ultimately lead to better integration of international government statistics.”

Joint work between DERI and Mr. Thomas’ team on Patient Controlled Privacy (using Linked Health Data) will be presented at the Semantic Technologies Conference in San Francisco in June, that makes use of the Privacy Preference Ontology and related privacy management web applications from DERI’s Social Software Unit.

Data.gov is part of a global initiative referred to as the Open Data movement, with the goal to motivate governments to make public information freely available and easily accessible online. Others examples include data.gov.uk and data.london.gov.uk from the UK, and data.fingal.ie and dublinked.ie from Ireland.

Researchers at DERI in NUI Galway are in the vanguard of this new technology space. The largest research organisation of its kind in the world, DERI with its 140 researchers, it is collaborating with industry and governments to revolutionise the utilisation of data.

Today, more than 200 regions and countries are publishing their government data online. Three years ago, DERI announced the adoption of its SIOC data format by a website in the Obama administration. The SIOC format is one of the Open Data formats being produced by a number of US Government websites that use the latest Drupal platform, including energy.gov (the US Energy Department), policy.house.gov (the Republican Policy committee), lsc.gov (the civil legal aid program), and oag.ca.gov (the California Attorney General). The DCAT vocabulary from DERI is also used by various government sites for describing government datasets and data catalogs. DERI also collaborates with the European Commission on common semantic vocabularies, such as the Asset Description Metadata Schema (ADMS).

Professor Stefan Decker, Director of DERI at NUI Galway, says that while we are seeing Open Data being used to improve public services and promote more transparent and effective government - that is only part of the story. “Open Data has been described recently by the UK’s Cabinet Office Minister Francis Maude as the raw material of a ‘new industrial revolution’. Making more data freely available is resulting in people using it to build new businesses and grow existing ones, creating jobs.

In Ireland, the Open Data movement is being pioneered by the likes of Fingal County Council, the Dublinked consortium and the National Cross-Industry Working Group on Open Data. DERI participates at a national and international level through the provision of best practices, standards and technologies. Open Data is key to supporting a truly transparent and participatory democratic system.”

In Ireland, DERI collaborates closely with local and the Local Government Computer Services Board, as well as the National Cross-Industry Working Group on Open Data to promote Open Data.

Professor Decker concluded: “These are exciting times and a true spirit of innovation and entrepreneurship is engulfing the IT world as networked knowledge begins to come into its own. Undoubtedly, ten years from now when we look back, we will wonder how we managed with the volumes of unconnected data we have now.”

DERI was founded in 2003 at NUI Galway with support from the Irish Government’s Science Foundation Ireland, as part of a strategic investment in Semantic Web research and business development.

-ends-

Author: Marketing and Communications Office, NUI Galway
« Back