Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data Access and Retrieval

The RDF data model allows the description of domain-level knowledge that is understandable by both humans and machines. RDF data can be derived from different source formats and diverse access points, ranging from databases or files in CSV format to data retrieved from Web apis in JSON, Web Services in XML or any other speciality formats. To this end, machine-interpretable mapping languages, such as rml, were introduced to uniformly define how data in multiple heterogeneous sources is mapped to the rdf data model, independently of their original format. However, the way in which this data is accessed and retrieved still remains hard-coded, as corresponding descriptions are often not available or not taken into account. In this paper, we introduce an approach that takes advantage of widely-accepted vocabularies, originally used to advertise services or datasets, such as Hydra or DCAT, to define how to access Web-based or other data sources. Consequently, the generation of RDF representations is facilitated and further automated, while the machine-interpretable descriptions of the connectivity to the original data remain independent and interoperable, offering a granular solution for accessing and mapping data.

Speakers:

Anastasia Dimou

Gent University
http://www.ugent.be/en

Ruben Verborgh

Gent University
http://www.ugent.be/en

Ruben Verborgh is a professor of Semantic Web technology at Ghent University – imec and a research affiliate at the Decentralized Information Group at MIT. He aims to build a more intelligent generation of clients for a decentralized Web at the intersection of Linked Data and hypermedia-driven Web APIs.

Search form

Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data Access and Retrieval

Speakers:

Anastasia Dimou

Ruben Verborgh

Miel Vander Sande

Erik Mannens

Rik Van de Walle