Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Metadata serves as the backbone of effective data management and analysis in the life sciences and healthcare domains, enabling researchers, clinicians, and policymakers to derive meaningful insights from vast amounts of data while ensuring its integrity, reliability, and confidentiality.

🔎 Metadata standards

A metadata standard is a set of rules, guidelines, and conventions that define how metadata should be structured, formatted, and described within a particular domain or context. These standards ensure consistency, interoperability, and effective management of metadata across different systems, organizations, and disciplines.

Here are some common metadata standards we use at Health-RI:

Dublin Core (DC): Dublin Core is a widely used metadata standard designed to provide a simple and standardized way to describe digital resources such as documents, web pages, images, videos, and other types of content on the internet. It was originally developed in 1995 by the Dublin Core Metadata Initiative (DCMI)

DCAT: Data Catalog Vocabulary, is a metadata standard specifically designed for describing datasets and data catalogs on the web. DCAT is based on the RDF (Resource Description Framework), which is is a standard model for representing and exchanging metadata and data on the web in a machine-readable format.

DCAT-AP: DCAT Application Profile for Data Portals in Europe, is a metadata standard developed by the European Commission to facilitate the interoperability of data catalogs and portals across European countries. It builds upon the DCAT (Data Catalog Vocabulary) standard and extends it with additional requirements and recommendations tailored to the European context.

🎯 HRI Metadata Set

The National Health Data Catalogue currently works with a Core Metadata Set. The Core Metadata Schema is a formal shared conceptualisation of the requirements to find and reuse information across Health-RI nodes via the national catalogue. It represents a set of minimal elements for describing each resource (including dataset) with common metadata. The current version of the Core metadata schema includes DCAT v3 and some selected DCAT-AP mandatory classes and their definitions. DCAT v3 is a data catalogue vocabulary with DCAT-AP being an application profile for describing public sector datasets in Europe.

...

Code Block
@prefix dcat: <http://www.w3.org/ns/dcat#> .
@prefix dct: <http://purl.org/dc/terms/> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .

<> a dcat:Dataset ;
    dct:identifier "BS001" ;
    dct:title "Blood Sample" ;
    dct:description "Metadata for a blood sample" ;
    dct:issued "2024-01-15T08:30:00"^^xsd:dateTime ;
    dct:publisher "Lab Technician, Sarah Lee" ;
    dct:subject "Hypertension" ;
    dcat:landingPage "https://example.com/blood_sample" ;
    dcat:accessRights "Informed consent obtained" ;
    dcat:theme "Health" ;
    dcat:keyword "Blood sample, Hypertension, CBC, Lipid Panel, Glucose Test" ;
    dcat:temporal "2024-01-15T08:30:00/2024-01-15T10:00:00"^^xsd:dateTime ;
    dcat:hasVersion "1.0" ;
    dcat:conformsTo <https://eurl.link/dcat-ap> .

The produced metadata mapping

🔎 Metadata standards

A metadata standard is a set of rules, guidelines, and conventions that define how metadata should be structured, formatted, and described within a particular domain or context. These standards ensure consistency, interoperability, and effective management of metadata across different systems, organizations, and disciplines.

Here are some common metadata standards we use at Health-RI:

Dublin Core (DC): Dublin Core is a widely used metadata standard designed to provide a simple and standardized way to describe digital resources such as documents, web pages, images, videos, and other types of content on the internet. It was originally developed in 1995 by the Dublin Core Metadata Initiative (DCMI)

DCAT: Data Catalog Vocabulary, is a metadata standard specifically designed for describing datasets and data catalogs on the web. DCAT is based on the RDF (Resource Description Framework), which is is a standard model for representing and exchanging metadata and data on the web in a machine-readable format.

DCAT-AP: DCAT Application Profile for Data Portals in Europe, is a metadata standard developed by the European Commission to facilitate the interoperability of data catalogs and portals across European countries. It builds upon the DCAT (Data Catalog Vocabulary) standard and extends it with additional requirements and recommendations tailored to the European context.

...

To map your metadata you first need to understand the structure of your metadata and their semantic meaning and the ontology (vocabulary) used to to describe your data in a Resource Description Framework (RDF), in our case DCAT V3 format.

(tick) Next steps

After mapping transfroming your data properties to the classes and variables of the HRI model you need to validate your model. This step ensures that the new model both accurately represent the original data as well as adheres to the HRI metadata structure.

Once your RDF data is ready, you can publish your data to an FDP where it can be harvested by the catalogue. More information about this step can be found here: 3. Exposing metadata

Additional resources

Technical details on DCAT AP and FAIR Datapoints - Youtube video, Health-RI

HRI Github - You can find recourses and examples on the Health-RI metadata Github. 

Resources from the EU Open Data Explained, including a general training on metadata and basic and advanced level resourses on DCAT and DCAT-AP.

Panel
panelIconId1f5e8
panelIcon:speech_left:
panelIconText🗨️
bgColor#E6FCFF

Questions?

If you have questions about the onboarding process or would like to learn more. Reach out to our https://www.health-ri.nl/health-ri-servicedesk

📧 servicedesk@health-ri.nl

...