...
Table of Contents | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
📌 Introduction
In this section, we describe the basics of metadata and explain what metadata mapping is. We also look at the Health-RI Core Metadata Schema and the metadata standards it builds upon. This page is intended for a general audience. For details on the standards and the schema, please visit the github specifications dedicated for data experts or data stewards https://github.com/Health-RI/health-ri-metadata/ .
🧠 What is metadata
Metadata is essentially data about data. It provides information that describes various aspects of your data, such as its description, the owner of the data, the type of data. In other words, metadata helps understanding and managing data effectively by providing additional information about it.
Metadata serves as the backbone of effective data management and analysis in the life sciences and healthcare domains. It enables researchers, clinicians, and policymakers to derive meaningful insights from vast amounts of data while ensuring its integrity, reliability and confidentiality, by provide a standardization, interoperability and machine readabilty of the shared metadata.
🔎 Metadata standards
A metadata standard is a set of rules, guidelines and conventions that define how metadata should be structured, formatted and described within a particular domain or context. Adhering to such standards ensures consistency, interoperability and effective management of metadata across different systems, organizations and disciplines.
...
DCAT-AP: DCAT Application Profile for Data Portals in Europe is a metadata standard developed by the European Commission to facilitate the interoperability of data catalogs and portals across European countries. It builds upon the DCAT (Data Catalog Vocabulary) standard and extends it with additional requirements and recommendations tailored to the European context.
🎯 HRI Metadata Schema
The National Health Data Catalogue currently works with a Core Metadata Schema. This Core Metadata Schema is a formal shared conceptualisation of the requirements to find and reuse information across Health-RI nodes via the National Catalogue. It represents a set of minimal elements for describing each resource (including dataset) with common metadata. The current version of the Core Metadata Schema includes DCAT v3 and some selected DCAT-AP mandatory classes and their definitions.
The set is split into several classes describing the data. At the moment four classes (Dataset, Catalog, Resource, and Agent) are mandatory. Each class is populated by a set of mandatory and recommended variables. You can find all of the descriptions of variables and classes here: Core Metadata Schema Specification
📋 What is metadata mapping
Info |
---|
Metadata mapping and creation of a metadata schema will likely require involvement of a semantic expert, data steward or equivalent. |
...
To map your metadata you first need to understand the structure of your metadata and their semantic meaning and the ontology (vocabulary) used to to describe your data in a Resource Description Framework (RDF), in our case DCAT V3, format. The general outline of the mapping pipeline can be found here: https://health-ri.atlassian.net/wiki/spaces/FSD/pages/edit-v2/290291734?draftShareId=ff45a2e2-80ee-49aa-b6d6-c04dedb6f9f8
Next steps
After mapping/transforming your data properties to the classes and variables of the HRI model, you need to validate your model. This step ensures that the new model both accurately represent the original data as well as adheres to the HRI metadata structure.
Once your RDF data is ready, you can publish it to FAIR Data Point, where it can be harvested by the Catalogue. More information about this step can be found here: 3. Exposing metadata
Additional resources
Technical details on DCAT AP and FAIR Datapoints - Youtube video, Health-RI
HRI Github - You can find recourses and examples on the Health-RI metadata Github.
Resources from the EU Open Data Explained, including a general training on metadata and basic and advanced level resourses on DCAT and DCAT-AP.
FAIR Metrolines (note: some pages under developement):
Metroline Step: Register resource level metadata
Metroline Step: Analyse data semantics
Panel | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
Questions?If you have questions about the onboarding process or would like to learn more. Reach out to our https://www.health-ri.nl/health-ri-servicedesk |
...