5. Metadata harvesting
Introduction
In this section we describe the harvesting process of the onboarding of metadata to the National Health Data Catalogue. To reach this step a your metadata should be mapped to the Health-RI metadata schema. https://health-ri.atlassian.net/wiki/spaces/FSD/pages/279281676/Metadata+mapping and FDP should be implemented with metadata exposed: https://health-ri.atlassian.net/wiki/spaces/FSD/pages/279183386/Exposing+metadata .
Validate metadata
The metadata needs to conform to Health-RI specification. You can use the validator to validate the metadata in your FDP.
The metadata can be validated with
Health-RI RDF Validator (see screenshot below)
The metadata validator does not traverse through the pages withing the FDP. Each page should be validated separately. If multiple datasets, catalogs etc. are present, validate at least one full pathway (ie. Catalog>Dataset>Distribution).
For Content to validate, select βURIβ. At URI, enter the URI provided by the data holder.
Content syntax: leave default
Validate As: select the metadata version you would like to validate against.
Make sure thereβs β0 errorsβ
If metadata does not pass validation, this must be fixed first by the data holder unless given dispensation by both data holder and Health-RI team. If you have issues, please contact servicedesk@health-ri.nl
Β
Contact Health-RI service desk
To harvest the exposed metadata the data holder/provider contacts the Health-RI service desk with an onboarding request and includes the details of the FAIR Data Point to harvest by sending an email to servicedesk@health-ri.nl.
Health-RI then performs the harvesting.
Check the harvested metadata
The metadata is harvested into a testing environment where a check of the data is performed by HRI and the data holder/provider, before reaching the Catalogue. You will receive an email requesting a check of the harvested metadata in the testing environment. If we encountered any issues during the harvesting, we will also notify you. If issues have been identified by the data holder or Health-RI they should be fixed and the process repeated. The Catalogue is currently updated daily for changes in the available FDPs, so changes in the metadata can take up to 24 hours to update.
Β
Metadata is onboarded
If no issues are detected during the harvesting and the harvesting to production environment is approved by the data holder, the data is then onboarding the National Health Data Catalogue. Data can be added to the production environment after 72 hours in the acceptance environment if approved by the data holder.
Β
Questions?
If you have questions about the onboarding process or would like to learn more. Reach out to our Health-RI Servicedesk | Health-RI
Β