...
In order to find and reuse information scattered across different sources, the research community in the Netherlands has agreed to index its resources in a national catalogue (add link to portal documentation Marianne Knoop Pathuis - Baarda ). The national catalogue in turn aims to be indexed in international cataloguescatalogue. One of the necessary conditions to achieve this goal is a common agreement on the minimum elements needed to search, find and reuse such resources. These elements and their properties are components of the Core Metadata Schema.
...
The Core Metadata Schema is a formal shared conceptualisation of the requirements to find and reuse information across Health-RI nodes via the national catalogue. It represents a set of minimal elements for describing each resource (including dataset) with common metadata. This scope is consistent with the functionality provided in the Plateau 1 National Catalogue catalogue release package (Marianne Knoop Pathuis - Baarda add link to documentation on plateau 1 and portal development releases).
...
This version of the Core metadata schema includes DCAT v3 and some selected DCAT-AP mandatory classes and their definitions. The most important entities are those that form the core of the DCAT application profile. DCAT-AP is a DCAT application profile for the exchange of information about catalogues catalogue of datasets and description of data services in Europe. It is to be reviewed by the community of nodes that make up the Health-RI ecosystem. The final release of this version will be a prerequisite for resources to be included in the national catalogue (Marianne Knoop Pathuis - Baarda add link) and benefit from its offered features. How to implement this model and connect to the national catalogue is described in the onboarding documentation. The offered features of the national catalogue are defined in the Plateau 1 (Marianne Knoop Pathuis - Baarda add link to Plateau 1 doc).
...
dcat:Dataset
is a type ofrdfs:Resource
representing a collection of datadcat:Distribution
is a type ofrdfs:Resource
representing an available form or representation of a dataset.dcat:Catalogcatalogue
is a type ofrdfs:Resource
representing a collection of datasets.dcat:DataService
, introduced in DCAT version 2, is a type of Resource representing a service for accessing data.foaf:Project
is a type ofrdfs:Resource
representing project level information
...
Class name | Definition | Usage Note | URI |
---|---|---|---|
Dataset | A resource type. | Used to describe one or more datasets. This describes details about the dataset(s). However, a single dataset can have different ways in which they are made available to potential users. How the data in a dataset can be accessed is defined in the Distribution. |
|
Catalogue | A catalog catalogue that is listed in the National Catalogcatalogue. | Used to describe a bundle of datasets, data services, biobanks, patient registries, or guidelines together under a single title. |
|
Agent | An entity that is associated with Catalogues catalogue and/or Datasets. | If the Agent is an organisation, the use of the Organization Ontology is recommended. http://www. w3.org/TR/2013/CR-vocab-org-20130625/ |
|
Resource | Anything described by RDF. | This is an abstract class, we do not use this class, instead we use specifications of it (e.g. Dataset). This is mainly for a high level grouping and the reuse of properties. |
|
...
Class name | Definition | Usage Note | URI | ||||
---|---|---|---|---|---|---|---|
Distribution | A physical embodiment of the Dataset in a particular media format. | Used to describe the different ways that a single dataset can be made available in. I.e., it can be downloaded or it can be accessed online in one or more distributions (e.g. one in a downloadable .csv file, another file with an access or query webpage) |
| ||||
Dataset Series | A resource type. Dataset series are defined in [ISO-19115] as a collection of datasets […] sharing common characteristics. However, their use is not limited to geospatial data, although in other domains they can be named differently (e.g., time series, data slices) and defined more or less strictly (see, e.g., the notion of "dataset slice" in [VOCAB-DATA-CUBE]). | With "dataset series" we refer to data, somehow interrelated, that are published separately. An example is budget data split by year and/or country, instead of being made available in a single dataset. |
| ||||
Data Service | A Resource type. | The kind of service can be indicated using the DRAFT EXAMPLE: Health-ri offers ??distinct service categories: one that allows querying of secure data/metadata through an interface, and another that offers analytical or statistical insights without directly serving the data, especially if it's sensitive. The first type is linked to the Dataset record it derives from, while the second is connected to a Catalog. |
| Project | A project (a collective endeavour of some kind). |
| |
Project | A project (a collective endeavour of some kind). | Used to describe a project that is connected to one or more datasets. A resource type |
|
...
Core Metadata Schema Properties per Class
...
Catalogue
A curated collection of metadata about resources. A web-based data catalog catalogue is typically represented as a single instance of this class.
...
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
dataset | relates every catalog catalogue to its containing datasets/ |
|
| The connection to the one or more datasets that this catalog catalogue describes. | 1..* |
description | A free-text account of the record. |
|
| A brief description of the catalogue. It can consist of multiple strings. For example, this catalogue describes breast cancer imaging datasets. | 1..* |
publisher | The entity responsible for making the resource available. |
|
| The organisation or a person that has published the catalogcatalogue | 1..* |
title | A name given to the resource. |
|
| The name of the catalogcatalogue. This is a required field and needs to be unique. | 1..* |
...
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
Relevant contact information for the cataloged catalogued resource. |
|
| Contact information that can be used, for example, for sending requests to further information or access to the Dataset. | 1..* | |
The entity responsible for producing the resource. |
|
| An agent (person or organisation) responsible for producing the dataset. | 1..* | |
A free-text account of the record |
|
| A free-text description of the Dataset. This property can be repeated for parallel language versions of the description. | 1..* | |
release date | Date of formal issuance (e.g., publication) of the resource. |
|
| NA | 1..* |
A unique identifier of the resource being described or catalogedcatalogued. |
|
| The main identifier for the Dataset, e.g. the URI or other unique identifier in the context of the Cataloguecatalogue. | 1..1 | |
modified | Most recent date on which the catalog catalogue entry was changed, updated or modified. |
|
| The most recent date on which the Dataset was changed or modified. | 1..* |
The entity responsible for making the resource available. |
|
| An agent (organisation or person) responsible for making the Dataset available. | 1..* | |
A main category of the resource. A resource can have multiple themes. |
|
| It consists of 1 or more IRIs (links) separated by commas. When set, it specifies relevant ontology concepts that classify the dataset. Typically, these can be looked up using the Ontology Lookup Service (OLS) or Bioportal. | 1..* | |
A name given to the record. |
|
| A name given to the Dataset. This property can be repeated for parallel language versions of the name. | 1..* | |
The nature or genre of the resource. |
|
| A type of the Dataset. A recommended controlled vocabulary data-type is foreseen. | 1..* | |
License | A legal document under which the resource is made available. |
|
| This should contain a URL that provides details regarding the license that is applicable to this dataset. | 1..* |
relation | defines a relation |
| foaf:Project | 1..* |
...
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
dataset distribution | An available distribution of the dataset. |
|
| Use this property to point to the distribution of this dataset when a distribution is available. | 0..* |
project | connect dataset to the corresponding projects |
|
| Use this property to point to the related project of this dataset when a project is available. | 0..* |
has version | This resource has a more specific, versioned resource [PAV]. |
|
| This property refers to a related Dataset that is a version, edition, or adaptation of the described Dataset. | 0..* |
in series | A dataset series of which the dataset is part. |
|
| NA | 0..* |
Data Service
A collection of operations that provides access to one or more datasets or data processing functions.
...
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
access URL | A URL of the resource that gives access to a distribution of the dataset. E.g., landing page, feed, SPARQL endpoint. |
|
| This property contains a URL that gives access to a Distribution of the Dataset. The resource at the access URL may contain information about how to get the Dataset. | 1..* |
media type | The media type of the distribution as defined by IANA [IANA-MEDIA-TYPES]. |
|
| This property SHOULD be used when the media type of the distribution is defined in IANA [IANA-MEDIA-TYPES], otherwise | 1..* |
title | A name given to the resourcedistribution. |
|
| the name of the dataset in combination with the format of the distribution can be used | 1..* |
description | A unique identifier of the resource being described or catalogedcatalogue. |
|
| NA | 1..* |
Recommended Properties
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
access service | A data service that gives access to the distribution of the dataset |
|
|
| 0..* |
download URL | The URL of the downloadable file in a given format. E.g., CSV file or RDF file. The format is indicated by the distribution's |
|
| NA | 0..* |
Agent
An entity that is associated with Catalogues Catalogue and/or Datasets. Agent can be individuals or organisations, If the Agent is an organisation, the use of the Organization Ontology is recommended. [link]
Mandatory Properties
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
name | A name for some thing. |
|
| This property contains a name of the agent. This property can be repeated for different versions of the name (e.g. the name in different languages) | 1..* |
identifier | A unique identifier of the resource being described or catalogedcatalogue. |
|
| 1..1 |
...
Property name | Definition | URI | rdfs:Range | Usage Note | Cardinality |
---|---|---|---|---|---|
Description | description of the project |
|
| NA | 1..* |
identifier | A unique identifier of the resource being described or catalogedcatalogue. |
|
| NA | 1.1 |
title | A name given to the resource. |
|
| NA | 1..* |
funded by | An organization funding a project or person. |
|
| NA | 1..* |
relation | link to the project datasets |
|
| NA | 1..* |
Recommended Properties
...
All things described by RDF are called resources, and they are instances of the class rdfs:Resource
. This is the class of everything. All other classes are subclasses of this class. see To read more, go to https://www.w3.org/TR/rdf12-schema/#ch_resource.