Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Req category

...

Status
colourRed
titleDRAFT - WORK IN PROGRESS

Example of Competency Questions used for designing core metadata schema (current release)

As part of creation of core metadata schema (based on DCAT-AP), Health-ri drafted have crafted a set of competency questions which showcase how the minimum core metadata schema can be useful for fulfilling the needs for finding and searching a different types of research resource. We first present a table of our competency question list with the setting they correspond to and answers, and, we then present SPARQL query implementations for these questions.

The requirements covers the minimum needs to fulfill the requirements for finding (discoverability) research resources in particular datasets on health-ri portal.

The core metadata schema provides generic, top level elements and is independent of the health and scientific domain.

The four main categories of requirements are:

findability

reusability

Interoperability

access

status: in development

Core metadata requirements

The initial goal and functionality that the metadata model must support is to enable the users in finding (Findability) and access/requesting (Access, Request) the research resource on the national catalogue.

We collected the metadata requirements in the form of competency queries.

Req category

Example of Competency Question

Candidate Answers

Findability

F

Q1. What types of datasets are available in the catalog?

Findability

Q2. What metadata fields are used to describe each dataset?

reuse

Q3. What types of data formats are supported in the catalog?

Findability

Q4. How can users search for datasets based on their domain or subject matter?

Findability

Q5. Can users search for datasets by the creator or author person or institution?

Findability

Q6. How can users filter datasets by publication date or update date?

Findability

Q7. Are there any restrictions or licensing information associated with the datasets, and can users filter by licensing type?

Findability

Q8. Can users search for datasets based on geographic location or spatial extent?

Findability

Q9. How can users discover datasets related to specific research topics or keywords?

Findability and reuse

Q10.Are there any APIs or programmatic access methods available for retrieving dataset information?

Findability

Q11. What are the available methods for accessing and downloading datasets?

Findability

Q12. How are datasets categorized or grouped into thematic collections or categories?

Findability

Q13. Can users find datasets based on their data source or origin?

Findability

Q14. Are there any specific quality or validation criteria associated with the datasets, and can users filter by dataset quality?

Findability

Q15. Can I find datasets related to specific research initiative or institution or Project? what can be the search keywords?

Findability

Q16. What data transformation or preprocessing options are available through the service (e.g., data aggregation, filtering, sorting)?

Findability

Q17. What other research resources can be searched in the catalogue?

...

What keywords, or topics are there to discover the datasets?

Collect vocabularies and their relations for defining the health research domains (health, clinical, omics, imaging)

F

What types of research resources can i search for?

Dataset, Distributions, Registries, Bio Samples, Questionnaires, Codes, Protocols, Image collections,

F

Can user search for datasets based on temporal information?

date published? date range?

F

What are the comprehensive set of metadata elements to describe research datasets?

dataset characteristics, title, description, theme

F

Which institution, Research group is the provider of this dataset?

F, Reuse

What processes has been done on the dataset? what services can be shared with the dataset?

dataset service, provenance and lineage of the data

F, Reuse

What are the versions of a dataset?

Access

What are the types of accessibility of a dataset, including licensing and access restrictions?

types of license,

A, Reuse

Can user specify the data format and data distribution methods for requesting the dataset?

distributions

Reuse

what are the information on how to access and download datasets

Joining dataset

Can users discover relationships between different datasets?

Reuse

Can I track changes or modifications made to a dataset over time?

access/reuse

Can I find information on data usage policies and terms related to a dataset?

F, interoperabiltity

Can I find the related resources, such as publications. study, project, services to the dataset?

Interoperability

can dataset metadata be integrated with other data catalogs or systems?

F/interoperabiltiy

what is the URIs for metadata records (instances) (global unique, persistent and resolvable)

URIs naming conventions for the metadatasets

F/interoperability

What is URIs for the metadata schemas ?

URIs naming convenstions for the model

F-Interoperabolity

What schema language is used for formal representing of generic concepts

RDFS, OWL, SKOS,

F-Interoperabolity

What schema language is used for defining data validating rules, constraints, profilings and …

Shacl

Given these generic requirements, we collected the core terms, definitions to be used for semantic modelling.

https://docs.google.com/spreadsheets/d/1KKfAxn4ftoOAM2v3WsqT2XcPhdmTjnf1BZkvFf9FqF8/edit#gid=0

...

Health Specific requirements

Domain Specific requirements

Imaging