DRAFT - WORK IN PROGRESS

Example of Competency Questions used for designing core metadata schema (current release)

As part of creation of core metadata schema (based on DCAT-AP), Health-ri drafted a set of competency questions which showcase what minimum core metadata is needed and that can be useful for fulfilling the needs for finding and searching a different types of research resource.

We first present a table of our competency question list with the setting they correspond to and answers, and, we then present SPARQL query implementations for these questions.

The requirements covers the minimum needs to fulfill the requirements for finding (discoverability) research resources in particular datasets on health-ri portal.

The core metadata schema provides generic, top level elements and is independent of the health and scientific domain.

The four main categories of requirements are:

Req category	Example Competency Question	Candidate Answers
Findability	Q1. What types of research resources are available in the catalog?
Findability	Q2. What metadata fields are used to describe each resourece?
reuse	Q3. What types of data formats are supported in the catalog?
Findability	Q4. How can users search for datasets based on their domain or subject matter?
Findability	Q5. Can users search for datasets by the creator or author person or institution?
Findability	Q6. How can users filter datasets by publication date or update date?
Findability	Q7. Are there any restrictions or licensing information associated with the datasets, and can users filter by licensing type?
Findability	Q8. Can users search for datasets based on geographic location or spatial extent?
Findability	Q9. How can users discover datasets related to specific research topics or keywords?
Findability and reuse	Q10.Are there any APIs or programmatic access methods available for retrieving dataset information?
Findability	Q11. What are the available methods for accessing and downloading datasets?
Findability	Q12. How are datasets categorized or grouped into thematic collections or categories?
Findability	Q13. Can users find datasets based on their data source or origin?
Findability	Q14. Are there any specific quality or validation criteria associated with the datasets, and can users filter by dataset quality?
Findability	Q15. Can I find datasets related to specific research initiative or institution or Project? what can be the search keywords?
Findability	Q16. What data transformation or preprocessing options are available through the service (e.g., data aggregation, filtering, sorting)?
Findability	Q17. What other research resources can be searched in the catalogue?

Given these generic questions, we collected the core nomenclature in order to use for classification and modelling purposes. The terminology sheet is accessible here.

https://docs.google.com/spreadsheets/d/1KKfAxn4ftoOAM2v3WsqT2XcPhdmTjnf1BZkvFf9FqF8/edit#gid=0

Requirement set for Core Metadata Schema

Example of Competency Questions used for designing core metadata schema (current release)

SPARQL QUERIES