STATUS: IN DEVELOPMENT
Short Description
To be able to reach your FAIRification goals, having a team with the right skillset is important. The composition of the team depends on the exact goals and different skills may be necessary in different phases of the of the process. See, for example:
FAIRopoly – FAIRification Guidance for ERN Patient Registries
FAIR in action - a flexible framework to guide FAIRification
In this step we present a list of common roles and resources involved in the FAIRification process listed by expertise and by FAIR principle. This will help you identify which team members and expertise are required and available (or missing) in your team.
Since a FAIR data steward is essential for reaching the FAIRification goals, a separate step has been dedicated to this role. See “Metroline Step: Have a FAIR data steward on board” for details on this crucial role.
Why is this step important
FAIRification is a complicated process and requires expertise from a variety of fields. Hence, assembling the right team is essential to meet your goals.
Expertise requirements for this step
TODO
How to
[Mijke: Another RDMkit page on this: https://rdmkit.elixir-europe.org/dm_coordination ]
[Sander] Would it make sense that, if we mention roles in this section in other pages, these roles are actually specified in this page’s How to? We could even create hyperlinks to this page.
RDMkit has a nice section about Roles in Data Management (with more details than I copied below) [Mijke coordinated/wrote most of it this]
In this section, information is organised based on the different roles a professional can have in research data management. You will find:
A description of the main tasks usually handled by each role.
A collection of research data management responsibilities for each role.
Links to RDMkit guidelines and advice on useful information for getting started with data management specific to each role.
Roles:
Data Steward: Data stewardship is a relatively new profession and a catch-all term for numerous support functions, roles and activities. It implies professional and careful treatment of data throughout all stages of a research process.
Policy maker: As a policy maker, you are responsible for the development of a strategic data management framework and the coordination and implementation of research data management guidelines and practices.
Principal Investigator: As a Principal Investigator (PI), you may have recently acquired project funding. More and more funders require data management plans (DMP), stimulating the researcher to consider, from the beginning of a project, all relevant aspects of data management.
Researcher: Your research data is a major output from your research project, it supports your research conclusions, and guides yourself and others towards future research. Therefore, managing the data well throughout the project, and sharing it, is a crucial aspect of research.
Research Software Engineer: Research software engineers (RSE) in the life sciences design, develop and maintain software systems that help researchers manage their software and data. The RSE’s software tools and infrastructure are critical in enabling scientific research to be conducted effectively.
Trainer: As a trainer, you design and deliver training courses in research data management with a focus on bioinformatics data. Your audience is mainly people in biomedical sciences: PhD students, postdocs, researchers, technicians and PIs.
The VASCA FAIRification core team consisted of a local data steward, an external FAIR datasteward, and an EDC system specialist. Throughout the project, additional expertise wasconsulted, such as a clinician specialised in vascular anomalies, the Institutional Ethical ReviewBoard, FAIR software developers, and researchers. A full overview of the different kinds ofexpertise and which part of the FAIRification process they contributed to can be found in TableS1
Expert | Description | Metroline Steps |
---|---|---|
Clinicians specialised in the domain | May have relevant expertise about:
| Has understanding/knowledge about: I changed to the latter, since they don’t necessarily always have the knowledge you’re looking for. Let’s decide on Friday. Domain Experts are individuals who possess deep knowledge and expertise in a particular domain or industry. They have a deep understanding of the intricacies, challenges, and nuances of their field. Their expertise comes from their years of experience and interactions within their specific domain. (copy-paste) |
Data manager | Has understanding/knowledge about:
| A data manager is a professional who oversees the development and use of data systems, ensuring effective data management, secure procedures, and data analysis. They enforce policies, establish data sharing rules, and troubleshoot data-related issues for organizations (copy-pasted). |
EDC system specialist | Has understanding/knowledge about:
| I’m not sure what job this is (something you could find on e.g. indeed) Part of Clinical Data Manager? If I look here in example 3 that seems to overlap? We could also write our own description, e.g.: |
FAIR data stewards | <See the other page>
| Maybe we can add FAIR and local data stewards as 1 entry here - data stewards or perhaps (FAIR) data stewards? We keep the list (add the “access policies” entry to make it complete?). It’s probably easier to discuss data stewards on the separate page, also given the distinction made in both Fieke’s link and rdmkit |
Health-RI expert team | Has understanding/knowledge about:
| Should HRI expert team be in here? |
Institutional Ethical Review Board | Has understanding/knowledge about:
| |
Local data stewards = Data Steward | Data stewardship is a relatively new profession and a catch-all term for numerous support functions, roles and activities. It implies professional and careful treatment of data throughout all stages of a research process. Has understanding/knowledge about:
| (copy-pasted) |
Patient advocate for the domain | Has understanding/knowledge about:
| |
Semantic data modelling specialists | Has understanding/knowledge about:
| |
Senior expert of standards for automated access protocols and privacy preservation | Has understanding/knowledge about:
| |
Senior healthcare interoperability expert | Has understanding/knowledge about:
| |
Software developer = Research Software Engineer | Research software engineers (RSE) in the life sciences design, develop and maintain software systems that help researchers manage their software and data. The RSE’s software tools and infrastructure are critical in enabling scientific research to be conducted effectively. Has understanding/knowledge about:
| (copy-pasted) |
In the FAIRification objectives step the following expertise is mentioned:
Domain expert; provides context to the FAIRification efforts from the perspective of a domain
Data stewards; helps defining FAIR objectives to meet the project’s, funder’s, journal’s and/or institute’s requirements
FAIR experts, such as metadata/semantics specialists; helps specifying the metadata/modeling aspects of FAIR objectives
ELSI experts, help identifying the legal compliance and ethical aspects of your FAIR objectives.
[Generic]
Data FAIRification requires different types of expertise and should therefore be carried out in a multidisciplinary team guided by FAIR data steward(s). The different sets of expertise are on i) the data to be FAIRified and how they are managed, ii) the domain and the aims of the data resource within it, iii) architectural features of the software that is (or will be) used for managing the data, iv) access policies applicable to the resource, v) the FAIRification process (guiding and monitoring it), vi) FAIR software services and their deployment, vii) data modelling, viii) global standards applicable to the data resource, and ix) global standards for data access. A good working approach is to organize a team that contains or has access to the required expertise. The core of such a team may be formed by data stewards, with at least expertise of the local environment and of the FAIRification process in general.
→ In list form, expertise required:
The data to be FAIRified and how they are managed,
the domain and the aims of the data resource within it
architectural features of the software that is (or will be) used for managing the data
access policies applicable to the resource
the FAIRification process (guiding and monitoring it),
FAIR software services and their deployment,
data modelling,
global standards applicable to the data resource
global standards for data access.
[RDMkit]
Perhaps: https://rdmkit.elixir-europe.org/dm_coordination
[Health-RI_FAIRification_Step_Report]
Expertise and Example Experts - Source: [De Novo]
| Expertise/Knowledge | Example Experts |
a | On the data to be FAIRified and how they are managed |
|
b | On the domain and on what a data resource is used for |
|
c | On architectural features of the software that is (or will be) used for managing the data |
|
d | On access policies applicable to the resource |
|
e | On the FAIRification process (guiding and monitoring it) |
|
f | On FAIR software services and their deployment |
|
g | On semantic data modelling |
|
h | On global standards applicable to the data resource interoperability |
|
i | On global standards for data access |
|
Resource glossary
Tool/Standard # can be used to #
Goal Modelling (see link) is a standard that can be used to represent goals that are connected to each other and it helps defining clear FAIRification objectives for both research question and process perspectives.
FAIR data point (see link) is a tool guarantees many FAIR principles and can be used to describe metadata completely in accordance to the DCAT standard, you can create and publish metadata in the FAIR data point which is a searchable and indexable resource (see fair data index, every fair data point is indexed in the fair data index),
DCAT (see link) is a standard to describe metadata of, from detail to general levels: distribution, dataset, catalogue
RDF (see link) extensible knowledge representation model is a way to describe and structure datasets
Smart Guidance (see link) is a tool that defines the specific steps for RD registries data FAIRification
Semantic data model for (e.g. Data model for set of common data elements for rare disease registration, Data model for Omics data, data model for WHO Rapid COVID CRF, Data models from EBI in the ‘documentation’ links on this page http://www.ebi.ac.uk/rdf/)
Practical Examples from the Community
This section should show the step applied in a real project. Links to demonstrator projects.
References & Further reading
Mijke Jetten, Marjan Grootveld, Annemie Mordant, Mascha Jansen, Margreet Bloemers, Margriet Miedema, & Celia W.G. van Gelder. (2021). Professionalising data stewardship in the Netherlands. Competences, training and education. Dutch roadmap towards national implementation of FAIR data stewardship (1.1). Zenodo. https://doi.org/10.5281/zenodo.4623713
Salome Scholtens, Mijke Jetten, Jasmin Böhmer, Christine Staiger, Inge Slouwerhof, Marije van der Geest, & Celia W.G. van Gelder. (2022). Final report: Towards FAIR data steward as profession for the lifesciences. Report of a ZonMw funded collaborative approach built on existing expertise (Versie 4). Zenodo. https://doi.org/10.5281/zenodo.7225070
Toolkit for building your dream team: “a resource intended to make it as easy as possible to organise a workshop aimed at raising awareness of and facilitating discussion around the diversity of roles that contribute to research”. […] “[t]he knowledge sector is now looking towards a team-based approach bringing together more overtly diverse team members with specific skills in funding, research design, data analysis, data management, software development, research ethics, political relationships, dealing with business, interdisciplinarity, communications etc.” https://research-dream-team-toolkit.readthedocs.io/en/latest/index.html
[FAIRopoly] https://www.ejprarediseases.org/fairopoly/
[FAIRinAction] https://www.nature.com/articles/s41597-023-02167-2
[Generic] https://direct.mit.edu/dint/article/2/1-2/56/9988/A-Generic-Workflow-for-the-Data-FAIRification
Contributors
Dena Tahvildari; Sander de Ridder; Jolanda Strubel; Bruna dos Santos Vieira; Mijke Jetten; Fieke Schoots; Ines De Oliveira Coelho Henriques; Shuxin Zhang; Alberto Cámara; César Bernabé; Joeri van der Velde