Status | ||||
---|---|---|---|---|
|
Short
...
description
‘Human resources are the most important part of the FAIRification process. Having a team with the right skillset will play an important role in achieving your FAIRification goals.’(FAIRopoly)
To be able to reach your FAIRification goals, having a team with the right skillset is important. The composition of the team depends on the exact goals and different skills may be necessary in different phases of the of the process. See, for example:
FAIRopoly – FAIRification Guidance for ERN Patient Registries ;
FAIR in action - – a flexible framework to guide FAIRification.
In this step we present a list of common roles and resources involved in the FAIRification process listed by expertise and by FAIR principle. This will help you identify which team members and expertise are required and available (or missing) in your team.
Since a FAIR data steward is essential for reaching the FAIRification goals, the step a separate step has been dedicated to this role. See “Metroline Step: Have a FAIR data steward on board” for details on this crucial role.
Why is this step important
FAIRification is a complicated process and requires expertise from a variety of fields. Hence, assembling Assembling the right team is essential to meet your goalsFAIR objectives.
Expertise requirements for this step
TODO
How to
[Mijke: Another RDMkit page on this: https://rdmkit.elixir-europe.org/dm_coordination ]
[Sander] Would it make sense that, if we mention roles in this section in other pages, these roles are actually specified in this page’s How to? We could even create hyperlinks to this page.
RDMkit has a nice section about Roles in Data Management (with more details than I copied below) [Mijke coordinated/wrote most of it this]
In this section, information is organised based on the different Step 1
Define the FAIRification Objectives you want to reach in your project. These objectives define which FAIR Metroline steps are relevant and each step suggests the expertise necessary.
Step 2
The table below gives an overview of many roles a professional can have in research data management. You In this table you will find:
A description of the main tasks usually handled by each role.
A collection of research data management responsibilities for each role.
Links to RDMkit guidelines and advice (where applicable) on useful information for getting started with data management specific to each role.
Roles:
Data Steward/Data manager: Individuals responsible for managing and curating research or healthcare data within organizations or projects. Job title and exact activities and responsibilities vary between organisations. In the Metroline steps we will refer to this role as “data steward”. Details on this role in the team are described in a separate step “Have a FAIR data steward on board”.
Policy maker: Decision-maker involved in shaping data management policies that promote FAIR data practices.
Principal Investigator: leads a clinical trial or research project. Responsible for following the data management requirements and guidelines of the organisation and/or funder. Decisions regarding data management are documented in the DMP (data management plan).
Researcher /scientist: Professionals involved in collecting, analyzing, and sharing data as part of a clinical trial, research project or other scientific endeavors.
Information Professionals: Librarians, archivists, and information scientists involved in organizing and preserving data assets.
IT and Systems Administrators: Professionals responsible for maintaining data infrastructure and ensuring technical compatibility and accessibility for an organisation or department.
Research Software Engineer: Research software engineers (RSE) in the life sciences design, develop and maintain software systems that help researchers manage their software and data. The RSE’s software tools and infrastructure are critical in enabling scientific research to be conducted effectively.
Trainer: Professionals involved in training and educating others, such as PhD students, postdocs, researchers, technicians and PIs. In case of FAIR related training this includes practices for managing and sharing data.
...
Expert
...
Description
...
Metroline Steps
...
Clinicians specialised in the domain
May have relevant expertise about:
Access policies applicable to the resource
Semantic data modelling
The data to be FAIRified and how they are managed
The domain and on what is a data resource is used for
Has understanding/knowledge about:
or
May have relevant expertise about:
I changed to the latter, since they don’t necessarily always have the knowledge you’re looking for. Let’s decide on Friday.
Domain Experts are individuals who possess deep knowledge and expertise in a particular domain or industry. They have a deep understanding of the intricacies, challenges, and nuances of their field. Their expertise comes from their years of experience and interactions within their specific domain. (copy-paste)
...
Data Steward/Data manager
Individuals responsible for managing and curating research or healthcare data within organizations or projects. Job title and exact activities and responsibilities vary between organisations. In the Metroline steps we will refer to this role as “data steward”. Details on this role in the team are described in a separate step “Have a FAIR data steward on board”.
Has understanding/knowledge of:
The data to be FAIRified and how they are managed
Global standards applicable to the data resource interoperability
Global standards for data access
Semantic data modelling
The data to be FAIRified and how they are managed
The FAIRification process (guiding and monitoring it)
Access policies applicable to the resource
Global standards applicable to the data resource interoperability
Global standards for data access
Semantic data modelling
The data to be FAIRified and how they are managed
The FAIRification process (guiding and monitoring it)
...
A data manager is a professional who oversees the development and use of data systems, ensuring effective data management, secure procedures, and data analysis. They enforce policies, establish data sharing rules, and troubleshoot data-related issues for organizations (copy-pasted).
...
EDC system specialist
Has understanding/knowledge about:
...
Architectural features of the software that is (or will be) used for managing the data
...
FAIR software services and their deployment
...
Global standards for data access
...
Global standards applicable to the data resource interoperability
The data to be FAIRified and how they are managed
I’m not sure what job this is (something you could find on e.g. indeed) Part of Clinical Data Manager? If I look here in example 3 that seems to overlap?
We could also write our own description, e.g.:
A professional who has knowledge of EDC systems.
...
FAIR data stewards
...
Maybe we can add FAIR and local data stewards as 1 entry here - data stewards or perhaps (FAIR) data stewards? We keep the list (add the “access policies” entry to make it complete?). It’s probably easier to discuss data stewards on the separate page, also given the distinction made in both Fieke’s link and rdmkit
...
Health-RI expert team
Has understanding/knowledge about:
FAIR software services and their deployment
...
Should HRI expert team be in here?
...
Institutional Ethical Review Board
Has understanding/knowledge about:
Access policies applicable to the resource
...
Patient advocate for the domain
Has understanding/knowledge about:
The data to be FAIRified and how they are managed
The domain and on what is a data resource is used for
...
Semantic data modelling specialists
Has understanding/knowledge about:
Semantic data modelling
...
Senior expert of standards for automated access protocols and privacy preservation
Has understanding/knowledge about:
Global standards for data access
...
Senior healthcare interoperability expert
Has understanding/knowledge about:
Global standards applicable to the data resource interoperability
...
Software developer = Research Software Engineer
Research software engineers (RSE) in the life sciences design, develop and maintain software systems that help researchers manage their software and data. The RSE’s software tools and infrastructure are critical in enabling scientific research to be conducted effectively.
Has understanding/knowledge about:
Architectural features of the software that is (or will be) used for managing the data
FAIR software services and their deployment
...
(copy-pasted)
In the FAIRification objectives step the following expertise is mentioned:
...
Domain expert; provides context to the FAIRification efforts from the perspective of a domain
...
Data stewards; helps defining FAIR objectives to meet the project’s, funder’s, journal’s and/or institute’s requirements
...
ELSI experts, help identifying the legal compliance and ethical aspects of your FAIR objectives.
[Generic]
Data FAIRification requires different types of expertise and should therefore be carried out in a multidisciplinary team guided by FAIR data steward(s). The different sets of expertise are on i) the data to be FAIRified and how they are managed, ii) the domain and the aims of the data resource within it, iii) architectural features of the software that is (or will be) used for managing the data, iv) access policies applicable to the resource, v) the FAIRification process (guiding and monitoring it), vi) FAIR software services and their deployment, vii) data modelling, viii) global standards applicable to the data resource, and ix) global standards for data access. A good working approach is to organize a team that contains or has access to the required expertise. The core of such a team may be formed by data stewards, with at least expertise of the local environment and of the FAIRification process in general.
→ In list form, expertise required:
The data to be FAIRified and how they are managed,
the domain and the aims of the data resource within it
architectural features of the software that is (or will be) used for managing the data
access policies applicable to the resource
the FAIRification process (guiding and monitoring it),
FAIR software services and their deployment,
data modelling,
global standards applicable to the data resource
global standards for data access.
[RDMkit]
Perhaps: https://rdmkit.elixir-europe.org/dm_coordination
[Health-RI_FAIRification_Step_Report]
Expertise and Example Experts - Source: [De Novo]
...
...
Expertise/Knowledge
...
Example Experts
...
a
...
On the data to be FAIRified and how they are managed
...
Local data steward
FAIR data steward
Data manager
EDC system specialist
Clinicians specialised in the domain
Patient advocate for the domain
...
b
...
On the domain and on what a data resource is used for
...
Clinicians specialised in the domain
Patient advocate for the domain
...
c
...
On architectural features of the software that is (or will be) used for managing the data
...
EDC system specialist
Software developer
...
d
...
On access policies applicable to the resource
...
Local data steward
Clinicians specialised in the domain
Institutional Ethical Review Board
...
e
...
On the FAIRification process (guiding and monitoring it)
...
Local data stewards
FAIR data stewards
...
f
...
On FAIR software services and their deployment
...
EDC system specialist
Software developer
Health-RI expert team
...
g
...
On semantic data modelling
...
Local and FAIR data steward
Semantic data modelling specialists
Clinicians specialised in the domain
...
h
...
On global standards applicable to the data resource interoperability
...
Local and FAIR data stewards
EDC system specialist
Senior healthcare interoperability expert
...
i
...
On global standards for data access
...
Local data and FAIR stewards
EDC system specialist
Senior expert of standards for automated access protocols and privacy preservation
Resource glossary
Tool/Standard # can be used to #
Goal Modelling (see link) is a standard that can be used to represent goals that are connected to each other and it helps defining clear FAIRification objectives for both research question and process perspectives.
FAIR data point (see link) is a tool guarantees many FAIR principles and can be used to describe metadata completely in accordance to the DCAT standard, you can create and publish metadata in the FAIR data point which is a searchable and indexable resource (see fair data index, every fair data point is indexed in the fair data index),
DCAT (see link) is a standard to describe metadata of, from detail to general levels: distribution, dataset, catalogue
RDF (see link) extensible knowledge representation model is a way to describe and structure datasets
Smart Guidance (see link) is a tool that defines the specific steps for RD registries data FAIRification
Semantic data model for (e.g. Data model for set of common data elements for rare disease registration, Data model for Omics data, data model for WHO Rapid COVID CRF, Data models from EBI in the ‘documentation’ links on this page http://www.ebi.ac.uk/rdf/)
Practical Examples from the Community
Example team:
The VASCA FAIRification core team consisted of a local data steward, an external FAIR data steward, and an EDC system specialist. Throughout the project, additional expertise was consulted, such as a clinician specialised in vascular anomalies, the Institutional Ethical Review Board, FAIR software developers, and researchers. A full overview of the different kinds of expertise and which part of the FAIRification process they contributed to can be found in TableS1
Links to demonstrator projects.
References & Further reading
Mijke Jetten, Marjan Grootveld, Annemie Mordant, Mascha Jansen, Margreet Bloemers, Margriet Miedema, & Celia W.G. van Gelder. (2021). Professionalising data stewardship in the Netherlands. Competences, training and education. Dutch roadmap towards national implementation of FAIR data stewardship (1.1). Zenodo. https://doi.org/10.5281/zenodo.4623713
Salome Scholtens, Mijke Jetten, Jasmin Böhmer, Christine Staiger, Inge Slouwerhof, Marije van der Geest, & Celia W.G. van Gelder. (2022). Final report: Towards FAIR data steward as profession for the lifesciences. Report of a ZonMw funded collaborative approach built on existing expertise (Versie 4). Zenodo. https://doi.org/10.5281/zenodo.7225070
Toolkit for building your dream team: “a resource intended to make it as easy as possible to organise a workshop aimed at raising awareness of and facilitating discussion around the diversity of roles that contribute to research”. […] “[t]he knowledge sector is now looking towards a team-based approach bringing together more overtly diverse team members with specific skills in funding, research design, data analysis, data management, software development, research ethics, political relationships, dealing with business, interdisciplinarity, communications etc.” https://research-dream-team-toolkit.readthedocs.io/en/latest/index.html
[FAIRopoly] https://www.ejprarediseases.org/fairopoly/
[FAIRinAction] https://www.nature.com/articles/s41597-023-02167-2
[Generic] https://direct.mit.edu/dint/article/2/1-2/56/9988/A-Generic-Workflow-for-the-Data-FAIRification
Contributors
...
role, including nearly identical roles between brackets;
identical roles are not used on Metroline pages;
if you’re interested in pages that use an identical role (e.g. “data manager”) , look for pages with the main role (e.g. “data steward”);
note that the identical roles mentioned are not exhaustive.
a description of the role;
specific variants of a role, such as “a researcher with domain knowledge”;
in which steps (the variant of) a role is used.
The roles and descriptions in the table are adjusted from the EOSC Digital skills for FAIR and open science report and the NPOS Professionalising data stewardship in the Netherlands: competences, training and education report, Some roles not considered relevant were left out from the table and some that were deemed missing were added. With the with the exception of the researcher and citizen role, the mentioned roles are often summarised as (research) data support professionals.
Role | Description | Usage | Metroline steps |
---|---|---|---|
Researcher (Scientist) | A researcher obtains, processes, produces, deposits and shares research data. | Researcher with domain knowledge |
|
Researcher with XYZ | |||
Data scientist | A data scientist is an expert on data processing, not necessarily from a specific discipline, who is capable of evaluating data quality, extracting relevant knowledge from data and representing such knowledge. | Data scientist | |
Research software engineer | A growing number of people in academia combine expertise in programming with an intricate understanding of research. These Research Software Engineers may start off as researchers who spend time developing software to progress their research or they may start off from a more conventional software-development background and be drawn to research by the challenge of using software to further research. For an elaborate overview of this role see the aforementioned NPOS report, chapter 4. | Research software engineer | |
Infrastructure professional (IT and Systems Administrators) | An infrastructure professional is an IT expert who manages and operates infrastructures and the necessary services for the storage, preservation and processing of data. | Infrastructure professional | |
Trainer (Educator) | A trainer is an expert who designs, organises, shapes content and manages and/or coordinates training activities, participating in the delivery of the training. | Trainer | |
Data curator | A data curator is an expert on the management and oversight of an organisation's entire data to ensure compliance with policy and/or regulatory obligations for longterm preservation and to provide higher-level users with high quality data that is easily accessible in a consistent manner. | Data curator | |
Data steward (Data librarian, Data manager) | A person responsible for keeping the quality, integrity, and access arrangements of data and metadata in a manner that is consistent with applicable law, institutional policy, and individual permissions. Data stewardship implies professional and careful treatment of data throughout all stages of a research process. A data steward aims at guaranteeing that data is appropriately treated at all stages of the research cycle (i.e., design, collection, processing, analysis, preservation, data sharing and reuse). Details on this role in the team are described in a separate step Have a FAIR data steward on board. | FAIR data steward |
|
Semantic expert (Metadata expert, interoperability expert) |
| ||
Data steward with EDC knowledge | |||
Citizen | Citizens in this context are any kind of people having interest in one or several scientific disciplines (including, but not limited to, the open source community or commercial companies undertaking research), who want to get information or contribute to a citizen science initiative or other initiatives of general public interest, or have their own interest in learning and addressing a specific challenge which is not part of his/her professional activity. | Citizen with domain knowledge |
|
Policy maker | Policy makers gather information through consultation and research, and reduce and extract from the information a policy, set of policies or a strategic framework which serve to promote a preferred course of action and could include financial support to research. | Policy maker | |
ELSI expert | ELSI experts provide guidance and answers to the ethical, legal and social implications of research. | ELSI expert |
|
To members of the Writing group: if the necessary expertise cannot be found in the table above, check the one below. If you need one of the roles described there, let Sander/Mijke/Jolanda know.
If you still cannot find a suitable role, tell us what role you need and we can discuss where/how it should be added.
Expert | Description | Metroline Steps |
---|---|---|
Institutional Review Board (IRB) / Medical Ethics Review Committee (METC) | Evaluate research protocols and ensure the research complies with regulatory requirements and ethical standards. For research to which the WMO (Medical Research Involving Human Subjects Act) is applicable, evaluation must be done by an accredited METC or by the CCMO (Central Committee on Research Involving Human Subjects). <On access policies applicable to the resource> | |
Leads a clinical trial or research project. Responsible for following the data management requirements and guidelines of the organisation and/or funder. Decisions regarding data management are documented in the DMP (data management plan). |
Expertise requirements for this step
To be able to define your team, you may need the experts described below.
Project manager. Knows the goals of the project and can help decide what team members are necessary to reach those goals.
HR. Involved when hiring new people.
Practical examples from the community
VASCERN (European Reference Network on Rare Multisystemic Vascular Diseases) describe the team used for the VASCA (Vascular Anomalies Registry) FAIRification in their De Novo paper, with a detailed description available in the paper’s supplementary material, table S1.
VASCA is a demonstrator project. More information can be found on its demonstrator page on the Health-RI website.
Training
Research Dream Team Toolkit: a resource for organising a workshop around the diversity of roles that contribute to research.
The Turing Way handbook contains several chapters with lessons and recommendations on research teams: E.g. Teamwork; Research Infrastructure Roles; Team manual, etc.
More relevant training will be added in the future if available.
Suggestions
Visit our How to contribute page for information on how to get in touch if you have any suggestions about this page.