Health-RI wiki v4.0 -> consultatie (open tot 03-12-2024)
Glossary
A number of concepts are essential for the correct understanding of the architecture. We now define it here; in the future, these will become part of a general glossary (in English and Dutch) that is kept centrally in the Thesaurus Healthcare and Welfare (TZW in dutch). This glossary is also part of solution 1 of the Obstacle Removal Trajectory. See also this article
In the dutch version of the glossary there are references to the TZW when applicable. Unfortunately the TZW does not support the English language yet, so in this article there are no references to the TZW.
Term | Definition |
---|---|
Adressing service | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), which indicates which parties participate in which functions and how to address them |
Anonymize | Anonymized data are personal data that is processed so that the identifying features are irreversibly removed/hidden so that it is no longer possible to trace it back to a person |
API | An API (Application Programming Interface) is used by software components to exchange data in a formalized way |
Application overview | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), which provides an overview of who has done what with personal health data. |
Authorative source | An authorative source is a record of data that is considered the primary source of that information. |
Catalogue | The roles article refers to the Data Guide Role. This refers to the Catalogue, as implemented in the Health-RI hub as a National Health Data Catalogue. This role performs services that allow a data user to find existing health data that can be used for research with the functions
|
Central data provider review commitee | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), responsible for:
|
Citizen | In the context of the Health-RI ecosystem, a citizen is an individual who controls his or her data in the context of research and innovation. Related roles: client, patient, subject More: Roles | Citizen |
Clinical data | Clinical data is data that has to do with the health or treatment of people. They may come from clinical research, where treatments or other medical interventions are tested on human subjects, or from clinical practice, where Healthcare Professionals or patients collect data about their patients or in the case patients themselves. |
Clinical (patient) research | In this research, an attempt is made to learn more about the condition by conducting research with patients. Examples of clinical research are the collection of clinical data to gain a good insight into the course of a condition. Also comparing the most efficient epilepsy or sleeping medication or identifying the best communication technique are examples of clinical research. |
Cohort | A group of people who share certain characteristics or traits and are the subject of a study or dataset. |
Data-archiver | A data archiver archives data in order to guarantee continuity of data provision in the event of the potential termination of original data source of a data producer. The data archiver role is one of the roles that must be done by the data retention role. The data holder is the controller of the data that the data archiver archives. |
Datacentric approach | A way of looking at where the data is central and not the purpose for which it was collected or used. |
Data cleansing | The process of detecting and correcting or removing corrupt, duplicate, incomplete, incorrect, or irrelevant data from a dataset, table or database. |
Data dictionary | A data dictionary is a codebook or code list: an agreed set of values for a metadata field. |
Data Governance Committee | The Data Governance Committee is preferably one authorized coordinating party that draws up a standard data dictionary for a relevant domain, together with domain experts. The data governance committee determines and manages:
The Data Governance Committee is related to the generic features |
Data guide | The data guide provides services that allow a user to find existing health data that can be used for the research with the functions
Also known as: catalog |
Data holder | A data holder is a person or organization responsible for the management and storage of health data within the Health-RI ecosystem for research and innovation. This can be, for example, a (collection of) hospital(s), a healthcare institution, a government agency, or a research organization that manages and stores health data. The data holder is a collection of roles that correctly make original health data available to the Health-RI ecosystem:
More Roles | Data holder In GDPR the term 'data controller' is used as well for the data holder |
Data management plan | A Data Management Plan (DMP) is a formal document developed at the beginning of your research project that describes all aspects of data management, both during and after the project. It contains, among other things:
source: uu.nl |
Data minimisation | Data minimisation means that a dataset does not contain more data than is necessary for the purpose for which the dataset was requested. |
Data Preparator | Based on the data-centric principle, the data preparator separates the original health data from the original application and prepares the original health data, where necessary, for multiple use (including for use for research and innovation) in a persistent data platform. The data preparator role is one of the roles that must be filled by the data holder role. The data holder is the controller of the data that the data preparator transforms. |
Data processor | Within the Health-RI ecosystem for research and innovation, there are two forms of "Data Processing":
|
Data producer | The data producer produces, creates and stores the original health data, preferably with
For original data sources, where the data user does not have the basis to know personally sensitive information (researchers data sources), the data producer will store the original health data in a pseudonymized or anonymized manner. The data producer role is one of the roles that must be done by the data holder role The data holder is the controller of the data that the data producer produces. |
Dataprovider | The dataprovider sets up the delivery process of the dataset based on the terms of use of the data preparator and the target transformation. The dataprovider role is one of the roles that must be done by the data holder. The data holder is the controller of the data made available by the data provider. More Roles | Dataprovider |
Data-request register | Data request and issue register, in which the reseearch data request service manages the requests. |
Dataset | A dataset is a collection of similar data relating to a group of data subjects. The collection has a certain uniformity, such as the presence of certain data items or data types, and similar data acquisition and processing techniques, so that it makes sense to view the dataset as a group that can be drawn upon for reuse. |
Datasubject | A datasubject is an identifiable natural person, whose health data is collected, processed, stored or shared within the Health-RI ecosystem. This could be, for example, a patient whose medical data is stored and shared between different healthcare professionals, researchers or government agencies. |
Data user | A data user is a natural person who has access to health data within the Health-RI ecosystem for research and innovation, for the performance of specific tasks or purposes. For example, a data user may be a researcher who needs data for scientific research, a public authority that needs data for monitoring public health or a healthcare professional who needs access to health data for diagnosis or treatment. |
Data user institute | A data user institute is the organization to which the data user belongs. The institute is legally responsible/liable for what happens to the data and also the contracting party for any agreements. |
Data visiting | A form of data processing on one (existing) dataset at one location (e.g. Plugin) |
(De-) identification service | Generic feature which pseudonymizes or anonymizes data. |
Distributed processing | In the case of distributed processing, the data user instructs from a central point to perform processing on one or more environments that are set up under the control of each of the data holder(s) concerned. |
Economy of scale | Economies of scale refers to the phenomenon whereby average costs per unit decrease as the scale of production increases. |
FAIR Data Point | FAIR Data Points are used to describe your data sets in a FAIR way, using standard metadata and make them available through simple web protocols. |
FAIR principles | FAIR is an acronym used in the context of data management. FAIR stands for Findable, Accessible, Interoperable and Reusable. |
Feasibility study | A feasibility study is an assessment that determines how likely it is that a dataset will provide added value to a study. If a study concerns, for example, the elderly and they are not included in the dataset or are only included to a limited extent, the researcher may decide not to request the dataset in question. |
Federated analysis coordinator | The federated analysis coordinator provides the tooling to submit the analysis question (in the case of a federated analysis) and ensures that the analysis question becomes available to the federated analysis implementers and that the results of the federated analysis are collected. |
Federated analysis performer | The Federated analysis performer ensures that an applicant's assignment federated analysis is executed locally at a data holder, by
|
Federated processing | Federated processing is a way of processing that does not depend on a centrally coordinated structure, so has no single point of failure or single point of power. |
Generic Features | The nodes from the Health-RI ecosystem use generic features. Preferably, these generic features are also used in healthcare itself (primary use). This promotes transparency and consistency of the application of these services and avoids unnecessary multiple costs for implementing and managing the solutions. The following generic features can be provided to the nodes of the Health-RI ecosystem:
|
Health data | Health data is healthcare data + research data + all other data applicable to health (IoT data and citizen-generated data). |
Health data access body (HDAB) | A legal entity designated by a Member State of the European Union to manage access to health data for research purposes. The role of an HDAB is to facilitate access to health data while ensuring the protection of personal data and privacy rights. source: EHDS Regulation |
Healthcare data | Healthcare data is data that is used to support the care processes and/or is recorded during the care processes Healthcare data is a form of health data. |
Healthcare Organisation | A healthcare organization is a provider and/or recipient of data and services in the field of research and innovation. |
Healthcare Professional | A Healthcare Professional is a natural person who provides medical or health-related services to patients or clients, whose purpose is to promote the health of patients or clients, to prevent, treat and cure diseases, and to support them in maintaining or restoring their functional skills and quality of life. |
Health-RI ecosystem | The integrated health data infrastructure for research and innovation that complies with the Health-RI Architecture. The Integrated Health Data Infrastructure for Research and Innovation refers to the totality of all parties involved and elements in scientific research and how they interact with each other. This includes not only the technological resources and facilities, but also the social, cultural, economic and political environment in scientific research. Within the ecosystem, involved parties can exchange health data with each other in a secure way. |
Health-RI foundation node | The Health-RI Foundation Node is a Nationwide node, which initially provides for the data user role. In time, this node will also provide the data-holding role for parties who cannot, may and/or do not want to make their original health data available through existing nodes. |
Horizontal partitioning of data | With horizontally partitioned health data, each source contains data from different patients/ citizens/ participants (the rows) and each source contains the same characteristics (the columns)). |
IAA | IAA stands for Identification, Authentication and Authorization. These are steps in the access control process. See the articles Identification and Authentication Service and Authorization Service for more information. |
Identity repositories | Different target groups use the Health-RI ecosystem. These different target groups each have specific roles and attributes. The NEN standard is currently looking at identification and authentication of healthcare professionals, which also faces the challenge that not all different groups of healthcare professionals fall under the UZI identification register. The Health-RI Foundation advocates a national approach or standardization of identity registers, in which the design of roles and attributes is looked at across domains and internationally. This also makes it easier to combine socio-economic information from ODISSEI with health data from the Health-RI ecosystem for prevention purposes. The target groups that we distinguish for the Health-RI ecosystem are:
Identity registers fall under the Generic features |
Infrastructure | Infrastructure is the set of facilities (organization, process, information, application and technology) for processing, storing, securing, managing and transporting digital data. |
Innovator | The Innovator is a special researcher who produces reusable research results and algorithms for data users who want to have simplified access to health data, such as healthcare professionals, policy makers and citizens. Tasks that an innovator performs are
|
Linking service | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), to be able to combine data of a person over various sources (data linking) based on identifiers or keys. |
Local review committee data providing | The local review committee data providing
|
Local review committee research | The organization to which the researcher or the data is affiliated has a review committee that determines whether studies that affiliated researchers want to start can be assessed on ethical and legal grounds. |
Make data FAIR | This process involves assigning unique and persistent identifiers to the data, enriching metadata, and registering or indexing this data in searchable resources. |
Making data comply with FAIR principles | This refers to the process of assessing existing data against the FAIR principles and making any necessary adjustments to meet these principles. This may involve adding missing metadata, improving the discoverability of data, standardizing formats, and so on. The goal is to transform the data so that it fully meets the FAIR criteria. |
Mapping service | The mapping service is a generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), which ensures that data is consistently translated between the different target groups of data holders and data users. In case of change in coding and modeling of standards within the unit of language, it is implemented once here. |
MDR Notified Body | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), which grants Medical Device Registry (MDR) certification |
Medical Research Ethics Committee (MREC) | A Medical Research Ethics Committee (MREC) assesses medical research with test subjects under the Medical Research Involving Human Subjects Act (WMO). The purpose of this assessment is to safeguard the rights, safety and well-being of the participants. A study can only be conducted after an approved MREC as issued a positive recommendation. |
Metadata | Metadata is data about the data. It gives people and systems information about the context of the data. This context is needed to make the data easier to find, understand, and relevant. It also helps to get a better grip on the data. Metadata provides people and systems with information about the context of the data. That context is necessary to make the data more findable, understandable and relevant. It also helps to get a better grip on the data. Metadata can concern both datasets and data points. Metadata about data points describes the context about a specific data point, for example a blood pressure measurement or image recording. The term metadata can have different meanings in different processes. Something that is 'metadata' in the primary care process can be 'data' in secondary use, for example the time or the measuring device that is recorded during a blood pressure measurement in the primary care process. |
Multiple use | Use both within healthcare (so-called primary use) and outside it, such as research and innovation (so-called secondary use). The responsibility for making data suitable for multiple use is shared between the data holder and the data user (in the role of data holder). Alternate term: further use |
National contact point for secondary use of electronic health data | An organizational and technical gateway enabling the cross-border secondary use of electronic health data, under the responsibility of a Member State of the European Union. The NCP-2nd-use is also responsible for ensuring interoperability and compatibility and for providing guidance and support. source: EHDS Regulation |
National node | A National node provides the role of connecting and/or applying nationally oriented specific data sources from the Health-RI ecosystem. The Health-RI Foundation is currently exploring a national categorical hub specifically for oncological health data. |
Node | A node forms the access to the Health-RI Ecosystem of the role of Data holder and/or Data user. There are several types of nodes:
More Nodes |
Onboarding | The term onboarding is used in the context of metadata and involves registering the metadata of a dataset with a FAIR Data point. |
Ontology | Terminology file characterized by the presence of specified relationships between the concepts; There are not only synonym relationships and hierarchical relationships such as in a thesaurus, but also all types of relationships are broken down |
Open standards | Open standards are standards that are publicly available, developed and approved through a process based on collaboration and consensus with the community. Open standards facilitate interoperability and data exchange. |
Payment service | A service that can charge a user an amount depending on the services/data sets purchased |
Persistent dataplatform | A persistent data platform is a central and durable storage environment for data that must remain available and consistent for a long time. Research data is stored within this environment and this data is made available from this environment |
Primary use | Use of health data on behalf of direct patient treatment. |
Processing centrally | With Processing centrally, the data from different data holders is processed in one central place. The administrator of the environment is not necessarily affiliated with the same institution as the data user. In general, the processing environment will be a virtual environment that will be set up specifically for the analysis or the directly encompassing project and will no longer be accessible when the analysis or project is complete. In the environment, the data user is given the opportunity to analyze the data from various sources together (by downloads or remote mounts or a combination). The data user can also collaborate with colleagues in the environment. In the central processing environment, the data user can select and use processing tools, or add their own processing tools. Think of sectoral facilities, (commercial) cloud facilities (such as myDRE of anDREa, SURF research cloud, a SURF HPC cluster, or commercial platforms such as Amazon AWS, Google Cloud or Microsoft Azure provided that they meet security requirements). |
Processing data | Within the Health-RI ecosystem for research and innovation, there are two forms of “Processing data”:
|
Processing environment | A processing environment is an environment where research data can be processed. The digital processing environments exist with different characteristics. For example, you can have a local processing environment, a central or a distributed one. |
Processing facility | A processing facility is an appliciation component in the central analysis processing environment (e.g. a virtual machine or some software) |
Pseudonymization | Pseudonymization is the reversible encryption, replacement and/or deletion of (directly) identifiable personal data in a dataset to protect the privacy of the data subject |
Regional node | A regional hub provides the role of connecting the region, by performing roles on behalf of regional parties from economy of scale, as a shared service center. Currently, the Health-RI ecosystem has eight regional nodes, which perform the role of Data Holder for the benefit of their region and also different Data User roles per regional node. More Nodes | Regional hub |
Registry Holder | A Registry Holder is a holder of an authorative source. |
Requester federated analysis | Together with the applicant for federated analysis, the researcher can instruct the federated analysis executor(s) to perform a local analysis calculation at the data holder. |
Research data | Research data is data that serves as input for a study, which is generated during a study and/or that is the result of a study. Care data, if suitable for use for research, can serve as research data after conducting a target transformation. |
Researchdata request service | Generic feature, both for healthcare itself (primary use) and for research & innovation (secondary use), responsible for:
|
Researcher | Researchers are natural persons engaged in conducting scientific research in the field of health care. This can be, for example, research into new treatments or therapies for diseases, discovering the causes of diseases or improving the quality of care for patients. Their work includes designing research protocols, conducting experiments, and analyzing data. More Roles | Reseacher |
Research Organisation | A research organization is a legal entity, a provider and/or recipient of data and services in the field of research and innovation. The data and services are primarily used in the investigation process. In addition, it is the ambition of the Health-RI foundation that this data from the learning healthcare system will also be part of the healthcare itself. |
Research register | A register that records which studies exist and what condition the studies are in (such as PaNaMa) |
Review | Medical-scientific research using data or bodily materials that come from people is often tested, especially in the areas of ethics and privacy, before implementation. The testing of health research that is not subject to the WMO is not required by law. This is in contrast to medical-scientific research involving humans, which falls under the WMO (Medical-Scientific Research Involving Humans Act). This last category is mandatory tested by an METC (Medical-Ethical Review Committee). Which studies from the first category, non-WMO-required research (nWMO research), are tested currently differs between institutions. The composition of the committee and the assessment criteria also differ between institutions. To reduce these differences, VWS has supported a project in which field parties have developed an assessment framework for nWMO research. Health-RI has taken over and is coordinating the follow-up process, in which implementation steps will be taken. |
Review committee | A review committee (also known as a Data Access Committee or DAC) has the responsibility to review and approve or reject requests for access to data. Examples are:
|
Secondary use | Use of healthdata outside the primary healthcare process, for example for research, innovation, quality |
Secure processing environment | A secure processing environment, which meets the requirements of the Health-RI ecosystem and provides the following services:
In general, the secure processing environment will be a virtual environment that is specially set up for the analysis or the directly encompassing project, and will no longer be accessible when the analysis or project is complete. In this environment, the data user is given the opportunity to analyze data from different sources together (e.g. via downloads or remote mounts or a combination). The Data User can also collaborate with colleagues in the environment for this purpose. In the secure processing environment, the data user can select and use processing tools, or add their own processing tools. A secure processing environment includes sectoral facilities, (commercial) cloud facilities (such as the myDRE from anDREa, SURF research cloud, a SURF HPC cluster), or commercial platforms such as Amazon AWS, Google Cloud or Microsoft Azure, provided they comply with privacy requirements. and safety requirements. |
Shared service center | A Shared Service Center (SSC) is a central department within an organization that provides specialized services and support to various internal departments or business units. The purpose of an SSC is to promote efficiency, save costs and improve the quality of service through centralized and standardized processes. |
Standardise | Choose by design for generic services and standards as components in stead of specific solutions to enable a node to join the Health-RI Ecosystem |
Storyline | A Storyline is a short, simple description of an application of the Health-RI ecosystem for research and innovation of an end user in outline. A storyline is not a functional description, but makes it clear what an end user wants, or needs and also why that is necessary. |
System Administrator Health-RI | The System Administrator Health-RI is a Health-RI ecosystem specific Generic feature that provides services such as:
|
Target transformation | The process of transforming healthdata from its original purpose, which is for example to support patient care, to other purposes, such as research or analysis. Healthcare data for example often contains personally identifiable information about patients and should be handled with care. To ensure patient privacy and confidentiality, healthcare data should be anonymized or pseudonymized before being used for purposes other than patient care. On the other hand, research data that is for example gathered using an EDC tool may need target transformation to be suitable for secondary use. |
Technology Transfer Office | Technology Transfer Offices (TTOs) within a university or university medical Center manage its intellectual property rights (IP) and the transfer of knowledge and technology to public and private partners. |
Terms of use service | The terms of use service is a service in which a citizen records preferences that describe who is allowed to do what with the health data of that citizen. This is a generic service used both for healthcare itself (primary use) and for research & innovation (secondary use). |
Ticketing booth | A place where participants in the Health-RI ecosystem can ask their questions |
Tool provider | A company that specializes in providing tools/software used to handle health data. |
Unity of language | Unity of language does not prescribe that every target group should use the same way of coding and modeling. Loss of quality while mapping the various encodings and modelling used, should be avoided as far as possible by harmonizing or, if necessary, standardizing the different encodings and modelling methods. source: RIVM ‘grondlegger van eenheid van taal’. |
Vertical partitioning of data | In the case of vertically partitioned data, sources contain data with different characteristics (the columns) of the same patients (the rows). |