Viewpoint
- Amelie Gyrard1, PhD ;
- Somayeh Abedian2, PhD ;
- Philip Gribbon3,4, PhD ;
- George Manias5, PhD ;
- Rick van Nuland6, PhD ;
- Kurt Zatloukal7, MD ;
- Irina Emilia Nicolae8, PhD ;
- Gabriel Danciu8, PhD ;
- Septimiu Nechifor8, PhD ;
- Luis Marti-Bonmati9, Dr med ;
- Pedro Mallol9, MSc ;
- Stefano Dalmiani10, MSc ;
- Serge Autexier11, PhD ;
- Mario Jendrossek12, MSc ;
- Ioannis Avramidis13, MSc ;
- Eva Garcia Alvarez14, PhD ;
- Petr Holub14, PhD ;
- Ignacio Blanquer15, Prof Dr ;
- Anna Boden16, Dr med ;
- Rada Hussein2, PhD
1Trialog, Paris, France
2Ludwig Boltzmann Institute for Digital Health and Prevention, Salzburg, Austria
3Discovery Research - ScreeningPort, Fraunhofer Institute for Translational Medicine and Pharmacology, Hamburg, Germany
4Fraunhofer Cluster of Excellence for Immune-Mediated Diseases, Frankfurt, Germany
5Department of Digital Systems, University of Piraeus, Piraeus, Greece
6Lygature, Utrecht, The Netherlands
7Diagnostic and Research Center for Molecular Biomedicine, Diagnostic and Research Institute of Pathology, Medical University of Graz, Graz, Austria
8Siemens Foundational Technologies, Brasov, Romania
9La Fe University Hospital Valencia, Valencia, Spain
10Monasterio Research Hospitals, Pisa, Italy
11Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Bremen, Germany
12Health Data Hub, Paris, France
13Ubitech, Athens, Greece
14Biobanking and Biomolecular Resources Research Infrastructure – European Research Infrastructure Consortium, Graz, Austria
15Universitat Politècnica de València, València, Spain
16Department of Clinical Pathology, Centre for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
Corresponding Author:
Amelie Gyrard, PhD
Trialog
25 rue du Général Foy
Paris, 75008
France
Phone: 33 033 1 44 70 61
Email: amelie.gyrard@trialog.com
Abstract
The adoption of the European Health Data Space (EHDS) regulation has made integrating health data critical for both primary and secondary applications. Primary use cases include patient diagnosis, prognosis, and treatment, while secondary applications support research, innovation, and regulatory decision-making. Additionally, leveraging large datasets improves training quality for artificial intelligence (AI) models, particularly in cancer prevention, prediction, and treatment personalization. The European Union (EU) has recently funded multiple projects under Europe’s Beating Cancer Plan. However, these projects face challenges related to fragmentation and the lack of standardization in metadata, data storage, access, and processing. This paper examines interoperability standards used in six EU-funded cancer-related projects: IDERHA (Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance), EUCAIM (European Cancer Imaging Initiative), ASCAPE (Artificial Intelligence Supporting Cancer Patients Across Europe), iHelp, BigPicture, and the HealthData@EU pilot. These initiatives aim to enhance the analysis of heterogeneous health data while aligning with EHDS implementation, specifically for the EHDS for the secondary use of data (EHDS2). Between October 2023 and July 2024, we organized meetings and workshops among these projects to assess how they adopt health standards and apply Internet of Things (IoT) semantic interoperability. The discussions focused on interoperability standards for health data, knowledge graphs, the data quality framework, patient-generated health data, AI reasoning, federated approaches, security, and privacy. Based on our findings, we developed a template for designing the EHDS2 interoperability framework in alignment with the new European Interoperability Framework (EIF) and EHDS governance standards. This template maps EHDS2-recommended standards to the EIF model and principles, linking the proposed EHDS2 data quality framework to relevant International Organization for Standardization (ISO) standards. Using this template, we analyzed and compared how the recommended EHDS2 standards were implemented across the studied projects. During workshops, project teams shared insights on overcoming interoperability challenges and their innovative approaches to bridging gaps in standardization. With support from HSbooster.eu, we facilitated collaboration among these projects to exchange knowledge on standards, legal implementation, project sustainability, and harmonization with EHDS2. The findings from this work, including the created template and lessons learned, will be compiled into an interactive toolkit for the EHDS2 interoperability framework. This toolkit will help existing and future projects align with EHDS2 technical and legal requirements, serving as a foundation for a common EHDS2 interoperability framework. Additionally, standardization efforts include participation in the development of ISO/IEC 21823-3:2021—Semantic Interoperability for IoT Systems. Since no ISO standard currently exists for digital pathology and AI-based image analysis for medical diagnostics, the BigPicture project is contributing to ISO/PWI 24051-2, which focuses on digital pathology and AI-based, whole-slide image analysis. Integrating these efforts with ongoing ISO initiatives can enhance global standardization and facilitate widespread adoption across health care systems.
J Med Internet Res 2025;27:e66273doi:10.2196/66273
Keywords
Introduction
Cancer is one of the main causes of death in Europe and worldwide, after cardiovascular diseases. According to the World Health Organization (WHO), cancer is the second leading cause of death and morbidity in Europe, with more than 3.7 million new cases and 1.9 million deaths each year [Cancer. World Health Organization. 2025. URL: https://www.who.int/europe/health-topics/cancer [accessed 2025-02-18] 1]. In response to the urgent need to renew European political commitment to tackle cancer, Europe’s Beating Cancer Plan (EBCP; 2021-2027) was launched and structured around 4 key action areas: prevention, early detection, diagnosis and treatment, and the quality of life of patients with cancer and cancer survivors [Communication from the Commission to the European Parliament and the Council. BRILL. 2012. URL: https://primary sources.brillonline.com/browse/human-rights-documents-online/communication-from-the-commission-to-the-european- parliament-and-the-council;hrdhrd46790058 [accessed 2023-02-01] 2]. Since 2021, the European Commission (EC) has supported collaborative projects focused on cancer diagnostics and treatment using high-performance computing and artificial intelligence (AI). To maximize the potential of data and digitalization, the EBCP also addressed the interactions and alignment of cancer data projects and initiatives with the European Health Data Space (EHDS). In autumn 2023, the EC organized 3 online workshops on the reuse of health data resources in the field of cancer research and recently published the results of these workshops [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23] 3], where “fragmentation and the lack of standardization in metadata, data storage, access, and processing” were identified as key challenges facing data-driven cancer projects. The nascent infrastructure for the application of AI in medical imaging (European Cancer Imaging Initiative [EUCAIM]) reported on the experiences of 5 projects developing big data infrastructures that will enable European, ethical, General Data Protection Regulation (GDPR)–compliant, quality-controlled, and cancer-related medical imaging platforms, where both large-scale data and AI algorithms will coexist [Kondylakis H, Kalokyri V, Sfakianakis S, Marias K, Tsiknakis M, Jimenez-Pastor A, et al. Data infrastructures for AI in medical imaging: a report on the experiences of five EU projects. Eur Radiol Exp. 2023;7(1):20. [FREE Full text] [CrossRef] [Medline]4]. These projects include the following:
- Chameleon: a project focused on developing AI algorithms for cancer diagnosis and prognosis.
- EuCanImage: a project aimed at creating a large-scale cancer image database.
- ProCAncer-I: a project focused on developing AI-based tools for personalized cancer treatment.
- Incisive: a project contributing a significant amount of cancer image data to EUCAIM.
- Primage: a project focused on developing AI-based image analysis techniques for cancer diagnosis.
These projects along with the RadioVal project established the AI for Health Imaging (AI4HI) network to develop cancer imaging data repositories and AI solutions based on medical imaging to improve clinical practice.
Table 1 summarizes the list of projects that participated in the EC workshop entitled “Landscaping data-driven projects and initiatives in the cancer field–rationale and directions for better collaboration and integration” [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23]
3], as well as the established project-level synergies and collaborations among ongoing projects.
The workshop also addressed the current challenges facing data-driven cancer projects [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23]
3], gaps existing in existing standards, and recommendations for future semantic interoperability (given in Table 2).
Collaboration scope | Synergy projects |
Data representation and interoperability | HealthData@EU pilot and CanSERV |
Infrastructure and services for benchmarking | EOSC4Cancer, EUCAIMa, and TEF-Health |
Federated data infrastructure and data structure | EUCAIM, EOSC4Cancer, and GDI |
Secure processing environment | SOLACE, EUCAIM, CanScreen-ECIS, IDERHAb, and Optima |
aEUCAIM: European Cancer Imaging Initiative.
bIDERHA: Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance.
Challenges [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23] 3] | Gaps in existing standards | Recommendations |
Fragmentation and lack of standardization in metadata, data storage, access, and processing. | The DCAT-APa health extension is still under development by the HealthData@EU pilot. | TEHDASb JAc recommendations to enhance interoperability within HealthData@EU—a framework for semantic, technical, and organizational interoperability. |
Lack of access to diverse and high-quality datasets. | Data quality framework for primary care data sources is not sufficiently addressed in the EHDSd framework. | Data quality frameworks provided by TEHDAS, EMAe, and QUANTUM projects. |
Evolving legal landscape | Relevant standards to the AIf Act are under development. | Participating in developing or extending the relevant standards. |
aDCAT-AP: DCAT Application Profile for Data Portals in Europe.
bTEHDAS: Towards the European Health Data Space.
cJA: joint action.
dEHDS: European Health Data Space.
eEMA: European Medicines Agency.
fAI: artificial intelligence.
HSbooster Health Project Cluster in Cancer
In this work, we focus on the interoperability challenges and existing gaps in health care standards building on the challenges and synergies highlighted in the EC workshop [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23]
3]. Through the EC European Standardization Booster (HSbooster.eu) initiative, we create synergy among six cancer data-driven projects (ie, IDERHA [Integration of Heterogeneous Data and Evidence towards Regulatory and Health Technology Assessments Acceptance], EUCAIM, ASCAPE [Artificial Intelligence Supporting Cancer Patients Across Europe], iHelp, BigPicture, and the HealthData@EU pilot project) by using health standards. These 6 innovative projects aim to transform digital health in oncology by leveraging advanced technologies and collaborative frameworks for enhancing cancer diagnosis, treatment, and research, improving patient outcomes, and accelerating scientific advancements. We initially created a template for the EHDS for the secondary use of data (EHDS2) interoperability framework based on the new European Interoperability Framework (EIF). The recommended standards and governance model from the joint action (JA) Towards the European Health Data Space (TEHDAS) [Second joint action towards the European Health Data Space – TEHDAS2. TEHDAS2. 2024. URL: https://tehdas.eu/project/ [accessed 2025-02-18]
5] for the secondary use of data were then used to harmonize the standards in the template (given in Textbox 1).
Starting October 2023, we conducted several meetings and workshops among the 6 projects to elucidate how they adopt health standards and Internet of Things (IoT) semantic interoperability. This included examining interoperability standards for health data, knowledge graphs–related technologies, the Smart Applications Reference Ontology (SAREF), the data quality framework (DQF), patient-generated health data (PGHD), AI reasoning, federated approaches, security, and privacy.
As per the TEHDAS recommendations, we compared the health-standardized models, ontologies, and terminologies used in these projects, including, Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR), Open Health Data Science and Informatics (OHDSI), Observational Medical Outcomes Partnership (OMOP)–Common Data Model (CDM), Digital Imaging and Communications in Medicine (DICOM), International Organization for Standardization (ISO) Technical Committee (TC) 215, and World Wide Web Consortium (W3C) Data Catalog Vocabulary (DCAT).
As EHDS is still evolving and its implementation is still under development, this study aims to examine how the 6 projects implement the recommended EHDS2 standards. Consequently, we introduce a new template to support the EHDS2 interoperability framework and highlight the results of comparing data standards across projects. By summarizing the lessons learned, we provide recommendations for future directions in EHDS2 implementation.
European Health Data Space
- The European Health Data Space (EHDS) [European Health Data Space. European Commission. 2024. URL: https://health.ec.europa.eu/ehealth-digital-health-and-care/european-health-data-space_en [accessed 2025-02-18] 6] aims to harmonize health data usage across Europe [Palojoki S, Lehtonen L, Vuokko R. Semantic interoperability of electronic health records: systematic review of alternative approaches for enhancing patient information availability. JMIR Med Inform. 2024;12:e53535. [FREE Full text] [CrossRef] [Medline]7]. It includes comprehensive rules, standards, practices, infrastructures, and a robust governance framework to improve health care delivery, drive research and innovation, and inform policymaking.
- A central focus of the EHDS is empowering patients by providing them greater digital access and control over their health data, fostering a more transparent and efficient health care system. In addition, the EHDS seeks to standardize health data across member states, ensuring interoperability, enhancing clinical outcomes, and accelerating medical research and innovation through a unified dataset.
- The governance framework ensures data privacy and security, addresses ethical concerns, and fosters stakeholder trust. The EHDS also aims to streamline regulatory processes and support cross-border health care initiatives by aligning national and regional policies.
- Therefore, the EHDS will significantly impact Europe’s digital health landscape by promoting a more connected, efficient, and patient-centered health care ecosystem.
Joint Action Toward the European Health Data Space
- The Toward the European Health Data Space (TEHDAS) project [Second joint action towards the European Health Data Space – TEHDAS2. TEHDAS2. 2024. URL: https://tehdas.eu/project/ [accessed 2025-02-18] 5] is a joint action that developed European principles for the secondary use of health data, involving 25 countries. It advocates for using existing International Organization for Standardization (ISO) standards to ensure consistency and interoperability. TEHDAS focuses on data interoperability, identifying standards for data discovery and common data models. A synthesis table details essential standards including DCAT Application Profile for Data Portals in Europe (DCAT-AP) [DCAT application profile for data portals in Europe. European Comission. 2015. URL: https://interoperable-europe.ec.euro pa.eu/collection/semic-support-centre/solution/dcat-application-profile-data-portals-europe [accessed 2015-10-26] 8], INSPIRE, FairSharing, Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT), and Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR).
- The project addresses syntactic interoperability, providing guidelines on data structure and format for seamless sharing across systems. In addition, TEHDAS emphasizes data quality, proposing frameworks to regulate and enhance the reliability of health data for secondary uses, such as research and policymaking. Overall, TEHDAS’s work supports a standardized and interoperable EHDS, leading to improved health care, research, and innovation in Europe.
European Interoperability Framework
- The new European Interoperability Framework (EIF) [The New European Interoperability Framework. European Commission. URL: https://ec.europa.eu/isa2/eif_en/ [accessed 2017-03-23] 9] adopted on March 23, 2017, fosters electronic communication and information exchange among European public administrations by providing common models, principles, and recommendations. The EIF delineates four layers of interoperability challenges: legal, organizational, semantic, and technical. This paper concentrates on the semantic interoperability challenge, encompassing both semantic and syntactic aspects. Semantic interoperability ensures that the meaning of exchanged data is preserved and understood across different systems. This can be achieved through the use of shared vocabularies and schemas. On the other hand, syntactic interoperability involves defining the grammar and format for data exchange.
- The EIF provides guidance for European public administrations to improve interoperability, ensuring smooth data exchange and efficient digital service delivery. Conversely, ISO 23903 concentrates on semantic interoperability within Information and Communication Technology (ICT) systems, aiming to unify the interpretation of information across various platforms.
- Aligning the EIF with ISO 23903 synchronizes European interoperability principles with semantic standards, fostering easier communication and cooperation among public administrations. This alignment ensures both technical harmony and uniformity in the meaning and semantics of exchanged data, thereby enhancing the efficiency and efficacy of digital service provision for citizens and businesses across Europe.
Template for the EHDS2 Interoperability Framework Based on the EIF
Figure 1 shows the process of creating the template for the EHDS2 interoperability framework.

Exploring How EHDS Adopts the New EIF Architecture and Principles
Recent studies showed the importance of adopting the EIF in establishing the EHDS interoperability framework [Palojoki S, Lehtonen L, Vuokko R. Semantic interoperability of electronic health records: systematic review of alternative approaches for enhancing patient information availability. JMIR Med Inform. 2024;12:e53535. [FREE Full text] [CrossRef] [Medline]7,Kouroubali A, Katehakis DG. Policy and strategy for interoperability of digital health in Europe. Stud Health Technol Inform. 2022;290:897-901. [CrossRef] [Medline]10,Stellmach C, Muzoora MR, Thun S. Digitalization of health data: interoperability of the proposed European health data space. Stud Health Technol Inform. 2022;298:132-136. [CrossRef] [Medline]11] (as shown in Textbox 1). In addition, several studies have investigated the interoperability requirements for heterogeneous health information systems, as well as the associated health care standards [Torab-Miandoab A, Samad-Soltani T, Jodati A, Rezaei-Hachesu P. Interoperability of heterogeneous health information systems: a systematic literature review. BMC Med Inform Decis Mak. 2023;23(1):18. [FREE Full text] [CrossRef] [Medline]12,Moreno-Conde J, Salas-Fernandez S, Moreno-Conde A. MedicalForms: integrated management of semantics for electronic health record systems and research platforms. Appl Sci. 2022;12(9):4322. [CrossRef]13].
Aligning the TEHDAS Results to the New EIF
The TEHDAS JA involves 25 European countries in developing the principles that will shape EHDS2 by providing guidance and recommendations on interoperability, data quality, and standards (as shown in panel 2 in Textbox 1). In 2023, the TEHDAS JA published a report titled “Options for governance models for the European Health Data,” which discusses EHDS governance using the EIF [Options for governance models for the European Health Data Space. TEHDAS. 2023. URL: https://tehdas.eu/app/uploads/2023/01/tehdas-options-for-governance-models-for-the-european-health-data-space.pdf [accessed 2023-01-17]
14]. In addition, they assessed 19 standards that provide a layer of semantic interoperability, supporting the cataloging of data sources and the exchange of data between different nodes [Recommendations to enhance interoperability within HealthData@EU- a framework for semantic, technical and organisational interoperability. TEHDAS. 2022. URL: https://tehdas.eu/tehdas1/app/uploads/2023/10/tehdas-recommendations-to-enhance -interoperability.pdf [accessed 2022-12-21]
15].
Mapping the TEHDAS Recommended Standards and Governance Model to the New EIF
Following the recommendation of the EC workshops on landscaping data-driven projects and initiatives in cancer [Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23]
3], we adopted the TEHDAS results on recommended standards for EHDS2 interoperability, as well as the principles of data quality frameworks. We then mapped these results to the new EIF interoperability framework and linked the data quality framework to the corresponding ISO standards. Figure 2 shows how the TEHDAS results are mapped to the new EIF interoperability framework. Notably, TEHDAS framed the recommended EHDS2 interoperability into 3 categories of standardization:
- Data discoverability (DCAT-AP)
- Enabling semantics interoperability (OMOP-CDM)
- Health data exchange (DICOM for imaging data and FHIR for health records).
The template shows the importance of incorporating and harmonizing the underpinning standards to comply with new regulations, such as the AI Act [The role of harmonised standards as tools for AI act compliance. DLA Piper. URL: https://www.dlapiper.com/es-pr/insights/publications/2024/01/the-role-of-harmonised-standards-as-tools-for-ai-act-compliance [accessed 2025-02-18] 16]. As a result, integrating advanced AI methodologies within the proposed framework enhances TEHDAS principles and advances beyond EIF by embedding robust data quality frameworks, enabling federated learning for secure and scalable data sharing, and incorporating privacy-preserving mechanisms that comply with GDPR. This framework delivers actionable templates that bridge theoretical interoperability principles with real-world AI applications, such as cancer diagnostics and personalized treatment. By aligning technical standards with semantic requirements and supporting adaptability to evolving regulations, such as the European AI Act, the framework provides a scalable, future-ready solution for health care interoperability and AI-driven innovation.
The EIF does not classify standards according to interoperability layers (technical, semantic, legal, and organizational). However, we need to consider the relevant standards for data privacy and quality assessment [Kouroubali A, Katehakis DG. Policy and strategy for interoperability of digital health in Europe. Stud Health Technol Inform. 2022;290:897-901. [CrossRef] [Medline]10].
Furthermore, the work is a foundation for creating an interactive tool for the EHDS2 Interoperability Framework, which has been submitted to JMIR as part 2 (Towards EHDS2 interoperability framework: An interactive EIF-based standards compliance toolkit for AI-driven projects).

Linking DQF to the Relevant ISO Standards
In May 2024, the Big Data Value Association (BDVA) published a study entitled “Elevating Data Quality A Paradigm Shift for Data Spaces and AI Needs” to explore the relationships between data quality and data spaces with respect to the AI Act [Elevating data quality a paradigm shift for data spaces and AI needs. Big Data Value Association (BDVA). URL: https://bdva.eu/news/new-paper-?-elevating-data-quality-a-paradigm-shift-for-data-spaces-and-ai-needs/ [accessed 2024-05-29] 17]. The study proposed that the data quality and utility label for EHDS should comply with data documentation, technical quality, data quality management processes, coverage, access and provision, and data enrichment procedures.
From the EHDS2 perspective, the TEHDAS JA provided a generic DQF [European Health Data Space Data Quality Framework. TEHDAS. 2022. URL: https://tehdas.eu/tehdas1/app/uploads/2022/05/tehdas-european-health-data-space-data-quality-framework-2022-05-18.pdf [accessed 2022-05-18]
18], which includes both technical quality elements and six utility dimensions: relevance, accuracy and reliability, coherence, coverage, completeness, and timelines. This was followed by the publication of associated recommendations affecting data quality and utility implementation in HealthData@EU [Recommendations on a Data Quality Framework for the European Health Data Space for secondary use. TEHDAS. URL: https://tehdas.eu/tehdas1/app/uploads/2023/09/tehdas-recommendations-on-a-data-quality-framework.pdf [accessed 2023-09-26]
19]. Based on these deliverables, the European Medicine Agency (EMA) published its own DQF for EU medicines regulation [Data Quality Framework for EU medicines regulation. European Medicines Agency (EMA). 2023. URL: https://www.ema.europa.eu/en/documents/regulatory-procedural-guideline/data-quality-framework-eu-medicines-regulation_en.pdf [accessed 2023-10-30]
20]. This publication provides an analysis of the data quality actions and metrics, as well as a maturity model, to guide the evolution of automation to support data-driven regulatory decision-making (as shown in Figure 3) [Data Quality Framework for EU medicines regulation. European Medicines Agency (EMA). 2023. URL: https://www.ema.europa.eu/en/documents/regulatory-procedural-guideline/data-quality-framework-eu-medicines-regulation_en.pdf [accessed 2023-10-30]
20].
Ensuring the accuracy, completeness, consistency, and reliability of cancer research data is of utmost importance, and adherence to data quality standards plays a crucial role in achieving this goal. These standards not only help maintain data integrity but also promote data interoperability through the use of standardized data models and vocabularies. This enables seamless data exchange and integration across different projects and platforms. In addition, adhering to these standards supports informed decision-making by providing high-quality data for clinical decisions, research insights, and policymaking related to cancer treatment and prevention. Compliance with international legal and regulatory requirements is also ensured, upholding proper data governance and ethical data usage. Finally, establishing a unified framework for data quality encourages collaboration among various stakeholders within the cancer research community. We compare data quality standards in Table S4 in Detailed results from analysis of six involved projects. Comparison of the usage of standards: artificial intelligence data quality.Multimedia Appendix 1

Landscape of the Involved Projects and Used Standards
In this study, we used the EHDS2 interoperability framework template to compare 6 projects with cancer use cases where AI is applied (as shown in panel 4 in Textbox 2;
Table 3). The selected projects vary in scope, cancer domain, categories of health data, scale of infrastructure, AI implementation approach, and time spans [Gyrard A, Gribbon P, Hussein R, Abedian S, Bonmati LM, Cabornero GL, et al. Synergies among health data projects with cancer use cases based on health standards. Stud Health Technol Inform. 2024;316:1292-1296. [CrossRef] [Medline]21].
We focused on semantic interoperability standards (as shown in Textbox 3), recommended by the TEHDAS JA and related ISO standards. The analysis explored how health standards support health ontologies, for example, how HL7 FHIR supports the Findability, Accessibility, Interoperability, and Reusability (FAIR) principles through the FHIR for FAIR implementation guide [FHIR for FAIR - FHIR Implementation Guide. HL7 FHIR. URL: http://build.fhir.org/ig/HL7/fhir-for-fair/terminology.html [accessed 2024-01-05]
28].
We explore 6 innovative projects that are transforming digital health in oncology. By leveraging advanced technologies and collaborative frameworks, these projects aim to enhance cancer diagnosis, treatment, and research, improving patient outcomes, and accelerating scientific advancements:
- IDERHA (Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance) [IDERHA. URL: https://www.iderha.org/integrating-health-data [accessed 2025-02-18] 22] (2023-2028): IDERHA advances digital health in the context of lung cancer. It seeks to enhance health care delivery and improve patient outcomes by integrating various health data sources to support effective clinical decision-making and the development of improved treatments.
- BigPicture [A central repository of digital pathology slides to boost the development of artificial intelligence. BigPicture. URL: https://bigpicture.eu/ [accessed 2025-02-18] 23] (2021-2027): BigPicture aims to revolutionize pathology practice by establishing a repository of high-quality, annotated images, and developing artificial intelligence (AI) tools to improve diagnostic accuracy and efficiency. Data are collected from all types of tissues, including 1 million clinical samples and 2 million nonclinical samples from toxicology studies.
- EUCAIM (European Cancer Imaging Initiative) [Cancer Image Europe. URL: https://cancerimage.eu/ [accessed 2023-09-29] 24] (2023-2026): EUCAIM provides a federated infrastructure for sharing and analyzing cancer images. This supports AI research and clinical practices to advance cancer diagnosis and treatment.
- iHelp [iHelp. URL: https://ihelp-project.eu/ [accessed 2025-02-18] 25] (2021-2024): The iHelp project is dedicated to developing an intelligent, AI-driven health monitoring platform for early disease detection and management. It offers personalized health care solutions that enhance patient engagement and outcomes, with a specific focus on pancreatic cancer.
- ASCAPE (Artificial Intelligence Supporting Cancer Patients Across Europe) [ASCAPE. URL: https://www.ascape-project.eu/ [accessed 2025-02-18] 26] (2020-2023): ASCAPE leverages AI to enhance the quality of life for patients with cancer. By using advanced analytics on health data, the project develops predictive models and personalized treatment plans to support patient care and survivorship.
- HealthData@EU Pilot [EHDS2 Pilot. URL: https://ehds2pilot.eu/ [accessed 2025-02-18] 27] (2022-2024): This project aims to establish a technical infrastructure to enable cross-border health data access and use within the European Union, empowering the secondary use of health data more broadly. This initiative supports the creation of a cohesive digital health ecosystem, fostering innovation and improving health care services across Europe.
Project | Duration | Cancer domain | AIa or MLb | Comments |
IDERHAc | 2023-2028 | Lung cancer | Federated machine learning | Under implementation |
BigPicture | 2021-2027 | Pan-cancer whole slide images | Central repository for digital pathology and platform for AI development | Ongoing |
EUCAIMd | 2023-2026 | Pan-cancer images | Federated Research Infrastructure | Ongoing |
iHelp | 2021-2024 | Pancreatic cancer | Explainable AI, deep neural networks, predictive algorithms, ML techniques, and federated queries on distributed infrastructures | Ended on June 30, 2024 |
ASCAPEe | 2020-2023 | Breast and Prostate cancer | Explainable AI, federated deep learning, and ML on homomorphically encrypted data | Finished |
HealthData@EU pilot | 2022-2024 | Colorectal cancer and other nonrelated cancer use cases | Federated query in noncancer use case: machine learning for analysis of care pathways | Pilot for EHDS2 |
aAI: artificial intelligence.
bML: machine learning.
cIDERHA: Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance.
dEUCAIM: European Cancer Imaging Initiative.
eASCAPE: Artificial Intelligence Supporting Cancer Patients Across Europe.
Our attention is drawn to the critical role that standards play in examining semantic interoperability within the digital health realm. These standards form the foundation for seamless data exchange, ensuring consistency and coherence across various health care platforms. The main standards are as follows:
- Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) [HL7 FHIR. URL: https://fhir.org/ [accessed 2025-02-18] 29]: Positioned as a cornerstone in health care interoperability, FHIR stands out for its efficiency, adaptable architecture, and robust data exchange mechanisms, which facilitate smooth communication across disparate systems.
- DICOM (Digital Imaging and Communications in Medicine) [DICOM. URL: https://www.dicomstandard.org [accessed 2025-02-18] 30]: DICOM is widely adopted in clinical and research settings, underscoring its effectiveness in enabling image exchange.
- Common Data Model (CDM)–OMOP (Observational Medical Outcomes Partnership) [Observational Health Data Sciences and Informatics (OHDSI). 2024. URL: https://www.ohdsi.org/ [accessed 2021-01-06] 31]: While not strictly a standard, CDM-OMOP allows for data harmonization and analysis. Its synergy with established standards enhances overall interoperability.
- International Organization for Standardization (ISO) [International Organization for Standardization (ISO). URL: https://www.iso.org/home.html [accessed 2025-01-22]
32]:
- ISO TC 215 and CEN TC 251 Health Informatics: These cover a range of ISO standards, such as 13606 (5 parts), 27269, and 29585, alongside the Health Informatics Service Architecture (HISA)–compliant data models.
- ISO 13606: This focuses on exchanging information on electronic health records between different health care information systems. It has a comprehensive structure for organizing clinical data, including patient demographics, medical history, diagnoses, treatments, and other relevant information.
- ISO/AWI TR 24305: This guideline, derived from ISO standards 13940 and 13606, offers a structured HL7 FHIR implementation framework. It strengthens interoperability infrastructure, aligns with ISO norms, and promotes uniformity and adherence to best practices.
- ISO 23903: Titled “Health Informatics - Interoperability and Integration Reference Architecture (IIRA),” this standard addresses the integration and interoperability challenges in health information systems and services, ensuring they can communicate effectively and share data seamlessly.
- ISO 13972: Titled “Health Informatics - Detailed Clinical Models (DCM),” this standard focuses on the standardization and interoperability of clinical information models. DCMs are specifications that represent the clinical concepts and data elements used in electronic health records (EHRs) and health information systems.
- Internet of Things and eHealth coding standards
- ETSI SmartM2M Smart Applications Reference Ontology (SAREF) [SAREF4EHAW: an extension of SAREF for eHealth Ageing Well domain. ETSI. URL: https://saref.etsi.org/saref4ehaw/v1.1.1/ [accessed 2020-02-01] 33]: Tailored for the eHealth and healthy aging domains, representing a specialization within standardization. Its integration expands interoperability across different targeted demographic groups.
- Systematized Nomenclature of Medicine (SNOMED) [SNOMED International. URL: https://www.snomed.org [accessed 2024-12-16] 34]: SNOMED is a comprehensive clinical terminology system that captures, encodes, and shares EHRs and other health care data. It provides a standardized vocabulary for describing clinical concepts, enabling interoperability and semantic consistency across health care systems. SNOMED facilitates accurate and meaningful exchange of clinical information, supporting clinical decision-making, research, and public health reporting.
- Logical Observation Identifiers Names and Codes (LOINC) [LOINC. URL: https://loinc.org/ [accessed 2025-02-18] 35]: LOINC is a standardized system for identifying and exchanging clinical laboratory test results and other clinical observations. It provides a universal set of codes and terms for describing laboratory tests, measurements, and observations, ensuring interoperability and semantic consistency in healthcare data exchange. LOINC enables seamless integration of laboratory data into EHRs and other health information systems.
- International Classification of Diseases (ICD) [International Statistical Classification of Diseases and Related Health Problems (ICD). World Health Organization. URL: https://www.who.int/standards/classifications/classification-of-diseases [accessed 2025-02-18] 36]: It is promoted by the World Health Organization (WHO). ICD codes are used to define diagnoses for clinical treatment, medical billing, and statistics collections. It contains codes for diseases, signs and symptoms, abnormal findings, complaints, social circumstances, and external causes of injury or diseases. These codes help physicians in determining which conditions are relevant for clinical decisions, injuries, and causes of death. However, ICD has some limitations (eg, diabetes has more than 2 dozen different codes). Patient data recorded with ICD can be used for administrative functions, epidemiologic studies, research subject recruitment, interventional protocols, and clinical decision support systems.
The other standards regarding data quality, security, artificial intelligence (AI), and wearables are listed in the Detailed results from analysis of six involved projects. Comparison of the usage of standards: artificial intelligence data quality.Multimedia Appendix 1
Abbreviations of semantic metadata or terminologies used by standards
- ATC: Anatomical Therapeutic Chemical Classification System; Birnlex: Biomedical Research Integrated Domain Group; CPT: Current Procedural Terminology; ICD: International Classification of Diseases; ICDO: International Classification of Diseases for Oncology; NAACR: North American Association of Central Cancer Registries; NCIT: National Cancer Institute Thesaurus; OSIRIS: Open Source Infrared Imaging System; RADLEX: Radiology Lexicon; RDFS: Resource Description Framework Schema; SPARQL: SPARQL Protocol and RDF Query Language; RxNorm: Prescription Normative Terminology; UCUM: Unified Code for Units of Measure; UMLS: Unified Medical Language System.
Comparing the Health Data Projects
To identify synergies among the 6 projects, we used the EHDS2 interoperability framework template to analyze adopted standards, highlighting similarities and differences in the implementation approach (detailed results given in Tables S1-S7 in Detailed results from analysis of six involved projects. Comparison of the usage of standards: artificial intelligence data quality.Multimedia Appendix 1
Mapping the Projects to the Created Template for the EHDS2 Interoperability Framework
The EHDS2 interoperability framework template identified key standards and technologies for the implementation of EHDS2, including HL7 FHIR; DICOM; OMOP-CDM; upcoming standards developed by ISO TC 215, ISO TC212, CEN TC 251; and ontology technologies, including W3C DCAT-AP. The key standards used in 6 projects, their focus areas, and planned future implementations are summarized in Table 4.
Project | Used standards | Planned or future standards |
IDERHA | HL7a FHIRb, DICOMc, and OMOP | ISOd TCe 215, DCAT-APf: HealthDCAT-AP |
BigPicture | DICOM, SNOMEDg, and ICD | ISO TC 212 (digital pathology and AIh), and Kidney Biopsy Codes |
EUCAIMi | Using its own hyperontology and common data model based on: DICOM, DICOM Seg, OMOP, FHIR, mCODE, DCAT-AP-Health, OSIRIS Ontologies used so far: LOINCj, SNOMED, UCUMk, RADLEXl, ICDO3, ICD-10m, CPT4, ICD10PCS, ATCn, NCITo, Birnlexp, NAACRq, Cancer Modifier | —r |
iHelp | HL7 FHIR, OMOP-CDM, ISO 27799:2016, SNOMED, LOINC, ICD-9, ICD-10, UMLS, SPARQL, RDFSs, RxNorm | — |
ASCAPE | HL7 FHIR, ISO/CEN 13606, LOINC, SNOMED | — |
HealthData@EU pilot | Observing and collecting standardization efforts rather than direct implementation | The first version of the Health DCAT-AP metadata standard is being developed in the project |
aHL7: Health Level 7.
bFHIR: Fast Health Interoperability Resources.
cDICOM: Digital Imaging and Communications in Medicine.
dISO: International Organization for Standardization.
eTC: Technical Committee.
fDCAT-AP: DCAT Application Profile for Data Portals in Europe.
gSNOMED: Systematized Medical Nomenclature for Medicine.
hAI: artificial intelligence.
iEUCAIM: European Cancer Imaging Initiative.
jLOINC: Logical Observation Identifiers Names and Codes.
kUCUM: Unified Code for Units of Measure.
lRADLEX: Radiology Lexicon.
mICD-10: International Classification of Diseases, Tenth Revision, Clinical Modification.
nATC: Anatomical Therapeutic Chemical classification system.
oNCIT: National Cancer Institute Thesaurus.
pBirnlex: Biomedical Research Integrated Domain Group.
qNAACR: North American Association of Central Cancer Registries.
rNot applicable.
sRDFS: Resource Description Framework Schema.
Main Findings and Lessons Learned
All projects support the recommended standards identified by the TEHDAS JA for EHDS2 interoperability. Although the DICOM standard is widely used for collecting, storing, and transferring medical imaging data, it lacks important information required to identify relevant images because DICOM metadata are not standardized [Park C, You SC, Jeon H, Jeong CW, Choi JW, Park RW. Development and validation of the radiology common data model (R-CDM) for the international standardization of medical imaging data. Yonsei Med J. 2022;63(Suppl):S74-S83. [FREE Full text] [CrossRef] [Medline]37]. To overcome this challenge, the EUCAIM project CDM is built upon the FHIR resources ImagingStudy and ImagingSeries, the Medical Imaging–CDM extension of the OMOP-CDM, the ProCAncer-I imaging extension [Kalokyri V, Kondylakis H, Sfakianakis S, Nikiforaki K, Karatzanis I, Mazzetti S, et al. MI-common data model: extending observational medical outcomes partnership-common data model (OMOP-CDM) for registering medical imaging metadata and subsequent curation processes. JCO Clin Cancer Inform. 2023;7:e2300101. [CrossRef] [Medline]38], and the OSIRIS imaging component. In this way, proper integration of imaging and clinical data was provided in alignment with the Integrating the Healthcare Enterprise [Integrating the Healthcare Enterprise (IHE Europe). URL: https://www.ihe-europe.net/ [accessed 2025-02-18] 39].
The BigPicture project selected the DICOM format as a standard for archiving whole-slide images (WSI) to support back-and-forth conversion of proprietary file formats. The BigPicture profile for DICOM WSI is based on work by the DICOM pathology working group WG26. The format is designed to allow efficient storage of a large number of annotations, for example, generated by AI algorithms [D4.03 report on unified open digital slide and annotation format specification. BigPicture. URL: https://bigpicture.eu/sites/default/files/2023-04/945358-BIGPICTURE_D4.03_Report%20on%20unified%20open%20digital%20slide%20and%20an notation%20format%20specification.pdf [accessed 2025-02-18] 40]. The project developed a Python tool (Wsidicom) to serve as a reference implementation of a DICOM WSI reader [imi-bigpicture / wsidicom. GitHub. URL: https://github.com/imi-bigpicture/wsidicom [accessed 2023-12-06] 41]. In addition, two tools, Opentile and Wsidicomizer, were developed to read other WSI formats and convert them to DICOM resulting in faster format conversion and maintained image quality [imi-bigpicture / wsidicomizer. GitHub. URL: https://github.com/imi-bigpicture/wsidicomizer [accessed 2023-12-06] 42].
To ensure structured data exchange and harmonized-consistent metadata interaction for data not captured in DICOM, BigPicture has developed additional metadata standards that define and comprise a set of substandards: (1) the data model (Common Mandatory Metadata Structure[CMMS]) comprising of all metadata, (2) a set of mandatory information that must be provided in relation to CMMS entities and their relations to each other, (3) a metadata file format that is used within BigPicture (flexible metadata file exchange format), and (4) a standard file structure of datasets containing metadata and data files that can be found on the repository. Standards are based on the European Genome-phenome Archive (EGA), and where possible existing standards have been incorporated (SNOMED, ICD for clinical data, Standardization for Exchange of Nonclinical Data [SEND] terminology, and International Harmonization of Nomenclature and Diagnostic Criteria (INHAND) nomenclature for nonclinical data). Results from BigPicture, such as metadata formats or quality control criteria, will feed into the ISO/AWI 24051-2 “Medical laboratories—Part 2: Digital pathology and artificial intelligence (AI)-based image analysis.”
Regarding the metadata standards, the HealthData@EU pilot published a landscape analysis of available metadata catalogs and metadata standards [Report on the landscape analysis of available metadata catalogues and the metadata standards in use. EHDS. URL: https://ehds2pilot.eu/available_results/report-on-the-landscape-analysis-of-available-metadata-catalogues-and-the-metadata-standards -in-use/ [accessed 2023-03-30] 43]. The pilot also develops an extension of DCAT-AP: HealthDCAT-AP that will be adopted by EHDS2 projects, like IDERHA [Hussein R, Balaur I, Burmann A, Ćwiek-Kupczyńska H, Gadiya Y, Ghosh S, et al. Getting ready for the European health data space (EHDS): IDERHA's plan to align with the latest EHDS requirements for the secondary use of health data. Open Res Eur. 2024;4:160. [CrossRef] [Medline]44]. This standard could be adopted later as the mandatory metadata standard foreseen under the EHDS regulation. The EUCAIM project designed its Hyper Ontology [EUCAIM's HyperOntology_v0.2beta. Zenodo. URL: https://zenodo.org/records/11109765 [accessed 2024-05-03] 45] using FHIR, UMLS, SNOMED-CT, and OMOP-CDM vocabularies.
While data from wearables are available in high volumes, a substantial amount of data is lost because the usability of the data is governed and limited by proprietary data formats from an increasing number of manufacturers, which shows the necessity of standardization to enable interoperability [El Saddik A. Digital twins: the convergence of multimedia technologies. IEEE MultiMedia. 2018;25(2):87-92. [CrossRef]46]. Among the six projects and initiatives, IDERHA, iHelp, and ASCAPE use sensors and wearables technologies. The iHelp project uses the holistic health records (HHR) model to enable the aggregation of data from different sources, sensors, and online platforms to support the seamless integration of multiple health dimensions [Manias G, Op DAH, Azqueta A, Burgos D, Capocchiano ND, Crespo BL, et al. iHELP: personalised health monitoring and decision support based on artificial intelligence and holistic health records. 2021. Presented at: IEEE Symposium on Computers and Communications (ISCC); September 08-10, 2021; Athens, Greece. [CrossRef]47].
The HL7 FHIR standard-compatible data structures were used in the context of the iHelp project to integrate primary and secondary data and to compile them into the holistic health records FHIR model [Manias G, Op DAH, Azqueta A, Burgos D, Capocchiano ND, Crespo BL, et al. iHELP: personalised health monitoring and decision support based on artificial intelligence and holistic health records. 2021. Presented at: IEEE Symposium on Computers and Communications (ISCC); September 08-10, 2021; Athens, Greece. [CrossRef]47]. FHIR can be used for clinical data and also for streaming data from sensors [Manias G, Azqueta-Alzúaz A, Dalianis A, Griffiths J, Kalogerini M, Kostopoulou K, et al. Advanced data processing of pancreatic cancer data integrating ontologies and machine learning techniques to create holistic health records. Sensors (Basel). 2024;24(6):1739. [FREE Full text] [CrossRef] [Medline]48]. The ASCAPE project successfully integrated EN/ISO 13606–standardized extracts from a patient mobile app into an electronic health record [Frid S, Fuentes Expósito MA, Grau-Corral I, Amat-Fernandez C, Muñoz Mateu M, Pastor Duran X, et al. Successful integration of EN/ISO 13606-standardized extracts from a patient mobile app into an electronic health record: description of a methodology. JMIR Med Inform. 2022;10(10):e40344. [FREE Full text] [CrossRef] [Medline]49]. IDERHA plans to extend the OMOP-CDM to address the PGHD, including patient-reported outcomes and patient-reported experience measures.
The BigPicture project highlighted that a dedicated standardization project focused on digital pathology is currently missing. It addressed the need for ISO/AWI 24051-2 “Medical laboratories—Part 2: Digital pathology and artificial intelligence (AI)-based image analysis” (under development, April 2024) that will build on ISO 20166-4:2021 “Molecular in vitro diagnostic examinations Specifications for pre-examination processes for formalin-fixed and paraffin-embedded (FFPE) tissue Part 4: In situ detection techniques” [D5.05 - Report on landscape of guidelines, standards and regulatory requirements relevant for digital pathology, clinical. BigPicture. URL: https://bigpicture.eu/sites/default/files/2023-04/945358-BIGPICTURE_D5.05%20-%20Report%20on%20 landscape%20of%20guidelines%2C%20clinical_final.pdf [accessed 2025-02-18]
50]. Figure 4 summarizes the main findings of the projects and how they can feed each other.

It is interesting to notice that the ETSI SmartM2M SAREF extensions for health and aging well (SAREF4EHAW) and wearables (SAREF4WEAR) are not known and not used on those six projects that we compared, even projects using IoT technologies with sensors, devices.
Key reasons for this gap include insufficient promotion, perceived relevance, integration challenges, and the presence of competitive standards. ETSI should enhance outreach, showcase successful implementations, collaborate with key projects, and simplify integration processes to improve adoption.
The other aspects, AI and federated learning, security and privacy, and data quality are covered by several approaches and standards. The adoption of federated standards and AI in cancer research projects across Europe represents a shift toward collaborative yet privacy-preserving medical research. These initiatives are essential for creating robust, scalable, and interpretable AI models that can significantly advance early detection, treatment, and overall patient care in oncology. The projects leverage various AI standards to achieve these goals. Some of the key AI standards being used have been included in Table S7 in Detailed results from analysis of six involved projects. Comparison of the usage of standards: artificial intelligence data quality.Multimedia Appendix 1
Creating Synergy Among the Involved Projects
We remarked on similarities and differences as we navigated the implementation of various standards among the projects. Despite the projects’ varied focuses, each project prioritizes interoperability, relying on various standards. This collective effort promotes knowledge exchange and innovation and fosters a digitally unified health care ecosystem for EHDS2. We also observed a trend towards common standards like HL7 FHIR and DICOM. While projects may vary in domain-specific standards and implementation nuances, leveraging established standards offers numerous advantages. This includes improved interoperability, streamlined development processes, scalability, and knowledge sharing.
Accordingly, intensive discussions accelerated the creation of the synergy among the groups that will be introduced in the Medical Informatics Europe 2024 conference [Gyrard A, Gribbon P, Hussein R, Abedian S, Bonmati LM, Cabornero GL, et al. Synergies among health data projects with cancer use cases based on health standards. Stud Health Technol Inform. 2024;316:1292-1296. [CrossRef] [Medline]21]. In addition, both the HealthData@EU pilot and the IDERHA project were addressed as EHDS use cases in the white paper on “IoT/Edge Computing and Health Data and Data Spaces” published by the Alliance for IoT and Edge Computing Innovation [AIOTI White Paper IoT/Edge Computing and Health Data and Data Spaces. AIOTI. 2024. URL: https://aioti.eu/aioti-white -paper-iot-edge-computing-and-health-data-and-data-spaces [accessed 2024-08-03] 51], and both projects also shared expertise on informing EHDS2 development. Experience from completed projects working with standards helps ongoing projects to decide on which standards to focus on, learn from their limitations, etc. For example, BigPicture designed a new standard ISO/AWI 24051-2.
Conclusion and Outlook
Standardization and legalization are the main pillars of EHDS. The HSbooster.eu initiative enabled intensive analysis of these aspects among six data-driven EU projects focusing on cancer. We aimed to create synergy among these projects to share the lessons learned in standards and legal implementation, sustainability of these projects, and harmonization with these with the EHDS2 implementation. Furthermore, we are involved in the development of standards such as ISO/IEC 21823-3:2021 “IoT - Interoperability for IoT Systems - Part 3 Semantic interoperability” (as editors), and since there is no ISO Standard for digital pathology and AI-based analysis of (whole slide) images for medical diagnosis, BigPicture is involved in ISO/PWI 24051-2 “Medical laboratories—Part 2: Digital pathology and artificial intelligence (AI)-based image analysis.”
Future work requires integration of results from TEHDAS 1&2 and HealthData@EU pilot as well as other EHDS supporting projects, such as QUANTUM [QUANTUM. URL: https://quantumproject.eu/ [accessed 2024-03-01] 52], which is seeking to develop the EHDS data quality and utility label. Additionally, enhancements to the EHDS2 interoperability framework template are needed, with more standards covering health ontologies, wearables, personal devices, data quality, as well as the usage of FAIR principles. The template can help researchers choose appropriate standards for their projects, thereby reducing time and effort in standard selection and implementation and improving EHDS2 implementation. Moreover, the template will be used in designing and creating an interactive toolkit for the EHDS2 interoperability framework. In this way, the existing and future projects can ensure alignment with governance and interoperability requirements for EHDS2.
EHDS has recently gained additional relevance since the European AI Act that has been approved by the European Parliament by resolution on March 13, 2024, explicitly refers to the EHDS as needing to “facilitate non-discriminatory access to health data and the training of AI algorithms on those data sets, in a privacy-preserving, secure, timely, transparent and trustworthy manner, and with an appropriate institutional governance.”
Acknowledgments
AG declares funding from HSBOOSTER 101058391, StandICT.eu 2026 101091933. The IDERHA (Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance) project is supported by the Innovative Health Initiative (IHI) Joint Undertaking (JU) under grant agreement 101112135. The JU receives support from the European Union’s Horizon Europe research and innovation program and life science industries represented by COCIR, EFPIA/Vaccines Europe, EuropaBio, and MedTech Europe. The BigPicture project has received funding from the Innovative Medicines Initiative 2 JU under grant agreement 945358. This JU receives support from the European Union’s Horizon 2020 research and innovation program and EFPIA. The iHelp project received funding from the European Union's Horizon 2020 research and innovation program under grant agreement 101017441. The ASCAPE (Artificial Intelligence Supporting Cancer Patients Across Europe) project received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 875351. The EUCAIM (European Cancer Imaging Initiative) project is cofunded by the European Union under grant agreement 101100633. The HealthData@EU pilot is cofunded by the European Union.
Authors' Contributions
AG and RH contributed to conceptualization, methodology, and writing – original draft. SA and PG assisted with methodology and writing – original draft. GM, RvN, KZ, IEN, GD, SN, LM-B, PM, SD, SA, MJ, IA, EGA, PH, IB, and AB contributed to validation and writing – review & editing.
Conflicts of Interest
None declared.
Multimedia Appendix 1
Detailed results from analysis of six involved projects. Comparison of the usage of standards: artificial intelligence data quality.
DOCX File , 51 KBReferences
- Cancer. World Health Organization. 2025. URL: https://www.who.int/europe/health-topics/cancer [accessed 2025-02-18]
- Communication from the Commission to the European Parliament and the Council. BRILL. 2012. URL: https://primary sources.brillonline.com/browse/human-rights-documents-online/communication-from-the-commission-to-the-european- parliament-and-the-council;hrdhrd46790058 [accessed 2023-02-01]
- Report on workshops - landscaping data driven projects and initiatives in the cancer field. European Commission. 2024. URL: https://digital-strategy.ec.europa.eu/en/library/report-workshops-landscaping-data-driven-projects-and-initiatives -cancer-field [accessed 2024-05-23]
- Kondylakis H, Kalokyri V, Sfakianakis S, Marias K, Tsiknakis M, Jimenez-Pastor A, et al. Data infrastructures for AI in medical imaging: a report on the experiences of five EU projects. Eur Radiol Exp. 2023;7(1):20. [FREE Full text] [CrossRef] [Medline]
- Second joint action towards the European Health Data Space – TEHDAS2. TEHDAS2. 2024. URL: https://tehdas.eu/project/ [accessed 2025-02-18]
- European Health Data Space. European Commission. 2024. URL: https://health.ec.europa.eu/ehealth-digital-health-and-care/european-health-data-space_en [accessed 2025-02-18]
- Palojoki S, Lehtonen L, Vuokko R. Semantic interoperability of electronic health records: systematic review of alternative approaches for enhancing patient information availability. JMIR Med Inform. 2024;12:e53535. [FREE Full text] [CrossRef] [Medline]
- DCAT application profile for data portals in Europe. European Comission. 2015. URL: https://interoperable-europe.ec.euro pa.eu/collection/semic-support-centre/solution/dcat-application-profile-data-portals-europe [accessed 2015-10-26]
- The New European Interoperability Framework. European Commission. URL: https://ec.europa.eu/isa2/eif_en/ [accessed 2017-03-23]
- Kouroubali A, Katehakis DG. Policy and strategy for interoperability of digital health in Europe. Stud Health Technol Inform. 2022;290:897-901. [CrossRef] [Medline]
- Stellmach C, Muzoora MR, Thun S. Digitalization of health data: interoperability of the proposed European health data space. Stud Health Technol Inform. 2022;298:132-136. [CrossRef] [Medline]
- Torab-Miandoab A, Samad-Soltani T, Jodati A, Rezaei-Hachesu P. Interoperability of heterogeneous health information systems: a systematic literature review. BMC Med Inform Decis Mak. 2023;23(1):18. [FREE Full text] [CrossRef] [Medline]
- Moreno-Conde J, Salas-Fernandez S, Moreno-Conde A. MedicalForms: integrated management of semantics for electronic health record systems and research platforms. Appl Sci. 2022;12(9):4322. [CrossRef]
- Options for governance models for the European Health Data Space. TEHDAS. 2023. URL: https://tehdas.eu/app/uploads/2023/01/tehdas-options-for-governance-models-for-the-european-health-data-space.pdf [accessed 2023-01-17]
- Recommendations to enhance interoperability within HealthData@EU- a framework for semantic, technical and organisational interoperability. TEHDAS. 2022. URL: https://tehdas.eu/tehdas1/app/uploads/2023/10/tehdas-recommendations-to-enhance -interoperability.pdf [accessed 2022-12-21]
- The role of harmonised standards as tools for AI act compliance. DLA Piper. URL: https://www.dlapiper.com/es-pr/insights/publications/2024/01/the-role-of-harmonised-standards-as-tools-for-ai-act-compliance [accessed 2025-02-18]
- Elevating data quality a paradigm shift for data spaces and AI needs. Big Data Value Association (BDVA). URL: https://bdva.eu/news/new-paper-?-elevating-data-quality-a-paradigm-shift-for-data-spaces-and-ai-needs/ [accessed 2024-05-29]
- European Health Data Space Data Quality Framework. TEHDAS. 2022. URL: https://tehdas.eu/tehdas1/app/uploads/2022/05/tehdas-european-health-data-space-data-quality-framework-2022-05-18.pdf [accessed 2022-05-18]
- Recommendations on a Data Quality Framework for the European Health Data Space for secondary use. TEHDAS. URL: https://tehdas.eu/tehdas1/app/uploads/2023/09/tehdas-recommendations-on-a-data-quality-framework.pdf [accessed 2023-09-26]
- Data Quality Framework for EU medicines regulation. European Medicines Agency (EMA). 2023. URL: https://www.ema.europa.eu/en/documents/regulatory-procedural-guideline/data-quality-framework-eu-medicines-regulation_en.pdf [accessed 2023-10-30]
- Gyrard A, Gribbon P, Hussein R, Abedian S, Bonmati LM, Cabornero GL, et al. Synergies among health data projects with cancer use cases based on health standards. Stud Health Technol Inform. 2024;316:1292-1296. [CrossRef] [Medline]
- IDERHA. URL: https://www.iderha.org/integrating-health-data [accessed 2025-02-18]
- A central repository of digital pathology slides to boost the development of artificial intelligence. BigPicture. URL: https://bigpicture.eu/ [accessed 2025-02-18]
- Cancer Image Europe. URL: https://cancerimage.eu/ [accessed 2023-09-29]
- iHelp. URL: https://ihelp-project.eu/ [accessed 2025-02-18]
- ASCAPE. URL: https://www.ascape-project.eu/ [accessed 2025-02-18]
- EHDS2 Pilot. URL: https://ehds2pilot.eu/ [accessed 2025-02-18]
- FHIR for FAIR - FHIR Implementation Guide. HL7 FHIR. URL: http://build.fhir.org/ig/HL7/fhir-for-fair/terminology.html [accessed 2024-01-05]
- HL7 FHIR. URL: https://fhir.org/ [accessed 2025-02-18]
- DICOM. URL: https://www.dicomstandard.org [accessed 2025-02-18]
- Observational Health Data Sciences and Informatics (OHDSI). 2024. URL: https://www.ohdsi.org/ [accessed 2021-01-06]
- International Organization for Standardization (ISO). URL: https://www.iso.org/home.html [accessed 2025-01-22]
- SAREF4EHAW: an extension of SAREF for eHealth Ageing Well domain. ETSI. URL: https://saref.etsi.org/saref4ehaw/v1.1.1/ [accessed 2020-02-01]
- SNOMED International. URL: https://www.snomed.org [accessed 2024-12-16]
- LOINC. URL: https://loinc.org/ [accessed 2025-02-18]
- International Statistical Classification of Diseases and Related Health Problems (ICD). World Health Organization. URL: https://www.who.int/standards/classifications/classification-of-diseases [accessed 2025-02-18]
- Park C, You SC, Jeon H, Jeong CW, Choi JW, Park RW. Development and validation of the radiology common data model (R-CDM) for the international standardization of medical imaging data. Yonsei Med J. 2022;63(Suppl):S74-S83. [FREE Full text] [CrossRef] [Medline]
- Kalokyri V, Kondylakis H, Sfakianakis S, Nikiforaki K, Karatzanis I, Mazzetti S, et al. MI-common data model: extending observational medical outcomes partnership-common data model (OMOP-CDM) for registering medical imaging metadata and subsequent curation processes. JCO Clin Cancer Inform. 2023;7:e2300101. [CrossRef] [Medline]
- Integrating the Healthcare Enterprise (IHE Europe). URL: https://www.ihe-europe.net/ [accessed 2025-02-18]
- D4.03 report on unified open digital slide and annotation format specification. BigPicture. URL: https://bigpicture.eu/sites/default/files/2023-04/945358-BIGPICTURE_D4.03_Report%20on%20unified%20open%20digital%20slide%20and%20an notation%20format%20specification.pdf [accessed 2025-02-18]
- imi-bigpicture / wsidicom. GitHub. URL: https://github.com/imi-bigpicture/wsidicom [accessed 2023-12-06]
- imi-bigpicture / wsidicomizer. GitHub. URL: https://github.com/imi-bigpicture/wsidicomizer [accessed 2023-12-06]
- Report on the landscape analysis of available metadata catalogues and the metadata standards in use. EHDS. URL: https://ehds2pilot.eu/available_results/report-on-the-landscape-analysis-of-available-metadata-catalogues-and-the-metadata-standards -in-use/ [accessed 2023-03-30]
- Hussein R, Balaur I, Burmann A, Ćwiek-Kupczyńska H, Gadiya Y, Ghosh S, et al. Getting ready for the European health data space (EHDS): IDERHA's plan to align with the latest EHDS requirements for the secondary use of health data. Open Res Eur. 2024;4:160. [CrossRef] [Medline]
- EUCAIM's HyperOntology_v0.2beta. Zenodo. URL: https://zenodo.org/records/11109765 [accessed 2024-05-03]
- El Saddik A. Digital twins: the convergence of multimedia technologies. IEEE MultiMedia. 2018;25(2):87-92. [CrossRef]
- Manias G, Op DAH, Azqueta A, Burgos D, Capocchiano ND, Crespo BL, et al. iHELP: personalised health monitoring and decision support based on artificial intelligence and holistic health records. 2021. Presented at: IEEE Symposium on Computers and Communications (ISCC); September 08-10, 2021; Athens, Greece. [CrossRef]
- Manias G, Azqueta-Alzúaz A, Dalianis A, Griffiths J, Kalogerini M, Kostopoulou K, et al. Advanced data processing of pancreatic cancer data integrating ontologies and machine learning techniques to create holistic health records. Sensors (Basel). 2024;24(6):1739. [FREE Full text] [CrossRef] [Medline]
- Frid S, Fuentes Expósito MA, Grau-Corral I, Amat-Fernandez C, Muñoz Mateu M, Pastor Duran X, et al. Successful integration of EN/ISO 13606-standardized extracts from a patient mobile app into an electronic health record: description of a methodology. JMIR Med Inform. 2022;10(10):e40344. [FREE Full text] [CrossRef] [Medline]
- D5.05 - Report on landscape of guidelines, standards and regulatory requirements relevant for digital pathology, clinical. BigPicture. URL: https://bigpicture.eu/sites/default/files/2023-04/945358-BIGPICTURE_D5.05%20-%20Report%20on%20 landscape%20of%20guidelines%2C%20clinical_final.pdf [accessed 2025-02-18]
- AIOTI White Paper IoT/Edge Computing and Health Data and Data Spaces. AIOTI. 2024. URL: https://aioti.eu/aioti-white -paper-iot-edge-computing-and-health-data-and-data-spaces [accessed 2024-08-03]
- QUANTUM. URL: https://quantumproject.eu/ [accessed 2024-03-01]
Abbreviations
AI: artificial intelligence |
AI4HI: Artificial Intelligence for Health Imaging |
ASCAPE: Artificial Intelligence Supporting Cancer Patients Across Europe |
BDVA: Big Data Value Association |
CDM: common data model |
CMMS: Common Mandatory Metadata Structure |
DCAT: Data Catalog Vocabulary |
DCAT-AP: DCAT Application Profile for Data Portals in Europe |
DICOM: Digital Imaging and Communications in Medicine |
DQF: data quality framework |
EBCP: Europe Beating Cancer Plan |
EC: European Commission |
EHDS: European Health Data Space |
EHDS2: European Health Data Space for the secondary use of data |
EIF: European Interoperability Framework |
EMA: European Medicine Agency |
EGA: European Genome-Phenome Archive |
EU: European Union |
EUCAIM: European Cancer Imaging Initiative |
FAIR: Findability, Accessibility, Interoperability, and Reusability |
FHIR: Fast Healthcare Interoperability Resource |
GDPR: General Data Protection Regulation |
HL7: Health Level 7 |
IDERHA: Integration of Heterogeneous Data and Evidence Towards Regulatory and Health Technology Assessments Acceptance |
IEC: International Electrotechnical Commission |
INHAND: International Harmonization of Nomenclature and Diagnostic Criteria |
IoT: Internet of Things |
ISO: International Organization for Standardization |
JA: joint action |
OHDSI: Observational Health Data Sciences and Informatics |
OMOP: Observational Medical Outcomes Partnership |
PGHD: patient-generated health data |
SAREF: Smart Applications Reference Ontology |
SEND: Standardization for Exchange of Nonclinical Data |
SNOMED CT: Systematized Medical Nomenclature for Medicine–Clinical Terminology |
TC: Technical Committee |
TEHDAS: Towards the European Health Data Space |
WHO: World Health Organization |
WSI: whole-slide images |
W3C: World Wide Web Consortium |
Edited by A Coristine; submitted 09.09.24; peer-reviewed by A Billis, Z Hou, FA Causio; comments to author 18.11.24; revised version received 10.12.24; accepted 31.01.25; published 24.03.25.
Copyright©Amelie Gyrard, Somayeh Abedian, Philip Gribbon, George Manias, Rick van Nuland, Kurt Zatloukal, Irina Emilia Nicolae, Gabriel Danciu, Septimiu Nechifor, Luis Marti-Bonmati, Pedro Mallol, Stefano Dalmiani, Serge Autexier, Mario Jendrossek, Ioannis Avramidis, Eva Garcia Alvarez, Petr Holub, Ignacio Blanquer, Anna Boden, Rada Hussein. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 24.03.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.