Published on in Vol 27 (2025)

This is a member publication of University College London (Jisc)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/57237, first published .
Balancing Risks and Opportunities: Data-Empowered-Health Ecosystems

Balancing Risks and Opportunities: Data-Empowered-Health Ecosystems

Balancing Risks and Opportunities: Data-Empowered-Health Ecosystems

Viewpoint

1University College London, London, United Kingdom

2World Organisation for Animal Health, Paris, France

3University of Sussex, Brighton, United Kingdom

4Our Future Health, London, United Kingdom

5UK Health Security Agency, London, United Kingdom

Corresponding Author:

Lan Li, PhD

University College London

Gower Street

London, WC1E6BT

United Kingdom

Phone: 44 07529917633

Email: lan.li.19@ucl.ac.uk


This viewpoint paper addresses the ongoing challenges and opportunities within the data-for-health ecosystem, drawing insights from a multistakeholder workshop. Despite notable progress in the digitization of health care systems, data sharing and interoperability remain limited, so the full potential of health care data is not realized. There is a critical need for data ecosystems that can enable the timely, safe, efficient, and sustainable collection and sharing of health care data. However, efforts to meet this need face risks related to privacy, data protection, security, democratic governance, and exclusion. Key challenges include poor interoperability, inconsistent approaches to data governance, and concerns about the commodification of data. While emerging platforms such as social media play a growing role in gathering and sharing health information, their integration into formal data systems remains limited. A robust and secure data-for-health ecosystem requires stronger frameworks for data governance, interoperability, and citizen engagement to build public trust. This paper argues that reframing health care data as a common good, improving the transparency of data acquisition and processing, and promoting the use of application programming interfaces (APIs) for real-time data access are essential to overcoming these challenges. In addition, it highlights the need for international norms and standards guided by multisector leadership, given the multinational nature of data sharing. Ultimately, this paper emphasizes the need to balance risks and opportunities to create a socially acceptable, secure, and effective data-sharing ecosystem in health care.

J Med Internet Res 2025;27:e57237

doi:10.2196/57237

Keywords



Data are essential to the delivery of effective health care and the improvement of health outcomes. However, the power of data relevant to health has not been fully realized. There is a need to nurture and develop data-for-health ecosystems to collect and share valuable data through transformative and responsible technological systems in timely, safe, efficient, and sustainable ways that maximize data utility for the public good.

A “data ecosystem” refers to a network of entities engaged in generating, sharing, and using data resources through various technologies [Oliveira MIF, Lóscio BF. What is a data ecosystem. 2018. Presented at: dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age; June 1, 2018; New York, NY. [CrossRef]1]. In the context of a data-for-health ecosystem, this network is diverse and decentralized, comprising public and private institutions, as well as individuals acting in various roles such as citizens, patients, and consumers. Data are actively and passively generated across all levels of this ecosystem, intentionally recorded for health purposes (eg, National Health Service [NHS] patient records) or initially generated for different reasons but repurposed for health care improvement (eg, geospatial air quality data shared to assist citizens with respiratory issues). Data management within this ecosystem varies across a spectrum: from tightly controlled data, such as patient records protected by strict privacy regulations and access protocols to ensure confidentiality, to fully open data, such as anonymized census or survey data that are freely accessible to the public for research and policy-making purposes. This range encompasses well-structured databases and unstructured streams from sources such as social media and Internet of Things devices, with knock-on effects for how data can be used.

Electronic devices generate digital traces that contribute valuable insights into personal health, extending beyond traditional patient records to include data held by IT and mobile corporations, and even loyalty card accounts [Brewer HR, Hirst Y, Chadeau-Hyam M, Johnson E, Sundar S, Flanagan JM. Association between purchase of over-the-counter medications and ovarian cancer diagnosis in the cancer loyalty card study (CLOCS): observational case-control study. JMIR Public Health Surveill. 2023;9:e41762. [CrossRef] [Medline]2]. The wide spectrum and variety of spatial-temporal geo-located big data streams that are collected include web-based search keywords, internet and store-based purchases, sensors and Internet of Things (IoT) data, health and fitness app data, and mobile phone location data [Salathé M. Digital epidemiology: what is it, and where is it going? Life Sci Soc Policy. 2018;14(1):1. [CrossRef] [Medline]3,Gardy JL, Loman NJ. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet. 2018;19(1):9-20. [CrossRef] [Medline]4].

The rapid evolution of this ecosystem poses risks related to privacy, data protection and security, transparency, access, and exclusion, as highlighted in the 2014 United Nations report “A World That Counts” [Gonzalez L, Hsu YC, Poole J, Rae B, Rutherford I. A world that counts: mobilising the data revolution for sustainable development. UN Data Revolution. 2014. URL: https://www.undatarevolution.org/wp-content/uploads/2014/11/A-World-That-Counts.pdf [accessed 2025-02-22] 5]. The COVID-19 pandemic heightened concerns, due to the proliferation of systems capturing personal information for public health benefit, but often at the expense of robust debate about privacy, data ownership, and technology regulation.

Despite the potential for leveraging digital traces in health care, limitations persist, revealing gaps and missed opportunities [Li L, Novillo-Ortiz D, Azzopardi-Muscat N, Kostkova P. Digital data sources and their impact on people's health: a systematic review of systematic reviews. Front Public Health. 2021;9:645260. [FREE Full text] [CrossRef]6]. The lack of interoperability between platforms and databases hinders data sharing across sectors, disciplines, countries, and international agencies. While such data holds substantial value for individuals, clinicians, health system planners, policy makers, researchers, and innovators, restrictions on visibility, access, and sharing impede its full potential. For instance, integrating data from health and fitness apps with electronic patient records could enhance clinical understanding, but business interests often deter app companies from developing interfaces for seamless integration.

Beyond interoperability, cross-border data sharing presents another critical hurdle, particularly in addressing global health challenges. Despite its growing importance, legal, regulatory, and technical barriers continue to limit progress. The lack of harmonized data protection standards across countries complicates the sharing of health data, particularly when navigating diverse privacy regulations like the General Data Protection Regulation (GDPR) in the European Union (EU) and less stringent frameworks in other regions. Countries often have conflicting data protection laws, making it difficult to ensure compliance while enabling international data flows. In addition, concerns over data sovereignty, where countries insist on controlling their citizens’ data within national borders, further limit cross-border collaboration [Chin YC, Zhao J. Governing cross-border data flows: international trade agreements and their limits. Laws. 2022;11(4):63. [CrossRef]7,Hummel P, Braun M, Tretter M, Dabrock P. Data sovereignty: a review. Big Data Soc. 2021;8(1):205395172098201. [CrossRef]8]. In public health emergencies, such restrictions can delay data access, hindering rapid responses. For example, during the COVID-19 pandemic, differing regulations and limited cross-border collaboration created obstacles for real-time sharing of epidemiological data, impeding global tracking efforts [Towett G, Snead RS, Grigoryan K, Marczika J. Geographical and practical challenges in the implementation of digital health passports for cross-border COVID-19 pandemic management: a narrative review and framework for solutions. Global Health. 2023;19(1):98. [CrossRef] [Medline]9].

In 2016, the “Findable, Accessible, Interoperable, Reusable” principle was introduced as a guide for improving data reusability, focusing on both machine and human accessibility by stakeholders from academia, industry, funding agencies, and scholarly publishers [Wilkinson MD, Dumontier M, Aalbersberg IJJ, Appleton G, Axton M, Baak A, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. 2016;3(1):160018. [FREE Full text] [CrossRef] [Medline]10]. Nevertheless, government agencies encounter significant challenges in leveraging population-level data for decision-making due to issues like delayed and inconsistent official data collection and aggregation, and the cost of accessing commercial datasets. Such difficulties were exacerbated during the COVID-19 pandemic, particularly for agencies in low- and middle-income countries with limited funding and infrastructure [Salama C. First lessons from government evaluations of COVID-19 responses: a synthesis. OECD. 2022. URL: https://tinyurl.com/yecuc8j9 [accessed 2023-06-12] 11]. Increasingly, nonofficial data is relied upon to provide timely information. This may be generated by citizens or communities as an evidence base for change or to enhance social accountability, but it is often collected by private providers, with individual consumers contributing their data either actively or passively. Increasingly, multinational technology companies engaging in “massive and passive” data collection are dominating data-for-health ecosystems. This trend was accelerated during the COVID-19 pandemic [Roberts S, Kostkova P. Disease surveillance, digital futures, and data-sharing in a world 'after' COVID-19. Global Policy Journal. 2021. URL: https:/​/www.​globalpolicyjournal.com/​articles/​science-and-technology/​disease-surveillance-digital-futures-and-data-sharing-world-after [accessed 2023-06-12] 12] when social media platforms and search tools such as Twitter, Facebook, Google, and Bing played a pivotal role in providing timely information for researchers to monitor public sentiment and assess intervention impacts [Li L, Aldosery A, Vitiugin F, Nathan N, Novillo-Ortiz D, Castillo C, et al. The response of governments and public health agencies to COVID-19 pandemics on social media: a multi-country analysis of Twitter discourse. Front Public Health. 2021;9:716333. [FREE Full text] [CrossRef] [Medline]13]. While the technology industry certainly benefits from leveraging personal data for web-based marketing, digital epidemiologists demonstrate how these new data streams can also enhance understanding of citizen behavior, support decision-making, and enable reporting of disease symptoms.

The aim of this viewpoint paper is to explore the key challenges and opportunities within these evolving ecosystems, synthesizing insights from the “Data Enabled Society for Health: Challenges and Opportunities” workshop held in London by University College London. This workshop brought together health care providers, policymakers, technologists, and researchers, to consider how to create a secure and effective data-for-health ecosystem that facilitates the gathering and sharing of valuable data for both individual and population-level health benefits, while effectively managing the associated risks. We suggest that to optimize the use of health data, governments and institutions should work on three overlapping areas: data governance, interoperability, and participation and citizen engagement..


Ongoing issues with data governance, data access, and privacy highlighted during the workshop extend beyond regulatory and technical concerns to profound ethical dilemmas. These include the challenge of obtaining meaningful informed consent in complex data ecosystems, the risk of data commodification that reduces individuals to mere data points, and the ethical responsibility of both public and private entities to prioritize the rights of individuals over profit or efficiency. These concerns also intersect with issues of data equity, where marginalized groups may face greater risks of exclusion or exploitation.

Within the public sector, citizens are not the only ones to limit access to their data due to privacy or security concerns. Governmental organizations and health care institutions are also very wary of granting access to their datasets, sometimes even to researchers, fearing either reputational damage due to mishaps or the loss of future financial opportunities if they share their assets.

This protective attitude towards an organization’s data assets reflects the perception of data as a commodity: something that can be “extracted” from individuals or communities and then traded. We think it is important to reframe this narrative and look at personal data in the same way we look at common goods, with regulated access and shared collective benefits [Taylor MJ, Wilson J. Reasonable expectations of privacy and disclosure of health data. Med Law Rev. 2019;27(3):432-460. [FREE Full text] [CrossRef] [Medline]14,Kostkova P, Brewer H, de Lusignan S, Fottrell E, Goldacre B, Hart G, et al. Who owns the data? Open data for healthcare. Front Public Health. 2016;4:7. [FREE Full text] [CrossRef] [Medline]15].

Attitudes against sharing medical information can stem from confusing messages and a lack of controls in the past, as well as fear of data being shared with third parties in ways that invade privacy, lead to increased surveillance and authoritarianism, or enable personal data exploitation against the individual’s interests (eg, with an insurance provider). Citizen engagement needs to be specific and honest about the risks and benefits of data sharing, rather than relying on an abstract notion of how data sharing may benefit humankind, to avoid pitfalls such as the failed launch of NHS Care.data initiatives [Carter P, Laurie GT, Dixon-Woods M. The social licence for research: why care.data ran into trouble. J Med Ethics. 2015;41(5):404-409. [FREE Full text] [CrossRef] [Medline]16] and the abandoned centralized NHS COVID-19 Test and Trace app [Wise J. Covid-19: UK drops its own contact tracing app to switch to apple and google model. BMJ. 2020;369:m2472. [CrossRef] [Medline]17]. At the policy level, the key challenges are increasing the transparency of data governance and developing robust deidentification techniques that preserve the usability of data for research, rather than anonymization. Transparency and open dialogue with citizens are paramount for regaining public trust and setting cornerstones for a balanced agenda. This must be seen as an ongoing process, as trust is never a given.

The above-described scenario of data sharing sits in stark contrast to the vast quantity of personal physiological and health data collected through social media, wearables, tracking devices, MedTech, and geo-located mobile apps. While there are differences in legal frameworks across geographies, for example, in Europe (where GDPR applies, giving more control to citizens over their data), the United States, and China, personal data are generally subject to industry-defined terms and conditions, often with no clear or accessible opt-out clauses allowing use for personalized web-based or mobile marketing. This happens without much awareness or evident concern from citizens and in the absence of robust policy debate. To rebalance the current situation, there is an urgent need for international regulation, oversight, and a restoration of user controls over personal data.

The evolving nature of data-for-health ecosystems therefore raises ethical issues that require deeper consideration. These include mechanisms for ensuring informed consent, particularly in settings where data collection occurs passively or through third-party platforms, often without clear communication with individuals. Traditional models of consent are increasingly inadequate in such ecosystems, leading to a growing call for more dynamic or granular consent mechanisms that allow individuals greater control over specific types of data use. Moreover, the balance between individual privacy and public health is a critical ethical dilemma. While sharing health data has clear societal benefits, especially during public health emergencies like the COVID-19 pandemic, it raises the risk of compromising individual privacy. Finding an ethical equilibrium requires robust deidentification techniques, transparent data governance frameworks, and public engagement in decision-making processes. Finally, we must address the ethical issues of data commodification, where individuals’ health data is treated as a commercial product. This commodification risks not only violating privacy but also entrenching inequalities, as those with fewer resources may have less control over their data and greater exposure to exploitation.


In addition to governance and access to data, the lack of common platforms and standards for data sharing limits collaboration across institutions and countries. This can have potentially devastating consequences in public health emergencies when a rapid and coordinated response is needed.

A notable example of successful collaboration is the Innovative Medicines Initiative ConcePTION consortium (2019), a European partnership involving public and private organizations that collect or access data related to pregnancy, childbirth, and lactation. Innovative Medicines Initiative ConcePTION aims to enhance the availability of health data and transform it into actionable evidence to improve clinical practices and outcomes [Hollestelle MJ, van der Graaf R, Sturkenboom MC, van Delden JJ. Stimulating solidarity to improve knowledge on medications used during pregnancy: a contribution from the ConcePTION project. BMC Med Ethics. 2023;24(1):44. [FREE Full text] [CrossRef] [Medline]18]. Another valuable framework is the Unified Information and Interoperability Governance model, which integrates principles of both information governance and interoperability. The Unified Information and Interoperability Governance framework was designed to improve adaptability, flexibility, and efficiency in health information usage across the entire health care system [Tumulak A, Tin J, Keshavjee K. Towards a Unified framework for information and interoperability governance. The Role of Digital Health Policy and Leadership. 2024:49-53. [CrossRef]19]. Similarly, the United Nations Educational, Scientific and Cultural Organization - Committee on Data for Science and Technology (UNESCO-CODATA) Global Project contributes to the UNESCO Open Science Toolkit by providing comprehensive guidance, checklists, and factsheets specifically aimed at fostering data transparency, accessibility, and collaboration during crises [Data policy for times of crisis facilitated by Open Science. CODATA. URL: https://codata.org/initiatives/data-policy/dptc/ [accessed 2024-09-18] 20]. The project delivers tools for both policy makers and scientists to support data collection, management, and governance. Such frameworks could be instrumental in future health crises, aligning with the goals of creating more interoperable and transparent data ecosystems.

One underused approach to health data access is that of application programming interfaces (APIs), which facilitate real-time, secure data sharing between systems. When integrated with emerging technologies like blockchain and federated learning, APIs can support more secure, privacy-preserving, and decentralized data exchanges. Blockchain can provide the security backbone for APIs, ensuring that data integrity and ownership are maintained, while federated learning can allow institutions to collaborate on data analytics without sharing sensitive raw data. Combined with machine learning algorithms, APIs can further enhance real-time data analysis, enabling rapid public health responses based on predictive insights. This is of particular importance for real-time data streams, such as those generated by social media or IoT sensors. An example of this approach was Twitter allowing free API access to 1% of its database—a bold move that provided real-time data and fueled thousands of research projects in social computing, epidemic intelligence, sentiment analysis, and network and graph computing over a decade [Sinnenberg L, Buttenheim AM, Padrez K, Mancheno C, Ungar L, Merchant RM. Twitter as a tool for health research: a systematic review. Am J Public Health. 2017;107(1):e1-e8. [CrossRef] [Medline]21]. It was also incorporated into the epidemic intelligence platforms used by the World Health Organization (WHO) and its member states, such as MediSys, Epidemic Intelligence from Open Sources (EIOS), and Global Public Health Intelligence Network. This provision ceased under Twitter’s new ownership [Ledford H. Researchers scramble as Twitter plans to end free data access. Nature. 2023;614(7949):602-603. [CrossRef] [Medline]22]. Free access to the Twitter API was suspended in February 2023, causing significant disruption to academic research that relied on social media data [Murtfeldt R, Alterman N, Kahveci I, West JD. RIP Twitter API: a eulogy to its vast research contributions. arXiv:2404.07340. 2024. [CrossRef]23]. Other examples of real-time data gathering and integration platforms include the flights portal Skyscanner and the financial products portal MoneySuperMarket. However, real-time data access for health applications is lacking.

Blockchain offers a groundbreaking solution for managing health data by using distributed, immutable ledgers that enhance transparency, security, and trust. In a data-for-health ecosystem, blockchain technology ensures that health data remains unaltered and that individuals retain control over their personal information through the use of smart contracts. These smart contracts facilitate secure, automated agreements between data owners and users, ensuring that data sharing complies with predefined privacy rules. The decentralized nature of blockchain makes it especially suitable for secure cross-border data sharing among institutions, allowing real-time access while safeguarding data integrity. Moreover, blockchain can support granular consent mechanisms, enabling individuals to specify who can access their data and under what conditions. A notable example is Estonia’s health care system, which has successfully integrated blockchain technology to secure over 1 million patient records, ensuring both data privacy and transparency [Alexopoulos C, Charalabidis Y, Loutsaris MA, Lachana Z. How blockchain technology changes government: a systematic analysis of applications. Int J Public Adm Digit Age. 2021;8(1):1-20. [CrossRef]24].

Meanwhile, machine learning (ML) is revolutionizing health care by enabling the analysis of large and complex datasets to identify patterns, predict outcomes, and support better decision-making. In data-for-health ecosystems, ML can be used to analyze real-time data streams from various sources, such as social media, wearable devices, and electronic medical records, to predict public health trends like disease outbreaks or shifts in vaccination behavior. ML algorithms enhance real-time epidemiological surveillance, facilitating the early detection of potential health threats through the analysis of social media activity, internet search trends, or mobile health app data. This predictive capability aids in timely public health interventions and resource allocation. For instance, Google’s DeepMind has applied ML to the NHS to improve diagnostic accuracy and predict patient outcomes by analyzing historical health records [Powles J, Hodson H. Google DeepMind and healthcare in an age of algorithms. Health Technol (Berl). 2017;7(4):351-367. [FREE Full text] [CrossRef] [Medline]25].

Other innovative methods enable ML models to be trained on decentralized data without requiring the sharing of raw data between institutions. Federated learning allows health care institutions to collaborate on model development while maintaining their data locally, thus ensuring data privacy and compliance with regulatory standards like the GDPR. It promotes interoperability without compromising data ownership, making it ideal for cross-institutional collaborations. For example, federated learning has been successfully implemented in medical imaging projects, improving disease detection without the need for direct data sharing across institutions [Darzidehkalani E, Ghasemi-Rad M, van Ooijen PMA. Federated learning in medical imaging: part I: toward multicentral health care ecosystems. J Am Coll Radiol. 2022;19(8):969-974. [FREE Full text] [CrossRef] [Medline]26]. Similarly, zero-knowledge proofs offer a method for verifying the truth of information without revealing the underlying data. This can significantly enhance data-for-health ecosystems by allowing health institutions to share valuable insights while preserving the privacy of sensitive patient information. For example, health systems could confirm the validity of vaccination status or other health data without exposing any personal identifiers. Cryptographic zero-knowledge proof systems have been explored in secure identity verification services, such as confirming COVID-19 results, while keeping patient identities confidential [Jo U, Oktian YE, Kim D, Oh S, Lee H, Kim H. A zero-knowledge-range-proof-based privacy-preserving blockchain platform for COVID-19 contact tracing. 2022. Presented at: International Conference on Platform Technology and Service (PlatCon); August 24, 2022:53-58; Jeju, Korea. [CrossRef]27]. In addition, the emerging role of the Metaverse in health care is gaining attention, as its potential applications continue to grow with advancements in Web 3.0 technology [Li Y, Gunasekeran DV, RaviChandran N, Tan TF, Ong JCL, Thirunavukarasu AJ, et al. The next generation of healthcare ecosystem in the metaverse. Biomed J. 2024;47(3):100679. [FREE Full text] [CrossRef] [Medline]28]. The Metaverse offers an immersive, interoperable ecosystem that could redefine the traditional health care system through avatar-based meetings, simulations, and social interactions between patients, providers, and organizations. However, technological innovation, regulatory oversight, and sound governance are needed to address current challenges in its development.


Limited access to technology in certain sectors of society and parts of the world can be attributed to resource and infrastructure constraints. Collaborative efforts across governments and communities are essential to address these challenges and prevent exclusion based on factors such as age or socioeconomic status [Vassilakopoulou P, Hustad E. Bridging digital divides: a literature review and research agenda for information systems research. Inf Syst Front. 2023;25(3):955-969. [FREE Full text] [CrossRef] [Medline]29].

Recently, concerns about data use and potential misuse have intensified due to health care provider hacking incidents and a lack of transparent communication regarding consequences and damages [Bharti N, O'Donovan C, Smallman M, Wilson J. Public trust, deliberative engagement and health data projects: beyond legal provisions. Engag Sci Technol Soc. 2021;7(1):125-133. [CrossRef]30]. Case studies such as the Care.data program [Taylor MJ, Wilson J. Reasonable expectations of privacy and disclosure of health data. Med Law Rev. 2019;27(3):432-460. [FREE Full text] [CrossRef] [Medline]14] and General Practice Data for Planning and Research [Bharti N, O'Donovan C, Smallman M, Wilson J. Public trust, deliberative engagement and health data projects: beyond legal provisions. Engag Sci Technol Soc. 2021;7(1):125-133. [CrossRef]30] underscore the critical role of public trust in the success of health data initiatives. Public skepticism, evidenced by opt-out rates and project pauses, demonstrates the need for responsible data use [Borda A, Molnar A, Neesham C, Kostkova P. Ethical issues in AI-enabled disease surveillance: perspectives from global health. Appl Sci. 2022;12(8):3890. [CrossRef]31].

Diminishing public trust, exemplified by incidents like the NHS Royal Free Trust data privacy breach, has profound consequences for current and future health data initiatives [Bharti N, O'Donovan C, Smallman M, Wilson J. Public trust, deliberative engagement and health data projects: beyond legal provisions. Engag Sci Technol Soc. 2021;7(1):125-133. [CrossRef]30]. To foster meaningful citizen participation, mechanisms such as multistakeholder platforms, cocreation exercises, and public consultations can be used. Engaging the public in deliberative discussions, as demonstrated by the Understanding Patient Data (UPD) initiative, is crucial [Understanding Patient Data. 2023. URL: https://understandingpatientdata.org.uk/about-us [accessed 2023-04-17] 32]. Through cocreative workshops [Asher R, Jesper-Mir E. Why do you need my data? A co-creative approach to enriching public dialogues around health data. National Co-ordinating Centre for Public Engagement. 2022. URL: https:/​/www.​publicengagement.ac.uk/​whats-new/​blog/​why-do-you-need-my-data-co-creative-approach-enriching-public-dialogues-around-health [accessed 2023-07-26] 33], UPD facilitates meaningful discussions among diverse stakeholders, ensuring public perspectives and concerns are not only heard but also integrated into decision-making processes [Scott K, Burall S, Perrin N, Shelton P, White D, Irvine G. Data for Public Benefit Balancing the risks and benefits of data sharing. Understanding Patient Data. 2018. URL: https://understandingpatientdata.org.uk/sites/default/files/2018-04/Data%20for%20public%20good_0.pdf [accessed 2017-05-28] 34]. This inclusive approach cultivates trust in responsible data usage and ensures citizen expectations shape data governance practices. Collaborative governance models offer a structure where stakeholders can jointly manage and govern data processes [Ansell C, Gash A. Collaborative governance in theory and practice. J Public Adm Res Theory. 2008;18(4):543-571. [CrossRef]35]. These models emphasize shared accountability, transparency, and continuous feedback, helping to mitigate power imbalances between different groups. Such models can be adapted to include conflict-resolution strategies to address disagreements or tensions that may arise during the decision-making process. Furthermore, achieving alignment across disparate groups requires consensus-building strategies. Techniques such as participatory decision-making frameworks and deliberative processes can help in forming agreement on key issues such as data ownership, privacy, and access rights [Kaner S. Facilitator's Guide to Participatory Decision-Making. San Francisco, CA. John Wiley & Sons; 2014. 36].

Another notable initiative that frames health data as a common good is the Our Future Health program delivered in collaboration with the NHS. This program aims to collect health information from up to 5 million adults across the United Kingdom on a voluntary basis [Our Future Health. The UK's largest health research programme. 2024. URL: https://study.ourfuturehealth.org.uk/welcome [accessed 2024-09-18] 37]. By September 2024, more than 1.8 million people had taken part. Researchers from universities, charities, the NHS, and health-related companies can apply for access to Our Future Health resources. It is anticipated that this will enable them to explore and test innovative approaches for predicting, detecting, and treating common diseases such as dementia, cancer, diabetes, heart disease, and stroke [Our Future Health. The UK's largest health research programme. 2024. URL: https://study.ourfuturehealth.org.uk/welcome [accessed 2024-09-18] 37].

In summary, Textbox 1 outlines the key future directions, recommendations, and research agenda emerging from the workshop discussions.

Textbox 1. Future directions, recommendations, and research agenda.

Research priorities

Several key areas require further investigation to advance the development of robust and secure data-for-health ecosystems:

  • Data privacy and security: Future research should focus on developing and refining privacy-preserving technologies that enable secure data sharing without compromising individual privacy. Researchers should also explore the effectiveness of granular consent mechanisms in empowering individuals to control their data.
  • Interoperability: Addressing the lack of interoperability between health care systems, both within and across national borders, is vital. Building on successful interoperability and disease coding standards such as SNOMED-CD (Systematized Medical Nomenclature for Medicine–Clinical Terminology) and ICD-10 (International Statistical Classification of Diseases and Related Health Problems, 10th Revision) [HL7 Version 2 Product Suite. HL7 International. URL: https://www.hl7.org/implement/standards/product_brief.cfm?product_id=185 [accessed 2025-02-06] 38,ICD-10 Version. 2019. URL: https://icd.who.int/browse10/2019/en [accessed 2025-02-06] 39], research should prioritize the development of standardized data-sharing protocols and application programming interface–based solutions that can facilitate real-time data exchange while ensuring compliance with differing regulatory frameworks.
  • Ethical governance models: As data-for-health ecosystems evolve, the development of inclusive governance models that balance the interests of diverse stakeholders will be essential. Researchers should focus on designing collaborative governance structures that integrate citizen engagement and address concerns about data commodification and trust.
  • Cross-border data sharing: Given the global nature of health challenges, it is crucial that future research addresses solutions for overcoming the legal and regulatory barriers to cross-border data sharing. This effort should focus on developing and endorsing international agreements, as well as harmonizing standards, to ensure the secure and ethical exchange of health data. Key stakeholders in this process should include organizations such as the World Health Organization, European Commission, and national Ministries of Health. An important step in this direction is building on existing frameworks, like the European Health Information Gateway [WHO European health information at your fingertips. URL: https://gateway.euro.who.int/en/datasets/ [accessed 2025-02-06] 40], to facilitate greater collaboration. In addition, fostering partnerships between the World Health Organization, the Association of European Operational Research Societies, and the European Centre for Disease Prevention and Control could further enhance data accessibility, with the creation of a joint data portal and application programming interface for developers to access health data securely and seamlessly.

Recommendations for policy makers

Policy makers play a crucial role in shaping the future of data-for-health ecosystems. While this is the hardest challenge to address due to the varying priorities of member states and IT industry commercial pressure We recommend the following actions:

  • Regulatory harmonization: policy makers should work toward the harmonization of data protection standards globally, to facilitate cross-border data sharing while safeguarding privacy. Collaborative efforts between international organizations and governments are key to establishing global frameworks for data governance.
  • Incentivizing technological innovation: governments should provide incentives for the adoption of privacy-enhancing technologies and interoperable data systems, including the use of federated learning and blockchain for secure data sharing.
  • Public trust and engagement: public trust in data systems must be a priority. Policy makers should encourage the development of transparent governance frameworks that involve citizens in decision-making processes and ensure public accountability in the use of health data.

Guidelines for practitioners and researchers

Health care practitioners and researchers should focus on the practical implementation of new technologies and frameworks. We recommend:

  • Adopting best practices for data sharing: health care institutions should adopt best practices for data sharing that prioritize data security, interoperability, and patient consent, while supporting the reuse of anonymized and pseudonymized data for research, policy, and social good. These practices can be informed by pilot projects and case studies that have successfully integrated innovative technologies such as machine learning and federated learning into health systems.
  • Collaborative research initiatives: researchers should pursue collaborative initiatives that bring together stakeholders from academia, industry, and government to cocreate solutions. These collaborations will be critical for testing the feasibility and effectiveness of emerging technologies in real-world settings.
  • Focus on equity and inclusion: researchers and practitioners must ensure that the benefits of data-for-health ecosystems are equitably distributed. This includes prioritizing access to technology for underrepresented populations and ensuring that governance models include mechanisms for inclusive participation.

Perspectives on the risks and opportunities within a data-for-health ecosystem vary based on the vantage points of government agencies, private entities, and individual citizens. Balancing top-down regulation with bottom-up citizen engagement is crucial for enabling data generation, usage, and sharing while safeguarding individuals. Examining successful practices, such as the NHS’s Data Utility framework, and the case studies (Textbox 2) highlighted in this paper can inform the development of adaptable models across countries.

Textbox 2. Case studies in data-for-health ecosystems.

Successful interoperability: The COVID-19 vaccination rollout in Israel [Rosen B, Hartal M, Waitzberg R. The Israeli health system's rapid responses during the COVID-19 pandemic. Isr J Health Policy Res. 2024;13(1):11. [CrossRef] [Medline]41]

Israel’s national health infrastructure enabled the rapid, seamless integration of vaccination data across health care providers during the COVID-19 pandemic. Interoperability between public health databases and electronic medical records allowed real-time data sharing, which facilitated efficient tracking, resource allocation, and public health decision-making. This success was driven by a well-established national health database and the effective use of application programming interfaces (APIs) that allowed data exchange between different systems. The case highlights how clear governance and technical standards can result in radical interoperability and improve public health outcomes.

  • Key factors for success:A centralized national database, clear data-sharing protocols, and government-driven API adoption.
  • Lessons learned:This case aligns with our proposed interoperability suggestions by demonstrating the value of standardized APIs and centralized databases in promoting real-time data sharing.

Interoperability challenges: Data sharing during the Ebola Epidemic (West Africa, 2014-2016) [Jalloh MF, Sengeh P, James N, Bah S, Jalloh MB, Owen K, et al. Integrated digital system for community engagement and community-based surveillance during the 2014-2016 Ebola outbreak in Sierra Leone: lessons for future health emergencies. BMJ Glob Health. 2020;5(12):e003936. [FREE Full text] [CrossRef] [Medline]42]

During the 2014-2016 Ebola epidemic in West Africa, the lack of interoperability between different health care systems and organizations severely hindered real-time data sharing and coordination among international health agencies. Various platforms were used to track the epidemic, but without standardized protocols, the response was delayed, costing lives. This case demonstrates the dangers of fragmented data systems during public health emergencies.

  • Key factors for failure:Lack of standardized data-sharing protocols, limited use of APIs, and a fragmented health infrastructure.
  • Lessons learned:The case reinforces the importance of adopting standardized interoperability solutions, such as APIs, to enable real-time data sharing across countries and organizations.

Governance challenges:The failure of the National Health Service Care.data program (United Kingdom) [Group BMJP. Problems with care.data and other stories. BMJ. 2015;351:h4613. [CrossRef]43]

The NHS Care.data initiative, launched in 2014, intended to pool health data across general practices to improve patient care and medical research. However, the initiative faced severe backlash due to inadequate communication, a lack of clear consent mechanisms, and fears over data commodification. Concerns about patient privacy and the potential for data misuse led to public outcry, and the program was ultimately abandoned in 2016.

  • Key factors for failure:Poor communication, lack of transparent consent processes, and public distrust.
  • Lessons learned:This case emphasizes the need for clear, dynamic consent mechanisms and transparent governance practices. It also underlines the importance of trust-building initiatives before embarking on large-scale health data programs.

Citizen engagement:The Understanding Patient Data Initiative (United Kingdom) [Hopkins H, Kinsella S, Mil A, Mil HV. Foundations of fairness: views on uses of NHS patients' data and NHS operational data. Understanding Patient Data. URL: https:/​/understandingpatientdata.​org.uk/​sites/​default/​files/​2020-03/​Foundations%20of%20Fairness%20-%20Full%20Research%20Report.​pdf [accessed 2020-02-13] 44]

The Understanding Patient Data initiative in the United Kingdom is an example of effective citizen engagement in data governance. By involving the public in cocreation workshops, the initiative has fostered dialogue on how patient data should be used, addressing concerns about privacy and data ownership. Through transparency reports and educational programs, this initiative has built a stronger foundation of trust and enabled more meaningful public participation in decision-making about health data.

  • Key factors for success:Transparent communication, active public involvement, and education initiatives.
  • Lessons learned:This case supports our proposed citizen engagement advocacy, particularly the importance of transparency and cocreation in building trust in data governance.

The innovative deployment of transformative technologies in dynamic data-for-health ecosystems is a far-reaching goal. However, the international agenda must also focus on equity, access to technology, and IT literacy, embracing the needs of citizens of all ages and in all regions of the world, to avoid a growing digital divide. Furthermore, international norms and standards are required, given the burgeoning role of multinational companies in this space and the vital importance of transboundary data sharing when addressing public health emergencies of international concern.

Global collaboration plays a pivotal role in advancing standardized data-sharing frameworks, ensuring interoperability, and fostering mutual trust among nations. Establishing robust international partnerships can accelerate the adoption of common standards while respecting local contexts and priorities. Such collaboration will also help to harmonize efforts toward equitable access to technology and shared public health goals. Large institutions have a key role in shaping the direction of this ecosystem and steering policies in specific directions. For example, the rapid and ubiquitous growth of efficient data collection tools—which have become an essential part of our lives—prompted the EU to devise the GDPR framework that became law in 2018. Due to the size of the market it covers and the EU’s geopolitical role, it has effectively set the standard for data protection and management worldwide, building on the WHO and European Centre for Disease Prevention and Control European Health information Gateway [WHO European health information at your fingertips. URL: https://gateway.euro.who.int/en/datasets/ [accessed 2025-02-06] 40]. Besides regulations and limitations, governmental bodies can promote and support private endeavors that create digital tools with a privacy-by-design approach. A recent joint statement—the European Health Data Space proposal—from health care stakeholders emphasizes the need for regulated industry access to health data for innovation and societal benefits, recognizing the interconnected nature of the data-for-health ecosystem [Gatpandan N. Stakeholders' consensus response to the proposed European Health Data Space. iHD. 2022. URL: https://www.i-hd.eu/stakeholders-consensus-response-to-the-proposed-european-health-data-space/ [accessed 2023-06-12] 45]. Such initiatives, alongside the recommendations in this paper, offer the potential for effective management of this ecosystem. This is crucial to balance diverse interests and ensure that the value of health data benefits society as a whole.

Acknowledgments

The “Data Enabled Society for Health: Challenges and Opportunities” workshop was supported by the Institute of Healthcare Engineering and the Centre for Digital Public Health in Emergencies, University College London. This workshop created a platform for knowledge exchange across a community of professionals passionate about data for health. We acknowledge the commitment of the organizers and attendees. We would like to extend our gratitude to ChatGPT, an AI language model developed by OpenAI, for its assistance in proofreading and refining the language of this article. The “Data Enabled Society for Health: Challenges and Opportunities” workshop and the paper writing was financially supported by the Institute of Healthcare Engineering, University College London.

Authors' Contributions

LL, EB, SL, SLR, AC, and PK contributed to conceptualization and writing – original draft. RS, NM, SE, MS, JW, IY, GR, BG, and VM assisted with data curation and writing – review & editing.

Conflicts of Interest

None declared.

  1. Oliveira MIF, Lóscio BF. What is a data ecosystem. 2018. Presented at: dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age; June 1, 2018; New York, NY. [CrossRef]
  2. Brewer HR, Hirst Y, Chadeau-Hyam M, Johnson E, Sundar S, Flanagan JM. Association between purchase of over-the-counter medications and ovarian cancer diagnosis in the cancer loyalty card study (CLOCS): observational case-control study. JMIR Public Health Surveill. 2023;9:e41762. [CrossRef] [Medline]
  3. Salathé M. Digital epidemiology: what is it, and where is it going? Life Sci Soc Policy. 2018;14(1):1. [CrossRef] [Medline]
  4. Gardy JL, Loman NJ. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet. 2018;19(1):9-20. [CrossRef] [Medline]
  5. Gonzalez L, Hsu YC, Poole J, Rae B, Rutherford I. A world that counts: mobilising the data revolution for sustainable development. UN Data Revolution. 2014. URL: https://www.undatarevolution.org/wp-content/uploads/2014/11/A-World-That-Counts.pdf [accessed 2025-02-22]
  6. Li L, Novillo-Ortiz D, Azzopardi-Muscat N, Kostkova P. Digital data sources and their impact on people's health: a systematic review of systematic reviews. Front Public Health. 2021;9:645260. [FREE Full text] [CrossRef]
  7. Chin YC, Zhao J. Governing cross-border data flows: international trade agreements and their limits. Laws. 2022;11(4):63. [CrossRef]
  8. Hummel P, Braun M, Tretter M, Dabrock P. Data sovereignty: a review. Big Data Soc. 2021;8(1):205395172098201. [CrossRef]
  9. Towett G, Snead RS, Grigoryan K, Marczika J. Geographical and practical challenges in the implementation of digital health passports for cross-border COVID-19 pandemic management: a narrative review and framework for solutions. Global Health. 2023;19(1):98. [CrossRef] [Medline]
  10. Wilkinson MD, Dumontier M, Aalbersberg IJJ, Appleton G, Axton M, Baak A, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. 2016;3(1):160018. [FREE Full text] [CrossRef] [Medline]
  11. Salama C. First lessons from government evaluations of COVID-19 responses: a synthesis. OECD. 2022. URL: https://tinyurl.com/yecuc8j9 [accessed 2023-06-12]
  12. Roberts S, Kostkova P. Disease surveillance, digital futures, and data-sharing in a world 'after' COVID-19. Global Policy Journal. 2021. URL: https:/​/www.​globalpolicyjournal.com/​articles/​science-and-technology/​disease-surveillance-digital-futures-and-data-sharing-world-after [accessed 2023-06-12]
  13. Li L, Aldosery A, Vitiugin F, Nathan N, Novillo-Ortiz D, Castillo C, et al. The response of governments and public health agencies to COVID-19 pandemics on social media: a multi-country analysis of Twitter discourse. Front Public Health. 2021;9:716333. [FREE Full text] [CrossRef] [Medline]
  14. Taylor MJ, Wilson J. Reasonable expectations of privacy and disclosure of health data. Med Law Rev. 2019;27(3):432-460. [FREE Full text] [CrossRef] [Medline]
  15. Kostkova P, Brewer H, de Lusignan S, Fottrell E, Goldacre B, Hart G, et al. Who owns the data? Open data for healthcare. Front Public Health. 2016;4:7. [FREE Full text] [CrossRef] [Medline]
  16. Carter P, Laurie GT, Dixon-Woods M. The social licence for research: why care.data ran into trouble. J Med Ethics. 2015;41(5):404-409. [FREE Full text] [CrossRef] [Medline]
  17. Wise J. Covid-19: UK drops its own contact tracing app to switch to apple and google model. BMJ. 2020;369:m2472. [CrossRef] [Medline]
  18. Hollestelle MJ, van der Graaf R, Sturkenboom MC, van Delden JJ. Stimulating solidarity to improve knowledge on medications used during pregnancy: a contribution from the ConcePTION project. BMC Med Ethics. 2023;24(1):44. [FREE Full text] [CrossRef] [Medline]
  19. Tumulak A, Tin J, Keshavjee K. Towards a Unified framework for information and interoperability governance. The Role of Digital Health Policy and Leadership. 2024:49-53. [CrossRef]
  20. Data policy for times of crisis facilitated by Open Science. CODATA. URL: https://codata.org/initiatives/data-policy/dptc/ [accessed 2024-09-18]
  21. Sinnenberg L, Buttenheim AM, Padrez K, Mancheno C, Ungar L, Merchant RM. Twitter as a tool for health research: a systematic review. Am J Public Health. 2017;107(1):e1-e8. [CrossRef] [Medline]
  22. Ledford H. Researchers scramble as Twitter plans to end free data access. Nature. 2023;614(7949):602-603. [CrossRef] [Medline]
  23. Murtfeldt R, Alterman N, Kahveci I, West JD. RIP Twitter API: a eulogy to its vast research contributions. arXiv:2404.07340. 2024. [CrossRef]
  24. Alexopoulos C, Charalabidis Y, Loutsaris MA, Lachana Z. How blockchain technology changes government: a systematic analysis of applications. Int J Public Adm Digit Age. 2021;8(1):1-20. [CrossRef]
  25. Powles J, Hodson H. Google DeepMind and healthcare in an age of algorithms. Health Technol (Berl). 2017;7(4):351-367. [FREE Full text] [CrossRef] [Medline]
  26. Darzidehkalani E, Ghasemi-Rad M, van Ooijen PMA. Federated learning in medical imaging: part I: toward multicentral health care ecosystems. J Am Coll Radiol. 2022;19(8):969-974. [FREE Full text] [CrossRef] [Medline]
  27. Jo U, Oktian YE, Kim D, Oh S, Lee H, Kim H. A zero-knowledge-range-proof-based privacy-preserving blockchain platform for COVID-19 contact tracing. 2022. Presented at: International Conference on Platform Technology and Service (PlatCon); August 24, 2022:53-58; Jeju, Korea. [CrossRef]
  28. Li Y, Gunasekeran DV, RaviChandran N, Tan TF, Ong JCL, Thirunavukarasu AJ, et al. The next generation of healthcare ecosystem in the metaverse. Biomed J. 2024;47(3):100679. [FREE Full text] [CrossRef] [Medline]
  29. Vassilakopoulou P, Hustad E. Bridging digital divides: a literature review and research agenda for information systems research. Inf Syst Front. 2023;25(3):955-969. [FREE Full text] [CrossRef] [Medline]
  30. Bharti N, O'Donovan C, Smallman M, Wilson J. Public trust, deliberative engagement and health data projects: beyond legal provisions. Engag Sci Technol Soc. 2021;7(1):125-133. [CrossRef]
  31. Borda A, Molnar A, Neesham C, Kostkova P. Ethical issues in AI-enabled disease surveillance: perspectives from global health. Appl Sci. 2022;12(8):3890. [CrossRef]
  32. Understanding Patient Data. 2023. URL: https://understandingpatientdata.org.uk/about-us [accessed 2023-04-17]
  33. Asher R, Jesper-Mir E. Why do you need my data? A co-creative approach to enriching public dialogues around health data. National Co-ordinating Centre for Public Engagement. 2022. URL: https:/​/www.​publicengagement.ac.uk/​whats-new/​blog/​why-do-you-need-my-data-co-creative-approach-enriching-public-dialogues-around-health [accessed 2023-07-26]
  34. Scott K, Burall S, Perrin N, Shelton P, White D, Irvine G. Data for Public Benefit Balancing the risks and benefits of data sharing. Understanding Patient Data. 2018. URL: https://understandingpatientdata.org.uk/sites/default/files/2018-04/Data%20for%20public%20good_0.pdf [accessed 2017-05-28]
  35. Ansell C, Gash A. Collaborative governance in theory and practice. J Public Adm Res Theory. 2008;18(4):543-571. [CrossRef]
  36. Kaner S. Facilitator's Guide to Participatory Decision-Making. San Francisco, CA. John Wiley & Sons; 2014.
  37. Our Future Health. The UK's largest health research programme. 2024. URL: https://study.ourfuturehealth.org.uk/welcome [accessed 2024-09-18]
  38. HL7 Version 2 Product Suite. HL7 International. URL: https://www.hl7.org/implement/standards/product_brief.cfm?product_id=185 [accessed 2025-02-06]
  39. ICD-10 Version. 2019. URL: https://icd.who.int/browse10/2019/en [accessed 2025-02-06]
  40. WHO European health information at your fingertips. URL: https://gateway.euro.who.int/en/datasets/ [accessed 2025-02-06]
  41. Rosen B, Hartal M, Waitzberg R. The Israeli health system's rapid responses during the COVID-19 pandemic. Isr J Health Policy Res. 2024;13(1):11. [CrossRef] [Medline]
  42. Jalloh MF, Sengeh P, James N, Bah S, Jalloh MB, Owen K, et al. Integrated digital system for community engagement and community-based surveillance during the 2014-2016 Ebola outbreak in Sierra Leone: lessons for future health emergencies. BMJ Glob Health. 2020;5(12):e003936. [FREE Full text] [CrossRef] [Medline]
  43. Group BMJP. Problems with care.data and other stories. BMJ. 2015;351:h4613. [CrossRef]
  44. Hopkins H, Kinsella S, Mil A, Mil HV. Foundations of fairness: views on uses of NHS patients' data and NHS operational data. Understanding Patient Data. URL: https:/​/understandingpatientdata.​org.uk/​sites/​default/​files/​2020-03/​Foundations%20of%20Fairness%20-%20Full%20Research%20Report.​pdf [accessed 2020-02-13]
  45. Gatpandan N. Stakeholders' consensus response to the proposed European Health Data Space. iHD. 2022. URL: https://www.i-hd.eu/stakeholders-consensus-response-to-the-proposed-european-health-data-space/ [accessed 2023-06-12]


API: application programming interface
EIOS: Epidemic Intelligence from Open Sources
EU: European Union
GDPR: General Data Protection Regulation
IoT: Internet of Things
ML: machine learning
NHS: National Health Service
UNESCO-CODATA: United Nations Educational, Scientific and Cultural Organization - Committee on Data for Science and Technology
UPD: Understanding Patient Data
WHO: World Health Organization


Edited by A Mavragani; submitted 08.02.24; peer-reviewed by J Lv, R van Der Graaf, M Toma; comments to author 03.06.24; revised version received 24.09.24; accepted 18.02.25; published 25.03.25.

Copyright

©Lan Li, Emma Back, Suna Lee, Rebecca Shipley, Néo Mapitse, Stefan Elbe, Melanie Smallman, James Wilson, Ifat Yasin, Geraint Rees, Ben Gordon, Virginia Murray, Stephen L Roberts, Anna Cupani, Patty Kostkova. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 25.03.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.