Viewpoint
Abstract
Registries have long been a cornerstone of medical research and public health, providing systematically collected data on diseases, treatments, and health outcomes. However, in the era of digital health, we argue that the traditional model of stand-alone registries needs reconsideration, given the context of increasingly digitized and interoperable health data ecosystems. Unless registries evolve to embrace embedded, standards-based data services, operating across interoperable infrastructure, they will become obsolete while digitalization is reshaping how data can be collected, shared, and used. In this viewpoint, we recount how the present health data ecosystem came to be and what role registries have come to play therein. Following that, we show how recent regulatory initiatives such as the Trusted Exchange Framework and Common Agreement in the United States or the European Health Data Space Regulation signal a shift toward cross-network health information exchange, promoting patient-centric data integration within electronic health record systems. We further illustrate how electronic health records are consequently set to evolve into information hubs, acting as the primary gateway for individuals through which they may access and control their personal health data spread throughout increasingly connected health data ecosystems. This, in turn, might stimulate the creation of digital twins and continuous learning health systems in practice. Following this line of thought, we discuss the opportunities and challenges of interconnected health data ecosystems. Ultimately, we propose that next-generation registries need to be designed as dynamic, service-oriented software stacks for research, leveraging the common data infrastructures that are currently being established around the world. Given the points raised in this viewpoint, we invite health care professionals and researchers alike to equally rethink the role that registries should play within the globally emerging interconnected health data ecosystems and contribute their findings. References included in this viewpoint were identified through searches of PubMed and Google Scholar with various search terms and combinations thereof pertinent to the topics touched on, for example, “patient registry,” “clinical registry,” “digital twin,” “healthcare,” “clinical research,” “virtual twin,” “TEFCA,” or “EHDS.” Only papers in English were reviewed. The final reference list was generated on the basis of originality and relevance to the broad scope of topics covered in this viewpoint, aiming to present a balanced overview of topic-related findings and arguments.
J Med Internet Res 2026;28:e92696doi:10.2196/92696
Keywords
Introduction
Registries have been instrumental in medical research for over 150 years []. By design, all registries share a common feature: they systematically collect uniform data on predefined populations to address specific clinical, epidemiological, or operational questions []. Analysts use registry data, for instance, to track the natural history of diseases, evaluate treatment effectiveness, monitor safety, improve quality of care, or conduct public health surveillance []. In the predigital era and well into the late 20th century, registries primarily relied on de novo data entry dedicated to the registry’s purpose, frequently in parallel with routine clinical record-keeping.
Over the past 2 decades, patient registries have gained renewed prominence as a key source of real-world data (RWD) for medical and health research []. Even though multiple, slightly differing definitions exist [], data at present commonly qualify as RWD if their collection is not subject to any rules or conditions outside the standard care setting [-]. For example, the US Food and Drug Administration broadly defines RWD as “data relating to patient health status and/or the delivery of health care routinely collected from a variety of sources,” and considers real-world evidence to be the clinical insights derived from analysis of RWD.
RWD use has long been advocated in medical research to bridge the efficacy-effectiveness gap, but has experienced a remarkable surge in recent years [,]. The COVID-19 pandemic further underscored the need for timely RWD, as policymakers required rapid evidence for decision-making during the crisis []. Health technology assessment (HTA) bodies and regulators now increasingly accept and sometimes even request real-world evidence to inform decisions on the effectiveness, safety, and cost-effectiveness of interventions [,].
Notably, HTA bodies regard registries as one of the most important RWD sources. In a survey conducted in 2020, 19 of 22 HTA organizations across 21 countries indicated that they accept registry data for decision-making—the highest acceptance among RWD sources studied []. Registries’ structured nature and curated data make them appealing for research. However, the same survey revealed a paradox: although registries were deemed the most acceptable RWD source, respondents—when asked to rank barriers to RWD acceptance on a scale from 1 (most important) to 9 (least important)—noted that registries often lack important variables, limiting their usefulness (average=4.2) []. This highlights the fact that registries in practice often still fail to appropriately capture the full spectrum of data needed for today’s health questions. Respondents further identified the lack of necessary data sources (average=3.3) and policy structures/information governance (average=3.5) as the top 2 barriers to using RWD.
Indeed, the volume and variety of health data have exploded in the digital age []. Nearly every facet of patient care and daily life can generate health-relevant data, from entries in electronic health records (EHRs) and laboratory results to smartphone sensor readings and billing accounts [,]. This wealth of data holds enormous potential for insights, but also challenges the traditional paradigm where registries are established as stand-alone data repositories with a predefined structure of fixed tabular fields, but with no or only a few connections to other data sources in use for data collection. Modern health data, however, goes beyond that and nowadays include unstructured notes, images, device telemetry, and more. Restricting analyses to a registry’s predefined dataset can mean losing granularity and context that reside in these other data modalities [], which is why the recurrent use of fixed electronic case report forms for collecting data in registries deserves reconsideration. Even more so, considering recent regulatory initiatives such as the US Trusted Exchange Framework and Common Agreement or the European Health Data Space Regulation that have been designed to facilitate the transition from disconnected data sources to a “network of networks” for cross-network health information exchange (HIE).
Given the survey results cited above and based on our own personal experience, we believe that researchers and practitioners urgently need to rethink the role of registries in health care and health research. Other authors have even called for the formalization of “Registry Science” as a new research discipline []. This viewpoint aims to synthesize topics and scientific findings surrounding health data use to propose a way forward. It accordingly describes the evolution of health data ecosystems and the role that registries have come to play therein. It outlines how digitalization and associated major changes in legislation and regulation pertaining to health data use in the European Union and the United States are transforming these ecosystems. Although the digital twin (DT) concept as such has been widely discussed in the literature in recent years with regard to health care, existing DT applications in practice remain scarce and limited in scope, as far as we are aware. To our knowledge, existing papers have also not yet discussed the implementation of DTs for health care against the backdrop of the aforementioned recent legislative and regulatory changes. With the publication of this viewpoint, we aim to connect the dots and intend to show why and how recent regulatory changes may stimulate the creation and use of DTs in clinical research and care. Eventually, we discuss the opportunities and challenges of the globally emerging interconnected health data ecosystems, based on a narrative review of pertinent literature. In doing so, we hope to inspire more research that supports the modernization of registries while contributing our own personal vision for their future.
Health Data Are Everywhere
Once IT entered the health sector, professional health data stakeholders such as health care providers or payers eagerly used it to convert paper records to electronic form and digitized their record-keeping processes, but while doing so, largely replicated old workflows in digital format []. True digitalization—meaning redesigning processes to fully exploit digital technology []—has long lagged behind []. As long as fast, stable, and affordable internet connections were still long out of sight, large-scale connectivity and data sharing enabling efficient secondary use of data were not a priority yet. Neither for users and developers, nor for policymakers and regulators. This has left us with a legacy of noninteroperable systems, each a “lock-in” environment with proprietary designs and architectures tied to specific vendors [,].
Nowadays, computing power, storage capacity, and network bandwidth are abundant and ever-increasing, marking the maturation and convergence of social, mobile, analytics, and cloud technologies []. Global internet and social media use, as well as adoption of mobile and smartphones, continue to grow, passing the 5 billion user threshold of the global population on all accounts today []. Meanwhile, the global volume of data is exploding exponentially on the order of zettabytes, and the health sector is a major contributor to this big data revolution [,]. Notably, most of these data are RWD [,,], potentially bearing health-relevant insights ( [,,,,]).

provides a nonexhaustive overview of RWD sources, arranged along a continuum from data typically considered directly health-relevant to data that may become health-relevant when linked with other information. Data may be compiled at the individual or population level depending on the purpose of analysis. For public health reporting, the target population is often defined from a political-geographic perspective, ranging from regional to national to global level data. Other parameters, such as age or income group, can also be used to define the data range and scope for a certain population under investigation.
Not all data are equally helpful for answering health-related questions. For instance, one can distinguish between (1) direct health data, which are generated in the process of health care delivery or health research to answer questions with explicit health relevance (eg, laboratory data), (2) indirect health-relevant data, which are created outside a health care delivery or research context, but that could still be used to answer a question with explicit health relevance (eg, activity tracking via fitness apps), and (3) potentially health-relevant data, which may not seem health-relevant just by themselves, but have the potential to answer a health-relevant question if linked with other data (eg, purchasing data or posts on social media) []. Indirect health-relevant data and potentially health-relevant data have received increasing attention in the wake of personalized medicine and precision medicine, fueled by the fast-progressing development and use of artificial intelligence for data-driven health applications worldwide [-].
Technological progress has not only considerably expanded the range of RWD that can be deemed health-relevant but has also increased the number of instances in which data are generated in either a personal or professional use context ( [,,,,,,]).

In broad terms, professional stakeholders generate/collect, store, and process data either for health research or health care delivery purposes, while private individuals (private end consumers) generate/collect, store, and process data for all kinds of personal purposes, which may be health-related, but do not necessarily have to be. The clear separation between the 2 major professional use purposes in highlights the historic divergent evolution of separate information systems for one purpose or the other [], which reinforces interoperability issues we are still facing today. Secondary use overall remains a challenge, as consumer or patient-generated data originating from personal use also are not easily accessible for professional use and vice versa. But no matter where the data resides, who holds or owns it, data that can be helpful for answering a health-related question is data that can be linked to a certain individual. The individual is the point of reference, which is why we have placed in the center of all data sources.
provides a more detailed characterization of RWD sources, while extends the overview of source categories depicted in .
| Unsupervised data sources (self-managed) | Supervised data sources (professionally managed) | |
| Primary use purpose |
|
|
| Examples for primary use |
|
|
| Data review/quality check |
|
|
| Health relevance of data included in these sources |
|
|
Unsupervised real-world data (RWD) sources (self-managed)
- Personal use
- Internet and social media
- Online communities
- Blog and vlogs
- Twitter, Facebook, Instagram, TikTok, YouTube, etc.
- Digital health and wellness apps
- Trackers and diaries (eg, habits and activities, symptoms, and menstruation)
- Motoric and cognitive tests
- Surveys and interviews
- Educational content (eg, videos, quizzes, and information texts)
- Digital apps, however, can be equipped with technical features that allow for the automatic check of data entry. For this reason, digital health apps that have been approved as a medical device or apps whose use is overseen by health care professionals (eg, app use as part of a study) would constitute supervised sources.
- Internet and social media
Supervised RWD sources (professionally managed)
- Research
- Noninterventional studies (observational studies)
- Cross-sectional studies
- Case-control studies
- Cohort studies
- Pragmatic trials
- Surveys and interviews
- Health and patient surveys
- Interviews and focus group discussions
- Registries
- Patient registries
- Disease registries
- Health service registries
- Literature and statistics
- Scientific journals and publications (preferably peer-reviewed)
- Health figures and recommendations published by professional associations, patient organizations, governments, and nongovernmental organizations
- Noninterventional studies (observational studies)
- Health care delivery
- Clinical, administrative, and business records
- Electronic health records and medical records
- Insurance claims and billing datasets
- Employment records
- Case notes and medication orders
- Admission/discharge and progress reports
- Medication administration records (dose, route, and National Drug Code/RxNorm codes)
- Point of sale records (prescriptions including refills and over-the-counter drugs, health and wellness goods/services)
- Tax records
- Clinical, administrative, and business records
In conclusion, present health data ecosystems encompass a vast array of stakeholders that use various digital devices and applications to generate, collect, process, and store more data of varying health-relevance for either personal or professional health-related use purposes than ever before, while health data sharing remains complicated [-].
The Rise of Interoperability Initiatives
There is a growing consensus among stakeholders that data should be collected once, and used many times, provided proper governance and consent are in place (eg, the summary of research survey results from multiple countries in [] and the scoping review by Alam et al []). This requires integrated architectures through which data from multiple sources can be accessed and reused for multiple purposes. Along these lines, the concept of the learning health system (LHS), sometimes more narrowly conceptualized as a learning health care system [], has gained traction over the past 2 decades, with stakeholders envisioning a cycle where practice and research through large-scale data integration mutually inform each other and thereby induce a cycle of improvement [,]. In parallel, demands for greater individual autonomy in terms of data access and control have been articulated [], accelerating efforts to integrate data collection across platforms.
Over the past decade, many attempts to link EHRs and registries have been made, accompanied by a multitude of studies investigating the extent to which EHR systems could be adopted and used for research purposes [,-]. But any such integration of information systems has typically been tailored to a specific use case or a specific pair of systems again. As a result, the broader interoperability problem has yet to be resolved: numerous custom pipelines have been built, but each is limited in scope (eg, linking 1 hospital’s EHR to 1 specialty registry). HIE takes place over many point-to-point connections rather than an open network.
In the United States, following the introduction of the Health Information Technology for Economic and Clinical Health Act in 2009, numerous vendor-, state-, and consortium-led efforts, while aiming to resolve this, have produced a “patchy landscape” of health information networks with divergent architectures and standards []. Other countries, too, have taken actions to transform their HIE infrastructures and processes, but the level of HIE maturity still varies widely, even within countries that already exhibit a comparably high level of digitalization (for an overview of different countries, see [-]). More recently, however, 2 regulatory initiatives have attracted particular attention because of their scope and expected impact.
In 2022, the 21st Century Cures Act’s Trusted Exchange Framework and Common Agreement (TEFCA) was introduced to create a nationwide “network of networks” for HIE within the United States. In the same year, the European Health Data Space (EHDS) Regulation was proposed, which likewise sets out a legal and technical framework for cross-border HIE and use of health data across all member states of the European Union []. The EHDS entered into force on March 26, 2025. Like TEFCA, it reinforces data portability to ensure individual data access and control, which is crucial for building dynamic, composite EHRs. presents a comparison of the core characteristics of both initiatives.
| TEFCAa [] | EHDSb [] | |
| Scope of application |
|
|
| Standardization |
|
|
| Infrastructure |
|
|
| Data exchange purposes |
|
|
| Enforcement |
|
|
aTEFCA: Trusted Exchange Framework and Common Agreement.
bEHDS: European Health Data Space.
cRCE: Recognized Coordinating Entity.
dHL7 FHIR: Health Level 7 Fast Healthcare Interoperability Resources.
In summary, policy efforts globally are rapidly aligning to promote data sharing, reuse, and individual data sovereignty, while implementation must still navigate issues of data protection and sovereignty across different jurisdictions [,].
Large-Scale Data Integration Enables the Creation of DTs
By design (), both TEFCA and the EHDS Regulation now inherently position EHRs as the primary gateway to the health data ecosystem. Both grant private individuals the right to aggregate their health data from different data holders through any compliant application of their choice. Comprehensive data access and control rights enable private individuals to actively participate in the use of their data and steer data flows, which might also provide an extra incentive for them to donate their data for research.
When various types of health-related RWD () are combined with reference to the individual, the result is sometimes referred to as medical DT or health DT [,]. A DT is a digital representation that mirrors an individual’s health status in real time, constructed from all available data streams []. As EHR vendors adapt their systems to keep up with regulatory requirements and stay competitive, the EHR as we know it is likely to evolve into an interconnected data hub. From the perspective of the private individual, the next-generation EHR is set to become their personal data trove. As such, it can be viewed as the “data backbone” for the individual’s DT. Controlled by the individual, it locates and assembles the data that can be fed into mechanistic or statistical models that, for instance, may simulate disease trajectories or predict treatment responses. If such models are directly integrated into EHR software applications, which can be rerun any time new data are added or become known to the EHR (ie, direct data entry into the EHR or reference to updated data in connected sources), the EHR basically becomes the DT of that individual (lower-right part of []).

Access to a common data infrastructure enables large-scale data integration. This enables the generation of comprehensive longitudinal EHRs. For instance, all factors that influence a disease can be described by networks (multilayer network construction). Using personal measures from the individual’s EHR as input for these networks creates the individual’s DT []. Large-scale data integration across many individuals can then give rise to 2 distinct DT population types: virtual cohorts consisting of the data twins of real people and virtual cohorts of computer-generated “twins,” that is, data from real people is used to generate synthetic data representations of “as-good-as-real” individuals but are not associated with a real living human being. The latter type of cohort, more precisely, is rather a potential application of digital twins that is made possible by leveraging data from real humans to build or improve mechanistic data and process models of different scales and dimensions (eg, organs and organ systems, cellular interactions) with the help of artificial intelligence and use insights gained from these models to create new synthetic data representations []. These representations, however, do not constitute actual DTs in strict terms [-].
In an LHS, DT cohorts—both “real” and synthetic ones—are invaluable (upper-right part of []). The former type of cohorts allows researchers and clinicians to study real-world complexity, diversity, and longitudinal care patterns, while the latter is particularly useful for drug development, for example, when recruitment of a sufficiently large number of real patients is difficult []. Marrying the DT concept with population health, one can imagine continuously learning “virtual trials” or simulations that update as new data streams in. This is a sharp departure from the static, periodic data capture of traditional registries.
In traditional registries, data are collected at set intervals or only during certain health care encounters and often with a narrow scope defined by the registry protocol. In contrast, a DT approach can incorporate layers of data (clinical, behavioral, environmental, etc) in near real time [,-,,]. The ultimate goal, as some authors have articulated, is to move from surveillance to improving clinical care in real time by integrating these rich data layers in a big data health information system ([,,]; ie, an LHS).
If every individual can decide at any time to make their data accessible for analyses on demand, the registry as a separate entity may not need to collect data in the traditional sense anymore. Instead, it would be the registry’s job to assemble relevant data from the network upon provision of a cohort’s definition. Instead of establishing a new registry and recruiting patients into it, researchers could simply obtain consent from patients to query their DT’s data through the network for specific analyses using the registry’s services if individuals have agreed to be contacted for this purpose. Likewise, individuals should be able to discover trials or studies fitting their individual profile via their EHR interface, making use of the common infrastructure to locate potential matches in connected registries.
Essentially, next-generation registries need to have a software architecture that allows researchers to aggregate data on the fly via queries to a distributed network and contact individuals if needed, for instance, to ask them to provide extra data or invite them to participate in a clinical trial. For example, rather than maintaining a dedicated registry for a disease like multiple sclerosis (MS) and asking clinicians to enter data into it, a future MS registry should be able to identify all patients with MS across EHRs (using standardized definitions and codes) and then retrieve their pertinent data (laboratory data, multiomics, functional test data, imaging data, etc) from various connected sources (digital health apps, insurance claims and billing datasets, medical records, etc) as allowed. On top of this, machine learning models could be used to generate predictive insights (eg, disease progression forecasts) from these integrated datasets, which then could be fed back into care via clinical decision support. In fact, several pilot projects have already explored DTs of various types (eg, DTs of organs such as the lung, the heart, and the spine; DTs of diseases such as Alzheimer or type 2 diabetes) [,]. Such models thrive on integrating multimodal data, reinforcing the need for interoperability and propelling the uptake of regulatory initiatives such as TEFCA or the EHDS regulation.
The implementation of these initiatives flips the traditional script: private individuals control their data and decide how to share it, and professional stakeholders assemble the data dynamically as needed and permitted to conduct analyses for their own use or as a service for other stakeholders (including private consumers). The ongoing evolution from a fragmented landscape of siloed information systems to interconnected health data ecosystems represents a paradigm shift in how we approach clinical research, public health surveillance, and health care delivery.
Opportunities in Interconnected Health Data Ecosystems
Individual Data Access and Control
Private individuals are gaining the technical means and legal rights to access and consolidate their health information, effectively dissolving the historical monopoly that professional data stakeholders have held over data access [,]. This democratization of data could catalyze greater patient engagement in care and research while spurring innovation in digital health technologies that harness comprehensive data. Community-driven data-sharing platforms such as PatientsLikeMe showcase the potential.
Better Care and Workload Relief
Large-scale data integration can provide more immediate insights for decision support. If physicians can, for instance, query the connected network infrastructure through EHR systems and instantly establish a repository of pooled data to see how similar patients have fared on a certain therapy, they can efficiently leverage knowledge gained from population analyses at the point of care. In this regard, the transition to interconnected health data ecosystems is the prerequisite for achieving the LHS vision where “the process of discovery [is] a natural outgrowth of patient care” []. And if data can be reused effectively such that reentry of data is avoided wherever possible, this also means that quality improvement initiatives and safety surveillance can happen in shorter time intervals than before.
Value-Based Care
A significant driver behind large-scale data integration is the global shift toward value-based care (VBC) [,]. Unlike volume-based care (fee-for-service models that reward quantity of services), VBC ties reimbursement and incentives to outcomes and cost-efficiency, essentially rewarding the “value” (quality of care in terms of health outcomes) delivered to patients [-]. The implementation of VBC requires a shared definition of health outcomes such that qualitative differences in one and the same outcome can be quantified and compared across providers. Michael Porter, who pioneered the VBC concept, emphasized that data must be “organized around the patient, not the department,” and that all information involved in the care cycle should be transparent and accessible to all stakeholders involved in that patient’s care to drive competition for the best care in terms of value []. VBC programs increasingly recognize the importance of patient-reported outcomes and experiences as part of the value equation []. These often come from outside traditional clinical data sources, via surveys or apps. Integrating such data with clinical outcomes requires flexible data collection mechanisms that transcend siloed systems. The LHS approach dovetails with VBC when continuous feedback loops are implemented to contain rising health care costs in aging societies []: as data (including health outcomes and costs incurred in obtaining them) are collected and analyzed, insights can drive changes in practice to improve value, which are then measured in subsequent data, further incentivizing competition for the best value.
Population Health Management
Health care providers are increasingly responsible for managing the health of defined populations under accountable care models. This requires identifying high-risk patients, gaps in care, and social determinants of health—tasks that benefit from pooling data from different sources. Traditional registries might tell you about disease-specific outcomes, but large-scale HIE and data integration can provide a panoramic overview, enabling proactive interventions. For instance, it might reveal that certain neighborhoods have higher rates of emergency visits for asthma, correlating with environmental data.
Health Care Policy Setting
The push for VBC has also spurred multistakeholder engagement in defining what outcomes matter and how to measure them. A recent report by the World Health Organization underscored that many countries are moving to formally involve patients and citizens in valuing care and outcomes []. The concept of value is now being extended beyond clinical measures to include aspects like equity and patient experience [,,], and capturing these again requires tapping into various data sources.
Challenges in Interconnected Health Data Ecosystems
Data Quality and Bias
When integrating diverse data, especially RWD, we must confront what has been called the “big data paradox” []. With massive sample sizes, even tiny biases or errors can lead to statistically significant—but misleading—results []. Common issues include missing data, inconsistent data entry, and documentation biases. Additionally, the absence of evidence in a dataset is not evidence of absence (a condition not documented is not necessarily not present). Advanced data cleaning, provenance tracking, and bias analysis techniques will be needed. The process of data collection itself always has an effect on the resulting dataset. Data recording in the process of health care delivery may be professionally supervised, but supervision itself is still subject to differing conditions (eg, organizational workflows, experience of the supervising professional, and reimbursement policies). It has also been noted that data recorded in the process of health care delivery not only reflect the health of a patient, but also the patient’s interaction with the health system []. Professional supervision of data collection alone is no guarantee for high quality. It may as well introduce additional data biases through the underreporting of adverse events [] or the false application of diagnosis-related groups codes to receive higher reimbursements []. Data collection led by researchers, too, may be prone to biased selection of study participants, misclassification of individuals with regard to their exposures, outcomes, or covariates, or confounding if no adequate safeguards are taken []. Smart design choices in digital health applications, however, can help mitigate errors right at the point of data entry.
Governance and Privacy
In a traditional registry, governance is typically clear: a registry custodian (often a professional society or public health body) has authority over data use and access according to a protocol. In an open network like TEFCA, governance is distributed. Compliance and enforcement across many networks and their participants are challenging. Data sharing at this scale also heightens concerns about breaches or misuse. Once many systems interconnect, the attack surface expands []. Robust security infrastructure, continuous monitoring, and clear accountability when violations occur are essential to maintain trust in the network. The EHDS tries to mitigate that through the establishment of a separate infrastructure for secondary data use and the creation of health data access bodies that act as gatekeepers to that infrastructure.
Transparency and Individual Responsibility
Consent choices need to be presented in a transparent and understandable manner. Standards guiding the implementation of granular consent options in app interfaces [] and novel technologies such as blockchain encryption need to be explored []. Individuals need to be aware of who may access their data and for what purpose. There is also a need for public engagement and communication about the benefits of data sharing. If patients fear their data will be used against them (eg, by insurance companies or employers), they may opt out of sharing, which could reintroduce biases and silos.
Interoperability and Standardization Hurdles
Different health data stakeholders still use varying standards or versions of standards. Fast Healthcare Interoperability Resources (FHIR) is gaining traction, but not all legacy systems support it fully. Semantic interoperability (ensuring that a “hypertension” diagnosis means the same in all systems, or that units of measure are standardized) is an ongoing effort. Expanding the breadth of standardized data will be an iterative process. Moreover, not all data types are equally ready for exchange. Structured tabular data are easier to integrate than unstructured notes or medical images. As we integrate more data from different modalities, new standards and mapping efforts will be needed. In recognition that technology is constantly evolving and that standardization needs to be harmonized, TEFCA will be updated over time, iteratively incorporating consented recommendations from representatives from health data stakeholder groups. The EHDS Regulation similarly mandates the setup of a stakeholder forum and includes a step-by-step timeline for operationalization toward full implementation.
Cultural and Organizational Change
In the past, health data stakeholders have engaged in health data blocking due to commercial protection reasons or liability concerns [,,]. Despite all looming opportunities, professional stakeholders might be wary of patients accessing full records or skeptical of external data quality. There may be resistance to relying on networked data services for research when people are used to the graphical interface and control mechanisms of a dedicated registry. Overcoming resistance to change requires, for example, showing clinicians that using networked data services saves them time, or showing researchers that they can achieve high-quality results faster using such services than by manually assembling data from selected sources. Ensuring that those who traditionally ran registries (often specialty societies or public health departments) still have a vital role—perhaps as stewards of data quality, analytic expertise, or conveners of stakeholders—may also help to reduce resistance.
Data Noise
Traditional registries, by their focused nature, were good at homing in on a specific problem (eg, tracking all patients with a rare disease to understand its course). An interconnected health data ecosystem provides access to a large pool of data, but analysts will need new methodologies and possibly new tools to ensure that, in the vast sea of data, the critical signals (safety issues, treatment effects, etc) can be detected and acted on.
Discussion
Overview
Interoperability initiatives such as TEFCA and the EHDS Regulation signal a globally converging willingness among regulators and policymakers to actively promote HIE among health data stakeholders through the establishment of common rules and standards and the creation of a common technical infrastructure for data sharing.
Taking into account the growing participation in TEFCA within the United States as more Qualified Health Information Networks are designated and the gradual implementation of the EHDS across EU member states, EHR systems are poised to become the major interface through which individuals manage and track their health and encounters with health professionals, gradually turning EHRs into interconnected data hubs. Similarly, we expect that registries will turn into larger IT platforms, equipped with functionalities and a software architecture that enable data pooling and integration from a variety of data sources across a federated data network.
Even though traditional registries failing to adapt might risk becoming obsolete, this should not be misinterpreted as saying that the goals they served are obsolete. On the contrary, common goals—such as understanding diseases, evaluating interventions, monitoring safety, and improving quality—are more important than ever. It is the means of achieving those goals that is changing.
Labkoff et al [] have identified 3 drivers of change with regard to registries: agility—the demand for systems that can be used for multiple use cases; extreme longitudinality—the demand for systems that allow data tracking over more extended periods of time and with greater frequency; and multimodality—the demand for systems that can ingest multiple data types [].
Put differently, next-generation registries are the ones that strive to cater to all 3 of these demands. A 3D matrix can also be used to determine how well a certain registry is able to handle complexity along these 3 dimensions relative to other registries (for an exemplary comparison of various existing registries, see []).
It is also important to recognize that progress along these dimensions will occur at different paces in different contexts, which is also reflected in the step-by-step rollout of EHDS and TEFCA. Some specialized registries in niche domains or low-resource settings may continue to exist separately for some time, especially if the infrastructure or trust needed for integration is not yet in place. During the transition, hybrid models will persist (eg, registries not yet ready to connect to a common infrastructure but are already capable of pulling/pushing data over designated point-to-point connections). Traditional registries will not disappear overnight but will gradually change or be replaced. We imagine a timeline where more and more registries migrate to pooling data across federated network infrastructures, up to the point when starting a registry “from scratch” without leveraging existing data networks becomes a rarity.
The international alignment of policies, rules, and regulations will still require more work and time to fully exploit arising opportunities and overcome accompanying challenges. International organizations and standards bodies play a crucial role in filtering stakeholder voices and finding universal solutions to common problems, which promotes the development of interoperability standards [,]. A comprehensive overview and categorization of relevant interoperability standards can be found in Torab-Miandoab et al [].
In the literature, several levels of interoperability have been defined, but semantic interoperability can be considered the most important one as it ensures that 2 or more systems are able to exchange information with one another and interpret it correctly []. This requires the use of standardized terminology, which enables systems to literally speak the same language, using the same vocabulary []. The accurate mapping of local terms to a specific semantic standard, such as Systematized Nomenclature of Medicine–Clinical Terms (SNOMED CT), however, is labor-intensive and demands expertise. The diversity of human languages that are spoken around the world, along with domain-specific vocabulary and local peculiarities, poses significant challenges in the implementation of such standards []. Different approaches to tackle these challenges have already been proposed and applied with varying success, for instance, the development and application of guidelines for the adoption of SNOMED CT, or the use of large language models in the mapping process [,].
While semantic standards define the meaning of data, transport standards define the structure and format of data that is needed for mutual data exchange. They ensure that data can actually be sent and received []. Without transport standards, data cannot actually move between systems. Building common infrastructures is not possible without implementing corresponding standards, which is why they play a prominent role in the implementation of TEFCA and the EHDS Regulation.
Even though the EHDS Regulation does not explicitly mandate the use of Health Level 7 (HL7) FHIR, it is still globally considered one of the most important transport standards for the electronic exchange of health data [], and HL7 Europe has also published 3 FHIR Implementation Guides especially designed to support the EHDS []. In contrast to the EHDS regulation, TEFCA even directly incorporates it as a key standard. Despite its popularity, many HL7 FHIR resources, however, have not yet reached normative status. This means that many HL7 FHIR resources are still in development and undergoing iterative testing. This implies that their definitions are still subject to change or can even be removed in future versions of the FHIR specification. In the current FHIR specification (version 5.0.0.0), only 15 out of 162 resources are listed as normative []. On one hand, this may complicate the implementation of FHIR in practice. On the other hand, this also allows for more flexibility in its adoption and leaves room for adjustments to changes brought about by new technologies and accompanying changes in user needs.
Ultimately, many countries are tackling similar issues [-]. There is a need for interoperability across borders, especially for research on rare diseases and global health threats. This also highlights the need for joint international initiatives that enable the pooling of resources and provide a forum for discussions and shared decision-making to overcome these issues. The implementation of EHDS, TEFCA, or the Global Initiative on Digital Health by the World Health Organization demonstrates that this has been widely recognized, but implementation is still in the early stages.
This is also a major limitation of the views and ideas we have presented in this article. Future research will be needed to evaluate the real-world impact of these initiatives. Similarly, more research into the design and setup of registries that make use of today’s technological advancements and keep up with user demands in terms of functionalities is needed. The multidimensional rise in complexity of today’s user demands may even justify the formalization of “Registry Science” as a new discipline [].
Conclusion
In this viewpoint, we have aimed to describe the ongoing transformation of global health data ecosystems, considering recent global changes in associated legislation and regulations, which promote a shift toward cross-network HIE of unprecedented scale. For this purpose, we have reviewed ongoing discussions in the literature touching on relevant topics and findings concerning, for instance, the use of information systems in health care and clinical research in general and the use of EHRs and registries in particular; the meaning of digitization vs digitalization; contents of TEFCA and the EHDS Regulation; and the application of the DT concept in the sphere of health. In doing so, we have come to the conclusion that up until now, ongoing discussions have largely failed to acknowledge the inherent transformational power of TEFCA and EHDS concerning present health data ecosystems, the role that EHRs and registries have come to play therein, and, closely connected to that, the relevance of these measures for the wider application of DTs in health care and the establishment of LHSs. With the publication of this viewpoint, while connecting the dots between relevant topics, we aim to close this gap and hope to contribute a new angle to the general scientific discussion of registry use cases and the raison d’être of registries in general. In conclusion, we believe that unless registries evolve to embrace embedded, standards-based data services, operating across interoperable infrastructures, they will become obsolete while digitalization is reshaping how data can be collected, shared, and used.
When it comes to the implementation of recent regulatory initiatives and the rethinking of the role of registries as depicted in this viewpoint, however, we must measure the success not by technology adoption alone, but by outcomes. Does integrating data lead to faster discovery of effective treatments? Does it help eliminate redundant tests and procedures? Does it reduce the burden on patients who previously had to enroll in multiple registries or carry their records from doctor to doctor? Early signs are positive. For instance, integrated data networks have been used to rapidly study medication safety signals by aggregating EHR data from many sites, something that would have taken traditional registries years to accumulate []. There is also the possibility that large-scale data integration can enhance HTAs and surveillance by providing a more complete picture than any single data source could [,,]. To succeed, professional health data stakeholders need to further adopt common data models and application processing interface standards such as FHIR, support harmonization efforts, and dedicate more attention to the role of private individuals as major owners and donors of data. This, for instance, includes patient involvement in the creation of EHR applications and user-centered design of feedback loops in any health data applications. Private individuals, on the other hand, need not only to be made aware of their rights to data access and control, but must also make use of the opportunity, take action, and use those rights.
Acknowledgments
The authors declare that generative artificial intelligence was used for tasks that fall within classification 1 of the taxonomy published by the International Association of Scientific, Technical & Medical Publishers: “refinement, correction, editing, or formatting the manuscript to improve the clarity of language.”
Funding
During the process of manuscript preparation and revision, the authors were partially funded by the German Research Foundation (Deutsche Forschungsgemeinschaft) as part of Germany’s Excellence Strategy (EXC 2050/2 – Project ID 390696704) Cluster of Excellence “Centre for Tactile Internet with Human-in-the-Loop” (CeTI) of TUD Dresden University of Technology.
Authors' Contributions
JW, SG, and TZ contributed to the conceptualization and writing of the manuscript. JW contributed to the investigation. HI contributed to writing the manuscript. JW and IV contributed to the visualization of figures in the manuscript. TZ and SG contributed to supervision. TZ contributed resources for the preparation of the manuscript. All authors were involved in the review and editing of the manuscript.
Conflicts of Interest
HI received speaker honoraria from Roche, Merck, Sandoz, and Novartis and financial support for research activities from Merck, Neuraxpharm, Novartis, Teva, Biogen, and Alexion. SG has or has had consulting relationships with Una Health GmbH, Lindus Health Ltd., Flo Ltd, ICURA ApS, Rock Health Inc, Thymia Ltd, FORUM Institut für Management GmbH, High-Tech Gründerfonds Management GmbH, the European Commission Directorate General for Research & Innovation, Prova Health Ltd, Haleon Plc, Ada Health GmbH, and the Saudi Arabia Food and Drug Authority funded through the United Nations Development Programme, and holds share options in Ada Health GmbH. TZ reports serving on scientific advisory boards and/or providing consulting services for Biogen, Roche, Novartis, BMS, Sanofi, and Merck; remuneration for activities in speaker bureaus for Roche, Neuraxpharm, Novartis, Merck, Novo Nordisk, Sanofi, BMS, Teva, and Biogen; as well as research funding from Biogen, Novartis, Merck, Teva, and Sanofi. TZ holds shares in Roche and Bayer. JW and IV declare that they do not have and have not had any commercial or financial relationships that could be construed as a potential conflict of interest.
References
- Pop B, Fetica B, Blaga ML, Trifa AP, Achimas-Cadariu P, Vlad CI, et al. The role of medical registries, potential applications and limitations. Med Pharm Rep. 2019;92(1):7-14. [FREE Full text] [CrossRef] [Medline]
- Stausberg J, Harkener S, Semler SC. Recent trends in patient registries for health services research. Methods Inf Med. 2021;60(S 01):e1-e8. [FREE Full text] [CrossRef] [Medline]
- Patient registries. European Medicines Agency. 2018. URL: https://www.ema.europa.eu/en/human-regulatory-overview/post-authorisation/patient-registries [accessed 2024-08-29]
- Gliklich RE, Leavy MB, Dreyer NA. Patient registries. Regist eval patient outcomes user's guide internet 4th edition. Rockville (MD). Agency for Healthcare Research and Quality (US); 2020.
- de Pouvourville G, Armoiry X, Lavorel A, Bilbault P, Maugendre P, Bensimon L, et al. Real-world data and evidence in health technology assessment: when are they complementary, substitutes, or the only sources of data compared to clinical trials? Therapie. 2023;78(1):81-94. [CrossRef] [Medline]
- Liu F, Panagiotakos D. Real-world data: a brief review of the methods, applications, challenges and opportunities. BMC Med Res Methodol. Nov 05, 2022;22(1):287. [FREE Full text] [CrossRef] [Medline]
- Dang A. Real-world evidence: a primer. Pharmaceut Med. Jan 2023;37(1):25-36. [FREE Full text] [CrossRef] [Medline]
- Pisa F, Arias A, Bratton E, Salas M, Sultana J. Real world data for rare diseases research: the beginner’s guide to registries. Expert Opinion on Orphan Drugs. 2023;11(1):9-15. [CrossRef]
- Wilson BE, Booth CM. Real-world data: bridging the gap between clinical trials and practice. EClinicalMedicine. 2024;78:102915. [FREE Full text] [CrossRef] [Medline]
- Makady A, Ham R, de Boer A, Hillege H, Klungel O, Goettsch W, et al. GetReal Workpackage 1. Policies for use of real-world data in health technology assessment (HTA): a comparative study of six HTA agencies. Value Health. Apr 2017;20(4):520-532. [FREE Full text] [CrossRef] [Medline]
- Hogervorst MA, Pontén J, Vreman RA, Mantel-Teeuwisse AK, Goettsch WG. Real world data in health technology assessment of complex health technologies. Front Pharmacol. 2022;13:837302. [FREE Full text] [CrossRef] [Medline]
- Hogervorst MA, Vreman RA, Mantel-Teeuwisse AK, Goettsch WG. Reported challenges in health technology assessment of complex health technologies. Value Health. 2022;25(6):992-1001. [FREE Full text] [CrossRef] [Medline]
- Volume of data or information created, captured, copied, and consumed worldwide from 2010 to 2029. Statista. 2021. URL: https://www.statista.com/statistics/871513/worldwide-data-created/ [accessed 2024-08-23]
- Extract Systems. No, healthcare isn't 30% of the world's data. LinkedIn. Jun 24, 2024. URL: https://www.linkedin.com/pulse/healthcare-isnt-30-worlds-data-extract-systems-18jyc [accessed 2024-08-25]
- Health data: a holistic approach to unlock the value of health data. Deloitte. 2023. URL: https://www.deloitte.com/content/dam/assets-zone2/be/en/docs/industries/health-care/be-health-data.pdf [accessed 2024-08-23]
- Nelson EC, Dixon-Woods M, Batalden PB, Homa K, Van Citters AD, Morgan TS, et al. Patient focused registries can improve health, care, and science. BMJ. 2016;354:i3319. [FREE Full text] [CrossRef] [Medline]
- Labkoff SE, Quintana Y, Rozenblit L. Identifying the capabilities for creating next-generation registries: a guide for data leaders and a case for "registry science". J Am Med Inform Assoc. 2024;31(4):1001-1008. [FREE Full text] [CrossRef] [Medline]
- Lippeveld T, Sauerborn R, Bodart C. Design and Implementation of Health Information Systems. Geneva. World Health Organization; 2000:978-992.
- Iyamu I, Xu AXT, Gómez-Ramírez O, Ablona A, Chang H, Mckee G, et al. Defining digital public health and the role of digitization, digitalization, and digital transformation: scoping review. JMIR Public Health Surveill. 2021;7(11):e30399. [FREE Full text] [CrossRef] [Medline]
- Stegemann L, Gersch M. Interoperability–technical or economic challenge? It-Inf Technol De Gruyter Oldenbourg. 2019;61(5-6):243-252. [CrossRef]
- Reza F, Prieto J, Julien S. Electronic health records: origination, adoption, and progression. In: Magnuson JA, Dixon BE, editors. Public Health Informatics and Information Systems. Cham. Springer; 2020:183-201.
- Legner C, Eymann T, Hess T, Matt C, Böhmann T, Drews P, et al. Digitalization: opportunity and challenge for the business and information systems engineering community. Bus Inf Syst Eng. 2017;59(4):301-308. [CrossRef]
- Digital Around the World. DataReportal-Glob Digit Insights. Kepios Pte Ltd. 2025. URL: https://datareportal.com/global-digital-overview [accessed 2025-05-09]
- Zisis K, Pavi E, Geitona M, Athanasakis K. Real-world data: a comprehensive literature review on the barriers, challenges, and opportunities associated with their inclusion in the health technology assessment process. J Pharm Pharm Sci. 2024;27:12302. [CrossRef] [Medline]
- Graili P, Guertin JR, Chan KKW, Tadrous M. Integration of real-world evidence from different data sources in health technology assessment. J Pharm Pharm Sci. 2023;26:11460. [CrossRef] [Medline]
- Anagnostopoulos C, Devereson A, El TA, Arias E, Westra A. Real-world data quality: what are the opportunities and challenges? McKinsey & Company. Jan 05, 2023. URL: https://www.mckinsey.com/industries/life-sciences/our-insights/real-world-data-quality-what-are-the-opportunities-and-challenges#/ [accessed 2024-08-23]
- Fiske A, Degelsegger-Márquez A, Marsteurer B, Prainsack B. Value-creation in the health data domain: a typology of what health data help us do. Biosocieties. Apr 12, 2022:1-25. [FREE Full text] [CrossRef] [Medline]
- Acosta JN, Falcone GJ, Rajpurkar P, Topol EJ. Multimodal biomedical AI. Nat Med. 2022;28(9):1773-1784. [CrossRef] [Medline]
- Venkatesh KP, Brito G, Kamel Boulos MN. Health digital twins in life science and health care innovation. Annu Rev Pharmacol Toxicol. Jan 23, 2024;64:159-170. [FREE Full text] [CrossRef] [Medline]
- Sahu M, Gupta RK, Ambasta R, Kumar P. Artificial intelligence and machine learning in precision medicine: a paradigm shift in big data analysis. Prog Mol Biol Transl Sci. 2022;190(1):57-100. [CrossRef] [Medline]
- What is the Trusted Exchange Framework and Common Agreement™? ONC TEFCATM Recognized Coordinating Entity. 2025. URL: https://rce.sequoiaproject.org/tefca/ [accessed 2025-09-09]
- Icons. Flaticon. URL: https://www.flaticon.com/icons [accessed 2026-06-04]
- Goldhaber NH, Jacobs M, Laurent L, Knight R, Zhu W, Pham D, et al. Integrating clinical research into electronic health record workflows to support a learning health system. JAMIA Open. 2024;7(2):ooae023. [FREE Full text] [CrossRef] [Medline]
- Leavy MB, Swenson A. Data sources. In: Tools Technol Regist Interoperability Regist Eval Patient Outcomes User's Guide 3rd Ed Add 2 Internet Agency for Healthcare Research and Quality (US). Rockville (MD). Agency for Healthcare Research and Quality (US); 2019.
- Payne TH, Lovis C, Gutteridge C, Pagliari C, Natarajan S, Yong C, et al. Status of health information exchange: a comparison of six countries. J Glob Health. 2019;9(2):0204279. [FREE Full text] [CrossRef] [Medline]
- Holmgren AJ, Esdar M, Hüsers J, Coutinho-Almeida J. Health information exchange: understanding the policy landscape and future of data interoperability. Yearb Med Inform. 2023;32(1):184-194. [FREE Full text] [CrossRef] [Medline]
- Thiel R, Deimel L, Schmidtmann D, Piesche K, Hüsing T, Rennoch J, et al. #SmartHealthSystems: International comparison of digital strategies. Gütersloh. Bertelsmann Stiftung; 2018.
- Vest JR. Menachemi N, Singh S, editors. Health Information Exchange: National and International Approaches. Cambridge, MA (USA). Health Inf Technol Int Context Emerald Group Publishing Limited; 2012.
- OECD. Facilitating the secondary use of health data for public interest purposes across borders. OECD Digit Econ Pap Paris. OECD Publishing; 2025:376.
- Alam P, Bolio A, Lin L, Larson HJ. Stakeholders' perceptions of personal health data sharing: a scoping review. PLOS Digit Health. 2024;3(11):e0000652. [FREE Full text] [CrossRef] [Medline]
- Sheikh A. From learning healthcare systems to learning health systems. Learn Health Syst. 2020;4(3):e10216. [FREE Full text] [CrossRef] [Medline]
- Barker W, Chang W, Everson J, Gabriel M, Patel V, Richwine C, et al. The evolution of health information technology for enhanced patient-centric care in the United States: data-driven descriptive study. J Med Internet Res. 2024;26:e59791. [FREE Full text] [CrossRef] [Medline]
- Raboin K, Ellis D, Nichols G, Hughes M, Brimacombe M, Rubin K. Advancing newborn screening long-term follow-up: integration of -based registries, dashboards, and efficient workflows. Int J Neonatal Screen. Mar 25, 2024;10(2):27. [FREE Full text] [CrossRef] [Medline]
- Mankowski MA, Bae S, Strauss AT, Lonze BE, Orandi BJ, Stewart D, et al. Generalizability of kidney transplant data in electronic health records-the epic cosmos database vs the scientific registry of transplant recipients. Am J Transplant. 2025;25(4):744-755. [FREE Full text] [CrossRef] [Medline]
- Kruse CS, Stein A, Thomas H, Kaur H. The use of electronic health records to support population health: a systematic review of the literature. J Med Syst. 2018;42(11):214. [FREE Full text] [CrossRef] [Medline]
- Sun Z, Compeau D, Carter M. Mapping the landscape of health information exchange (HIE) networks in the United States. Commun Assoc Inf Syst. Nov 12, 2021;49(1):483-514. [CrossRef]
- Regulation (EU) 2025/327 of the European Parliament and of the Council of 11 February 2025 on the European Health Data Space and amending Directive 2011/24/EU and Regulation (EU) 2024/2847 (Text with EEA relevance). EUR-Lex. 2024. URL: http://data.europa.eu/eli/reg/2025/327/oj [accessed 2025-11-02]
- Abbasi AB, Layden J, Gordon W, Gregurick S, DeLew N, Grossman J, et al. A unified approach to health data exchange: a report from the US DHHS. JAMA. 2025;333(12):1074-1079. [CrossRef] [Medline]
- U.S. Department of Health and Human Services, U.S. Office of the Secretary. “Health Data, Technology, and Interoperability: Patient Engagement, Information Sharing, and Public Health Interoperability” Final Rule. Vols 2024-29163. 2025. URL: https://www.federalregister.gov/documents/2024/12/16/2024-29163/health-data-technology-and-interoperability-trusted-exchange-framework-and-common-agreement-tefca
- U.S. Department of Health and Human Services, Office of the National Coordinator for Health Information Technology. Examining Oversight of the Privacy & Security of Health Data Collected by Entities Not Regulated by HIPAA. 2016. URL: https://www.healthit.gov/sites/default/files/non-covered_entities_report_june_17_2016.pdf
- Brightwell C, Brückner S, Halpern O, Gilbert S. Trust and inclusion in digital health: the need to transform consent. Digit Soc. 2024;3(3):52. [CrossRef]
- Kertesz F. Collaboration in healthcare: implications of data sharing for secondary use in the European Union. Eur J Health Law Brill Nijhoff. 2024;31(5):497-517. [CrossRef] [Medline]
- Sadée C, Testa S, Barba T, Hartmann K, Schuessler M, Thieme A, et al. Medical digital twins: enabling precision medicine and medical artificial intelligence. Lancet Digit Health. Jul 2025;7(7):100864. [FREE Full text] [CrossRef] [Medline]
- Demuth S, De Sèze J, Edan G, Ziemssen T, Simon F, Gourraud P. Digital representation of patients as medical digital twins: data-centric viewpoint. JMIR Med Inform. 2025;13:e53542-e53542. [FREE Full text] [CrossRef] [Medline]
- Li X, Loscalzo J, Mahmud AKMF, Aly DM, Rzhetsky A, Zitnik M, et al. Digital twins as global learning health and disease models for preventive and personalized medicine. Genome Med. 2025;17(1):11. [FREE Full text] [CrossRef] [Medline]
- Katsoulakis E, Wang Q, Wu H, Shahriyari L, Fletcher R, Liu J, et al. Digital twins for health: a scoping review. NPJ Digit Med. Mar 22, 2024;7(1):77. [CrossRef] [Medline]
- Laubenbacher R, Adler F, An G, Castiglione F, Eubank S, Fonseca LL, et al. Forum on immune digital twins: a meeting report. NPJ Syst Biol Appl. Feb 16, 2024;10(1):19. [CrossRef] [Medline]
- Grieves MW. Digital twins: past, present, future. In: Crespi N, Drobot AT, Minerva R, editors. The Digital Twin. Springer. Cham; 2023:97-121.
- Laubenbacher R, Mehrad B, Shmulevich I, Trayanova N. Digital twins in medicine. Nat Comput Sci. Mar 2024;4(3):184-191. [CrossRef] [Medline]
- Voigt I, Inojosa H, Dillenseger A, Haase R, Akgün K, Ziemssen T. Digital twins for multiple sclerosis. Front Immunol. 2021;12:669811. [FREE Full text] [CrossRef] [Medline]
- Rubinger L, Ekhtiari S, Gazendam A, Bhandari M. Registries: big data, bigger problems? Injury. May 2023;54 Suppl 3:S39-S42. [CrossRef] [Medline]
- Miller RS, Mitchell K, Myslinski R, Rising J. Health information technology (IT) and patient registries. In: Tools Technol Regist Interoperability Regist Eval Patient Outcomes User's Guide 3rd Ed Add 2 Internet Agency for Healthcare Research and Quality (US). Rockville, MD. Agency for Healthcare Research and Quality (US); 2019.
- Gliklich RE, Leavy MB, Dreyer NA. Tools Technol Regist Interoperability Regist Eval Patient Outcomes User's Guide 3rd Ed Add 2. Rockville, MD. Agency for Healthcare Research and Quality (US); 2019.
- Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. J Big Data. 2019;6(1):54. [CrossRef]
- Porter ME, Teisberg EO. Redefining health care: creating value-based competition on results. Boston, Mass. Harvard Business School Press; 2006.
- Porter ME. Value-based health care delivery. Ann Surg. 2008;248:144-150. [CrossRef]
- Porter ME. What is value in health care? N Engl J Med Massachusetts Medical Society. 2010;363(26):2477-2481. [CrossRef] [Medline]
- Achungura G, Anand, aidev S, Peepre S, Srivastava A, Pandit P. A global review of value-based care theory, practice and lessons learned. World Health Organization Country Office for India. 2025. URL: https://www.who.int/publications/i/item/9789290220121 [accessed 2025-04-25]
- Jones CH, Dolsten M. Healthcare on the brink: navigating the challenges of an aging society in the United States. NPJ Aging. Apr 06, 2024;10(1):22. [CrossRef] [Medline]
- Smith PC, Sagan A, Siciliani L, Figueras J. Building on value-based health care: towards a health system perspective. Health Policy. 2023;138:104918. [FREE Full text] [CrossRef] [Medline]
- Al-Sahab B, Leviton A, Loddenkemper T, Paneth N, Zhang B. Biases in electronic health records data for generating real-world evidence: an overview. J Healthc Inform Res. 2024;8(1):121-139. [CrossRef] [Medline]
- Msaouel P. The big data paradox in clinical practice. Cancer Invest. 2022;40(7):567-576. [FREE Full text] [CrossRef] [Medline]
- Agniel D, Kohane IS, Weber GM. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ. 2018;361:k1479. [FREE Full text] [CrossRef] [Medline]
- García-Abeijon P, Costa C, Taracido M, Herdeiro MT, Torre C, Figueiras A. Factors associated with underreporting of adverse drug reactions by health care professionals: a systematic review update. Drug Saf. 2023;46(7):625-636. [CrossRef] [Medline]
- Barouni M, Ahmadian L, Anari HS, Mohsenbeigi E. Challenges and adverse outcomes of implementing reimbursement mechanisms based on the diagnosis-related group classification system: a systematic review. Sultan Qaboos Univ Med J. 2020;20(3):e260-e270. [FREE Full text] [CrossRef] [Medline]
- Brown JP, Hunnicutt JN, Ali MS, Bhaskaran K, Cole A, Langan SM, et al. Quantifying possible bias in clinical and epidemiological studies with quantitative bias analysis: common approaches and limitations. BMJ. 2024;385:e076365. [CrossRef] [Medline]
- Zhuang Y, Zhang L. Promoting TEFCA with blockchain technology: a decentralized approach to patient-centered healthcare data management. AMIA Annu Symp Proc. 2024:824-833. [FREE Full text] [Medline]
- Savage M, Savage LC. Doctors routinely share health data electronically under HIPAA, and sharing with patients and patients' third-party health apps is consistent: interoperability and privacy analysis. J Med Internet Res. 2020;22(9):e19818. [FREE Full text] [CrossRef] [Medline]
- Gansel X, Mary M, van Belkum A. Semantic data interoperability, digital medicine, and e-health in infectious disease management: a review. Eur J Clin Microbiol Infect Dis. 2019;38(6):1023-1034. [CrossRef] [Medline]
- Torab-Miandoab A, Samad-Soltani T, Jodati A, Rezaei-Hachesu P. Interoperability of heterogeneous health information systems: a systematic literature review. BMC Med Inform Decis Mak. 2023;23(1):18. [FREE Full text] [CrossRef] [Medline]
- Park Y, Kang H, Kim J, Shin SY, Cho D, Rhee SY, et al. Development and evaluation of SNOMED CT automated mapping tool: advancing terminology standardization and semantic i. JMIR Med Inform. 2026;14:e82670. [FREE Full text] [CrossRef] [Medline]
- Ehrsam J, Gaudet-Blavignac C, Mattei M, Baumann M, Lovis C. Semantics in action: a guide for representing clinical data elements with SNOMED CT. J Biomed Semantics. 2025;16(1):7. [FREE Full text] [CrossRef] [Medline]
- New HL7 Europe FHIR implementation guides to support the European Health Data Space. HL7 Europe. 2025. URL: https://www.hl7europe.org/new-hl7-europe-fhir-implementation-guides-to-support-the-european-health-data-space/ [accessed 2026-04-25]
- Resourcelist - FHIR v5.0.0. Health Level Seven International. URL: https://www.hl7.org/fhir/resourcelist.html [accessed 2026-04-25]
- Casey JD, Courtright KR, Rice TW, Semler MW. What can a learning healthcare system teach us about improving outcomes? Curr Opin Crit Care. 2021;27(5):527. [CrossRef]
- Bertelsen N, Oehrlein E, Lewis B, Westrich-Robertson T, Elliott J, Willgoss T, et al. Patient engagement and patient experience data in regulatory review and health technology assessment: where are we today? Ther Innov Regul Sci. 2025;59(4):737-752. [CrossRef] [Medline]
Abbreviations
| DT: digital twin |
| EHDS: European Health Data Space |
| EHR: electronic health record |
| FHIR: Fast Healthcare Interoperability Resources |
| HIE: health information exchange |
| HL7: Health Level 7 |
| HTA: health technology assessment |
| LHS: learning health system |
| MS: multiple sclerosis |
| RWD: real-world data |
| SNOMED CT: Systematized Nomenclature of Medicine–Clinical Terms |
| TEFCA: Trusted Exchange Framework and Common Agreement |
| VBC: value-based care |
Edited by A Mavragani; submitted 02.Feb.2026; peer-reviewed by S Thun, R Niebler, A Piętocha; comments to author 06.Apr.2026; accepted 19.May.2026; published 03.Jul.2026.
Copyright©Judith Wenk, Hernan Inojosa, Isabel Voigt, Stephen Gilbert, Tjalf Ziemssen. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 03.Jul.2026.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

