Abstract
Patient-generated health data (PGHD) encompass health-related information created, recorded, and gathered by patients in their daily lives, and are distinct from data collected in clinical settings. PGHD can offer insight into patients’ everyday health behaviors and conditions, supporting health management and clinical decision-making. The Veterans Health Administration (VHA) has developed a robust infrastructure to collect PGHD, including automatically collected data from digital sensors and patient-entered data. This effort is guided by comprehensive policy and strategy documents to ensure the secure storage and effective use of PGHD. This paper describes the development and implementation of an infrastructure to support PGHD within the VHA and highlights envisioned clinical and research uses of PGHD to advance health care for US veterans. The PGHD database was built to Fast Healthcare Interoperability Resources standards, facilitating secure data storage and exchange of PGHD. Clinical tools, such as the provider-facing dashboards, make PGHD accessible from the electronic health records. Research and evaluation efforts focus on evaluating PGHD’s impact on patient engagement, clinical outcomes, and health care equity. The VHA’s comprehensive PGHD infrastructure represents a significant advancement in personalized health care and patient engagement. The integration of PGHD into clinical practice can enhance shared decision-making and self-management, while research and evaluation efforts can address how to maximize the benefits of PGHD for veterans. The VHA’s approach sets a benchmark for other US health care systems in leveraging PGHD to achieve the broad aims of enhancing stakeholder health care experiences, improving population health and health equity, and reducing costs.
J Med Internet Res 2025;27:e70755doi:10.2196/70755
Keywords
Introduction
Patient-generated health data (PGHD) are health-related data that are created, recorded, or gathered by patients in their everyday lives. PGHD can be used to promote health and wellness or to help address a health concern [
]. PGHD overlap with the related concepts of digital phenotyping and personal sensing but can also include nondigital format data [ , ]. PGHD are distinct from data generated in clinical settings, as they provide insight from patients’ everyday lives, outside of the health care encounter [ ]. PGHD often include (but are not limited to) biometric data, symptomatology, and activity levels, and can be collected manually or through digital devices such as wearables (eg, smartwatches), mobile health apps, and Bluetooth-enabled medical devices (eg, blood pressure cuffs). Integrating PGHD into clinical care has the potential to improve outcomes by supporting care delivery processes across the care continuum, including disease prevention and diagnosis, health management, interventions, patient-provider communication, and shared decision-making [ ]. In addition, PGHD have the potential to empower and engage patients in managing their own health, including engagement in healthy behaviors and monitoring of chronic conditions [ ].The Veterans Health Administration (VHA) Digital Health Office and Office of Connected Care is responsible for the development of an evolving suite of veteran- and health care team–facing virtual care tools, many of which support the collection of PGHD. In 2001, the VHA launched the tethered online patient portal, My HealtheVet, and later allowed veterans to manually track their personal health data, such as vital measurements, diet, sleep, or physical activity for their own records. In 2014, the VHA launched Annie for Veterans (“Annie”), a VHA SMS text messaging system modeled after the Florence Simple Telehealth system created by the National Health Service in the United Kingdom [
]. Annie sends automated self-care reminders to veterans and allows veterans to send self-entered PGHD to their clinical teams.In 2020, the VHA’s Office of Connected Care launched a new initiative to expand the VHA’s ability to collect PGHD from veterans who want to share their data through home medical devices (eg, digital glucometers) and activity trackers (eg, Fitbit) (Newton and Frisbee, unpublished data, 2020). In May 2023, the Office of Connected Care released the Share My Health Data mobile health app, allowing veterans to sync Bluetooth-enabled medical devices, smartwatch and wearable activity trackers, and third-party mobile health apps (eg, Apple Health) and share the corresponding PGHD with the VHA (
). The Share My Health Data app also allows veterans to directly self-enter PGHD, such as body weight, and can sync with continuous glucose monitoring devices. Prior to using the app, veterans must first accept the end-user license agreement and the data collection agreement. The data collection agreement informs the user that the VHA may collect and store any PGHD that the user has agreed to share with the VHA. Consent for the VHA to use a veteran’s PGHD includes the purposes for use such as clinical care, veteran population health initiatives, quality improvement efforts, and research.
There is great interest, both across VHA and other US health care systems, in harnessing PGHD to reduce costs, improve clinical outcomes and health equity, and enhance stakeholder health care experiences [
]. To support these goals, the VHA’s Office of Connected Care has developed the PGHD database, where veterans’ PGHD are stored and transformed into data elements that can be accessed by the VHA health care team members and researchers. The Office of Connected Care aims to support researchers who study PGHD to improve clinical care processes and veteran health outcomes, as well as integrate the collection and use of PGHD within research and evaluation projects to support their aims and objectives [ ].This manuscript describes the technical requirements, infrastructure, unique attributes and limitations, and potential clinical, research, and evaluation uses of the PGHD database as part of the VHA’s broader efforts to realize the value of PGHD in advancing health care for US veterans. As the PGHD database evolves and lessons are learned from researchers, field testers, and pilot projects, the goal will be to integrate PGHD into fundamental clinical workflows and build a knowledge base for where integration provides the greatest value.
PGHD Technical Requirements
Data collected in the PGHD database are guided by a series of VHA policy and strategy documents, the development of which was overseen by the Office of Connected Care [
]. These documents provide guidance on key aspects of collecting, storing, and using PGHD within the health care system, including data access and management, clinical workflows, management of anticipated risks, implementation, and evaluation (U.S. Department of Veterans Affairs, unpublished data, 2021) [ ]. The VHA Directive 6506 defines PGHD as, “health data created, recorded or gathered electronically by or from veterans, beneficiaries, or their authorized delegates outside the clinical health care setting to help address a health concern” [ ]. The directive further defines PGHD as either solicited (ie, PGHD that the VHA providers request from veterans) or unsolicited (ie, PGHD that are provided by veterans without a request from the VHA providers) and outlines the VHA’s efforts around the use of PGHD as threefold: (1) supporting veterans with self-management; (2) supporting clinical decision-making and care delivery for individual veterans; and (3) supporting data use for health care analytics, including population health, quality improvement, and research. The expectations for provider documentation and for the use of PGHD within the electronic health record are also outlined in VHA Directive 6506.The VHA’s PGHD database contains data collected from several sources, including VHA mobile apps, third-party mobile health apps, and Bluetooth-enabled digital devices.
depicts the data flow from these sources to the end-users of the data. To ensure the standardization across sources, data in the PGHD database go through a Fast Healthcare Interoperability Resources (FHIR)–compliant application programming interface (API). The FHIR standard is designed to facilitate the exchange of data between different health care applications by organizing data elements into resources and transmission through an API [ ]. PGHD obtained through the API are stored in an analytic cloud environment via two processes: one is a direct copy of the FHIR-compliant data (in JavaScript Object Notation [JSON] format); and the other is a normative copy created by an Extract, Transform, and Load data integration process. Once loaded into the Corporate Data Warehouse and the Health Data and Analytics Platform, data can be displayed for viewing by VHA clinicians and staff through the Virtual Care Manager and Clinician Decision Support Consoles, and by veterans through the Share My Health Data App. A parallel copy is loaded on the VHA’s Informatics and Computing Infrastructure Workplace servers for operational and research use. Separate data integration and validation processes are applied to other sources, so that these data sources can also be stored in the PGHD database using the same FHIR standard.
PGHD Infrastructure
A formal PGHD governance structure ensures that data contained in the PGHD database adhere to the FHIR data standard and mappings, and that data are stored securely and in accordance with VHA policy. Rows in the database represent observations of specific PGHD measurements taken over a specific time period, such as the number of steps taken during a calendar day, systolic blood pressure taken at a specific date and time in mm Hg, or total sleep duration in minutes. Data elements in the PGHD are linked to specific patients in the VHA through a unique patient identifier and can be merged with other VHA data sources in the VHA’s Corporate Data Warehouse. Across data sources, over 130 million observations of PGHD have been collected from over 25,000 patients in the VHA since May 1, 2023. While the PGHD database contains data from Annie and patient-entered vital signs, most of the observations come from third-party health apps connected via the Share My Health Data App. Share My Health Data allows users to share data collected from health aggregator apps such as Apple Health, fitness trackers such as Fitbit or Garmin, pair over 50 Bluetooth-enabled medical devices, and input patient-entered vitals. The most common types of PGHD are from daily activity summaries (eg, daily steps and distance traveled), exercise, sleep, and vital signs. Recent updates enabling the collection of high-frequency data from digital sensor devices (eg, minute-by-minute heart rate [pulse] data, glucose readings from continuous glucose monitoring devices, etc) will allow even more granular insight into veterans’ experiences outside the clinical encounter.
Importantly, PGHD from clinical programs, such as remote home monitoring telehealth, are currently not stored in the PGHD database. These programs are closely monitored by clinicians through established protocols and procedures. Conversely, data contained in the PGHD database are more likely to fall under the category of “unsolicited” data and will not typically be actively monitored. Providers retain autonomy in determining with their patients when to review unsolicited PGHD and are not responsible for making use of the data unless doing so is specified in the veteran’s care plan [
].Unique Attributes and Potential Limitations of Unsolicited PGHD
Although PGHD can offer valuable insights into patients’ daily activities, vital signs, and sleep patterns, several limitations of such data must be acknowledged. Wearable digital devices, while improving in precision, often have inconsistencies in data capture due to various factors such as device placement, user adherence, battery life, and other technical issues [
]. For example, step counts can vary significantly based on the make, model, and type of device (eg, smartphone vs Bluetooth digital pedometer vs smartwatch), and heart rate measurements can be affected by movement or improper device placement [ , ]. Heterogeneity, variability, and data quality are also known challenges with sleep data [ ].Another limitation of PGHD relates to its completeness and representativeness. The data that are in the PGHD database, for example, depend on what are available through an API from third-party digital health companies. Metadata that come from Apple Health may differ from what are available through Garmin or Fitbit. Additionally, not all individuals consistently use their devices and, in some cases, users may make unintentional entry errors (in cases of manual, self-entered PGHD) or intentionally alter data reporting. These behaviors can lead to gaps in data that can affect the continuity and reliability of health monitoring.
It can also be difficult to determine if a daily summary measurement (eg, daily step count) represents a full or partial day of use. Additionally, individuals who use mobile health apps and digital devices are likely not representative of broader patient populations, as users of mobile apps and digital devices tend to be younger, more health-conscious, and of higher socioeconomic status, or in other cases, have a higher burden of disease and health care needs. These trends can limit the generalizability of research findings generated from PGHD [
]. Finally, a key distinction between PGHD from wearables and devices and traditional health care data collected in clinical settings is that PGHD are tied to the device generating the data. As a result, there is some inherent uncertainty regarding the identity of the individual operating the device transmitting PGHD to the VHA.Given these limitations, appropriately representing and analyzing PGHD are critical for health systems looking to integrate such data for clinical, research, and evaluation use. The lack of certain metadata from third-party health apps limits the ability to discern between older and newer makes and models of common wearables and other devices. Prior studies on older devices have raised concern about the performance in populations with chronic conditions and mobility issues [
]. As such, individuals and providers could become concerned (or reassured) about a patient’s status based on unreliable data from an unreliable device. In future iterations of the database, it is possible that more metadata from the third-party health aggregators (eg, make and model) will be available; however, availability will be dependent on what is available through the APIs, and may differ from one device or app to another.These limitations necessitate careful consideration when interpreting PGHD and underscore the importance of complementing it with more traditional data obtained during clinical encounters. As a tool for monitoring health and wellness, consistent collection of PGHD can help monitor important trends over time and can help minimize the risk of any one individual data point being inaccurate.
Potential Clinical Uses of the PGHD Database
The PGHD database establishes the infrastructure necessary to develop tools that support clinical use of PGHD. Currently, patients can view their own PGHD through the Share My Health Data mobile app’s daily dashboards and the VHA clinicians can view their patients’ PGHD through Power BI (Microsoft) clinical dashboards accessed through the VHA’s electronic health record and a provider-facing web application called Virtual Care Manager. These provider dashboards do not require additional logins but do require knowledge of where they are located within the electronic health record. These dashboards display PGHD alongside data from the patient’s medical record to facilitate the identification of trends and opportunities for clinical intervention. The Patient Generated Vitals and Health Data Dashboard contains tabs for different types of PGHD, such as blood pressure, blood glucose, weight, diet or nutrition, activity, and pulse oximetry data (
and ). The Diabetes Dashboard combines PGHD (blood glucose, blood pressure, weight, nutrition, activity, and blood oxygen levels) with data collected during in-person visits, laboratory results, and active medication information.

The Office of Connected Care is currently conducting pilot tests to assess the usability of PGHD in clinical care and the PGHD clinical dashboards. The pilot tests are designed to identify the level of effort required by veterans to sync and share their medical or activity devices, and to assess how clinical teams are incorporating PGHD into their clinical practice and workflows. Ongoing evaluations of the pilot tests will identify barriers and potential solutions to the integration of PGHD into clinical practice. Although the VHA has not yet committed to a set of metrics to define “success” for the expansion of PGHD, there is commitment to studying the lessons learned from the ongoing pilot studies to inform targets for growth.
Potential Research and Evaluation Uses for the PGHD Database
The VHA has identified five priority areas for virtual care research, two of which directly relate to the VHA’s PGHD initiatives: (1) identifying implementation strategies to increase patient and clinician adoption of effective virtual care technologies, and (2) examining how best to use PGHD in combination with electronic health record data to generate clinically valuable alerts and predictions for providers [
]. The development of the Share My Health Data app and accompanying clinician dashboards will facilitate research and evaluation initiatives aligned with both of these priorities.Identifying implementation strategies to increase patient and clinician adoption of effective virtual care technologies marks the beginning of an extensive research and evaluation effort to understand the value of PGHD. Specific areas of interest include the role of PGHD in facilitating patient engagement in treatment or self-management, shared decision-making, and population health management. The VHA’s PGHD-related research agenda also focuses on research to inform improvements to PGHD implementation and to evaluate the safety, effectiveness, and health equity of PGHD solutions [
]. Examples include studies that engage with clinicians to understand which types of PGHD should be prioritized for clinical use, the evaluation of practice protocols for PGHD, barriers and solutions to the interoperability of PGHD with the electronic health record, solutions to address disparities in access to PGHD, the impact of PGHD on patient outcomes and effectiveness of care, and the most effective ways to integrate PGHD into clinical workflows and decision-making processes.The PGHD database can also be leveraged as part of research and evaluation initiatives designed to measure the impact of broader health care interventions. Doing so is critical to examining how best to use PGHD in combination with electronic health record data to generate clinically valuable alerts and predictions. For example, rather than relying on data captured in the electronic health record collected during clinical encounters, the Share My Health Data app can be used to gather real-world data in between clinical encounters to assess the effectiveness of interventions being tested such as continuous glucose monitoring data, digital home blood pressure cuff readings, sleep quality, or daily step counts from wearables.
The aforementioned research and evaluation priorities represent cutting-edge work in the field of health services research and technology-assisted care. Thus, the advent of the PGHD database represents an avenue to disrupt the field of health care delivery such that clinicians are able to adopt a more personalized and data-informed approach to treatment.
Discussion
Both within and outside of the VHA, health care systems have been increasingly interested in exploring the potential value of PGHD. As personal digital devices that can seamlessly capture these data have become commonplace, the next era of health care will involve capturing and harnessing these rich data elements in patients’ day-to-day lives outside of the clinician’s office to facilitate delivering care that is informed by each patient’s unique digital phenotype. The advent of the Share My Health Data app and accompanying clinician dashboards represent an important step the VHA is taking to be on the cutting edge of this transformation. The extensive collection and thoughtful application of PGHD not only support the individual health journeys of veterans but can also contribute broadly to the efficiency and effectiveness of health care and service delivery.
The next step of the PGHD initiative at the VHA is focused on research and evaluation to identify how best to integrate PGHD into workflows and find where it provides the most value. The VHA is particularly interested in identifying high-priority clinical use cases for PGHD and understanding implementation strategies to encourage patient and provider adoption. On the provider end, evaluations of attitudes toward PGHD and barriers to integrating PGHD-driven insights into care are needed. On the patient end, work is needed to encourage engagement and better understand expectations around the use of PGHD. Veterans can currently access the PGHD that they share through the Share My Health Data app; plans exist for the development of rules and alerts, and an online platform where veterans can more fully interact with their data. Further, although the Share My Health Data app alerts users that their data may not be regularly monitored by their care teams, veterans may still expect their providers to review data prior to and during clinical appointments or to notify them of any concerns in between appointments. How to effectively communicate to veterans how PGHD will be used by their care teams is necessary to ensure that it aligns with veteran expectations and preferences.
As the PGHD infrastructure at the VHA evolves, developing and integrating clinical alerts and predictions that make care more proactive and individualized for patients is crucial. Certainly, in the digital phenotyping space, we have seen that important clinical occurrences such as mood episodes can be predicted with this type of PGHD [
, ], but we have yet to see the broad implementation of alerts or models of care that leverage these types of data in routine practice.The future of health care at the VHA, underpinned by the expansive use of PGHD, looks toward not only achieving the broad goals of enhancing stakeholder experiences, improving population health and health equity, and reducing costs but also setting a benchmark for other US health care systems. The VHA is committed to the broad dissemination of research and operational findings related to the implementation of PGHD across its clinics and facilities nationwide. As the VHA pursues the next step of rollout for the Share My Health Data app, the emphasis will be on establishing clear ethical standards for using this data, identifying and addressing challenges presented by the data itself, and streamlining clinical workflows for implementation success. Already, numerous lessons have been learned that have yielded valuable insights. As the VHA’s PGHD infrastructure evolves, the VHA is committed to sharing these lessons broadly, along with accompanying recommendations. The hope is that these lessons will be beneficial for other health care systems and their own efforts to better integrate PGHD into health care delivery.
Acknowledgments
The authors would like to acknowledge the critical contributions of Kevin Troutner, Vincent Catania, and their team members who built and supported the technical infrastructure necessary for the Veterans Health Administration's (VHA) Patient-Generated Health Data Program described in this manuscript. The authors would also like to thank Dr Meredith Josephs for her leadership as executive sponsor of the Veteran Affair's (VA) Patient-Generated Health Data Program. This work was supported by the US Department of Veterans Affairs; Office of Connected Care; and Office of Research and Development, Health Services Research and Development Service, Quality Enhancement Research Initiative Program (PEC 15-470; principal investigator: TPH). The views expressed in this article are those of the authors and do not necessarily reflect the position or policy of the Department of Veterans Affairs or the United States government.
Data Availability
The Veterans Health Administration (VHA) patient-generated health data (PGHD) on which this project is based are not permitted to leave the VHA firewall without a data use agreement. This limitation is consistent with other work based on VHA data. However, VHA data are made freely available to investigators behind the Department of Veterans Affairs' firewall with appropriate documentation.
Conflicts of Interest
None declared.
References
- Nazi KM, Newton T, Armstrong CM. Unleashing the potential for patient-generated health data (PGHD). J Gen Intern Med. Feb 2024;39(Suppl 1):9-13. [CrossRef] [Medline]
- Mohr DC, Zhang M, Schueller SM. Personal sensing: understanding mental health using ubiquitous sensors and machine learning. Annu Rev Clin Psychol. May 8, 2017;13:23-47. [CrossRef] [Medline]
- Onnela JP, Rauch SL. Harnessing smartphone-based digital phenotyping to enhance behavioral and mental health. Neuropsychopharmacology. Jun 2016;41(7):1691-1696. [CrossRef] [Medline]
- Tiase VL, Hull W, McFarland MM, et al. Patient-generated health data and electronic health record integration: a scoping review. JAMIA Open. Dec 2020;3(4):619-627. [CrossRef] [Medline]
- Lavallee DC, Lee JR, Austin E, et al. mHealth and patient generated health data: stakeholder perspectives on opportunities and barriers for transforming healthcare. Mhealth. 2020;6:8. [CrossRef] [Medline]
- Alaboud K, Shahreen M, Islam H, et al. Clinicians’ perspectives in using patient-generated health data to improve ischemic heart disease management. AMIA Jt Summits Transl Sci Proc. 2022;2022:112-119. [Medline]
- From Flo to Annie – how Flo is being used in the USA. The Health Foundation. URL: https://www.health.org.uk/article/from-flo-to-annie [Accessed 2025-05-05]
- Nundy S, Cooper LA, Mate KS. The quintuple aim for health care improvement: a new imperative to advance health equity. JAMA. Feb 8, 2022;327(6):521-522. [CrossRef] [Medline]
- Veterans Health Administration VHA Directive 6506 review and use of patient-generated health data under the office of connected care. Department of Veterans Affairs. 2021. URL: https://www.va.gov/vhapublications/ViewPublication.asp?pub_ID=9252 [Accessed 2025-05-05]
- VA Enterprise Cloud – Mobile Application Platform (Cloud) Assessing (VAEC-MAP) (173VA005OP2). Fed Regist. 2021;86(213):61852-61855. URL: https://www.govinfo.gov/content/pkg/FR-2021-11-08/pdf/2021-24368.pdf [Accessed 2025-05-05]
- Welcome to FHIR®. HL7 FHIR Release 5. URL: https://HL7.org/FHIR [Accessed 2025-05-05]
- Canali S, Schiaffonati V, Aliverti A. Challenges and recommendations for wearable devices in digital health: data quality, interoperability, health equity, fairness. PLOS Digit Health. Oct 2022;1(10):e0000104. [CrossRef] [Medline]
- Bent B, Goldstein BA, Kibbe WA, Dunn JP. Investigating sources of inaccuracy in wearable optical heart rate sensors. NPJ Digit Med. 2020;3:18. [CrossRef] [Medline]
- Angelucci A, Canali S, Aliverti A. Digital technologies for step counting: between promises of reliability and risks of reductionism. Front Digit Health. 2023;5:1330189. [CrossRef] [Medline]
- Perez-Pozuelo I, Zhai B, Palotti J, et al. The future of sleep health: a data-driven revolution in sleep science and medicine. NPJ Digit Med. 2020;3:42. [CrossRef] [Medline]
- Dobson R, Stowell M, Warren J, et al. Use of consumer wearables in health research: issues and considerations. J Med Internet Res. Nov 21, 2023;25:e52444. [CrossRef] [Medline]
- Lauritzen J, Muñoz A, Luis Sevillano J, Civit A. The usefulness of activity trackers in elderly with reduced mobility: a case study. Stud Health Technol Inform. 2013;192:759-762. [Medline]
- Hogan TP, Sherman SE, Dardashti N, McMahon N, Slightam C, Zulman DM. Realizing virtual care in VA: supporting the healthcare system’s journey towards enhanced access, engagement, and outcomes. J Gen Intern Med. Feb 2024;39(Suppl 1):1-4. [CrossRef] [Medline]
- Cho CH, Lee T, Kim MG, In HP, Kim L, Lee HJ. Mood prediction of patients with mood disorders by machine learning using passive digital phenotypes based on the circadian rhythm: prospective observational cohort study. J Med Internet Res. Apr 17, 2019;21(4):e11029. [CrossRef] [Medline]
- Lee HJ, Cho CH, Lee T, et al. Prediction of impending mood episode recurrence using real-time digital phenotypes in major depression and bipolar disorders in South Korea: a prospective nationwide cohort study. Psychol Med. Sep 2023;53(12):5636-5644. [CrossRef] [Medline]
Abbreviations
Annie: Annie for Veterans |
API: application programming interface |
FHIR: Fast Healthcare Interoperability Resources |
JSON: JavaScript Object Notation |
PGHD: patient-generated health data |
VHA: Veterans Health Administration |
Edited by Amaryllis Mavragani; submitted 13.01.25; peer-reviewed by Carolyn Clancy, Victoria Tiase; final revised version received 28.03.25; accepted 04.04.25; published 06.06.25.
Copyright© Terry J Newton, Nilesh Shah, Katherine Lewis, Mark S Zocchi, Felicia R Bixler, Bella Etingen, Jessica M Lipschitz, Stephanie A Robinson, Timothy P Hogan, Stephanie L Shimada. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 6.6.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.