Published on in Vol 26 (2024)

This is a member publication of University of Oxford (Jisc)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/57827, first published .
Value of Engagement in Digital Health Technology Research: Evidence Across 6 Unique Cohort Studies

Value of Engagement in Digital Health Technology Research: Evidence Across 6 Unique Cohort Studies

Value of Engagement in Digital Health Technology Research: Evidence Across 6 Unique Cohort Studies

Original Paper

14YouandMe, Seattle, WA, United States

2Department of Psychiatry, University of Oxford, Oxford, United Kingdom

3Crohn's & Colitis Foundation, New York, NY, United States

4Section of Urology and Renal Transplantation, Virginia Mason Francisan Health, Seattle, WA, United States

5MindMed Inc, New York, NY, United States

6Tufts University School of Medicine, Boston, MA, United States

7Department of Paediatrics, University of Toronto, Toronto, ON, Canada

8Department of Pediatric Hematology/Oncology, Schneider Children's Medical Center of Israel, Petach-Tikva, Israel

9Department of Pediatrics, University of Toronto, Toronto, ON, Canada

10Gasteroentology Unit, Oxford University Hospitals NHS Foundation Trust and Biomedical Research Centre, Oxford, United Kingdom

11The Dr. Henry D. Janowitz Division of Gastroenterology, Icahn School of Medicine at Mount Sinai, New York, NY, United States

12Department of Neurosurgery, Johns Hopkins University School of Medicine, Baltimore, MD, United States

13The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, United States

14Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada

15Evidation Health Inc, Santa Mateo, CA, United States

16Sage Bionetworks, Seattle, WA, United States

Corresponding Author:

Sarah M Goodday, MSc, PhD

4YouandMe

2901 3rd Ave

Seattle, WA, 98121

United States

Phone: 1 (206) 928 8243

Email: sarah@4youandme.org


Background: Wearable digital health technologies and mobile apps (personal digital health technologies [DHTs]) hold great promise for transforming health research and care. However, engagement in personal DHT research is poor.

Objective: The objective of this paper is to describe how participant engagement techniques and different study designs affect participant adherence, retention, and overall engagement in research involving personal DHTs.

Methods: Quantitative and qualitative analysis of engagement factors are reported across 6 unique personal DHT research studies that adopted aspects of a participant-centric design. Study populations included (1) frontline health care workers; (2) a conception, pregnant, and postpartum population; (3) individuals with Crohn disease; (4) individuals with pancreatic cancer; (5) individuals with central nervous system tumors; and (6) families with a Li-Fraumeni syndrome affected member. All included studies involved the use of a study smartphone app that collected both daily and intermittent passive and active tasks, as well as using multiple wearable devices including smartwatches, smart rings, and smart scales. All studies included a variety of participant-centric engagement strategies centered on working with participants as co-designers and regular check-in phone calls to provide support over study participation. Overall retention, probability of staying in the study, and median adherence to study activities are reported.

Results: The median proportion of participants retained in the study across the 6 studies was 77.2% (IQR 72.6%-88%). The probability of staying in the study stayed above 80% for all studies during the first month of study participation and stayed above 50% for the entire active study period across all studies. Median adherence to study activities varied by study population. Severely ill cancer populations and postpartum mothers showed the lowest adherence to personal DHT research tasks, largely the result of physical, mental, and situational barriers. Except for the cancer and postpartum populations, median adherences for the Oura smart ring, Garmin, and Apple smartwatches were over 80% and 90%, respectively. Median adherence to the scheduled check-in calls was high across all but one cohort (50%, IQR 20%-75%: low-engagement cohort). Median adherence to study-related activities in this low-engagement cohort was lower than in all other included studies.

Conclusions: Participant-centric engagement strategies aid in participant retention and maintain good adherence in some populations. Primary barriers to engagement were participant burden (task fatigue and inconvenience), physical, mental, and situational barriers (unable to complete tasks), and low perceived benefit (lack of understanding of the value of personal DHTs). More population-specific tailoring of personal DHT designs is needed so that these new tools can be perceived as personally valuable to the end user.

J Med Internet Res 2024;26:e57827

doi:10.2196/57827

Keywords



Wearable digital health technologies (DHTs) [1,2] and mobile apps facilitate the remote, real-world assessment of health including objective signs of disease that are typically confined to health care visits and health care provider interpretation. These specific categories of DHTs, herein referred to as “personal DHTs,” hold promise for transforming health research through the new ability to capture high-resolution, high-frequency, in-the-moment health-related multimodal information in decentralized ways. Through the provision of personal DHTs in clinical care, individuals could be better empowered to navigate their health outside the health care system with greater accessibility, agency, and accuracy than currently possible [1,2]. One of the largest challenges in the future of digital health that involves the use of personal DHTs is end-user engagement. While direct comparisons of engagement in personal DHT research are challenging due to the heterogeneous reporting of retention and adherence factors, and a lack of consensus on a definition of “engagement” [3-6], accumulating evidence supports that so far engagement in the use of personal DHTs has been poor. Specifically, retention in personal DHT research studies and the use of health-related apps is low across diverse populations and applications [7-9]. Further, there is evidence of attrition biases in personal DHT research resulting in insufficient representation of minority populations [7]. In addition to poor retention, personal DHT research studies have low adherence to completing active app-based tasks resulting in large amounts of missing data. This missing data problem results in challenges in artificial intelligence models from insufficient volumes of data to follow individual patterns, and limits app-based context “label” data. This “label” data is crucial for validating passively collected information from personal DHTs, particularly given the early state of the field and as the utility of certain approaches such as knowledge graphs and large language models emerge.

Several personal DHTs health research studies have started to surface [7-12], resulting in the identification of barriers to engagement. These barriers include technical problems with the technology and in collecting the data, usability, privacy concerns, and digital literacy. Many of these barriers point to a need to retain a human element in the research process, and to include an aspect of co-designing with end users. Emerging personal DHT research studies that show better engagement retain some form of “human-in-the-loop” (regular contact with research staff) and co-design or end-user approach [11-15]. Among these studies, retention rates of 80% and higher have been observed, while average adherence to wearable device use and daily app surveys have been shown to be >90% and 70%, respectively [11-15].

The promise of digital health rests on the assumption that end users can be engaged in the long-term use of personal DHTs for health monitoring, yet this remains to be seen among most existing research applications. There have been increasing international calls for the inclusion of patients in the design and conduct of health research [16-18], and this seems particularly relevant for digital health research where the patient is the end user of these new remote tools. In this paper, we report on engagement across 6 unique personal DHT health research studies that adopted different aspects of a participant-centric design, but each with distinct population and design features. The objective is to describe how participant engagement techniques and different personal DHT designs affect participant adherence, retention, and overall engagement in personal DHT health research.


Study Design

In total, 6 personal DHT research studies are included in this quantitative and qualitative analysis of engagement that span diverse populations including a frontline health care population (the stress and recovery in frontline health care workers study) [11]; a conception, pregnancy, and postpartum population (Better Understanding the Metamorphosis of Pregnancy [BUMP] study) [19]; and populations with different diseases including Crohn disease (stress in Crohn: forecasting symptom transitions study), Li-Fraumeni syndrome (stress and LFS: a feasibility study of wearable technologies to detect stress in families with LFS), and patients with pancreatic and central nervous system (CNS) tumors (help enable real-time observations [HERO] in pancreatic [PANC] and CNS tumors studies) [20].

All of these studies were conducted by 4YouandMe—a US-based nonprofit (charitable) organization. 4YouandMe specializes in open-source research into the application of personal DHTs for health and wellness [20]. 4YouandMe has a particular focus on leveraging personal DHTs to empower the patient in navigating their unique disease or life transitional period. These 6 studies were included in this analysis as they reflect all of the completed studies by 4YouandMe at the time of this analysis. Characteristics of these studies can be found in Table 1 and additional methodological detail can be found in Multimedia Appendix 1. All studies involved the use of a bespoke study smartphone app built by 4YouandMe and the use of the Oura smart ring, the Garmin smartwatch, the Apple smartwatch, an Empatica smartwatch, and the Bodyport Cardiac Scale. Details of these devices can be found in Multimedia Appendix 2).

Table 1. Characteristics of included studies.
Study and populationSample sizeAge (years), median (IQR)Active study time (months)RecruitmentDevicesAverage (SD) app daily burdenCompensationEngagement strategy
Stress and recovery

Frontline health care workers36533.0 (28.0-42.0)4-6Remote: Social media and health care organization newsletters
  • App (iPhone only)
  • Oura ring
  • Garmin watch
5 (1.8) minutesNone (participants completing the study kept the wearable devices)
  • Biweekly check-in phone calls
  • Investigator-participant Zoom (Zoom Video Communications, Qumu Corporation) calls
  • Participant feedback-driven app changes
  • Coauthorship opportunity
Stress in Crohn

Patients with Crohn disease195 (MSSMa, N=139; Oxford, N=56)MSSM (median 29, IQR 24-37), Oxford (median 39, IQR 32-50)6-9In-clinic: through inflammatory bowel disease clinics
  • App (iPhone only)
  • Oura ring
  • Empatica Embrace
  • Bodyport scale
7.7 (1.0) minutesYes, participants could keep the ring or receive compensation based on points accumulated
  • Biweekly check-in phone calls
  • Participant feedback-driven app changes
HERO-CNSb

Patients with CNSc tumors1252 (43-56)7In-clinic: through cancer specialty clinics
  • App (iPhone or Android)
  • Oura ring
  • Garmin
5.3 (2.1) minutesNone (participants completing the study kept the wearable devices)
  • Biweekly check-in (phone or in-clinic)
HERO-PANCd

Patients with pancreatic cancer2657 (53-65)e1 to 14 monthsfIn-clinic: through cancer specialty clinics
  • App (iPhone or Android)
  • Oura ring
  • Garmin
  • Bodyport scale
3.1 (1.9) minutesNone (participants completing the study kept the wearable devices)
  • Biweekly check-in (phone or in-clinic)
Stress and LFSg

Affected and unaffected family members of a proband with LFS4939.0 (7.9-68.0)6In-clinic: through cancer specialty clinics
  • App (iPhone only)
  • Empatica EmbracePlus
2.3 (0.9) minutes

None
  • Biweekly check-in (phone or in-clinic)
BUMPh

Pregnant individuals (up to 15 weeks)52433.0 (30-36)Up to 12 monthsRemote: through patient-provider portals, social media, and community health clinics
  • App (iPhone or Android)
  • Oura ring
  • Garmin
  • Bodyport scale
5.0 (2.3) minutesYes, participants received compensation based on study points accumulated
  • Biweekly check-in phone calls
  • Investigator-participant Zoom calls
  • Participant feedback-driven app changes
BUMP-Ci

Individuals actively attempting to get pregnant27334.0 (31-36)Up to 6 monthsRemote: through patient-provider portals, social media, and community health clinics
  • App (iPhone or Android)
  • Oura ring
3.8 (2.0) minutesYes, participants could keep the ring or receive compensation
  • Biweekly check-in phone calls
  • Investigator-participant Zoom calls
  • Participant feedback-driven app changes

aMSSM: Mount Sinai School of Medicine.

bHERO-CNS: help enable real-time observations—central nervous system.

cCNS: central nervous system.

dHERO-PANC: help enable real-time observations—pancreatic cancer.

en=24, 2 unknown.

fUntil withdrawal, progression, death, or study completion (October 31, 2022).

gLFS: Li-Fraumeni syndrome.

hBUMP: Better Understanding the Metamorphosis of Pregnancy.

iBUMP-C: Better Understanding the Metamorphosis of Pregnancy—Conception.

Ethical Considerations

All included studies were approved by the local institutional research ethics boards (REB) at their local sites (Multimedia Appendix 1): stress and recovery in frontline health care workers study (institutional review board [IRB], Advarra [4UCOVID1901, Pro00043205]), BUMP study (IRB Advarra Pro00047893), stress in Crohn (Oxford site: Hampshire-A IRAS ID: 269286, Mount Sinai School of Medicine [MSSM] site: IRB of MSSM: GCO 19-1543 | IRB-19-02298), stress and LFS (Sick Kids: REB: 1000072240), HERO-CNS (John Hopkins Medicine IRB IRB00253818), and HERO-PANC (University Hospital Network REB: 20-5211).

Statistical Analysis

Definitions of adherence in digital health research studies are heterogeneous [3-6]. Consistent criteria for adherence across all included studies were attempted. While many different wearable features could be used as the basis for the use of the device, features that were most reliably monitored were selected. For studies using the Oura smart ring, daily adherence was defined as at least one sleep data event present for the prior night. The Oura ring was only expected to be worn at night for many of the included studies, which is why sleep data were used as the indicator for adherence. For studies using the Garmin smartwatch, daily adherence was defined as step data present for that day. For the Empatica smartwatch, daily adherence was defined as at least one data event (worn properly in a day). Adherence to the Bodyport Cardiac Scale was defined as the proportion of days where a weight event was present divided by the total number of expected follow-up days. Adherence to in-app task completion was defined as the proportion of tasks completed when prompted in the app divided by the total number of tasks that should have been completed over study follow-up. For example, all included studies had a daily survey. In a study with a minimum of 4 months of follow-up expected from participants, the total number of expected daily surveys is approximately 120. For a weekly app survey, the total number of expected surveys for a 4-month study follow-up would be 16. Adherence to biweekly check-in calls was defined as the proportion of calls completed divided by the total number of expected calls over study follow-up. Medians and ranges are described since the adherence distributions were nonnormally distributed. All adherence estimations were performed only among retained participants.

Differences in adherence and retention by sociodemographic characteristics were estimated using χ2, Fisher exact, Mann-Whitney U, and ANOVA tests where appropriate among studies that have sufficient sample sizes (stress and recovery, BUMP, and stress in Crohn). Survival probabilities using the Kaplan-Meier approach were calculated to display the probability of retention over the course of each included study. Retention (total proportion of participants completing the study among all enrolled) is also reported. Additional information on how retention was calculated for each unique study can be found in Multimedia Appendix 3.


Description of Included Studies

Study design characteristics of all studies are described in Table 1. All studies included the use of at least one wearable device plus a study app that involved daily, as well as intermittent surveys (daily question prompts, validated questionnaires) and active tasks (cognitive active or physical function tasks [eg, walk tests], video diaries). In all included studies, participants were required to use their own Android or iPhone smartphone for study activities. Recruitment mechanisms differed across studies with some including remote recruitment through digital advertisements on social media, professional organizations and newsletters, and patient portals (stress and recovery, and BUMP), while others recruited patients in-person through specialty clinics (stress in Crohn, HERO studies, and stress and LFS). The daily burden of app active tasks across studies ranged from 2 to 7 minutes. Study follow-up periods across studies ranged from 4 to 18 months. Across all studies except the stress and LFS study, participants were offered to keep some of the study wearable devices (most often the ring and the watch). Further, 2 studies included the option for modest financial compensation (BUMP and stress in Crohn).

All studies included an engagement strategy that centered around a biweekly phone check-in with a consistent engagement specialist that served the purpose of supporting participants, helping them with onboarding, resolving potential technological problems, and discussing and collecting study experience feedback. Additionally, all included studies implemented different strategies that focused on working with participants as co-designers. These strategies included making app changes that were driven by direct participant feedback during active follow-up, offering a “your data” section in the app that allowed participants to track key symptoms over time, hosting optional investigator-participant Zoom calls where participants could meet the study team, receive study updates, preliminary results, and could offer more feedback, and inviting participants to contribute to and be listed as coauthors on published work.

Adherence by Study Population

Median adherence in engagement phone check-in calls, wearable device use, daily app survey completion, and in-app active tasks can be found in Table 2. Median adherence varied across study populations. The stress in Crohn–MSSM site had a lower adherence on the engagement check-in calls (50%) compared to other studies, many of which had 100% adherence on these calls (Table 2). This study site is herein referred to as the low-engagement cohort. In this low-engagement cohort, median adherence to completing daily app surveys, to wearing the Empatica smartwatch, and to using the Bodyport Cardiac Scale were lower than all other study cohorts that included these studies’ activities (except the BUMP-postpartum cohort). Further, median adherence to using the Oura smart ring was lower in the low-engagement cohort compared to other cohorts except for the postpartum and severely ill cancer populations.

The HERO studies included the most severely ill participants including patients with active diagnoses of CNS and pancreatic tumors. Some HERO participants were undergoing chemotherapy, some had therapy-related complications, some had infections, and some had progressive, life-threatening tumor growth. While the total number of participants in these studies was low, these studies showed low adherence on the daily survey (<55%) and wearable device use (<65% HERO-CNS only). Interestingly, HERO-PANC participants exhibited high wearable device use median adherence (83.3%, IQR 51%-93.2%, Oura and 95.5%, IQR 75.2%-99.2%, Garmin), despite the health status of this population. Further, median adherence to in-app cognitive active tasks was higher among the HERO studies compared to most other studies. Engagement check-in call adherence was also high in the HERO studies. Among the BUMP postpartum cohort, there was consistently lower adherence on all study tasks except for the engagement check-in calls compared to other studies, particularly in comparison to the BUMP prenatal cohort. Specifically, median adherence to the Oura ring, Garmin smartwatch use, and the Bodyport Cardiac Scale in the BUMP-prenatal cohort compared to the BUMP postpartum cohort dropped from 87.2% (IQR 68.7%-96.7%) to 55% (IQR 5.5%-83.7%), 96.7% (IQR 82.9%-100%) to 62.5% (IQR 12.3%-96.4%), and 74.7% (IQR 52%-87.3%) to 33.1% (IQR 8.9%-67.7%), respectively (Table 2).

Table 2. Median adherence to study activities across studies.

Stress and recoveryBUMP-CaBUMPbBUMP-POSTcSINCd-MSSMeSINC-OxfordHERO-CNSfHERO-PANCgStress in LFSh
Participants, n297983793791175471945
ESi check-ins, median (IQR)75.0 (57.1-87.5)100.0 (87.9-100.0)100.0 (88.4-100.0)100.0 (100.0-100.0)50.0 (20.0-75.0)100.0 (90.9-100.0)85.7 (78.1-88.2)100.0 (100.0-100.0)60.0 (40.0-80.0)
Oura ring, median (IQR)97.0 (86.0-100.0)90.6 (76.3-97.7)87.2 (68.7-96.7)55.0 (5.5-83.7)80.5 (37.1-92.4)98.9 (94.0-99.6)42.3 (32.0-58.2)83.3 (51.0-93.2)j
Garmin watch, median (IQR)96.7 (82.9-100.0)62.4 (12.3-96.4)63.3 (54.7-64.3)95.5 (75.2-99.2)
Apple watch, median (IQR)98.1 (87.7-100.0)79.8 (32.4-96.3)
Empatica watch, median (IQR)26.0 (6.2-64.1)72.5 (37.1-96.8)86.8 (66.7-95.6)
Bodyport scale, median (IQR)74.7 (52.0-87.3)33.1 (8.9-67.7)38.5 (17.1-64.7)79.5 (52.7-88.4)
Daily survey, median (IQR)75.4 (57.2-88.2)42.4 (24.6-69.7)60.1 (34.4-81.7)18.4 (1.0-47.6)27.9 (10.4-51.9)70.3 (41.9-84.0)53.3 (47.8-71.5)49.1 (20.2-83.4)62.5 (40.96-82.59)
Reaction rime, median (IQR)88.9 (75.0-100.0)43.4 (24.3-72.8)30.4 (9.7-50.6)69.5 (46.6-89.3)59.0 (50.0-66.7)62.5 (20.9-86.6)
Trail making, median (IQR)88.9 (71.1-100.0)46.5 (24.0-73.7)28.7 (9.4-50.0)71.6 (45.0-87.3)61.5 (52.1-76.5)38.1 (4.2-76.2)57.7 (36.8-72.0)
EBTk, median (IQR)30.1 (16.2-54.1)44.6 (22.6-73.9)6.5 (0.0-33.3)23.1 (9.1-44.4)32.1 (0.0-58.6)
N-Back, median (IQR)51.4 (24.9-76.4)8.3 (0.0-44.4)
Gait task, median (IQR)25.0 (0.0-60.0)0.0 (0.0-0.0)24.5 (18.8-62.8)36.0 (2.2-74.0)
Walk test, median (IQR)14.3 (0.0-40.0)0.0 (0.0-0.0)23.1 (13.9-60.4)25.0 (7.8-49.5)
Video diary, median (IQR)4.3 (0.0-27.7)8.3 (0.0-50.0)0.0 (0.0-0.0)5.6 (0.0-22.2)9.4 (0.0-35.1)25.0 (8.7-77.1)0.0 (0.0-37.5)

aBUMP-C: Better Understanding the Metamorphosis of Pregnancy—Conception.

bBUMP: Better Understanding the Metamorphosis of Pregnancy.

cBUMP-POST: Better Understanding the Metamorphosis of Pregnancy—Postpartum.

dSINC: stress in Crohn.

eMSSM: Mount Sinai School of Medicine.

fHERO-CNS: help enable real-time observations—central nervous system.

gHERO-PANC: help enable real-time observations—pancreatic cancer.

hLFS: Li-Fraumeni syndrome.

iES: engagement specialist.

jNot available.

kEBT: emotional bias test.

Adherence by Study Activity

There were differences in adherence rates across different study activities. Adherence to wearable device use was consistently higher across studies compared to in-app activities, which is not surprising given the passive nature of these devices. Excluding the postpartum and HERO-CNS study, median adherence to Oura ring use was >80% across all studies, and as high as 99% (IQR 94.9%-99.6%; stress in Crohn-Oxford site; Table 2). There were also differences in adherence across specific wearable devices. Garmin and Apple smartwatch adherence was >95% in BUMP pregnant individuals and HERO-PANC participants, while median adherence for the Empatica Watch was lower among the studies that used this device (stress in Crohn-Oxford, 72.5%, IQR 37.1%-96.8%; stress in Crohn-MSSM, low-engagement cohort, 26%, IQR 6.2%-64.1%; and stress in LFS, 86.8%, IQR 0.7%-0.9%). Median adherence to the Bodyport Cardiac Scale was 74.7% (IQR 52%-87.3%) among BUMP pregnant individuals and 79.5% (IQR 52.7%-88.4%) in HERO-PANC participants (Table 2). Excluding the postpartum and HERO study populations and the low-engagement cohort, in-app daily survey adherence was >60% for all studies (Table 2). Finally, adherence to in-app active tasks was lower in general compared to other activities such as wearable device use or in-app surveys. Tasks that involved walking (gait and walk task) or speaking (video diaries) showed lower adherence compared to other active tasks (eg, cognitive and emotional bias tasks; Table 2).

Adherence by Study Recruitment and Engagement Strategy

There did not appear to be any meaningful difference in median adherence rates across study activities by study recruitment methods (in-clinic vs remote) or follow-up time. Further, 2 studies that included modest financial compensation in addition to engagement strategies showed higher adherence rates compared to some of the other studies (ie, BUMP and stress in Crohn), but the impact of compensation is difficult to disentangle from other study characteristics such as population differences, and these studies did not show superior adherence rates compared to the stress and recovery study that did not offer financial compensation.

Retention

The median proportion of participants retained in the study across the 6 studies was 77.2% (IQR 72.6%-88%; Table 3). The probability of staying in the study stayed above 80% for all studies during the first month of study participation and stayed above 50% for the entire active study period across all studies (Multimedia Appendix 4).

Table 3. Retention across studies.
StudyProportion retained at study completion, retained/enrolled (%)
Stress and recovery297/365 (81.4)
BUMP-Ca134/187 (72.7)b
BUMPc379/524 (72.3)
Stress in Crohn-MSSMd117/139 (84.2)
Stress in Crohn-Oxford54/56 (96.4)
HERO-CNSe7/12 (58.3)
HERO-PANCf19/26 (73.1)g
Stress and LFSh45/49 (91.8)

aBUMP-C: Better Understanding the Metamorphosis of Pregnancy—Conception.

bOnly includes participants who were enrolled in the Better Understanding the Metamorphosis of Pregnancy—Conception-specific app.

cBUMP: Better Understanding the Metamorphosis of Pregnancy.

dMSSM: Mount Sinai School of Medicine.

eHERO-CNS: help enable real-time observations—central nervous system.

fHERO-PANC: help enable real-time observations—pancreatic cancer.

gHelp enable real-time observations—pancreatic cancer has unique factors to consider when interpreting the proportion retained until study completion, since the study aimed to monitor patients until they developed progressive disease or died, or the study end date (October 31, 2022; see Multimedia Appendix 3).

hLFS: Li-Fraumeni syndrome.

Adherence and Retention by Participant Sociodemographic Characteristics

Median adherence for the Oura smart ring, a smartwatch (Garmin, Apple, and Empatica), and the Bodyport Cardiac Scale was lower among younger participants compared to older participants across most studies (Multimedia Appendix 5). Specifically, Oura smart ring adherence was significantly lower in those aged 18-25 years compared to those aged ≥26 years in the BUMP study (P=.03) and stress in Crohn-MSSM studies (P=.02), and was lower in the BUMP-C and stress and recover studies, but this difference was not statistically significant at P=.59 and P=.08, respectively. Median adherence for Apple smartwatch use was significantly lower in those aged 18-25 years compared to those aged ≥26 years in the BUMP study (P=.02), while median adherence for Garmin smartwatch use was lower but not statistically significant (P=.06). Median adherence for the Bodyport Cardiac Scale was significantly lower in those aged 18-25 years compared to those aged ≥26 years in BUMP (P<.005) and stress in Crohn-MSSM (P<.006).

In the BUMP study, Black or African American ethnicity had significantly higher median adherence to completing the in-app daily survey compared to other race or ethnicity groups (P=.01). This trend was observed in the stress and recovery study (P=.07) and the stress in Crohn-MSSM study (P=.24), although the difference was not statistically significant. In contrast, median adherence to Oura smart ring, smartwatch, and Bodyport Cardiac Scale use was lower among Black or African American individuals compared to other race or ethnicity groups, although these differences were not statistically significant (Multimedia Appendix 5).

Retention did not significantly differ by age group or gender (Multimedia Appendix 6).

Retention likelihood was significantly different by race or ethnicity groups in BUMP-C (P<.001) and BUMP (P=.001). Specifically, participants of White ethnicity were more likely to stay in the study in both BUMP-C and BUMP, while participants reporting their race or ethnicity as either unknown or not reporting this item were less likely to be retained (Multimedia Appendix 6).

Barriers to Engagement (Qualitative Synthesis of Participant Feedback)

Figure 1 describes key themes that impacted participant retention, adherence, and overall engagement that cut across all included studies. These themes include participant burden and forgetfulness, digital literacy, physical and mental barriers, personal and altruistic benefits, and privacy and confidentiality. Qualitative feedback from participants, research staff, and investigators across these 5 themes is summarized in Multimedia Appendix 7. The top three barriers to engagement in active study tasks were (1) participant burden and in particular fatigue with the repetitiveness of tasks; (2) physical or mental and situational barriers that prevented the ability to complete tasks; and (3) personal and altruistic benefit, namely the perception that the use of the personal DHTs was not personally useful for a health benefit or a lack of understanding as to why and how certain features (eg, heart rate variability) could be useful to track for health benefit. Qualitative feedback from participants in the 2 cohorts demonstrating lower adherence (HERO-PANC and BUMP post partum) suggested that while participants were highly engaged, they were either too ill, distracted, or tired to complete many of the study activities while navigating a serious illness or the early postpartum period.

Figure 1. Five key participant engagement barrier themes.

Principal Findings

Evidence across 6 unique and diverse studies involving the longitudinal use of personal DHTs supports that participant-centric engagement strategies aid in participant retention and maintaining good adherence in some populations. These strategies centered around (1) human contact with an engagement specialist as often as every 2 weeks, (2) investigator-participant meetings during active study follow-up, (3) offering returned symptom data in the app, (4) inviting participants to contribute as coauthors in published work, and (5) real-time modifications to the study app based on participant feedback.

In the majority of included studies, the probability of staying in the study stayed above 90% for the first month and stayed above 50% for active study periods for all studies. Lower retention or adherence was observed among studies that included a severely ill cancer population and a postpartum population. Barriers to participation in these cohorts were largely the result of physical and situational roadblocks. Excluding studies of a severely ill and postpartum population and the low-engagement cohort in the stress in Crohn study, adherence to Oura smart ring and Garmin smartwatch use was 80% and as high as 99% in some cohorts, while adherence to the Bodyport Cardiac Scale was 75% in a pregnant population. This supports that different populations can successfully be engaged in the use of active app assessments and wearable devices in the long term with adequate support.

Retention and adherence rates observed in these studies are higher than typically reported by other personal DHT research studies [7-9,12,13,21]. For example, a review of 8 large app-based DHT research studies in the United States reported that the probability of staying in the study dropped to or below 50% after the first 4 weeks of participation for all included studies [7]. Further, across the 8 included studies in this review, >50% of participants did not engage with the app for at least 7 days. Another large app-based study in the United States, the Warfighter Analytics Using Smartphones for Health study that collected daily active and passive app data reported a median retention of 45.2% (38/84 days), while the probability of staying in the study hit 50% at approximately 5.5 weeks [10]. A large app-based study in the United Kingdom (cloudy with a chance of pain study) involving daily active app assessments reported that 64% of participants fell into the low engagement or no engagement categories after baseline [12]. The RADAR study [14], a multinational study involving active and passive assessments from an app, and a Fitbit reported comparable retention results among participants with major depression to those reported here. This study reported a retention rate of 54.6% for 43 weeks of study participation; however, the probability of staying in the study stayed above 75% for the first several months of participation (~6 months). While the active app assessments in this study only included assessments every 2 weeks as opposed to daily assessments, this study additionally included aspects of a participant-centric design, which may have contributed to the higher reported retention [15].

Taken together, in comparison to other published personal DHT research studies, the 6 studies included in this paper reflect higher levels of engagement. Importantly, the included studies in this analysis involved high burden designs in comparison to other studies that request, for example, weekly or biweekly active tasks of participants [14] or only involve the use of a smartwatch. Specifically, across the included studies here, participants were expected to complete on average 4.6 (SD 1.62) minutes a day of app activities in addition to continuously using multiple wearable devices.

While different variations of participant-centric strategies were used across the 6 included studies, a key common feature was a biweekly check-in call with an engagement specialist. These calls served the purpose of providing support and building rapport with participants, working through onboarding and technological issues with study devices, tracking adherence, and receiving study-related feedback from participants. Numerous challenges arise in the conduct of remote, personal DHT research, and without frequent check-in and semiregular data monitoring by research staff, knowledge of these issues is a black box. The most significant drop in retention in personal DHT research studies tends to be during the first few weeks of participation [7]. These early onboarding weeks are crucial in working with participants to ensure they can get into a rhythm of participation. The passive sensing nature of personal DHTs has much potential to inform new objective measures of health, however, are not always intuitively understood as personally important for unique diseases (eg, heart rate variability or phone screen time). Personal DHT studies allow for “light touch” research approaches that enable data collection without traditional research coordinator contact, but this may come with a cost that inadvertently creates a less engaging study environment for participants and limits the opportunity to help participants understand the value in their participation. Of the included 6 studies, 1 cohort had much lower engagement on the check-in calls (50% adherence) compared to other included studies and, in turn, consistently demonstrated lower adherence to study-related activities. Still, even with extensive engagement designs, populations that had physical, mental, and situational barriers to study task completion (ie, severely ill, postpartum mothers) showed lower adherence to wearable device use and active smartphone tasks compared to other study populations. Top reported barriers to engagement included participant burden, physical, mental, and situational barriers, and low perceived value of personal DHTs for health care. These engagement barriers have been reported in previous literature [8,9] relating to DHT research and in the use of DHT interventions. However, the conveyed importance of the perceived value of the approach among participants in the current analysis is noteworthy. Given the foreign nature of personal DHTs for many individuals, particularly older populations, further work is needed to co-design and educate end users on the potential value of self-monitoring unique health-related data.

Irrespective of the engagement approach, adherence to in-app surveys and tasks was lower than wearable device use, which is not surprising given the higher burden related to in-app activities. The self-reported information captured from frequent or momentary in-app assessments is extremely valuable as context information. This context information or “label” data is useful for validating objectively captured information, yet remains the most difficult to capture in sufficient detail. Further, certain in-app activity adherences were consistently lower than others. Namely, activities that required the user to be active (walk in a straight line or complete a video diary) were low across studies. Still, adherence to daily in-app surveys was >60% for all studies excluding the postpartum and HERO study populations.

Limitations

This quantitative and qualitative analysis compared observational data across different digital health studies. However, no true comparison cohort that did not include engagement strategies was included. Therefore, the inferred casualty of participant check-ins with engagement specialists on retention and adherence rates cannot be not concluded. We are formally testing whether the biweekly check-in significantly increases adherence and retention in an ongoing study with an appropriate comparison arm without check-in support (NCT05753605). One of the included studies (stress and recovery) was conducted during the early 2020 COVID-19 pandemic. There is some evidence that engagement in research was higher during the early pandemic time periods [22]. It cannot be ruled out that the higher observed retention and adherence in this study compared to others was not due to this potential time period bias. The stress in the Crohn-Oxford site included a population of patients some of whom were already engaged in the use of web-based monitoring of symptoms. In turn, this could have contributed to the high retention and higher adherence observed at this site compared to the other stress in the Crohn-MSSM site. The results presented on barriers to engagement were primarily qualitative and collected from conversations with participants, research staff, and investigators across studies.

Conclusions

Globally, mobile apps are used for a variety of purposes in everyday life, while the use of smartwatches for activity monitoring is gaining increasing popularity. However, the use of these tools for health remains a challenge. These findings support that human support via phone and other participant-centric engagement strategies centered on giving back to participants and working with them as co-designers can support sufficient retention and adherence in personal DHT research across diverse populations. This has implications for the utility and potential necessity of a digital support worker in digital health care, as highlighted by others [23]. A power of personal DHTs is enabling the patient to be in control of their health through self-monitoring, but this new role comes with a responsibility. This important shift in role from doctor to patient outlines how crucial it is to include patients in the early design phase of personal DHT health research. Further work is needed to inform app designs that support habitual forming activities around task completion so that app-related activities become a part of participants’ daily routine and are perceived as personally valuable.

Acknowledgments

The stress and recovery study was supported in part by the Bill & Melinda Gates Foundation (INV-016651). The stress in Crohn study was funded by the Leona M. and Harry B. Helmsley Charitable Trust (1911-03376). The help enable real-time observation (HERO)–central nervus system study was funded by the Mark Foundation for Cancer Research through an ASPIRE award (19-024-ASP). The HERO–pancreatic cancer study was funded by the Mark Foundation for Cancer Research through an ASPIRE award (19-024-ASP), Pancreatic Cancer Canada, the Princess Margaret Cancer Foundation, and 4YouandMe. The Better Understanding the Metamorphosis of Pregnancy (BUMP) study was funded by 4YouandMe and Sema4 along with supplemental in-kind contributions from coalition partners (Evidation Health, Vector Institute, Cambridge Cognition, and Bodyport). The stress and LFS study was funded by in-kind contributions from 4YouandMe, SickKids Hospital, and the Vector Institute.

Conflicts of Interest

CB is a consultant for Depuy Synthes, Bionaut Labs, Galectin Therapeutics, Haystack Oncology, and Privo Technologies. CB is a cofounder of Belay Diagnostics and OrisDx. DRK is an officer, employee, and shareholder of MindMed; a consultant at Tempus, Nightware, and Limitless; and board member of Sonara. RPH is an advisory board member at Bristol Meyers Squibb. MH is an advisory board member for Servier, AnHeart, and Bayer; steering committee member for Novartis; honoraria from Novartis; data safety monitoring committee member for Advarra and Parexel. RG received a graduate scholarship from Pfizer and provided consulting or advisory roles for Astrazeneca, Tempus, Eisai, Incyte, Knight Therapeutics, Guardant Health, and Ipsen. The others declare no conflicts of interest.

Multimedia Appendix 1

Study descriptions.

DOCX File , 20 KB

Multimedia Appendix 2

Study wearable devices.

DOCX File , 16 KB

Multimedia Appendix 3

Retention calculations.

DOCX File , 28 KB

Multimedia Appendix 4

Probability of retaining in the study across studies.

DOCX File , 134 KB

Multimedia Appendix 5

Median adherence to study activities stratified by sociodemographic characteristics.

DOCX File , 28 KB

Multimedia Appendix 6

Sociodemographic differences in participants who were retained versus not retained.

DOCX File , 28 KB

Multimedia Appendix 7

Qualitative feedback from participants, research staff, and investigators surrounding barriers to engagement in digital health research, summarized across 6 unique studies.

DOCX File , 20 KB

  1. Friend SH, Ginsburg GS, Picard RW. Wearable digital health technology. N Engl J Med. 2023;389(22):2100-2101. [CrossRef] [Medline]
  2. Goodday SM, Geddes JR, Friend SH. Disrupting the power balance between doctors and patients in the digital era. Lancet Digit Health. 2021;3(3):e142-e143. [FREE Full text] [CrossRef] [Medline]
  3. Ng MM, Firth J, Minen M, Torous J. User engagement in mental health apps: a review of measurement, reporting, and validity. Psychiatr Serv. 2019;70(7):538-544. [FREE Full text] [CrossRef] [Medline]
  4. White KM, Williamson C, Bergou N, Oetzmann C, de Angel V, Matcham F, et al. A systematic review of engagement reporting in remote measurement studies for health symptom tracking. NPJ Digit Med. 2022;5(1):82. [FREE Full text] [CrossRef] [Medline]
  5. Daniore P, Nittas V, von Wyl V. Enrollment and retention of participants in remote digital health studies: scoping review and framework proposal. J Med Internet Res. 2022;24(9):e39910. [FREE Full text] [CrossRef] [Medline]
  6. Chan A, Chan D, Lee H, Ng CC, Yeo AHL. Reporting adherence, validity and physical activity measures of wearable activity trackers in medical research: a systematic review. Int J Med Inform. 2022;160:104696. [FREE Full text] [CrossRef] [Medline]
  7. Pratap A, Neto EC, Snyder P, Stepnowsky C, Elhadad N, Grant D, et al. Indicators of retention in remote digital health studies: a cross-study evaluation of 100,000 participants. NPJ Digit Med. 2020;3:21. [FREE Full text] [CrossRef] [Medline]
  8. Simblett S, Matcham F, Siddi S, Bulgari V, Barattieri di San Pietro C, Hortas López J, et al. RADAR-CNS Consortium. Barriers to and facilitators of engagement with mHealth technology for remote measurement and management of depression: qualitative analysis. JMIR mHealth uHealth. 2019;7(1):e11325. [FREE Full text] [CrossRef] [Medline]
  9. O'Connor S, Hanlon P, O'Donnell CA, Garcia S, Glanville J, Mair F. Understanding factors affecting patient and public engagement and recruitment to digital health interventions: a systematic review of qualitative studies. BMC Med Inform Decis Mak. 2016;16(1):120. [FREE Full text] [CrossRef] [Medline]
  10. Li SX, Halabi R, Selvarajan R, Woerner M, Fillipo IG, Banerjee S, et al. Recruitment and retention in remote research: learnings from a large, decentralized real-world study. JMIR Form Res. 2022;6(11):e40765. [FREE Full text] [CrossRef] [Medline]
  11. Goodday SM, Karlin E, Alfarano A, Brooks A, Chapman C, Desille R, Stress And Recovery Participants, et al. An alternative to the light touch digital health remote study: the stress and recovery in frontline COVID-19 health care workers study. JMIR Form Res. 2021;5(12):e32165. [FREE Full text] [CrossRef] [Medline]
  12. Druce KL, McBeth J, van der Veer SN, Selby DA, Vidgen B, Georgatzis K, et al. Recruitment and ongoing engagement in a UK smartphone study examining the association between weather and pain: cohort study. JMIR mHealth uHealth. 2017;5(11):e168. [FREE Full text] [CrossRef] [Medline]
  13. Reade S, Spencer K, Sergeant JC, Sperrin M, Schultz DM, Ainsworth J, et al. Cloudy with a chance of pain: engagement and subsequent attrition of daily data entry in a smartphone pilot study tracking weather, disease severity, and physical activity in patients with rheumatoid arthritis. JMIR mHealth uHealth. 2017;5(3):e37. [FREE Full text] [CrossRef] [Medline]
  14. Matcham F, Leightley D, Siddi S, Lamers F, White KM, Annas P, et al. RADAR-CNS consortium. Remote Assessment of Disease and Relapse in Major Depressive Disorder (RADAR-MDD): recruitment, retention, and data availability in a longitudinal remote measurement study. BMC Psychiatry. 2022;22(1):136. [FREE Full text] [CrossRef] [Medline]
  15. Zhang Y, Pratap A, Folarin AA, Sun S, Cummins N, Matcham F, et al. Long-term participant retention and engagement patterns in an app and wearable-based multinational remote digital depression study. NPJ Digit Med. Feb 17, 2023;6(1):25. [CrossRef] [Medline]
  16. How we engage the public. Wellcome. URL: https://wellcome.org/what-we-do/public-engagement [accessed 2024-02-02]
  17. Canada's strategy for patient-oriented research. Canadian Institutes of Health Research. URL: https://cihr-irsc.gc.ca/e/44000.html [accessed 2024-02-02]
  18. NIHR launches new centre for engagement and dissemination. National Institute of Health and Care Research. URL: https://www.nihr.ac.uk/news/nihr-launches-new-centre-for-engagement-and-dissemination/24576 [accessed 2024-02-02]
  19. Goodday SM, Karlin E, Brooks A, Chapman C, Karlin DR, Foschini L, et al. Better Understanding of the Metamorphosis of Pregnancy (BUMP): protocol for a digital feasibility study in women from preconception to postpartum. NPJ Digit Med. Mar 30, 2022;5(1):40. [CrossRef] [Medline]
  20. 4YouandMe. URL: http://4youandme.org [accessed 2023-06-07]
  21. Nowell WB, Curtis JR, Zhao H, Xie F, Stradford L, Curtis D, et al. Participant engagement and adherence to providing smartwatch and patient-reported outcome data: digital tracking of rheumatoid arthritis longitudinally (DIGITAL) real-world study. JMIR Hum Factors. 2023;10:e44034. [FREE Full text] [CrossRef] [Medline]
  22. Vargo D, Zhu L, Benwell B, Yan Z. Digital technology use during COVID-19 pandemic: a rapid review. Human Behav and Emerg Tech. Dec 28, 2020;3(1):13-24. [CrossRef]
  23. Perret S, Alon N, Carpenter-Song E, Myrick K, Thompson K, Li S, et al. Standardising the role of a digital navigator in behavioural health: a systematic review. Lancet Digit Health. 2023;5(12):e925-e932. [FREE Full text] [CrossRef] [Medline]


BUMP: Better Understanding the Metamorphosis of Pregnancy
CNS: central nervous system
DHT: digital health technology
HERO: help enable real-time observation
IRB: institutional review board
LFS: Li-Fraumeni syndrome
MSSM: Mount Sinai School of Medicine
PANC: pancreatic cancer
REB: research ethics board


Edited by G Eysenbach, T de Azevedo Cardoso; submitted 06.03.24; peer-reviewed by C Godoy Jr; comments to author 05.04.24; revised version received 12.04.24; accepted 29.05.24; published 03.09.24.

Copyright

©Sarah M Goodday, Emma Karlin, Alexa Brooks, Carol Chapman, Christiana Harry, Nelly Lugo, Shannon Peabody, Shazia Rangwala, Ella Swanson, Jonell Tempero, Robin Yang, Daniel R Karlin, Ron Rabinowicz, David Malkin, Simon Travis, Alissa Walsh, Robert P Hirten, Bruce E Sands, Chetan Bettegowda, Matthias Holdhoff, Jessica Wollett, Kelly Szajna, Kallan Dirmeyer, Anna Dodd, Shawn Hutchinson, Stephanie Ramotar, Robert C Grant, Adrien Boch, Mackenzie Wildman, Stephen H Friend. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 03.09.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.