Abstract
Background: Stress is widespread and carries substantial mental health, social, and economic burdens. Yet, access to clinician-led stress management remains constrained by service capacity, cost, and stigma. In response, artificial intelligence (AI)–enabled tools have rapidly proliferated as scalable, self-directed options. However, evidence on how these systems support stress management outside formal clinical settings remains fragmented.
Objective: This systematic review aimed to synthesize empirical evidence on how AI-enabled technologies are used for self-directed stress management. We mapped the emerging functions of these tools, the psychological frameworks informing their design, the populations and settings studied, and the outcomes reported.
Methods: We conducted a PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses)–compliant systematic review of English-language studies published between 2000 and 2025. Six databases were searched (APA PsycINFO, PubMed, MEDLINE, Scopus, Web of Science Core Collection, ProQuest, and Google Scholar).
Results: Of 3008 records identified, 35 studies met the inclusion criteria. The methodological quality of included studies was critically appraised using the Mixed Methods Appraisal Tool (version 2018). Findings illustrated that AI-supported stress management can operate through 5 core functions, including psychological intervention, behavioral support, psychoeducation, companionship, and emotional support, and stress monitoring, detection, and triage. Across the reviewed studies, these functions supported self-directed stress management by helping users identify stress, regulate responses, and engage in coping outside formal clinical care.
Conclusions: AI-enabled systems show preliminary promise for supporting self-directed stress management through multiple user-facing functions grounded in established psychological frameworks.
Trial Registration: PROSPERO CRD420251135780; https://www.crd.york.ac.uk/PROSPERO/view/CRD420251135780
doi:10.2196/90709
Keywords
Introduction
The Emerging Roles of Artificial Intelligence in Self-Directed Stress Management
Overview
From minor hassles to major crises, everyday life is saturated with stress, commonly understood as a state of worry or mental tension caused by difficult situations []. Amidst the COVID-19 pandemic, stress symptomatology was estimated to affect up to 25% of individuals [,]. Global estimates indicate that stress prevalence ranges from approximately 13% to nearly 40% across population-level surveys and meta-analytic studies, with approximately 1 in 3 adults reporting daily stress and most countries showing worsening trajectories over the past decade [-]. These figures are often magnified in at-risk groups, such as informal caregivers. Meta-analytic evidence indicates that informal caregivers face a substantial mental health burden, with many reported to meet diagnostic criteria for mental health conditions, and nearly half experience clinically significant caregiver burden [,]. Therefore, stress can act as a symptom of existing difficulties and as a risk factor that worsens long-term mental health trajectories. However, scalable and flexible support that can accommodate diverse needs remains limited by both system-level and individual-level barriers. For instance, constrained service capacity means that groups such as students experience high stress but low uptake of treatment [,]. Meanwhile, financial, logistical, and stigma-related obstacles disproportionately impede access to clinical and counseling care, contributing to poorer mental health outcomes [-].
In light of this, self-directed stress management has gained traction as a practical alternative. Such approaches parallel self-help interventions, defined as standardized, evidence-based programs that participants implement independently of professional therapists or guides [,]. These self-directed solutions can enhance accessibility while offering greater autonomy and convenience, especially for those facing structural or social barriers to conventional care. Digital mental health interventions targeting youth have shown potential to achieve sustainable improvements in stress and anxiety outcomes by providing personalized, accessible, and culturally responsive resources. However, high dropout rates, limited long-term data, and the inherently low accountability of self-directed digital interventions [,] identify a clear need for novel strategies for supporting continued use and intervention efficacy.
AI and Self-Directed Stress Management Interventions
Given these limitations, artificial intelligence (AI) offers capabilities to address gaps in current self-directed stress management programs. Large language models (LLMs), natural language processing (NLP), and related machine learning approaches have become increasingly ubiquitous in many sectors. In this context, a key strength of these intelligent models is their ability to interpret complex user data and translate it into tailored, context-specific responses [-]. Leveraging these capabilities could enable more accessible and efficient delivery of stress management solutions that are responsive to individual needs. In broader health care contexts, AI has been proposed and increasingly applied to support personalized monitoring, reduce selected workflow burdens, and assist routine tasks [-]. These precedents suggest that AI-enabled models can harness similar efficiencies for independent stress self-management.
At the intervention level, workplace internet-based stress management interventions have demonstrated substantial reductions in perceived stress [] as well as cost-effective benefits [,]. AI-based systems further enhance these interventions by offering greater scalability, consistent availability, and anonymity [-], which may lower barriers for individuals who are hesitant to engage in face-to-face stress management programs. Conversational agents exemplify this potential by enabling dialogue that users experience as more therapeutic, personalized, and autonomous [,,]. Some evidence suggests that users may be more willing to disclose sensitive information to nonhuman agents, partly because of reduced perceived judgment [,-]. In addition, intelligent conversational models do not require ongoing clinician involvement, allowing them to scale far more efficiently than human-delivered or clinician-supported digital interventions [-]. Collectively, these capabilities indicate that AI-driven self-directed stress management tools can extend beyond static guidance to provide adaptive, ongoing support that responds to technological advances and broader societal needs for accessible, flexible, and scalable approaches to stress management.
Applications of AI for Self-Directed Stress Management
One prominent application of AI in this context is the use of physiological and behavioral signals to monitor stress. Specifically, stress monitoring is based on measuring physiological changes and comparing them with baseline metrics [-]. The symptomatology of stress involves consistent and predictable physiological changes, such as heart rate variability; AI may be used to detect and model these physiological signs to infer when an individual is under stress []. Machine learning models trained on such health data can classify stress states with promising accuracy [] and, when embedded in wearable or smartphone-based sensors, could provide continuous, unobtrusive monitoring across daily contexts [,]. This addresses a core limitation of traditional self-report measures, which capture only brief, time-bound snapshots of experience [,]. Related literature on digital phenotyping shows that passive smartphone-sensor data can be used to infer behavioral and symptom patterns, enabling the matching of individuals to mental health applications that suit specific needs [,], offering a route to more personalized and timely self-help support.
Beyond monitoring physiological symptoms, AI can also support behavior change processes that reduce stress in the long term. Intelligent systems are increasingly used to deliver habit tracking tools, adaptive nudges, and tailored coping suggestions, drawing on broader developments in digital behavior change technologies [-]. In this context, “user-centered” and “person-based” design approaches have become central to sustaining behavioral patterns in similar interventions []. In practice, virtual therapists and chatbots may offer relatively discreet spaces, reducing perceived stigma and the sense of judgment that often deters help-seeking [,]. Empathy-oriented conversational agents take this further by analyzing user sentiment and generating emotionally attuned responses []. Such intelligent dialogue platforms have shown promise in reducing student stress through supportive, user-centric conversation and increased perceived trust in the conversational agent [-].
At the same time, the broader adoption of AI in mental health raises substantial concerns, as generative and predictive models are inherently nondeterministic and may produce unreliable or erroneous outputs. Such risks are compounded when these probabilistic models are trained on noisy, incomplete, or low-fidelity data from consumer-grade devices unable to match the reliability of clinical-grade systems [,]. Data privacy concerns are especially salient for multimodal AI models that rely on facial images, voice recordings, or other sensitive user data [,]. Under regulations such as the Health Insurance Portability and Accountability Act (HIPAA), patient health information held by covered entities is strongly protected; however, most existing regulatory and data protection frameworks have yet to fully consider how AI developers and vendors collect, use, and share mental health–related data [-]. In practice, many of these systems are trained on datasets that are limited in scale and diversity, with systematic underrepresentation of minority groups [-]. This pattern of limitations systematically biases model predictions and performance for users in marginalized populations, potentially exacerbating existing mental health care inequalities [,,-]. As a result, appraising the value and impact of AI-driven interventions for stress management requires rigorous scrutiny of how these systems are conceptualized, designed, trained, and evaluated for different population needs.
This Research
Despite rapid growth in AI-enabled mental health tools, evidence on AI for self-directed stress management remains fragmented. Existing studies vary widely in the types of AI used, the theoretical frameworks guiding development, and the stress-related functions targeted (eg, monitoring, psychoeducation, and behavior change), and there has been no systematic synthesis focused on autonomous use outside formal clinical settings. This review addressed this gap through a systematic examination of empirical studies published between 2000 and 2025 that used AI-integrated platforms independently by individuals to cope with stress outside formal clinical care. Specifically, we identified the main categories of AI tools evaluated to date and noted the populations and settings they target. Finally, we analyzed the functions by which these tools support self-directed stress coping. In doing so, we aimed to provide an integrated overview of existing approaches and lay a foundation for future work on responsible, user-centered AI in self-directed stress management.
Methods
Transparency and Openness
We conducted this systematic review in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines () []. The review protocol, including the search strategy and planned synthesis framework, was submitted for registration on PROSPERO (registration ID: CRD420251135780). Furthermore, citation management, record deduplication, and metadata extraction were conducted using Zotero version 7.0.24 (Corporation for Digital Scholarship) []. Relevant files, namely screening records and extraction data used in this work, will be made publicly available upon publication.
Search Strategy
A systematic search strategy was jointly developed by the first 2 authors (MKGR and SSMT) and conducted across 6 electronic databases: APA PsycINFO, PubMed or MEDLINE, Scopus, Web of Science Core Collection, ProQuest, and Google Scholar. The long-string series of keywords, (“artificial intelligence” OR “A.I.” OR AI OR GPT OR chatbot OR “large language model” OR LLM*) AND (stress OR “stress management” OR “stress response*” OR (“self-directed” AND stress*) OR “de-stress*” OR “self-care”), was applied to APA PsycINFO, PubMed, Scopus, and Web of Science. All records were extracted from these 4 databases. For supplementary searching in ProQuest and Google Scholar, a shortened query with the same Boolean structure was used, namely (“artificial intelligence” OR AI OR GPT OR LLM OR chatbot) AND (stress OR “stress management” OR “stress response*” OR “de-stress*"). Only the first 100 results were retrieved based on relevance. Records were limited to studies published in English from 2000 to 2025 that involved human participants, including early online publications where available. Exclusion filters were applied to exclude medical, biological, and clinical trial domains unrelated to self-directed stress management. The search and extraction of records were conducted on September 4, 2025.
Screening Criteria and Calibration
This review used a 2-stage screening process consisting of (1) title and abstract screening and (2) full-text screening. Both the first and second authors independently reviewed studies from all 6 sources, with a focus on human subjects who independently use AI or AI-enabled agents for personal stress management. We excluded records that involved AI and stress management in purely clinical settings, AI use in non–stress-related settings, and non–human-related stress (eg, stress on crop rotation). Records were retained according to the following inclusion criteria:
- The record was written in English.
- The record was unique and not a duplicate of any prior entry.
- The record was not retracted or withdrawn at the time of screening.
- The record explicitly addressed self-directed or personal stress management, rather than clinician-led, medically supervised, or biomarker-based clinical interventions.
- The record provided sufficient information to obtain full-text access.
A trial set of 100 records was independently screened to calibrate inclusion decisions and ensure consistency. Following this calibration, the remaining records were screened independently. Records with insufficient abstract detail but retrievable full text were advanced, whereas records with no abstract and insufficient retrievable information were excluded. Records meeting inclusion criteria at the title and abstract level proceeded to full-text review. Similarly, at this subsequent full-text screening stage, a trial set of 50 records was undertaken to ensure alignment, after which the same inclusion and exclusion criteria as those used in title and abstract screening were applied, with the addition of the following criteria:
- The record was peer reviewed and published.
- The record did not pertain to interventions delivered in conventional clinical mental health settings.
- The record explicitly involved AI use or AI-assisted technology (eg, chatbots, LLMs, and AI-enabled apps) for stress self-management.
- The record noted that AI use was “self-directed,” ie, initiated and managed by the user without clinician or therapist oversight.
- The record presented empirical data (quantitative, qualitative, or mixed methods) and collected data from human participants.
Data Extraction and Synthesis
Data extraction was conducted independently by both the first and second authors, with discrepancies resolved through discussion. As the included records comprised both qualitative and quantitative studies, thematic analysis was used to identify and categorize recurring themes in how AI supports stress management. This approach was appropriate because it enabled findings from qualitative, quantitative, and mixed method studies to be integrated into a coherent whole, consistent with guidance for mixed methods systematic review []. An exploratory coding procedure was applied to identify recurring patterns, emerging categories, and overlapping conceptual discussions within the agreed extraction criteria. A trial set of 20 records was screened to ensure consistency of data for extraction. When consolidating outcome data across heterogeneous study designs, we prioritized textual descriptions of reported effects; where quantitative results were available, these were also mapped onto a common "direction of effect" rubric (improvement, no or minimal change, and worsening), following recommendations for thematic synthesis in systematic reviews of mixed evidence [,].
To synthesize data from both qualitative and quantitative records, we followed Thomas and Harden’s [] procedure for thematic synthesis. After confirming the final set of included records, both first and second authors reviewed all records again to develop codes for the text and descriptive themes. We initially analyzed each record against our full-text inclusion criteria, focusing on the forms of AI used to manage stress, the notable frameworks on which AI-enabled agents were trained to provide help, and the significant or nonsignificant results of interventions. The first and second authors then discussed our preliminary findings to produce a list of themes that address the focus of our research: how AI models support stress management. Themes were internally refined to ensure coherence and clear differentiation. Both the first and second authors independently reviewed the final included records and categorized them. In addition, following the data extraction protocols of Aromataris et al [] for umbrella review, the following were also extracted from each record: (1) author and year of publication, (2) journal or publication, (3) country (of sample), (4) study type, (5) study pool, (6) sample size, (7) mean age (years), (8) age range (years), (9) intervention name and description, (10) intervention framework, and (11) key findings.
Results
Overview of Search and Inclusion Results
The search string yielded 3008 records (). The full list of all unique records identified and screened (N=2810), including those advanced to full-text review (n=173) and excluded full-text articles with reasons (n=138), is available in . After removing duplicates (198/3008, 6.58%), 2810 (93.42%) unique records remained. This review proceeded in 2 phases. In the first phase, both authors reviewed the titles and abstracts of all 2810 records to determine eligibility for the full-text phase based on the criteria outlined in the “Methods” section. A total of 2637 (93.84%) records were excluded after title and abstract screening, retaining 173 (6.16%) records for full-text screening. Specifically, 27 (1.02%) records lacked an abstract, 354 (13.45%) were clinical studies not explicitly related to stress self-management, 3 (0.11%) were not in English, and 2253 (85.44%) were outside the scope of this review. In the second phase, the 173 included records were reviewed in full by both authors. During this full-text screening phase, 63 (36.42%) records were removed for not being peer reviewed or in preprint, 8 (4.62%) records were studies set in purely clinical settings (ie, set in conventional therapeutic settings), 13 (7.51%) records involved the presence of a clinician or AI use for stress relief that was co-administered with clinical treatment, 40 (23.12%) records had AI use not for stress management, and 5 (2.89%) records did not have independent use of AI tools for stress management. During the data extraction and thematic analysis, a further 9 (5.20%) records were removed due to insufficient data. A total of 35 (20.23%) records were included in the final review. The overall agreement rates for title and abstract screening (2426/2810, 86.30%) and for full-text screening (138/173, 79.77%) were adequate, with data extraction agreement at >70%. Disagreements were resolved at the end of each stage through discussion with both authors.

Overview of Included Studies
This review synthesized 35 studies, including 3 (8.57%) qualitative studies, 11 (31.43%) randomized controlled trials, 6 (17.14%) nonrandomized quantitative studies, 4 (11.43%) quantitative descriptive studies, and 11 (31.43%) mixed methods studies. These included studies examined AI-enabled tools for self-directed stress management across chatbots, mobile apps, embodied agents, and other digital platforms. summarizes the key characteristics of the included records, including study-level information, intervention frameworks, AI architectures used, and sample characteristics where reported. Sample characteristics include the study pool or demographic focus of each sample (eg, employed nurses, fathers, and university students), as well as sample size, mean age, and age range. AI architecture classifications for each study (eg, LLM-based, rule-based, and hybrid) are also included in .
| Author, year of publication | Journal or publication | Country/Region | Study type | Study pool | Sample size | Mean age (y) | Age range (y) | Intervention framework | AI architecture used | Key findings |
| Alanazi et al, 2025 [] | Scientific Reports | Saudi Arabia | Mixed methods | Children with Down syndrome and caregivers of children with Down syndrome | N=130 (n=7 and n=123) | — | 3‐18 (children) and ≥18 (caregivers) | Mixed (CBT-, occupational-, and behavior-oriented components) | Several multimodal AI systems, combining computer vision, NLP, and machine learning | Moderate-large improvements were observed in children’s mobility (d≈0.65), communication (d≈0.72), and daily living or household tasks (d≈0.83); carers reported reduced caregiving stress and greater child independence, engagement, and confidence, although adoption was limited by cost, lack of support, and technical issues, with qualitative reports also indicating subjective stress reduction. |
| Alanezi, 2024 [] | Journal of Multidisciplinary Healthcare | Saudi Arabia | Qualitative | Mixed adult population | n=24 | — | ≥18 | CBT-based psychoeducation | ChatGPT-3.5 or / LLM-based conversational AI | Participants reported using AI mental health tools primarily for psychoeducation (~60%), emotional support, and feeling heard (~50%), goal setting or motivation, and accessing mental health resources; perceived benefits included CBT-like reframing, coping guidance, and self-monitoring, while concerns centered on limited assessment accuracy, data privacy or ethics, and cultural-linguistic mismatch. |
| Allan, 2024 [] | Behaviour and Information Technology | New Zealand | Quantitative nonrandomized | SME owners and managers | N=31 (n=14; n=17) | 36.9 | 21‐57 | Stress Mindset Theory [] | Rule-based decision-tree chatbot with keyword matching | A brief conversational agent intervention produced very large gains in a “stress-is-enhancing” mindset (eg, SMM scores approximately doubled, d≈2.19) and large improvements in self-reported work performance (d≈1.59), with very high engagement (100% adherence and 86% wanting continued access) and participants attributing productivity gains to stress-reframing processes. |
| Baek and Cha, 2025 [] | Worldviews on Evidence-Based Nursing | South Korea | Quantitative randomized controlled trials | Employed nurses | N=120 (n=40; n=40; n=40) | 31.55; 31.88; 32.05 | ≥18 | WHO-based program integrating mindfulness, ACT, storytelling, and laughter therapy | AI recommendation algorithm using similarity scoring and neural-network updating | Across 4 weeks, the AI-supported intervention (and especially self-selected use) produced significant reductions in client-related and personal burnout and stress responses, with job stress decreasing in all conditions; stress-response reductions were strongest in the AI and self-selection groups, but not evident in the minimal-control group. |
| Cho et al, 2024 [] | Journal of Medical Internet Research | South Korea | Quantitative nonrandomized | Employed nurses | N=603 (n=293; n=10; n=300) | 32.81 | ≥18 | WHO-based mindfulness, ACT, storytelling, laughter therapy modules | Collaborative-filtering recommender with artificial neural network updates | Burnout, job stress, and stress responses significantly decreased from baseline through posttest assessments (P<.001), with multicomponent modules (mindfulness, ACT, storytelling, laughter therapy) each improving different burnout facets and stress indices; satisfaction increased as the AI optimization progressed, and reductions in job stress and stress response were already evident after 2 weeks. |
| Fuhrmann et al, 2025 [] | Scientific Reports | Germany | Quantitative randomized controlled trials | Mixed adult population | N=166 (n=56; n=56; n=54) | 25.39; 23.25; 23.76 | 18‐60 | CBT-informed stress management (HRV biofeedback+CBT elements) | Sensor-based signal processing using accelerometer-derived HR biofeedback and facial emotion recognition | Both versions of the stress management app improved perceived stress, emotion regulation, and well-being, but the HR-based biofeedback version yielded significantly greater reductions in perceived stress than the waitlist control (significant time× condition interaction on PSS), with high usability ratings. |
| Gabrielli et al, 2021 [] | JMIR mHealth and uHealth | Italy | Mixed methods | University students | N=71 | 20.6 | 18‐34 | CBT, mindfulness, positive psychology | Scripted rule-based chatbot with predefined response options | Use of the app was associated with significant decreases in perceived stress (PSS-10) and severe anxiety symptoms, alongside improvements in mindfulness facets (describing, nonjudging); stress levels shifted from high to low categories over time, with moderate engagement (≈78 interactions; 58% completion). |
| Hiller et al, 2025 [] | JMIR mHealth and uHealth | Germany | Qualitative | Mixed adult population | N=61 (n=27; n=34) | 30.5; 20.38 | 18‐59 | Ecological momentary intervention or just-in-time adaptive intervention with CBT and ACT components | Predictive time-series machine learning for adaptive EMI delivery | A 4-week digital intervention significantly improved well-being and reduced perceived stress and depressive symptoms; qualitative feedback highlighted increased mental health awareness, the ability to identify stress triggers, and the value of ongoing structure, support, and emotional guidance. |
| Huang et al, 2022 [] | International Journal of Social Robotics | Taiwan | Quantitative nonrandomized | Mixed adult population | N=122 (n=23; n=40; n=59) | 25; 22.1; 23.87 | — | Study-specific human-robot Interaction framework (ability, appropriateness, and validity) | Hybrid NLP system using knowledge graph, Bayesian network, neural network, and AIML chatbot | Participants rated the robot’s stressor inference accuracy highly (~4.4/5) and judged its emotional support responses as more appropriate than those of a baseline chatbot; they perceived the agent as understanding their feelings, providing valid and reliable emotional support, and making self-disclosure easier. |
| Indrayanti et al, 2025 [] | SSM–Mental Health | Indonesia | Mixed methods | University students | N=32 (n=16; n=16) | 20.21 | 19‐22 | PFA | GPT-3 chatbot with NLP-based sentiment detection | Compared with controls, the PsyBot condition significantly reduced loneliness and improved well-being over 2 weeks, indicating that the chatbot was more effective than controls in alleviating social isolation. |
| Jiang and Yang, 2025 [] | BMC Psychology | China | Quantitative randomized controlled trials | Middle-school students | n=65 | 15.24; 15.75 | 14‐18 | CBT (Elomia) with Mindfulness and PFA-derived components | Hybrid recommendation and chatbot system using NLP and machine learning | Relative to controls, the intervention produced significant and sustained reductions in somatic symptoms, anxiety and insomnia, social dysfunction, and severe depression, reflecting overall alleviation of psychological distress, while the control group showed no meaningful change. |
| Keung and So, 2025 [] | Frontiers in Psychology | Hong Kong | Quantitative randomized controlled trials | University students | N=110 (n=55; n=55) | 29.93; 28.13 | 18‐71; 18‐60 | CBT-based supportive chatbot | GPT-4 or LLM-based counseling chatbot | Stress levels decreased from pre- to post-test in both AI and human support conditions, regardless of whether the chatbot’s identity was revealed, but higher “perceptual fear” of AI predicted lower perceived support quality; those with lower fear evaluated the chatbot more favorably. |
| Ko and Woo, 2025 [] | JMIR mHealth and uHealth | South Korea | Quantitative descriptive | Population not specified | N=7 (raters) | – | – | Mixed (CBT, health information provision, and physiological monitoring) | Mixed app ecosystem with algorithmic chat, AI mood checking, and recommender features | Among commercial apps, those providing rich mental health information, counseling services, and multiprovider details achieved the highest uMARS scores (17/41 apps >4.0, range of 3.08‐4.08) and were preferred by users, who particularly valued 24-hour access and not having to rely on a single provider. |
| Lai et al, 2025 [] | Journal of Nursing Research | Taiwan | Quantitative randomized controlled trials | Employed nurses | N=80 | 33.6 | ≥18 | Stress appraisal and coping theory+CBT | Hybrid GPT-supported chatbot embedded in structured intervention content | Relative to controls, the intervention produced significant improvements in sleep disturbance, perceived stress, anxiety, and depressive symptoms by 6-week follow-up (all P<.001); perceived stress was already lower in the intervention group by 3 weeks, whereas the control group showed a slight increase in stress over time. |
| Lee and Hahn, 2024 [] | Frontiers in Psychology | South Korea | Quantitative nonrandomized | University students | N=137 | 23.3 | 18‐49 | Study-specific mind perception and support-type manipulation framework | Rule-based chatbot with Dialogflow ML intent or emotion classification | Attributing “mind” to the chatbot (explicitly or implicitly) predicted higher perceived message effectiveness (explicit d≈1.09; implicit d≈0.50); emotional support reduced message effectiveness when users did not implicitly attribute mind, whereas informational support showed no such interaction, and participants described the interaction as facilitating introspection and self-evaluation. |
| Lee et al, 2025 [] | JMIR mHealth and uHealth | South Korea | Mixed methods | University students | N=170 (n=85; n=47; n=38) | 20.6; 20.2 | – | CBT-based skills and behavioral support chatbot | Semigenerative chatbot combining KoGPT2 generation, KoBERT classification, CNN, and fixed CBT scenarios | Only the chatbot group showed significant improvements in time management skills, perceived stress (eg, PSS reductions of approximately 2.5‐4 points), and procrastination, with particularly strong effects in a high-engagement cluster; control participants did not exhibit comparable improvements. |
| Li et al, 2025 [] | Journal of the American Psychiatric Nurses Association | Hong Kong | Quantitative nonrandomized | University students | N=30 | 23.03 | 18‐25 | Mindfulness-Based Stress Reduction | Scripted rule-based chatbot using tree-graph dialogue, keyword spotting, conditional logic, and database memory | A fully chatbot-delivered audio mindfulness program produced large reductions in stress (PSS from 34.11 to 24.65; d≈−1.95, P<.001) and significant gains in related well-being domains, with high adherence (~87%) and no adverse events; most participants reported improved sleep, mood, and emotional regulation. |
| Ly et al, 2017 [] | Internet Interventions | Sweden | Mixed methods | University students | N=37 (n=14, n=14, n=9) | 21.1, 25.4, 28.8 | 20‐49 | CBT+positive psychology | Rule-based chatbot using triggers, keywords, NLU, and lightweight database | After a 2-week intervention with CBT interface (SHIMS), only the adherence group (consistent use over 2 wk) showed significant interaction effects of group and time on perceived stress (F1, 27=4.30; P=.048), with a large-medium effect size (d=1.06). |
| Marwaha et al, 2025 [] | The Open Psychology Journal | India | Mixed methods | University students | N=214 | – | 18‐23 | CBT+mindfulness | ChatGPT or LLM-based digital intervention | Following chatbot use, COVID-19 stress scores significantly declined (from 16.50 to 15.80; P<.01), and qualitative data indicated reduced fear of being judged, less felt need for counseling or “someone to talk to,” and fewer felt stress-related mood swings, with concomitant improvements in sleep quality. |
| Medeiros et al, 2022 [] | IEEE Transactions on Human-Machine Systems | United Kingdom | Quantitative randomized controlled trials | Mixed adult population | N=188 (n=86; n=102) | 28.74 | 18‐35 | Emotion Regulation Theory (Gross) (study-specific implementation) | Hybrid chatbot using rule-based stressor mapping, NLP/text mining, Watson NLU, and ML classification | Compared with a neutral chatbot, interacting with a supportive chatbot produced modest reductions in perceived stress (~0.7 points; P≈.01), increased positive affect, and decreased negative affect. However, arousal-related stress indices did not show robust change. |
| Meng and Dai, 2021 [] | Journal of Computer-Mediated Communication | United States | Quantitative randomized controlled trials | University students | N=211 | 20.4 | – | Study-specific emotional support versus self-disclosure framework | Rule-based keyword-recognition chatbot with predefined scripts | Emotional support from the chatbot did not directly reduce stress (nonsignificant main effect) but produced significant reductions in worry and an indirect effect on stress via perceived supportiveness; receiving emotional support predicted stress reduction through feeling supported, whereas a chatbot that only self-disclosed (without emotional support) yielded the poorest stress outcomes. |
| Nelekar et al, 2022 [] | British Journal of Educational Technology | India | Quantitative randomized controlled trials | University students | N=61 | 20.52 | – | BDI model+ explanation-based agent architecture | Explainable rule-based BDI-style embodied agent using option-based dialogue | Stress significantly decreased across explanation conditions (eg, belief group from 7.65 to 6.91, P≈.01), and behavioral intentions increased for multiple health behaviors depending on explanation type; participants also reported heightened awareness of their own stress through interacting with the AI explanation system. |
| Orji et al, 2025 [] | Behaviour and Information Technology | Canada | Mixed methods | Mixed adult population | N=190 (n=95; n=72; n=23) | – | 18‐45 | Positive psychology, PERMA, flow theory, PSD model | Hybrid app combining smile detection (computer vision), ChatGPT API, and sentiment analysis | The intervention led to significant improvements in mood (BMIS pleasantness, calmness, and positivity all increased, d ≈ 0.6), with 80% of interviewees reporting stress relief and relaxation; both quantitative and qualitative data indicated that the app effectively helped users feel calmer. |
| Park et al, 2025 [] | International Journal of Educational Technology in Higher Education | South Korea | Mixed methods | University students | N=78 (n=50; n=28) | 22.44; 22.6; 22.28 | – | Self-reflection framework (self-disclosure–based design) | GPT-4o LLM chatbot with prompt engineering and self-disclosure manipulation | The self-disclosure chatbot elicited deeper reflective processing. This was seen in longer sessions, richer narrative content, and greater engagement across all reflection substages, which supported users’ awareness of stress triggers and appraisal patterns. Although objective assessment accuracy did not differ significantly from the non-self-disclosure version, self-disclosure interactions were perceived as more emotionally resonant and conducive to stress clarification, helping users articulate stressors and evaluate coping options more effectively than the NSD chatbot. |
| Park et al, 2019 [] | Journal of Medical Internet Research | South Korea | Qualitative | University students | N=30 | – | ≥18 | MI | Rule-based ELIZA-style chatbot with keyword-based response generation | Participants generally appreciated evocative, context-sensitive questions that facilitated self-reflection and motivation and valued the chatbot’s nonjudgmental stance and empathy toward graduate school stress; however, some responses were seen as repetitive or generic, and many participants desired more personalized and informational support. |
| Qi, 2025 [] | International Journal of Human-Computer Interaction | China | Quantitative nonrandomized | University students | N=206 (n=86; n=120) | 19.8 | – | CBT+crisis intervention components | Scripted CBT chatbot with preprepared responses | The intervention significantly reduced DASS-Stress scores (~4.2-point decrease, P<.01) and negative affect (SCL-90), with most students reporting improved emotional clarity, although between-group differences in stress outcomes were not statistically significant in this sample. |
| Silva et al, 2020 [] | Journal of Medical Systems | Portugal | Quantitative descriptive | Medical school students | N=83 | 22.13 | 17‐38 | Study-specific “EUSTRESS” system (physiological stress monitoring and prediction) | Sensor-based ML stress detection using HR/HRV and neural network classification | The system effectively detected examination-related stress: physiological markers (RR interval, SDNN, RMSSD) and PSS-13 scores all indicated significantly higher stress during examinations than at baseline; a neural network model achieved good discrimination (sensitivity~75%, specificity~78%) between examination and baseline states. |
| Sun et al, 2023 [] | Frontiers in Artificial Intelligence and Applications | The Netherlands | Quantitative randomized controlled trials | Mixed adult population | N=107 (n=30; n=77) | – | ≥16 | MI, BCTs, GET, mindfulness | Hybrid chatbot combining scripted expert dialogue, NLU/NLG, sentiment analysis, classification, and generation | Mindfulness-oriented chatbot conversations led to perceived stress reductions (belief in reduced stress increased; eg, stress ratings ≈3.6 vs neutral 3; P<.001) and strengthened beliefs in physical activity engagement for most barrier types, although in some cases control participants reported higher PA engagement beliefs than the intervention group. |
| Teague et al, 2025 [] | Behaviour & Information Technology | Australia | Mixed methods | Mixed adult population | N=186 (n=53; n=10; n=43; n=80) | 31.2; 31.23 | 24‐39 | CBT, mindfulness, BCT | Rule-based expert-system chatbot using hierarchical decision trees and menus | The father-focused app showed good usability (SUS≈ 75‐81; MARS≈3.8‐4.3) and high engagement (~8.7 interactions per day); fathers and clinicians rated it as having high perceived impact on fathers’ mental health awareness and support-seeking, with fathers reporting clearer mood awareness and benefits from mood and goal tracking. |
| Tong et al, 2025 [] | Journal of Medical Internet Research | Hong Kong | Quantitative randomized controlled trials | Mixed adult population | N=285 | 26.45 | 18‐61 | CBT, mindfulness, behavioral activation, positive psychology | Rule-based decision-tree chatbot with predefined categorical choices | The intervention produced small to moderate improvements in self-care behaviors (d≈0.36), mindfulness (d≈0.37), depressive symptoms (d≈−0.26), and positive emotions (d≈0.28); stress decreased over time in both intervention and control groups, yielding a main effect of time but no significant group× time interaction for stress. |
| Troitskaya and Batkhina, 2022 [] | Family Process | Russia | Quantitative randomized controlled trials | Mixed adult population | N=156 (n=58; n=98) | 31.0; 29.24; 31.08 | 18‐54 | CBT, mindfulness, positive psychology | Decision-tree chatbot with branching scripts and stored user variables | Over 21 days, participants using iCognito showed significant reductions in psychological distress (DASS-stress decreased by~2.7 points, P<.01) and qualitative reports of better emotional regulation and communication; there were moderate effects on relationship satisfaction and conflict reduction, although some interaction effects (eg, on quarrels and emotional intelligence) were not significant. |
| Weng et al, 2024 [] | Journal of Medical Internet Research | Singapore | Quantitative descriptive | Mixed adult population | N=786 | – | ≥18 | CBT, behavioral activation, mindfulness, positive psychology, motivational interviewing | Hybrid platform combining chatbot, screening algorithm, and CBT-based digital tools | Usage analytics indicated high engagement with grounding, sleep, and breathing tools, and ~57% of users reported feeling better after chatbot sessions, with greater improvements linked to completing structured CBT tools; however, outcomes were based on anonymous, uncontrolled usage data, and no RCT evidence was available. |
| Williams et al, 2021 [] | Social Sciences | New Zealand | Mixed methods | University students | N=256 (n=124; n=132) | 19.9 | 18‐25 | CBT, DBT skills, mindfulness, behavioral activation, positive psychology | Rule-based chatbot using quick reply options and branching paths | Participants showed moderate to large pre-post improvements in stress (PSS-10 reduction of ~5.7, P<.001) and well-being (WHO-5), alongside modest sleep improvements and high adherence (median 19.5 of 21 days completed), with effect sizes in the small-medium range (≈0.38‐0.49). |
| Yong, 2025 [] | Journal of Technology in Behavioral Science | France | Quantitative descriptive | University students | n=10 | – | 17‐21 | Study-specific emotional-AI sentiment-detection+ affective computing framework | Multimodal emotional-AI system combining ML, FACS computer vision, IoT sensors, and smart home orchestration | AI-guided sessions improved positive affect (PANAS+6.1) and reduced negative affect (~−4.3), with users reporting relaxation, clearer mood, and interest in more personalized recommendations; however, the evidence was primarily feasibility- and experience-focused, with limited formal hypothesis testing, and the broader work emphasized integrating AI-infused mental health supports into the home. |
| Yoon et al, 2024 [] | BMJ Health & Care Informatics | Singapore | Mixed methods | Mixed adult population | N=330 (n=278; n=12; n=40) | – | 18‐60 | CBT, DBT, mindfulness, behavioral activation, motivational interviewing | AI-enabled platform using chatbot with algorithmic self-assessment and resource recommendation | Platform logs showed strong engagement with breathing, grounding, reframing, and sleep tools; survey and interview data indicated reduced stress, improved emotional regulation, and high perceived usefulness, with most users finding the platform helpful (≈71%) and reporting mood improvements (≈62%); anonymity, convenience, and immediate access were valued, particularly for managing work-related stress. |
aAI: artificial intelligence.
bNot applicable.
cCBT: cognitive behavioral therapy.
dNLP: natural language processing.
eLLM: large language model.
fSME: small-to-medium-sized enterprise.
gSMM: Stress Mindset Measure.
hWHO: World Health Organization.
iACT: Acceptance and Commitment Therapy.
jHRV: heart rate variability.
kHR: heart rate.
lPSS-10: Perceived Stress Scale (10 item).
mEMI: ecological momentary intervention.
nAIML: Artificial Intelligence Markup Language.
oPFA: Psychological First Aid.
puMARS: user version of the Mobile Application Rating Scale.
qML: machine learning.
rCNN: convolutional neural network.
sNLU: natural language understanding.
tBDI: belief-desire-intention.
uPERMA: Positive Emotion, Engagement, Relationships, Meaning, Accomplishment model.
vPSD: Persuasive Systems Design model.
wAPI: Application Programming Interface.
xBMIS: Brief Mood Introspection Scale.
yNSD: non-self-disclosure.
zMI: motivational interviewing.
aaELIZA: “ELIZA-style chatbot”, an early natural language processing computer program, reacted to user input without the capacity for logic-driven decision making, made use of pre-programmed responses.
abDASS: Depression, Anxiety, and Stress Scale.
acSCL-90: Symptom Checklist-90.
adRR: R-wave to R-wave interval.
aeSDNN: standard deviation of normal-to-normal intervals.
afRMSSD: root mean square of successive differences.
agBCT: behavior change technique.
ahGET: Graded Exercise Therapy.
aiNLG: natural language generation.
ajPA: physical activity.
akSUS: System Usability Scale.
alMARS: Mobile Application Rating Scale.
amRCT: randomized control trial.
anDBT: Dialectical Behavior Therapy.
aoFACS: Facial Action Coding System.
apIoT: internet of things.
Several trends are evident across the included records. First, the records were predominantly published after 2020, suggesting an increase in interest in the use of AI-enabled services for self-initiated personal stress management. Second, the included records comprised studies using qualitative research methods, as well as studies incorporating experimentation and randomized controlled trials. This indicates that the field is moving beyond early feasibility work, although the evidence base remains uneven. Subsequent research can thus build on established experimental findings and protocols to assess the efficacy of AI-enabled stress management strategies. Finally, the included records span 20 countries across multiple regions, suggesting that stress management is a concern of broad international relevance and that AI-enabled stress management strategies have been examined across diverse cultural contexts.
Quality Appraisal
The methodological quality of the 35 included studies was assessed using the Mixed Methods Appraisal Tool (MMAT) [], which appraises qualitative, quantitative, and mixed methods studies using 5 design-specific criteria. Two independent reviewers (the first and second authors) evaluated each study. Initial interrater agreement was 83.8%, and discrepancies were resolved through discussion. The full MMAT ratings, criteria, and coding framework applied in this review are provided in .
What Emerging Roles Do AI-Enabled Tools Play in Self-Directed Stress Management?
Overview
From our included records, we identified the following themes: (1) psychological intervention, use of AI to administer psychological aid for individuals independent of a clinical or medical practitioner; (2) behavioral support, where AI agents support users in making intended behavioral changes to manage stress; (3) psychoeducation, where AI-enabled agents are equipped with resources to provide users access to information about their current stressors, how to manage stress and stressors, and access to multiple sources of medical intervention; (4) companionship and emotional support, where AI-enabled agents can be present for users in moments of high stress or need, and readily provide emotional support for users; and (5) stress monitoring, detection, and triage, where AI-enabled agents assist users in monitoring personal stress levels through language detection, physiological signals when paired with wearables, and provide advice for users’ next best steps. For clarity, the term AI-enabled agents is used as an umbrella term to refer to chatbots, conversational agents, and AI-driven applications.
Psychological Intervention
Making use of AI-enabled agents for immediate care in high stress or prolonged stressful situations without needing to wait for a clinician or medical practitioner is a prominent benefit of AI-enabled agents. Thirteen articles found that AI can be used for stress management through providing psychological intervention enabled by AI agents [,-,,,,,-,]. Within these studies, AI architectures included rule-based or scripted conversational systems [,,]; machine learning, recommendation, or predictive NLP-based systems [,,,,]; LLM or generative artificial intelligence (GenAI) systems [,]; sensor, computer vision, or multimodal hybrid systems [,]; and AI-chatbot plus screening or recommendation systems [].
In general, these AI agents have been shown to be effective in delivering timely interventions. For example, Allan [] showed that a brief conversation with an AI chatbot shifted users’ mindsets from viewing stress as harmful to viewing stress as enhancing (d≈2.19), and users attributed productivity gains to this mindset change. In high-stress work environments, such as hospitals, participants who engaged with AI-supported interventions reported significant reductions in burnout, job stress, and stress responses over a 4-week period [,]. AI-enabled interventions for high-stress work environments were particularly valuable to users, given the fast-paced nature of their jobs, which made it unlikely that they would seek clinical intervention. Stress-response reductions were the strongest in the AI-supported intervention group [], which supports the efficacy of immediate intervention.
AI-enabled interventions can also be combined with other forms of technology, such as biofeedback through wearables, where detecting a change in user physiology and administering immediate intervention yielded greater stress reductions in perceived stress []. This demonstrates the flexibility in how stress management interventions can be delivered without a clinician present. Immediate use of AI-enabled agents to mitigate stress also has positive effects. Marwaha et al [] found a significant decline in COVID-19–related student stress scores after interacting with the chatbot compared to baseline values, with several participants qualitatively reporting reduced fear of peer judgment and decreased stress-related mood swings. Moreover, the findings of Hiller et al [] highlight that the “just-in-time” ability of AI agents to act as a considerable merit, where users interacting with the AI-enabled system, mila, also saw significantly improved well-being and reduced perceived stress and depressive symptoms, with qualitative reports of improved ability to identify triggers when supported by the chatbot.
Beyond the immediate deployment of intervention, prolonged support with AI-enabled agents also benefits users by improving mood and affect. Weng et al [] showed that users who engaged with the AI support chatbot Wysa, hosted on the website platform mindline.sg, repeatedly reported feeling better after chatbot sessions. Similarly, users who interacted with SmileApp reported significant improvements in mood, experiencing stress relief and relaxation, thereby helping them feel calmer when dealing with stressors [].
In the context of psychological intervention, a significant number of AI-enabled systems implement evidence-based strategies derived from established frameworks such as cognitive behavioral therapy [,,,,], mindfulness [,,,,], and Gross’s emotion regulation theory []. The flexibility of chatbots to be programmed with various psychological aids and frameworks to address different scenarios and stressors demonstrates a significant use case for AI-supported agents in stress management and in catering to each user’s needs. The available evidence supports AI-enabled agents as a feasible means of delivering psychological intervention content in the reviewed contexts, particularly when systems are designed around established psychological frameworks.
Behavioral Support
AI-enabled agents can also help users achieve intended behaviors that mitigate stress. Ten studies found uses for AI to assist in helping users achieve behavioral changes [,,,-,,,,]. Among the behavioral support studies, AI architectures included rule-based, scripted, or decision tree chatbot systems [,,,,]. In these studies, AI agents primarily took the form of online chatbots, being delivered via websites or mobile apps [] and even integrated with smart home technology []. Other used AI architectures include hybrid GPT or semigenerative systems combined with structured intervention content or ML classifiers [,] and multimodal, AI-enabled, or ML-based assistive or recommendation systems [,,]. AI-supported agents were often programmed to address behaviors that contribute to stress or could improve stress outcomes. For example, Lai et al [] showed that a 6-week intervention using an AI agent based on stress appraisal and coping theory encouraged healthier sleep habits and adaptive coping strategies, resulting in significant improvements in perceived stress, anxiety, and depressive symptoms. Participants also reported that, following interaction with the chatbot, they experienced improved sleep, mood, and emotion regulation. Another targeted behavior was procrastination. Lee et al [] reported that engagement with a cognitive behavioral thCognitive Behavioural Therapy (CBT)–based supportive chatbot (Moa) was associated with greater improvements in time management skills, perceived stress, and procrastination behaviors compared with a control condition.
Beyond self-improvement domains, AI agents have helped users identify their personal conflict styles and adjust their approaches to interpersonal disagreements. For instance, Troitskaya and Batkhina [] showed that over 21 days, using an AI-supported application was associated with a considerable reduction in psychological distress for couples. Qualitative reports indicated improved emotional regulation and communication in their relationships []. Consistent use of AI support was also associated with significant decreases in perceived stress while yielding improvements in mindfulness and self-care behaviors [,,]. Taken together, these findings suggest that AI interventions can be effective aids in helping users make lasting behavioral changes in targeted areas, supporting them in discontinuing negative habits and encouraging positive ones. Furthermore, immediate access to AI platforms promotes greater accountability and user engagement, ensuring users remain on track to achieve their intended goals.
Psychoeducation
AI-enabled technology has also been applied to help users learn more about stress and stress mindsets, identify stressors, and access resources when experiencing stress. Four articles found that AI-enabled agents helped users through providing education about stress mindsets and assisting users to understand indicators of stress and how to mitigate harmful effects [,,,]. AI architectures used for psychoeducational interventions included rule-based chatbot or expert system designs [,], LLM- or GenAI-based support [], and mixed app ecosystem architectures involving algorithmic chatting, AI mood checking, or possible AI recommendation features []. The psychoeducational functions were primarily delivered through chatbots trained on established psychological frameworks, often including CBT-informed elements, which participants engaged with independently [,,,]. For instance, Alanezi [] reported that participants used AI agents to learn about stress and strategies for managing stressors and described experiences such as feeling heard and supported during interactions with the chatbot. The same study further noted that participants valued the ease of access to mental health resources, as well as features such as CBT-informed cognitive reframing, coping guidance, and stress self-monitoring []. AI-based psychoeducation can also be tailored to specific contexts or populations. Teague et al [] reported that fathers of newborn children who interacted with a chatbot (Rover) trained in cognitive behavioral therapy, mindfulness, and behavior change techniques described greater awareness of their mental health and increased support-seeking. The chatbot provided information tailored to the perinatal context and was reported to support fathers in identifying, understanding, and processing mood changes associated with caring for a newborn. Participants used these outputs to contextualize their stress levels in relation to demands and plan strategies with the AI to better cope with increased stress at predicted times. Commercial apps with AI support marketed for mass population use also have the potential to impart large amounts of information to the general population. Ko and Woo [] found that among highly rated mental health apps with AI-enabled support, frequent users reported 24-hour access and multiple sources of support and information, a significant advantage over relying on a single medical provider. Users also had access to extensive mental health information through the app, which could be tailored to their specific needs.
Moreover, findings suggest that AI-enabled technologies can effectively address population-specific stress management needs by customizing information for different groups while simultaneously broadening access to mental health information and support. Williams et al [] reported that consistent interaction with a chatbot informed by cognitive behavioral therapy, positive psychology, and mindfulness was associated with statistically significant pre-post improvements in well-being (5-item World Health Organization Well-Being Index) and perceived stress (Perceived Stress Scale-10). These changes corresponded to small-to-medium effect sizes (approximately 0.38‐0.49) and were accompanied by self-reported improvements in adherence to healthy sleep habits and sleep quality. In summary, AI-facilitated psychoeducation can empower users to independently monitor their stress levels and manage their stress responses more effectively by teaching them evidence-based strategies. By customizing educational content to the user’s context and providing on-demand support, AI tools may help increase users’ literacy about stress and coping, which is a critical component of long-term stress management.
Companionship and Emotional Support
Users can make use of AI-enabled agents for immediate emotional support and companionship during high stress periods. Several studies highlighted the potential for AI agents to serve as readily available sources of emotional support [,,,,]. Within these 5 studies, AI architectures were mixed. Two centered around LLM or GenAI conversational systems [,], one used a rule-based chatbot augmented by ML-based Dialogflow classification [], one used a rule-based keyword recognition chatbot with predefined scripts [], and one used an explainable rule-based belief-desire-intention–style embodied conversational agent [].
The immediacy of AI-based support makes these agents valuable, especially for individuals who may be unable or reluctant to seek human support through other means. For example, Indrayanti et al [] reported that the PsyBot chatbot supported users experiencing negative emotions during periods of distress, with participants describing the platform as providing a supportive space for emotional expression and feeling heard. Similarly, Nelekar et al [] reported that participants experienced significant reductions in stress after interacting with an AI chatbot designed to facilitate conversations about users’ beliefs, desires, and life goals, alongside increased awareness and understanding of their own stress through the system’s explanation-based features.
Although participants generally report benefits from AI-mediated emotional support, some frameworks do not produce a direct reduction in stress; instead, stress may decrease indirectly via reduced worry when the chatbot increases perceived support and prompts self-disclosure []. In Meng and Dai [], the Chatfuel-based supportive chatbot showed no direct effect on stress but demonstrated an indirect effect via perceived support and associated worry reductions, consistent with its design to provide emotional support or elicit self-disclosure. Within this theme, interventions grounded in established frameworks, such as PsyBot (Psychological First Aid) and ARU (Belief–Desire–Intention model with an explanation-based architecture), were associated with reductions in negative affect and stress and, in some cases, increases in positive affect; such findings highlight the value of validated psychological frameworks in guiding chatbot design in this context. Overall, the immediacy and availability of AI agents provide users with rapid access to emotional support, which may contribute to improvements in well-being over time.
Stress Monitoring, Detection, and Triage
AI-enabled technology can also enable users to monitor personal stress levels independently, without a clinician or other medical professional, and to detect changes in these levels before users are aware of them. Three studies fall into this category [,,]. Huang et al [] created a bespoke human-robot interaction system for adults (the RoBoHoN) that engaged participants in conversation. Across stress monitoring and triage studies, AI architectures included physiological or sensor-based ML systems [] and hybrid NLP, ML, or generative systems for stressor and barrier inference [,]. As such, the 3 studies in this category used physiological, sensor-based, or conversational data, rather than self-reported symptom scales, to support timely remote self-intervention recommendations. Users perceived this robot as an agent capable of understanding their feelings, providing valid and reliable emotional support in moments of distress, and facilitating self-disclosure to identify causes or effects of stress. Another study by Sun et al [] tested 6 chatbots each trained with a different therapeutic framework (eg, motivational interviewing and mindfulness). Participants in the mindfulness-oriented chatbot condition showed a significant reduction in perceived stress after the intervention, demonstrating that early detection and mitigation of stressors can be achieved with AI agents [].
In addition to conversational monitoring approaches, Silva et al [] described the EuStress solution, an AI-enabled information system designed for continuous, real-time stress assessment using wearable-derived physiological signals and machine learning classification. Consistent with a physiological stress monitoring approach, the system assesses the stress levels of the students using a wearable device to collect stress-related data with minimal user interaction. The solution was intended to interpret the stress reactivity patterns of the individual and predict stress states, cumulative effects of stress, and chronic stress, positioning the intervention as a detection-and-triage tool that translates biometric data into actionable stress-state inference []. While the independent use of AI-enabled systems for self-monitoring and self-management of stress is limited, the efficacy of AI in stress monitoring is supported, serving as a just-in-time strategy rather than allowing a stressful situation to spiral out of users’ control.
All 3 key studies examining AI for stress monitoring, detection, and triage [,,] indicate that AI shows potential for effectively monitoring stress-related indicators and supporting stress detection and triage. RoBoHoN, for instance, was an interactive robotic system that inferred users’ stressors from conversational cues and was rated highly for inference accuracy []. These findings indicate that AI technologies can be trained to detect subtle cues indicative of stress, even before users are consciously aware of their stress. Importantly, user experiences with RoBoHoN were positive. Participants reported that the interaction was enjoyable and that the agent appeared to understand their feelings. They also described the agent as providing valid and reliable emotional support, which facilitated emotional self-disclosure and contributed to stress relief. In a study by Sun et al [], a multichatbot study showed that reductions in perceived stress were accompanied by strengthened beliefs in physical activity as a stress mitigation strategy. This finding suggests that AI-based interventions may also influence users’ coping beliefs and attitudes, specifically by reinforcing exercise as a coping strategy when stress is detected. Moreover, the EuStress system described by Silva et al [] further demonstrated that stress-linked physiological patterns could be meaningfully distinguished in real-world contexts, such as baseline periods versus examination periods among students. ML models were able to classify stress states with promising performance using wearable sensor data. The authors noted practical constraints, including limited sample sizes and data gaps, that affected model robustness and generalizability. These observations underscore that although AI models can detect stress, they require sufficiently large and diverse datasets to ensure reliable performance across users and contexts.
Discussion
Overview
The sample characteristics indicate that the included records were drawn from a reasonably broad range of participant groups but were concentrated mainly in university student samples and mixed adult populations. Several studies also focused on more specific groups, including employed nurses, small-to-medium-sized enterprise owners and managers, medical students, fathers, and children with Down syndrome and their caregivers. This suggests that the evidence base spans educational, occupational, family, and caregiving contexts. Sample sizes varied widely across the included records, from small qualitative or feasibility samples to larger intervention, survey, and platform-based samples. This pattern is consistent with the methodological range of the review, which included qualitative, mixed methods, descriptive, and trial-based studies. Where age was reported, the samples were weighted mainly toward young adults, especially in student-focused studies. Student samples were generally concentrated in the late teens to 20s, whereas occupational- and community-based samples more often fell in the late 20s to 30s. Most studies included participants aged 18 years and older, although a smaller number included adolescents or child-caregiver samples. Overall, the evidence base reflects a range of applied contexts and participant groups while remaining centered largely on students, working adults, and other accessible community populations that are well suited to the use and evaluation of AI-based stress support tools.
This review synthesized 35 studies, comprising qualitative studies (n=3), randomized controlled trials (n=11), nonrandomized quantitative studies (n=6), quantitative descriptive studies (n=4), and mixed methods studies (n=11). Collectively, the included studies examined AI-enabled tools for self-directed stress management, including chatbots, mobile apps, embodied agents, and other digital platforms. Overall, these interventions show preliminary promise as scalable and accessible supports for psychological intervention, behavioral support, psychoeducation, companionship and emotional support, and stress monitoring. However, the evidence base remains early-stage and heterogeneous, so the findings should be interpreted cautiously. Quality appraisal showed that limitations tended to cluster by study design rather than by individual study. All included studies met the 2 MMAT screening criteria (ie, “Are there clear research questions?” and “Do the collected data allow addressing the research questions?”; []). However, methodological quality was more mixed across the design-specific criteria. Among the qualitative studies, all 3 met all 5 study type–specific MMAT criteria. Among the randomized controlled trials, most met criteria for randomization, baseline comparability, and outcome completeness, although blinding was less consistently established and participant adherence was unclear in 1 study. For the nonrandomized quantitative studies, the main limitations were inadequate control of confounding and, in some cases, limited representativeness of the target population. For the quantitative descriptive studies, statistical analysis was generally appropriate, but measurement appropriateness and nonresponse bias were less consistently addressed. For several mixed methods studies, the main weaknesses were incomplete integration of qualitative and quantitative components and limited consideration of inconsistencies between the 2 strands. Overall, the studies were sufficient to identify emerging roles of AI in self-directed stress management, but the methodological variability warrants cautious interpretation of effectiveness and generalizability going forward.
Key Findings
Overview
This review synthesized evidence on how AI can assist in self-directed stress management, identifying 5 distinct functional roles of AI-enabled agents. AI-enabled stress management systems took multiple forms, with several studies using chatbots that delivered support based on existing psychological frameworks, including CBT-informed approaches [,,,,] and mindfulness-based programs such as Mindfulness-Based Stress Reduction []. Other studies evaluated web- and app-based platforms that provided self-guided tools and conversational support [,,,]. In addition, certain systems used physiological monitoring or biofeedback to support stress regulation [,]. These tools are largely designed to function as on-demand support for users in need, providing alerts that inform them when they experience elevated physiological symptoms indicative of stress and offering companionship in moments of emotional distress. AI-based support can be made continuously available, and because many agents are designed around established psychological frameworks, they may support users in managing immediate stressors, although evidence of effectiveness remains mixed and study dependent. Our analysis revealed clear distinctions in how AI-integrated solutions enable self-directed stress management. Each of the 5 identified themes represents distinct mechanisms or use cases for AI agents, assistants, software, and interventions. In synthesizing the emerging field, we illustrated the broad impacts and the specific ways in which AI helps users, from tracking daily habits to coping with persistent workplace stress.
Furthermore, the diversity of AI functions observed across studies was also reflected in the technical architectures and classification frameworks used to describe them. Rule-based and decision tree systems were commonly used for structured tasks, such as CBT-style exercises, mindfulness sessions, psychoeducation, and behavioral prompts. LLM- or GenAI- and NLP-enabled systems appeared more relevant where the intended role involved open-ended conversation, emotional support, or companionship. Sensor-based, physiological, computer-vision, and multimodal ML systems were more prominent when AI was used to infer stress states, emotional cues, or behavioral patterns. Many of the discussed interventions systems also integrated multiple AI approaches, making broad labels such as “chatbot” or “AI app” insufficiently precise. Although existing taxonomies classify AI systems by service design characteristics, human-AI activities [eg, ,], or broader risk and evaluation considerations, these frameworks remain wider in scope than this review [eg, ,]. By organizing studies according to AI’s functional role in self-directed stress management, this review provides a clearer and more use case–specific basis for interpreting outcomes, clarifying what each system is intended to do and avoiding the treatment of “AI” as a single undifferentiated category. In particular, a psychoeducational system, for example, may be assessed by whether it delivers accurate and understandable information; a behavioral support tool by adherence and behavior change; a psychological intervention by stress-related outcomes; a companionship and emotional support system by perceived support and emotional relief; and a monitoring or triage system by detection accuracy, timeliness, and appropriate feedback [,]. Across the included studies, AI served distinct functions in self-directed stress management, delivering psychological interventions, supporting behavior change, providing psychoeducation, offering emotional companionship, and monitoring or triaging stress-related states, rather than functioning as a single intervention type. This role-based structure clarifies each system’s intended purpose and offers a more meaningful basis for evaluating efficacy than AI architecture alone.
Traditionally, stress has been viewed as a negative subject, with extreme stress labeled as the cause of physical ailments and declining mental health [,,]. Our findings, however, suggest that AI-based strategies aid in reframing stress in more constructive ways. For example, the use of AI tools can help individuals to reframe stress as a motivating factor [] or as a warning sign of burnout [], with readily available access to AI-enabled agents reducing another layer of stress in seeking clinical intervention or professional help []. AI-enabled stress management strategies also serve as a gateway to increasing mental health literacy about personal stressors [], allowing users to understand their experiences from a different perspective. The functional roles identified in our review provide a clear foundation for further research on AI and stress management and will help develop more targeted solutions for each individual. Furthermore, the impact of stress-related illnesses extends beyond immediate negative affect. Stress, when unmanaged on a widespread scale, can snowball into enormous health care costs, loss in productivity at work, and, in severe cases, amount to physical disabilities []. The ability of AI-enabled technology to intervene and provide assistance to individuals before their stress levels reach levels of psychological or physical harm is a significant use case for AI and could be used meaningfully to help individuals manage their stress in healthy ways, while remaining accessible at any time.
Research Gaps and Limitations
Through this systematic synthesis, we identified several gaps in the existing literature on AI-supported self-directed stress management. First, the evidence base is dominated by early-stage pilot and feasibility studies. Although many of these studies report short-term improvements in stress-related outcomes, few evaluate mature or clinically validated systems, and distinctions between AI-enabled interventions and broader digital interventions are often insufficiently specified. As a result, evidence regarding long-term effectiveness remains limited, and this literature does not yet support meaningful estimation of pooled effect sizes. In addition, the available evidence may be skewed toward positive findings, reflecting both the novelty of AI-based interventions and broader publication biases. Second, we observed substantial heterogeneity across AI models, psychological frameworks, stress outcome measures, and evaluation methods. While this diversity reflects the field’s interdisciplinary nature, it limits comparability and makes it difficult to determine which approaches are most effective. This challenge is further compounded by the absence of standardized terminology across studies, as rapidly evolving AI capabilities and inconsistent system labeling make it difficult to systematically capture, compare, and track meaningful distinctions between intervention types over time. Third, we identified that the existing study samples are narrow and demographically skewed, with most studies involving young adults or university students. This pattern is expected given typical technology adoption trends [], but it limits generalizability to older or less technologically proficient populations. Factors such as digital literacy and frequency of use may have differing effects on populations that were not captured within this study. Next, issues of safety, fairness, and reliability have been underexamined. Few studies systematically examined erroneous or inappropriate AI outputs, algorithmic bias, or differential performance across demographic groups. Reporting practices also tended to emphasize favorable outcomes, with comparatively little discussion of system failures, unintended effects, or potential harms. Addressing these gaps will be critical as AI-based stress management tools move toward broader real-world deployment. Finally, our search was limited to English-language publications, which introduce the potential for language bias. Both authors’ first language is English, which limits the representativeness of the evidence base outside of English.
Future Directions
To move the field of AI-based stress management beyond the scope of this thematic synthesis, future research should prioritize the development of a cumulative, methodologically rigorous evidence base. First, long-term randomized controlled trials are needed to assess whether stress reduction effects and behavioral changes are sustained beyond the short durations (often less than 3 months) reported in existing studies. Second, as the literature matures, meta-analytic work will be essential for estimating pooled effect sizes and identifying moderators such as age, cultural context, delivery modality, and underlying psychological frameworks. Achieving this will require more standardized outcome measures and longer follow-up intervals. Third, future studies should investigate user-AI interaction processes, including perceived mind attribution, trust, empathy, and fear of AI, as these dimensions appear to influence engagement and stress-related outcomes, in addition to examining the effects of such AI usage patterns on different age groups and demographics. Using validated scales of AI-specific individual differences, such as measures of trust, dependency, or perceived agency, would help clarify these mechanisms. Fourth, methodological standards must be strengthened across disciplines. As observed in this review, reporting practices were highly variable, and only a small subset of studies adhered to established AI-specific guidelines such as CONSORT-AI (Consolidated Standards of Reporting Trials - Artificial Intelligence) [] and SPIRIT-AI (Standard Protocol Items: Recommendations for Interventional Trials - Artificial Intelligence) []. Future work should therefore prioritize transparency, include protocols for error detection and explainability, and adopt mechanisms for real-time error handling to support replicability and user safety. Fifth, improving the diversity of datasets is critical to mitigate algorithmic bias and ensure that AI systems perform equitably across populations, particularly those currently underrepresented. Sixth, the cultural adaptation of AI tools warrants systematic attention. This includes accounting for linguistic variation, culturally specific stressors, and local norms in the expression of stress and coping. The ability of different AI tools to be personalized and scaled to each individual’s needs is an area that could be explored in the future. Across the included studies, none tested age-related moderation effects. Digital literacy was not consistently considered, measured, or modeled as a covariate. One study considered digital literacy as familiarity with AI [] but did not conduct a moderation analysis. Future research should examine whether age, digital literacy, and technological familiarity moderate engagement, acceptability, and effectiveness, given the current skew toward younger and technologically savvy populations. Seventh, research and health care organizations should begin evaluating AI-based stress interventions in real-world settings. In particular, studies set in workplaces, educational institutions, and community health platforms would be ideal for examining the extent to which emerging human-AI models improve engagement, accountability, and sustainability. Furthermore, multimodal AI systems that integrate conversational data with physiological and behavioral signals hold promise for improving ecological validity and enabling more dynamic, real-time stress detection and intervention. Together, these directions offer a roadmap for advancing the field toward more robust, inclusive, and context-sensitive AI applications in self-directed stress management.
Finally, regulation and legislation surrounding AI applications for stress management should be strengthened, especially concerning data security for clinically vulnerable populations. While AI may be beneficial to individual users, providers and governing bodies must be held accountable, ensuring users’ data is not compromised or shared without consent, so as not to harm users in the process. Existing AI guidance frameworks, including the Organisation for Economic Co-operation and Development AI Principles [], the European Commission’s Ethics Guidelines for Trustworthy AI [], and the ASEAN Guide on AI Governance and Ethics [], highlight principles such as transparency, safety, privacy, human oversight, fairness, and accountability. However, these frameworks offer broad governance guidance rather than reporting standards specific to AI use in nonclinical psychological research. A potential alternative could be to adapt principles of ethical use of AI from existing sources before deploying AI in research and scaling the use of AI systems in consumer-facing products. At present, psychology-specific reporting guidance for AI use in nonclinical research appears limited. Frameworks such as CONSORT-AI [] and SPIRIT-AI [] offer relevant examples for AI-related reporting in clinical trial reports and protocols, but they are not general regulatory or accountability frameworks. Researchers could therefore draw cautiously on existing AI governance principles when designing studies or developing consumer-facing AI tools, particularly where human participants are involved. Human oversight may also be useful to help researchers monitor AI outputs and respond to unexpected issues during deployment.
Conclusions
This systematic review consolidates current evidence on how AI supports self-directed stress management. We illustrated 5 distinct ways in which intelligent systems can assist users in managing stress, ranging from monitoring and intervention to behavior change and emotional support. Across these applications, one central insight emerges: several AI stress management interventions were informed by user-centered design principles or psychological frameworks that emphasize coping, behavior change, mindfulness, or emotional support [,,,,,]. As AI systems learn directly from their training data, model validation and reliability testing are as crucial as development studies on novel systems. Small, homogenous, or biased samples risk producing models that misrepresent or overlook entire segments of users [,]. As such, future work must give equal weight to large-scale, diverse data collection and rigorous validation studies, ensuring that AI tools trained for stress management generalize safely, equitably, and across contexts. At the same time, emerging techniques such as digital phenotyping [,] offer a promising path forward. By matching users to interventions that align with their behavioral and symptom profiles, such approaches directly address low engagement stemming from perceived unmet needs. Ultimately, these synthesized findings have wide-reaching implications across cultures, settings, and mental health domains. AI systems hold preliminary promise for expanding access and enhancing stress self-management. As these tools increasingly operate without clinician oversight, standards such as transparent communication of model capabilities, safeguards, and reporting requirements are critical to protect users while realizing the full potential of AI in self-directed interventions.
Acknowledgments
MKGR and SSMT should be considered joint first authors. The order of co-first authors provided was randomized. Co-first authors can prioritize their names when adding this paper’s reference to their resumes. The authors declare the use of generative artificial intelligence (GenAI) in the research and writing process. According to the Generative Artificial Intelligence Delegation Taxonomy (GAIDeT; 2025), the following tasks were delegated to GenAI tools under full human supervision: proofreading and editing. The GenAI tool used was ChatGPT-5.2. Responsibility for the final manuscript lies entirely with the authors. GenAI tools are not listed as authors and do not bear responsibility for the final outcomes.
Funding
Andree Hartanto was supported by Singapore Management University research grants from the Ministry of Education Academic Research Fund Tier 1 (25-SOSS-SMU-006) and the Lee Kong Chian Fund for Research Excellence.
Authors' Contributions
MKGR contributed to conceptualization, formal analysis, investigation, methodology, visualization, writing the original draft, and review and editing of the manuscript. SSMT contributed to conceptualization, formal analysis, investigation, methodology, visualization, writing the original draft, and review and editing of the manuscript. AH contributed to the conceptualization, supervision, and review and editing of the manuscript.
Conflicts of Interest
None declared.
Multimedia Appendix 1
Title and abstract screening log and full-text screening log (record-level decisions), including inclusion or exclusion decisions and reasons for exclusion at full-text stage.
XLSX File, 803 KBMultimedia Appendix 2
MMAT (2018) quality appraisal worksheet, documenting study-level methodological quality assessments for included studies.
XLSX File, 18 KBReferences
- Stress: questions and answers. World Health Organization. 2023. URL: https://www.who.int/news-room/questions-and-answers/item/stress#:~:text=What%20is%20stress%3F [Accessed 2025-11-28]
- Mahmud S, Mohsin M, Dewan MN, Muyeed A. The global prevalence of depression, anxiety, stress, and insomnia among general population during COVID-19 pandemic: a systematic review and meta-analysis. Trends Psychol. 2023;31(1):143-170. [CrossRef] [Medline]
- Salari N, Khazaie H, Hosseinian-Far A, et al. The prevalence of stress, anxiety and depression within front-line healthcare workers caring for COVID-19 patients: a systematic review and meta-regression. Hum Resour Health. Dec 17, 2020;18(1):100. [CrossRef] [Medline]
- Canaletti EF, Lun P, Stutzman LD, Chan M, Cheung F. Rising tide of stress: global trends and structural predictors over 18 years. Wellbeing, Space and Society. Jun 2026;10:100319. [CrossRef]
- Jalali A, Ziapour A, Karimi Z, et al. Global prevalence of depression, anxiety, and stress in the elderly population: a systematic review and meta-analysis. BMC Geriatr. Oct 4, 2024;24(1):809. [CrossRef] [Medline]
- Piao X, Xie J, Managi S. Continuous worsening of population emotional stress globally: universality and variations. BMC Public Health. Dec 23, 2024;24(1):3576. [CrossRef] [Medline]
- Smith MD, Wesselbaum D. Global evidence on the prevalence of and risk factors associated with stress. J Affect Disord. Apr 1, 2025;374:179-183. [CrossRef] [Medline]
- Cham CQ, Ibrahim N, Siau CS, et al. Caregiver burden among caregivers of patients with mental illness: a systematic review and meta-analysis. Healthcare (Basel). Nov 30, 2022;10(12):2423. [CrossRef] [Medline]
- Soh XC, Hartanto A, Ling N, Reyes M, Sim L, Majeed NM. Prevalence of depression, anxiety, burden, burnout, and stress in informal caregivers: an umbrella review of meta-analyses. Archives of Gerontology and Geriatrics Plus. Sep 2025;2(3):100197. [CrossRef]
- Amanvermez Y, Zhao R, Cuijpers P, et al. Effects of self-guided stress management interventions in college students: a systematic review and meta-analysis. Internet Interv. Apr 2022;28:100503. [CrossRef] [Medline]
- Bruffaerts R, Mortier P, Auerbach RP, et al. Lifetime and 12-month treatment for mental disorders and suicidal thoughts and behaviors among first year college students. Int J Methods Psychiatr Res. Jun 2019;28(2):e1764. [CrossRef] [Medline]
- Arahanthabailu P, Praharaj SK, Purohith AN, Yesodharan R, Rege S, Appaji R. Exploring barriers to seek mental health services among patients with severe mental illness and their caregivers in a modified assertive community treatment program: a qualitative thematic analysis. Indian J Psychiatry. Jul 2024;66(7):621-629. [CrossRef] [Medline]
- Ebert DD, Mortier P, Kaehlke F, et al. Barriers of mental health treatment utilization among first-year college students: first cross-national results from the WHO World Mental Health International College Student Initiative. Int J Methods Psychiatr Res. Jun 2019;28(2):e1782. [CrossRef] [Medline]
- Keast R, Lengyell M, Finnigan C, Brown J, Jay M. Barriers to mental health care for low-income clients as perceived by counsellors. Journal for Social Action in Counseling & Psychology. Aug 6, 2025;17(1):2-21. [CrossRef]
- Cuijpers P, Schuurmans J. Self-help interventions for anxiety disorders: an overview. Curr Psychiatry Rep. Aug 2007;9(4):284-290. [CrossRef] [Medline]
- Chen X, Chen L, Song D, Wang Y, Ryoo J. Digital mental health for young people: sustainably managing stress and anxiety. Mental Health and Digital Technologies. Oct 29, 2025;2(3):157-182. [CrossRef]
- Gaffney H, Mansell W, Tai S. Conversational agents in the treatment of mental health problems: mixed-method systematic review. JMIR Ment Health. Oct 18, 2019;6(10):e14166. [CrossRef] [Medline]
- Ghorbian M, Ghobaei-Arani M. Large language models for mental health diagnosis and treatment: a survey. Artif Intell Rev. Nov 14, 2025;59(1):9. [CrossRef]
- Köckritz J, İlgen B, Cohrdes C, Hattab G. Current applications and future directions in natural language processing for news media and mental health. Sci Rep. Sep 15, 2025;15(1):32532. [CrossRef] [Medline]
- Raza M, Jahangir Z, Riaz MB, Saeed MJ, Sattar MA. Industrial applications of large language models. Sci Rep. Apr 21, 2025;15(1):13755. [CrossRef] [Medline]
- Alowais SA, Alghamdi SS, Alsuhebany N, et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Med Educ. Sep 22, 2023;23(1):689. [CrossRef] [Medline]
- Obodo OR. Exploring the role of AI-driven decision support systems in reducing diagnostic errors in rural healthcare settings. International Journal of Scientific Research and Modern Technology. Sep 20, 2025:131-135. [CrossRef]
- Pailaha AD. The impact and issues of artificial intelligence in nursing science and healthcare settings. SAGE Open Nurs. 2023;9:23779608231196847. [CrossRef] [Medline]
- Freund J, Smit F, Lehr D, et al. A universal digital stress management intervention for employees: randomized controlled trial with health-economic evaluation. J Med Internet Res. Oct 22, 2024;26:e48481. [CrossRef] [Medline]
- Peeters SB, Hilgersom M, van Krugten FCW, et al. Economic evaluations of eHealth interventions targeting mental health problems in the workplace: a systematic review. J Ment Health. Oct 2025;34(5):580-595. [CrossRef] [Medline]
- Ali M, Ali S, Abbas Q, Abbas Z, Lee SW. Artificial intelligence for mental health: a narrative review of applications, challenges, and future directions in digital health. Digit Health. 2025;11:20552076251395548. [CrossRef] [Medline]
- Feng Y, Hang Y, Wu W, et al. Effectiveness of AI-driven conversational agents in improving mental health among young people: systematic review and meta-analysis. J Med Internet Res. May 14, 2025;27:e69639. [CrossRef] [Medline]
- Jain G, Pareek S, Carlbring P. Revealing the source: how awareness alters perceptions of AI and human-generated mental health responses. Internet Interv. Jun 2024;36:100745. [CrossRef] [Medline]
- Beatty C, Malik T, Meheli S, Sinha C. Evaluating the therapeutic alliance with a free-text CBT conversational agent (Wysa): a mixed-methods study. Front Digit Health. 2022;4:847991. [CrossRef] [Medline]
- Mauriello ML, Lincoln T, Hon G, Simon D, Jurafsky D, Paredes P. SAD: a stress annotated dataset for recognizing everyday stressors in SMS-like conversational systems. Presented at: CHI EA ’21: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems; May 8-13, 2021. [CrossRef]
- Croes EAJ, Antheunis ML, van der Lee C, de Wit JMS. Digital confessions: the willingness to disclose intimate information to a chatbot and its impact on emotional well-being. Interact Comput. Aug 15, 2024;36(5):279-292. [CrossRef]
- Lucas GM, Gratch J, King A, Morency LP. It’s only a computer: virtual humans increase willingness to disclose. Comput Human Behav. Aug 2014;37:94-100. [CrossRef]
- Merwin ER, Hagen AC, Keebler JR, Forbes C. Self-disclosure to AI: people provide personal information to AI and humans equivalently. Computers in Human Behavior: Artificial Humans. Aug 2025;5:100180. [CrossRef]
- Hua Y, Na H, Li Z, et al. A scoping review of large language models for generative tasks in mental health care. NPJ Digit Med. Apr 30, 2025;8(1):230. [CrossRef] [Medline]
- Olawade DB, Wada OZ, Odetayo A, David-Olawade AC, Asaolu F, Eberhardt J. Enhancing mental health with artificial intelligence: current trends and future prospects. Journal of Medicine, Surgery, and Public Health. Aug 2024;3:100099. [CrossRef]
- Pavuluri S, Sangal R, Sather J, Taylor RA. Balancing act: the complex role of artificial intelligence in addressing burnout and healthcare workforce dynamics. BMJ Health Care Inform. Aug 24, 2024;31(1):e101120. [CrossRef] [Medline]
- Crosswell AD, Lockwood KG. Best practices for stress measurement: how to measure psychological stress in health research. Health Psychol Open. 2020;7(2):2055102920933072. [CrossRef] [Medline]
- Neri L, Oberdier MT, van Abeelen KCJ, et al. Electrocardiogram monitoring wearable devices and artificial-intelligence-enabled diagnostic capabilities: a review. Sensors (Basel). May 16, 2023;23(10):4805. [CrossRef] [Medline]
- Smets E, Rios Velazquez E, Schiavone G, et al. Large-scale wearable data reveal digital phenotypes for daily-life stress detection. NPJ Digit Med. 2018;1(1):67. [CrossRef] [Medline]
- Castaldo R, Melillo P, Bracale U, Caserta M, Triassi M, Pecchia L. Acute mental stress assessment via short term HRV analysis in healthy adults: a systematic review with meta-analysis. Biomed Signal Process Control. Apr 2015;18:370-377. [CrossRef]
- Almadhor A, Sampedro GA, Abisado M, Abbas S. Efficient feature-selection-based stacking model for stress detection based on chest electrodermal activity. Sensors (Basel). Jul 25, 2023;23(15):6664. [CrossRef] [Medline]
- Scollon CN, Kim-Prieto C, Diener E. Experience sampling: promises and pitfalls, strengths and weaknesses. J Happiness Stud. Mar 2003;4(1):5-34. [CrossRef]
- Choi A, Ooi A, Lottridge D. Digital phenotyping for stress, anxiety, and mild depression: systematic literature review. JMIR Mhealth Uhealth. May 23, 2024;12:e40689. [CrossRef] [Medline]
- Dwyer B, Flathers M, Burns J, et al. Assessing digital phenotyping for app recommendations and sustained engagement: cohort study. JMIR Form Res. Nov 19, 2024;8:e62725. [CrossRef] [Medline]
- Loughnane C, Laiti J, O’Donovan R, Dunne PJ. Systematic review exploring human, AI, and hybrid health coaching in digital health interventions: trends, engagement, and lifestyle outcomes. Front Digit Health. 2025;7:1536416. [CrossRef] [Medline]
- Michie S, Yardley L, West R, Patrick K, Greaves F. Developing and evaluating digital interventions to promote behavior change in health and health care: recommendations resulting from an international workshop. J Med Internet Res. Jun 29, 2017;19(6):e232. [CrossRef] [Medline]
- Zhu Y, Long Y, Wang H, Lee KP, Zhang L, Wang SJ. Digital behavior change intervention designs for habit formation: systematic review. J Med Internet Res. May 24, 2024;26:e54375. [CrossRef] [Medline]
- Michie S, Wood CE, Johnston M, Abraham C, Francis JJ, Hardeman W. Behaviour change techniques: the development and evaluation of a taxonomic method for reporting and describing behaviour change interventions (a suite of five studies involving consensus methods, randomised controlled trials and analysis of qualitative data). Health Technol Assess. Nov 2015;19(99):1-188. [CrossRef] [Medline]
- Bendig E, Erb B, Schulze-Thuesing L, Baumeister H. The next generation: chatbots in clinical psychology and psychotherapy to foster mental health – a scoping review. Verhaltenstherapie. 2022;32(Suppl. 1):64-76. [CrossRef]
- Trappey AJC, Lin APC, Hsu KYK, Trappey CV, Tu KLK. Development of an empathy-centric counseling chatbot system capable of sentimental dialogue analysis. Processes (Basel). May 8, 2022;10(5):930. [CrossRef]
- Kraus M, Wagner N, Callejas Z, Minker W. The role of trust in proactive conversational assistants. IEEE Access. 2021;9:112821-112836. [CrossRef]
- Sanjeewa R, Iyer R, Apputhurai P, Wickramasinghe N, Meyer D. Empathic conversational agent platform designs and their evaluation in the context of mental health: systematic review. JMIR Ment Health. Sep 9, 2024;11:e58974. [CrossRef] [Medline]
- Ho JQH, Hartanto A, Koh A, Majeed NM. Gender biases within artificial intelligence and ChatGPT: evidence, sources of biases and solutions. Computers in Human Behavior: Artificial Humans. May 2025;4:100145. [CrossRef]
- Alkaeed M, Qayyum A, Qadir J. Privacy preservation in artificial intelligence and extended reality (AI-XR) metaverses: a survey. Journal of Network and Computer Applications. Nov 2024;231:103989. [CrossRef]
- Mandal A, Paradkar M, Panindre P, Kumar S. AI-based detection of stress using heart rate data obtained from wearable devices. Procedia Comput Sci. 2023;230:749-759. [CrossRef]
- Rezaeikhonakdar D. AI chatbots and challenges of HIPAA compliance for AI developers and vendors. J Law Med Ethics. 2023;51(4):988-995. [CrossRef] [Medline]
- Tavory T. Regulating AI in mental health: ethics of care perspective. JMIR Ment Health. Sep 19, 2024;11:e58493. [CrossRef] [Medline]
- Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G. Potential biases in machine learning algorithms using electronic health record data. JAMA Intern Med. Nov 1, 2018;178(11):1544-1547. [CrossRef] [Medline]
- Horsfall LJ, Bondaronek P, Ive J, Poduval S. Clinical algorithms and the legacy of race-based correction: historical errors, contemporary revisions and equity-oriented methodologies for epidemiologists. Clin Epidemiol. 2025;17:647-662. [CrossRef] [Medline]
- Joseph J. Algorithmic bias in public health AI: a silent threat to equity in low-resource settings. Front Public Health. 2025;13:1643180. [CrossRef] [Medline]
- Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. Mar 29, 2021;372:n71. [CrossRef] [Medline]
- Corporation for digital scholarship. Zotero. 2025. URL: https://www.zotero.org/
- Stern C, Lizarondo L, Carrier J, et al. Methodological guidance for the conduct of mixed methods systematic reviews. JBI Evid Synth. Oct 2020;18(10):2108-2118. [CrossRef] [Medline]
- Rodgers M, Sowden A, Petticrew M, et al. Testing methodological guidance on the conduct of narrative synthesis in systematic reviews: effectiveness of interventions to promote smoke alarm ownership and function. Evaluation (Lond). Jan 2009;15(1):49-73. [CrossRef]
- Popay J, Roberts H, Sowden A, et al. Guidance on the Conduct of Narrative Synthesis in Systematic Reviews: A Product from the ESRC Methods Programme. Lancaster University; 2006. [CrossRef]
- Thomas J, Harden A. Methods for the thematic synthesis of qualitative research in systematic reviews. BMC Med Res Methodol. Jul 10, 2008;8(1):45. [CrossRef] [Medline]
- Aromataris E, Fernandez R, Godfrey CM, Holly C, Khalil H, Tungpunkom P. Summarizing systematic reviews: methodological development, conduct and reporting of an umbrella review approach. Int J Evid Based Healthc. Sep 2015;13(3):132-140. [CrossRef] [Medline]
- Alanazi R, Alanazi AS, Alqazlan S, Benlaria H. Assessing the impact of AI tools on mobility and daily assistance for children with down syndrome in Saudi Arabia. Sci Rep. Aug 2025;15(1). [CrossRef]
- Alanezi F. Assessing the effectiveness of ChatGPT in delivering mental health support: a qualitative study. J Multidiscip Healthc. 2024;17:461-471. [CrossRef] [Medline]
- Allan DD. A mobile messaging-based conversational agent-led stress mindset intervention for New Zealand small-to-medium-sized enterprise owner-managers: effectiveness and acceptability study. Behaviour & Information Technology. Sep 9, 2024;43(12):2785-2798. [CrossRef]
- Crum AJ, Akinola M, Martin A, Fath S. The role of stress mindset in shaping cognitive, emotional, and physiological responses to challenging and threatening stress. Anxiety Stress Coping. Jul 2017;30(4):379-395. [CrossRef] [Medline]
- Baek G, Cha C. AI-assisted tailored intervention for nurse burnout: a three-group randomized controlled trial. Worldviews Evid Based Nurs. Feb 2025;22(1):e70003. [CrossRef] [Medline]
- Cho A, Cha C, Baek G. Development of an artificial intelligence-based tailored mobile intervention for nurse burnout: single-arm trial. J Med Internet Res. Jun 21, 2024;26:e54029. [CrossRef] [Medline]
- Fuhrmann LM, Lukas CA, Schindler-Gmelch L, Berking M. Evaluating a brief smartphone-based stress management intervention with heart rate biofeedback from built-in sensors in a three arm randomized controlled trial. Sci Rep. Jun 23, 2025;15(1):20257. [CrossRef] [Medline]
- Gabrielli S, Rizzi S, Bassi G, et al. Engagement and effectiveness of a healthy-coping intervention via chatbot for university students during the COVID-19 pandemic: mixed methods proof-of-concept study. JMIR Mhealth Uhealth. May 28, 2021;9(5):e27965. [CrossRef] [Medline]
- Hiller S, Götzl C, Rauschenberg C, et al. Health-promoting effects and everyday experiences with a mental health app using ecological momentary assessments and AI-based ecological momentary interventions among young people: qualitative interview and focus group study. JMIR Mhealth Uhealth. Apr 29, 2025;13:e65106. [CrossRef] [Medline]
- Huang YC, Gamborino E, Huang YJ, Qian X, Fu LC, Yeh SL. Inferring stressors from conversation: towards an emotional support robot companion. Int J of Soc Robotics. Sep 2022;14(7):1657-1671. [CrossRef]
- Indrayanti I, Salsabila AK, Amrita V, Alhaddad MM, Saskia AB, Ramadhani DP. PsyBot: a randomized controlled trial of WhatsApp-based psychological first aid to reduce loneliness among 18–22-year-old students in Yogyakarta, Indonesia. SSM - Mental Health. Dec 2025;8:100504. [CrossRef]
- Jiang J, Yang Y. GymBuddy and Elomia, AI-integrated applications, effects on the mental health of the students with psychological disorders. BMC Psychol. Aug 2025;13(1):350. [CrossRef]
- Keung WM, So TY. Attitudes towards AI counseling: the existence of perceptual fear in affecting perceived chatbot support quality. Front Psychol. 2025;16:1538387. [CrossRef] [Medline]
- Ko S, Woo H. Users’ needs for mental health apps: quality evaluation using the user version of the mobile application rating scale. JMIR Mhealth Uhealth. Jul 4, 2025;13:e64622. [CrossRef] [Medline]
- Lai HL, Chen CI, Liaw GH, Lu LC, Huang CY. Effectiveness of a chatbot-based internet-support-coping-caring intervention in improving sleep quality and work-related outcomes among nurses in taiwan during the COVID-19 pandemic: a pilot study. J Nurs Res. Oct 1, 2025;33(5):e411. [CrossRef] [Medline]
- Lee I, Hahn S. On the relationship between mind perception and social support of chatbots. Front Psychol. 2024;15:1282036. [CrossRef] [Medline]
- Lee S, Jeong J, Kim M, Lee S, Kim SP, Jung D. Development of a mobile intervention for procrastination augmented with a semigenerative chatbot for university students: pilot randomized controlled trial. JMIR Mhealth Uhealth. Apr 10, 2025;13:e53133. [CrossRef] [Medline]
- Li Y, Chung TY, Lu W, et al. Chatbot-based mindfulness-based stress reduction program for university students with depressive symptoms: intervention development and pilot evaluation. J Am Psychiatr Nurses Assoc. 2025;31(4):398-411. [CrossRef] [Medline]
- Ly KH, Ly AM, Andersson G. A fully automated conversational agent for promoting mental well-being: a pilot RCT using mixed methods. Internet Interv. Dec 2017;10:39-46. [CrossRef] [Medline]
- Marwaha EB, Gera J, Thareja R, Gupta S. The digital bridge: using digital intervention to understand psychosocial correlation between COVID-19 stress and sleep quality in undergraduate students before and after COVID-19. TOPSYJ. Feb 28, 2025;18(1):e18743501364965. [CrossRef]
- Medeiros L, Bosse T, Gerritsen C. Can a chatbot comfort humans? Studying the impact of a supportive chatbot on users’ self-perceived stress. IEEE Trans Human-Mach Syst. Jun 2022;52(3):343-353. [CrossRef]
- Meng J, Dai Y(. Emotional support from AI chatbots: should a supportive partner self-disclose or not? J Comput-Mediat Commun. Sep 9, 2021;26(4):207-222. [CrossRef]
- Nelekar S, Abdulrahman A, Gupta M, Richards D. Effectiveness of embodied conversational agents for managing academic stress at an Indian University (ARU) during COVID‐19. Brit J Educational Tech. May 2022;53(3):491-511. [CrossRef]
- Orji J, Chan G, Orji R. Design, development, and evaluation of an mHealth app to reduce stress and promote happiness through smiling. Behaviour & Information Technology. May 9, 2026;45(8):1546-1566. [CrossRef]
- Park M, Fels S, Seo K. Enhancing academic stress assessment through self-disclosure chatbots: effects on engagement, accuracy, and self-reflection. Int J Educ Technol High Educ. 2025;22(1). [CrossRef]
- Park S, Choi J, Lee S, et al. Designing a chatbot for a brief motivational interview on stress management: qualitative case study. J Med Internet Res. Apr 16, 2019;21(4):e12231. [CrossRef] [Medline]
- Qi Y. Pilot quasi-experimental research on the effectiveness of the Woebot AI chatbot for reducing mild depression symptoms among athletes. International Journal of Human–Computer Interaction. Jan 2, 2025;41(1):452-459. [CrossRef]
- Silva E, Aguiar J, Reis LP, Sá JOE, Gonçalves J, Carvalho V. Stress among Portuguese medical students: the EuStress Solution. J Med Syst. Jan 2, 2020;44(2):45. [CrossRef] [Medline]
- Sun X, Casula D, Navaratnam A, et al. Lukowicz P, Mayer S, Koch J, Shawe-Taylor J, Tiddi I, editors. Virtual Support for Real-World Movement: Using Chatbots to Overcome Barriers to Physical Activity. Front Artif Intell Appl IOS Press; 2023. [CrossRef] ISBN: 978-1-64368-394-2
- Teague SJ, Shatte ABR, Fuller-Tyszkiewicz M, Hutchinson DM. User-centred design and evaluation of an mHealth app for fathers’ perinatal mental health: a feasibility, acceptability, and usability study. Behav Inf Technol. May 9, 2025;44(20):1-15. [CrossRef] [Medline]
- Tong ACY, Wong KTY, Chung WWT, Mak WWS. Effectiveness of topic-based chatbots on mental health self-care and mental well-being: randomized controlled trial. J Med Internet Res. Apr 30, 2025;27:e70436. [CrossRef] [Medline]
- Troitskaya O, Batkhina A. Mobile application for couple relationships: results of a pilot effectiveness study. Fam Process. Jun 2022;61(2):625-642. [CrossRef] [Medline]
- Weng JH, Hu Y, Heaukulani C, et al. Mental wellness self-care in Singapore with mindline.sg: a tutorial on the development of a digital mental health platform for behavior change. J Med Internet Res. Jun 4, 2024;26:e44443. [CrossRef] [Medline]
- Williams R, Hopkins S, Frampton C, Holt-Quick C, Merry SN, Stasiak K. 21-Day stress detox: open trial of a universal well-being chatbot for young adults. Soc Sci (Basel). 2021;10(11):416. [CrossRef]
- Yong SC. Integrating emotional AI into mobile apps with smart home systems for personalized mental wellness. J Technol Behav Sci. 2025;10(4):761-778. [CrossRef]
- Yoon S, Goh H, Low XC, Weng JH, Heaukulani C. User perceptions and utilisation of features of an AI-enabled workplace digital mental wellness platform ’mindline at work’. BMJ Health Care Inform. Aug 17, 2024;31(1):e101045. [CrossRef] [Medline]
- Hong QN, Fàbregues S, Bartlett G, et al. The Mixed Methods Appraisal Tool (MMAT) version 2018 for information professionals and researchers. EFI. Dec 18, 2018;34(4):285-291. [CrossRef]
- Blaß M, Gimpel H, Karnebogen P. A taxonomy and archetypes of AI-Based Health Care Services: Qualitative Study. J Med Internet Res. Nov 27, 2024;26:e53986. [CrossRef] [Medline]
- Choong YY, Jensen T. AI use taxonomy: a human-centered approach. National Institute of Standards and Technology; 2024. [CrossRef]
- Uuk R, Gutierrez CI, Guppy D, et al. A taxonomy of systemic risks from general-purpose AI. SSRN. Preprint posted online on 2025. [CrossRef]
- Slattery P, Saeri AK, Grundy EAC, et al. The AI risk repository: a meta-review, database, and taxonomy of risks from artificial intelligence. Patterns. May 2026;7(5):101517. [CrossRef]
- Rahman M, Victoros E, Davis R, Duaa T, Shanjana Y, Islam M. Use of artificial intelligence in mental healthcare, health psychology, and related research: a narrative review to address challenges and opportunities. Health Sci Rep. Dec 2025;8(12):e71595. [CrossRef] [Medline]
- Alekseev A, Strobel C. A taxonomy of AI experiments. Journal of Behavioral and Experimental Economics. Mar 2026;121:102525. [CrossRef]
- Manley AE, Biddle L, Savović J, Moran P. The positive and negative consequences of stress and its relationship with coping in medical students: a qualitative study. Med Teach. Feb 2025;47(2):323-328. [CrossRef] [Medline]
- Faiz F, Le V, Masli EK. Determinants of digital technology adoption in innovative SMEs. Journal of Innovation & Knowledge. Oct 2024;9(4):100610. [CrossRef]
- Liu X, Cruz Rivera S, Moher D, Calvert MJ, Denniston AK, SPIRIT-AI and CONSORT-AI Working Group. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat Med. Sep 2020;26(9):1364-1374. [CrossRef] [Medline]
- Rivera SC, Liu X, Chan AW, Denniston AK, Calvert MJ, SPIRIT-AI and CONSORT-AI Working Group. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI Extension. BMJ. Sep 9, 2020;370:m3210. [CrossRef] [Medline]
- Artificial intelligence. Organisation for Economic Co-operation and Development (OECD). 2026. URL: https://www.oecd.org/en/topics/policy-issues/artificial-intelligence.html [Accessed 2026-05-02]
- Ethics guidelines for trustworthy AI. European Commission. 2019. URL: https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai [Accessed 2026-05-02]
- ASEAN guide on AI governance and ethics. ASEAN Secretariat. 2024. URL: https://asean.org/book/asean-guide-on-ai-governance-and-ethics/ [Accessed 2026-05-02]
- Chen P, Wu L, Wang L. AI fairness in data management and analytics: a review on challenges, methodologies and applications. Appl Sci (Basel). Sep 13, 2023;13(18):10258. [CrossRef]
- Timmons AC, Duong JB, Simo Fiallo N, et al. A call to action on assessing and mitigating bias in artificial intelligence applications for mental health. Perspect Psychol Sci. Sep 2023;18(5):1062-1096. [CrossRef] [Medline]
Abbreviations
| AI: artificial intelligence |
| CBT: cognitive behavioral therapy |
| CONSORT-AI: Consolidated Standards of Reporting Trials - Artificial Intelligence |
| GAIDeT: Generative Artificial Intelligence Delegation Taxonomy |
| GenAI: generative artificial intelligence |
| HIPAA: Health Insurance Portability and Accountability Act |
| LLM: large language model |
| MMAT: Mixed Methods Appraisal Tool |
| NLP: natural language processing |
| PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| SPIRIT-AI: Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence |
| WHO: World Health Organization |
Edited by Matthew Balcarras; submitted 02.Jan.2026; peer-reviewed by Md Zakir Hossain, Ogochukwu Ndibe, Vibhuti Gupta, Zhao Liu; final revised version received 05.May.2026; accepted 05.May.2026; published 24.Jun.2026.
Copyright© Mary Kamillah Grace Reyes, Shauna Sha Min Teo, Andree Hartanto. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 24.Jun.2026.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

