Review
Abstract
Background: The number of people with diabetes is on the rise globally. Self-management and health education of patients are the keys to control diabetes. With the development of digital therapies and artificial intelligence, chatbots have the potential to provide health-related information and improve accessibility and effectiveness in the field of patient self-management.
Objective: This study systematically reviews the current research status and effectiveness of chatbots in the field of diabetes self-management to support the development of diabetes chatbots.
Methods: A systematic review and meta-analysis of chatbots that can help patients with diabetes with self-management was conducted. PubMed and Web of Science databases were searched using keywords around diabetes, chatbots, conversational agents, virtual assistants, and more. The search period was from the date of creation of the databases to January 1, 2023. Research articles in English that fit the study topic were selected, and articles that did not fit the study topic or were not available in full text were excluded.
Results: In total, 25 studies were included in the review. In terms of study type, all articles could be classified as systematic design studies (n=8, 32%), pilot studies (n=8, 32%), and intervention studies (n=9, 36%). Many articles adopted a nonrandomized controlled trial design in intervention studies (n=6, 24%), and there was only 1 (4%) randomized controlled trial. In terms of research strategy, all articles can be divided into quantitative studies (n=10, 40%), mixed studies (n=6, 24%), and qualitative studies (n=1, 4%). The evaluation criteria for chatbot effectiveness can be divided into technical performance evaluation, user experience evaluation, and user health evaluation. Most chatbots (n=17, 68%) provided education and management focused on patient diet, exercise, glucose monitoring, medications, and complications, and only a few studies (n=2, 8%) provided education on mental health. The meta-analysis found that the chatbot intervention was effective in lowering blood glucose (mean difference 0.30, 95% CI 0.04-0.55; P=.02) and had no significant effect in reducing weight (mean difference 1.41, 95% CI –2.29 to 5.11; P=.46) compared with the baseline.
Conclusions: Chatbots have potential for the development of self-management for people with diabetes. However, the evidence level of current research is low, and higher level research (such as randomized controlled trials) is needed to strengthen the evidence base. More use of mixed research in the research strategy is needed to fully use the strengths of both quantitative and qualitative research. Appropriate and innovative theoretical frameworks should be used in the research to provide theoretical support for the study. In addition, researchers should focus on the personalized and user-friendly interactive features of chatbots, as well as improvements in study design.
doi:10.2196/60380
Keywords
Introduction
Background
Diabetes, as a significant global public health problem, has resulted in a heavy burden on the health care system. Approximately 530 million people worldwide currently have diabetes, and by 2050, more than 1.31 billion people will have diabetes globally [
]. In 2021, the global health expenditure caused by diabetes was about US $966 billion, and it is estimated that this figure will exceed US $1054 billion by 2045 [ ]. Currently, there is no effective cure for diabetes, but it is somewhat controllable and preventable. The American Diabetes Association Professional Practice Committee suggests that diabetes may be diagnosed based on hemoglobin A1C (HbA1c) criteria or plasma glucose criteria, either the fasting plasma glucose value, 2-hour glucose value during a 75-g oral glucose tolerance test, or random glucose value accompanied by classic hyperglycemic symptoms or hyperglycemic crises. Therefore, the main control of diabetes mellitus is the blood glucose level [ ]. Diabetes control can reduce and delay the onset and progression of diabetes complications. Good diabetes control depends on a combination of diet, exercise, medication, diabetes education, and blood glucose monitoring [ ]. However, high-quality treatment of diabetes and its complications also depends on education about diabetes and active management of the patient’s own behavior (also known as self-management) [ ]. Health education is vital for the self-management of diabetes [ ]. The purpose of diabetes education is to help patients improve their self-care ability, that is, to increase knowledge about self-care of diabetes and to improve the ability of self-care practice [ ]. Diabetes self-management and support have been provided face-to-face (one-to-one or group based) for decades, with many trials and real-world studies showing improved diabetes outcomes [ ]. However, high costs and resource requirements have limited the scope and scalability of face-to-face programs [ ].Digital therapeutics is a new concept: it is technology produced by the digital integration of medicine and health. It combines disease, data, and algorithms [
]. The essence of digital therapeutics is a disease intervention program based on evidence-based medicine and presented by software programs that provide core functions such as the prevention, treatment, or management of diseases and high-risk factors [ ]. Digital therapy applications (Blue Star, Diabeo System, etc) use web-based applications or cloud platforms to provide evidence-based, personalized, and rapid care management for chronic and behavioral-changeable diseases, including diabetes [ ]. Digital therapy applications can improve patients’ compliance, treatment success rate, and economic achievements in diabetes management by enhancing patients’ participation, changing patients’ lifestyles, providing comprehensive medical care, and regular glucose monitoring [ ]. At present, the increase of patients with diabetes has brought a heavy burden to medical professionals and the medical care system. With the development of artificial intelligence (AI) and digital therapies, a convenient and effective diabetes health education system is gradually being developed [ ]. It meets the individual needs of patients with diabetes to acquire knowledge about diabetes and self-management, helps patients improve their daily lifestyles, and supports them in the self-management of diabetes.The chatbot is one of the products of scientific and technological progress and digital therapy, which has great potential in providing health education for patients and helping patients manage themselves. The chatbot can also be called a conversational agent or system, defined as a nonhuman virtual conversational bot, a computer program, or an AI program [
]. It participates in the conversation via audio or text input to answer the user’s questions, and the user can easily access the desired information by sending messages to the chatbot [ ]. AI is now gradually being integrated into chatbots. When a user asks a question or issues a command in human language, the chatbot understands the contextual information sent by the user and responds accordingly. Google Assistant, Apple’s Siri, and ChatGPT are common chatbots with voice-activated interfaces [ , ]. Chatbots can play an essential role in many areas, especially in health care. They can help patients anytime via a smartphone app or the web [ ]. Chatbots can provide fascinating health-related information and services and provide a personalized lifestyle [ ]. Chatbots can usually provide better answers to health care questions than other health care resources [ ]. For health care organizations, the use of chatbots significantly improves efficiency and reduces the workload of health care professionals and the burden on the health care system [ , ]. For patients, the use of chatbots can assist in self-management, improving behavioral compliance and quality of life [ ]. Previous findings indicate that chatbot interventions effectively improve depression, anxiety, stress, medication adherence, and smoking cessation [ ].Chatbots have the potential to provide personalized health education and assist in self-management for people with diabetes. According to our findings, although some studies have analyzed diabetes and self-management or investigated the use of chatbots in other diseases [
, ], no study has systematically reviewed the use of chatbots in self-management for patients with diabetes. Therefore, this study is innovative to some extent.Objectives
In summary, the aim of this study was to conduct a systematic review and meta-analysis of chatbots used in the field of diabetes self-management, exploring the specific characteristics of chatbots, research methods, research strategies, evaluation measures of outcome indicators, and some other information. By analyzing information about chatbots, this study aims to assess the potential and effectiveness of chatbots in diabetes self-management, thus helping researchers to sort out the current state of research and future development trends in this field. This will provide support for improving the development of chatbots in the field of diabetes self-management, which will ultimately help patients with diabetes to stay healthy.
Methods
Search Strategy
A systematic search of the literature was conducted in the PubMed and Web of Science databases, from the date of database establishment until January 1, 2023. PubMed and Web of Science are the 2 most commonly used databases for scholars in the biomedical field. PubMed is the largest biomedical retrieval platform in the world. Web of Science is the most classic and authoritative citation database in the scientific community, and the most important core collection of Web of Science contains more than 12,000 academic journals in more than 250 disciplines. The quality of articles in these 2 databases is very high, and hence, the researcher chose to extract high-quality literature from these 2 databases. Search terms were developed from a combination of keywords and Boolean operators in consultation with a professional medical librarian. The keywords of the search mainly revolve around “diabetes” and “Chatbot” (see Table S1 in
for the search strings).Data Management and Extraction
The data filtering process can be divided into four stages: (1) duplicate data removal, (2) primary filtering (filtering based on titles and abstracts), (3) secondary filtering (filtering based on full text), and (4) cross-checking. Two researchers (JZZ and GP) independently screened the articles according to the inclusion and exclusion criteria (see Table S2 in
). In the first step, for articles that were both in the Web of Science and PubMed databases, only one of them was retained, and duplicates were removed. In the second step, irrelevant articles were initially filtered out by reading titles and abstracts. In the third step, articles that did not fit the topic of the study were excluded by carefully reading the full text for judgment. Finally, independent screening results from JZZ and GP were cross-checked to ensure the accuracy of article screening. For articles with screening disagreements, the disagreements were resolved through discussion with the senior author, YBW. The PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) 2020 flow diagram was used to visualize the comprehensive search process and the results obtained.The quality of the included literature was evaluated by 2 researchers independently using the evaluation criteria for observational studies developed by the Agency for Healthcare Research and Quality [
]. The tool has a total of 11 entries, each of which contains 3 options: “Yes,” “No,” and “Unclear.” A score of 1 was given for “Yes” and 0 for “No” or “Unclear”; the total score was 11, with 0-3 as low quality, 4-7 as medium quality, and 8-11 as high quality. Disagreements were resolved through discussion between authors. For details of the quality evaluation results, refer to Table S3 in .Outcomes of Interest
The basic information and data from the articles were extracted and checked according to a predesigned data extraction table. The data information extracted for each study includes (1) basic information about the article (author, country, and year of publication); (2) specific features of the chatbot (name, consultable content, AI technology, question and answer database data sources, and input and output methods); and (3) research design aspects (research strategy, research type, theoretical framework, evaluation indexes of chatbot effect, etc).
Meta-Analysis Methods
Meta-analysis was planned if the included studies were sufficiently homogeneous in terms of statistical characteristics. RevMan 5.3 software (Cochrane Collaboration) was used for the statistical analysis of data. Mean, SD, and 95% CI were used as study effect indicators. The heterogeneity test was performed on the combined data. If P>.10 and I2<50%, it suggests that the heterogeneity among studies was small, and the fixed-effect model was selected for meta-analysis. If P<.10 and I2>50%, it suggests that there is obvious heterogeneity among the studies, and the random effect model was used for analysis. Sensitivity analysis was performed by comparing the results of the combined effect sizes of the fixed-effects model and random-effects model to ensure the stability and reliability of the results [
].Results
Search Results
In total, 25 articles from 14 countries were included in the analysis (see
for details). Authors from Australia and Portugal published 4 articles each, and authors from the United States and India published 3 articles each. Also, high-income countries, including Singapore, Norway, the Netherlands, Spain, Greece, and South Korea, and low-income countries, including China, Bangladesh, South Africa, and Ghana, have published relevant studies. The PRISMA flow diagram is shown in .Study | Year | Research type | Country | Consultable content | Evaluation metrics |
Krishnakumar et al [ | ]2021 | Interventional study | India | Diet, exercise, medications, and blood glucose | User experience and user health |
Stephens et al [ | ]2019 | Pilot study | United States | Diet, exercise, and mental health | Technical performance and user experience |
Sharma et al [ | ]2018 | System design | India | General diabetes knowledge | None (theoretical assumptions) |
Sowah et al [ | ]2020 | System design | Ghana | General diabetes knowledge | Technical performance |
Anastasiadou et al [ | ]2020 | System design | Greece | Diet, exercise, medications, blood glucose, treatment, learning resources, and doctor information | User experience |
Beaudry et al [ | ]2019 | Pilot study | United States | Drugs and general knowledge of chronic diseases | User experience |
Mitchell et al [ | ]2021 | Pilot study | United States | Diet and exercise | User experience |
Hossain et al [ | ]2022 | System design | Bangladesh | General diabetes knowledge | Technical performance |
Xie et al [ | ]2018 | System design | China | Diet, medication, and symptoms | Technical performance |
Maher et al [ | ]2020 | Interventional study | Australia | Diet and exercise | User experience and user health |
Sagstad et al [ | ]2022 | Interventional study | Norway | Diet, exercise, blood glucose, and gestational diabetes | User experience |
Mash et al [ | ]2022 | Interventional study | South Africa | Diet, exercise, medications, blood glucose, complications, tobacco, alcohol, and mental health | User experience |
Baptista et al [ | ]2020 | Interventional study | Australia | Diet, exercise, medications, blood glucose, and complications | User experience |
Hurmuz et al [ | ]2020 | Interventional study | Netherlands | Diet, exercise, and social support | User experience and user health |
Gong et al [ | ]2020 | Interventional study | Australia | Diet, exercise, medications, blood glucose, and complications | User experience and user health |
Roca et al [ | ]2021 | Interventional study | Spain | Medicine | User experience and user health |
Pimenta et al [ | ]2022 | System design | Portugal | Diet, exercise, and medication | None (theoretical assumptions) |
Balsa et al [ | ]2020 | Pilot study | Portugal | Diet, exercise, and medication | User experience |
Félix et al [ | ]2019 | System design | Portugal | Diet, exercise, and medication | None (theoretical assumptions) |
Saritha et al [ | ]2020 | System design | India | Diabetes general knowledge | Technical performance |
Buinhas et al [ | ]2019 | Pilot study | Portugal | Diet, exercise, and medication | User experience |
Dhinagaran et al [ | ]2021 | Interventional study | Singapore | Diet, exercise, sleep, and mental health | User experience and user health |
Dhinagaran and Car [ | ]2022 | Pilot study | Singapore | Diet, exercise, sleep, and stress management | User experience |
Hussain and Athula [ | ]2018 | Pilot study | Australia | General knowledge of diabetes | Technical performance and user experience |
Rehman et al [ | ]2020 | Pilot study | South Korea | Disease prediction | Technical performance and user experience |
Characteristics of Included Studies
Self-Management Content
Most chatbots provided self-management services (n=17, 68%) around lifestyle interventions (diet and exercise). In addition to providing information on diet and exercise, several studies simultaneously provided information about medications, blood glucose, and complications. In total, 5 (20%) studies provided the management of mental health dimensions such as anxiety, depression, social support, and stress. Two (8%) other studies provided functions in sleep quality, smoking, and alcohol consumption (see Tables S4-6 in
for details). The chatbots included in this study provide education and self-management services to different types of populations (including people with type 2 diabetes, adolescents, older adults, etc; see Tables S4-6 in for details). For example, the Tess chatbot is intended for adolescents. Tess is designed to respond to users with scripted statements that are crafted by mental health professionals to mimic empathy and compassion. Similar to how therapists can adapt their style or approach to address each client’s needs, Tess intervenes and responds based on an individual’s reported emotions or concerns [ ].Question and Answer Database Source
Researchers have mainly collected and extracted diabetes-related health information as a database source for chatbots through the International Diabetes Federation, clinical guidelines on diabetes published in various countries, authoritative health websites, and medical journal literature (see Tables S4-6 in
for details).Interaction and Input-Output Methods
When users interact with a chatbot to ask a question, the chatbot could provide 3 different forms of question and answer: free input (where the user freely enters a question), button input (where the user clicks on a topic or number button to ask a question), and mixed mode (where both free input and button modes are available). Many chatbots support users’ use of unrestricted free speech for queries (n=12, 48%). Ten (40%) studies supported users in making inquiries based on topic buttons provided by the chatbots, and 3 (12%) other studies supported mixed mode. Most chatbots allowed users to enter questions using text (n=19, 76%), and a small number of chatbots allowed users to make inquiries using voice (Tables S4-6 in
).Research Strategy
In our study, research strategies can be categorized into 3 types: qualitative research (n=9, 36%), quantitative research (n=1, 4%), and mixed research (n=7, 28%). Qualitative and quantitative research, in particular, have their own strengths and weaknesses, and mixed research allows quantitative and qualitative research to complement each other’s strengths (the advantages of quantitative, qualitative, and mixed studies are detailed in
).The Theoretical Framework
A total of 10 (40%) articles included in our study reported theoretical frameworks (Tables S4-6 in
). These theoretical frameworks mainly include the Behavior Change Wheel, Self-Determination Theory, Behavior Change Theory (Cross-Theoretical Model and Social Cognitive Theory), and so on. One of the more applied theoretical frameworks in the study was the Behavior Change Wheel, followed by the Behavior Change Theory and the Self-Determination Theory. The researchers integrated the theoretical frameworks in the design and intervention functions of the chatbot (see for specific explanations and applications of the theoretical frameworks in detail).Research Type
In our study, researchers classified all articles into 3 types of studies based on research characteristics: system design studies (n=8, 32%), pilot studies (n=8, 32%), and intervention studies (n=9, 36%). In system design studies, chatbots are at an early stage of the study. Most articles at this stage (n=5, 62.5%) consist of theoretical concepts or design chatbots based on theoretical concepts and test their performance. In pilot studies, researchers go deeper in their exploration based on system design. Articles at this stage choose small-scale, short-term experiments to evaluate the effect of chatbots. The intervention studies were a deeper exploration by the researcher based on the pilot study. This study phase has more participants, a longer intervention period, and a more standardized study design. Specific characteristics of chatbots in each study type are detailed in
and .Meta-Analysis Results
The studies included in the meta-analysis were pre-post trials. Based on the homogeneity characteristics of the included studies, the researchers used HbA1c and weight change as outcome indicators. In the meta-analysis, the researchers used the diabetes chatbot as an intervention to analyze the changes in HbA1c and body weight of the study participants after a period of intervention, compared to the baseline. The results of the meta-analysis are shown below.
HbA1c
Meta-analysis of HbA1c included 4 articles (Krishnakumar et al [
], Gong et al [ ], Baptista et al [ ], and Roca et al [ ]), totaling 4 groups of pre-post trials and 219 trial participants ( ). The results showed that there was no overall heterogeneity among the 4 groups of included trials (I2=0%, P=.94). The combined total effect size and 95% CI obtained using a fixed-effects model was (mean difference [MD] 0.30, 95% CI 0.04-0.55; P=.02), which reached the level of significance. Thus, the chatbot intervention had an effect on lowering HbA1c compared with the baseline. This analysis had consistent results under both fixed-effects and random-effects models. Article publication bias was assessed using funnel plots (Figure S1 in ).Weight
Meta-analysis of weight included 3 articles (Krishnakumar et al [
], Maher et al [ ], and Gong et al [ ]), totaling 3 sets of pre-post trials with 176 trial participants ( ). The results showed that there was no overall heterogeneity among the 3 groups of included trials (I2=0%, P>.99). The combined total effect size and 95% CI obtained using a fixed-effects model was (MD=1.41, 95% CI –2.29 to 5.11; P=.46), which did not reach the level of significance. Thus, there was no significant effect of the chatbot intervention in reducing body weight compared to baseline. This analysis had consistent results under both fixed-effects and random-effects models. Article publication bias was assessed using funnel plots (Figure S2 in ).Discussion
Principal Findings
This study is a systematic review and meta-analysis of the effectiveness of chatbots in diabetes self-management. A total of 25 articles were analyzed in this review. Based on the type of study, the 25 studies can be categorized as systematic design type studies, pilot studies, and intervention studies. Chatbots provide services involving both physical and psychological aspects. The criteria for assessing the effectiveness of chatbots can be categorized into technical performance assessment, user experience assessment, and user health assessment. Overall, the results of the evaluation of the chatbot model were good, and the overall acceptance of chatbots by users was high.
General Overview of Study
Chatbots could play an important role in future self-management for patients with diabetes. However, the current use of chatbots in diabetes self-management is still in its infancy. The number of publications in each country was low, and there was room for further research and development. Compared to high-income countries, there were fewer publications in low-income countries. However, low-income countries have a large population base, and the number of people with diabetes is increasing. Therefore, researchers should pay attention to conducting relevant research in low-income countries in the future.
The chatbot can provide management on physical dimensions such as diet, exercise, medication, blood glucose, and complications management. Only a few articles provided services on psychological dimensions such as anxiety, depression, stress, and social support. More than half of the diabetes chatbots enable users to enter questions freely. A few chatbots offer button input. More than half of the chatbots offer text-based counseling, and a few chatbots allow for voice-to-text counseling. Voice interaction can facilitate users’ interaction with chatbots. Therefore, this represents an area for future research. Chatbots need to establish an appropriate rapport with the user through personalized and friendly interactions for sustained and engaging interventions.
Meta-Analysis Results
Conducting a meta-analysis of the interventional studies, the researchers extracted 2 outcome metrics with a high degree of homogeneity—HbA1c and body weight—as criteria for evaluating the effectiveness of the chatbot intervention. Other intervention studies have used measures such as quality of life, BMI, waist circumference, blood pressure, diabetes knowledge, and risk perception. Meta-analysis of the study results showed that the chatbot intervention was effective in lowering blood glucose but had no significant effect on weight reduction. For example, a pilot study conducted over a 4-month period allowed patients to record their meals, weight, physical activity, and blood glucose in a chatbot. Patients received sessions on self-care behaviors and were provided with feedback from the AI-powered chatbot, as well as regular informational interactions with a diabetes educator via voice calls. To increase patient motivation and reduce the difficulty of performing specific tasks, the chatbot facilitates patient action through a variety of forms of brief educational content (e.g., informative lessons, videos, quizzes, and stories). The program uses a gamified approach to build self-management skills and rewards patients with virtual trophies as they complete lessons, tasks, and challenges. As participation in the program increased, patients’ HbA1c, fasting blood glucose, and postprandial blood glucose levels gradually decreased [
].The Role of Chatbots
The results of this study found that as technologies such as semantic analysis and natural language processing continue to be incorporated into chatbots, researchers have built a knowledge base of diabetes questions and answers with logical rules, so that chatbots can provide users with personalized advice and management services to help patients with diabetes improve their self-management.
Chatbots can help people with diabetes manage their diet, exercise, medication, blood glucose monitoring, prevention of complications, and mental health to control blood glucose levels and improve quality of life [
- , - ]. In addition, chatbots also have a significant advantage in out-of-hospital diabetes management. Chatbots can save many human medical resources, reducing the work pressure on primary care staff and the burden on the health care system [ ]. Therefore, chatbots with AI technology will play a prominent role in diabetes health education and self-management, which is in line with the trend of digital therapy.Based on the analysis of the effectiveness assessment indicators (technical performance indicators, user experience, and user metrics) of the chatbot, it was found that the technical assessment of the chatbot model was good [
], the overall acceptance of the chatbot by the users was high [ ], and the health status of the users improved after using the chatbot for self-management [ , , , ]. Overall, the use of chatbots in health education and self-management of diabetic patients is effective.Lack of High-Quality Research Evidence
Identifying the best evidence is an important part of evidence-based medicine. Evidence grading is an essential tool to help decision-makers obtain the best evidence. Although the criteria and methods for grading research evidence are not uniform, there is some grading of evidence that is generally accepted by scientists. These gradings have consistently endorsed meta-analyses and randomized controlled trials (RCTs) as the highest level of evidence [
]. Most of the articles in our study were system design studies and pilot studies that were in the primary and secondary research phases, while fewer intervention studies were in the analysis. Most of the studies in the system design category tested the model performance of chatbots through algorithms and techniques. However, the actual effectiveness of chatbot applications in the self-management of patients with diabetes has yet to be verified at this stage. In the pilot studies, the number of respondents was small (n=10-33), the duration of the intervention was short (from short-term measurements to 24 weeks), and the respondents included older adults and the general public in addition to people with diabetes. Most of the intervention research articles took a pre-post trial design, and only 1 document used an RCT design with the highest level of evidence. There is still a lack of higher level studies, especially RCTs, to evaluate the effectiveness of chatbots in diabetes self-management. RCTs are important for evaluating the effectiveness of chatbots in diabetes self-management. Therefore, more high-quality studies, such as RCTs, should be conducted in the future to evaluate the effectiveness of chatbots in diabetes self-management, providing stronger evidence that will contribute to the development of the field.Single Method of Research
Most of the articles included for analysis in our study used quantitative research methods. The data sources mainly relied on data collected from chatbots and user questions and answers, as well as user ratings of the experience, satisfaction, and usability of chatbots collected from questionnaires. In contrast, only a minority of articles used mixed research methods (MMRs) and qualitative research methods (semistructured interviews, focus group discussions, etc). For example, of the pilot studies, most used quantitative research methods, with only 2 MMR studies and 1 qualitative study. The quantitative studies were mainly based on questionnaires and data collected from chatbots, while MMR studies included interviews with users on top of that. MMR has unique advantages [
]. The original purpose of the MMR was to bridge the gap between the quantitative and qualitative paradigms of analysis, combining the strengths of both to allow a broader and deeper analysis of specific issues. Quantitative research is objective and reproducible but does not facilitate the researcher’s observation and communication with the research participants. Qualitative research allows the researcher to observe and communicate with the research participants in a close and more natural setting. Qualitative research makes it easier for the researcher to observe and understand the research participants’ behavior, attitudes, and motivations from the participant’s perspective, and the research design is more flexible. Adding qualitative research to quantitative research allows researchers to better understand what respondents think about chatbots. Respondents may raise issues that the researcher has not noticed and considered, helping the researcher to improve the research design. Through qualitative research, the researcher can fully play into the motivation of the researcher and interviewees by conducting semistructured interviews and focus group discussions with interviewees. The interview questions centered on the features that users want chatbots to have or their feelings when using chatbots. After the interviews, the researcher organizes the collected data and information and analyzes them to gain an in-depth understanding of the users’ experiences, perceptions, and barriers to the use of chatbots so as to provide assistance for subsequent research. Therefore, more MMRs should be used in future research on chatbots and diabetes self-management. By combining the respective strengths and unique perspectives of qualitative and quantitative, a more comprehensive understanding of the impact of chatbots on diabetes self-management can be achieved.Insufficiently Innovative Theoretical Framework
In our study, it was found that only a minority of studies had a theoretical framework to support them, while most studies did not have a theoretical framework. For example, only 4 of the pilot studies were guided by a theoretical framework. This phenomenon suggests that researchers have not paid enough attention to the theoretical framework. Theories are developed to explain, predict, and understand phenomena. A scientific theoretical framework actively supports and guides research design and experimentation and can inform and guide the selection of research types. A theoretical framework helps the researcher synthesize prior empirical findings within a theoretical framework, clarifying what the concepts are and how they relate to each other. It provides guidance for research at the methodological level. It helps the researcher to build bridges between theories, facilitates the researcher to recognize their own research, and discovers connections to the established body of knowledge.
The main theoretical frameworks used in the field of diabetes self-management and chatbots include Behavior Change Theory, Self-Determination Theory, Cross-Theoretical Models, and Social Cognitive Theory. These theoretical frameworks are relatively well-developed and widely used. Applying a theoretical framework can provide a theoretical foundation for research on chatbots in diabetes self-management. The use of appropriate theoretical frameworks can enhance understanding of the underlying mechanisms between chatbots and diabetes self-management and facilitate the development of effective interventions. Therefore, future research is expected to use more novel theoretical frameworks in studies related to chatbots and diabetes self-management. Or innovatively applying theoretical frameworks from other fields to enrich research in the field of chatbots and diabetes self-management (see
for specific explanations and applications of the theoretical frameworks in detail).Equitable Accessibility and Privacy Protection
In addition, although patients with diabetes and those at risk of developing diabetes are getting younger, older adults still account for a high proportion of patients with diabetes.
Chatbots, a new-era product of mobile health and AI, should consider the needs of different age groups in terms of system design and software interface usage. Especially for older adults, their acceptance of the new system, their learning level, whether the software interface is easy to use, whether the font size is comfortable, and whether the answers are easy to understand need special consideration. In order to ensure that digital health tools such as chatbots are accessible and beneficial to everyone, it is crucial to include diverse populations in future research.
At the same time, chatbots collect a large amount of health and personal data from users. If this information is disclosed, it will violate patients’ privacy rights. Lack of privacy protection will affect users’ trust. Chatbots can earn users’ trust through dialog and prompt them to reveal more personal data [
]. Therefore, it is crucial to safeguard patients’ personal privacy. First, regulators must act to ensure user safety and privacy. For example, on March 13, 2024, the AI Bill was passed in the European Union (EU) Parliament and will soon become EU law. The EU’s AI Bill will require high-risk AI systems to be assessed and monitored before approval [ ]. Second, operators should standardize the management of data usage rights of back-office personnel and strengthen core data positions and departmental prevention. Third, users should be clearly informed of the privacy policy. The privacy policy must be dynamic, and the privacy policy must provide a table of contents to facilitate users’ understanding of the content of the privacy policy. The collection of users’ personal information shall be based on the principles of clear purpose, explicit consent, and collection of the least necessary data, as well as a detailed description of the type of information to be collected, its purpose, and the manner in which it is to be collected. In addition, operators need to give the correct download uniform resource locator or store. According to their own services, they target the older adults and minors for safety knowledge popularization. When sharing and using personal information, it needs to be deidentified or anonymized so that the subject of the personal information cannot be identified [ ].Strengths and Limitations
This study is the first systematic review of chatbots and self-management for people with diabetes. Researchers searched for relevant literature from 2 authoritative databases, Web of Science and PubMed. The researchers analyzed and summarized the specific features of chatbots in the field of diabetes, as well as information on the research methodology, research design, and theoretical framework used in the articles. In addition, the articles included from different countries may increase the generalizability of our findings.
However, this article also has some limitations. First, researchers selected more authoritative and comprehensive databases, but some gray literature was not retrieved. Second, in the process of data extraction, the key information extracted was confirmed by experts and extracted and cross-checked by 2 researchers independently, but there was still a possibility of human error. There is some literature that fits the research theme very well, but it was not included in our study because full-text information was not available. Only English-language studies were included in this review, which may lead to a certain degree of publication bias. Finally, the trials in the meta-analysis were all pre-post trials with low evidence validity. Therefore, the effectiveness of chatbot interventions remains to be validated by conducting trials.
Conclusions
This study analyzes and summarizes information on the specific characteristics of chatbots in the field of self-management of patients with diabetes, such as research methods, research strategies, effectiveness evaluation metrics, etc, and provides suggestions for research related to chatbots and the self-management of patients with diabetes. The number of articles published in each country was low, and chatbots were at an early stage of development in the field of self-management of patients with diabetes. Chatbots provide advice and education focused on the physical aspects of patients, and researchers have focused less on the psychological aspects of patients. A number of positive effects of chatbots in supporting the self-management of people with diabetes. The overall acceptance of chatbots among people with diabetes was high. However, most of the current studies were at the stage of system design and pilot studies. Meta-analysis showed that the chatbot intervention was effective in lowering blood glucose but had no significant effect on weight reduction.
Based on the results derived from this study, we make the following appeal to researchers: (1) focus should be placed on the area of chatbots and diabetes self-management, with research to fully demonstrate the effectiveness of chatbot interventions; (2) future studies should address identified research gaps and use higher level evidence, such as RCTs; (3) in the choice of research strategies, more mixed research should be adopted to fully use the advantages of quantitative and qualitative research; (4) use of appropriate and innovative theoretical frameworks in the study to provide theoretical support for the study; and (5) the digital age requires the protection of users’ private data. It is hoped that researchers will continue to innovate and enrich the research in this field, which will help to continuously develop effective intervention measures of chat robots in diabetes self-management, and ultimately help patients with diabetes stay healthy.
Acknowledgments
Thanks to the authors of the articles that included analysis in our research. Their articles laid the foundation for our research. The research for this article is funded by the National Key Research and Development Program (2022YFC3600904) and the Capital Health Development Research Project (2022-1G-4252).
Data Availability
The data sets generated and analyzed during this study are available from the corresponding author on reasonable request.
Authors' Contributions
YBW involved in conceptualization, data curation, methodology, project administration, and writing – review & editing; JZZ involved in investigation, methodology, writing – original draft, and writing – review & editing; PG, TYD, JYZ, YWW, YNZ, SYL, XYL, and EYW involved in investigation and writing – review & editing; and XYS involved in project administration, supervision, and writing – review & editing. All authors contributed to the article and approved the submitted version. All authors ensure that generative AI was not used in the manuscript writing.
Conflicts of Interest
None declared.
Supplementary tables and figures.
DOCX File , 49 KBAnalysis of the advantages and disadvantages of quantitative, qualitative and mixed studies.
DOCX File , 14 KBOverview of theoretical frameworks and specific applications to chatbots and diabetes.
DOCX File , 18 KBResearch type.
DOCX File , 17 KBPRISMA 2020 checklist.
DOCX File , 30 KBReferences
- GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet. 2023;402(10397):203-234. [FREE Full text] [CrossRef] [Medline]
- Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. 2022;183:109119. [FREE Full text] [CrossRef] [Medline]
- American Diabetes Association Professional Practice Committee. 2. Diagnosis and classification of diabetes: standards of care in diabetes-2024. Diabetes Care. 2024;47(Suppl 1):S20-S42. [CrossRef] [Medline]
- ElSayed NA, Aleppo G, Aroda VR, Bannuru RR, Brown FM, Bruemmer D, et al. 7. Diabetes technology: standards of care in diabetes-2023. Diabetes Care. 2023;46(Suppl 1):S111-S127. [FREE Full text] [CrossRef] [Medline]
- Association of Diabetes CareEducation Specialists, Kolb L. An effective model of diabetes care and education: the ADCES7 self-care behaviors™. Sci Diabetes Self Manag Care. 2021;47(1):30-53. [CrossRef] [Medline]
- Xie W, Ding R, Yan J, Qu Y. A mobile-based question-answering and early warning system for assisting diabetes management. Wirel Commun Mob Comput. 2018;2018(s1):1-14. [FREE Full text] [CrossRef]
- Chrvala CA, Sherr D, Lipman RD. Diabetes self-management education for adults with type 2 diabetes mellitus: a systematic review of the effect on glycemic control. Patient Educ Couns. 2016;99(6):926-943. [FREE Full text] [CrossRef] [Medline]
- Chatterjee S, Davies MJ, Heller S, Speight J, Snoek FJ, Khunti K. Diabetes structured self-management education programmes: a narrative review and current innovations. Lancet Diabetes Endocrinol. 2018;6(2):130-142. [CrossRef] [Medline]
- Kim M, Patrick K, Nebeker C, Godino J, Stein S, Klasnja P, et al. The digital therapeutics real-world evidence framework: an approach for guiding evidence-based digital therapeutics design, development, testing, and monitoring. J Med Internet Res. 2024;26:e49208. [FREE Full text] [CrossRef] [Medline]
- Hong JS, Wasden C, Han DH. Introduction of digital therapeutics. Comput Methods Programs Biomed. 2021;209:106319. [CrossRef] [Medline]
- Kaufman N. Digital therapeutics: leading the way to improved outcomes for people with diabetes. Diabetes Spectr. 2019;32(4):301-303. [FREE Full text] [CrossRef] [Medline]
- Kurniawan MH, Handiyani H, Nuraini T, Hariyati RTS, Sutrisno S. A systematic review of artificial intelligence-powered (AI-powered) chatbot intervention for managing chronic illness. Ann Med. 2024;56(1):2302980. [CrossRef] [Medline]
- McGreevey JD, Hanson CW, Koppel R. Conversational agents in health care-reply. J Am Med Assoc. 2020;324(23):2444-2445. [CrossRef] [Medline]
- d'Elia A, Gabbay M, Rodgers S, Kierans C, Jones E, Durrani I, et al. Artificial intelligence and health inequities in primary care: a systematic scoping review and framework. Fam Med Community Health. 2022;10(Suppl 1):e001670. [FREE Full text] [CrossRef] [Medline]
- Li L, Jiang B, Sun L. HNF1A:from monogenic diabetes to type 2 diabetes and gestational diabetes mellitus. Front Endocrinol (Lausanne). 2022;13:829565. [FREE Full text] [CrossRef] [Medline]
- Park D, Jeong S, Seo Y. Systematic review on chatbot techniques and applications. J Inf Procress Syst. 2022;18(1):26-47. [FREE Full text]
- Bickmore TW, Kimani E, Trinh H, Pusateri A, Paasche-Orlow MK, Magnani JW. Managing chronic conditions with a smartphone-based conversational virtual agent. 2018. Presented at: Proceedings of the 18th International Conference on Intelligent Virtual Agents; November 05, 2018:119-124; Sydney, Australia. URL: https://dl.acm.org/doi/abs/10.1145/3267851.3267908
- Pereira J, Díaz Ó. Using health chatbots for behavior change: a mapping study. J Med Syst. 2019;43(5):135. [CrossRef] [Medline]
- Marks M, Haupt CE. AI chatbots, health privacy, and challenges to HIPAA compliance. J Am Med Assoc. 2023;330(4):309-310. [CrossRef] [Medline]
- Lee P, Bubeck S, Petro J. Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine. N Engl J Med. 2023;388(13):1233-1239. [CrossRef] [Medline]
- Ayers JW, Poliak A, Dredze M, Leas EC, Zhu Z, Kelley JB, et al. Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Intern Med. 2023;183(6):589-596. [FREE Full text] [CrossRef] [Medline]
- Marcuzzi A, Nordstoga AL, Bach K, Aasdahl L, Nilsen TIL, Bardal EM, et al. Effect of an artificial intelligence-based self-management app on musculoskeletal health in patients with neck and/or low back pain referred to specialist care: a randomized clinical trial. JAMA Netw Open. 2023;6(6):e2320400. [FREE Full text] [CrossRef] [Medline]
- Singh B, Olds T, Brinsley J, Dumuid D, Virgara R, Matricciani L, et al. Systematic review and meta-analysis of the effectiveness of chatbots on lifestyle behaviours. NPJ Digit Med. 2023;6(1):118. [FREE Full text] [CrossRef] [Medline]
- Alsahli M, Abd-Alrazaq A, Househ M, Konstantinidis S, Blake H. The effectiveness of mobile phone messaging-based interventions to promote physical activity in type 2 diabetes mellitus: systematic review and meta-analysis. J Med Internet Res. 2022;24(3):e29663. [FREE Full text] [CrossRef] [Medline]
- Jiang Z, Huang X, Wang Z, Liu Y, Huang L, Luo X. Embodied conversational agents for chronic diseases: scoping review. J Med Internet Res. 2024;26:e47134. [FREE Full text] [CrossRef] [Medline]
- Shu X, Mai Q, Blatz M, Price R, Wang XD, Zhao K. Direct and indirect restorations for endodontically treated teeth: a systematic review and meta-analysis, IAAD 2017 consensus conference paper. J Adhes Dent. 2018;20(3):183-194. [CrossRef] [Medline]
- Deeks JJ, Higgins JP, Altman DG. Chapter 9: analysing dataundertaking meta-analyses. In: Higgins JP, Green S, editors. Cochrane Handbook for Systematic Reviews of Interventions. Sussex, UK. John Wiley & Sons; 2008:243-296.
- Krishnakumar A, Verma R, Chawla R, Sosale A, Saboo B, Joshi S, et al. Evaluating glycemic control in patients of South Asian origin with type 2 diabetes using a digital therapeutic platform: analysis of real-world data. J Med Internet Res. 2021;23(3):e17908. [FREE Full text] [CrossRef] [Medline]
- Stephens TN, Joerin A, Rauws M, Werk LN. Feasibility of pediatric obesity and prediabetes treatment support through tess, the AI behavioral coaching chatbot. Transl Behav Med. 2019;9(3):440-447. [CrossRef] [Medline]
- Sharma M, Singh G, Singh R. An advanced conceptual diagnostic healthcare framework for diabetes and cardiovascular disorders. EAI Endorsed Trans Scalable Inf Syst. 2018;5(18):154828. [FREE Full text]
- Sowah RA, Bampoe-Addo AA, Armoo SK, Saalia FK, Gatsi F, Sarkodie-Mensah B. Design and development of diabetes management system using machine learning. Int J Telemed Appl. 2020;2020:8870141. [FREE Full text] [CrossRef] [Medline]
- Anastasiadou M, Alexiadis A, Polychronidou E, Votis K, Tzovaras D. A prototype educational virtual assistant for diabetes management. 2020. Presented at: IEEE 20th International Conference on Bioinformatics and Bioengineering (BIBE); October 26-28, 2020:999-1004; Cincinnati, OH. URL: https://ieeexplore.ieee.org/abstract/document/9288129
- Beaudry J, Consigli A, Clark C, Robinson KJ. Getting ready for adult healthcare: designing a chatbot to coach adolescents with special health needs through the transitions of care. J Pediatr Nurs. 2019;49:85-91. [CrossRef] [Medline]
- Mitchell EG, Maimone R, Cassells A, Tobin JN, Davidson P, Smaldone AM, et al. Automated vs. human health coaching: exploring participant and practitioner experiences. Proc ACM Hum Comput Interact. 2021;5(CSCW1):99. [FREE Full text] [CrossRef] [Medline]
- Hossain E, Alshehri M, Almakdi S, Halawani H, Rahman MM, Rahman W, et al. Dm-Health app: diabetes diagnosis using machine learning with smartphone. Comput Mater Continua. 2022;72(1):1713-1746. [FREE Full text]
- Maher CA, Davis CR, Curtis RG, Short CE, Murphy KJ. A physical activity and diet program delivered by artificially intelligent virtual health coach: proof-of-concept study. JMIR Mhealth Uhealth. 2020;8(7):e17558. [FREE Full text] [CrossRef] [Medline]
- Sagstad MH, Morken N, Lund A, Dingsør LJ, Nilsen ABV, Sorbye LM. Quantitative user data from a chatbot developed for women with gestational diabetes mellitus: observational study. JMIR Form Res. 2022;6(4):e28091. [FREE Full text] [CrossRef] [Medline]
- Mash R, Schouw D, Fischer AE. Evaluating the implementation of the GREAT4Diabetes WhatsApp chatbot to educate people with type 2 diabetes during the COVID-19 pandemic: convergent mixed methods study. JMIR Diabetes. 2022;7(2):e37882. [FREE Full text] [CrossRef] [Medline]
- Baptista S, Wadley G, Bird D, Oldenburg B, Speight J, My Diabetes Coach Research Group. Acceptability of an embodied conversational agent for type 2 diabetes self-management education and support via a smartphone app: mixed methods study. JMIR Mhealth Uhealth. 2020;8(7):e17038. [FREE Full text] [CrossRef] [Medline]
- Hurmuz MZM, Jansen-Kosterink SM, Op den Akker H, Hermens HJ. User experience and potential health effects of a conversational agent-based electronic health intervention: protocol for an observational cohort study. JMIR Res Protoc. 2020;9(4):e16641. [FREE Full text] [CrossRef] [Medline]
- Gong E, Baptista S, Russell A, Scuffham P, Riddell M, Speight J, et al. My diabetes coach, a mobile app-based interactive conversational agent to support type 2 diabetes self-management: randomized effectiveness-implementation trial. J Med Internet Res. 2020;22(11):e20322. [FREE Full text] [CrossRef] [Medline]
- Roca S, Lozano ML, García J, Alesanco Á. Validation of a virtual assistant for improving medication adherence in patients with comorbid type 2 diabetes mellitus and depressive disorder. Int J Environ Res Public Health. 2021;18(22):12056. [FREE Full text] [CrossRef] [Medline]
- Pimenta N, Félix IB, Monteiro D, Marques MM, Guerreiro MP. Promoting physical activity in older adults with type 2 diabetes an anthropomorphic conversational agent: development of an evidence and theory-based multi-behavior intervention. Front Psychol. 2022;13:883354. [FREE Full text] [CrossRef] [Medline]
- Balsa J, Félix I, Cláudio AP, Carmo MB, Silva ICE, Guerreiro A, et al. Usability of an intelligent virtual assistant for promoting behavior change and self-care in older people with type 2 diabetes. J Med Syst. 2020;44(7):130. [CrossRef] [Medline]
- Félix IB, Guerreiro MP, Cavaco A, Cláudio AP, Mendes A, Balsa J, et al. Development of a complex intervention to improve adherence to antidiabetic medication in older people using an anthropomorphic virtual assistant software. Front Pharmacol. 2019;10:680. [FREE Full text] [CrossRef] [Medline]
- Saritha AK. Medibot A predictive generic diabetic chatbot using bagging ensemble hybrid learning. Int J Eng Adv Technol. 2020;9(4):2249-8958. [FREE Full text]
- Buinhas S, Cláudio AP, Carmo MB, Balsa J, Cavaco A, Mendes A, et al. Virtual assistant to improve self-care of older people with type 2 diabetes: first prototype. In: Gerontechnology. Cham, Switzerland. Springer; 2019:236-248.
- Dhinagaran DA, Sathish T, Soong A, Theng Y, Best J, Tudor Car L. Conversational agent for healthy lifestyle behavior change: web-based feasibility study. JMIR Form Res. 2021;5(12):e27956. [FREE Full text] [CrossRef] [Medline]
- Dhinagaran DA, Car LT. Public perceptions of a healthy lifestyle change conversational agent in Singapore: a qualitative study. Digit Health. 2022;8:20552076221131190. [FREE Full text] [CrossRef] [Medline]
- Hussain S, Athula G. Extending a conventional chatbot knowledge base to external knowledge source and introducing user based sessions for diabetes education. 2018. Presented at: 32nd International Conference on Advanced Information Networking and Applications Workshops (WAINA); May 16-18, 2018:698-703; Krakow, Poland. URL: https://ieeexplore.ieee.org/abstract/document/8418155
- Rehman UU, Chang DJ, Jung Y, Akhtar U, Razzaq MA, Lee S. Medical instructed real-time assistant for patient with glaucoma and diabetic conditions. Appl Sci. 2020;10(7):2216. [CrossRef]
- Andrès E, Meyer L, Zulfiqar A, Hajjam M, Talha S, Bahougne T, et al. Telemonitoring in diabetes: evolution of concepts and technologies, with a focus on results of the more recent studies. J Med Life. 2019;12(3):203-214. [FREE Full text] [CrossRef] [Medline]
- Guyatt G, Oxman AD, Akl EA, Kunz R, Vist G, Brozek J, et al. GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables. J Clin Epidemiol. 2011;64(4):383-394. [CrossRef] [Medline]
- Johnson RB, Onwuegbuzie AJ. Mixed methods research: a research paradigm whose time has come. Educ Res. 2004;33(7):14-26. [CrossRef]
- The Lancet. AI in medicine: creating a safe and equitable future. Lancet. 2023;402(10401):503. [CrossRef] [Medline]
- Gilbert S. The EU passes the AI Act and its implications for digital medicine are unclear. NPJ Digit Med. 2024;7(1):135. [FREE Full text] [CrossRef] [Medline]
- Liu Q, Liu H, Qin Z, Ma C. Research on users' privacy protection policy of m-health application based on content analysis. Chin Hosp. 2019;23(09):20-23.
Abbreviations
AI: artificial intelligence |
EU: European Union |
HbA1c: hemoglobin A1c |
MD: mean difference |
MMR: mixed research method |
PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
RCT: randomized controlled trial |
Edited by X Ma; submitted 09.05.24; peer-reviewed by K Rawlings, K Obaideen; comments to author 16.07.24; revised version received 14.08.24; accepted 14.10.24; published 03.12.24.
Copyright©Yibo Wu, Jinzi Zhang, Pu Ge, Tingyu Duan, Junyu Zhou, Yiwei Wu, Yuening Zhang, Siyu Liu, Xinyi Liu, Erya Wan, Xinying Sun. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 03.12.2024.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.