Methodologies Used to Study the Feasibility, Usability, Efficacy, and Effectiveness of Social Robots For Elderly Adults: Scoping Review

doi:10.2196/37434

Review

¹Department of Research and Development, Iberian Institute of Research in Psycho-Sciences, INTRAS Foundation, Zamora, Spain

²Psycho-Sciences Research Group, Salamanca Biomedical Research Institute, Salamanca University, Salamanca, Spain

³Psychiatry and Mental Health Service, Assistance Complex of Zamora, Zamora, Spain

⁴Department on Aging, Netherlands Institute of Mental Health and Addiction, Trimbos Insititute, Utrecht, Netherlands

Corresponding Author:

Aysan Mahmoudi Asl, MSc

Department of Research and Development

Iberian Institute of Research in Psycho-Sciences

INTRAS Foundation

C. de la Hiniesta, 137

Zamora, 49024

Spain

Phone: 34 604543985

Email: amahmoudi@ides.es

Background: New research fields to design social robots for older people are emerging. By providing support with communication and social interaction, these robots aim to increase quality of life. Because of the decline in functioning due to cognitive impairment in older people, social robots are regarded as promising, especially for people with dementia. Although study outcomes are hopeful, the quality of studies on the effectiveness of social robots for the elderly is still low due to many methodological limitations.

Objective: We aimed to review the methodologies used thus far in studies evaluating the feasibility, usability, efficacy, and effectiveness of social robots in clinical and social settings for elderly people, including persons with dementia.

Methods: Dedicated search strings were developed. Searches in MEDLINE (PubMed), Web of Science, PsycInfo, and CINAHL were performed on August 13, 2020.

Results: In the 33 included papers, 23 different social robots were investigated for their feasibility, usability, efficacy, and effectiveness. A total of 8 (24.2%) studies included elderly persons in the community, 9 (27.3%) included long-term care facility residents, and 16 (48.5%) included people with dementia. Most of the studies had a single aim, of which 7 (21.2%) focused on efficacy and 7 (21.2%) focused on effectiveness. Moreover, forms of randomized controlled trials were the most applied designs. Feasibility and usability were often studied together in mixed methods or experimental designs and were most often studied in individual interventions. Feasibility was often assessed with the Unified Theory of the Acceptance and Use of Technology model. Efficacy and effectiveness studies used a range of psychosocial and cognitive outcome measures. However, the included studies failed to find significant improvements in quality of life, depression, and cognition.

Conclusions: This study identified several shortcomings in methodologies used to evaluate social robots, resulting in ambivalent study findings. To improve the quality of these types of studies, efficacy/effectiveness studies will benefit from appropriate randomized controlled trial designs with large sample sizes and individual intervention sessions. Experimental designs might work best for feasibility and usability studies. For each of the 3 goals (efficacy/effectiveness, feasibility, and usability) we also recommend a mixed method of data collection. Multiple interaction sessions running for at least 1 month might aid researchers in drawing significant results and prove the real long-term impact of social robots.

J Med Internet Res 2022;24(8):e37434

doi:10.2196/37434

Keywords

aged; dementia; social robots; pet-bots; community settings; long-term care; methods; scoping review

In the next few decades, we expect the global population to age due to a combination of high life expectancy, low birth rates, and the baby boomer generation entering their senior years. By 2030, 33% of the population in Western Europe will be over 60 years of age [1]. Dementia is one of the most common neurodegenerative diseases that affects 50 million older people around the world, and it is projected to reach 155 million in 2050 [2].

Dementia is characterized by deterioration in memory, cognition, behavior, and ability to perform everyday activities [3]. It is estimated that approximately one-third of people with dementia live alone [4]. They experience unmet needs because of living alone and are at a higher risk of faster deterioration. In addition, people with dementia who live alone are considered at a higher risk of medication use problems, falls and injuries, inadequate self-care, trouble with activities of daily living, and reduced social networks [5-8].

In the past decades, technological advances coincided with the great health challenge of aging societies [9]. New research fields in assistive technology are dedicated to designing social robots for older adults with or without cognitive impairment to promote their quality of life (QoL) through communication and social interactions [10]. Social robots are intended to provide and facilitate social contact, psychosocial and cognitive stimulation, and have the potential to support elderly people to maintain their autonomy and independence and enhance their well-being [11].

Socially assistive robots (SARs) can be grouped into 2 main categories based on their feature and function: (1) service robots, and (2) companion robots [12]. The main task of these robots is to establish any form of interaction and communication. This function can be performed by SARs in multiple manners, such as through touch sensors, cameras, (robotic) body movements, tablet interfaces, and sound and speech systems. Within the subgroup of the companion robots, humanoid robots like Pepper and Nao provide users with advanced applications that provide leisure activities (music, photos, and games), cognitive and physical stimulation activities, and assistance with mental or physical tasks. Pet robots, such as PARO, AIBO, and NeCoro as substitutes for pets and companion animals are intended to provide emotional and physiological stimulation, have calming effects, and lead to mood improvements [13].

For the successful implementation and large-scale uptake of social robots or any other psychosocial intervention, their feasibility, usability, and cost-effectiveness should be perceived as good by the end users (people with dementia and healthy older adults), clinicians, and other stakeholders (eg, health care insurers and policy makers). The Monitoring and Evaluating Digital Health Interventions framework recommends evaluating 4 factors to integrate and implement a digital health intervention: (1) feasibility, to assesses whether the digital health system works as intended in a given context; (2) usability, to assess whether the digital health system can be used as intended by users; (3) efficacy, to assess whether the digital health intervention can achieve the intended results in a research (controlled) setting, and (4) effectiveness, to assess whether the digital health intervention can achieve the intended results in a nonresearch (uncontrolled) setting [14].

Despite the rising interest in social robots after the COVID-19 pandemic, there is limited evidence on the effectiveness of social robots in elderly care. The methodological quality of studies on the effectiveness of social robots in elderly adults is still low, and inappropriate study designs, samples, form, duration of interventions, and data collection methods have affected the strength of study outcomes [12].

Currently, there is no state-of-the-art proof of concept for study designs to evaluate the use of social robots for elderly people. Since the degenerative nature of dementia can cause methodological challenges, specific attention should be paid to studies that include people with dementia. To determine what the appropriate research methods are to study feasibility, usability, efficacy, and effectiveness, this article aims to review the methodologies used thus far in studies with social robots in clinical and social settings with elderly people to pave the way for future researchers in this field.

Protocol Registration

The protocol of this scoping review was registered in Open Science Framework (OSF) [15].

Search Strategy

Searches were conducted on August 13, 2020, in MEDLINE (PubMed), Web of Science, PsycInfo, and CINAHL databases. No time window was applied. Three search strings covering the topics “social robots,” “community setting,” and “elderly people” were constructed. For each database, reverent subject headings were adapted. For MEDLINE, we used the following strings and keywords: ((robotics[MeSH Terms] OR robot*)) AND ((humanoid OR companion OR social* OR “socially assistive” OR interact* OR android)), (((((((aging[MeSH Terms]) OR (aged[MeSH Terms])) OR (elderly[MeSH Terms])) OR (vulnerable population[MeSH Terms])) OR (senior)) OR (ageing)) OR (geriatric)) OR (old*), and (((((community health service[MeSH Terms]) OR (social support[MeSH Terms])) OR (residential facilities[MeSH Terms])) OR (independent living[MeSH Terms])) OR (social support[MeSH Major Topic])) OR (“community dwelling” OR “home dwelling” OR “care home” OR “in-home” OR “at home” OR “home-based” OR “home setting” OR “nursing home” OR home).

Selection Criteria

Publications potentially eligible for inclusion had to study a social robot that was physically embedded in an experimental or clinical study in people aged 65 or above. Studies were excluded if they were (1) conducted in an acute care setting; (2) conference abstracts, case studies, dissertations, books, or review papers; (3) published in a language other than English or Spanish; (4) solely reporting a robot design, development process, or theoretical models (5) stakeholder opinions on robots without any interaction; (6) involved in the implementation of new hardware or software or an assessment tool on a robot (such as assessing a fall detection sensor); and (7) involving telepresence robots with only video call features.

Selection Procedure

After the removal of duplicates, 2 reviewers (authors AM and MM) independently applied the inclusion and exclusion criteria in 3 steps, starting with screening titles, abstracts, and then full texts. The selections were compared, and in case of disagreement, discussed by the 2 reviewers. In cases where no consensus could be reached, a third reviewer was consulted (author HR).

Data Extraction

The literature was mapped according to the following areas of interest: author and country, robot name, the aim of the robot, aim of the study, type of outcome measure, study design, study sample, study setting, methodology of data collection, interaction scenario, relevant outcome measures, measurement instruments, results, and reported limitations of the study. Information was synthesized descriptively, and findings were narratively summarized according to the areas of interest.

The quality of the included articles was appraised independently by 2 authors (AM and MM), through the quality assessment of digital health interventions within the Monitoring and Evaluating Digital Health Interventions framework established by the World Health Organization (WHO) [14]. The tool includes a list of methodological study criteria comprising (1) 23 essential criteria for all types of studies and (2) essential criteria for qualitative and quantitative studies (3 criteria each). The extent to which criteria were met by studies was rated by 3 independent researchers on a 3-point scale (0= poor, 1= fair, 2= good). We calculated the percentage of agreement between the ratings (Multimedia Appendix 1). We also applied this framework to categorize the studies according to their aims in 4 categories: (1) feasibility, (2) usability, (3) efficacy, and (4) effectiveness.

General Findings

The search resulted in a total of 723 individual publications. After the screening process, 33 articles met the selection criteria (Figure 1). In 33 papers [11,16-45], 23 different social robots were evaluated among elderly adults and people with dementia in 13 different countries. Moreover, 19 studies specifically evaluated either feasibility, usability, efficacy, or effectiveness and were considered as single aim studies. The remaining studies (n=14) had multiple aims, evaluating 2 or 3 of the aforementioned study aims. Overall, feasibility was studied in 17 (51.5%) studies, usability in 13 (39.3%), effectiveness in 12 (36.3%), and efficacy in 10 (30.3%).

The quality appraisal identified that primary and secondary outcomes were clearly defined in all studies. Additionally, the methods of data collection were described well, but the eligibility of the participants was not reported in 12 (36.4%) papers. Moreover, 12 out of 33 (36.4%) papers did not present a clear description of the study design. Multimedia Appendices 2 and 3 show a summary of the characteristics, methodologies, and outcomes of the included studies.

Figure 1. PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow diagram for literature search.

Robots

Of the 23 different social robots, PARO (n=10, 30.3%), Nao (n=5, 15.1%), AIBO (n=2, 6%), and Hobbit (n=4, 12.1%) were the most often investigated.

Participants and Settings

Of the 33 identified studies, 16 (48.4%) focused on people with dementia [11,18,23,24,27,29-35,38,39,43,44], 9 (27.3%) were performed in samples of residents of long-term care facilities whose cognitive status was not mentioned [17,20,22,26,28,36,40,41,45], and the remaining 8 (24.2%) focused on elderly people living in the community whose cognitive status was also not clearly revealed [16,19,22,25,37,42,46,47]. Moreover, 4 (12.1%) studies additionally recruited care staff [22,24,26,38]. The age range of older adults was 65-98 years. Of the included studies, 3 (9.1%) did not report the number of participants [17,20,26]. The sample sizes used in the studies ranged from 5 to 139.

The social robots were studied in long-term facilities (n=17, 51.5%), private households (n=8, 24.2%), and laboratory settings (n=4, 12.1%). Additional settings were based in a care organization (n=1, 3%), a daycare center for dementia (n=1, 3%), and a health service facility (n=1, 3%). Four (12.1%) studies investigated the robots in 2 different settings [22,33,43,46].

Study Aims, Designs, and Outcome Measures

Single Aim Studies

Of the included studies, 3 (9.1%) focused solely on feasibility, using quasi-experimental designs [22,27,46], and 1 (3%) explicitly focused on usability in a private home setting [37]. Additionally, 7 (21.2%) studies aimed at studying efficacy [31-35,38,44], of which 2 (28.5%) applied a form of randomized controlled trial (RCT) design, 1 (14.3%) randomized crossover design, 1 (14.3%) pretest-posttest design, and the other 3 (42.8%) a form of quasi-experimental design. The effectiveness of the robots was explicitly studied in 8 (24.2%) articles [17,20,24,36,40,41,43,45] using randomized designs, with 1 (12.5%) RCT, 2 (25%) blocked RCTs, 2 (25%) quasi-experimental designs, 1 (12.5%) pretest-posttest design, 1 (12.5%) cross-sectional, and 1 (12.5%) qualitative study. The impact of robots was evaluated on QoL (n=6, 18.2%), mood and depression (n=6, 18.2%), behavioral (n=6, 18.2%) and neuropsychiatric symptoms (n=2, 6.1%), emotions and affect (n=5, 15.2%), cognition (n=4, 12,1%), engagement (n=8, 24.2%), participation and social interaction (n=8, 24.2%), care burden (n=1, 3%), loneliness (n=1, 3%), and physiological indicators (n=3, 9.1%). The sample size for the 22 studies with effectiveness/efficacy aims ranged from 11 to 139 participants, and 7 (31.8%) of these studies included samples of over 40 participants.

Multiple Aim Studies

A total of 14 (42.4%) studies had multiple aims. Of these, 7 (50%) focused on feasibility and usability [11,16,19,22,25,28,44], of which 3 (42.9%) applied a mixed methods design, and the remaining 4 (57.1%) applied either an experimental design or a field trial. Meanwhile, 3 (9.1%) focused on feasibility, usability, and effectiveness, and all applied a mixed methods design [26,29,39]. Additionally, 1 (3%) study investigated feasibility and efficacy and applied an experimental design [30], 1 (3%) focused on feasibility, usability, and efficacy using a pretest-posttest design [18], and 1 (3%) assessed the feasibility and effectiveness of the robot, applying a nonrandomized controlled trial design [22].

Study Aims and Settings

Only 5 (27.8%) of the 18 studies aiming to evaluate feasibility and/or usability were performed in nursing home settings; 5 (27.8%) were performed in laboratory settings, and the remaining 8 (44.4%) were performed in private households. In 7 (38.9%) of the 18 studies, people with dementia and those with cognitive impairment were included. In the remaining 11 (61.1%) studies, the cognitive status of the participants was not clearly indicated.

Of the 22 (66.7%) studies that focused on efficacy or effectiveness, all but 4 (81.8%) [29,33,39,47] were performed in long-term care settings. These 4 (18.2%) were performed in private households and a daycare facility. Of these studies, 13 (59.1%) included cognitively impaired samples, only 1 (4.5%) study included community-dwelling elderly persons without disclosing their cognitive status, and the remaining 8 (36.4%) included long-term care residents.

Study Interventions

Interaction between study participants and social robots was mostly investigated during individual sessions (n=18, 54.5%). In 12 (36.4%) studies, interactions were studied in group sessions. Only 3 (9.1%) studies applied both individual and group interactions [11,20,33], while 1 (3%) demonstrated the task performance of the robot without any close interaction with study participants [19]. Feasibility and/or usability (n=11, 33.3%) were mostly studied in individual settings (n = 8, 72.7%); 1 (9.1%) study was performed in a group, and 1 (9.1%) was applied to both individual and group settings. Individual (n=7, 21.2%) and group settings (n=6, 18.2%) were used most often to study efficacy and effectiveness (n=15, 45.5%); 2 (6.1%) studies applied the intervention individually and in a group. In studies with multiple aims, 4 (28.6%) individual and 2 (14.3%) group setting interventions were found. In 2 (6.1%) studies, social robots were available in the residents’ lounge in nursing homes, and participants were free to interact with the social robots during scheduled time slots [22,45]. In 8 (24.2%) studies, the robots were installed in participants’ private homes for a duration of 5 days to 3 months [16,27,29,33,37,39,44,47].

A total of 9 (27.3%) studies executed 1 or 2 interactive sessions [11,19,22-24,28,30,32,34], of which 6 (66.7%) investigated the usability and feasibility of the robot, 1 (11.1%) investigated effectiveness, and 2 (22.2%) investigated efficacy. Most of the studies conducted more than 2 interactive sessions: 5 (15.2%) studied feasibility and/or usability, 12 (36.4%) studied efficacy or effectiveness, and 8 (24.2%) were multiple aim studies. The interactive sessions ran from 10 to 90 minutes a day for a maximum of 4 months.

Data Collection

We identified 4 methods of data collection: (1) questionnaires (n=26, 78.8%), (2) observations (physical and videotape) (n=19, 57.6%), (3) interviews (n=13, 39.4%), and physiological measurements (n=3, 9.1%). Figures 2 and 3 show the data collection methods and the responsible administrator of data for the identified data collection methods, respectively.

Figure 3. Responsible administrator of data for identified methods of data collection.

Measurement Instruments

Outcomes regarding feasibility were assessed with the Unified Theory of the Acceptance and Use of Technology (UTAUT) model adapted by Heerink [11,19,23,29,30,46,48], the Robot User Acceptance Scale, the Robot Attitude Scale, the Mind Perception Scale [21,47], and the Negative Attitudes Toward Robots Scale [16,37]. Of the included studies, 5 (15.2%) utilized questionnaires regarding robot functions and acceptance that were specifically developed for the study [19,22,23,25,46].

Studies exploring usability applied the System Usability Scale [22,25,28], a modification of the Usefulness, Satisfaction, and Ease of Use [26] scale, and the Technology Usage Inventory [39]. Two (6.1%) qualitative studies performed conversation and video analysis [27,28] to extract statements on acceptability and usability.

Efficacy and effectiveness outcomes were evaluated by a wide range of neuropsychosocial measurement instruments: (1) mood: Geriatric Depression Screening [49], the Cornell Scale For Depression in Dementia [50], Apparent Emotion Rating Instrument [51], University of California, Los Angeles (UCLA) Loneliness Scale [52], Observed Emotion Rating Scale [53]; (2) cognition: Montreal Cognitive Assessment [54], Mini-Mental State Examination [55]; (3) QoL: QoL Alzheimer Disease, Dementia QoL Questionnaire [56], QoL in Late-Stage Dementia [57]; (4) behavior: Neuropsychiatric Inventory [58], Gottfries-Bråne-Steen Scale [59], Apathy Evaluation Scale [60], Cohen-Mansfield Agitation Inventory [61], Apathy Inventory [62], Apathy Scale for Institutionalized People With Dementia Nursing Home Version [63]; (5) Participation and Interaction: Activity Participation Scale [40], and Assessments of Communication and Interaction Skills [64].

Among the studies applying questionnaires, 5 (19.2%) indirectly collected data via care staff and informal caregivers, and 13 (50%) directly collected data via the researchers.

Study Outcomes

Concerning social robots’ feasibility outcomes, almost all studies (n=16, 94.1%) deemed social robots acceptable. Nevertheless, 1 (5.9%) study reported mixed results on acceptability by care staff [22], and 1 (5.9%) did not find any significant results on quantitative measurements for acceptability, but qualitative results were positive [16]. In 3 (17.6%) studies, the perceived agency [21,47] and perceived enjoyment [46] were found to decrease over time.

The reported usability (n=12, 36.4%) was overall positive, except in 2 (16.7%) studies in which the usability was negatively affected by technical issues or lack of robustness of the robots [28,37]. Only 3 (9.1%) studies assessed affordability for Hobbit and Nao, in which the participants did not consider the social robots affordable and were skeptical of buying them [25,28,37].

Most of the findings endorse the use of social robots by older adults. Improvements were mostly found in emotion and mood [20,31,33-35,38,44], engagement [24,29,30], and participation and social interaction [20,31-33,40,45]. Increased job satisfaction of staff [22], self-report pain reduction [38], and improved global psychiatric symptoms [43] were the other positive study outcomes. There were findings of reduced challenging behavior [20,31,34], sense of loneliness [17], and stress levels [45]. However, dementia symptoms like agitation and other problematic symptoms did not improve in 1 (3%) study [33].

Meanwhile, 4 (12.1%) studies did not find a significant impact on QoL [19,21,43,47]. Only 1 (3%) study found a moderate-to-large positive effect on QoL of people with dementia [35]. Social robot interventions also failed to significantly improve depression [18,21,43,47], perceived social support [18], medication adherence [47], and cognition [24]. There were mixed results regarding physiological measures, such as urine tests measuring stress levels and blood pressure [33,45]. No author declared a proven negative effect of social robots on older adults.

Reported Study Limitations

Of the 33 studies, 7 (21.2%) did not report any study limitations [16,17,23,25,30,42,46]. A wide range of limitations was reported, and the most common barrier considered in 17 (51.5%) studies was the small sample size, which was mostly reported for efficacy and effectiveness studies. In the feasibility and/or usability studies, the limitations were mainly attributed to technical problems or interaction difficulties. The use of unvalidated questionnaires, homogeneity in the sex of the study sample, inadequate observation, and short duration of interventions were reported as other limitations in general.

Quality Appraisal

The inter-rater agreement for the quality appraisal was 86.1%. Reports of the description of study design, bias, and enrollment procedure were mostly rated as “fair.” In most of the articles, the sampling methods, confounding factors, missing data in quantitative studies, and reflexivity of data interpretation in qualitative studies were poorly reported. Other criteria were mostly rated as “good” (Multimedia Appendix 1). The quality appraisal revealed unclear descriptions or insufficient details in 5 (15.2%) studies, especially those in disciplines other than health research [25,30,42,44,45].

Principal Findings

The results of this scoping review revealed a variety of applied study methods in studies with social robots concerning study design, sample size, study setting, method of data collection, interaction scenario (the sequence, duration, and setting of the intervention), outcome measures, measurement instruments, study results, and reported limitations. Feasibility and usability were mainly studied on preprototype social robots in laboratory settings. Considering the relatively short history of the use of social robots in psychosocial interventions, it is crucial to determine the main features and functions of the robots to be considered in the design and development phase. Hence, usability, feasibility, and implementation should be strategic research aims. Fully developed robots such as PARO were evaluated in terms of effectiveness in real-world settings. Most of the identified studies aimed to determine the neuropsychosocial impact of social robots on older adults.

For the studies that explicitly fall within a feasibility and/or usability evaluation, researchers applied experimental, mixed method, and field trial designs, mostly applied outside nursing home care settings. This might imply that feasibility and/or usability for persons that are more severely cognitively impaired are currently understudied. Most of the studies verified the acceptability and usability of the robots within single or multiple interactive sessions in individual or group settings, and all these studies reported positive outcomes in varying degrees on the feasibility and/or usability of the social robots. The quantitative and qualitative data were collected mostly through questionnaires and interviews and a few by direct observation. Regarding this point, researchers should consider using the direct observational methodology to capture main factors of the interaction and emotional relationships fostered by robot use. Within the questionnaires and interview questions based on the UTAUT model, some concepts such as trust, anxiety, perceived enjoyment, and social support can change over time [37,46,47]. Therefore, longer use of the robots might reveal these changes and reduce the novelty effect over time [46].

Overall, efficacy and effectiveness studies were conducted on study populations either with cognitive impairment or residing in long-term care facilities. The studies with significant results [17,24,29-31,34-36,45] mostly employed experimental designs including RCTs and quasi-experimental designs with larger sample sizes and longer intervention periods compared to studies showing slight or no improvements. RCTs are likely to be the most appropriate design and a gold standard to confidently demonstrate that a specific intervention has resulted in a change in a process or a health outcome [14]. Biased assessment of outcomes and any confounding effects can be avoidable by large-scale RCTs. However, due to the degenerative character of dementia and personal differences in capacities of people with dementia, difficulties in randomizing subjects often arise [14]. Additionally, when using long study periods, the dropout rate might be high, as participants’ cognitive deterioration can hinder their continued participation in the study. On the other hand, when it is not feasible or ethical to conduct an RCT, a quasi-experimental design may serve best for collecting quantitative data. We also recommend randomized controlled block designs in the case of heterogeneous study samples. When, for instance, including people with dementia in studies with long intervention periods, the dementia levels alter. With a randomized controlled block design, some variables in different blocks can be controlled for, or comparable approaches can be applied within the blocks.

Studies targeting the efficacy and effectiveness of the robots delivered interventions diverged in format, duration, dosage, and location. Two (6.1%) studies [32,38] highlighted a need for individual intervention sessions tailored to users’ preferences and capacities, and the findings of another study confirmed this approach [65]. Additionally, individualized sessions could omit confounding factors by reducing the chance of interactions with the facilitator or other participants [66]. Group interventions may lower the odds of interaction between potential users and the robot, potentially lowering the effect of the intervention, especially when the intervention is delivered in a noisy setting with many participants [18]. The issues of small sample sizes and short interactions were considered major limitations in studies that failed to find significant results, and they are considered the toughest challenges for researchers in this field [66,67]. These limitations are often cojoined with the study setting. In nursing homes, a larger number of residents are often enrolled in a clinical trial, and the robots are not personalized but must be shared by the entire group. Whereas in private homes, studies are conducted with individuals or dyads, which creates better possibilities for personalization of the robot. Overall, the personalization of the intervention and alleviation of loneliness are 2 advantages of home-based clinical trials. However, there are some challenges to these types of studies, such as the need for several robots, implementation difficulty, and personalization, but it is nevertheless a step in the right direction. We observe a paradox, in that new or experimental robots are employed in research with low numbers of participants, whereas commercially available robots are tested on large study samples. Commercialization allows for better testing and evaluation, which is logical from an economical perspective. However, we urge that before robots are marketed, developers should study the feasibility and usability appropriately in the target group, as well as the effectiveness in a substantial study sample with sufficient power. After bringing the robot to the market, producers should continue to invest in studies to improve their product to tailor it optimally to their users. This should be a joined mission of producers and policy makers to improve the sustainability of health care systems.

Apart from the aforementioned limitations of the studies, some weak aspects of the study designs led to failure of the social robots’ impacts. For instance, a mismatch between the studied construct and the main aim of the robot may lead to the poor conclusion that the robot is not efficient. An example of this is the studies on PARO that failed to demonstrate significant results for cognition, as PARO is not developed to stimulate cognitive functioning [31,33,41,43]. Additionally, to capture significant results in constructs such as cognition, QoL, and depression, a long intervention period is necessary because these are constructs that do not change very quickly. In studies with people with dementia, it might also be useful to study stability of these constructs instead of improvement, since it is inherent to the disease that these constructs deteriorate over time. Regarding the broad concept of QoL ranging from physical health to psychological state and social relationships, the application of a suitable QoL measurement instrument that corresponds to the robot’s aim should be taken into account.

Implications For Efficacy and Effectiveness Studies

Appropriate RCTs with large sample sizes and individual interaction sessions running for longer than 1 month would serve best for such studies to draw relatively robust and reliable results.

Implications For Feasibility and Usability Studies

The study methods are similar for both aims, so researchers could apply the same design. Experimental designs with mixed methods of data collection are recommended for these studies. Multiple interaction sessions might reveal the changes in feasibility and usability.

Implication For Studies With Multiple Aims

We recommend performing separate studies for multiple aims since the study designs for each aim are comparable.

We gathered further practical recommendations through which future work may address existing shortcomings (Table 1). Regarding the mixed methods of data collection, studies suggest a combination of qualitative and quantitative methods for data collection, which will enable the researcher to capture different details in users’ responses and address different aspects of the research question. A mixed methods approach was helpful in studies that could not derive positive results from quantitative data but did from qualitative data [16]. Regarding the difficulties of recruiting many users in case of availability of just a few robots, these mixed methods should be mandatory. Even though we did not find any negative results regarding the intervention dosage, there are shreds of evidence of highly intense intervention resulting in negative effects or exhaustion [18]. Hence, the dose response for specific measures remain an open question for future researchers.

Table 1. Further implications and recommendations for future studies.

Area of consideration	Type of study	Recommendation
Participant and setting	Efficacy/effectiveness	Gender homogeneity ‎ Different levels of dementia ‎ Realistic environments ‎
Intervention and data collection	Efficacy/effectiveness	Multiple intervention sessions for longer than 1 month ‎
	Feasibility/usability	Initialization phase before trial ‎
	All types of studies	Well-trained observers and professionals ‎ Tailored interventions ‎ Include an intervention facilitator apart from an observer ‎ Consistent observations ‎ Standardized and validated measurement instruments ‎ A client-centered approach to intervention design ‎ A combination of qualitative and quantitative methods of data collection ‎ Observational study when including people with severe dementia ‎
Gap in the existing literature to be filled	Efficacy/effectiveness	Best response-dosage of intervention for particular measure and participant condition ‎ Characteristics of subjects who benefit most from the social robots ‎

Limitations and Strengths

Although the use of social robots is promising to support people with dementia, we did not include dementia specifically in the search strings, since this scoping review focused on elderly people in general. However, we believe that our search captured most of the studies executed in the field of dementia because many of the identified studies included people with dementia in either mixed or specific study samples. However, some relevant studies on elderly people with dementia may be missing in this review, as well as may studies that are only traceable in databases that were not taken into account in this review. The searches were conducted in scientific databases deemed the most viable to retrieve valid and reliable results for this scoping review. The exclusion of studies focusing only on the development phase of social robots can be considered a limitation of this study. Some information on the evaluation of the feasibility and usability executed in the development stage might have been missed. In addition, studies on telepresence robots were excluded due to their relatively simple features. Compared to pet robots and humanoid robots, telepresence robots are limited in interactions with users, which occur merely through a touch screen, making use of visual and audio stimuli but omitting other sensory stimulation. Although mainly used for medical visits, some telepresence robots might support social functioning. Information on studies performed on these robots might have been missed.

Our study is the first scoping review on the methodologies for studying social robots in elderly people and people with dementia. The existing reviews on this topic mostly focus on design, use, effectiveness, facilitators, and barriers to the implementation of social robots [12,66,67,68-73]. This study might support future researchers to design a research study on social robots in elderly adults and answer some study design queries.

Conclusions

This review narratively synthesizes information on the methodology of studying social robots in elderly adults and people with dementia. Relevant recommendations were formulated, directed for studies with specific aims that may aid future researchers in developing adequate study designs to evaluate social robots, allowing for more reliable information on study outcomes. Our research leads us to the conclusion that more studies with large sample sizes are needed on the effectiveness of social robots in private households of elderly adults and people with dementia to demonstrate the actual usefulness of social robots on delaying institutionalization by improving QoL, cognition, and social contact, and counteracting loneliness. Most of the identified studies focused on usability, and the robots appeared to be favorably accepted in most cases. It is time to encourage investigations in private homes to supplement existing knowledge about the effectiveness of robots and personalization of their functions. We expect that additional research will corroborate the impact of social robots on loneliness, mood, QoL, and social health in people with dementia and the elderly.

Acknowledgments

This research was carried out as part of the Marie Curie Innovative Training Network (ITN) action, H2020-MSCA-ITN-2018, under grant agreement number 813196.

Conflicts of Interest

MFM has participated in clinical trials with Lundbeck, Acadia, Janssen, Otsuka, Angellini, Oxygen, Roche, Servier, Boehringer Ingelheim, and Gw Research Limited.

‎

Multimedia Appendix 1

Quality appraisal of the included studies assessed by two independent reviewers.

DOCX File , 56 KB

‎

Multimedia Appendix 2

Characteristics and methodologies of the studies.

DOCX File , 43 KB

‎

Multimedia Appendix 3

Outcomes.

DOCX File , 24 KB

World Population Prospects: 2015 Revision. New York, NY: United Nations Department of Economic and Social Affairs; 2015:1-66.
Dementia: key facts. World Health Organization. URL: https://www.who.int/news-room/fact-sheets/detail/dementia [accessed 2022-06-17]
Wimo A. The global economic impact of dementia. 2010. URL: https://abramsonseniorcare.org/media/1199/observed-emotion-rating-scale.pdf [accessed 2022-06-21]
Ebly EM, Hogan DB, Rockwood K. Living alone with dementia. Dement Geriatr Cogn Disord 1999 Nov 4;10(6):541-548. [CrossRef] [Medline]
Miranda-Castillo C, Woods B, Orrell M. People with dementia living alone: what are their needs and what kind of support are they receiving? Int Psychogeriatr 2010 Mar 10;22(4):607-617. [CrossRef]
Webber PA, Fox P, Burnette D. Living alone with Alzheimer's disease: effects on health and social service utilization patterns. Gerontologist 1994 Feb 01;34(1):8-14. [CrossRef] [Medline]
Meaney A, Croke M, Kirby M. Needs assessment in dementia. Int J Geriatr Psychiatry 2005 Apr;20(4):322-329. [CrossRef] [Medline]
Wilcox J. Identifying the support needs of people with dementia and older people with mental illness on a Joint Community Team: A preliminary report. J Ment Health 2009 Jul 06;4(2):157-164. [CrossRef]
Perrier E. Positive Disruption: Healthcare, Ageing and Participation in the Age of Technology. Sydney, Australia: The McKell Institute; Sep 2015:2015.
Mordoch E, Osterreicher A, Guse L, Roger K, Thompson G. Use of social commitment robots in the care of elderly people with dementia: a literature review. Maturitas 2013 Jan;74(1):14-20. [CrossRef] [Medline]
Fan J, Bian D, Zheng Z, Beuscher L, Newhouse P, Mion L, et al. A robotic coach architecture for elder care (ROCARE) based on multi-user engagement models. IEEE Trans Neural Syst Rehabil Eng 2017 Aug;25(8):1153-1163. [CrossRef]
Abdi J, Al-Hindawi A, Ng T, Vizcaychipi MP. Scoping review on the use of socially assistive robot technology in elderly care. BMJ Open 2018 Feb 12;8(2):e018815 [FREE Full text] [CrossRef] [Medline]
Robinson H, Macdonald B, Kerse N, Broadbent E. The psychosocial effects of a companion robot: a randomized controlled trial. J Am Med Dir Assoc 2013 Sep;14(9):661-667. [CrossRef] [Medline]
Mitchell K. Interventions. In: Monitoring and Evaluating Digital Health Interventions: A Practical Guide to Conducting Research and Assessment. Cambridge, UK: Cambridge University Press; 2010:15-22.
OSF. 2021. URL: https://osf.io/vpyg2/ [accessed 2022-02-14]
Bajones M, Fischinger D, Weiss A, Puente P, Wolf D, Vincze M, et al. Results of field trials with a mobile service robot for older adults in 16 private households. J Hum-Robot Interact 2020 Feb 07;9(2):1-27. [CrossRef]
Banks MR, Willoughby LM, Banks WA. Animal-assisted therapy and loneliness in nursing homes: use of robotic versus living dogs. J Am Med Dir Assoc 2008 Mar;9(3):173-177. [CrossRef] [Medline]
Barrett E, Burke M, Whelan S, Santorelli A, Oliveira BL, Cavallo F, et al. Evaluation of a companion robot for individuals with dementia: quantitative findings of the MARIO project in an Irish residential care setting. J Gerontol Nurs 2019 Jul 01;45(7):36-45. [CrossRef] [Medline]
Beer J, Prakash A, Smarr CA, Chen TL, Hawkins K, Nguyen H, et al. Older users' acceptance of an assistive robot: attitudinal changes following brief exposure. Gerontechnology 2017 Mar 29;16(1):21-36 [FREE Full text] [CrossRef] [Medline]
Birks M, Bodak M, Barlas J, Harwood J, Pether M. Robotic seals as therapeutic tools in an aged care facility: a qualitative study. J Aging Res 2016;2016:8569602-8569607 [FREE Full text] [CrossRef] [Medline]
Broadbent E, Kerse N, Peri K, Robinson H, Jayawardena C, Kuo T, et al. Benefits and problems of health-care robots in aged care settings: A comparison trial. Australas J Ageing 2016 Mar;35(1):23-29. [CrossRef] [Medline]
Cavallo F, Esposito R, Limosani R, Manzi A, Bevilacqua R, Felici E, et al. Robotic services acceptance in smart environments with older adults: user satisfaction and acceptability study. J Med Internet Res 2018 Sep 21;20(9):e264 [FREE Full text] [CrossRef] [Medline]
Cavallo F, Limosani R, Manzi A, Bonaccorsi M, Esposito R, Di Rocco M, et al. Development of a socially believable multi-robot solution from town to home. Cogn Comput 2014 Jul 3;6(4):954-967. [CrossRef]
Chu M, Khosla R, Khaksar SMS, Nguyen K. Service innovation through social robot engagement to improve dementia care quality. Assist Technol 2016 Apr 11;29(1):8-18. [CrossRef]
Fischinger D, Einramhof P, Papoutsakis K, Wohlkinger W, Mayer P, Panek P, et al. Hobbit, a care robot supporting independent living at home: First prototype and lessons learned. Rob Auton Syst 2016 Jan;75:60-78. [CrossRef]
Huisman C, Kort H. Two-year use of care robot Zora in Dutch nursing homes: an evaluation study. Healthcare (Basel) 2019 Feb 19;7(1):31 [FREE Full text] [CrossRef] [Medline]
Inoue T, Nihei M, Narita T, Onoda M, Ishiwata R, Mamiya I, et al. Field-based development of an information support robot for persons with dementia. TAD 2012 Dec 12;24(4):263-271. [CrossRef]
Olde Keizer RACM, van Velsen L, Moncharmont M, Riche B, Ammour N, Del Signore S, et al. Using socially assistive robots for monitoring and preventing frailty among older adults: a study on usability and user experience challenges. Health Technol 2019 Apr 9;9(4):595-605. [CrossRef]
Khosla R, Chu M, Khaksar SMS, Nguyen K, Nishida T. Engagement and experience of older people with socially assistive robots in home care. Assist Technol 2021 Mar 04;33(2):57-71. [CrossRef] [Medline]
Khosla R, Nguyen K, Chu M. Human robot engagement and acceptability in residential aged care. Int J Hum-Comput Interact 2016 Dec 27;33(6):510-522. [CrossRef]
Koh IS, Kang HS. Effects of intervention using PARO on the cognition, emotion, problem behavior, and social interaction of elderly people with dementia. J Korean Acad Community Health Nurs 2018;29(3):300. [CrossRef]
Kramer SC, Friedmann E, Bernstein PL. Comparison of the effect of human interaction, animal-assisted therapy, and AIBO-assisted therapy on long-term care residents with dementia. Anthrozoös 2015 Apr 28;22(1):43-57. [CrossRef]
Liang A, Piroth I, Robinson H, MacDonald B, Fisher M, Nater UM, et al. A pilot randomized trial of a companion robot for people with dementia living in the community. J Am Med Dir Assoc 2017 Oct 01;18(10):871-878. [CrossRef] [Medline]
Libin A, Cohen-Mansfield J. Therapeutic robocat for nursing home residents with dementia: preliminary inquiry. Am J Alzheimers Dis Other Demen 2004 Jun 30;19(2):111-116 [FREE Full text] [CrossRef] [Medline]
Moyle W, Cooke M, Beattie E, Jones C, Klein B, Cook G, et al. Exploring the effect of companion robots on emotional expression in older adults with dementia: a pilot randomized controlled trial. J Gerontol Nurs 2013 May;39(5):46-53. [CrossRef] [Medline]
Obayashi K, Kodate N, Masuyama S. Enhancing older people’s activity and participation with socially assistive robots: a multicentre quasi-experimental study using the ICF framework. Adv Robot 2018 Oct 14;32(22):1207-1216. [CrossRef]
Pripfl J, Körtner T, Batko-Klein D, Hebesberger D, Weninger M, Gisinger C. Social service robots to support independent living : Experiences from a field trial. Z Gerontol Geriatr 2016 Jun;49(4):282-287. [CrossRef] [Medline]
Pu L, Moyle W, Jones C. How people with dementia perceive a therapeutic robot called PARO in relation to their pain and mood: A qualitative study. J Clin Nurs 2020 Feb 02;29(3-4):437-446. [CrossRef] [Medline]
Schüssler S, Zuschnegg J, Paletta L, Fellner M, Lodron G, Steiner J, et al. Effects of a humanoid socially assistive robot versus tablet training on psychosocial and physical outcomes of persons with dementia: protocol for a mixed methods study. JMIR Res Protoc 2020 Feb 03;9(2):e14927 [FREE Full text] [CrossRef] [Medline]
Sung H, Chang S, Chin M, Lee W. Robot-assisted therapy for improving social interactions and activity participation among institutionalized older adults: a pilot study. Asia Pac Psychiatry 2015 Mar 01;7(1):1-6. [CrossRef] [Medline]
Thodberg K, Sørensen L, Videbech P, Poulsen P, Houbak B, Damgaard V, et al. Behavioral responses of nursing home residents to visits from a person with a dog, a robot seal or a toy cat. Anthrozoös 2016 Mar 08;29(1):107-121. [CrossRef]
Zsiga K, Tóth A, Pilissy T, Péter O, Dénes Z, Fazekas G. Evaluation of a companion robot based on field tests with single older adults in their homes. Assist Technol 2018;30(5):259-266. [CrossRef] [Medline]
Valentí Soler M, Agüera-Ortiz L, Olazarán Rodríguez J, Mendoza Rebolledo C, Pérez Muñoz A, Rodríguez Pérez I, et al. Social robots in advanced dementia. Front Aging Neurosci 2015;7:133 [FREE Full text] [CrossRef] [Medline]
Wada K, Shibata T, Saito T, Tanie K. Robot assisted activity at a health service facility for the aged for ten weeks: An interim report of a long-term experiment. P I MECH ENG I-J SYS 2006 Dec 27;220(8):709-715. [CrossRef]
Wada K, Shibata T. Social and physiological influences of robot therapy in a care house. IS 2008 May 26;9(2):258-276. [CrossRef]
Torta E, Werner F, Johnson DO, Juola JH, Cuijpers RH, Bazzani M, et al. Evaluation of a small socially-assistive humanoid robot in intelligent homes for the care of the elderly. J Intell Robot Syst 2014;76:57-71. [CrossRef]
Broadbent E, Peri K, Kerse N, Jayawardena C, Kuo I, Datta C, et al. Robots in older people’s homes to improve medication adherence and quality of life: a randomised cross-over trial. ICSR 2014;8755. [CrossRef]
Heerink M, Kröse B, Evers V, Wielinga B. Assessing acceptance of assistive social agent technology by older adults: the Almere model. Int J of Soc Robotics 2010 Sep 4;2(4):361-375. [CrossRef]
Yesavage JA, Brink T, Rose TL, Lum O, Huang V, Adey M, et al. Development and validation of a geriatric depression screening scale: A preliminary report. J Psychiatr Res 1982 Jan;17(1):37-49. [CrossRef]
Alexopoulos GS, Abrams RC, Young RC, Shamoian CA. Cornell scale for depression in dementia. Biol Psychiatry 1988 Feb;23(3):271-284. [CrossRef]
Snyder M, Ryden M, Shaver P, Wang J, Savik K, Gross C, et al. The apparent emotion rating instrument. Clinical Gerontologist 2008 Oct 11;18(4):17-29. [CrossRef]
Russell DW. UCLA Loneliness Scale (Version 3): reliability, validity, and factor structure. J Pers Assess 1996 Feb;66(1):20-40. [CrossRef] [Medline]
Lawton MP. Observed Emotion Rating Scale. 1999. URL: https://abramsonseniorcare.org/media/1199/observed-emotion-rating-scale.pdf [accessed 2022-06-21]
Nasreddine Z, Phillips NA, Bédirian V, Charbonneau S, Whitehead V, Collin I, et al. The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc 2005 Apr;53(4):695-699. [CrossRef] [Medline]
Folstein MF, Folstein SE, McHugh PR. "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 1975 Nov;12(3):189-198. [CrossRef] [Medline]
The WHOQOL Group. Development of the World Health Organization WHOQOL-BREF quality of life assessment. Psychol Med 1998 May 01;28(3):551-558. [CrossRef] [Medline]
Weiner MF, Martin-Cook K, Svetlik DA, Saine K, Foster B, Fontaine CS. The quality of life in late-stage dementia (QUALID) scale. J Am Med Dir Assoc 2000;1(3):114-116. [Medline]
Cummings JL. The Neuropsychiatric Inventory: assessing psychopathology in dementia patients. Neurology 1997 May 01;48(5 Suppl 6):S10-S16. [CrossRef] [Medline]
Bråne G, Gottfries CG, Winblad B. The Gottfries-Bråne-Steen scale: validity, reliability and application in anti-dementia drug trials. Dement Geriatr Cogn Disord 2001;12(1):1-14. [CrossRef] [Medline]
Marin RS, Biedrzycki RC, Firinciogullari S. Reliability and validity of the apathy evaluation scale. Psychiatry Research 1991 Aug;38(2):143-162. [CrossRef]
Cohen-Mansfield J. Instruction Manual for the Cohen-Mansfield Agitation Inventory (CMAI). 1991. URL: https://uiowa2.instructure.com/files/22132/download?download_frd=1 [accessed 2022-06-21]
Robert P, Clairet S, Benoit M, Koutaich J, Bertogliati C, Tible O, et al. The apathy inventory: assessment of apathy and awareness in Alzheimer's disease, Parkinson's disease and mild cognitive impairment. Int J Geriatr Psychiatry 2002 Dec;17(12):1099-1105. [CrossRef] [Medline]
Agüera-Ortiz L, Gil-Ruiz N, Cruz-Orduña I, Ramos-García I, Osorio RS, Valentí-Soler M, et al. A novel rating scale for the measurement of apathy in institutionalized persons with dementia: the APADEM-NH. Am J Geriatr Psychiatry 2015 Feb;23(2):149-159. [CrossRef] [Medline]
Forsyth K, Lai J, Kielhofner G. The Assessment of Communication and Interaction Skills (ACIS): measurement properties. Br J Occup Ther 2016 Nov 05;62(2):69-74. [CrossRef]
Kachouie R, Sedighadeli S, Khosla R, Chu M. Socially assistive robots in elderly care: a mixed-method systematic literature review. Int J Hum-Comput Interact 2014 Apr;30(5):369-393. [CrossRef]
Pu L, Moyle W, Jones C, Todorovic M. The effectiveness of social robots for older adults: a systematic review and meta-analysis of randomized controlled studies. Gerontologist 2019 Jan 09;59(1):e37-e51. [CrossRef] [Medline]
Bemelmans R, Gelderblom GJ, Jonker P, de Witte L. Socially assistive robots in elderly care: a systematic review into effects and effectiveness. J Am Med Dir Assoc 2012 Feb;13(2):114-120.e1. [CrossRef] [Medline]
Oliveira R, Arriaga P, Santos FP, Mascarenhas S, Paiva A. Towards prosocial design: A scoping review of the use of robots and virtual agents to trigger prosocial behaviour. Comput Hum Behav 2021 Jan;114:106547. [CrossRef]
Koh WQ, Ang FXH, Casey D. Impacts of low-cost robotic pets for older adults and people with dementia: scoping review. JMIR Rehabil Assist Technol 2021 Feb 12;8(1):e25340 [FREE Full text] [CrossRef] [Medline]
Góngora Alonso S, Hamrioui S, de la Torre Díez I, Motta Cruz E, López-Coronado M, Franco M. Social robots for people with aging and dementia: a systematic review of literature. Telemed J E Health 2019 Jul;25(7):533-540. [CrossRef] [Medline]
Hirt J, Ballhausen N, Hering A, Kliegel M, Beer T, Meyer G. Social robot interventions for people with dementia: a systematic review on effects and quality of reporting. J Alzheimers Dis 2021;79(2):773-792 [FREE Full text] [CrossRef] [Medline]
Ozdemir D, Cibulka J, Stepankova O, Holmerova I. Design and implementation framework of social assistive robotics for people with dementia - a scoping review. Health Technol 2021 Feb 05;11(2):367-378. [CrossRef]
Koh WQ, Felding SA, Toomey E, Casey D. Barriers and facilitators to the implementation of social robots for older adults and people with dementia: a scoping review protocol. Syst Rev 2021 Feb 05;10(1):49 [FREE Full text] [CrossRef] [Medline]

‎

ADL: activities of daily living

ITN: innovative training network

LTCF: long-term care facility

OSF: Open Science Framework

QoL: quality of life

RCT: randomized controlled trial

SAR: socially assistive robot

UCLA: University of California, Los Angeles

UTAUT: Unified Theory of the Acceptance and Use of Technology

WHO: World Health Organization

Edited by R Kukafka; submitted 21.02.22; peer-reviewed by I Guemghar, V Stara; comments to author 28.03.22; revised version received 22.05.22; accepted 08.06.22; published 01.08.22

©Aysan Mahmoudi Asl, Mauricio Molinari Ulate, Manuel Franco Martin, Henriëtte van der Roest. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 01.08.2022.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Methodologies Used to Study the Feasibility, Usability, Efficacy, and Effectiveness of Social Robots For Elderly Adults: Scoping Review