Establishing a Health CASCADE–Curated Open-Access Database to Consolidate Knowledge About Co-Creation: Novel Artificial Intelligence–Assisted Methodology Based on Systematic Reviews

doi:10.2196/45059

Review

¹School of Health and Life Sciences, Glasgow Caledonian University, Glasgow, United Kingdom

²Department of Community Medicine and Rehabilitation, Umeå University, Umeå, Sweden

³Institute of Informatics and Telecommunications, National Centre of Scientific Research Demokritos, Athens, Greece

⁴Department of Digital Systems, University of Piraeus, Piraeus, Greece

⁵Department of Public Health and Primary Care, Ghent University, Ghent, Belgium

⁶Department of Movement and Sports Sciences, Ghent University, Ghent, Belgium

⁷Department of Sport Sciences, Faculty of Psychology Education and Sport Sciences, Blanquerna - Ramon Llull University, Barcelona, Spain

⁸Department of Physical Therapy, Faculty of Health Sciences, Blanquerna - Ramon Llull University, Barcelona, Spain

⁹Department of Public and Occupational Health, Amsterdam Public Health Research Institute, Amsterdam University Medical Centers, Vrije Universiteit Amsterdam, Amsterdam, Netherlands

¹⁰Yunus Centre, Glasgow Caledonian University, Glasgow, United Kingdom

*these authors contributed equally

Corresponding Author:

Danielle Marie Agnello, BS, MSc

School of Health and Life Sciences

Glasgow Caledonian University

Cowcaddens Road

Glasgow, G4 0BA

United Kingdom

Phone: 44 7871788785

Email: danielle.agnello@gcu.ac.uk

Background: Co-creation is an approach that aims to democratize research and bridge the gap between research and practice, but the potential fragmentation of knowledge about co-creation has hindered progress. A comprehensive database of published literature from multidisciplinary sources can address this fragmentation through the integration of diverse perspectives, identification and dissemination of best practices, and increase clarity about co-creation. However, two considerable challenges exist. First, there is uncertainty about co-creation terminology, making it difficult to identify relevant literature. Second, the exponential growth of scientific publications has led to an overwhelming amount of literature that surpasses the human capacity for a comprehensive review. These challenges hinder progress in co-creation research and underscore the need for a novel methodology to consolidate and investigate the literature.

Objective: This study aimed to synthesize knowledge about co-creation across various fields through the development and application of an artificial intelligence (AI)–assisted selection process. The ultimate goal of this database was to provide stakeholders interested in co-creation with relevant literature.

Methods: We created a novel methodology for establishing a curated database. To accommodate the variation in terminology, we used a broad definition of co-creation that encompassed the essence of existing definitions. To filter out irrelevant information, an AI-assisted selection process was used. In addition, we conducted bibliometric analyses and quality control procedures to assess content and accuracy. Overall, this approach allowed us to develop a robust and reliable database that serves as a valuable resource for stakeholders interested in co-creation.

Results: The final version of the database included 13,501 papers, which are indexed in Zenodo and accessible in an open-access downloadable format. The quality assessment revealed that 20.3% (140/688) of the database likely contained irrelevant material, whereas the methodology captured 91% (58/64) of the relevant literature. Participatory and variations of the term co-creation were the most frequent terms in the title and abstracts of included literature. The predominant source journals included health sciences, sustainability, environmental sciences, medical research, and health services research.

Conclusions: This study produced a high-quality, open-access database about co-creation. The study demonstrates that it is possible to perform a systematic review selection process on a fragmented concept using human-AI collaboration. Our unified concept of co-creation includes the co-approaches (co-creation, co-design, and co-production), forms of participatory research, and user involvement. Our analysis of authorship, citations, and source landscape highlights the potential lack of collaboration among co-creation researchers and underscores the need for future investigation into the different research methodologies. The database provides a resource for relevant literature and can support rapid literature reviews about co-creation. It also offers clarity about the current co-creation landscape and helps to address barriers that researchers may face when seeking evidence about co-creation.

J Med Internet Res 2023;25:e45059

doi:10.2196/45059

Keywords

co-creation (24); co-production (10); co-design (203); database (76); participatory (38); methodology (70); artificial intelligence (1819)

Background

Co-creation emerges from the participatory research paradigm, is grounded in a history of community engagement, and was first documented in the 1970s [Grönroos C. Conceptualising value co-creation: a journey to the 1970s and back to the future. J Mark Manag. 2012;28(13-14):1520-1534. [FREE Full text] [CrossRef]1]. According to Greenhalgh et al [Greenhalgh T, Jackson C, Shaw S, Janamian T. Achieving research impact through co-creation in community-based health services: literature review and case study. Milbank Q. Jun 2016;94(2):392-429. [FREE Full text] [CrossRef] [Medline]2], co-creation is a promising strategy to increase the effectiveness and impact of health interventions and claims to address complex problems in public health (eg, the obesity epidemic, persisting poverty, and food insecurity), which are particularly resistant to resolution, specifically in vulnerable populations [Greenhalgh T, Jackson C, Shaw S, Janamian T. Achieving research impact through co-creation in community-based health services: literature review and case study. Milbank Q. Jun 2016;94(2):392-429. [FREE Full text] [CrossRef] [Medline]2,Halvorsrud K, Kucharska J, Adlington K, Rüdell K, Brown Hajdukova E, Nazroo J, et al. Identifying evidence of effectiveness in the co-creation of research: a systematic review and meta-analysis of the international healthcare literature. J Public Health (Oxf). Apr 12, 2021;43(1):197-208. [FREE Full text] [CrossRef] [Medline]3]. Co-creation also responds to the growing demand from the public to be involved in research aimed at ensuring health interventions are better tailored to their needs and circumstances [Greenhalgh T, Jackson C, Shaw S, Janamian T. Achieving research impact through co-creation in community-based health services: literature review and case study. Milbank Q. Jun 2016;94(2):392-429. [FREE Full text] [CrossRef] [Medline]2]. Furthermore, the explicit reference to co-creation in many fields of science and sectors has grown exponentially in recent years, and it is now expected, and often demanded, by funders, governments, and policy makers to democratize and accelerate the impact of research and innovation [Williams O, Sarre S, Papoulias SC, Knowles S, Robert G, Beresford P, et al. Lost in the shadows: reflections on the dark side of co-production. Health Res Policy Syst. May 07, 2020;18(1):43. [FREE Full text] [CrossRef] [Medline]4].

Co-creation aims to engage stakeholders from all the nodes of the quadruple helix (academia, industry, government, and users) to create collectively effective and sustainable solutions. However, without an evidence base or a robust and trustworthy methodology, using co-creation can remain a blind praxis [Schütz F, Heidingsfelder ML, Schraudner M. Co-shaping the future in quadruple helix innovation systems: uncovering public preferences toward participatory research and innovation. She Ji J Des Econ Innov. 2019;5(2):128-146. [FREE Full text] [CrossRef]5]. Co-creation is resource intensive and, if conducted poorly, can waste time and resources without clear impacts and outputs. There is a real danger that this will ultimately lead to growing distrust toward co-creation from researchers, the public, policy makers, and other key stakeholders. More importantly, it might put the public at risk of detrimental effects and potential unintended consequences (sensitive information leaked, ethical misconduct, etc) from poorly evidenced and poorly conducted co-creation [Williams O, Sarre S, Papoulias SC, Knowles S, Robert G, Beresford P, et al. Lost in the shadows: reflections on the dark side of co-production. Health Res Policy Syst. May 07, 2020;18(1):43. [FREE Full text] [CrossRef] [Medline]4].

Presently, the progress of co-creation in public health is hindered by the limited availability of comprehensive and integrated literature and empirical evidence, further complicated by the absence of a standardized methodology. Assembling a database of published literature from multidisciplinary sources is a key step toward facilitating the integration of diverse perspectives and enabling the identification and dissemination of best practices across multiple fields of research. However, there are two considerable challenges to achieving this. First, there is a lack of clarity about co-creation, including the interchangeable use of terminology when referring to co-creation. Second, the rapid increase in scientific publishing means that obtaining exhaustive and comprehensive literature requires dealing with a vast body of information, making it almost impossible to complete it on time within the standards for quality research. For instance, by the time the research is completed, it may already be outdated [Shojania KG, Sampson M, Ansari MT, Ji J, Doucette S, Moher D. How quickly do systematic reviews go out of date? a survival analysis. Ann Intern Med. Aug 21, 2007;147(4):224-233. [FREE Full text] [CrossRef] [Medline]6].

Fragmentation of Knowledge About Co-Creation

Bauman [Bauman A. A personal reflection on co-creation in public health: a dream partly realised. Public Health Res Pract. Jun 15, 2022;32(2):3222210. [FREE Full text] [CrossRef] [Medline]7] and Iwarsson et al [Iwarsson S, Edberg AK, Ivanoff SD, Hanson E, Jönson H, Schmidt S. Understanding user involvement in research in aging and health. Gerontol Geriatr Med. Dec 29, 2019;5:2333721419897781. [FREE Full text] [CrossRef] [Medline]8] highlight the interchangeable use of the terms co-creation, co-design, co-production, participatory research, patient and public involvement, and collaborative research when describing a development process involving multiple stakeholders. Vargas et al [Vargas C, Whelan J, Brimblecombe J, Allender S. Co-creation, co-design, co-production for public health - a perspective on definition and distinctions. Public Health Res Pract. Jun 15, 2022;32(2):3222211. [FREE Full text] [CrossRef] [Medline]9] argued that co-creation is considered an overarching principle that includes both co-design and co-production. Co-creation has been used in different fields, for example, it can be perceived as user involvement in designing goods, as well as codelivery of public services [Vargas C, Whelan J, Brimblecombe J, Allender S. Co-creation, co-design, co-production for public health - a perspective on definition and distinctions. Public Health Res Pract. Jun 15, 2022;32(2):3222211. [FREE Full text] [CrossRef] [Medline]9]. In addition, Masterson et al [Masterson D, Areskoug Josefsson K, Robert G, Nylander E, Kjellström S. Mapping definitions of co-production and co-design in health and social care: a systematic scoping review providing lessons for the future. Health Expect. Jun 2022;25(3):902-913. [FREE Full text] [CrossRef] [Medline]10] recently uncovered >500 different definitions of co-production and co-design, and the rate of introducing new definitions is increasing over the past decade.

The use of interchangeable terms and varying definitions in co-creation research has led to potential conceptual fragmentation [Taylor H, Vickers P. Conceptual fragmentation and the rise of eliminativism. Euro Jnl Phil Sci. 2017;7(1):17-40. [FREE Full text] [CrossRef]11]. Messiha et al [Messiha K, Chinapaw MJ, Ket HC, An Q, Anand-Kumar V, Longworth GR, et al. Systematic review of contemporary theories used for co-creation, co-design and co-production in public health. J Public Health (Oxf) (forthcoming). May 05, 2023:fdad046. [CrossRef] [Medline]12] recently discussed this issue when investigating systematic theory building, highlighting the risk of knowledge fragmentation when inconsistent terminologies related to co-creation are used. Furthermore, Gray et al [Gray PH, Meister DB. Introduction: fragmentation and integration in knowledge management research. Inf Technol People. Sep 2003;16(3):259-265. [FREE Full text] [CrossRef]13] discussed the work by Hirsch and Levin [Hirsch PM, Levin DZ. Umbrella advocates versus validity police: a life-cycle model. Organ Sci. Mar 1999;10(2):199-212. [FREE Full text]14] in characterizing the fragmentation process in organizational sciences and highlighting the life cycle of emerging ideas. When research emphasizes differences between aspects of a central idea, it can lead to divergence, resulting in nonoverlapping ideas and an increasing prevalence of contextualizing research, ultimately leading to fragmentation [Gray PH, Meister DB. Introduction: fragmentation and integration in knowledge management research. Inf Technol People. Sep 2003;16(3):259-265. [FREE Full text] [CrossRef]13]. Consequently, fragmentation and poor communication can impede scientific progress and undermine the original concept [Gray PH, Meister DB. Introduction: fragmentation and integration in knowledge management research. Inf Technol People. Sep 2003;16(3):259-265. [FREE Full text] [CrossRef]13,Balietti S, Mäs M, Helbing D. On disciplinary fragmentation and scientific progress. PLoS One. Mar 19, 2015;10(3):e0118747. [FREE Full text] [CrossRef] [Medline]15]. Although it is encouraging that many stakeholders recognize the value of co-creation, fragmentation hinders knowledge sharing and the effective application of co-creation in practice. Therefore, a unified concept of co-creation is necessary to address the complexity arising from the broad and interchangeable use of the terminology.

Given the current situation, it is increasingly challenging to gain a comprehensive overview of co-creation or to gather appropriate theoretical and methodological support in the literature [Steiner A, Farmer J, Yates S, Moran M, Carlisle K. How to systematically analyze co-production to inform future policies? introducing 5Ws of co-production. Public Adm Rev. Oct 25, 2022;83(3):503-521. [FREE Full text] [CrossRef]16]. This is compounded by the fact that in recent years, the overall scientific publication output has been increasing at an annual rate of 3% from 1980 to 2012, and the volume of publications doubles every 24 years [Bornmann L, Mutz R. Growth rates of modern science: a bibliometric analysis based on the number of publications and cited references. J Assoc Inf Sci Technol. Nov 01, 2015;66(11):2215-2222. [FREE Full text] [CrossRef]17,Anderson C, Peterson CJ, Dennis JA. Mass publication during the COVID-19 pandemic: too much of a good thing? Southwest Respir Crit Care Chron. 2022;10(42):22-24. [FREE Full text] [CrossRef]18]. Owing to this rate of increase, it is becoming difficult, if not impossible, to approach this vast amount of literature without having to make compromises in keywords and database selection [Marshall IJ, Wallace BC. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst Rev. Jul 11, 2019;8(1):163. [FREE Full text] [CrossRef] [Medline]19]. Unfortunately, these concessions can jeopardize the quality of the research as well as decrease the possibility of gaining a comprehensive and exhaustive evidence base. To manage the potential fragmentation, an overinclusive approach to co-creation terms is needed in addition to an expanded systematic review process that includes an artificial intelligence (AI)–assisted selection process to manage the large volume of literature resulting from such an approach.

Objective

This study aimed to synthesize knowledge about co-creation across various fields through the development and application of an AI-assisted selection process. The ultimate goal of this database is to provide stakeholders interested in co-creation with relevant literature.

Overview

We developed a novel selection methodology using AI to include all relevant literature. To ensure the quality of our methodology, we based it on established frameworks such as the Cochrane Collaboration and PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) [Moher D, Liberati A, Tetzlaff J, Altman DG, PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. Jul 21, 2009;6(7):e1000097. [FREE Full text] [CrossRef] [Medline]20]. Systematic reviews have traditionally used meticulously designed search strategies that use various combinations of precise terms to identify pertinent literature and reduce the need for extensive screening. However, the interchangeable use of co-creation terminology necessitated a broad search strategy to avoid the inadvertent exclusion of relevant literature. This broad search strategy resulted in a vast amount of literature.

To handle the search results, we used a set of selection criteria aiming to capture key aspects of co-creation. In addition, we used ASReview (Open Science Lab), an AI software [van de Schoot R, de Bruin J, Schram R, Zahedi P, de Boer J, Weijdema F, et al. An open source machine learning framework for efficient and transparent systematic reviews. Nat Mach Intell. Feb 01, 2021;3(2):125-133. [FREE Full text] [CrossRef]21], to assist a multidisciplinary team of researchers in efficiently excluding irrelevant literature during the selection process. This AI software allowed us to reduce the volume of papers requiring screening, whereas the researchers helped to prevent terminological bias and ensure a broad consensual conceptual understanding of co-creation. As the objective of this study was to identify and consolidate relevant literature about co-creation, only the conventional systematic review steps up to the selection process were followed [Moher D, Liberati A, Tetzlaff J, Altman DG, PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. Jul 21, 2009;6(7):e1000097. [FREE Full text] [CrossRef] [Medline]20], and no protocol was registered.

Search Strategy

To ensure a comprehensive search strategy, we compiled an exhaustive list of keywords that could represent co-creation in a broad sense [Masterson D, Areskoug Josefsson K, Robert G, Nylander E, Kjellström S. Mapping definitions of co-production and co-design in health and social care: a systematic scoping review providing lessons for the future. Health Expect. Jun 2022;25(3):902-913. [FREE Full text] [CrossRef] [Medline]10]. The list was refined through rounds of discussions with a multidisciplinary group of co-creation researchers, resulting in the final selection of keywords. The final set of keywords in English included co-creat*, co-conception, co-production, public and patient involvement, public participation, participatory, experience-based design, co-design, user involvement, collaborative design, and citizen science. The team searched PubMed, CINAHL, and all 47 databases within ProQuest with tailored search strategies. Details on the search strategy and results are provided in

Multimedia Appendix 1

Search strategy and results.

DOCX File , 30 KB Multimedia Appendix 1.

All the references retrieved from the search were downloaded in Microsoft Excel format and merged into 1 document (duplicates were removed). Any article that did not contain an abstract was removed and archived for manual screening. The research team identified 10 relevant and 5 irrelevant articles to provide preliminary material to ASReview [Van de Schoot R, De Bruin J, Schram R, Zahedi P, De Boer J, Weijdema F, et al. ASReview: active learning for systematic reviews. Zenodo. Dec 20, 2021. URL: https://zenodo.org/record/5793275/export/hx [accessed 2022-12-01] 22] to begin its learning process [van de Schoot R, de Bruin J, Schram R, Zahedi P, de Boer J, Weijdema F, et al. An open source machine learning framework for efficient and transparent systematic reviews. Nat Mach Intell. Feb 01, 2021;3(2):125-133. [FREE Full text] [CrossRef]21]. Owing to the potential duplicates and to reduce the risk of overfitting, the full set of literature was randomized before the selection process began [Ying X. An overview of overfitting and its solutions. J Phys Conf Ser. 2019;1168(2):022022. [FREE Full text] [CrossRef]23].

Selection Process

A total of 14 researchers affiliated with the Health CASCADE Marie Skłodowska-Curie Innovative Training Network [Verloigne M, Altenburg T, Cardon G, Chinapaw M, Dall P, Deforche B, et al. Making co-creation a trustworthy methodology for closing the implementation gap between knowledge and action in health promotion: the Health CASCADE project. Perspect Public Health. Oct 07, 2022 [FREE Full text]24] were enlisted to conduct the selection process using ASReview [van de Schoot R, de Bruin J, Schram R, Zahedi P, de Boer J, Weijdema F, et al. An open source machine learning framework for efficient and transparent systematic reviews. Nat Mach Intell. Feb 01, 2021;3(2):125-133. [FREE Full text] [CrossRef]21]. The researchers were paired and performed double-blind screening on the same set of literature. However, it is important to note that because ASReview reacts to each decision of the researchers, the pairs may not have screened the same set of literature before reaching the stop rules (these rules are described below).

We used a multidisciplinary and AI-assisted approach to establish a unified concept of co-creation, despite the interchangeable use of terms in the literature. This allowed us to assemble a set of literature that comprehensively represented co-creation, regardless of the intended meaning of specific terms (Figure 1).

The selection criteria are presented in Textbox 1. The selection criteria included a new definition of co-creation that captured the essence of existing definitions and was inclusive of diverse fields while still accommodating the variation in terminology. Co-creation is defined as any act of collective creativity that involves a broad range of relevant and affected actors in creative problem-solving that aims to produce a desired outcome [Osborne SP, Strokosch K. It takes two to tango? Understanding the co-production of public services by integrating the services management and public administration perspectives. Br J Manag. Aug 07, 2013;24(S1):S31-S47. [FREE Full text] [CrossRef]25,Sanders EB, Stappers PJ. Co-creation and the new landscapes of design. CoDesign. Mar 2008;4(1):5-18. [FREE Full text] [CrossRef]26].

In the selection process, researchers screened records presented by ASReview until they reached the 2 stop rules: they had screened 10% (1440/14,393) of their packet of papers and they found 100 irrelevant papers in a row. Papers without an abstract were not included in the ASReview screening. Instead, they were evaluated manually by the researchers using the same criteria for selection [sofie. ASReview LAB class 101. ASReview. Nov 12, 2020. URL: https://asreview.nl/blog/asreview-class-101/ [accessed 2021-11-30] 27].

**Figure 1.** Screening with a unified concept of co-creation.

Textbox 1. Inclusion and exclusion criteria for the selection process.

Inclusion criteria

Includes at least 1 or more of the keywords in the title or abstract of the article.
On the basis of the abstract, the study adheres to the working definition of co-creation.
The title and abstract are both written in English.

Exclusion criteria

The title or abstract did not contain any of the keywords.
On the basis of the abstract, the study did not adhere to the working definition of co-creation.
The title or abstract is written in a language other than English.

Data Consolidation

During the selection process in ASReview, three possible outcomes were possible: (1) a paper was included, (2) a paper was excluded, or (3) the paper was not shown to the researcher (meaning ASReview did not show the reference to the researcher before the stop rule was reached). Therefore, during the screening step, each paper received a decision, creating 6 possible combinations. These combinations are depicted in Figure 2.

Figure 2 visualizes the decision support tree applied during the selection process. As our screening strategy prioritized overinclusion, to avoid excluding a relevant paper, papers that were included by at least 1 researcher were included in the database. Consequently, the following label numbers 1 (included/included), 2 (included/excluded), and 4 (included/not shown) were included in the database. To exclude a paper from the database, we required at least 2 researchers to exclude the paper or that ASReview did not show the paper twice owing to its irrelevance. Therefore, papers with 3 (excluded/excluded) or 6 (not shown/not shown) were excluded from the database.

Papers that were only excluded by 1 researcher, label 5 (excluded/not shown), were uploaded into ASReview and screened a second time by a researcher. All papers that were excluded again (label B) were then excluded from the database, and those that were included were added to the database (label A). All the papers not shown to the researcher in this second screening were excluded owing to their likely irrelevance (label C). The outputs from these screening steps were exported from ASReview, and the data were merged into 1 Microsoft Excel document.

**Figure 2.** Decision support tree for double screening of papers.

Quality Assurance of the Database

In this study, false positives (inclusion of an irrelevant paper) were less of a concern, as they could not endanger the integrity of the database, so our strategy prioritized reducing the number of false negatives (exclusion of a relevant paper). To achieve high accuracy with a low rate of false negatives, we implemented a strategy to guarantee the quality and accuracy of the database.

Checking for False Negatives

To assess the possibility of false negatives, we conducted an evaluation using 80 papers provided by Health CASCADE Marie Skłodowska-Curie Innovative Training Network, which met our inclusion criteria but were not included in the original data set used to train ASReview. We searched for these papers in our database and noted any paper that was not found as error. We then traced the sources of each paper back through each step of the methodology to determine at which point it was excluded.

Four possibilities of false negatives were considered: double exclusion by 2 researchers, exclusion by 1 researcher and ASReview did not show it to the second researcher, ASReview did not show it to any researcher, or a data management error. In addition, there was a possibility that a paper was missing from the database because it was not originally present in the result of the search, which was not considered a false negative. Any of these situations would have resulted in the absence of a relevant item in the database.

To calculate the false-negative rate, we summed the number of papers from the 4 possible sources of false negatives and divided it by the number of test papers minus the number of test papers that were absent from the original search.

Checking for False Positives

To evaluate the potential number of papers in the database that did not adhere to the screening selection criteria (false positives), 3 researchers double screened a random sample of 5% (688/13,760) of the final database. The aim was to identify any papers that should have been excluded. The estimation of the false-positive rate for the full database was calculated by determining the false-positive rate of the screened sample, which is the number of papers excluded divided by the total number of papers screened.

Analysis of the Database

To gain a better sense of the composition of the final database, we conducted a bibliometric analysis of the included literature. We used Rayyan (Qatar Computing Research Institute), a systematic review manager [Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan-a web and mobile app for systematic reviews. Syst Rev. Dec 05, 2016;5(1):210. [FREE Full text] [CrossRef] [Medline]28], and VOSviewer (Centre for Science and Technology Studies), a tool for constructing and visualizing bibliometric networks [van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. Aug 2010;84(2):523-538. [FREE Full text] [CrossRef] [Medline]29], to analyze the frequency of terms in titles and abstracts as well as their co-occurrence. We also used VOSviewer to analyze coauthorship links and authors’ citations to explore how co-creation research is distributed and shared over time. We performed an analysis of the source landscape to identify the dominant fields represented in the database. This involved mapping the source journals and examining their respective disciplines. The full method for replicating this analysis in Rayyan and VOSviewer is presented in

Multimedia Appendix 2

Rayyan and VOSviewer analysis.

DOCX File , 31 KB Multimedia Appendix 2.

Study Selection

The adapted PRISMA flowchart is shown in Figure 3, and a filled-in PRISMA checklist is provided in

Multimedia Appendix 3

PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) 2020 checklist.

DOCX File , 40 KB Multimedia Appendix 3. On the basis of the initial search and after duplicates were removed, 118,802 papers were identified. As shown in , there were 2 parallel screening processes for this literature: one for screening papers with titles and abstracts in ASReview (n=115,144) and another for manual screening of papers without an abstract (n=3658). During the screening assisted by ASReview (left side of ), a total of 6277 papers were excluded. During the screening of papers without abstracts (right side of ), these papers were compared manually to check for duplication or presence within the set of papers included or excluded during the ASReview screening, excluding 2945 papers. Then, the nonduplicate papers were screened manually, excluding an additional 638 papers.

After these screening stages, 6915 papers were excluded by the researchers, and 90,872 papers were excluded by ASReview based on irrelevancy. During our data cleaning stage, 4309 duplicates were removed and 1 publication was retracted, resulting in a total of 13,760 papers included in the database. Finally, following the quality check, 22 additional papers were included, 140 more papers were excluded, and 141 duplicates were removed. These steps resulted in a final database that contained 13,501 papers.

**Figure 3.** An adapted PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flowchart of the full methodology. The start indicates the citations that were not shown by ASReview to the researchers.

Database Quality

We evaluated the accuracy of the methodology by searching for 80 papers that adhered to our selection criteria but were not included in the original data set used to train ASReview. Of the 80 papers, 58 (72%) papers were found in the database and 22 (28%) were not included in the database. Further analysis of the 22 missing papers revealed that ASReview did not show 6 papers to any researcher during the AI-assisted selection process, whereas 16 papers were not present in the original search results. As a result, the false-negative rate was determined to be 9% (6/64), indicating that 91% (58/64) of the relevant literature was captured by the methodology.

The false-positive rate for the entire database was calculated by determining the false-positive rate of the screened sample, which is the number of papers excluded (n=140) divided by the total number of papers screened (n=688), resulting in a false-positive rate of 20.3%.

Terminology

The most frequent terms in the title or abstract of a paper in the database were participatory (n=8101) and different variations of the word co-creation (eg, c-ocreation, co-create, co-creating, and co-creators; n=2896). The other terms that appeared in the database were co-production (n=856), user involvement (n=779), co-design (n=760), and public participation (n=463). Finally, the least frequent terms were citizen science (n=184), collaborative design (n=113), public and patient involvement (n=64), and experience-based design (n=11). Co-conception was not present in the database (n=0).

The VOSviewer co-occurrence analysis of the keywords in the title and abstracts of the papers in the database is visualized as a network map in Figure 4. This network map included 93 keywords linked to each other 2818 times. VOSviewer also grouped the keywords into 5 clusters [van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. Aug 2010;84(2):523-538. [FREE Full text] [CrossRef] [Medline]29]. Cluster 1 (in red) contained all forms of co-design, co-production, patient engagement, participatory design, and public involvement. Cluster 2 (in green) contained different forms of community-based participatory research (CBPR), participatory research, and qualitative and quantitative research. Cluster 3 (in blue) contained forms of community, public, and stakeholder development; engagement; participation; and citizen science and citizen participation. Cluster 4 (in yellow) contained forms of co-creation, innovation, and value creation. Cluster 5 (in purple) contained forms of participatory research, participatory action research, and collaborative research. In addition, among these clusters, the main connecting terms were the different forms of engagement, such as user involvement; public or patient involvement; and the less predominant methodologies, such as co-design and co-production. Finally, there were a few methodological terms that linked the clusters, such as the use of theoretical frameworks and models, process planning, mixed methods, collective action, and consultation. A summary of this analysis is provided in

Multimedia Appendix 4

VOSviewer results.

DOCX File , 31 KB Multimedia Appendix 4.

The VOSviewer co-occurrence analysis of the keywords in the title and abstracts of the papers in the database is visualized as a network map in Figure 5. Automatically adapted by the software for optimal relevancy [van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. Aug 2010;84(2):523-538. [FREE Full text] [CrossRef] [Medline]29], the date ranges from 2010 to 2022. The clusters that contained the most recent publications, ranging from 2018 to 2022, were clusters 1, 4, and 5. The most frequent terms in these clusters were co-design, co-production, public involvement, co-creation, innovation, participatory action research, and collaborative research. The older clusters, clusters 2 and 3, ranged from as far back as 2011, and the most recent publication occurred around 2017. Clusters 2 and 3 contained the different forms of participatory research; CBPR; qualitative and quantitative research; community, public, and stakeholder development; engagement; participation; and citizen science. A summary of this analysis is provided in

Multimedia Appendix 4

VOSviewer results.

DOCX File , 31 KB Multimedia Appendix 4.

**Figure 4.** Keyword co-occurrence across the title and abstracts of the papers from January 2010 to November 2022. Each link between 2 keywords represents a co-occurrence. The size of the keyword bubble represents its importance in the number of co-occurrences. Five colors represent 5 clusters. cbpr: community-based participatory research.

**Figure 5.** Keyword co-occurrence across the title and abstracts of the papers in the database from January 2010 to November 2022. Each link between 2 keywords represents a co-occurrence. The size of the keyword bubble represents its importance in the number of co-occurrences. The color of the link represents the period where it occurs, and the color of the bubble represents the average of the co-occurrence’s periods. (date range on the displayed scale is automatically adapted by the software based on relevancy). cbpr: community-based participatory research.

Bibliometric Analysis

The VOSviewer analysis of coauthorship is visualized as a network map in Figure 6. The date range was optimized for relevancy by the software [van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. Aug 2010;84(2):523-538. [FREE Full text] [CrossRef] [Medline]29], spanning from 2010 to 2022. In this network map, there were 1544 links between authors, with an overall total link strength of 3740. We aimed to determine the extent of coauthorship of each author by assessing their total link strength, which represents the number of publications that the author has coauthored. Among the authors, there were 15 authors with a total link strength of 64; hence, we considered all authors with a total link strength of >64 as having the most coauthorship. The 4 authors with the most coauthorship (highest total link strength) were Nina Wallerstein (134/3740, 3.58%), Bonnie Duran (80/3740, 2.14%), Brenda Happell (79/3470, 2.11%), and Julia Bocking (74/3470, 1.98%).

This network map included 18 clusters of 819 authors, with only 2 clusters forming around the 4 authors with the most coauthorships. One cluster included Brenda Happell (a nursing and midwifery researcher) and Julia Bocking (a nursing researcher), with publications from 2015 to 2020. The other cluster included Nina Wallerstein (a community-based participatory and participatory health researcher) and Bonnie Duran (a community-based participatory researcher), with publications from 2011 to 2020. A summary of this analysis is provided in Table 1 and

Multimedia Appendix 4

VOSviewer results.

DOCX File , 31 KB Multimedia Appendix 4.

The VOSviewer analysis of authorship citations is visualized as a network map in Figure 7. Automatically adapted by the software for optimal relevancy [van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. Aug 2010;84(2):523-538. [FREE Full text] [CrossRef] [Medline]29], the date ranges from 2010 to 2022. This network map included 14 clusters of 389 authors, with a combined total of 121,037 citations among the authors. In this analysis, we were interested in the total citations of each author. The authors with <2500 citations were not included in the list of most-cited authors. On the basis of this criterion, there were 8 authors with the most citations, including Christian Grönroos (3550/121,037, 2.93%), Nina Wallerstein (3083/121,037, 2.55%), Mark S Reed (3045/121,037, 2.52%), Pennie Frow (2955/121,037, 2.44%), Bonnie Duran (2840/121,037, 2.35%), Kaj Storbacka (2792/121,037, 2.31%), Paul P Maglio (2652/121,037, 2.19%), and Meredith Minkler (2538/121,037, 2.09%). These most-cited authors were only grouped into 3 clusters. The first cluster was formed around Christian Grönroos (marketing and economics researcher), Pennie Frow (marketing researcher), Kaj Storbacka (economics researcher), and Paul P Maglio (management researcher). Mark S Reed (environmental governance researcher) was also present in that cluster, but he was on the periphery of it, away from the other authors. The second cluster included Nina Wallerstein and Bonnie Duran, who were both community-based participatory researchers. The third cluster only included Meredith Minkler, a participatory and public health researcher. Meredith’s cluster was very close to Nina’s and Bonnie’s cluster, whereas Christian’s cluster was on the other end of the network map. A summary of this analysis is provided in Table 2 and

Multimedia Appendix 4

VOSviewer results.

DOCX File , 31 KB Multimedia Appendix 4.

The VOSviewer analysis of the source journals, called source landscape, is visualized as a network map in Figure 8. This network map included 8 clusters of 188 journals, with a total of 5896 documents. In this analysis, we were interested in the total number of documents per journal, as it indicated the prevailing journals in the database. The journals with >100 documents were considered the predominant journals. On the basis of this criterion, there were 7 top journals, including Health Expectations (354/5896, 6%); Progress in Community Health Partnerships: Research, Education, and Action (317/5896, 5.38%); the International Journal of Integrated Care (167/5896, 2.83%); Sustainability (160/5896, 2.71%); Health Promotion Practice (149/5896, 2.53%); the International Journal of Environmental Research and Public Health (139/5896, 2.36%); BMJ Open (119/5896, 2.02%); and BMC Health Services Research (113/5896, 1.92%). These predominate journals were grouped into 4 clusters. The first cluster (in green) included Health Expectations and BMJ Open as well as other less dominant journals about different forms of health research. The second cluster (in red) included Progress in Community Health Partnerships: Research, Education, and Action; Health Promotion Practice; and the International Journal of Environmental Research and Public Health. This cluster also included different public health journals. The third cluster (in purple) included the International Journal of Integrated Care and BMC Health Services Research as well as less dominant journals about health services, research, and policy. The fourth cluster (in blue) included Sustainability and other less dominant journals about social and environmental sciences and action research. A summary of this analysis is provided in Table 3 and

Multimedia Appendix 4

VOSviewer results.

DOCX File , 31 KB Multimedia Appendix 4.

**Figure 6.** Mapping and clustering of coauthorship of the papers in the database from January 2010 to November 2022. The size of the author’s bubble represents its importance in the number of coauthorships. The color represents the average period where the coauthorships occur.

Table 1. Authors and field of research (based on ).

Authors	Field	Total link strength	Total links (co-authors), n	Average publication year
Nina Wallerstein	Community-based participatory research and participatory health research	134	50	2015
Bonnie Duran	Community-based participatory research	80	19	2017
Brenda Happell	Nursing and midwifery research	79	19	2015
Julia Bocking	Nursing research	74	18	2019

**Figure 7.** Mapping and clustering of author citations from January 2010 to November 2022. The author’s bubble’s size represents its importance in the number of citations. Each link between 2 authors represents a citation. The color of the link represents the period where the citation occur, and the bubble 1 represents the average periods of the author’s citations.

Table 2. Author citations and field of research (based on ).

Most-cited authors	Field	Citations (n=121,037), n (%)	Average publication year
Christian Grönroos	Marketing and economics research	3550 (2.93)	2015
Nina Wallerstein	Community-based participatory research	3083 (2.55)	2015
Mark S Reed	Environmental governance research	3045 (2.52)	2013
Pennie Frow	Marketing research	2955 (2.44)	2012
Bonnie Duran	Community-based participatory research	2840 (2.35)	2017
Kaj Storbacka	Economics research	2792 (2.31)	2010
Paul P Maglio	Management research	2652 (2.19)	2011
Meredith Minkler	Participatory and public health research	2538 (2.09)	2011

**Figure 8.** Source landscape mapping from January 1970 to November 2022. Each link between 2 journals represents a citation between 2 journals’ documents. The journal bubble size represents its importance in the number of citations. On 8 clusters in the analysis, 5 clusters are visible through 5 colors.

Table 3. Characterizing the source landscape analysis in .

Journal (source)	Papers (n=5896), n (%)	Citations (n=154,945), n (%)	Average publication year	Cluster number (color)
Health Expectations	354 (6)	9894 (6.39)	2017	2 (green)
Progress in Community Health Partnerships: Research, Education, and Action	317 (5.38)	3094 (2)	2014	1 (red)
International Journal of Integrated Care	167 (2.83)	120 (0.08)	2018	5 (purple)
Sustainability	160 (2.71)	1851 (1.19)	2020	3 (blue)
Health Promotion Practice	149 (2.53)	3173 (2.05)	2014	1 (red)
International Journal of Environmental Research and Public Health	139 (2.36)	1110 (0.72)	2020	1 (red)
BMJ Open	119 (2.02)	931 (0.6)	2019	2 (green)
BMC Health Services Research	113 (1.92)	1900 (1.23)	2018	5 (purple)