@Article{info:doi/10.2196/62754, author="Chevalier, Aline and Dosso, Cheyenne", title="The Influence of Medical Expertise and Information Search Skills on Medical Information Searching: Comparative Analysis From a Free Data Set", journal="JMIR Form Res", year="2025", month="Apr", day="17", volume="9", pages="e62754", keywords="information searching", keywords="credibility", keywords="internet", keywords="medicine", keywords="information search skills", abstract="Background: Nowadays, the internet has become the primary source of information for physicians seeking answers to medical questions about their patients before consulting colleagues. However, many websites provide low-quality, unreliable information that lacks scientific validation. Therefore, physicians must develop strong information search skills to locate relevant, accurate, and evidence-based content. However, previous studies have shown that physicians often have poor search skills and struggle to find information on the web, which may have detrimental consequences for patient care. Objective: This study aims to determine how medical students and residents searched for medical information on the internet, the quality of the web resources they used (including their nature and credibility), and how they evaluated the reliability of these resources and the answers they provided. Given the importance of domain knowledge (in this case, medicine) and information search skills in the search process, we compared the search behaviors of medical students and residents with those of computer science students. While medical students and residents possess greater medical-related knowledge, computer science students have stronger information search skills. Methods: A total of 20 students participated in this study: 10 medical students and residents, and 10 computer science students. Data were extracted from a freely accessible data set in accordance with FAIR (Findable, Accessible, Interoperable, and Reusable) principles. All participants searched for medical information online to make a diagnosis, select a treatment, and enhance their knowledge of a medical condition---3 primary activities they commonly perform. We analyzed search performance metrics, including search time, the use of medical-related keywords, and the accuracy of the information found, as well as the nature and credibility of web resources used by medical students and residents compared with computer science students. Results: Medical students and residents provided more accurate answers than computer science students without requiring additional time. Their medical expertise also enabled them to better assess the reliability of resources and select high-quality web sources, primarily from hospital websites. However, it is noteworthy that they made limited use of evidence-based tools such as PubMed. Conclusions: Although medical students and residents generally outperformed computer science students, they did not frequently use evidence-based tools. As previously observed, they may avoid databases due to the risk of encountering too many irrelevant articles and difficulties in applying appropriate filters to locate relevant information. Nevertheless, clinical and practical evidence-based medicine plays a crucial role in updating physicians' knowledge, improving patient care, and enhancing physician-patient relationships. Therefore, information search skills should be an integral part of medical education and continuing professional development for physicians. ", doi="10.2196/62754", url="https://formative.jmir.org/2025/1/e62754" } @Article{info:doi/10.2196/67361, author="Amoozegar, B. Jacqueline and Williams, Peyton and Giombi, C. Kristen and Richardson, Courtney and Shenkar, Ella and Watkins, L. Rebecca and O'Donoghue, C. Amie and Sullivan, W. Helen", title="Consumer Engagement With Risk Information on Prescription Drug Social Media Pages: Findings From In-Depth Interviews", journal="J Med Internet Res", year="2025", month="Mar", day="25", volume="27", pages="e67361", keywords="social media", keywords="prescription drugs", keywords="risk information", keywords="safety information", keywords="Facebook", keywords="Instagram", keywords="prescription", keywords="risk", keywords="information", keywords="safety", keywords="interview", keywords="consumer engagement", keywords="digital", keywords="drug promotion", keywords="user experience", keywords="promotion", abstract="Background: The volume of digital drug promotion has grown over time, and social media has become a source of information about prescription drugs for many consumers. Pharmaceutical companies currently present risk information about prescription drugs they promote in a variety of ways within and across social media platforms. There is scarce research on consumers' interactions with prescription drug promotion on social media, particularly on which features may facilitate or inhibit consumers' ability to find, review, and comprehend drug information. This is concerning because it is critical for consumers to know and weigh drug benefits and risks to be able to make informed decisions regarding medical treatment. Objective: We aimed to develop an understanding of the user interface (UI) and user experience (UX) of social media pages and posts created by pharmaceutical companies to promote drugs and how UI or UX design features impact consumers' interactions with drug information. Methods: We conducted in-person interviews with 54 consumers segmented into groups by device type (laptop or mobile phone), social media platform (Facebook or Instagram), and age. Interviewers asked participants to navigate to and review a series of 4 pages and 3 posts on their assigned device and platform. Interviewers encouraged participants to ``think aloud,'' as they interacted with the stimuli during a brief observation period. Following each observation period, participants were asked probing questions. An analyst reviewed video recordings of the observation periods to abstract quantitative interaction data on whether a participant clicked on or viewed risk information at each location it appeared on each page. Participants' responses were organized in a metamatrix, which we used to conduct thematic analysis. Results: Observational data revealed that 59\% of participants using Facebook and 70\% of participants using Instagram viewed risk information in at least 1 possible location on average across all pages tested during the observation period. There was not a single location across the Facebook pages that participants commonly clicked on to view risk information. However, a video with scrolling risk information attracted more views than other features. On Instagram, at least half of the participants consistently clicked on the highlighted story with risk information across the pages. Although thematic analysis showed that most participants were able to identify the official pages and risk information for each drug, auto-scrolling text and text size posed barriers to identification and comprehensive review for some participants. Participants generally found it more difficult to identify the drugs' indications than risks. Participants using Instagram more frequently reported challenges identifying risks and indications compared to those using Facebook. Conclusions: UI or UX design features can facilitate or pose barriers to users' identification, review, and comprehension of the risk information provided on prescription drugs' social media pages and posts. ", doi="10.2196/67361", url="https://www.jmir.org/2025/1/e67361" } @Article{info:doi/10.2196/67658, author="Tistad, Malin and Hultman, Lill and Wohlin Wottrich, Annica and von Koch, Lena", title="The Lived Experience of Participating in Online Peer-To-Peer Groups After Acquired Brain Injury: Phenomenological Study", journal="J Med Internet Res", year="2025", month="Mar", day="25", volume="27", pages="e67658", keywords="compassion", keywords="experiential knowledge", keywords="fatigue", keywords="self-compassion", keywords="stroke", keywords="social media", keywords="meaning", keywords="interview", keywords="normalization", abstract="Background: Stroke and other acquired brain injuries (ABIs) can present challenging experiences for individuals, both in recovery of functions affected by visible or invisible impairments and in learning to live with the new situation. Research has shown that sharing experiences face-to-face in peer groups can be beneficial during recovery. However, there is limited knowledge about the lived experiences of people with ABI who participate in online peer-to-peer groups. Objective: The aim of our study was to explore the lived experiences of participating in online peer-to-peer groups for people with ABI, where participants themselves set the agenda. Methods: Members of 2 Facebook groups (FBGs) for people with ABI were invited to participate in this study, and 20 individuals were included (14 women and 6 men; age range 24-74 years). One FBG focused on stroke and the other on fatigue caused by ABI. One group was private, and the other group was public. Data were collected through semistructured interviews, in which participants were encouraged to describe their experiences of engaging in FBGs in detail. The interviews were conducted over telephone or Zoom and digitally recorded. The audio recordings were then transcribed verbatim, resulting in 224 pages of text, and analyzed using the empirical phenomenological psychological method. Results: The analysis presented a common meaning structure with 1 main characteristic that is, ``validating self,'' common for all 20 participants, and 3 subcharacteristics, that is, ``learning---having one's own experiences confirmed,'' ``adjusting self---building competence and self-compassion,'' and ``supporting others---becoming a valued lived-experience expert/authority.'' Together, the subcharacteristics reflected a process of validating self from newcomer to lived-experience expert or authority. In this process, members of FBGs moved from being newcomers with pronounced needs for support and to learn and to have their experiences confirmed by others with similar experiences. Thus, participants were building competence and developing self-compassion. Gradually, they assumed the role of advisors, mentors, or coaches, acknowledging their experiences and competence as valuable to others, thereby validating themselves as compassionate lived-experience experts or authorities in supporting others. Conclusions: Participation in online peer-to-peer groups can offer unique opportunities for individuals with ABI to validate self through processes that involve learning, developing self-compassion and compassion for others, and offering support to others with similar experiences. Given that rehabilitation after an ABI is often of limited duration and that positive experiences can be achieved over time through involvement in digital peer-to-peer support, health care professionals should assist patients by providing information and directing them to digital networks for people with ABI. However, when recommending the use of online peer-to-peer support, impairments and insufficient digital competence that may complicate or prevent the use of social media should be assessed and support provided when relevant. ", doi="10.2196/67658", url="https://www.jmir.org/2025/1/e67658", url="http://www.ncbi.nlm.nih.gov/pubmed/40131323" } @Article{info:doi/10.2196/67677, author="Hou, Yu and Bishop, R. Jeffrey and Liu, Hongfang and Zhang, Rui", title="Improving Dietary Supplement Information Retrieval: Development of a Retrieval-Augmented Generation System With Large Language Models", journal="J Med Internet Res", year="2025", month="Mar", day="19", volume="27", pages="e67677", keywords="dietary supplements", keywords="knowledge representation", keywords="knowledge graph", keywords="retrieval-augmented generation", keywords="large language model", keywords="user interface", abstract="Background: Dietary supplements (DSs) are widely used to improve health and nutrition, but challenges related to misinformation, safety, and efficacy persist due to less stringent regulations compared with pharmaceuticals. Accurate and reliable DS information is critical for both consumers and health care providers to make informed decisions. Objective: This study aimed to enhance DS-related question answering by integrating an advanced retrieval-augmented generation (RAG) system with the integrated Dietary Supplement Knowledgebase 2.0 (iDISK2.0), a dietary supplement knowledge base, to improve accuracy and reliability. Methods: We developed iDISK2.0 by integrating updated data from authoritative sources, including the Natural Medicines Comprehensive Database, the Memorial Sloan Kettering Cancer Center database, Dietary Supplement Label Database, and Licensed Natural Health Products Database, and applied advanced data cleaning and standardization techniques to reduce noise. The RAG system combined the retrieval power of a biomedical knowledge graph with the generative capabilities of large language models (LLMs) to address limitations of stand-alone LLMs, such as hallucination. The system retrieves contextually relevant subgraphs from iDISK2.0 based on user queries, enabling accurate and evidence-based responses through a user-friendly interface. We evaluated the system using true-or-false and multiple-choice questions derived from the Memorial Sloan Kettering Cancer Center database and compared its performance with stand-alone LLMs. Results: iDISK2.0 integrates 174,317 entities across 7 categories, including 8091 dietary supplement ingredients; 163,806 dietary supplement products; 786 diseases; and 625 drugs, along with 6 types of relationships. The RAG system achieved an accuracy of 99\% (990/1000) for true-or-false questions on DS effectiveness and 95\% (948/100) for multiple-choice questions on DS-drug interactions, substantially outperforming stand-alone LLMs like GPT-4o (OpenAI), which scored 62\% (618/1000) and 52\% (517/1000) on these respective tasks. The user interface enabled efficient interaction, supporting free-form text input and providing accurate responses. Integration strategies minimized data noise, ensuring access to up-to-date, DS-related information. Conclusions: By integrating a robust knowledge graph with RAG and LLM technologies, iDISK2.0 addresses the critical limitations of stand-alone LLMs in DS information retrieval. This study highlights the importance of combining structured data with advanced artificial intelligence methods to improve accuracy and reduce misinformation in health care applications. Future work includes extending the framework to broader biomedical domains and improving evaluation with real-world, open-ended queries. ", doi="10.2196/67677", url="https://www.jmir.org/2025/1/e67677" } @Article{info:doi/10.2196/66279, author="{\vS}uvalov, Hendrik and Lepson, Mihkel and Kukk, Veronika and Malk, Maria and Ilves, Neeme and Kuulmets, Hele-Andra and Kolde, Raivo", title="Using Synthetic Health Care Data to Leverage Large Language Models for Named Entity Recognition: Development and Validation Study", journal="J Med Internet Res", year="2025", month="Mar", day="18", volume="27", pages="e66279", keywords="natural language processing", keywords="named entity recognition", keywords="large language model", keywords="synthetic data", keywords="LLM", keywords="NLP", keywords="machine learning", keywords="artificial intelligence", keywords="language model", keywords="NER", keywords="medical entity", keywords="Estonian", keywords="health care data", keywords="annotated data", keywords="data annotation", keywords="clinical decision support", keywords="data mining", abstract="Background: Named entity recognition (NER) plays a vital role in extracting critical medical entities from health care records, facilitating applications such as clinical decision support and data mining. Developing robust NER models for low-resource languages, such as Estonian, remains a challenge due to the scarcity of annotated data and domain-specific pretrained models. Large language models (LLMs) have proven to be promising in understanding text from any language or domain. Objective: This study addresses the development of medical NER models for low-resource languages, specifically Estonian. We propose a novel approach by generating synthetic health care data and using LLMs to annotate them. These synthetic data are then used to train a high-performing NER model, which is applied to real-world medical texts, preserving patient data privacy. Methods: Our approach to overcoming the shortage of annotated Estonian health care texts involves a three-step pipeline: (1) synthetic health care data are generated using a locally trained GPT-2 model on Estonian medical records, (2) the synthetic data are annotated with LLMs, specifically GPT-3.5-Turbo and GPT-4, and (3) the annotated synthetic data are then used to fine-tune an NER model, which is later tested on real-world medical data. This paper compares the performance of different prompts; assesses the impact of GPT-3.5-Turbo, GPT-4, and a local LLM; and explores the relationship between the amount of annotated synthetic data and model performance. Results: The proposed methodology demonstrates significant potential in extracting named entities from real-world medical texts. Our top-performing setup achieved an F1-score of 0.69 for drug extraction and 0.38 for procedure extraction. These results indicate a strong performance in recognizing certain entity types while highlighting the complexity of extracting procedures. Conclusions: This paper demonstrates a successful approach to leveraging LLMs for training NER models using synthetic data, effectively preserving patient privacy. By avoiding reliance on human-annotated data, our method shows promise in developing models for low-resource languages, such as Estonian. Future work will focus on refining the synthetic data generation and expanding the method's applicability to other domains and languages. ", doi="10.2196/66279", url="https://www.jmir.org/2025/1/e66279" } @Article{info:doi/10.2196/67033, author="Hao, Jie and Chen, Zhenli and Peng, Qinglong and Zhao, Liang and Zhao, Wanqing and Cong, Shan and Li, Junlian and Li, Jiao and Qian, Qing and Sun, Haixia", title="Prompt Framework for Extracting Scale-Related Knowledge Entities from Chinese Medical Literature: Development and Evaluation Study", journal="J Med Internet Res", year="2025", month="Mar", day="18", volume="27", pages="e67033", keywords="prompt engineering", keywords="named entity recognition", keywords="in-context learning", keywords="large language model", keywords="Chinese medical literature", keywords="measurement-based care", keywords="framework", keywords="prompt", keywords="prompt framework", keywords="scale", keywords="China", keywords="medical literature", keywords="MBC", keywords="LLM", keywords="MedScaleNER", keywords="retrieval", keywords="information retrieval", keywords="dataset", keywords="artificial intelligence", keywords="AI", abstract="Background: Measurement-based care improves patient outcomes by using standardized scales, but its widespread adoption is hindered by the lack of accessible and structured knowledge, particularly in unstructured Chinese medical literature. Extracting scale-related knowledge entities from these texts is challenging due to limited annotated data. While large language models (LLMs) show promise in named entity recognition (NER), specialized prompting strategies are needed to accurately recognize medical scale-related entities, especially in low-resource settings. Objective: This study aims to develop and evaluate MedScaleNER, a task-oriented prompt framework designed to optimize LLM performance in recognizing medical scale-related entities from Chinese medical literature. Methods: MedScaleNER incorporates demonstration retrieval within in-context learning, chain-of-thought prompting, and self-verification strategies to improve performance. The framework dynamically retrieves optimal examples using a k-nearest neighbors approach and decomposes the NER task into two subtasks: entity type identification and entity labeling. Self-verification ensures the reliability of the final output. A dataset of manually annotated Chinese medical journal papers was constructed, focusing on three key entity types: scale names, measurement concepts, and measurement items. Experiments were conducted by varying the number of examples and the proportion of training data to evaluate performance in low-resource settings. Additionally, MedScaleNER's performance was compared with locally fine-tuned models. Results: The CMedS-NER (Chinese Medical Scale Corpus for Named Entity Recognition) dataset, containing 720 papers with 27,499 manually annotated scale-related knowledge entities, was used for evaluation. Initial experiments identified GLM-4-0520 as the best-performing LLM among six tested models. When applied with GLM-4-0520, MedScaleNER significantly improved NER performance for scale-related entities, achieving a macro F1-score of 59.64\% in an exact string match with the full training dataset. The highest performance was achieved with 20-shot demonstrations. Under low-resource scenarios (eg, 1\% of the training data), MedScaleNER outperformed all tested locally fine-tuned models. Ablation studies highlighted the importance of demonstration retrieval and self-verification in improving model reliability. Error analysis revealed four main types of mistakes: identification errors, type errors, boundary errors, and missing entities, indicating areas for further improvement. Conclusions: MedScaleNER advances the application of LLMs and prompts engineering for specialized NER tasks in Chinese medical literature. By addressing the challenges of unstructured texts and limited annotated data, MedScaleNER's adaptability to various biomedical contexts supports more efficient and reliable knowledge extraction, contributing to broader measurement-based care implementation and improved clinical and research outcomes. ", doi="10.2196/67033", url="https://www.jmir.org/2025/1/e67033" } @Article{info:doi/10.2196/66683, author="Li, Hongmin and Li, Dongxu and Zhai, Min and Lin, Li and Cao, ZhiHeng", title="Associations Among Online Health Information Seeking Behavior, Online Health Information Perception, and Health Service Utilization: Cross-Sectional Study", journal="J Med Internet Res", year="2025", month="Mar", day="14", volume="27", pages="e66683", keywords="online health information seeking (OHIS)", keywords="online health information perception (OHIP)", keywords="mediating effect", keywords="health service utilization", keywords="health information", keywords="health perception", keywords="data", keywords="China", keywords="Chinese General Social Survey (CGSS)", keywords="database", keywords="medical information", keywords="survey", abstract="Background: Seeking online health information can empower individuals to better understand their health concerns, facilitating their ability to manage their health conditions more effectively. It has the potential to change the likelihood and frequency of health service usage. Although existing literature has demonstrated the prevalence of seeking online health information among different populations, the factors affecting online health information perception and discussions on the associations between seeking online health information and health service utilization are limited. Objective: We analyzed the associations between online health information seeking behavior and health service utilization, as well as the online health information perception delivery mechanism. Methods: We analyzed data from the Chinese General Social Survey, the first national representative survey conducted in mainland China. The independent variable was the online health information seeking behavior. The outcome variable was health service utilization by the respondents, and online health information perception was selected as the mediating variable in this analysis. Factor analysis was conducted to obtain online health information perception. Multiple regressions were performed to investigate the effect of online health information seeking behavior on physician visits. Bootstrap methods were conducted to test the mediation effects of online health information perception. Results: This analysis included 1475 cases. Among the participants, 939 (63.66\%) sought online health information in the last 12 months. The mean age of the respondents was 46.72 (SD 15.86) years, and 794 (53.83\%) were females. After controlling for other variables, individuals with online health information seeking behaviors showed 0.289 times more outpatient visits (P=.003), 0.131 times more traditional Chinese medicine outpatient visits (P=.01), and 0.158 times more Western medicine outpatient visits (P=.007) over the past year compared to those who did not seek health information online. Additionally, multiple regression analyses revealed statistically significant effects of gender, age, and health status on physician visits. The total effect revealed that seeking online health information significantly influenced the total physician visits ($\beta$=0.290; P=.003), indicating a certain correlation between online health information seeking behavior and physician visits. Seeking online health information had a significant positive impact on the perception ($\beta$=0.265; P<.001). The mediation effects analysis identified that online health information perception led to a significant increase in physician visits with the increase in the online health information seeking behaviors ($\beta$=0.232; P=.02). Conclusions: The online health information perception of an individual influences the effect online health information seeking has on the frequency of physician visits. The online health information seeking behavior impacts outpatient service utilization both directly and indirectly through online health information perception and significantly increases the frequency of clinic visits after controlling for other variables. Interventions can be explored to improve the health utilization of residents by increasing their online health information perception. ", doi="10.2196/66683", url="https://www.jmir.org/2025/1/e66683" } @Article{info:doi/10.2196/51804, author="Portela, Diana and Freitas, Alberto and Costa, El{\'i}sio and Giovannini, Mattia and Bousquet, Jean and Almeida Fonseca, Jo{\~a}o and Sousa-Pinto, Bernardo", title="Impact of Demographic and Clinical Subgroups in Google Trends Data: Infodemiology Case Study on Asthma Hospitalizations", journal="J Med Internet Res", year="2025", month="Mar", day="10", volume="27", pages="e51804", keywords="infodemiology", keywords="asthma", keywords="administrative databases", keywords="multimorbidity", keywords="co-morbidity", keywords="respiratory", keywords="pulmonary", keywords="Google Trends", keywords="correlation", keywords="hospitalization", keywords="admissions", keywords="autoregressive", keywords="information seeking", keywords="searching", keywords="searches", keywords="forecasting", abstract="Background: Google Trends (GT) data have shown promising results as a complementary tool to classical surveillance approaches. However, GT data are not necessarily provided by a representative sample of patients and may be skewed toward demographic and clinical groups that are more likely to use the internet to search for their health. Objective: In this study, we aimed to assess whether GT-based models perform differently in distinct population subgroups. To assess that, we analyzed a case study on asthma hospitalizations. Methods: We analyzed all hospitalizations with a main diagnosis of asthma occurring in 3 different countries (Portugal, Spain, and Brazil) for a period of approximately 5 years (January 1, 2012-December 17, 2016). Data on web-based searches on common cold for the same countries and time period were retrieved from GT. We estimated the correlation between GT data and the weekly occurrence of asthma hospitalizations (considering separate asthma admissions data according to patients' age, sex, ethnicity, and presence of comorbidities). In addition, we built autoregressive models to forecast the weekly number of asthma hospitalizations (for the different aforementioned subgroups) for a period of 1 year (June 2015-June 2016) based on admissions and GT data from the 3 previous years. Results: Overall, correlation coefficients between GT on the pseudo-influenza syndrome topic and asthma hospitalizations ranged between 0.33 (in Portugal for admissions with at least one Charlson comorbidity group) and 0.86 (for admissions in women and in White people in Brazil). In the 3 assessed countries, forecasted hospitalizations for 2015-2016 correlated more strongly with observed admissions of older versus younger individuals (Portugal: Spearman $\rho$=0.70 vs $\rho$=0.56; Spain: $\rho$=0.88 vs $\rho$=0.76; Brazil: $\rho$=0.83 vs $\rho$=0.82). In Portugal and Spain, forecasted hospitalizations had a stronger correlation with admissions occurring for women than men (Portugal: $\rho$=0.75 vs $\rho$=0.52; Spain: $\rho$=0.83 vs $\rho$=0.51). In Brazil, stronger correlations were observed for admissions of White than of Black or Brown individuals ($\rho$=0.92 vs $\rho$=0.87). In Portugal, stronger correlations were observed for admissions of individuals without any comorbidity compared with admissions of individuals with comorbidities ($\rho$=0.68 vs $\rho$=0.66). Conclusions: We observed that the models based on GT data may perform differently in demographic and clinical subgroups of participants, possibly reflecting differences in the composition of internet users' health-seeking behaviors. ", doi="10.2196/51804", url="https://www.jmir.org/2025/1/e51804" } @Article{info:doi/10.2196/57881, author="Muscat, M. Danielle and Hinton, Rachael and Kuruvilla, Shyama and Nutbeam, Don", title="``Your Life, Your Health: Tips and Information for Health and Well-Being'': Development of a World Health Organization Digital Resource to Support Universal Access to Trustworthy Health Information", journal="JMIR Form Res", year="2025", month="Mar", day="6", volume="9", pages="e57881", keywords="health communication", keywords="health literacy", keywords="consumer health information", keywords="digital health", keywords="universal health care", abstract="Background: Access to trustworthy, understandable, and actionable health information is a key determinant of health and is an essential component of universal health coverage and primary health care. The World Health Organization has developed a new digital resource for the general public to improve health and well-being across different life phases and to support people in caring for themselves, their families, and their communities. The goal was to make trustworthy health information accessible, understandable, and actionable for the general public in a digital format and at the global scale. Objective: The aim of this paper was to describe the multistage approach and methodology used to develop the resource Your life, your health: Tips and information for health and well-being (hereafter, Your life, your health). Methods: A 5-step process was used to develop Your life, your health, including (1) reviewing and synthesizing existing World Health Organization technical guidance, member state health and health literacy plans, and international human rights frameworks to identify priority messages; (2) developing messages and graphics that are accessible, understandable, and actionable for the public using health literacy principles; (3) engaging with experts and stakeholders to refine messages and message delivery; (4) presenting priority content in an accessible digital format; and (5) adapting the resource based on feedback and new evidences. Results: The Your life, your health online resource adopts a life-course approach to organize health information based on priority actions and rights that support peoples' health and well-being across different life stages and specific health topics. The resource promotes health literacy by offering advice on asking questions to health workers, making informed decisions about personal and family health, and effectively using digital media to obtain reliable health information. Additionally, it reflects the ambitions of the Sustainable Development Goals by providing essential information on the social determinants of health and clarifies the distinct roles of individuals, frontline workers, governments, and the media in promoting and protecting health. Conclusions: Making health information available---including to the public---is an essential step in strengthening the global health information system. The development process for the Your life, your health online resource outlined in this article offers a structured approach to translate technical health guidelines into accessible, understandable, and actionable health information for the general public. ", doi="10.2196/57881", url="https://formative.jmir.org/2025/1/e57881" } @Article{info:doi/10.2196/64672, author="DuPont-Reyes, J. Melissa and Villatoro, P. Alice and Tang, Lu", title="Health Information Scanning and Seeking in Diverse Language, Cultural and Technological Media Among Latinx Adolescents: Cross-Sectional Study", journal="J Med Internet Res", year="2025", month="Mar", day="5", volume="27", pages="e64672", keywords="adolescent behaviors", keywords="mental health", keywords="Latino", keywords="social media", keywords="adolescent", keywords="media use", keywords="internet use", keywords="health information seeking", keywords="health information scanning", keywords="mobile phone", abstract="Background: Continuous scientific and policy debate regarding the potential harm and/or benefit of media and social media on adolescent health has resulted, in part, from a deficiency in robust scientific evidence. Even with a lack of scientific consensus, public attitudes, and sweeping social media prohibitions have swiftly ensued. A focus on the diversity of adolescents around the world and their diverse use of language, culture, and social media is absent from these discussions. Objective: This study aims to guide communication policy and practice, including those addressing access to social media by adolescent populations. This study assesses physical and mental health information scanning and seeking behaviors across diverse language, cultural, and technological media and social media among Latinx adolescent residents in the United States. This study also explores how Latinx adolescents with mental health concerns use media and social media for support. Methods: In 2021, a cross-sectional survey was conducted among 701 US-based Latinx adolescents aged 13-20 years to assess their health-related media use. Assessments ascertained the frequency of media use and mental and physical health information scanning and seeking across various media technologies (eg, TV, podcasts, and social media) and language and cultural types (ie, Spanish, Latinx-tailored English, and general English). Linear regression models were used to estimate adjusted predicted means of mental and physical health information scanning and seeking across diverse language and cultural media types, net personal and family factors, in the full sample and by subsamples of mental health symptoms (moderate-high vs none-mild). Results: Among Latinx adolescents, media and social media use was similar across mental health symptoms. However, Latinx adolescents with moderate-high versus none-mild symptoms more often scanned general English media and social media for mental health information (P<.05), although not for physical health information. Also, Latinx adolescents with moderate-high versus none-mild symptoms more often sought mental health information on Latinx-tailored and general English media, and social media (P<.05); a similar pattern was found for physical health information seeking. In addition, Latinx adolescents with moderate-high versus none-mild symptoms often sought help from family and friends for mental and physical health problems and health care providers for mental health only (P<.05). Conclusions: While media and social media usage was similar across mental health, Latinx adolescents with moderate-high symptoms more often encountered mental health content in general English media and social media and turned to general English- and Latinx-tailored media and social media more often for their health concerns. Together these study findings suggest more prevalent and available mental health content in general English versus Spanish language and Latinx-tailored media and underscore the importance of providing accessible, quality health information across diverse language, cultural, and technological media and social networks as a viable opportunity to help improve adolescent health. ", doi="10.2196/64672", url="https://www.jmir.org/2025/1/e64672", url="http://www.ncbi.nlm.nih.gov/pubmed/40053766" } @Article{info:doi/10.2196/56831, author="Bayani, Azadeh and Ayotte, Alexandre and Nikiema, Noel Jean", title="Transformer-Based Tool for Automated Fact-Checking of Online Health Information: Development Study", journal="JMIR Infodemiology", year="2025", month="Feb", day="21", volume="5", pages="e56831", keywords="fact-checking automation", keywords="transformers", keywords="infodemic", keywords="credible health information", keywords="machine learning", keywords="automated", keywords="online health information", keywords="misinformation", keywords="natural language processing", keywords="epidemiology", keywords="health domain", abstract="Background: Many people seek health-related information online. The significance of reliable information became particularly evident due to the potential dangers of misinformation. Therefore, discerning true and reliable information from false information has become increasingly challenging. Objective: This study aimed to present a pilot study in which we introduced a novel approach to automate the fact-checking process, leveraging PubMed resources as a source of truth using natural language processing transformer models to enhance the process. Methods: A total of 538 health-related web pages, covering 7 different disease subjects, were manually selected by Factually Health Company. The process included the following steps: (1) using transformer models of bidirectional encoder representations from transformers (BERT), BioBERT, and SciBERT, and traditional models of random forests and support vector machines, to classify the contents of web pages into 3 thematic categories (semiology, epidemiology, and management), (2) for each category in the web pages, a PubMed query was automatically produced using a combination of the ``WellcomeBertMesh'' and ``KeyBERT'' models, (3) top 20 related literatures were automatically extracted from PubMed, and finally, (4) the similarity checking techniques of cosine similarity and Jaccard distance were applied to compare the content of extracted literature and web pages. Results: The BERT model for the categorization of web page contents had good performance, with F1-scores and recall of 93\% and 94\% for semiology and epidemiology, respectively, and 96\% for both the recall and F1-score for management. For each of the 3 categories in a web page, 1 PubMed query was generated and with each query, the 20 most related, open access articles within the category of systematic reviews and meta-analyses were extracted. Less than 10\% of the extracted literature was irrelevant; those were deleted. For each web page, an average of 23\% of the sentences were found to be very similar to the literature. Moreover, during the evaluation, it was found that cosine similarity outperformed the Jaccard distance measure when comparing the similarity between sentences from web pages and academic papers vectorized by BERT. However, there was a significant issue with false positives in the retrieved sentences when compared with accurate similarities, as some sentences had a similarity score exceeding 80\%, but they could not be considered similar sentences. Conclusions: In this pilot study, we have proposed an approach to automate the fact-checking of health-related online information. Incorporating content from PubMed or other scientific article databases as trustworthy resources can automate the discovery of similarly credible information in the health domain. ", doi="10.2196/56831", url="https://infodemiology.jmir.org/2025/1/e56831", url="http://www.ncbi.nlm.nih.gov/pubmed/39812653" } @Article{info:doi/10.2196/66910, author="Seinen, M. Tom and Kors, A. Jan and van Mulligen, M. Erik and Rijnbeek, R. Peter", title="Using Structured Codes and Free-Text Notes to Measure Information Complementarity in Electronic Health Records: Feasibility and Validation Study", journal="J Med Internet Res", year="2025", month="Feb", day="13", volume="27", pages="e66910", keywords="natural language processing", keywords="named entity recognition", keywords="clinical concept extraction", keywords="machine learning", keywords="electronic health records", keywords="EHR", keywords="word embeddings", keywords="clinical concept similarity", keywords="text mining", keywords="code", keywords="free-text", keywords="information", keywords="electronic record", keywords="data", keywords="patient records", keywords="framework", keywords="structured data", keywords="unstructured data", abstract="Background: Electronic health records (EHRs) consist of both structured data (eg, diagnostic codes) and unstructured data (eg, clinical notes). It is commonly believed that unstructured clinical narratives provide more comprehensive information. However, this assumption lacks large-scale validation and direct validation methods. Objective: This study aims to quantitatively compare the information in structured and unstructured EHR data and directly validate whether unstructured data offers more extensive information across a patient population. Methods: We analyzed both structured and unstructured data from patient records and visits in a large Dutch primary care EHR database between January 2021 and January 2024. Clinical concepts were identified from free-text notes using an extraction framework tailored for Dutch and compared with concepts from structured data. Concept embeddings were generated to measure semantic similarity between structured and extracted concepts through cosine similarity. A similarity threshold was systematically determined via annotated matches and minimized weighted Gini impurity. We then quantified the concept overlap between structured and unstructured data across various concept domains and patient populations. Results: In a population of 1.8 million patients, only 13\% of extracted concepts from patient records and 7\% from individual visits had similar structured counterparts. Conversely, 42\% of structured concepts in records and 25\% in visits had similar matches in unstructured data. Condition concepts had the highest overlap, followed by measurements and drug concepts. Subpopulation visits, such as those with chronic conditions or psychological disorders, showed different proportions of data overlap, indicating varied reliance on structured versus unstructured data across clinical contexts. Conclusions: Our study demonstrates the feasibility of quantifying the information difference between structured and unstructured data, showing that the unstructured data provides important additional information in the studied database and populations. The annotated concept matches are made publicly available for the clinical natural language processing community. Despite some limitations, our proposed methodology proves versatile, and its application can lead to more robust and insightful observational clinical research. ", doi="10.2196/66910", url="https://www.jmir.org/2025/1/e66910" } @Article{info:doi/10.2196/66696, author="Chen, Sihui and Ngai, Bik Cindy Sing and Cheng, Cecilia and Hu, Yangna", title="Analyzing Themes, Sentiments, and Coping Strategies Regarding Online News Coverage of Depression in Hong Kong: Mixed Methods Study", journal="J Med Internet Res", year="2025", month="Feb", day="13", volume="27", pages="e66696", keywords="online news coverage", keywords="depression", keywords="natural language processing", keywords="NLP", keywords="latent Dirichlet allocation", keywords="LDA", keywords="sentiment", keywords="coping strategies", keywords="content analysis", abstract="Background: Depression, a highly prevalent global mental disorder, has prompted significant research concerning its association with social media use and its impact during Hong Kong's social unrest and COVID-19 pandemic. However, other mainstream media, specifically online news, has been largely overlooked. Despite extensive research conducted in countries, such as the United States, Australia, and Canada, to investigate the latent subthemes, sentiments, and coping strategies portrayed in depression-related news, the landscape in Hong Kong remains unexplored. Objective: This study aims to uncover the latent subthemes presented in the online news coverage of depression in Hong Kong, examine the sentiment conveyed in the news, and assess whether coping strategies have been provided in the news for individuals experiencing depression. Methods: This study used natural language processing (NLP) techniques, namely the latent Dirichlet allocation topic modeling and the Valence Aware Dictionary and Sentiment Reasoner (VADER) sentiment analysis, to fulfill the first and second objectives. Coping strategies were rigorously assessed and manually labeled with designated categories by content analysis. The online news was collected from February 2019 to May 2024 from Hong Kong mainstream news websites to examine the latest portrayal of depression, particularly during and after the social unrest and the COVID-19 pandemic. Results: In total, 2435 news articles were retained for data analysis after the news screening process. A total of 7 subthemes were identified based on the topic modeling results. Societal system, law enforcement, global recession, lifestyle, leisure, health issues, and US politics were the latent subthemes. Moreover, the overall news exhibited a slightly positive sentiment. The correlations between the sentiment scores and the latent subthemes indicated that the societal system, law enforcement, health issues, and US politics revealed negative tendencies, while the remainder leaned toward a positive sentiment. The coping strategies for depression were substantially lacking; however, the categories emphasizing information on skills and resources and individual adjustment to cope with depression emerged as the priority focus. Conclusions: This pioneering study used a mixed methods approach where NLP was used to investigate latent subthemes and underlying sentiment in online news. Content analysis was also performed to examine available coping strategies. The findings of this research enhance our understanding of how depression is portrayed through online news in Hong Kong and the preferable coping strategies being used to mitigate depression. The potential impact on readers was discussed. Future research is encouraged to address the mentioned implications and limitations, with recommendations to apply advanced NLP techniques to a new mental health issue case or language. ", doi="10.2196/66696", url="https://www.jmir.org/2025/1/e66696", url="http://www.ncbi.nlm.nih.gov/pubmed/39946170" } @Article{info:doi/10.2196/64290, author="Mendel, Tamir and Singh, Nina and Mann, M. Devin and Wiesenfeld, Batia and Nov, Oded", title="Laypeople's Use of and Attitudes Toward Large Language Models and Search Engines for Health Queries: Survey Study", journal="J Med Internet Res", year="2025", month="Feb", day="13", volume="27", pages="e64290", keywords="large language model", keywords="artificial intelligence", keywords="LLMs", keywords="search engine", keywords="Google", keywords="internet", keywords="online health information", keywords="United States", keywords="survey", keywords="mobile phone", abstract="Background: Laypeople have easy access to health information through large language models (LLMs), such as ChatGPT, and search engines, such as Google. Search engines transformed health information access, and LLMs offer a new avenue for answering laypeople's questions. Objective: We aimed to compare the frequency of use and attitudes toward LLMs and search engines as well as their comparative relevance, usefulness, ease of use, and trustworthiness in responding to health queries. Methods: We conducted a screening survey to compare the demographics of LLM users and nonusers seeking health information, analyzing results with logistic regression. LLM users from the screening survey were invited to a follow-up survey to report the types of health information they sought. We compared the frequency of use of LLMs and search engines using ANOVA and Tukey post hoc tests. Lastly, paired-sample Wilcoxon tests compared LLMs and search engines on perceived usefulness, ease of use, trustworthiness, feelings, bias, and anthropomorphism. Results: In total, 2002 US participants recruited on Prolific participated in the screening survey about the use of LLMs and search engines. Of them, 52\% (n=1045) of the participants were female, with a mean age of 39 (SD 13) years. Participants were 9.7\% (n=194) Asian, 12.1\% (n=242) Black, 73.3\% (n=1467) White, 1.1\% (n=22) Hispanic, and 3.8\% (n=77) were of other races and ethnicities. Further, 1913 (95.6\%) used search engines to look up health queries versus 642 (32.6\%) for LLMs. Men had higher odds (odds ratio [OR] 1.63, 95\% CI 1.34-1.99; P<.001) of using LLMs for health questions than women. Black (OR 1.90, 95\% CI 1.42-2.54; P<.001) and Asian (OR 1.66, 95\% CI 1.19-2.30; P<.01) individuals had higher odds than White individuals. Those with excellent perceived health (OR 1.46, 95\% CI 1.1-1.93; P=.01) were more likely to use LLMs than those with good health. Higher technical proficiency increased the likelihood of LLM use (OR 1.26, 95\% CI 1.14-1.39; P<.001). In a follow-up survey of 281 LLM users for health, most participants used search engines first (n=174, 62\%) to answer health questions, but the second most common first source consulted was LLMs (n=39, 14\%). LLMs were perceived as less useful (P<.01) and less relevant (P=.07), but elicited fewer negative feelings (P<.001), appeared more human (LLM: n=160, vs search: n=32), and were seen as less biased (P<.001). Trust (P=.56) and ease of use (P=.27) showed no differences. Conclusions: Search engines are the primary source of health information; yet, positive perceptions of LLMs suggest growing use. Future work could explore whether LLM trust and usefulness are enhanced by supplementing answers with external references and limiting persuasive language to curb overreliance. Collaboration with health organizations can help improve the quality of LLMs' health output. ", doi="10.2196/64290", url="https://www.jmir.org/2025/1/e64290", url="http://www.ncbi.nlm.nih.gov/pubmed/39946180" } @Article{info:doi/10.2196/63149, author="Downing, J. Gregory and Tramontozzi, M. Lucas and Garcia, Jackson and Villanueva, Emma", title="Harnessing Internet Search Data as a Potential Tool for Medical Diagnosis: Literature Review", journal="JMIR Ment Health", year="2025", month="Feb", day="11", volume="12", pages="e63149", keywords="health", keywords="informatics", keywords="internet search data", keywords="early diagnosis", keywords="web search", keywords="information technology", keywords="internet", keywords="machine learning", keywords="medical records", keywords="diagnosis", keywords="health care", keywords="self-diagnosis", keywords="detection", keywords="intervention", keywords="patient education", keywords="internet search", keywords="health-seeking behavior", keywords="artificial intelligence", keywords="AI", abstract="Background: The integration of information technology into health care has created opportunities to address diagnostic challenges. Internet searches, representing a vast source of health-related data, hold promise for improving early disease detection. Studies suggest that patterns in search behavior can reveal symptoms before clinical diagnosis, offering potential for innovative diagnostic tools. Leveraging advancements in machine learning, researchers have explored linking search data with health records to enhance screening and outcomes. However, challenges like privacy, bias, and scalability remain critical to its widespread adoption. Objective: We aimed to explore the potential and challenges of using internet search data in medical diagnosis, with a specific focus on diseases and conditions such as cancer, cardiovascular disease, mental and behavioral health, neurodegenerative disorders, and nutritional and metabolic diseases. We examined ethical, technical, and policy considerations while assessing the current state of research, identifying gaps and limitations, and proposing future research directions to advance this emerging field. Methods: We conducted a comprehensive analysis of peer-reviewed literature and informational interviews with subject matter experts to examine the landscape of internet search data use in medical research. We searched for published peer-reviewed literature on the PubMed database between October and December 2023. Results: Systematic selection based on predefined criteria included 40 articles from the 2499 identified articles. The analysis revealed a nascent domain of internet search data research in medical diagnosis, marked by advancements in analytics and data integration. Despite challenges such as bias, privacy, and infrastructure limitations, emerging initiatives could reshape data collection and privacy safeguards. Conclusions: We identified signals correlating with diagnostic considerations in certain diseases and conditions, indicating the potential for such data to enhance clinical diagnostic capabilities. However, leveraging internet search data for improved early diagnosis and health care outcomes requires effectively addressing ethical, technical, and policy challenges. By fostering interdisciplinary collaboration, advancing infrastructure development, and prioritizing patient engagement and consent, researchers can unlock the transformative potential of internet search data in medical diagnosis to ultimately enhance patient care and advance health care practice and policy. ", doi="10.2196/63149", url="https://mental.jmir.org/2025/1/e63149", url="http://www.ncbi.nlm.nih.gov/pubmed/39813106" } @Article{info:doi/10.2196/66446, author="Zhang, Zhongmin and Xu, Hengyi and Pan, Jing and Song, Fujian and Chen, Ting", title="Spatiotemporal Characteristics and Influential Factors of Electronic Cigarette Web-Based Attention in Mainland China: Time Series Observational Study", journal="J Med Internet Res", year="2025", month="Feb", day="10", volume="27", pages="e66446", keywords="electronic cigarettes", keywords="Baidu index", keywords="web-based attention", keywords="spatiotemporal characteristics", keywords="China", abstract="Background: The popularity of electronic cigarettes (e-cigarettes) has steadily increased, prompting a considerable number of individuals to search for relevant information on them. Previous e-cigarette infodemiology studies have focused on assessing the quality and reliability of website content and quantifying the impact of policies. In reality, most low-income countries and low- and middle-income countries have not yet conducted e-cigarette use surveillance. Data sourced from web-based search engines related to e-cigarettes have the potential to serve as cost-effective supplementary means to traditional monitoring approaches. Objective: This study aimed to analyze the spatiotemporal distribution characteristics and associated sociodemographic factors of e-cigarette searches using trends from the Baidu search engine. Methods: The query data related to e-cigarettes for 31 provinces in mainland China were retrieved from the Baidu index database from January 1, 2015, to December 31, 2022. Concentration ratio methods and spatial autocorrelation analysis were applied to analyze the temporal aggregation and spatial aggregation of the e-cigarette Baidu index, respectively. A variance inflation factor test was performed to avoid multicollinearity. A spatial panel econometric model was developed to assess the determinants of e-cigarette web-based attention. Results: The daily average Baidu index for e-cigarettes increased from 53,234.873 in 2015 to 85,416.995 in 2021 and then declined to 52,174.906 in 2022. This index was concentrated in the southeastern coastal region, whereas the hot spot shifted to the northwestern region after adjusting for population size. Positive spatial autocorrelation existed in the per capita Baidu index of e-cigarettes from 2015 to 2022. The results of the local Moran's I showed that there were mainly low-low cluster areas of the per capita Baidu index, especially in the central region. Furthermore, the male-female ratio, the proportion of high school and above education, and the per capita gross regional domestic product were positively correlated with the per capita Baidu index for e-cigarettes. A higher urbanization rate was associated with a reduced per capita Baidu index. Conclusions: With the increasing popularity of web-based searches for e-cigarettes, a targeted e-cigarette health education program for individuals in the northwest, males, rural populations, high school and above educated individuals, and high-income groups is warranted. ", doi="10.2196/66446", url="https://www.jmir.org/2025/1/e66446", url="http://www.ncbi.nlm.nih.gov/pubmed/39928402" } @Article{info:doi/10.2196/66072, author="Xiong, Xin and Xiang, Linghui and Chang, Litao and Wu, XY Irene and Deng, Shuzhen", title="Forecasting the Incidence of Mumps Based on the Baidu Index and Environmental Data in Yunnan, China: Deep Learning Model Study", journal="J Med Internet Res", year="2025", month="Feb", day="6", volume="27", pages="e66072", keywords="mumps", keywords="deep learning", keywords="baidu index", keywords="forecasting", keywords="incidence prediction", keywords="time series analysis", keywords="Yunnan", keywords="China", abstract="Background: Mumps is a viral respiratory disease characterized by facial swelling and transmitted through respiratory secretions. Despite the availability of an effective vaccine, mumps outbreaks have reemerged globally, including in China, where it remains a significant public health issue. In Yunnan province, China, the incidence of mumps has fluctuated markedly and is higher than that in mainland China, underscoring the need for improved outbreak prediction methods. Traditional surveillance methods, however, may not be sufficient for timely and accurate outbreak prediction. Objective: Our study aims to leverage the Baidu search index, representing search volumes from China's most popular search engine, along with environmental data to develop a predictive model for mumps incidence in Yunnan province. Methods: We analyzed mumps incidence in Yunnan Province from 2014 to 2023, and used time series data, including mumps incidence, Baidu search index, and environmental factors, from 2016 to 2023, to develop predictive models based on long short-term memory networks. Feature selection was conducted using Pearson correlation analysis, and lag correlations were explored through a distributed nonlinear lag model (DNLM). We constructed four models with different combinations of predictors: (1) model BE, combining the Baidu index and environmental factors data; (2) model IB, combining mumps incidence and Baidu index data; (3) model IE, combining mumps incidence and environmental factors; and (4) model IBE, integrating all 3 data sources. Results: The incidence of mumps in Yunnan showed significant variability, peaking at 37.5 per 100,000 population in 2019. From 2014 to 2023, the proportion of female patients ranged from 41.3\% in 2015 to 45.7\% in 2020, consistently lower than that of male patients. After excluding variables with a Pearson correlation coefficient of <0.10 or P values of <.05, we included 3 Baidu index search term groups (disease name, symptoms, and treatment) and 6 environmental factors (maximum temperature, minimum temperature, sulfur dioxide, carbon monoxide, particulate matter with a diameter of 2.5 {\textmu}m or less, and particulate matter with a diameter of 10 {\textmu}m or less) for model development. DNLM analysis revealed that the relative risks consistently increased with rising Baidu index values, while nonlinear associations between temperature and mumps incidence were observed. Among the 4 models, model IBE exhibited the best performance, achieving the coefficient of determination of 0.72, with mean absolute error, mean absolute percentage error, and root-mean-square error values of 0.33, 15.9\%, and 0.43, respectively, in the test set. Conclusions: Our study developed model IBE to predict the incidence of mumps in Yunnan province, offering a potential tool for early detection of mumps outbreaks. The performance of model IBE underscores the potential of integrating search engine data and environmental factors to enhance mumps incidence forecasting. This approach offers a promising tool for improving public health surveillance and enabling rapid responses to mumps outbreaks. ", doi="10.2196/66072", url="https://www.jmir.org/2025/1/e66072" } @Article{info:doi/10.2196/58338, author="Kaushik, Aprajita and Barcellona, Capucine and Mandyam, Kanumoory Nikita and Tan, Ying Si and Tromp, Jasper", title="Challenges and Opportunities for Data Sharing Related to Artificial Intelligence Tools in Health Care in Low- and Middle-Income Countries: Systematic Review and Case Study From Thailand", journal="J Med Internet Res", year="2025", month="Feb", day="4", volume="27", pages="e58338", keywords="artificial intelligence", keywords="data sharing", keywords="health care", keywords="low- and middle-income countries", keywords="AI tools", keywords="systematic review", keywords="case study", keywords="Thailand", keywords="computing machinery", keywords="academic experts", keywords="technology developers", keywords="health care providers", keywords="internet connectivity", keywords="data systems", keywords="low health data literacy", keywords="cybersecurity", keywords="standardized data formats", keywords="AI development", keywords="PRISMA", abstract="Background: Health care systems in low- and middle-income countries (LMICs) can greatly benefit from artificial intelligence (AI) interventions in various use cases such as diagnostics, treatment, and public health monitoring but face significant challenges in sharing data for developing and deploying AI in health care. Objective: This study aimed to identify barriers and enablers to data sharing for AI in health care in LMICs and to test the relevance of these in a local context. Methods: First, we conducted a systematic literature search using PubMed, SCOPUS, Embase, Web of Science, and ACM using controlled vocabulary. Primary research studies, perspectives, policy landscape analyses, and commentaries performed in or involving an LMIC context were included. Studies that lacked a clear connection to health information exchange systems or were not reported in English were excluded from the review. Two reviewers independently screened titles and abstracts of the included articles and critically appraised each study. All identified barriers and enablers were classified according to 7 categories as per the predefined framework---technical, motivational, economic, political, legal and policy, ethical, social, organisational, and managerial. Second, we tested the local relevance of barriers and enablers in Thailand through stakeholder interviews with 15 academic experts, technology developers, regulators, policy makers, and health care providers. The interviewers took notes and analyzed data using framework analysis. Coding procedures were standardized to enhance the reliability of our approach. Coded data were reverified and themes were readjusted where necessary to avoid researcher bias. Results: We identified 22 studies, the majority of which were conducted across Africa (n=12, 55\%) and Asia (n=6, 27\%). The most important data-sharing challenges were unreliable internet connectivity, lack of equipment, poor staff and management motivation, uneven resource distribution, and ethical concerns. Possible solutions included improving IT infrastructure, enhancing funding, introducing user-friendly software, and incentivizing health care organizations and personnel to share data for AI-related tools. In Thailand, inconsistent data systems, limited staff time, low health data literacy, complex and unclear policies, and cybersecurity issues were important data-sharing challenges. Key solutions included building a conducive digital ecosystem---having shared data input platforms for health facilities to ensure data uniformity and to develop easy-to-understand consent forms, having standardized guidelines for data sharing, and having compensation policies for data breach victims. Conclusions: Although AI in LMICs has the potential to overcome health inequalities, these countries face technical, political, legal, policy, and organizational barriers to sharing data, which impede effective AI development and deployment. When tested in a local context, most of these barriers were relevant. Although our findings might not be generalizable to other contexts, this study can be used by LMICs as a framework to identify barriers and strengths within their health care systems and devise localized solutions for enhanced data sharing. Trial Registration: PROSPERO CRD42022360644; https://www.crd.york.ac.uk/prospero/display\_record.php?RecordID=360644 ", doi="10.2196/58338", url="https://www.jmir.org/2025/1/e58338" } @Article{info:doi/10.2196/63449, author="Hsu, Wan-Chen", title="eHealth Literacy and Cyberchondria Severity Among Undergraduate Students: Mixed Methods Study", journal="JMIR Form Res", year="2025", month="Feb", day="3", volume="9", pages="e63449", keywords="eHealth literacy", keywords="undergraduate student", keywords="cyberchondria", keywords="compucondria", keywords="web-based health information", keywords="health information seeking", keywords="college students", abstract="Background: With the development of the internet, health care websites have become increasingly important by enabling easy access to health information, thereby influencing the attitudes and behaviors of individuals toward health issues. However, few studies have addressed public access to health information and self-diagnosis. Objective: This study investigated the background factors and status of cyberchondria severity among college students by conducting a nationwide sample survey using the Cyberchondria Severity Scale. Further, we explored the perspective of eHealth literacy of those with scores higher than 1 SD from the mean by analyzing their recent experiences using web-based health information. Methods: A nationally representative sample of college students was surveyed, and 802 valid responses were obtained (male: 435/802, 54.2\%; female: 367/802, 45.8\%; mean age 20.3, SD 1.4 years). The Cyberchondria Severity Scale was used, which consisted of 4 dimensions (increased anxiety, obsessive-compulsive hypochondria, perceived controllability, and web-based physician-patient interaction). Additionally, we recruited 9 volunteers who scored more than 1 SD above the mean for in-depth interviews on their web-based health information--seeking behaviors. Results: Significant differences were found across the 4 dimensions of cyberchondria severity (F3,2403=256.26; P<.001), with perceived controllability scoring the highest (mean 2.75, SD 0.87) and obsessive-compulsive hypochondria scoring the lowest (mean 2.19, SD 0.77). Positive correlations were observed between perceived controllability, web-based physician-patient interactions, increased anxiety, and obsessive-compulsive hypochondria (r=0.46-0.75, P<.001). Regression analysis indicated that health concern significantly predicted perceived controllability ($\beta$ coefficient=0.12; P<.05) and web-based physician-patient interaction ($\beta$ coefficient=0.16; P<.001). Interview data revealed that students often experienced heightened anxiety (8/9, 89\%) and stress (7/9, 78\%) after exposure to web-based health information, highlighting the need for improved health literacy and reliable information sources. Conclusions: The study identified both benefits and risks in college students' use of web-based health information, emphasizing the importance of critical consciousness and eHealth literacy. Future research should examine how college students move from self-awareness to actionable change and the development of critical health literacy, which are essential for effective digital health engagement. ", doi="10.2196/63449", url="https://formative.jmir.org/2025/1/e63449" } @Article{info:doi/10.2196/55309, author="Chan, J. Garrett and Fung, Mark and Warrington, Jill and Nowak, A. Sarah", title="Understanding Health-Related Discussions on Reddit: Development of a Topic Assignment Method and Exploratory Analysis", journal="JMIR Form Res", year="2025", month="Jan", day="29", volume="9", pages="e55309", keywords="digital health", keywords="internet", keywords="open data", keywords="social networking", keywords="social media", abstract="Background: Social media has become a widely used way for people to share opinions about health care and medical topics. Social media data can be leveraged to understand patient concerns and provide insight into why patients may turn to the internet instead of the health care system for health advice. Objective: This study aimed to develop a method to investigate Reddit posts discussing health-related conditions. Our goal was to characterize these topics and identify trends in these social media--based medical discussions. Methods: Using an initial query, we collected 1 year of Reddit posts containing the phrases ``get tested'' and ``get checked.'' These posts were manually reviewed, and subreddits containing irrelevant posts were excluded from analysis. This selection of posts was manually read by the investigators to categorize posts into topics. A script was developed to automatically assign topics to additional posts based on keywords. Topic and keyword selections were refined based on manual review for more accurate topic assignment. Topic assignment was then performed on the entire 1-year Reddit dataset containing 347,130 posts. Related topics were grouped into broader medical disciplines. Analysis of the topic assignments was then conducted to assess condition and medical topic frequencies in medical condition--focused subreddits and general subreddits. Results: We created an automated algorithm to assign medical topics to Reddit posts. By iterating through multiple rounds of topic assignment, we improved the accuracy of the algorithm. Ultimately, this algorithm created 82 topics sorted into 17 broader medical disciplines. Of all topics, sexually transmitted infections (STIs), eye disorders, anxiety, and pregnancy had the highest post frequency overall. STIs comprised 7.44\% (5876/78,980) of posts, and anxiety comprised 5.43\% (4289/78,980) of posts. A total of 34\% (28/82) of the topics comprised 80\% (63,184/78,980) of all posts. Of the medical disciplines, those with the most posts were?psychiatry and mental health;?genitourinary and reproductive health; infectious diseases;?and endocrinology, nutrition, and metabolism. Psychiatry and mental health comprised 26.6\% (21,009/78,980) of posts, and genitourinary and reproductive health comprised 13.6\% (10,741/78,980) of posts. Overall, most posts were also classified under these 4 medical disciplines. During analysis, subreddits were also classified as general if they did not focus on a specific health issue and topic-specific if they discussed a specific medical issue. Topics that appeared most frequently in the top 5 in general subreddits included addiction and drug anxiety, attention-deficit/hyperactivity disorder, abuse, and STIs. In topic-specific subreddits, most posts were found to discuss the topic of that subreddit. Conclusions: Certain health topics and medical disciplines are predominant on Reddit. These include topics such as STIs, eye disorders, anxiety, and pregnancy. Most posts were classified under the medical disciplines of psychiatry and mental health, as well as genitourinary and reproductive health. ", doi="10.2196/55309", url="https://formative.jmir.org/2025/1/e55309", url="http://www.ncbi.nlm.nih.gov/pubmed/39879094" } @Article{info:doi/10.2196/69554, author="Gu, Chenyu and Qian, Liquan and Zhuo, Xiaojie", title="Mindfulness Intervention for Health Information Avoidance in Older Adults: Mixed Methods Study", journal="JMIR Public Health Surveill", year="2025", month="Jan", day="28", volume="11", pages="e69554", keywords="health information avoidance", keywords="cyberchondria", keywords="self-determination theory", keywords="mindfulness", keywords="elderly", abstract="Background: The global aging population and rapid development of digital technology have made health management among older adults an urgent public health issue. The complexity of online health information often leads to psychological challenges, such as cyberchondria, exacerbating health information avoidance behaviors. These behaviors hinder effective health management; yet, little research examines their mechanisms or intervention strategies. Objective: This study investigates the mechanisms influencing health information avoidance among older adults, emphasizing the mediating role of cyberchondria. In addition, it evaluates the effectiveness of mindfulness meditation as an intervention strategy to mitigate these behaviors. Methods: A mixed methods approach was used, combining quantitative and qualitative methodologies. Substudy 1 developed a theoretical model based on self-determination theory to explore internal (positive metacognition and health self-efficacy) and external (subjective norms and health information similarity) factors influencing health information avoidance, with cyberchondria as a mediator. A cross-sectional survey (N=236) was conducted to test the proposed model. Substudy 2 involved a 4-week mindfulness meditation intervention (N=94) to assess its impact on reducing health information avoidance behaviors. Results: Study 1 showed that positive metacognition ($\beta$=.26, P=.002), health self-efficacy ($\beta$=.25, P<.001), and health information similarity ($\beta$=.29, P<.001) significantly predicted health information avoidance among older adults. Cyberchondria mediated these effects: positive metacognition (effect=0.106, 95\% CI 0.035-0.189), health self-efficacy (effect=0.103, 95\% CI 0.043-0.185), and health information similarity (effect=0.120, 95\% CI 0.063-0.191). Subjective norms did not significantly predict health information avoidance ($\beta$=?.11, P=.13), and cyberchondria did not mediate this relationship (effect=?0.045, 95\% CI ?0.102 to 0.016). Study 2 found that after the 4-week mindfulness intervention, the intervention group (group 1: n=46) exhibited significantly higher mindfulness levels than the control group (group 2: n=48; Mgroup1=4.122, Mgroup2=3.606, P<.001) and higher levels compared with preintervention (Mt2=4.122, Mt1=3.502, P<.001, where t1=preintervention and t2=postintervention). However, cyberchondria levels did not change significantly (Mt1=2.848, Mt2=2.685, P=.18). Nevertheless, the results revealed a significant interaction effect between mindfulness and cyberchondria on health information avoidance (effect=?0.357, P=.002, 95\% CI ?0.580 to ?0.131), suggesting that mindfulness intervention effectively inhibited the transformation of cyberchondria into health information avoidance behavior. Conclusions: This study reveals the role of cyberchondria in health information avoidance and validates mindfulness meditation as an effective intervention for mitigating such behaviors. Findings offer practical recommendations for improving digital health information delivery and health management strategies for older adults. ", doi="10.2196/69554", url="https://publichealth.jmir.org/2025/1/e69554" } @Article{info:doi/10.2196/67192, author="Scherbakov, A. Dmitry and Hubig, C. Nina and Lenert, A. Leslie and Alekseyenko, V. Alexander and Obeid, S. Jihad", title="Natural Language Processing and Social Determinants of Health in Mental Health Research: AI-Assisted Scoping Review", journal="JMIR Ment Health", year="2025", month="Jan", day="16", volume="12", pages="e67192", keywords="natural language processing", keywords="datasets", keywords="mental health", keywords="automated review", keywords="depression", keywords="suicide", keywords="mental health research", keywords="NLP", keywords="artificial intelligence", keywords="AI", keywords="scoping review", keywords="determinant", keywords="large language model", keywords="LLM", keywords="quantitative", keywords="automation", abstract="Background: The use of natural language processing (NLP) in mental health research is increasing, with a wide range of applications and datasets being investigated. Objective: This review aims to summarize the use of NLP in mental health research, with a special focus on the types of text datasets and the use of social determinants of health (SDOH) in NLP projects related to mental health. Methods: The search was conducted in September 2024 using a broad search strategy in PubMed, Scopus, and CINAHL Complete. All citations were uploaded to Covidence (Veritas Health Innovation) software. The screening and extraction process took place in Covidence with the help of a custom large language model (LLM) module developed by our team. This LLM module was calibrated and tuned to automate many aspects of the review process. Results: The screening process, assisted by the custom LLM, led to the inclusion of 1768 studies in the final review. Most of the reviewed studies (n=665, 42.8\%) used clinical data as their primary text dataset, followed by social media datasets (n=523, 33.7\%). The United States contributed the highest number of studies (n=568, 36.6\%), with depression (n=438, 28.2\%) and suicide (n=240, 15.5\%) being the most frequently investigated mental health issues. Traditional demographic variables, such as age (n=877, 56.5\%) and gender (n=760, 49\%), were commonly extracted, while SDOH factors were less frequently reported, with urban or rural status being the most used (n=19, 1.2\%). Over half of the citations (n=826, 53.2\%) did not provide clear information on dataset accessibility, although a sizable number of studies (n=304, 19.6\%) made their datasets publicly available. Conclusions: This scoping review underscores the significant role of clinical notes and social media in NLP-based mental health research. Despite the clear relevance of SDOH to mental health, their underutilization presents a gap in current research. This review can be a starting point for researchers looking for an overview of mental health projects using text data. Shared datasets could be used to place more emphasis on SDOH in future studies. ", doi="10.2196/67192", url="https://mental.jmir.org/2025/1/e67192" } @Article{info:doi/10.2196/59625, author="Huijgens, Fiorella and Kwakman, Pascale and Hillen, Marij and van Weert, Julia and Jaspers, Monique and Smets, Ellen and Linn, Annemiek", title="How Patients With Cancer Use the Internet to Search for Health Information: Scenario-Based Think-Aloud Study", journal="JMIR Infodemiology", year="2025", month="Jan", day="16", volume="5", pages="e59625", keywords="web-based health information seeking", keywords="think aloud", keywords="scenario based", keywords="cancer", keywords="patient evaluation", keywords="information seeking", keywords="web-based information", keywords="health information", keywords="internet", keywords="pattern", keywords="motivation", keywords="cognitive", keywords="emotional", keywords="response", keywords="patient", keywords="survivor", keywords="caregiver", keywords="interview", keywords="scenario", keywords="women", keywords="men", abstract="Background: Patients with cancer increasingly use the internet to seek health information. However, thus far, research treats web-based health information seeking (WHIS) behavior in a rather dichotomous manner (ie, approaching or avoiding) and fails to capture the dynamic nature and evolving motivations that patients experience when engaging in WHIS throughout their disease trajectory. Insights can be used to support effective patient-provider communication about WHIS and can lead to better designed web-based health platforms. Objective: This study explored patterns of motivations and emotions behind the web-based information seeking of patients with cancer at various stages of their disease trajectory, as well as the cognitive and emotional responses evoked by WHIS via a scenario-based, think-aloud approach. Methods: In total, 15 analog patients were recruited, representing patients with cancer, survivors, and informal caregivers. Imagining themselves in 3 scenarios---prediagnosis phase (5/15, 33\%), treatment phase (5/15, 33\%), and survivor phase (5/15, 33\%)---patients were asked to search for web-based health information while being prompted to verbalize their thoughts. In total, 2 researchers independently coded the sessions, categorizing the codes into broader themes to comprehend analog patients' experiences during WHIS. Results: Overarching motives for WHIS included reducing uncertainty, seeking reassurance, and gaining empowerment. At the beginning of the disease trajectory, patients mainly showed cognitive needs, whereas this shifted more toward affective needs in the subsequent disease stages. Analog patients' WHIS approaches varied from exploratory to focused or a combination of both. They adapted their search strategy when faced with challenging cognitive or emotional content. WHIS triggered diverse emotions, fluctuating throughout the search. Complex, confrontational, and unexpected information mainly induced negative emotions. Conclusions: This study provides valuable insights into the motivations of patients with cancer underlying WHIS and the emotions experienced at various stages of the disease trajectory. Understanding patients' search patterns is pivotal in optimizing web-based health platforms to cater to specific needs. In addition, these findings can guide clinicians in accommodating patients' specific needs and directing patients toward reliable sources of web-based health information. ", doi="10.2196/59625", url="https://infodemiology.jmir.org/2025/1/e59625" } @Article{info:doi/10.2196/54460, author="Weidinger, Florian and Dietzel, Nikolas and Graessel, Elmar and Prokosch, Hans-Ulrich and Kolominsky-Rabas, Peter", title="Using Health Information Resources for People With Cognitive Impairment (digiDEM Bayern): Registry-Based Cohort Study", journal="JMIR Form Res", year="2025", month="Jan", day="15", volume="9", pages="e54460", keywords="dementia", keywords="mild cognitive impairment", keywords="cognitive impairment", keywords="information sources", keywords="health information", keywords="health information--seeking behavior", keywords="Digital Dementia Registry Bavaria", keywords="digiDEM", abstract="Background: Dementia is a growing global health challenge with significant economic and social implications. Underdiagnosis of dementia is prevalent due to a lack of knowledge and understanding among the general population. Enhancing dementia literacy through improved health information--seeking behavior is crucial for the self-determined management of the disease by those affected. Understanding the relationship between dementia literacy, health information--seeking behavior, and the use of various information sources among individuals with cognitive impairment is of high importance in this context. Objective: The aim of this study was to analyze the relevance of different sources of health information from the perspective of people with cognitive impairment, while also evaluating differences based on age, gender, and disease progression. Methods: This study is part of the ongoing project ``Digital Dementia Registry Bavaria -- digiDEM Bayern.'' The Digital Dementia Registry Bavaria is a multicenter, prospective, longitudinal register study in Bavaria, Germany. People with cognitive impairment rated several information sources by using Likert scales with the values unimportant (1) to very important (5). Data were analyzed descriptively, and multiple 2-sample, 2-tailed t tests were used to evaluate differences by cognitive status and gender and using multiple one-way ANOVA to evaluate differences by age group. Results: Data of 924 people with cognitive impairment (531 with dementia, 393 with mild cognitive impairment) were evaluated. The most relevant health information sources were ``Personal visit to a medical professional'' (mean 3.9, SD 1.1) and ``Family / Friends'' (mean 3.9, SD 1.2). ``Internet'' was 1 of the 2 lowest-rated information sources by people with cognitive impairment (mean 1.6, SD 1.1), with nearly three-quarters (684/924, 74\%) of the participants rating the source as unimportant. The age-specific analyses showed significant differences for the sources ``Internet'' (F2,921=61.23; P<.001), ``Courses / Lectures'' (F2,921=18.88; P<.001), and ``Family / Friends'' (F2,921=6.27; P=.002) for the 3 defined age groups. There were several significant differences between people with mild cognitive impairment and dementia whereby the first group evaluated most sources higher, such as ``Internet'' (mean difference=0.6; t640=7.52; P<.001). The only sources rated higher by the dementia group were ``TV / Radio'' and ``Family / Friends,'' with none of them showing significant differences. Gender-specific analyses showed women with cognitive impairment valuing every evaluated source higher than men apart from ``Internet'' (mean difference=0.4; t685=4.97; P<.001). Conclusions: To enhance health and dementia literacy, the best way to communicate health information to people with cognitive impairment is through interpersonal contact with medical professionals and their friends and family. Slight changes in valuation should be considered as the medical condition progresses, along with variations by age and gender. In particular, the evaluation and use of the internet are dependent on these factors. Further research is needed to capture potential changes in the valuation of the internet as a health information source. International Registered Report Identifier (IRRID): RR2-10.1136/bmjopen-2020-043473 ", doi="10.2196/54460", url="https://formative.jmir.org/2025/1/e54460" } @Article{info:doi/10.2196/50862, author="Lang, A. Iain and King, Angela and Boddy, Kate and Stein, Ken and Asare, Lauren and Day, Jo and Liabo, Kristin", title="Jargon and Readability in Plain Language Summaries of Health Research: Cross-Sectional Observational Study", journal="J Med Internet Res", year="2025", month="Jan", day="13", volume="27", pages="e50862", keywords="readability", keywords="jargon", keywords="reading", keywords="accessibility", keywords="health research", keywords="science communication", keywords="public understanding of science", keywords="open science", keywords="patient and public involvement", keywords="health literacy", keywords="plain language summary", keywords="health communication", abstract="Background: The idea of making science more accessible to nonscientists has prompted health researchers to involve patients and the public more actively in their research. This sometimes involves writing a plain language summary (PLS), a short summary intended to make research findings accessible to nonspecialists. However, whether PLSs satisfy the basic requirements of accessible language is unclear. Objective: We aimed to assess the readability and level of jargon in the PLSs of research funded by the largest national clinical research funder in Europe, the United Kingdom's National Institute for Health and Care Research (NIHR). We also aimed to assess whether readability and jargon were influenced by internal and external characteristics of research projects. Methods: We downloaded the PLSs of all NIHR National Journals Library reports from mid-2014 to mid-2022 (N=1241) and analyzed them using the Flesch Reading Ease (FRE) formula and a jargon calculator (the De-Jargonizer). In our analysis, we included the following study characteristics of each PLS: research topic, funding program, project size, length, publication year, and readability and jargon scores of the original funding proposal. Results: Readability scores ranged from 1.1 to 70.8, with an average FRE score of 39.0 (95\% CI 38.4-39.7). Moreover, 2.8\% (35/1241) of the PLSs had an FRE score classified as ``plain English'' or better; none had readability scores in line with the average reading age of the UK population. Jargon scores ranged from 76.4 to 99.3, with an average score of 91.7 (95\% CI 91.5-91.9) and 21.7\% (269/1241) of the PLSs had a jargon score suitable for general comprehension. Variables such as research topic, funding program, and project size significantly influenced readability and jargon scores. The biggest differences related to the original proposals: proposals with a PLS in their application that were in the 20\% most readable were almost 3 times more likely to have a more readable final PLS (incidence rate ratio 2.88, 95\% CI 1.86-4.45). Those with the 20\% least jargon in the original application were more than 10 times as likely to have low levels of jargon in the final PLS (incidence rate ratio 13.87, 95\% CI 5.17-37.2). There was no observable trend over time. Conclusions: Most of the PLSs published in the NIHR's National Journals Library have poor readability due to their complexity and use of jargon. None were readable at a level in keeping with the average reading age of the UK population. There were significant variations in readability and jargon scores depending on the research topic, funding program, and other factors. Notably, the readability of the original funding proposal seemed to significantly impact the final report's readability. Ways of improving the accessibility of PLSs are needed, as is greater clarity over who and what they are for. ", doi="10.2196/50862", url="https://www.jmir.org/2025/1/e50862" } @Article{info:doi/10.2196/58457, author="Wals Zurita, Jesus Amadeo and Miras del Rio, Hector and Ugarte Ruiz de Aguirre, Nerea and Nebrera Navarro, Cristina and Rubio Jimenez, Maria and Mu{\~n}oz Carmona, David and Miguez Sanchez, Carlos", title="The Transformative Potential of Large Language Models in Mining Electronic Health Records Data: Content Analysis", journal="JMIR Med Inform", year="2025", month="Jan", day="2", volume="13", pages="e58457", keywords="electronic health record", keywords="EHR", keywords="oncology", keywords="radiotherapy", keywords="data mining", keywords="ChatGPT", keywords="large language models", keywords="LLMs", abstract="Background: In this study, we evaluate the accuracy, efficiency, and cost-effectiveness of large language models in extracting and structuring information from free-text clinical reports, particularly in identifying and classifying patient comorbidities within oncology electronic health records. We specifically compare the performance of gpt-3.5-turbo-1106 and gpt-4-1106-preview models against that of specialized human evaluators. Objective: We specifically compare the performance of gpt-3.5-turbo-1106 and gpt-4-1106-preview models against that of specialized human evaluators. Methods: We implemented a script using the OpenAI application programming interface to extract structured information in JavaScript object notation format from comorbidities reported in 250 personal history reports. These reports were manually reviewed in batches of 50 by 5 specialists in radiation oncology. We compared the results using metrics such as sensitivity, specificity, precision, accuracy, F-value, $\kappa$ index, and the McNemar test, in addition to examining the common causes of errors in both humans and generative pretrained transformer (GPT) models. Results: The GPT-3.5 model exhibited slightly lower performance compared to physicians across all metrics, though the differences were not statistically significant (McNemar test, P=.79). GPT-4 demonstrated clear superiority in several key metrics (McNemar test, P<.001). Notably, it achieved a sensitivity of 96.8\%, compared to 88.2\% for GPT-3.5 and 88.8\% for physicians. However, physicians marginally outperformed GPT-4 in precision (97.7\% vs 96.8\%). GPT-4 showed greater consistency, replicating the exact same results in 76\% of the reports across 10 repeated analyses, compared to 59\% for GPT-3.5, indicating more stable and reliable performance. Physicians were more likely to miss explicit comorbidities, while the GPT models more frequently inferred nonexplicit comorbidities, sometimes correctly, though this also resulted in more false positives. Conclusions: This study demonstrates that, with well-designed prompts, the large language models examined can match or even surpass medical specialists in extracting information from complex clinical reports. Their superior efficiency in time and costs, along with easy integration with databases, makes them a valuable tool for large-scale data mining and real-world evidence generation. ", doi="10.2196/58457", url="https://medinform.jmir.org/2025/1/e58457" } @Article{info:doi/10.2196/64577, author="Lemieux, Mackenzie and Zhou, Cyrus and Cary, Caroline and Kelly, Jeannie", title="Changes in Reproductive Health Information-Seeking Behaviors After the Dobbs Decision: Systematic Search of the Wikimedia Database", journal="JMIR Infodemiology", year="2024", month="Dec", day="16", volume="4", pages="e64577", keywords="abortion", keywords="Dobbs", keywords="internet", keywords="viewer trends", keywords="Wikipedia", keywords="women's health", keywords="contraception", keywords="contraceptive", keywords="trend", keywords="information seeking", keywords="page view", keywords="reproductive", keywords="reproduction", abstract="Background: After the US Supreme Court overturned Roe v. Wade, confusion followed regarding the legality of abortion in different states across the country. Recent studies found increased Google searches for abortion-related terms in restricted states after the Dobbsv. Jackson Women's Health Organization decision was leaked. As patients and providers use Wikipedia (Wikimedia Foundation) as a predominant medical information source, we hypothesized that changes in reproductive health information-seeking behavior could be better understood by examining Wikipedia article traffic. Objective: This study aimed to examine trends in Wikipedia usage for abortion and contraception information before and after the Dobbs decision. Methods: Page views of abortion- and contraception-related Wikipedia pages were scraped. Temporal changes in page views before and after the Dobbs decision were then analyzed to explore changes in baseline views, differences in views for abortion-related information in states with restrictive abortion laws versus nonrestrictive states, and viewer trends on contraception-related pages. Results: Wikipedia articles related to abortion topics had significantly increased page views following the leaked and final Dobbs decision. There was a 103-fold increase in the page views for the Wikipedia article Roe v. Wade following the Dobbs decision leak (mean 372,654, SD 135,478 vs mean 3614, SD 248; P<.001) and a 67-fold increase in page views following the release of the final Dobbs decision (mean 8942, SD 402 vs mean 595,871, SD 178,649; P<.001). Articles about abortion in the most restrictive states had a greater increase in page views (mean 40.6, SD 12.7; 18/51, 35\% states) than articles about abortion in states with some restrictions or protections (mean 26.8, SD 7.3; 24/51, 47\% states; P<.001) and in the most protective states (mean 20.6, SD 5.7; 8/51, 16\% states; P<.001). Finally, views to pages about common contraceptive methods significantly increased after the Dobbs decision. ``Vasectomy'' page views increased by 183\% (P<.001), ``IUD'' (intrauterine device) page views increased by 80\% (P<.001), ``Combined oral contraceptive pill'' page views increased by 24\% (P<.001), ``Emergency Contraception'' page views increased by 224\% (P<.001), and ``Tubal ligation'' page views increased by 92\% (P<.001). Conclusions: People sought information on Wikipedia about abortion and contraception at increased rates after the Dobbs decision. Increased traffic to abortion-related Wikipedia articles correlated to the restrictiveness of state abortion policies. Increased interest in contraception-related pages reflects the increased demand for contraceptives observed after the Dobbs decision. Our work positions Wikipedia as an important source of reproductive health information and demands increased attention to maintain and improve Wikipedia as a reliable source of health information after the Dobbs decision. ", doi="10.2196/64577", url="https://infodemiology.jmir.org/2024/1/e64577" } @Article{info:doi/10.2196/63476, author="Ahn, Seong-Ho and Yim, Kwangil and Won, Hyun-Sik and Kim, Kang-Min and Jeong, Dong-Hwa", title="Discovering Time-Varying Public Interest for COVID-19 Case Prediction in South Korea Using Search Engine Queries: Infodemiology Study", journal="J Med Internet Res", year="2024", month="Dec", day="16", volume="26", pages="e63476", keywords="COVID-19", keywords="confirmed case prediction", keywords="search engine queries", keywords="query expansion", keywords="word embedding", keywords="public health", keywords="case prediction", keywords="South Korea", keywords="search engine", keywords="infodemiology", keywords="infodemiology study", keywords="policy", keywords="lifestyle", keywords="machine learning", keywords="machine learning techniques", keywords="utilization", keywords="temporal variation", keywords="novel framework", keywords="temporal", keywords="web-based search", keywords="temporal semantics", keywords="prediction model", keywords="model", abstract="Background: The number of confirmed COVID-19 cases is a crucial indicator of policies and lifestyles. Previous studies have attempted to forecast cases using machine learning techniques that use a previous number of case counts and search engine queries predetermined by experts. However, they have limitations in reflecting temporal variations in queries associated with pandemic dynamics. Objective: This study aims to propose a novel framework to extract keywords highly associated with COVID-19, considering their temporal occurrence. We aim to extract relevant keywords based on pandemic variations using query expansion. Additionally, we examine time-delayed web-based search behavior related to public interest in COVID-19 and adjust for better prediction performance. Methods: To capture temporal semantics regarding COVID-19, word embedding models were trained on a news corpus, and the top 100 words related to ``Corona'' were extracted over 4-month windows. Time-lagged cross-correlation was applied to select optimal time lags correlated to confirmed cases from the expanded queries. Subsequently, ElasticNet regression models were trained after reducing the feature dimensions using principal component analysis of the time-lagged features to predict future daily case counts. Results: Our approach successfully extracted relevant keywords depending on the pandemic phase, encompassing keywords directly related to COVID-19, such as its symptoms, and its societal impact. Specifically, during the first outbreak, keywords directly linked to COVID-19 and past infectious disease outbreaks similar to those of COVID-19 exhibited a high positive correlation. In the second phase of the pandemic, as community infections emerged, keywords related to the government's pandemic control policies were frequently observed with a high positive correlation. In the third phase of the pandemic, during the delta variant outbreak, keywords such as ``economic crisis'' and ``anxiety'' appeared, reflecting public fatigue. Consequently, prediction models trained by the extracted queries over 4-month windows outperformed previous methods for most predictions 1-14 days ahead. Notably, our approach showed significantly higher Pearson correlation coefficients than models based solely on the number of past cases for predictions 9-11 days ahead (P=.02, P<.01, and P<.01), in contrast to heuristic- and symptom-based query sets. Conclusions: This study proposes a novel COVID-19 case-prediction model that automatically extracts relevant queries over time using word embedding. The model outperformed previous methods that relied on static symptom-based or heuristic queries, even without prior expert knowledge. The results demonstrate the capability of our approach to track temporal shifts in public interest regarding changes in the pandemic. ", doi="10.2196/63476", url="https://www.jmir.org/2024/1/e63476" } @Article{info:doi/10.2196/58482, author="Ageng, Kidung and Inthiran, Anushia", title="Exploring Pregnancy-Related Information-Sharing Behavior Among First-Time Southeast Asian Fathers: Qualitative Semistructured Interview Study", journal="JMIR Pediatr Parent", year="2024", month="Dec", day="9", volume="7", pages="e58482", keywords="pregnancy", keywords="first-time fathers", keywords="information sharing", keywords="Southeast Asia", keywords="information-seeking behavior", keywords="cultural factors", abstract="Background: While the benefits of fathers' engagement in pregnancy are well researched, little is known about first-time expectant fathers' information-seeking practices in Southeast Asia regarding pregnancy. In addition, there is a notable gap in understanding their information-sharing behaviors during the pregnancy journey. This information is important, as cultural norms are prevalent in Southeast Asia, and this might influence their information-sharing behavior, particularly about pregnancy. Objective: This study aims to explore and analyze the pregnancy-related information-sharing behavior of first-time expectant fathers in Southeast Asia. This study specifically aims to investigate whether first-time fathers share pregnancy information, with whom they share it, through what means, and the reasons behind the decisions to share the information or not. Methods: We conducted semistructured interviews with first-time Southeast Asian fathers in Indonesia, a sample country in the Southeast Asian region. We analyzed the data using quantitative descriptive analysis and qualitative content theme analysis. A total of 40 first-time expectant fathers were interviewed. Results: The results revealed that 90\% (36/40) of the participants shared pregnancy-related information with others. However, within this group, more than half (22/40, 55\%) of the participants shared the information exclusively with their partners. Only a small proportion, 10\% (4/40), did not share any information at all. Among those who did share, the most popular approach was face-to-face communication (36/40, 90\%), followed by online messaging apps (26/40, 65\%). The most popular reason for sharing was to validate information (14/40, 35\%), while the most frequent reason for not sharing with anyone beyond their partner was because of the preference for asking for information rather than sharing (12/40, 30\%). Conclusions: This study provides valuable insights into the pregnancy-related information-sharing behaviors of first-time fathers in Southeast Asia. It enhances our understanding of how first-time fathers share pregnancy-related information and how local cultural norms and traditions influence these practices. In contrast to first-time fathers in high-income countries, the information-sharing behavior of first-time Southeast Asian fathers is defined by cultural nuances. Culture plays a crucial role in their daily decision-making processes. Therefore, this emphasizes the importance of cultural considerations in future discussions and the development of intervention programs related to pregnancy for first-time Southeast Asian fathers. In addition, this study sheds light on the interaction processes that first-time fathers engage in with others, highlighting areas where intervention programs may be necessary to improve their involvement during pregnancy. For example, first-time fathers actively exchange new information found with their partners; therefore, creating features or platforms that facilitate this process could improve their overall experience. Furthermore, health practitioners should take a more proactive approach in engaging with first-time fathers, as currently there is a communication gap between them. ", doi="10.2196/58482", url="https://pediatrics.jmir.org/2024/1/e58482" } @Article{info:doi/10.2196/54092, author="Na, Kilhoe and Zimdars, Melissa and Cullinan, E. Megan", title="Understanding Membership in Alternative Health Social Media Groups and Its Association with COVID-19 and Influenza Vaccination: Web-Based Cross-Sectional Survey", journal="JMIR Form Res", year="2024", month="Dec", day="5", volume="8", pages="e54092", keywords="alternative health", keywords="social media", keywords="misinformation", keywords="vaccination", keywords="COVID-19", keywords="Coronavirus", abstract="Background: Social media platforms have become home to numerous alternative health groups where people share health information and scientifically unproven treatments. Individuals share not only health information but also health misinformation in alternative health groups on social media. Yet, little research has been carried out to understand members of these groups. This study aims to better understand various characteristics of members in alternative health groups and the association between membership and attitudes toward vaccination and COVID-19 and influenza vaccination--related behaviors. Objective: This study aims to test hypotheses about different potential characteristics of members in alternative health groups and the association between membership and attitudes toward vaccination and vaccine-related behaviors. Methods: A web-based cross-sectional survey (N=1050) was conducted. Participants were recruited from 19 alternative health social media groups and Amazon's Mechanical Turk. A total of 596 participants were members of alternative health groups and 454 were nonmembers of alternative health groups. Logistic regressions were performed to test the hypotheses about the relationship between membership and the variables of interest. Results: Logistic regression revealed that there is a positive association between alternative health social media group membership and 3 personal characteristics: sharing trait (B=.83, SE=.11; P<.01; odds ratio [OR] 2.30, 95\% CI 1.85-2.86), fear of negative evaluations (B=.19, SE=.06; P<.001, OR 1.21, 95\% CI 1.06-1.37), and conspiratorial mentality (B=.33, SE=.08; P<.01; OR 1.40, 95\% CI 1.18-1.65). Also, the results indicate that there is a negative association between membership and 2 characteristics: health literacy (B=--1.09, SE=.17; P<.001; OR .33, 95\% CI 0.23-0.47) and attitudes toward vaccination (B=-- 2.33, SE=.09; P=.02; OR 0.79, 95\% CI 0.65-0.95). However, there is no association between membership and health consciousness (B=.12, SE=.10; P=.24; OR 1.13, 95\% CI 0.92-1.38). Finally, membership is negatively associated with COVID-19 vaccination status (B=--.84, SE=.17; P<.001; OR 48, 95\% CI 0.32-0.62), and influenza vaccination practice (B=--1.14, SE=.17; P<.001; OR .31, 95\% CI 0.22-0.45). Conclusions: Our findings indicate that people joining alternative health social media groups differ from nonmembers in different aspects, such as sharing, fear of negative evaluations, conspiratorial mentality, and health literacy. They also suggest that there is a significant relationship between membership and vaccination. By more thoroughly exploring the demographic, or by better understanding the people for whom interventions are designed, this study is expected to help researchers to more strategically and effectively develop and implement interventions. ", doi="10.2196/54092", url="https://formative.jmir.org/2024/1/e54092" } @Article{info:doi/10.2196/63281, author="Gaba, Ann and Bennett, Richard", title="Health-Related Messages About Herbs, Spices, and Other Botanicals Appearing in Print Issues and Websites of Legacy Media: Content Analysis and Evaluation", journal="JMIR Form Res", year="2024", month="Dec", day="4", volume="8", pages="e63281", keywords="legacy media", keywords="health applications", keywords="health communication", keywords="botanical products", keywords="content analysis", abstract="Background: Legacy media are publications that existed before the internet. Many of these have migrated to a web format, either replacing or in parallel to their print issues. Readers place an economic value on access to the information presented as they pay for subscriptions and place a higher degree of trust in their content. Much has been written about inaccurate and misleading health information in social media; however, the content and accuracy of information contained in legacy media has not been examined in detail. Discussion of herbs, spices, and other botanicals has been absent from this context. Objective: The objectives of this study were to (1) identify the health associations of botanical products mentioned in legacy media targeted to a range of demographic groups and (2) evaluate these health associations for accuracy against published scientific studies. Methods: In total, 10 popular magazines targeting a range of gender, race/ethnicity, and sexual orientation demographic groups were selected for analysis. Relevant content was extracted and coded over 1 year. Associations between specific botanical products and health factors were identified. For the most frequent botanical--health application associations, a PubMed search was conducted to identify reviews corresponding to each item's indicated applications. Where no systematic reviews were available, single research studies were sought. Results: A total of 237 unique botanical products were identified. There were 128 mentions of these in the print issues and 1215 on the websites. In total, 18 health applications were identified and used to categorize the indicated uses for the various products individually and as general categories. The most frequently mentioned applications were skin care, with 913 mentions, immunity enhancement, with 705 mentions, gastrointestinal health and probiotics, with 184 mentions, and cognitive function (stress and mental health), with 106 mentions. Comparison to published literature evaluating the efficacy of these functions identified positive support for aloe vera, argan oil, chamomile, jojoba oil, lavender, rosemary, and tea tree oil in skin care. Berries, ginger, turmeric, and green tea had the strongest evidence for a role in immunity enhancement. Ginger and oats were supported as having a role in gastrointestinal health. Finally, berries, lavender, ashwagandha, and cannabidiol were supported as having a role in managing stress. Other frequently mentioned items such as aloe vera, ashwagandha, or mushrooms for immunity were less strongly supported. Conclusions: Comparison of the most prevalent associations between botanical products and health applications to published literature indicates that, overall, these associations were consistent with current scientific reports about the health applications of botanical products. While some products had a greater degree of research support than others, truly egregious falsehoods were absent. Therefore, legacy media may be considered a credible source of information to readers about these topics. ", doi="10.2196/63281", url="https://formative.jmir.org/2024/1/e63281" } @Article{info:doi/10.2196/57718, author="Jagomast, Tobias and Finck, Jule and Tangemann-M{\"u}nstedt, Imke and Auth, Katharina and Dr{\"o}mann, Daniel and Franzen, F. Klaas", title="Google Trends Assessment of Keywords Related to Smoking and Smoking Cessation During the COVID-19 Pandemic in 4 European Countries: Retrospective Analysis", journal="Online J Public Health Inform", year="2024", month="Dec", day="3", volume="16", pages="e57718", keywords="internet", keywords="coronavirus", keywords="COVID-19", keywords="SARS-CoV-2", keywords="pandemics", keywords="public health", keywords="smoking cessation", keywords="tobacco products", keywords="Google Trends", keywords="relative search volume", keywords="Europe", keywords="online", keywords="search", keywords="smoking", keywords="addiction", keywords="quit", keywords="cessation", keywords="trend", keywords="cluster", keywords="public interest", keywords="lockdown", keywords="vaccination", keywords="spread", keywords="incidence", abstract="Background: Smoking is a modifiable risk factor for SARS-CoV-2 infection. Evidence of smoking behavior during the pandemic is ambiguous. Most investigations report an increase in smoking. In this context, Google Trends data monitor real-time public information--seeking behavior and are therefore useful to characterize smoking-related interest over the trajectory of the pandemic. Objective: This study aimed to use Google Trends data to evaluate the effect of the pandemic on public interest in smoking-related topics with a focus on lockdowns, vaccination campaigns, and incidence. Methods: The weekly relative search volume was retrieved from Google Trends for England, Germany, Italy, and Spain from December 31, 2017, to April 18, 2021. Data were collected for keywords concerning consumption, cessation, and treatment. The relative search volume before and during the pandemic was compared, and general trends were evaluated using the Wilcoxon rank-sum test. Short-term changes and hereby temporal clusters linked to lockdowns or vaccination campaigns were addressed by the flexible spatial scan statistics proposed by Takahashi and colleagues. Subsequently, the numbers of clusters after the onset of the pandemic were compared by chi-square test. Results: Country-wise minor differences were observed while 3 overarching trends prevailed. First, regarding cessation, the statistical comparison revealed a significant decline in interest for 58\% (7/12) of related keywords, and fewer clusters were present during the pandemic. Second, concerning consumption, significantly reduced relative search volume was observed for 58\% (7/12) of keywords, while treatment-related keywords exhibited heterogeneous trends. Third, substantial clusters of increased interest were sparsely linked to lockdowns, vaccination campaigns, or incidence. Conclusions: This study reports a substantial decline in overall relative search volume and clusters for cessation interest. These results underline the importance of intensifying cessation aid during times of crisis. Lockdowns, vaccination, and incidence had less impact on information-seeking behavior. Other public measures that positively affect smoking behavior remain to be determined. ", doi="10.2196/57718", url="https://ojphi.jmir.org/2024/1/e57718", url="http://www.ncbi.nlm.nih.gov/pubmed/39626237" } @Article{info:doi/10.2196/53440, author="Blank, Ann Carol and Biedka, Sarah and Montalmant, Abigail and Saft, Katelyn and Lape, Miranda and Mao, Kate and Bradt, Joke and Liou, T. Kevin", title="Scope, Findability, and Quality of Information About Music-Based Interventions in Oncology: Quantitative Content Analysis of Public-Facing Websites at National Cancer Institute--Designated Cancer Centers", journal="JMIR Cancer", year="2024", month="Nov", day="22", volume="10", pages="e53440", keywords="music-based interventions", keywords="cancer", keywords="oncology", keywords="symptom management", keywords="music therapy", keywords="music services", keywords="National Cancer Institute", abstract="Background: Music-based interventions (MBIs) are evidence-based, nonpharmacological treatments that include music therapy (MT) delivered by board-certified music therapists, as well as music services (MS) delivered by other health professionals and volunteers. Despite MBI's growing evidence base in cancer symptom management, it remains unclear how MBI-related information is presented to the public. Over 80\% of people with cancer use the internet to find health-related information. In the United States, the National Cancer Institute (NCI) identifies certain Cancer Centers (CCs) as NCI-designated CCs or Comprehensive Cancer Centers (CCCs) based on their excellence in research. As NCI-designated CCs and CCCs are considered the gold standard in cancer care, their websites are viewed by the public as important sources of information. Objective: We aimed to determine scope, findability, and quality of MBI-related information on public-facing websites of NCI-designated CCs/CCCs. Methods: We reviewed 64 NCI-designated CC/CCC websites (excluding basic laboratories) between November 2022 and January 2023. We extracted data on the scope of information: (1) type of MBI offered (MT or MS), (2) format (individual, group), (3) method of delivery (in person or remotely delivered), (4) setting (inpatient or outpatient), (5) target population (pediatric or adult), (6) MBI practitioner qualifications, (7) clinical indications or benefits, (8) presence of testimonials, (9) cost, and (10) scheduling or referral information. We also extracted data on findability (ie, presence of direct link or drop-down menu and the number of clicks to locate MBI-related information). Based on the scope and findability data, we rated the information quality as high, moderate, or low using an adapted scale informed by prior research. Results: Thirty-one (48\%) of the 64 CC/CCCs described MBIs on their websites. Of these, 6 (19\%) mentioned both MT and MS, 16 (52\%) mentioned MT only, and 9 (29\%) mentioned MS only. The most common format was hybrid, involving individuals and groups (n=20, 65\%). The most common delivery method was in person (n=16, 52\%). The most common target population was adults (n=12, 39\%). The most common MBI practitioners were board-certified music therapists (n=21, 68\%). The most described indications or benefits were psychological. Twenty-eight (90\%) websites lacked testimonials, and 26 (84\%) lacked cost information. Twenty-six (84\%) websites provided scheduling or referral information. MBI-related information was found with an average of 4 (SD 1) clicks. Nine (29\%) websites were of high quality, 18 (58\%) were moderate, and 4 (13\%) were low. Conclusions: Based on public websites, MBIs were most commonly delivered in person by board-certified music therapists to outpatient and inpatient adults, using individual and group formats to provide psychological benefits. The findability and quality of this information should be improved to promote the dissemination of MBIs for cancer symptom management. ", doi="10.2196/53440", url="https://cancer.jmir.org/2024/1/e53440" } @Article{info:doi/10.2196/49328, author="Zhou, Weiqiang and Liu, Dongliang and Yi, Zhaoxu and Lei, Yang and Zhang, Zhenming and Deng, Yu and Tan, Ying", title="Web-Based Platform for Systematic Reviews and Meta-Analyses of Traditional Chinese Medicine: Platform Development Study", journal="JMIR Form Res", year="2024", month="Nov", day="22", volume="8", pages="e49328", keywords="evidence-based medicine", keywords="information science", keywords="medical librarian", keywords="web development", keywords="web design", keywords="meta-analysis", keywords="traditional Chinese medicine", keywords="systematic review", keywords="review methodology", keywords="Chinese medicine", keywords="traditional medicine", abstract="Background: There are many problems associated with systematic reviews of traditional Chinese medicine (TCM), such as considering ``integrated traditional Chinese and Western medicine'' or treatment methods as intervention measures without considering the differences in drug use, disregarding dosage and courses of treatment, disregarding interindividual differences in control groups, etc. Classifying a large number of heterogeneous intervention measures into the same measure is easy but results in inaccurate results. In April 2023, Cochrane launched RevMan Web to digitalize systematic reviews and meta-analyses. We believe that this web-based working model helps solve the abovementioned problems. Objective: This study aims to (1) develop a web-based platform that is more suitable for systematic review and meta-analysis of TCM and (2) explore the characteristics and future development directions of this work through the testing of digital workflow. Methods: We developed TCMeta (Traditional Chinese Medicine Meta-analysis)---a platform focused on systematic reviews of TCM types. All systematic review--related work can be completed on the web, including creating topics, writing protocols, arranging personnel, obtaining literature, screening literature, inputting and analyzing data, and designing illustrations. The platform was developed using the latest internet technology and can be continuously modified and updated based on user feedback. When screening the literature on the platform, in addition to the traditional manual screening mode, the platform also creatively provides a query mode where users input keywords and click on Search to find literature with the same characteristics; this better reflects the objectivity of the screening with higher efficiency. Productivity can be improved by analyzing data and generating graphs digitally. Results: We used some test data in TCMeta to simulate data processing in a systematic review. In the literature screening stage, researchers could rapidly screen 19 sources of literature from among multiple sources with the manual screening mode. This traditional method could result in bias due to differences in the researchers' cognitive levels. The query mode is much more complex and involves inputting of data regarding drug compatibility, dosage, syndrome type, etc; different query methods can yield very different results, thus increasing the stringency of screening. We integrated data analysis tools on the platform and used third-party software to generate graphs. Conclusions: TCMeta has shown great potential in improving the quality of systematic reviews of TCM types in simulation tests. Several indicators show that this web-based mode of working is superior to the traditional way. Future research is required to focus on validating and refining the performance of TCMeta, emphasizing the ability to handle complex data. The system has good scalability and adaptability, and it has the potential to have a positive impact on the field of evidence-based medicine. ", doi="10.2196/49328", url="https://formative.jmir.org/2024/1/e49328" } @Article{info:doi/10.2196/60334, author="Tang, Jian and Huang, Zikun and Xu, Hongzhen and Zhang, Hao and Huang, Hailing and Tang, Minqiong and Luo, Pengsheng and Qin, Dong", title="Chinese Clinical Named Entity Recognition With Segmentation Synonym Sentence Synthesis Mechanism: Algorithm Development and Validation", journal="JMIR Med Inform", year="2024", month="Nov", day="21", volume="12", pages="e60334", keywords="clinical named entity recognition", keywords="word embedding", keywords="Chinese electronic medical records", keywords="RoBERTa", keywords="entity recognition", keywords="segmentation", keywords="natural language processing", keywords="AI", keywords="artificial intelligence", keywords="dataset", keywords="dataset augmentation", keywords="algorithm", keywords="entity", keywords="EMR", abstract="Background: Clinical named entity recognition (CNER) is a fundamental task in natural language processing used to extract named entities from electronic medical record texts. In recent years, with the continuous development of machine learning, deep learning models have replaced traditional machine learning and template-based methods, becoming widely applied in the CNER field. However, due to the complexity of clinical texts, the diversity and large quantity of named entity types, and the unclear boundaries between different entities, existing advanced methods rely to some extent on annotated databases and the scale of embedded dictionaries. Objective: This study aims to address the issues of data scarcity and labeling difficulties in CNER tasks by proposing a dataset augmentation algorithm based on proximity word calculation. Methods: We propose a Segmentation Synonym Sentence Synthesis (SSSS) algorithm based on neighboring vocabulary, which leverages existing public knowledge without the need for manual expansion of specialized domain dictionaries. Through lexical segmentation, the algorithm replaces new synonymous vocabulary by recombining from vast natural language data, achieving nearby expansion expressions of the dataset. We applied the SSSS algorithm to the Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach (RoBERTa) + conditional random field (CRF) and RoBERTa + Bidirectional Long Short-Term Memory (BiLSTM) + CRF models and evaluated our models (SSSS + RoBERTa + CRF; SSSS + RoBERTa + BiLSTM + CRF) on the China Conference on Knowledge Graph and Semantic Computing (CCKS) 2017 and 2019 datasets. Results: Our experiments demonstrated that the models SSSS + RoBERTa + CRF and SSSS + RoBERTa + BiLSTM + CRF achieved F1-scores of 91.30\% and 91.35\% on the CCKS-2017 dataset, respectively. They also achieved F1-scores of 83.21\% and 83.01\% on the CCKS-2019 dataset, respectively. Conclusions: The experimental results indicated that our proposed method successfully expanded the dataset and remarkably improved the performance of the model, effectively addressing the challenges of data acquisition, annotation difficulties, and insufficient model generalization performance. ", doi="10.2196/60334", url="https://medinform.jmir.org/2024/1/e60334" } @Article{info:doi/10.2196/58088, author="Patel, Mohammed Ahmed and Baxter, Weston and Porat, Talya", title="Toward Guidelines for Designing Holistic Integrated Information Visualizations for Time-Critical Contexts: Systematic Review", journal="J Med Internet Res", year="2024", month="Nov", day="20", volume="26", pages="e58088", keywords="visualization", keywords="design", keywords="holistic", keywords="integrated", keywords="time-critical", keywords="guidelines", keywords="pre-attentive processing", keywords="gestalt theory", keywords="situation awareness", keywords="decision-making", keywords="mobile phone", abstract="Background: With the extensive volume of information from various and diverse data sources, it is essential to present information in a way that allows for quick understanding and interpretation. This is particularly crucial in health care, where timely insights into a patient's condition can be lifesaving. Holistic visualizations that integrate multiple data variables into a single visual representation can enhance rapid situational awareness and support informed decision-making. However, despite the existence of numerous guidelines for different types of visualizations, this study reveals that there are currently no specific guidelines or principles for designing holistic integrated information visualizations that enable quick processing and comprehensive understanding of multidimensional data in time-critical contexts. Addressing this gap is essential for enhancing decision-making in time-critical scenarios across various domains, particularly in health care. Objective: This study aims to establish a theoretical foundation supporting the argument that holistic integrated visualizations are a distinct type of visualization for time-critical contexts and identify applicable design principles and guidelines that can be used to design for such cases. Methods: We systematically searched the literature for peer-reviewed research on visualization strategies, guidelines, and taxonomies. The literature selection followed the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. The search was conducted across 6 databases: ACM Digital Library, Google Scholar, IEEE Xplore, PubMed, Scopus, and Web of Science. The search was conducted up to August 2024 using the terms (``visualisations'' OR ``visualizations'') AND (``guidelines'' OR ``taxonomy'' OR ``taxonomies''), with studies restricted to the English language. Results: Of 936 papers, 46 (4.9\%) were included in the final review. In total, 48\% (22/46) related to providing a holistic understanding and overview of multidimensional data; 28\% (13/46) focused on integrated presentation, that is, integrating or combining multidimensional data into a single visual representation; and 35\% (16/46) pertained to time and designing for rapid information processing. In total, 65\% (30/46) of the papers presented general information visualization or visual communication guidelines and principles. No specific guidelines or principles were found that addressed all the characteristics of holistic, integrated visualizations in time-critical contexts. A summary of the key guidelines and principles from the 46 papers was extracted, collated, and categorized into 60 guidelines that could aid in designing holistic integrated visualizations. These were grouped according to different characteristics identified in the systematic review (eg, gestalt principles, reduction, organization, abstraction, and task complexity) and further condensed into 5 main proposed guidelines. Conclusions: Holistic integrated information visualizations in time-critical domains are a unique use case requiring a unique set of design guidelines. Our proposed 5 main guidelines, derived from existing design theories and guidelines, can serve as a starting point to enable both holistic and rapid processing of information, facilitating better-informed decisions in time-critical contexts. ", doi="10.2196/58088", url="https://www.jmir.org/2024/1/e58088" } @Article{info:doi/10.2196/53781, author="Leblanc, Victor and Hamroun, Aghiles and Bentegeac, Rapha{\"e}l and Le Guellec, Bastien and Lenain, R{\'e}mi and Chazard, Emmanuel", title="Added Value of Medical Subject Headings Terms in Search Strategies of Systematic Reviews: Comparative Study", journal="J Med Internet Res", year="2024", month="Nov", day="19", volume="26", pages="e53781", keywords="Medical Subject Headings", keywords="MeSH", keywords="MeSH thesaurus", keywords="systematic review", keywords="PubMed", keywords="search strategy", keywords="comparative analysis", keywords="literature review", keywords="positive predictive value", keywords="PPV", keywords="review", keywords="scientific knowledge", keywords="medical knowledge", keywords="utility", keywords="systematic literature review", abstract="Background: The massive increase in the number of published scientific articles enhances knowledge but makes it more complicated to summarize results. The Medical Subject Headings (MeSH) thesaurus was created in the mid-20th century with the aim of systematizing article indexing and facilitating their retrieval. Despite the advent of search engines, few studies have questioned the relevance of the MeSH thesaurus, and none have done so systematically. Objective: The objective of this study was to estimate the added value of using MeSH terms in PubMed queries for systematic reviews (SRs). Methods: SRs published in 4 high-impact medical journals in general medicine over the past 10 years were selected. Only SRs for which a PubMed query was provided were included. Each query was transformed to obtain 3 versions: the original query (V1), the query with free-text terms only (V2), and the query with MeSH terms only (V3). These 3 queries were compared with each other based on their sensitivity and positive predictive values. Results: In total, 59 SRs were included. The suppression of MeSH terms had an impact on the number of relevant articles retrieved for 24 (41\%) out of 59 SRs. The median (IQR) sensitivities of queries V1 and V2 were 77.8\% (62.1\%-95.2\%) and 71.4\% (42.6\%-90\%), respectively. V1 queries provided an average of 2.62 additional relevant papers per SR compared with V2 queries. However, an additional 820.29 papers had to be screened. The cost of screening an additional collected paper was therefore 313.09, which was slightly more than triple the mean reading cost associated with V2 queries (88.67). Conclusions: Our results revealed that removing MeSH terms from a query decreases sensitivity while slightly increasing the positive predictive value. Queries containing both MeSH and free-text terms yielded more relevant articles but required screening many additional papers. Despite this additional workload, MeSH terms remain indispensable for SRs. ", doi="10.2196/53781", url="https://www.jmir.org/2024/1/e53781" } @Article{info:doi/10.2196/58041, author="Wang, Dingqiao and Liang, Jiangbo and Ye, Jinguo and Li, Jingni and Li, Jingpeng and Zhang, Qikai and Hu, Qiuling and Pan, Caineng and Wang, Dongliang and Liu, Zhong and Shi, Wen and Shi, Danli and Li, Fei and Qu, Bo and Zheng, Yingfeng", title="Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study", journal="J Med Internet Res", year="2024", month="Nov", day="8", volume="26", pages="e58041", keywords="large language models", keywords="LLMs", keywords="retrieval-augmented generation", keywords="RAG", keywords="GPT-4.0", keywords="Claude-2", keywords="Google Bard", keywords="diabetes education", abstract="Background: Large language models (LLMs) demonstrated advanced performance in processing clinical information. However, commercially available LLMs lack specialized medical knowledge and remain susceptible to generating inaccurate information. Given the need for self-management in diabetes, patients commonly seek information online. We introduce the Retrieval-augmented Information System for Enhancement (RISE) framework and evaluate its performance in enhancing LLMs to provide accurate responses to diabetes-related inquiries. Objective: This study aimed to evaluate the potential of the RISE framework, an information retrieval and augmentation tool, to improve the LLM's performance to accurately and safely respond to diabetes-related inquiries. Methods: The RISE, an innovative retrieval augmentation framework, comprises 4 steps: rewriting query, information retrieval, summarization, and execution. Using a set of 43 common diabetes-related questions, we evaluated 3 base LLMs (GPT-4, Anthropic Claude 2, Google Bard) and their RISE-enhanced versions respectively. Assessments were conducted by clinicians for accuracy and comprehensiveness and by patients for understandability. Results: The integration of RISE significantly improved the accuracy and comprehensiveness of responses from all 3 base LLMs. On average, the percentage of accurate responses increased by 12\% (15/129) with RISE. Specifically, the rates of accurate responses increased by 7\% (3/43) for GPT-4, 19\% (8/43) for Claude 2, and 9\% (4/43) for Google Bard. The framework also enhanced response comprehensiveness, with mean scores improving by 0.44 (SD 0.10). Understandability was also enhanced by 0.19 (SD 0.13) on average. Data collection was conducted from September 30, 2023 to February 5, 2024. Conclusions: The RISE significantly improves LLMs' performance in responding to diabetes-related inquiries, enhancing accuracy, comprehensiveness, and understandability. These improvements have crucial implications for RISE's future role in patient education and chronic illness self-management, which contributes to relieving medical resource pressures and raising public awareness of medical knowledge. ", doi="10.2196/58041", url="https://www.jmir.org/2024/1/e58041" } @Article{info:doi/10.2196/64221, author="Nachman, Sophie and Ortiz-Prado, Esteban and Tucker, D. Joseph", title="Video Abstracts in Research", journal="J Med Internet Res", year="2024", month="Nov", day="4", volume="26", pages="e64221", keywords="video abstract", keywords="abstract", keywords="dissemination", keywords="public engagement", keywords="online", keywords="videos", keywords="public audience", keywords="communication", keywords="infographics", keywords="health literacy", keywords="patient education", keywords="public health", doi="10.2196/64221", url="https://www.jmir.org/2024/1/e64221" } @Article{info:doi/10.2196/51095, author="Zeng, Jiaqi and Zou, Xiaoyi and Li, Shirong and Tang, Yao and Teng, Sisi and Li, Huanhuan and Wang, Changyu and Wu, Yuxuan and Zhang, Luyao and Zhong, Yunheng and Liu, Jialin and Liu, Siru", title="Assessing the Role of the Generative Pretrained Transformer (GPT) in Alzheimer's Disease Management: Comparative Study of Neurologist- and Artificial Intelligence--Generated Responses", journal="J Med Internet Res", year="2024", month="Oct", day="31", volume="26", pages="e51095", keywords="Alzheimer's disease", keywords="artificial intelligence", keywords="AI", keywords="large language model", keywords="LLM", keywords="Generative Pretrained Transformer", keywords="GPT", keywords="ChatGPT", keywords="patient information", abstract="Background: Alzheimer's disease (AD) is a progressive neurodegenerative disorder posing challenges to patients, caregivers, and society. Accessible and accurate information is crucial for effective AD management. Objective: This study aimed to evaluate the accuracy, comprehensibility, clarity, and usefulness of the Generative Pretrained Transformer's (GPT) answers concerning the management and caregiving of patients with AD. Methods: In total, 14 questions related to the prevention, treatment, and care of AD were identified and posed to GPT-3.5 and GPT-4 in Chinese and English, respectively, and 4 respondent neurologists were asked to answer them. We generated 8 sets of responses (total 112) and randomly coded them in answer sheets. Next, 5 evaluator neurologists and 5 family members of patients were asked to rate the 112 responses using separate 5-point Likert scales. We evaluated the quality of the responses using a set of 8 questions rated on a 5-point Likert scale. To gauge comprehensibility and participant satisfaction, we included 3 questions dedicated to each aspect within the same set of 8 questions. Results: As of April 10, 2023, the 5 evaluator neurologists and 5 family members of patients with AD rated the 112 responses: GPT-3.5: n=28, 25\%, responses; GPT-4: n=28, 25\%, responses; respondent neurologists: 56 (50\%) responses. The top 5 (4.5\%) responses rated by evaluator neurologists had 4 (80\%) GPT (GPT-3.5+GPT-4) responses and 1 (20\%) respondent neurologist's response. For the top 5 (4.5\%) responses rated by patients' family members, all but the third response were GPT responses. Based on the evaluation by neurologists, the neurologist-generated responses achieved a mean score of 3.9 (SD 0.7), while the GPT-generated responses scored significantly higher (mean 4.4, SD 0.6; P<.001). Language and model analyses revealed no significant differences in response quality between the GPT-3.5 and GPT-4 models (GPT-3.5: mean 4.3, SD 0.7; GPT-4: mean 4.4, SD 0.5; P=.51). However, English responses outperformed Chinese responses in terms of comprehensibility (Chinese responses: mean 4.1, SD 0.7; English responses: mean 4.6, SD 0.5; P=.005) and participant satisfaction (Chinese responses: mean 4.2, SD 0.8; English responses: mean 4.5, SD 0.5; P=.04). According to the evaluator neurologists' review, Chinese responses had a mean score of 4.4 (SD 0.6), whereas English responses had a mean score of 4.5 (SD 0.5; P=.002). As for the family members of patients with AD, no significant differences were observed between GPT and neurologists, GPT-3.5 and GPT-4, or Chinese and English responses. Conclusions: GPT can provide patient education materials on AD for patients, their families and caregivers, nurses, and neurologists. This capability can contribute to the effective health care management of patients with AD, leading to enhanced patient outcomes. ", doi="10.2196/51095", url="https://www.jmir.org/2024/1/e51095" } @Article{info:doi/10.2196/60939, author="Joshi, Saubhagya and Ha, Eunbin and Amaya, Andee and Mendoza, Melissa and Rivera, Yonaira and Singh, K. Vivek", title="Ensuring Accuracy and Equity in Vaccination Information From ChatGPT and CDC: Mixed-Methods Cross-Language Evaluation", journal="JMIR Form Res", year="2024", month="Oct", day="30", volume="8", pages="e60939", keywords="vaccination", keywords="health equity", keywords="multilingualism", keywords="language equity", keywords="health literacy", keywords="online health information", keywords="conversational agents", keywords="artificial intelligence", keywords="large language models", keywords="health information", keywords="public health", abstract="Background: In the digital age, large language models (LLMs) like ChatGPT have emerged as important sources of health care information. Their interactive capabilities offer promise for enhancing health access, particularly for groups facing traditional barriers such as insurance and language constraints. Despite their growing public health use, with millions of medical queries processed weekly, the quality of LLM-provided information remains inconsistent. Previous studies have predominantly assessed ChatGPT's English responses, overlooking the needs of non--English speakers in the United States. This study addresses this gap by evaluating the quality and linguistic parity of vaccination information from ChatGPT and the Centers for Disease Control and Prevention (CDC), emphasizing health equity. Objective: This study aims to assess the quality and language equity of vaccination information provided by ChatGPT and the CDC in English and Spanish. It highlights the critical need for cross-language evaluation to ensure equitable health information access for all linguistic groups. Methods: We conducted a comparative analysis of ChatGPT's and CDC's responses to frequently asked vaccination-related questions in both languages. The evaluation encompassed quantitative and qualitative assessments of accuracy, readability, and understandability. Accuracy was gauged by the perceived level of misinformation; readability, by the Flesch-Kincaid grade level and readability score; and understandability, by items from the National Institutes of Health's Patient?Education?Materials Assessment Tool (PEMAT) instrument. Results: The study found that both ChatGPT and CDC provided mostly accurate and understandable (eg, scores over 95 out of 100) responses. However, Flesch-Kincaid grade levels often exceeded the American Medical Association's recommended levels, particularly in English (eg, average grade level in English for ChatGPT=12.84, Spanish=7.93, recommended=6). CDC responses outperformed ChatGPT in readability across both languages. Notably, some Spanish responses appeared to be direct translations from English, leading to unnatural phrasing. The findings underscore the potential and challenges of using ChatGPT for health care access. Conclusions: ChatGPT holds potential as a health information resource but requires improvements in readability and linguistic equity to be truly effective for diverse populations. Crucially, the default user experience with ChatGPT, typically encountered by those without advanced language and prompting skills, can significantly shape health perceptions. This is vital from a public health standpoint, as the majority of users will interact with LLMs in their most accessible form. Ensuring that default responses are accurate, understandable, and equitable is imperative for fostering informed health decisions across diverse communities. ", doi="10.2196/60939", url="https://formative.jmir.org/2024/1/e60939" } @Article{info:doi/10.2196/53636, author="Bardhan, Jayetri and Roberts, Kirk and Wang, Zhe Daisy", title="Question Answering for Electronic Health Records: Scoping Review of Datasets and Models", journal="J Med Internet Res", year="2024", month="Oct", day="30", volume="26", pages="e53636", keywords="medical question answering", keywords="electronic health record", keywords="EHR", keywords="electronic medical records", keywords="EMR", keywords="relational database", keywords="knowledge graph", abstract="Background: Question answering (QA) systems for patient-related data can assist both clinicians and patients. They can, for example, assist clinicians in decision-making and enable patients to have a better understanding of their medical history. Substantial amounts of patient data are stored in electronic health records (EHRs), making EHR QA an important research area. Because of the differences in data format and modality, this differs greatly from other medical QA tasks that use medical websites or scientific papers to retrieve answers, making it critical to research EHR QA. Objective: This study aims to provide a methodological review of existing works on QA for EHRs. The objectives of this study were to identify the existing EHR QA datasets and analyze them, study the state-of-the-art methodologies used in this task, compare the different evaluation metrics used by these state-of-the-art models, and finally elicit the various challenges and the ongoing issues in EHR QA. Methods: We searched for articles from January 1, 2005, to September 30, 2023, in 4 digital sources, including Google Scholar, ACL Anthology, ACM Digital Library, and PubMed, to collect relevant publications on EHR QA. Our systematic screening process followed PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. A total of 4111 papers were identified for our study, and after screening based on our inclusion criteria, we obtained 47 papers for further study. The selected studies were then classified into 2 non--mutually exclusive categories depending on their scope: ``EHR QA datasets'' and ``EHR QA models.'' Results: A systematic screening process obtained 47 papers on EHR QA for final review. Out of the 47 papers, 53\% (n=25) were about EHR QA datasets, and 79\% (n=37) papers were about EHR QA models. It was observed that QA on EHRs is relatively new and unexplored. Most of the works are fairly recent. In addition, it was observed that emrQA is by far the most popular EHR QA dataset, both in terms of citations and usage in other papers. We have classified the EHR QA datasets based on their modality, and we have inferred that Medical Information Mart for Intensive Care (MIMIC-III) and the National Natural Language Processing Clinical Challenges datasets (ie, n2c2 datasets) are the most popular EHR databases and corpuses used in EHR QA. Furthermore, we identified the different models used in EHR QA along with the evaluation metrics used for these models. Conclusions: EHR QA research faces multiple challenges, such as the limited availability of clinical annotations, concept normalization in EHR QA, and challenges faced in generating realistic EHR QA datasets. There are still many gaps in research that motivate further work. This study will assist future researchers in focusing on areas of EHR QA that have possible future research directions. ", doi="10.2196/53636", url="https://www.jmir.org/2024/1/e53636" } @Article{info:doi/10.2196/53938, author="Kim, Minji and Vassey, Julia and Li, Dongmei and Galimov, Artur and Han, Eileen and Kirkpatrick, G. Matthew and Stanton, A. Cassandra and Ozga, E. Jenny and Lee, Sarah and Unger, B. Jennifer", title="Discussion of Heated Tobacco Products on Twitter Following IQOS's Modified-Risk Tobacco Product Authorization and US Import Ban: Content Analysis", journal="J Med Internet Res", year="2024", month="Oct", day="24", volume="26", pages="e53938", keywords="heated tobacco products", keywords="IQOS", keywords="social media", keywords="Twitter", keywords="tobacco control", keywords="modified-risk tobacco product authorization", keywords="MRTP authorization", keywords="tobacco regulatory science", keywords="import ban", keywords="observational study", keywords="public opinion", keywords="content analysis", abstract="Background: Understanding public opinions about emerging tobacco products is important to inform future interventions and regulatory decisions. Heated tobacco products (HTPs) are an emerging tobacco product category promoted by the tobacco industry as a ``better alternative'' to combustible cigarettes. Philip Morris International's IQOS is leading the global HTP market and recently has been subject to important policy events, including the US Food and Drug Administration's (FDA) modified-risk tobacco product (MRTP) authorization (July 2020) and the US import ban (November 2021). Although limited in their legal implications outside the United States, these policy events have been quoted in global news outlets and Philip Morris International's promotional communications, showing how they may potentially impact global tobacco regulation. Given the impending return of IQOS to the US market, understanding how the policy events were received through social media discourse will provide valuable insights to inform global tobacco control policy. Objective: This study aims to examine HTP-related social media discourse around important policy events. Methods: We analyzed HTP-related posts on Twitter during the time period that included IQOS's MRTP authorization in the United States and the US import ban, examining personal testimonial, news/information, and direct marketing/retail tweets separately. We also examined how the tweets discussed health and policy. A total of 10,454 public English tweets (posted from June 2020 to December 2021) were collected using HTP-related keywords. We randomly sampled 2796 (26.7\%) tweets and conducted a content analysis. We used pairwise co-occurrence analyses to evaluate connections across themes. Results: Tweet volumes peaked around IQOS-related policy events. Among all tweets, personal testimonials were the most common (1613/2796, 57.7\%), followed by news/information (862/2796, 30.8\%) and direct marketing/retail (321/2796, 11\%). Among personal testimonials, more tweets were positive (495/1613, 30.7\%) than negative (372/1613, 23.1\%), often comparing the health risks of HTPs with cigarettes (402/1613, 24.9\%) or vaping products (252/1613, 15.6\%). Approximately 10\% (31/321) of the direct marketing/retail tweets promoted international delivery, suggesting cross-border promotion. More than a quarter of tweets (809/2796, 28.9\%) discussed US and global policy, including misinterpretation about IQOS being a ``safer'' tobacco product after the US FDA's MRTP authorization. Neutral testimonials mentioning the IQOS brand (634/1613, 39.3\%) and discussing policy (378/1613, 23.4\%) showed the largest pairwise co-occurrence. Conclusions: Results suggest the need for careful communication about the meaning of MRTP authorizations and relative risks of tobacco products. Many tweets expressed HTP-favorable opinions referring to reduced health risks, even though the US FDA has denied marketing of the HTP with reduced risk claims. The popularity of social media as an information source with global reach poses unique challenges in health communication and health policies. While many countries restrict tobacco marketing via the web, our results suggest that retailers may circumvent such regulations by operating overseas. ", doi="10.2196/53938", url="https://www.jmir.org/2024/1/e53938", url="http://www.ncbi.nlm.nih.gov/pubmed/39446431" } @Article{info:doi/10.2196/54653, author="Manion, J. Frank and Du, Jingcheng and Wang, Dong and He, Long and Lin, Bin and Wang, Jingqi and Wang, Siwei and Eckels, David and Cervenka, Jan and Fiduccia, C. Peter and Cossrow, Nicole and Yao, Lixia", title="Accelerating Evidence Synthesis in Observational Studies: Development of a Living Natural Language Processing--Assisted Intelligent Systematic Literature Review System", journal="JMIR Med Inform", year="2024", month="Oct", day="23", volume="12", pages="e54653", keywords="machine learning", keywords="deep learning", keywords="natural language processing", keywords="systematic literature review", keywords="artificial intelligence", keywords="software development", keywords="data extraction", keywords="epidemiology", abstract="Background: Systematic literature review (SLR), a robust method to identify and summarize evidence from published sources, is considered to be a complex, time-consuming, labor-intensive, and expensive task. Objective: This study aimed to present a solution based on natural language processing (NLP) that accelerates and streamlines the SLR process for observational studies using real-world data. Methods: We followed an agile software development and iterative software engineering methodology to build a customized intelligent end-to-end living NLP-assisted solution for observational SLR tasks. Multiple machine learning--based NLP algorithms were adopted to automate article screening and data element extraction processes. The NLP prediction results can be further reviewed and verified by domain experts, following the human-in-the-loop design. The system integrates explainable articificial intelligence to provide evidence for NLP algorithms and add transparency to extracted literature data elements. The system was developed based on 3 existing SLR projects of observational studies, including the epidemiology studies of human papillomavirus--associated diseases, the disease burden of pneumococcal diseases, and cost-effectiveness studies on pneumococcal vaccines. Results: Our Intelligent SLR Platform covers major SLR steps, including study protocol setting, literature retrieval, abstract screening, full-text screening, data element extraction from full-text articles, results summary, and data visualization. The NLP algorithms achieved accuracy scores of 0.86-0.90 on article screening tasks (framed as text classification tasks) and macroaverage F1 scores of 0.57-0.89 on data element extraction tasks (framed as named entity recognition tasks). Conclusions: Cutting-edge NLP algorithms expedite SLR for observational studies, thus allowing scientists to have more time to focus on the quality of data and the synthesis of evidence in observational studies. Aligning the living SLR concept, the system has the potential to update literature data and enable scientists to easily stay current with the literature related to observational studies prospectively and continuously. ", doi="10.2196/54653", url="https://medinform.jmir.org/2024/1/e54653" } @Article{info:doi/10.2196/50099, author="Acero, Nicole and Herrero, Emma and Foncham, Juanita and McIlvaine, Jamie and Kayaalp, Emre and Figueora, Melissa and Oladipo, Francis Antonia", title="Accuracy, Quality, and Misinformation of YouTube Abortion Procedural Videos: Cross-Sectional Study", journal="J Med Internet Res", year="2024", month="Oct", day="22", volume="26", pages="e50099", keywords="abortion", keywords="YouTube", keywords="social media", keywords="accuracy", keywords="quality", keywords="misinformation", keywords="reliability", keywords="obstetrics", keywords="women's health", keywords="reproductive", keywords="patient education", keywords="health information", keywords="prochoice", abstract="Background: The internet is often the first source patients turn to for medical information. YouTube is a commonly used internet-based resource for patients seeking to learn about medical procedures, including their risks, benefits, and safety profile. Abortion is a common yet polarizing medical procedure. People interested in obtaining an abortion are likely to use the internet to learn more about abortion procedures and may encounter misinformed and biased information. This is troubling as information found on the internet can significantly alter perceptions and understanding of these procedures. There is no current research that evaluates the accuracy, quality, and misinformation of instructional abortion videos available to patients. Objective: The purpose of this study was to assess if any given video can deliver accurate and quality information about this topic in an unbiased manner and to assess the level of factually incorrect, distorted, or medically irrelevant information in any given video. Methods: Procedural methods of abortion were queried on YouTube on August 22, 2022. The videos were screened with strict exclusion criteria. Videos were categorized into ``video slants'' based on the language and attitudes expressed in each video. Video accuracy was calculated using the Surgical Curriculum in Obstetrics and Gynecology (SCOG) checklist for each corresponding procedure. Video quality was calculated using the Laparoscopic Surgery Video Educational Guidelines (LAP-VEGaS) criteria. The level of misinformation was assessed with the evidence-based Anti-Choice Rubric, which scores the amount of factually incorrect, distorted, or medically irrelevant information in each video. Results: A total of 32 videos were analyzed and categorized into 3 ``video slant'' groups: neutral (n=23, 72\%), antichoice (n=4, 12\%), and prochoice (n=5, 16\%). Using the SCOG checklist, neutral videos had the highest median accuracy (45.9\%), followed by antichoice videos (24.6\%) and prochoice videos (18.5\%). None of the videos met the LAP-VEGaS quality control criteria, (score>11, indicating adequate quality). Neutral videos had a median score of 8.8 out of 18, with antichoice videos scoring 10.75 and prochoice videos scoring 6.2. Using the Anti-Choice Rubric, neutral videos mentioned only 1 factually incorrect piece of information. Antichoice videos mentioned 12 factually incorrect pieces of information, 8 distortions, and 3 medically irrelevant pieces of information. Prochoice videos did not mention any of the 3 themes. Conclusions: Using the SCOG checklist, the accuracy of instructional videos were inconsistent across the 3 identified ``video slants.'' Using LAP-VEGaS criteria, the quality of educational videos were also inconsistent across the 3 ``video slants.'' Prochoice videos had the lowest level of misinformation, with no mentions of any of the 3 themes. Antichoice videos had the highest levels of misinformation, with mentions in all 3 themes. Health care professionals should consider this when counseling patients who may watch YouTube videos for information regarding abortion procedures. ", doi="10.2196/50099", url="https://www.jmir.org/2024/1/e50099" } @Article{info:doi/10.2196/60164, author="Nunes, Miguel and Bone, Joao and Ferreira, C. Joao and Elvas, B. Luis", title="Health Care Language Models and Their Fine-Tuning for Information Extraction: Scoping Review", journal="JMIR Med Inform", year="2024", month="Oct", day="21", volume="12", pages="e60164", keywords="language model", keywords="information extraction", keywords="healthcare", keywords="PRISMA-ScR", keywords="scoping literature review", keywords="transformers", keywords="natural language processing", keywords="European Portuguese", abstract="Background: In response to the intricate language, specialized terminology outside everyday life, and the frequent presence of abbreviations and acronyms inherent in health care text data, domain adaptation techniques have emerged as crucial to transformer-based models. This refinement in the knowledge of the language models (LMs) allows for a better understanding of the medical textual data, which results in an improvement in medical downstream tasks, such as information extraction (IE). We have identified a gap in the literature regarding health care LMs. Therefore, this study presents a scoping literature review investigating domain adaptation methods for transformers in health care, differentiating between English and non-English languages, focusing on Portuguese. Most specifically, we investigated the development of health care LMs, with the aim of comparing Portuguese with other more developed languages to guide the path of a non--English-language with fewer resources. Objective: This study aimed to research health care IE models, regardless of language, to understand the efficacy of transformers and what are the medical entities most commonly extracted. Methods: This scoping review was conducted using the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) methodology on Scopus and Web of Science Core Collection databases. Only studies that mentioned the creation of health care LMs or health care IE models were included, while large language models (LLMs) were excluded. The latest were not included since we wanted to research LMs and not LLMs, which are architecturally different and have distinct purposes. Results: Our search query retrieved 137 studies, 60 of which met the inclusion criteria, and none of them were systematic literature reviews. English and Chinese are the languages with the most health care LMs developed. These languages already have disease-specific LMs, while others only have general--health care LMs. European Portuguese does not have any public health care LM and should take examples from other languages to develop, first, general-health care LMs and then, in an advanced phase, disease-specific LMs. Regarding IE models, transformers were the most commonly used method, and named entity recognition was the most popular topic, with only a few studies mentioning Assertion Status or addressing medical lexical problems. The most extracted entities were diagnosis, posology, and symptoms. Conclusions: The findings indicate that domain adaptation is beneficial, achieving better results in downstream tasks. Our analysis allowed us to understand that the use of transformers is more developed for the English and Chinese languages. European Portuguese lacks relevant studies and should draw examples from other non-English languages to develop these models and drive progress in AI. Health care professionals could benefit from highlighting medically relevant information and optimizing the reading of the textual data, or this information could be used to create patient medical timelines, allowing for profiling. ", doi="10.2196/60164", url="https://medinform.jmir.org/2024/1/e60164", url="http://www.ncbi.nlm.nih.gov/pubmed/39432345" } @Article{info:doi/10.2196/48154, author="Van Oirschot, Garett and Pomphrey, Amanda and Dunne, Caoimhe and Murphy, Kate and Blood, Karina and Doherty, Cailbhe", title="An Evaluation of the Design of Multimedia Patient Education Materials in Musculoskeletal Health Care: Systematic Review", journal="JMIR Rehabil Assist Technol", year="2024", month="Oct", day="15", volume="11", pages="e48154", keywords="health education", keywords="patient education", keywords="patient education materials", keywords="multimedia", keywords="musculoskeletal diseases", keywords="musculoskeletal pain", keywords="eHealth", keywords="self-management", abstract="Background: Educational multimedia is a cost-effective and straightforward way to administer large-scale information interventions to patient populations in musculoskeletal health care. While an abundance of health research informs the content of these interventions, less guidance exists about optimizing their design. Objective: This study aims to identify randomized controlled trials of patient populations with musculoskeletal conditions that used multimedia-based patient educational materials (PEMs) and examine how design was reported and impacted patients' knowledge and rehabilitation outcomes. Design was evaluated using principles from the cognitive theory of multimedia learning (CTML). Methods: PubMed, CINAHL, PsycINFO, and Embase were searched from inception to September 2023 for studies examining adult patients with musculoskeletal conditions receiving multimedia PEMs compared to any other interventions. The primary outcome was knowledge retention measured via test scores. Secondary outcomes were any patient-reported measures. Retrievability was noted, and PEMs were sourced through search, purchase, and author communication. Results: A total of 160 randomized controlled trials were eligible for inclusion: 13 (8.1\%) included their educational materials and 31 (19.4\%) required a web search, purchase, or direct requests for educational materials. Of these 44 (27.5\%) studies, none fully optimized the design of their educational materials, particularly lacking in the CTML principles of coherence, redundancy, modality, and generative activities for the learner. Of the 160 studies, the remaining 116 (72.5\%) contained interventions that could not be retrieved or appraised. Learning was evaluated in 5 (3.1\%) studies. Conclusions: Musculoskeletal studies should use open science principles and provide their PEMs wherever possible. The link between providing multimedia PEMs and patient learning is largely unexamined, but engagement potential may be maximized when considering design principles such as the CTML. ", doi="10.2196/48154", url="https://rehab.jmir.org/2024/1/e48154", url="http://www.ncbi.nlm.nih.gov/pubmed/39162239" } @Article{info:doi/10.2196/60695, author="Hung, W. Tony K. and Kuperman, J. Gilad and Sherman, J. Eric and Ho, L. Alan and Weng, Chunhua and Pfister, G. David and Mao, J. Jun", title="Performance of Retrieval-Augmented Large Language Models to Recommend Head and Neck Cancer Clinical Trials", journal="J Med Internet Res", year="2024", month="Oct", day="15", volume="26", pages="e60695", keywords="large language model", keywords="LLM", keywords="ChatGPT", keywords="GPT-4", keywords="artificial intelligence", keywords="AI", keywords="clinical trials", keywords="decision support", keywords="LookUpTrials", keywords="cancer care delivery", keywords="head and neck oncology", keywords="head and neck cancer", keywords="retrieval augmented generation", doi="10.2196/60695", url="https://www.jmir.org/2024/1/e60695" } @Article{info:doi/10.2196/56354, author="Zenone, Marco and van Schalkwyk, May and Hartwell, Greg and Caulfield, Timothy and Maani, Nason", title="Selling Misleading ``Cancer Cure'' Books on Amazon: Systematic Search on Amazon.com and Thematic Analysis", journal="J Med Internet Res", year="2024", month="Oct", day="8", volume="26", pages="e56354", keywords="cancer", keywords="Amazon", keywords="misinformation", keywords="e-commerce", keywords="cancer cure", keywords="cancer misinformation", keywords="misleading", keywords="cancer information", keywords="treatment", keywords="cancer treatment", keywords="thematic analysis", keywords="online information", abstract="Background: While the evidence base on web-based cancer misinformation continues to develop, relatively little is known about the extent of such information on the world's largest e-commerce website, Amazon. Multiple media reports indicate that Amazon may host on its platform questionable cancer-related products for sale, such as books on purported cancer cures. This context suggests an urgent need to evaluate Amazon.com for cancer misinformation. Objective: This study sought to (1) examine to what extent are misleading cancer cure books for sale on Amazon.com and (2) determine how cancer cure books on Amazon.com provide misleading cancer information. Methods: We searched ``cancer cure'' on Amazon.com and retrieved the top 1000 English-language book search results. We reviewed the books' descriptions and titles to determine whether the books provided misleading cancer cure or treatment information. We considered a book to be misleading if it suggested scientifically unsupported cancer treatment approaches to cure or meaningfully treat cancer. Among books coded as misleading, we conducted an inductive latent thematic analysis to determine the informational value the books sought to offer. Results: Nearly half (494/1000, 49.4\%) of the sampled ``cancer cure'' books for sale on Amazon.com appeared to contain misleading cancer treatment and cure information. Overall, 17 (51.5\%) out of 33 Amazon.com results pages had 50\% or more of the books coded as misleading. The first search result page had the highest percentage of misleading books (23/33, 69.7\%). Misleading books (n=494) contained eight themes: (1) claims of efficacious cancer cure strategies (n=451, 91.3\%), (2) oversimplifying cancer and cancer treatment (n=194, 39.3\%), (3) falsely justifying ineffective treatments as science based (n=189, 38.3\%), (4) discrediting conventional cancer treatments (n=169, 34.2\%), (5) finding the true cause of cancer (n=133, 26.9\%), (6) homogenizing cancer (n=132, 26.7\%), (7) discovery of new cancer treatments (n=119, 24.1\%), and (8) cancer cure suppression (n=82, 16.6\%). Conclusions: The results demonstrate that misleading cancer cure books are for sale, visible, and prevalent on Amazon.com, with prominence in initial search hits. These misleading books for sale on Amazon can be conceived of as forming part of a wider, cross-platform, web-based information environment in which misleading cancer cures are often given prominence. Our results suggest that greater enforcement is needed from Amazon and that cancer-focused organizations should engage in preemptive misinformation debunking. ", doi="10.2196/56354", url="https://www.jmir.org/2024/1/e56354" } @Article{info:doi/10.2196/58831, author="Armbruster, Jonas and Bussmann, Florian and Rothhaas, Catharina and Titze, Nadine and Gr{\"u}tzner, Alfred Paul and Freischmidt, Holger", title="``Doctor ChatGPT, Can You Help Me?'' The Patient's Perspective: Cross-Sectional Study", journal="J Med Internet Res", year="2024", month="Oct", day="1", volume="26", pages="e58831", keywords="artificial intelligence", keywords="AI", keywords="large language models", keywords="LLM", keywords="ChatGPT", keywords="patient education", keywords="patient information", keywords="patient perceptions", keywords="chatbot", keywords="chatbots", keywords="empathy", abstract="Background: Artificial intelligence and the language models derived from it, such as ChatGPT, offer immense possibilities, particularly in the field of medicine. It is already evident that ChatGPT can provide adequate and, in some cases, expert-level responses to health-related queries and advice for patients. However, it is currently unknown how patients perceive these capabilities, whether they can derive benefit from them, and whether potential risks, such as harmful suggestions, are detected by patients. Objective: This study aims to clarify whether patients can get useful and safe health care advice from an artificial intelligence chatbot assistant. Methods: This cross-sectional study was conducted using 100 publicly available health-related questions from 5 medical specialties (trauma, general surgery, otolaryngology, pediatrics, and internal medicine) from a web-based platform for patients. Responses generated by ChatGPT-4.0 and by an expert panel (EP) of experienced physicians from the aforementioned web-based platform were packed into 10 sets consisting of 10 questions each. The blinded evaluation was carried out by patients regarding empathy and usefulness (assessed through the question: ``Would this answer have helped you?'') on a scale from 1 to 5. As a control, evaluation was also performed by 3 physicians in each respective medical specialty, who were additionally asked about the potential harm of the response and its correctness. Results: In total, 200 sets of questions were submitted by 64 patients (mean 45.7, SD 15.9 years; 29/64, 45.3\% male), resulting in 2000 evaluated answers of ChatGPT and the EP each. ChatGPT scored higher in terms of empathy (4.18 vs 2.7; P<.001) and usefulness (4.04 vs 2.98; P<.001). Subanalysis revealed a small bias in terms of levels of empathy given by women in comparison with men (4.46 vs 4.14; P=.049). Ratings of ChatGPT were high regardless of the participant's age. The same highly significant results were observed in the evaluation of the respective specialist physicians. ChatGPT outperformed significantly in correctness (4.51 vs 3.55; P<.001). Specialists rated the usefulness (3.93 vs 4.59) and correctness (4.62 vs 3.84) significantly lower in potentially harmful responses from ChatGPT (P<.001). This was not the case among patients. Conclusions: The results indicate that ChatGPT is capable of supporting patients in health-related queries better than physicians, at least in terms of written advice through a web-based platform. In this study, ChatGPT's responses had a lower percentage of potentially harmful advice than the web-based EP. However, it is crucial to note that this finding is based on a specific study design and may not generalize to all health care settings. Alarmingly, patients are not able to independently recognize these potential dangers. ", doi="10.2196/58831", url="https://www.jmir.org/2024/1/e58831" } @Article{info:doi/10.2196/49387, author="MacNeill, Luke A. and MacNeill, Lillian and Luke, Alison and Doucet, Shelley", title="Health Professionals' Views on the Use of Conversational Agents for Health Care: Qualitative Descriptive Study", journal="J Med Internet Res", year="2024", month="Sep", day="25", volume="26", pages="e49387", keywords="conversational agents", keywords="chatbots", keywords="health care", keywords="health professionals", keywords="health personnel", keywords="qualitative", keywords="interview", abstract="Background: In recent years, there has been an increase in the use of conversational agents for health promotion and service delivery. To date, health professionals' views on the use of this technology have received limited attention in the literature. Objective: The purpose of this study was to gain a better understanding of how health professionals view the use of conversational agents for health care. Methods: Physicians, nurses, and regulated mental health professionals were recruited using various web-based methods. Participants were interviewed individually using the Zoom (Zoom Video Communications, Inc) videoconferencing platform. Interview questions focused on the potential benefits and risks of using conversational agents for health care, as well as the best way to integrate conversational agents into the health care system. Interviews were transcribed verbatim and uploaded to NVivo (version 12; QSR International, Inc) for thematic analysis. Results: A total of 24 health professionals participated in the study (19 women, 5 men; mean age 42.75, SD 10.71 years). Participants said that the use of conversational agents for health care could have certain benefits, such as greater access to care for patients or clients and workload support for health professionals. They also discussed potential drawbacks, such as an added burden on health professionals (eg, program familiarization) and the limited capabilities of these programs. Participants said that conversational agents could be used for routine or basic tasks, such as screening and assessment, providing information and education, and supporting individuals between appointments. They also said that health professionals should have some oversight in terms of the development and implementation of these programs. Conclusions: The results of this study provide insight into health professionals' views on the use of conversational agents for health care, particularly in terms of the benefits and drawbacks of these programs and how they should be integrated into the health care system. These collective findings offer useful information and guidance to stakeholders who have an interest in the development and implementation of this technology. ", doi="10.2196/49387", url="https://www.jmir.org/2024/1/e49387", url="http://www.ncbi.nlm.nih.gov/pubmed/39320936" } @Article{info:doi/10.2196/57926, author="Prakash, Ravi and Dupre, E. Matthew and {\O}stbye, Truls and Xu, Hanzhang", title="Extracting Critical Information from Unstructured Clinicians' Notes Data to Identify Dementia Severity Using a Rule-Based Approach: Feasibility Study", journal="JMIR Aging", year="2024", month="Sep", day="24", volume="7", pages="e57926", keywords="electronic health record", keywords="EHR", keywords="electric medical record", keywords="EMR", keywords="patient record", keywords="health record", keywords="personal health record", keywords="PHR", keywords="unstructured data", keywords="rule based analysis", keywords="artificial intelligence", keywords="AI", keywords="large language model", keywords="LLM", keywords="natural language processing", keywords="NLP", keywords="deep learning", keywords="Alzheimer's disease and related dementias", keywords="AD", keywords="ADRD", keywords="Alzheimer's disease", keywords="dementia", keywords="geriatric syndromes", abstract="Background: The severity of Alzheimer disease and related dementias (ADRD) is rarely documented in structured data fields in electronic health records (EHRs). Although this information is important for clinical monitoring and decision-making, it is often undocumented or ``hidden'' in unstructured text fields and not readily available for clinicians to act upon. Objective: We aimed to assess the feasibility and potential bias in using keywords and rule-based matching for obtaining information about the severity of ADRD from EHR data. Methods: We used EHR data from a large academic health care system that included patients with a primary discharge diagnosis of ADRD based on ICD-9 (International Classification of Diseases, Ninth Revision) and ICD-10 (International Statistical Classification of Diseases, Tenth Revision) codes between 2014 and 2019. We first assessed the presence of ADRD severity information and then the severity of ADRD in the EHR. Clinicians' notes were used to determine the severity of ADRD based on two criteria: (1) scores from the Mini Mental State Examination and Montreal Cognitive Assessment and (2) explicit terms for ADRD severity (eg, ``mild dementia'' and ``advanced Alzheimer disease''). We compiled a list of common ADRD symptoms, cognitive test names, and disease severity terms, refining it iteratively based on previous literature and clinical expertise. Subsequently, we used rule-based matching in Python using standard open-source data analysis libraries to identify the context in which specific words or phrases were mentioned. We estimated the prevalence of documented ADRD severity and assessed the performance of our rule-based algorithm. Results: We included 9115 eligible patients with over 65,000 notes from the providers. Overall, 22.93\% (2090/9115) of patients were documented with mild ADRD, 20.87\% (1902/9115) were documented with moderate or severe ADRD, and 56.20\% (5123/9115) did not have any documentation of the severity of their ADRD. For the task of determining the presence of any ADRD severity information, our algorithm achieved an accuracy of >95\%, specificity of >95\%, sensitivity of >90\%, and an F1-score of >83\%. For the specific task of identifying the actual severity of ADRD, the algorithm performed well with an accuracy of >91\%, specificity of >80\%, sensitivity of >88\%, and F1-score of >92\%. Comparing patients with mild ADRD to those with more advanced ADRD, the latter group tended to contain older, more likely female, and Black patients, and having received their diagnoses in primary care or in-hospital settings. Relative to patients with undocumented ADRD severity, those with documented ADRD severity had a similar distribution in terms of sex, race, and rural or urban residence. Conclusions: Our study demonstrates the feasibility of using a rule-based matching algorithm to identify ADRD severity from unstructured EHR report data. However, it is essential to acknowledge potential biases arising from differences in documentation practices across various health care systems. ", doi="10.2196/57926", url="https://aging.jmir.org/2024/1/e57926", url="http://www.ncbi.nlm.nih.gov/pubmed/39316421" } @Article{info:doi/10.2196/55182, author="Mess, Veronica Elisabeth and Kramer, Frank and Krumme, Julia and Kanelakis, Nico and Teynor, Alexandra", title="Use of Creative Frameworks in Health Care to Solve Data and Information Problems: Scoping Review", journal="JMIR Hum Factors", year="2024", month="Sep", day="13", volume="11", pages="e55182", keywords="creative frameworks", keywords="data and information problems", keywords="data collection", keywords="data processing", keywords="data provision", keywords="health care", keywords="information visualization", keywords="interdisciplinary teams", keywords="user-centered design", keywords="user-centered data design", keywords="user-centric development", abstract="Background: Digitization is vital for data management, especially in health care. However, problems still hinder health care stakeholders in their daily work while collecting, processing, and providing health data or information. Data are missing, incorrect, cannot be collected, or information is inadequately presented. These problems can be seen as data or information problems. A proven way to elicit requirements for (software) systems is by using creative frameworks (eg, user-centered design, design thinking, lean UX [user experience], or service design) or creative methods (eg, mind mapping, storyboarding, 6 thinking hats, or interaction room). However, to what extent they are used to solve data or information-related problems in health care is unclear. Objective: The primary objective of this scoping review is to investigate the use of creative frameworks in addressing data and information problems in health care. Methods: Following JBI guidelines and the PRISMA-ScR framework, this paper analyzes selected papers, answering whether creative frameworks addressed health care data or information problems. Focusing on data problems (elicitation or collection, processing) and information problems (provision or visualization), the review examined German and English papers published between 2018 and 2022 using keywords related to ``data,'' ``design,'' and ``user-centered.'' The database SCOPUS was used. Results: Of the 898 query results, only 23 papers described a data or information problem and a creative method to solve it. These were included in the follow-up analysis and divided into different problem categories: data collection (n=7), data processing (n=1), information visualization (n=11), and mixed problems meaning data and information problem present (n=4). The analysis showed that most identified problems fall into the information visualization category. This could indicate that creative frameworks are particularly suitable for solving information or visualization problems and less for other, more abstract areas such as data problems. The results also showed that most researchers applied a creative framework after they knew what specific (data or information) problem they had (n=21). Only a minority chose a creative framework to identify a problem and realize it was a data or information problem (n=2). In response to these findings, the paper discusses the need for a new approach that addresses health care data and information challenges by promoting collaboration, iterative feedback, and user-centered development. Conclusions: Although the potential of creative frameworks is undisputed, applying these in solving data and information problems is a minority. To harness this potential, a suitable method needs to be developed to support health care system stakeholders. This method could be the User-Centered Data Approach. ", doi="10.2196/55182", url="https://humanfactors.jmir.org/2024/1/e55182" } @Article{info:doi/10.2196/47562, author="Fari{\v c}, Nu{\vs}a and Potts, WW Henry and Heilman, M. James", title="Quality of Male and Female Medical Content on English-Language Wikipedia: Quantitative Content Analysis", journal="J Med Internet Res", year="2024", month="Sep", day="12", volume="26", pages="e47562", keywords="Wikipedia", keywords="wikis", keywords="writing", keywords="internet", keywords="health information", keywords="sex", keywords="sex bias", keywords="consumer health information", keywords="health communication", keywords="public education", keywords="social media", abstract="Background: Wikipedia is the largest free online encyclopedia and the seventh most visited website worldwide, containing >45,000 freely accessible English-language medical articles accessed nearly 1.6 billion times annually. Concerns have been expressed about the balance of content related to biological sex on Wikipedia. Objective: This study aims to categorize the top 1000 most-read (most popular) English-language Wikipedia health articles for June 2019 according to the relevance of the article topic to each sex and quality. Methods: In the first step, Wikipedia articles were identified using WikiProject Medicine Popular Pages. These were analyzed on 13 factors, including total views, article quality, and total number of references. In the second step, 2 general medical textbooks were used as comparators to assess whether Wikipedia's spread of articles was typical compared to the general medical coverage. According to the article's content, we proposed criteria with 5 categories: 1=``exclusively female,'' 2=``predominantly female but can also affect male individuals,'' 3=``not sex specific or neutral,'' 4=predominantly male but can affect female individuals,'' and 5=``exclusively male.'' Results: Of the 1000 Wikipedia health articles, 933 (93.3\%) were not sex specific and 67 (6.7\%) were sex specific. There was no statistically significant difference in the number of reads per month between the sex-specific and non--sex-specific articles (P=.29). Coverage of female topics was higher (50/1000, 5\%) than male topics (17/1000, 1.7\%; this difference was also observed for the 2 medical textbooks, in which 90.2\% (2330/2584) of content was not sex specific, female topics accounted for 8.1\% (209/2584), and male topics for accounted for 1.7\% (45/2584; statistically significant difference; Fisher exact test P=.03). Female-category articles were ranked higher on the Wikipedia medical topic importance list (top, high, or mid importance) than male-category articles (borderline statistical significance; Fisher exact test P=.05). Female articles had a higher number of total and unique references; a slightly higher number of page watchers, pictures, and available languages; and lower number of edits than male articles (all were statistically nonsignificant). Conclusions: Across several metrics, a sample of popular Wikipedia health-related articles for both sexes had comparable quality. Wikipedia had a lower number of female articles and a higher number of neutral articles relative to the 2 medical textbooks. These differences were small, but statistically significant. Higher exclusively female coverage, compared to exclusively male coverage, in Wikipedia articles was similar to the 2 medical textbooks and can be explained by inclusion of sections on obstetrics and gynecology. This is unlike the imbalance seen among biographies of living people, in which approximately 77.6\% pertain to male individuals. Although this study included a small sample of articles, the spread of Wikipedia articles may reflect the readership and the population's content consumption at a given time. Further study of a larger sample of Wikipedia articles would be valuable. ", doi="10.2196/47562", url="https://www.jmir.org/2024/1/e47562" } @Article{info:doi/10.2196/58705, author="Manuilova, Iryna and Bossenz, Jan and Weise, Bianka Annemarie and Boehm, Dominik and Strantz, Cosima and Unberath, Philipp and Reimer, Niklas and Metzger, Patrick and Pauli, Thomas and Werle, D. Silke and Schulze, Susann and Hiemer, Sonja and Ustjanzew, Arsenij and Kestler, A. Hans and Busch, Hauke and Brors, Benedikt and Christoph, Jan", title="Identifications of Similarity Metrics for Patients With Cancer: Protocol for a Scoping Review", journal="JMIR Res Protoc", year="2024", month="Sep", day="4", volume="13", pages="e58705", keywords="patient similarity", keywords="cancer research", keywords="patient similarity applications", keywords="precision medicine", keywords="cancer similarity metrics", keywords="scoping review protocol", abstract="Background: Understanding the similarities of patients with cancer is essential to advancing personalized medicine, improving patient outcomes, and developing more effective and individualized treatments. It enables researchers to discover important patterns, biomarkers, and treatment strategies that can have a significant impact on cancer research and oncology. In addition, the identification of previously successfully treated patients supports oncologists in making treatment decisions for a new patient who is clinically or molecularly similar to the previous patient. Objective: The planned review aims to systematically summarize, map, and describe existing evidence to understand how patient similarity is defined and used in cancer research and clinical care. Methods: To systematically identify relevant studies and to ensure reproducibility and transparency of the review process, a comprehensive literature search will be conducted in several bibliographic databases, including Web of Science, PubMed, LIVIVIVO, and MEDLINE, covering the period from 1998 to February 2024. After the initial duplicate deletion phase, a study selection phase will be applied using Rayyan, which consists of 3 distinct steps: title and abstract screening, disagreement resolution, and full-text screening. To ensure the integrity and quality of the selection process, each of these steps is preceded by a pilot testing phase. This methodological process will culminate in the presentation of the final research results in a structured form according to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) flowchart. The protocol has been registered in the Journal of Medical Internet Research. Results: This protocol outlines the methodologies used in conducting the scoping review. A search of the specified electronic databases and after removing duplicates resulted in 1183 unique records. As of March 2024, the review process has moved to the full-text evaluation phase. At this stage, data extraction will be conducted using a pretested chart template. Conclusions: The scoping review protocol, centered on these main concepts, aims to systematically map the available evidence on patient similarity among patients with cancer. By defining the types of data sources, approaches, and methods used in the field, and aligning these with the research questions, the review will provide a foundation for future research and clinical application in personalized cancer care. This protocol will guide the literature search, data extraction, and synthesis of findings to achieve the review's objectives. International Registered Report Identifier (IRRID): DERR1-10.2196/58705 ", doi="10.2196/58705", url="https://www.researchprotocols.org/2024/1/e58705", url="http://www.ncbi.nlm.nih.gov/pubmed/39230952" } @Article{info:doi/10.2196/55352, author="Marshall, Zack and Bhattacharjee, Maushumi and Wang, Meng and Cadri, Abdul and James, Hannah and Asghari, Shabnam and Peltekian, Rene and Benz, Veronica and Finley-Roy, Vanessa and Childs, Brynna and Asaad, Lauren and Swab, Michelle and Welch, Vivian and Brunger, Fern and Kaposy, Chris", title="Finding Medical Photographs of Patients Online: Randomized, Cross-Sectional Study", journal="J Med Internet Res", year="2024", month="Jun", day="24", volume="26", pages="e55352", keywords="patient photographs", keywords="privacy", keywords="informed consent", keywords="publication ethics", keywords="case reports", abstract="Background: Photographs from medical case reports published in academic journals have previously been found in online image search results. This means that patient photographs circulate beyond the original journal website and can be freely accessed online. While this raises ethical and legal concerns, no systematic study has documented how often this occurs. Objective: The aim of this cross-sectional study was to provide systematic evidence that patient photographs from case reports published in medical journals appear in Google Images search results. Research questions included the following: (1) what percentage of patient medical photographs published in case reports were found in Google Images search results? (2) what was the relationship between open access publication status and image availability? and (3) did the odds of finding patient photographs on third-party websites differ between searches conducted in 2020 and 2022? Methods: The main outcome measure assessed whether at least 1 photograph from each case report was found on Google Images when using a structured search. Secondary outcome variables included the image source and the availability of images on third-party websites over time. The characteristics of medical images were described using summary statistics. The association between the source of full-text availability and image availability on Google Images was tested using logistic regressions. Finally, we examined the trend of finding patient photographs using generalized estimating equations. Results: From a random sample of 585 case reports indexed in PubMed, 186 contained patient photographs, for a total of 598 distinct images. For 142 (76.3\%) out of 186 case reports, at least 1 photograph was found in Google Images search results. A total of 18.3\% (110/598) of photographs included eye, face, or full body, including 10.9\% (65/598) that could potentially identify the patient. The odds of finding an image from the case report online were higher if the full-text paper was available on ResearchGate (odds ratio [OR] 9.16, 95\% CI 2.71-31.02), PubMed Central (OR 7.90, 95\% CI 2.33-26.77), or Google Scholar (OR 6.07, 95\% CI 2.77-13.29) than if the full-text was available solely through an open access journal (OR 5.33, 95\% CI 2.31-12.28). However, all factors contributed to an increased risk of locating patient images online. Compared with the search in 2020, patient photographs were less likely to be found on third-party websites based on the 2022 search results (OR 0.61, 95\% Cl 0.43-0.87). Conclusions: A high proportion of medical photographs from case reports was found on Google Images, raising ethical concerns with policy and practice implications. Journal publishers and corporations such as Google are best positioned to develop an effective remedy. Until then, it is crucial that patients are adequately informed about the potential risks and benefits of providing consent for clinicians to publish their images in medical journals. ", doi="10.2196/55352", url="https://www.jmir.org/2024/1/e55352", url="http://www.ncbi.nlm.nih.gov/pubmed/38913416" } @Article{info:doi/10.2196/52655, author="Invernici, Francesco and Bernasconi, Anna and Ceri, Stefano", title="Searching COVID-19 Clinical Research Using Graph Queries: Algorithm Development and Validation", journal="J Med Internet Res", year="2024", month="May", day="30", volume="26", pages="e52655", keywords="big data corpus", keywords="clinical research", keywords="co-occurrence network", keywords="COVID-19 Open Research Dataset", keywords="CORD-19", keywords="graph search", keywords="Named Entity Recognition", keywords="Neo4j", keywords="text mining", abstract="Background: Since the beginning of the COVID-19 pandemic, >1 million studies have been collected within the COVID-19 Open Research Dataset, a corpus of manuscripts created to accelerate research against the disease. Their related abstracts hold a wealth of information that remains largely unexplored and difficult to search due to its unstructured nature. Keyword-based search is the standard approach, which allows users to retrieve the documents of a corpus that contain (all or some of) the words in a target list. This type of search, however, does not provide visual support to the task and is not suited to expressing complex queries or compensating for missing specifications. Objective: This study aims to consider small graphs of concepts and exploit them for expressing graph searches over existing COVID-19--related literature, leveraging the increasing use of graphs to represent and query scientific knowledge and providing a user-friendly search and exploration experience. Methods: We considered the COVID-19 Open Research Dataset corpus and summarized its content by annotating the publications' abstracts using terms selected from the Unified Medical Language System and the Ontology of Coronavirus Infectious Disease. Then, we built a co-occurrence network that includes all relevant concepts mentioned in the corpus, establishing connections when their mutual information is relevant. A sophisticated graph query engine was built to allow the identification of the best matches of graph queries on the network. It also supports partial matches and suggests potential query completions using shortest paths. Results: We built a large co-occurrence network, consisting of 128,249 entities and 47,198,965 relationships; the GRAPH-SEARCH interface allows users to explore the network by formulating or adapting graph queries; it produces a bibliography of publications, which are globally ranked; and each publication is further associated with the specific parts of the query that it explains, thereby allowing the user to understand each aspect of the matching. Conclusions: Our approach supports the process of query formulation and evidence search upon a large text corpus; it can be reapplied to any scientific domain where documents corpora and curated ontologies are made available. ", doi="10.2196/52655", url="https://www.jmir.org/2024/1/e52655", url="http://www.ncbi.nlm.nih.gov/pubmed/38814687" } @Article{info:doi/10.2196/49928, author="Romeiser, L. Jamie and Jusko, Nicole and Williams, A. Augusta", title="Emerging Trends in Information-Seeking Behavior for Alpha-Gal Syndrome: Infodemiology Study Using Time Series and Content Analysis", journal="J Med Internet Res", year="2024", month="May", day="8", volume="26", pages="e49928", keywords="alpha-gal", keywords="alpha gal", keywords="alpha-gal syndrome", keywords="lone star tick", keywords="infodemiology", keywords="time series", keywords="content analysis", keywords="Google Trends", keywords="allergy", keywords="allergic", keywords="immune", keywords="immunology", keywords="immunological", keywords="information behavior", keywords="information behaviour", keywords="information seeking", keywords="geographic", abstract="Background: Alpha-gal syndrome is an emerging allergy characterized by an immune reaction to the carbohydrate molecule alpha-gal found in red meat. This unique food allergy is likely triggered by a tick bite. Cases of the allergy are on the rise, but prevalence estimates do not currently exist. Furthermore, varying symptoms and limited awareness of the allergy among health care providers contribute to delayed diagnosis, leading individuals to seek out their own information and potentially self-diagnose. Objective: The study aimed to (1) describe the volume and patterns of information-seeking related to alpha-gal, (2) explore correlations between alpha-gal and lone star ticks, and (3) identify specific areas of interest that individuals are searching for in relation to alpha-gal. Methods: Google Trends Supercharged-Glimpse, a new extension of Google Trends, provides estimates of the absolute volume of searches and related search queries. This extension was used to assess trends in searches for alpha-gal and lone star ticks (lone star tick, alpha gal, and meat allergy, as well as food allergy for comparison) in the United States. Time series analyses were used to examine search volume trends over time, and Spearman correlation matrices and choropleth maps were used to explore geographic and temporal correlations between alpha-gal and lone star tick searches. Content analysis was performed on related search queries to identify themes and subcategories that are of interest to information seekers. Results: Time series analysis revealed a rapidly increasing trend in search volumes for alpha-gal beginning in 2015. After adjusting for long-term trends, seasonal trends, and media coverage, from 2015 to 2022, the predicted adjusted average annual percent change in search volume for alpha-gal was 33.78\%. The estimated overall change in average search volume was 627\%. In comparison, the average annual percent change was 9.23\% for lone star tick, 7.34\% for meat allergy, and 2.45\% for food allergy during this time. Geographic analysis showed strong significant correlations between alpha-gal and lone star tick searches especially in recent years ($\rho$=0.80; P<.001), with primary overlap and highest search rates found in the southeastern region of the United States. Content analysis identified 10 themes of primary interest: diet, diagnosis or testing, treatment, medications or contraindications of medications, symptoms, tick related, specific sources of information and locations, general education information, alternative words for alpha-gal, and unrelated or other. Conclusions: The study provides insights into the changing information-seeking patterns for alpha-gal, indicating growing awareness and interest. Alpha-gal search volume is increasing at a rapid rate. Understanding specific questions and concerns can help health care providers and public health educators to tailor communication strategies. The Google Trends Supercharged-Glimpse tool offers enhanced features for analyzing information-seeking behavior and can be valuable for infodemiology research. Further research is needed to explore the evolving prevalence and impact of alpha-gal syndrome. ", doi="10.2196/49928", url="https://www.jmir.org/2024/1/e49928", url="http://www.ncbi.nlm.nih.gov/pubmed/38717813" } @Article{info:doi/10.2196/53646, author="Glayzer, E. Jennifer and Bray, C. Bethany and Kobak, H. William and Steffen, D. Alana and Schlaeger, M. Judith", title="Lack of Diversity in Research on Females with Ehlers-Danlos Syndromes: Recruitment Protocol for a Quantitative Online Survey", journal="JMIR Res Protoc", year="2024", month="May", day="2", volume="13", pages="e53646", keywords="Ehlers-Danlos syndrome", keywords="hypermobility", keywords="social media", keywords="recruitment", keywords="Facebook", keywords="hereditary disease", keywords="connective tissue disorders", keywords="racial", keywords="ethnic", keywords="diversity", keywords="challenges", keywords="strategies", keywords="strategy", keywords="online", keywords="information seeking", keywords="cross-sectional survey", keywords="dyspareunia", keywords="painful sex", keywords="United States", abstract="Background: Ehlers-Danlos syndromes (EDS) are a group of connective tissue disorders caused by fragile lax collagen. Current EDS research lacks racial and ethnic diversity. The lack of diversity may be associated with the complexities of conducting a large international study on an underdiagnosed condition and a lack of EDS health care providers who diagnose and conduct research outside of the United States and Europe. Social media may be the key to recruiting a large diverse EDS sample. However, studies that have used social media to recruit have not been able to recruit diverse samples. Objective: This study aims to discuss challenges, strategies, outcomes, and lessons learned from using social media to recruit a large sample of females with EDS. Methods: Recruitment on social media for a cross-sectional survey examining dyspareunia (painful sexual intercourse) in females was examined. Inclusion criteria were (1) older than 18 years of age, (2) assigned female at birth, and (3) diagnosed with EDS. Recruitment took place on Facebook and Twitter (now X), from June 1 to June 25, 2019. Results: A total of 1178 females with EDS were recruited from Facebook (n=1174) and X (n=4). On Facebook, participants were recruited via support groups. A total of 166 EDS support groups were identified, 104 permitted the principal investigator to join, 90 approved posting, and the survey was posted in 54 groups. Among them, 30 of the support groups posted in were globally focused and not tied to any specific country or region, 21 were for people in the United States, and 3 were for people outside of the United States. Recruitment materials were posted on X with the hashtag \#EDS. A total of 1599 people accessed the survey and 1178 people were eligible and consented. The average age of participants was 38.6 (SD 11.7) years. Participants were predominantly White (n=1063, 93\%) and non-Hispanic (n=1046, 92\%). Participants were recruited from 29 countries, with 900 (79\%) from the United States and 124 (11\%) from Great Britain. Conclusions: Our recruitment method was successful at recruiting a large sample. The sample was predominantly White and from North America and Europe. More research needs to be conducted on how to recruit a diverse sample. Areas to investigate may include connecting with more support groups from outside the United States and Europe, researching which platforms are popular in different countries, and translating study materials into different languages. A larger obstacle to recruiting diverse samples may be the lack of health care providers that diagnose EDS outside the United States and Europe, making the pool of potential participants small. There needs to be more health care providers that diagnose and treat EDS in countries that are predominantly made up of people of color as well as research that specifically focuses on these populations. International Registered Report Identifier (IRRID): RR1-10.2196/53646 ", doi="10.2196/53646", url="https://www.researchprotocols.org/2024/1/e53646", url="http://www.ncbi.nlm.nih.gov/pubmed/38696252" } @Article{info:doi/10.2196/50518, author="Park, Daemin and Kim, Dasom and Park, Ah-hyun", title="Agendas on Nursing in South Korea Media: Natural Language Processing and Network Analysis of News From 2005 to 2022", journal="J Med Internet Res", year="2024", month="Mar", day="19", volume="26", pages="e50518", keywords="nurses", keywords="news", keywords="South Korea", keywords="natural language processing", keywords="NLP", keywords="network analysis", keywords="politicization", abstract="Background: In recent years, Korean society has increasingly recognized the importance of nurses in the context of population aging and infectious disease control. However, nurses still face difficulties with regard to policy activities that are aimed at improving the nursing workforce structure and working environment. Media coverage plays an important role in public awareness of a particular issue and can be an important strategy in policy activities. Objective: This study analyzed data from 18 years of news coverage on nursing-related issues. The focus of this study was to examine the drivers of the social, local, economic, and political agendas that were emphasized in the media by the analysis of main sources and their quotes. This analysis revealed which nursing media agendas were emphasized (eg, social aspects), neglected (eg, policy aspects), and negotiated. Methods: Descriptive analysis, natural language processing, and semantic network analysis were applied to analyze data collected from 2005 to 2022. BigKinds were used for the collection of data, automatic multi-categorization of news, named entity recognition of news sources, and extraction and topic modeling of quotes. The main news sources were identified by conducting a 1-mode network analysis with SNAnalyzer. The main agendas of nursing-related news coverage were examined through the qualitative analysis of major sources' quotes by section. The common and individual interests of the top-ranked sources were analyzed through a 2-mode network analysis using UCINET. Results: In total, 128,339 articles from 54 media outlets on nursing-related issues were analyzed. Descriptive analysis showed that nursing-related news was mainly covered in social (99,868/128,339, 77.82\%) and local (48,056/128,339, 48.56\%) sections, whereas it was rarely covered in economic (9439/128,339, 7.35\%) and political (7301/128,339, 5.69\%) sections. Furthermore, 445 sources that had made the top 20 list at least once by year and section were analyzed. Other than ``nurse,'' the main sources for each section were ``labor union,'' ``local resident,'' ``government,'' and ``Moon Jae-in.'' ``Nursing Bill'' emerged as a common interest among nurses and doctors, although the topic did not garner considerable attention from the Ministry of Health and Welfare. Analyzing quotes showed that nurses were portrayed as heroes, laborers, survivors of abuse, and perpetrators. The economic section focused on employment of youth and women in nursing. In the political section, conflicts between nurses and doctors, which may have caused policy confusion, were highlighted. Policy formulation processes were not adequately reported. Media coverage of the enactment of nursing laws tended to relate to confrontations between political parties. Conclusions: The media plays a crucial role in highlighting various aspects of nursing practice. However, policy formulation processes to solve nursing issues were not adequately reported in South Korea. This study suggests that nurses should secure policy compliance by persuading the public to understand their professional perspectives. ", doi="10.2196/50518", url="https://www.jmir.org/2024/1/e50518", url="http://www.ncbi.nlm.nih.gov/pubmed/38393293" } @Article{info:doi/10.2196/42849, author="Ling, Eunice and de Pieri, Domenico and Loh, Evenne and Scott, M. Karen and Li, H. Stephen C. and Medbury, J. Heather", title="Evaluation of the Accuracy, Credibility, and Readability of Statin-Related Websites: Cross-Sectional Study", journal="Interact J Med Res", year="2024", month="Mar", day="14", volume="13", pages="e42849", keywords="statins", keywords="consumer health information", keywords="readability", keywords="credibility", keywords="accuracy", keywords="digital health, health information seeking", keywords="cardiovascular", keywords="mortality", keywords="management", keywords="pharmacotherapy", keywords="risk", keywords="medication", abstract="Background: Cardiovascular disease (CVD) represents the greatest burden of mortality worldwide, and statins are the most commonly prescribed drug in its management. A wealth of information pertaining to statins and their side effects is on the internet; however, to date, no assessment of the accuracy, credibility, and readability of this information has been undertaken. Objective: This study aimed to evaluate the quality (accuracy, credibility, and readability) of websites likely to be visited by the general public undertaking a Google search of the side effects and use of statin medications. Methods: Following a Google web search, we reviewed the top 20 consumer-focused websites with statin information. Website accuracy, credibility, and readability were assessed based on website category (commercial, not-for-profit, and media), website rank, and the presence or absence of the Health on the Net Code of Conduct (HONcode) seal. Accuracy and credibility were assessed following the development of checklists (with 20 and 13 items, respectively). Readability was assessed using the Simple Measure of Gobbledegook scores. Results: Overall, the accuracy score was low (mean 14.35 out of 20). While side effects were comprehensively covered by 18 websites, there was little information about statin use in primary and secondary prevention. None of the websites met all criteria on the credibility checklist (mean 7.8 out of 13). The median Simple Measure of Gobbledegook score was 9.65 (IQR 8.825-10.85), with none of the websites meeting the recommended reading grade of 6, even the media websites. A website bearing the HONcode seal did not mean that the website was more comprehensive or readable. Conclusions: The quality of statin-related websites tended to be poor. Although the information contained was accurate, it was not comprehensive and was presented at a reading level that was too difficult for an average reader to fully comprehend. As such, consumers risk being uninformed about this pharmacotherapy. ", doi="10.2196/42849", url="https://www.i-jmr.org/2024/1/e42849", url="http://www.ncbi.nlm.nih.gov/pubmed/38483461" } @Article{info:doi/10.2196/53372, author="Bevens, William and Davenport, Rebekah and Neate, Sandra and Yu, Maggie and Jelinek, Pia and Jelinek, Alexander George and Reece, Jeanette", title="Web-Based Health Information Seeking by People Living With Multiple Sclerosis: Qualitative Investigation of the Multiple Sclerosis Online Course", journal="J Med Internet Res", year="2024", month="Feb", day="9", volume="26", pages="e53372", keywords="information-seeking behavior", keywords="self-management", keywords="lifestyle", keywords="digital health", abstract="Background: Digital technologies have afforded people living with multiple sclerosis (MS) access to telehealth consultations, diagnostic tools, and monitoring. Although health care professionals remain the most trusted source of information, the internet has emerged as a valuable resource for providing MS-related information, particularly during the COVID-19 pandemic. Notably, people living with MS are increasingly seeking educational content for a range of topics related to the self-management of MS; however, web-based information seeking remains largely underevaluated. To address this gap and ensure that web-based health-related information is accessible and engaging, this study used qualitative methods to analyze the reflections from participants of web-based educational programs for people living with MS. Objective: This study aimed to explore the motivations, behaviors, and expectations of web-based health information seeking for people living with MS. Methods: We conducted semistructured interviews for 38 people living with MS 1 month after they completed the novel MS Online Course, which provided information on modifiable lifestyle-related risk factors for people living with MS. Of the 38 participants, 22 (58\%) completed the intervention course and 16 (42\%) completed the standard care course. Inductive thematic analysis was used within a qualitative paradigm, and 2 authors coded each interview separately and arrived at themes with consensus. Results: We identified 2 themes: motivation to learn and MS information on the web. The diagnosis of MS was described as a pivotal moment for precipitating web-based information seeking. People living with MS sought lifestyle-related information to facilitate self-management and increase control of their MS. Although social media sites and MS websites were considered useful for providing both support and information, discretion was needed to critically appraise information. Recognizable institutions were frequently accessed because of their trustworthiness. Conclusions: This study provided novel insights into the motivations of people living with MS for seeking web-based health information. Furthermore, their preferences for the content and format of the web-based information accessed and their experiences and reactions to this information were explored. These findings may guide educators, researchers, and clinicians involved in MS care to optimize the engagement and processing of web-based health information seeking by people living with MS. ", doi="10.2196/53372", url="https://www.jmir.org/2024/1/e53372", url="http://www.ncbi.nlm.nih.gov/pubmed/38335016" } @Article{info:doi/10.2196/48599, author="Moens, Maarten and Van Doorslaer, Leen and Billot, Maxime and Eeckman, Edgard and Roulaud, Manuel and Rigoard, Philippe and Fobelets, Maaike and Goudman, Lisa", title="Examining the Type, Quality, and Content of Web-Based Information for People With Chronic Pain Interested in Spinal Cord Stimulation: Social Listening Study", journal="J Med Internet Res", year="2024", month="Jan", day="30", volume="26", pages="e48599", keywords="online information", keywords="social listening", keywords="neuromodulation", keywords="patient care", keywords="chronic pain", keywords="web-based data", abstract="Background: The increased availability of web-based medical information has encouraged patients with chronic pain to seek health care information from multiple sources, such as consultation with health care providers combined with web-based information. The type and quality of information that is available on the web is very heterogeneous, in terms of content, reliability, and trustworthiness. To date, no studies have evaluated what information is available about neuromodulation on the web for patients with chronic pain. Objective: This study aims to explore the type, quality, and content of web-based information regarding spinal cord stimulation (SCS) for chronic pain that is freely available and targeted at health care consumers. Methods: The social listening tool Awario was used to search Facebook (Meta Platforms, Inc), Twitter (Twitter, Inc), YouTube (Google LLC), Instagram (Meta Platforms, Inc), blogs, and the web for suitable hits with ``pain'' and ``neuromodulation'' as keywords. Quality appraisal of the extracted information was performed using the DISCERN instrument. A thematic analysis through inductive coding was conducted. Results: The initial search identified 2174 entries, of which 630 (28.98\%) entries were eventually withheld, which could be categorized as web pages, including news and blogs (114/630, 18.1\%); Reddit (Reddit, Inc) posts (32/630, 5.1\%); Vimeo (Vimeo, Inc) hits (38/630, 6\%); or YouTube (Google LLC) hits (446/630, 70.8\%). Most posts originated in the United States (519/630, 82.4\%). Regarding the content of information, 66.2\% (383/579) of the entries discussed (fully discussed or partially discussed) how SCS works. In total, 55.6\% (322/579) of the entries did not elaborate on the fact that there may be >1 potential treatment choice and 47.7\% (276/579) did not discuss the influence of SCS on the overall quality of life. The inductive coding revealed 4 main themes. The first theme of pain and the burden of pain (1274/8886, 14.34\% coding references) explained about pain, pain management, individual impact of pain, and patient experiences. The second theme included neuromodulation as a treatment approach (3258/8886, 36.66\% coding references), incorporating the background on neuromodulation, patient-centered care, SCS therapy, and risks. Third, several device-related aspects (1722/8886, 19.38\% coding references) were presented. As a final theme, patient benefits and testimonials of treatment with SCS (2632/8886, 29.62\% coding references) were revealed with subthemes regarding patient benefits, eligibility, and testimonials and expectations. Conclusions: Health care consumers have access to web-based information about SCS, where details about the surgical procedures, the type of material, working mechanisms, risks, patient expectations, testimonials, and the potential benefits of this therapy are discussed. The reliability, trustworthiness, and correctness of web-based sources should be carefully considered before automatically relying on the content. ", doi="10.2196/48599", url="https://www.jmir.org/2024/1/e48599", url="http://www.ncbi.nlm.nih.gov/pubmed/38289645" } @Article{info:doi/10.2196/57779, author="Graham, Scott S. and Shiva, Jade and Sharma, Nandini and Barbour, B. Joshua and Majdik, P. Zoltan and Rousseau, F. Justin", title="Conflicts of Interest Publication Disclosures: Descriptive Study", journal="JMIR Data", year="2024", month="Oct", day="31", volume="5", pages="e57779", keywords="conflicts of interest", keywords="biomedical publishing", keywords="research integrity", keywords="dataset", keywords="COI", keywords="ethical", keywords="ethics", keywords="publishing", keywords="drugs", keywords="pharmacies", keywords="pharmacology", keywords="pharmacotherapy", keywords="pharmaceuticals", keywords="medication", keywords="disclosure", keywords="information science", keywords="library science", keywords="open data", abstract="Background: Multiple lines of previous research have documented that author conflicts of interest (COI) can compromise the integrity of the biomedical research enterprise. However, continuing research that would investigate why, how, and in what circumstances COI is most risky is stymied by the difficulty in accessing disclosure statements, which are not widely represented in available databases. Objective: In this study, we describe a new open access dataset of COI disclosures extracted from published biomedical journal papers. Methods: To develop the dataset, we used ClinCalc's Top 300 drugs lists for 2017 and 2018 to identify 319 of the most commonly used drugs. Search strategies for each product were developed using the National Library of Medicine's and MeSH (Medical Subject Headings) browser and deployed using the eUtilities application programming interface in April 2021. We identified the 150 most relevant papers for each product and extracted COI disclosure statements from PubMed, PubMed Central, or retrieved papers as necessary. Results: Conflicts of Interest Publication Disclosures (COIPonD) is a new dataset that captures author-reported COI disclosures for biomedical research papers published in a wide range of journals and subspecialties. COIPonD captures author-reported disclosure information (including lack of disclosure) for over 38,000 PubMed-indexed papers published between 1949 and 2022. The collected papers are indexed by discussed drug products with a focus on the 319 most commonly used drugs in the United States. Conclusions: COIPonD should accelerate research efforts to understand the effects of COI on the biomedical research enterprise. In particular, this dataset should facilitate new studies of COI effects across disciplines and subspecialties. ", doi="10.2196/57779", url="https://data.jmir.org/2024/1/e57779" } @Article{info:doi/10.2196/42619, author="Taira, Kazuya and Itaya, Takahiro and Fujita, Sumio", title="Predicting Smoking Prevalence in Japan Using Search Volumes in an Internet Search Engine: Infodemiology Study", journal="J Med Internet Res", year="2022", month="Dec", day="14", volume="24", number="12", pages="e42619", keywords="health policy", keywords="internet use", keywords="quality indicators", keywords="search engine", keywords="smoking", keywords="tobacco use", keywords="public health", keywords="infodemiology", keywords="smoking trend", keywords="health indicator", keywords="health promotion", abstract="Background: Tobacco smoking is an important public health issue and a core indicator of public health policy worldwide. However, global pandemics and natural disasters have prevented surveys from being conducted. Objective: The purpose of this study was to predict smoking prevalence by prefecture and sex in Japan using Internet search trends. Methods: This study used the infodemiology approach. The outcome variable was smoking prevalence by prefecture, obtained from national surveys. The predictor variables were the search volumes on Yahoo! Japan Search. We collected the search volumes for queries related to terms from the thesaurus of the Japanese medical article database Ichu-shi. Predictor variables were converted to per capita values and standardized as z scores. For smoking prevalence, the values for 2016 and 2019 were used, and for search volume, the values for the April 1 to March 31 fiscal year (FY) 1 year prior to the survey (ie, FY 2015 and FY 2018) were used. Partial correlation coefficients, adjusted for data year, were calculated between smoking prevalence and search volume, and a regression analysis using a generalized linear mixed model with random effects was conducted for each prefecture. Several models were tested, including a model that included all search queries, a variable reduction method, and one that excluded cigarette product names. The best model was selected with the Akaike information criterion corrected (AICC) for small sample size and the Bayesian information criterion (BIC). We compared the predicted and actual smoking prevalence in 2016 and 2019 based on the best model and predicted the smoking prevalence in 2022. Results: The partial correlation coefficients for men showed that 9 search queries had significant correlations with smoking prevalence, including cigarette (r=--0.417, P<.001), cigar in kanji (r=--0.412, P<.001), and cigar in katakana (r=-0.399, P<.001). For women, five search queries had significant correlations, including vape (r=0.335, P=.001), quitting smoking (r=0.288, P=.005), and cigar (r=0.286, P=.006). The models with all search queries were the best models for both AICC and BIC scores. Scatter plots of actual and estimated smoking prevalence in 2016 and 2019 confirmed a relatively high degree of agreement. The average estimated smoking prevalence in 2022 in the 47 prefectures for the total sample was 23.492\% (95\% CI 21.617\%-25.367\%), showing an increasing trend, with an average of 29.024\% (95\% CI 27.218\%-30.830\%) for men and 8.793\% (95\% CI 7.531\%-10.054\%) for women. Conclusions: This study suggests that the search volume of tobacco-related queries in internet search engines can predict smoking prevalence by prefecture and sex in Japan. These findings will enable the development of low-cost, timely, and crisis-resistant health indicators that will enable the evaluation of health measures and contribute to improved public health. ", doi="10.2196/42619", url="https://www.jmir.org/2022/12/e42619", url="http://www.ncbi.nlm.nih.gov/pubmed/36515993" } @Article{info:doi/10.2196/41219, author="?ulea, M. Cristina and N?d??an, Valentin and Ursachi, Tatiana and Toboltoc, Paul-C?t?lin and Benedek, Theodora", title="What Patients Find on the Internet When Looking for Information About Percutaneous Coronary Intervention: Multilanguage Cross-sectional Assessment", journal="J Med Internet Res", year="2022", month="Dec", day="6", volume="24", number="12", pages="e41219", keywords="percutaneous coronary intervention", keywords="consumer health informatics", keywords="internet", keywords="health education", keywords="health information", keywords="quality", keywords="reliability", keywords="informed decision-making", keywords="credibility", keywords="content quality", keywords="medical information", abstract="Background: The internet provides general users with wide access to medical information. However, regulating and controlling the quality and reliability of the considerable volume of available data is challenging, thus generating concerns about the consequences of inaccurate health care--related documentation. Several tools have been proposed to increase the transparency and overall trustworthiness of medical information present on the web. Objective: We aimed to analyze and compare the quality and reliability of information about percutaneous coronary intervention on English, German, Hungarian, Romanian, and Russian language websites. Methods: Following a rigorous protocol, 125 websites were selected, 25 for each language sub-sample. The websites were assessed concerning their general characteristics, compliance with a set of eEurope 2002 credibility criteria, and quality of the informational content (namely completeness and accuracy), based on a topic-specific benchmark. Completeness and accuracy were graded independently by 2 evaluators. Scores were reported on a scale from 0 to 10. The 5 language subsamples were compared regarding credibility, completeness, and accuracy. Correlations between credibility scores on the one hand, and completeness and accuracy scores, on the other hand, were tested within each language subsample. Results: The websites' compliance with credibility criteria was average at best with scores between 3.0 and 6.0. In terms of completeness and accuracy, the website subsets qualified as poor or average, with scores ranging from 2.4 to 4.6 and 3.6 to 5.3, respectively. English language websites scored significantly higher in all 3 aspects, followed by German and Hungarian language websites. Only German language websites showed a significant correlation between credibility and information quality. Conclusions: The quality of websites in English, German, Hungarian, Romanian, and Russian languages about percutaneous coronary intervention was rather inadequate and may raise concerns regarding their impact on informed decision-making. Using credibility criteria as indicators of information quality may not be warranted, as credibility scores were only exceptionally correlated with content quality. The study brings valuable descriptive data on the quality of web-based information regarding percutaneous coronary intervention in multiple languages and raises awareness about the need for responsible use of health-related web resources. ", doi="10.2196/41219", url="https://www.jmir.org/2022/12/e41219", url="http://www.ncbi.nlm.nih.gov/pubmed/36472906" } @Article{info:doi/10.2196/41527, author="Patton, Thomas and Abramovitz, Daniela and Johnson, Derek and Leas, Eric and Nobles, Alicia and Caputi, Theodore and Ayers, John and Strathdee, Steffanie and B{\'o}rquez, Annick", title="Characterizing Help-Seeking Searches for Substance Use Treatment From Google Trends and Assessing Their Use for Infoveillance: Longitudinal Descriptive and Validation Statistical Analysis", journal="J Med Internet Res", year="2022", month="Dec", day="1", volume="24", number="12", pages="e41527", keywords="internet", keywords="search", keywords="help-seeking", keywords="substance use treatment", keywords="surveillance", keywords="infoveillance", keywords="google trends", abstract="Background: There is no recognized gold standard method for estimating the number of individuals with substance use disorders (SUDs) seeking help within a given geographical area. This presents a challenge to policy makers in the effective deployment of resources for the treatment of SUDs. Internet search queries related to help seeking for SUDs using Google Trends may represent a low-cost, real-time, and data-driven infoveillance tool to address this shortfall in information. Objective: This paper assesses the feasibility of using search query data related to help seeking for SUDs as an indicator of unmet treatment needs, demand for treatment, and predictor of the health harms related to unmet treatment needs. We explore a continuum of hypotheses to account for different outcomes that might be expected to occur depending on the demand for treatment relative to the system capacity and the timing of help seeking in relation to trajectories of substance use and behavior change. Methods: We used negative binomial regression models to examine temporal trends in the annual SUD help-seeking internet search queries from Google Trends by US state for cocaine, methamphetamine, opioids, cannabis, and alcohol from 2010 to 2020. To validate the value of these data for surveillance purposes, we then used negative binomial regression models to investigate the relationship between SUD help-seeking searches and state-level outcomes across the continuum of care (including lack of care). We started by looking at associations with self-reported treatment need using data from the National Survey on Drug Use and Health, a national survey of the US general population. Next, we explored associations with treatment admission rates from the Treatment Episode Data Set, a national data system on SUD treatment facilities. Finally, we studied associations with state-level rates of people experiencing and dying from an opioid overdose, using data from the Agency for Healthcare Research and Quality and the CDC WONDER database. Results: Statistically significant differences in help-seeking searches were observed over time between 2010 and 2020 (based on P<.05 for the corresponding Wald tests). We were able to identify outlier states for each drug over time (eg, West Virginia for both opioids and methamphetamine), indicating significantly higher help-seeking behaviors compared to national trends. Results from our validation analyses across different outcomes showed positive, statistically significant associations for the models relating to treatment need for alcohol use, treatment admissions for opioid and methamphetamine use, emergency department visits related to opioid use, and opioid overdose mortality data (based on regression coefficients having P?.05). Conclusions: This study demonstrates the clear potential for using internet search queries from Google Trends as an infoveillance tool to predict the demand for substance use treatment spatially and temporally, especially for opioid use disorders. ", doi="10.2196/41527", url="https://www.jmir.org/2022/12/e41527", url="http://www.ncbi.nlm.nih.gov/pubmed/36454620" } @Article{info:doi/10.2196/38425, author="Ackleh-Tingle, V. Jonathan and Jordan, M. Natalie and Onwubiko, N. Udodirim and Chandra, Christina and Harton, E. Paige and Rentmeester, T. Shelby and Chamberlain, T. Allison", title="Prevalence and Correlates of COVID-19 Vaccine Information on Family Medicine Practices' Websites in the United States: Cross-sectional Website Content Analysis", journal="JMIR Form Res", year="2022", month="Nov", day="17", volume="6", number="11", pages="e38425", keywords="primary care", keywords="vaccine hesitancy", keywords="COVID-19", keywords="health communications", keywords="health information", keywords="health website", keywords="family practice", keywords="vaccine information", keywords="online health", keywords="health platform", keywords="online information", abstract="Background: Primary care providers are regarded as trustworthy sources of information about COVID-19 vaccines. Although primary care practices often provide information about common medical and public health topics on their practice websites, little is known about whether they also provide information about COVID-19 vaccines on their practice websites. Objective: This study aimed to investigate the prevalence and correlates of COVID-19 vaccine information on family medicine practices' website home pages in the United States. Methods: We used the Centers for Medicare and Medicaid National Provider Identifier records to create a sampling frame of all family medicine providers based in the United States, from which we constructed a nationally representative random sample of 964 family medicine providers. Between September 20 and October 8, 2021, we manually examined the practice websites of these providers and extracted data on the availability of COVID-19 vaccine information, and we implemented a 10\% cross-review quality control measure to resolve discordances in data abstraction. We estimated the prevalence of COVID-19 vaccine information on practice websites and website home pages and used Poisson regression with robust error variances to estimate crude and adjusted prevalence ratios for correlates of COVID-19 vaccine information, including practice size, practice region, university affiliation, and presence of information about seasonal influenza vaccines. Additionally, we performed sensitivity analyses to account for multiple comparisons. Results: Of the 964 included family medicine practices, most (n=509, 52.8\%) had ?10 distinct locations, were unaffiliated with a university (n=838, 87.2\%), and mentioned seasonal influenza vaccines on their websites (n=540, 56.1\%). In total, 550 (57.1\%) practices mentioned COVID-19 vaccines on their practices' website home page, specifically, and 726 (75.3\%) mentioned COVID-19 vaccines anywhere on their practice website. As practice size increased, the likelihood of finding COVID-19 vaccine information on the home page increased (n=66, 27.7\% among single-location practices, n=114, 52.5\% among practices with 2-9 locations, n=66, 56.4\% among practices with 10-19 locations, and n=304, 77.6\% among practices with 20 or more locations, P<.001 for trend). Compared to clinics in the Northeast, those in the West and Midwest United States had a similar prevalence of COVID-19 vaccine information on website home pages, but clinics in the south had a lower prevalence (adjusted prevalence ratio 0.8, 95\% CI 0.7 to 1.0; P=.02). Our results were largely unchanged in sensitivity analyses accounting for multiple comparisons. Conclusions: Given the ongoing COVID-19 pandemic, primary care practitioners who promote and provide vaccines should strongly consider utilizing their existing practice websites to share COVID-19 vaccine information. These existing platforms have the potential to serve as an extension of providers' influence on established and prospective patients who search the internet for information about COVID-19 vaccines. ", doi="10.2196/38425", url="https://formative.jmir.org/2022/11/e38425", url="http://www.ncbi.nlm.nih.gov/pubmed/36343211" } @Article{info:doi/10.2196/42447, author="Zhao, Chris Yuxiang and Zhao, Mengyuan and Song, Shijie", title="Online Health Information Seeking Among Patients With Chronic Conditions: Integrating the Health Belief Model and Social Support Theory", journal="J Med Internet Res", year="2022", month="Nov", day="2", volume="24", number="11", pages="e42447", keywords="health information seeking", keywords="patients with chronic conditions", keywords="health belief model, social support", keywords="critical health literacy", abstract="Background: Chronic diseases are the leading causes of death and disability. With the growing patient population and climbing health care expenditures, researchers and policy makers are seeking new approaches to improve the accessibility of health information on chronic diseases while lowering costs. Online health information sources can play a substantial role in effective patient education and health communication. However, some contradictory evidence suggests that patients with chronic conditions may not necessarily seek online health information. Objective: This study aims to integrate 2 theories (ie, the health belief model and social support theory) and a critical health literacy perspective to understand online health information seeking (OHIS) among patients with chronic conditions. Methods: We used the survey method to collect data from online chronic disease communities and groups on social media platforms. Eligible participants were consumers with at least 1 chronic condition and those who have experience with OHIS. A total of 390 valid questionnaires were collected. The partial least squares approach to structural equation modeling was employed to analyze the data. Results: The results suggested that perceived risk (t=3.989, P<.001) and perceived benefits (t=3.632, P<.001) significantly affected patients' OHIS. Perceived susceptibility (t=7.743, P<.001) and perceived severity (t=8.852, P<.001) were found to influence the perceived risk of chronic diseases significantly. Informational support (t=5.761, P<.001) and emotional support (t=5.748, P<.001) also impacted the perceived benefits of online sources for patients. In addition, moderation analysis showed that critical health literacy significantly moderated the link between perceived risk and OHIS (t=3.097, P=.002) but not the relationship between perceived benefits and OHIS (t=0.288, P=.774). Conclusions: This study shows that the health belief model, when combined with social support theory, can predict patients' OHIS. The perceived susceptibility and severity can effectively explain perceived risk, further predicting patients' OHIS. Informational support and emotional support can contribute to perceived benefits, thereby positively affecting patients' OHIS. This study also demonstrated the important negative moderating effects of critical health literacy on the association between perceived risk and OHIS. ", doi="10.2196/42447", url="https://www.jmir.org/2022/11/e42447", url="http://www.ncbi.nlm.nih.gov/pubmed/36322124" } @Article{info:doi/10.2196/39946, author="Lin, Michelle and Phipps, Mina and Yilmaz, Yusuf and Nash, J. Christopher and Gisondi, A. Michael and Chan, M. Teresa", title="A Fork in the Road for Emergency Medicine and Critical Care Blogs and Podcasts: Cross-sectional Study", journal="JMIR Med Educ", year="2022", month="Oct", day="28", volume="8", number="4", pages="e39946", keywords="open educational resource", keywords="free open-access meducation", keywords="FOAM", keywords="meducation", keywords="open-access", keywords="internet based", keywords="web based", keywords="website", keywords="social media", keywords="medical education", keywords="disruptive innovation", keywords="blog", keywords="podcast", keywords="emergency", keywords="critical care", abstract="Background: Free open-access meducation (FOAM) refers to open-access, web-based learning resources in medicine. It includes all formats of digital products, including blogs and podcasts. The number of FOAM blog and podcast sites in emergency medicine and critical care increased dramatically from 2002 to 2013, and physicians began to rely on the availability of these resources. The current landscape of these FOAM sites is unknown. Objective: This study aims to (1) estimate the current number of active, open-access blogs and podcasts in emergency medicine and critical care and (2) describe observed and anticipated trends in the FOAM movement using the Theory of Disruptive Innovation by Christensen as a theoretical framework. Methods: The authors used multiple resources and sampling strategies to identify active, open-access blogs and podcasts between April 25, 2022, and May 8, 2022, and classified these websites as blogs, podcasts, or blogs+podcasts. For each category, they reported the following outcome measures using descriptive statistics: age, funding, affiliations, and team composition. Based on these findings, the authors projected trends in the number of active sites using a positivist paradigm and the Theory of Disruptive Innovation as a theoretical framework. Results: The authors identified 109 emergency medicine and critical care websites, which comprised 45.9\% (n=50) blogs, 22.9\% (n=25) podcasts, and 31.2\% (n=34) blogs+podcasts. Ages ranged from 0 to 18 years; 27.5\% (n=30) sold products, 18.3\% (n=20) used advertisements, 44.0\% (n=48) had institutional funding, and 27.5\% (n=30) had no affiliation or external funding sources. Team sizes ranged from 1 (n=26, 23.9\%) to ?5 (n=60, 55\%) individuals. Conclusions: There was a sharp decline in the number of emergency medicine and critical care blogs and podcasts in the last decade, dropping 40.4\% since 2013. The initial growth of FOAM and its subsequent downturn align with principles in the Theory of Disruptive Innovation by Christensen. These findings have important implications for the field of medical education. ", doi="10.2196/39946", url="https://mededu.jmir.org/2022/4/e39946", url="http://www.ncbi.nlm.nih.gov/pubmed/36306167" } @Article{info:doi/10.2196/37845, author="van Deursen, M. Alexander J. A.", title="General Health Statuses as Indicators of Digital Inequality and the Moderating Effects of Age and Education: Cross-sectional Study", journal="J Med Internet Res", year="2022", month="Oct", day="21", volume="24", number="10", pages="e37845", keywords="digital inequality", keywords="health", keywords="MOS", keywords="eHealth", keywords="digital health", keywords="online health", keywords="age", keywords="education", keywords="survey", keywords="digital divide", keywords="attitude", keywords="health outcome", keywords="patient outcome", keywords="internet access", keywords="internet skill", keywords="technology skill", abstract="Background: Considerable effort has been directed to offering online health information and services aimed at the general population. Such efforts potentially support people to obtain improved health outcomes. However, when health information and services are moved online, issues of equality need to be considered. In this study, we focus on the general population and take as a point of departure how health statuses (physical functioning, social functioning, mental health, perceived health, and physical pain) are linked to internet access (spanning internet attitude, material access, internet skills, and health-related internet use). Objective: This study aims to reveal to what extent (1) internet access is important for online health outcomes, (2) different health statuses are important for obtaining internet access and outcomes, and (3) age and education moderate the contribution of health statuses to internet access. Methods: A sequence of 2 online surveys drawing upon a sample collected in the Netherlands was used, and a data set with 1730 respondents over the age of 18 years was obtained. Results: Internet attitude contributes positively to material access, internet skills, and health outcomes and negatively to health-related internet use. Material access contributes positively to internet skills and health-related internet use and outcomes. Internet skills contribute positively to health-related internet use and outcomes. Physical functioning contributes positively to internet attitude, material access, and internet skills but negatively to internet health use. Social functioning contributes negatively to internet attitude and positively to internet skills and internet health use. Mental health contributes positively to internet attitude and negatively to material access and internet health use. Perceived health positively contributes to material access, internet skills, and internet health use. Physical pain contributes positively to internet attitude and material access and indirectly to internet skills and internet health use. Finally, most contributions are moderated by age (<65 and ?65 years) and education (low and high). Conclusions: To make online health care attainable for the general population, interventions should focus simultaneously on internet attitude, material access, internet skills, and internet health use. However, issues of equality need to be considered. In this respect, digital inequality research benefits from considering health as a predictor of all 4 access stages. Furthermore, studies should go beyond single self-reported measures of health. Physical functioning, social functioning, mental health, perceived health, and physical pain all show unique contributions to the different internet access stages. Further complicating this issue, online health-related interventions for people with different health statuses should also consider age and the educational level of attainment. ", doi="10.2196/37845", url="https://www.jmir.org/2022/10/e37845", url="http://www.ncbi.nlm.nih.gov/pubmed/36269664" } @Article{info:doi/10.2196/39555, author="Alexander, Shelley and Seenan, Chris", title="Credibility, Accuracy, and Comprehensiveness of Readily Available Internet-Based Information on Treatment and Management of Peripheral Artery Disease and Intermittent Claudication: Review", journal="J Med Internet Res", year="2022", month="Oct", day="17", volume="24", number="10", pages="e39555", keywords="peripheral artery disease", keywords="intermittent claudication", keywords="health information", keywords="education", keywords="internet", keywords="eHealth", keywords="digital health", abstract="Background: Peripheral artery disease (PAD) affects millions of people worldwide, and a core component of management of the condition is self-management. The internet is an important source of health information for many people. However, the content of websites regarding treatment recommendations for PAD has not been fully evaluated. Objective: This study aimed to assess the credibility, accuracy, and comprehensiveness of websites found via a common search engine, by comparing the content to current guidelines for treatment and management of PAD and intermittent claudication (IC). Methods: A review of websites from hospitals, universities, governments, consumer organizations, and professional associations in the United States and the United Kingdom was conducted. Website recommendations for the treatment of PAD and IC were coded in accordance with the guidelines of the National Institute for Health and Care Excellence (NICE) and the American Heart Association (AHA). Primary outcomes were website credibility (4-item Journal of the American Medical Association benchmark), website accuracy (in terms of the percentage of accurate recommendations), and comprehensiveness of website recommendations (in terms of the percentage of guideline recommendations that were appropriately covered). Secondary outcomes were readability (Flesch--Kincaid grade level) and website quality (Health On the Net Foundation's code of conduct). Results: After screening, 62 websites were included in this analysis. Only 45\% (28/62) of websites met the credibility requirement by stating they were updated after the NICE guidelines were published. Declaration of authorship and funding and the presence of reference lists were less commonly reported. Regarding accuracy, 81\% (556/685) of website recommendations were deemed accurate on following NICE's and the AHA's recommendations. Comprehensiveness was low, with an average of 40\% (25/62) of guideline treatment recommendations being appropriately covered by websites. In most cases, readability scores revealed that the websites were too complex for web-based consumer health information. Conclusions: Web-based information from reputable sources about the treatment and management of PAD and IC are generally accurate but have low comprehensiveness, credibility, and readability. ", doi="10.2196/39555", url="https://www.jmir.org/2022/10/e39555", url="http://www.ncbi.nlm.nih.gov/pubmed/36251363" } @Article{info:doi/10.2196/41012, author="Dub{\'e}, Eve and MacDonald, E. Shannon and Manca, Terra and Bettinger, A. Julie and Driedger, Michelle S. and Graham, Janice and Greyson, Devon and MacDonald, E. Noni and Meyer, Samantha and Roch, Genevi{\`e}ve and Vivion, Maryline and Aylsworth, Laura and Witteman, O. Holly and G{\'e}linas-Gascon, F{\'e}lix and Marques Sathler Guimaraes, Lucas and Hakim, Hina and Gagnon, Dominique and B{\'e}chard, Beno{\^i}t and Gramaccia, A. Julie and Khoury, Richard and Tremblay, S{\'e}bastien", title="Understanding the Influence of Web-Based Information, Misinformation, Disinformation, and Reinformation on COVID-19 Vaccine Acceptance: Protocol for a Multicomponent Study", journal="JMIR Res Protoc", year="2022", month="Oct", day="17", volume="11", number="10", pages="e41012", keywords="vaccine hesitancy", keywords="COVID-19", keywords="misinformation", keywords="vaccine decisions", keywords="disinformation", keywords="online", keywords="vaccine", keywords="vaccination", abstract="Background: The COVID-19 pandemic has generated an explosion in the amount of information shared on the internet, including false and misleading information on SARS-CoV-2 and recommended protective behaviors. Prior to the pandemic, web-based misinformation and disinformation were already identified as having an impact on people's decision to refuse or delay recommended vaccination for themselves or their children. Objective: The overall aims of our study are to better understand the influence of web-based misinformation and disinformation on COVID-19 vaccine decisions and investigate potential solutions to reduce the impact of web-based misinformation and disinformation about vaccines. Methods: Based on different research approaches, the study will involve (1) the use of artificial intelligence techniques, (2) a web-based survey, (3) interviews, and (4) a scoping review and an environmental scan of the literature. Results: As of September 1, 2022, data collection has been completed for all objectives. The analysis is being conducted, and results should be disseminated in the upcoming months. Conclusions: The findings from this study will help with understanding the underlying determinants of vaccine hesitancy among Canadian individuals and identifying effective, tailored interventions to improve vaccine acceptance among them. International Registered Report Identifier (IRRID): DERR1-10.2196/41012 ", doi="10.2196/41012", url="https://www.researchprotocols.org/2022/10/e41012", url="http://www.ncbi.nlm.nih.gov/pubmed/36191171" } @Article{info:doi/10.2196/38641, author="Jawad, Danielle and Cheng, Heilok and Wen, Ming Li and Rissel, Chris and Baur, Louise and Mihrshahi, Seema and Taki, Sarah", title="Interactivity, Quality, and Content of Websites Promoting Health Behaviors During Infancy: 6-Year Update of the Systematic Assessment", journal="J Med Internet Res", year="2022", month="Oct", day="7", volume="24", number="10", pages="e38641", keywords="breastfeeding", keywords="bottle feeding", keywords="websites", keywords="web-based platform", keywords="infant food", keywords="readability", keywords="accuracy", keywords="consumer", keywords="health information", keywords="interactivity", keywords="solid food", keywords="quality", keywords="grading", keywords="comprehensibility", keywords="infant", keywords="baby", keywords="babies", keywords="feeding", keywords="food", keywords="eating", keywords="nutrition", keywords="health behavior", keywords="web-based information", keywords="health website", keywords="sleep", keywords="screen time", keywords="rating", abstract="Background: As of 2021, 89\% of the?Australian population are active internet users. Although the internet is widely used, there are concerns about the quality, accuracy, and credibility of health-related websites. A 2015 systematic assessment of infant feeding websites and apps available in Australia found that 61\% of websites were of poor quality and readability, with minimal coverage of infant feeding topics and lack of author credibility. Objective: We aimed to systematically assess the quality, interactivity, readability, and comprehensibility of information targeting infant health behaviors on websites globally and provide an update of the 2015 systematic assessment. Methods: Keywords related to infant milk feeding behaviors, solid feeding behaviors, active play, screen time, and sleep were used to identify websites targeting infant health behaviors on the Google search engine on Safari. The websites were assessed by a subset of the authors using predetermined criteria between July 2021 and February 2022 and assessed for information content based on the Australian Infant Feeding Guidelines and National Physical Activity Recommendations. The Suitability Assessment of Materials, Quality Component Scoring System, the Health-Related Website Evaluation Form, and the adherence to the Health on the Net code were used to evaluate the suitability and quality of information. Readability was assessed using 3 web-based readability tools. Results: Of the 450 websites screened, 66 were included based on the selection criteria and evaluated. Overall, the quality of websites was mostly adequate. Media-related sources, nongovernmental organizations, hospitals, and privately owned websites had the highest median quality scores, whereas university websites received the lowest median score (35\%). The information covered within the websites was predominantly poor: 91\% (60/66) of the websites received an overall score of ?74\% (mean 53\%, SD 18\%). The suitability of health information was mostly rated adequate for literacy demand, layout, and learning and motivation of readers. The median readability score for the websites was grade 8.5, which is higher than the government recommendations (