Search Articles

View query in Help articles search

Search Results (1 to 10 of 13 Results)

Download search results: CSV END BibTex RIS


Public Attitudes Toward Violence Against Doctors: Sentiment Analysis of Chinese Users

Public Attitudes Toward Violence Against Doctors: Sentiment Analysis of Chinese Users

It is calculated as follows: The TF-IDF weights are ultimately formed by the product of TF and IDF: LDA is a probabilistic graphical model to identify hidden thematic structures within text collections. The model was proposed by Blei et al [38] in 2003 and has become a classic method for topic modeling in text mining. The LDA model extracts potential topics from documents or corpora using a bag-of-words approach.

Yuwen Zheng, Meirong Tian, Jingjing Chen, Lei Zhang, Jia Gao, Xiang Li, Jin Wen, Xing Qu

JMIR Med Inform 2025;13:e63772

Analyzing Themes, Sentiments, and Coping Strategies Regarding Online News Coverage of Depression in Hong Kong: Mixed Methods Study

Analyzing Themes, Sentiments, and Coping Strategies Regarding Online News Coverage of Depression in Hong Kong: Mixed Methods Study

To uncover the subthemes underneath the online news for RQ1, LDA was first performed on the combined dataset to investigate the “hidden” thematic topics. The distribution of these topics will be then examined in each dataset separately to allow a more nuanced understanding.

Sihui Chen, Cindy Sing Bik Ngai, Cecilia Cheng, Yangna Hu

J Med Internet Res 2025;27:e66696

Cross-Cultural Sense-Making of Global Health Crises: A Text Mining Study of Public Opinions on Social Media Related to the COVID-19 Pandemic in Developed and Developing Economies

Cross-Cultural Sense-Making of Global Health Crises: A Text Mining Study of Public Opinions on Social Media Related to the COVID-19 Pandemic in Developed and Developing Economies

One of the most important types of these algorithms is latent Dirichlet allocation (LDA) [25]. The LDA model is a generative model used in natural language study, which allows for extracting topics from a set of source documents. Each document is considered a group of words that form one or more latent topics when combined [26]. A particular distribution of terms characterizes each topic. LDA's generative process is based on analyzing the text's data (text mining).

Adham Kahlawi, Firas Masri, Wasim Ahmed, Josep Vidal-Alaball

J Med Internet Res 2025;27:e58656

Integrating Patient-Generated Digital Data Into Mental Health Therapy: Mixed Methods Analysis of User Experience

Integrating Patient-Generated Digital Data Into Mental Health Therapy: Mixed Methods Analysis of User Experience

Open-ended questions relating to dashboard experience were also analyzed using latent Dirichlet allocation (LDA) [14] to identify themes and keywords. LDA is a probabilistic topic modeling technique used to identify clusters of co-occurring words in text data. We used MALLET implementation [15] of LDA to generate topics in an unsupervised manner.

Lauren Southwick, Meghana Sharma, Sunny Rai, Rinad S Beidas, David S Mandell, David A Asch, Brenda Curtis, Sharath Chandra Guntuku, Raina M Merchant

JMIR Ment Health 2024;11:e59785

Chinese Public Attitudes and Opinions on Health Policies During Public Health Emergencies: Sentiment and Topic Analysis

Chinese Public Attitudes and Opinions on Health Policies During Public Health Emergencies: Sentiment and Topic Analysis

This study used the Latent Dirichlet Allocation (LDA) model for topic modeling. LDA is the most widely used topic model at the time and could be used to identify the most common topics across social media platforms [30]. LDA is an unsupervised learning algorithm that does not require prelabeled datasets, making it suitable for handling large and diverse text data such as Weibo posts [31].

Min Liu, Shuo Yuan, Bingyan Li, Yuxi Zhang, Jia Liu, Cuixia Guan, Qingqing Chen, Jiayi Ruan, Lunfang Xie

J Med Internet Res 2024;26:e58518

Mapping the Bibliometrics Landscape of AI in Medicine: Methodological Study

Mapping the Bibliometrics Landscape of AI in Medicine: Methodological Study

Drawing from the metadata of publications retrieved from Pub Med, we used latent Dirichlet allocation (LDA) topic modeling—an established method in various academic research fields, including technology management, computer science, and biomedicine [32-34]—to discern shifts in research areas within individual AI domains. Our LDA topic modeling, conducted in Python using the Gensim library, used unsupervised machine learning to analyze vast quantities of unstructured data.

Jin Shi, David Bendig, Horst Christian Vollmar, Peter Rasche

J Med Internet Res 2023;25:e45815

Results and Methodological Implications of the Digital Epidemiology of Prescription Drug References Among Twitter Users: Latent Dirichlet Allocation (LDA) Analyses

Results and Methodological Implications of the Digital Epidemiology of Prescription Drug References Among Twitter Users: Latent Dirichlet Allocation (LDA) Analyses

Since LDA identifies patterns via co-occurrences of different words, we are especially interested in words that have at least one independent semantic meaning. Thus, words that are punctuation marks, stop words, and hyperlinks were removed from each document. After those words were removed, we ran the LDA topic model to find the underlying topic model structure for each corpus. More formally, LDA is defined as a generative probabilistic model of a corpus [24].

Maria A Parker, Danny Valdez, Varun K Rao, Katherine S Eddens, Jon Agley

J Med Internet Res 2023;25:e48405

Evolution of Public Attitudes and Opinions Regarding COVID-19 Vaccination During the Vaccine Campaign in China: Year-Long Infodemiology Study of Weibo Posts

Evolution of Public Attitudes and Opinions Regarding COVID-19 Vaccination During the Vaccine Campaign in China: Year-Long Infodemiology Study of Weibo Posts

We used latent Dirichlet allocation (LDA), a popular topic modeling algorithm proposed by Blei et al in 2003 [38], to extract public opinions from posts. LDA is an unsupervised text classification algorithm based on a 3-layer Bayesian structure that maps the relationship between topics, documents (posts in this study), and words [39]. First, we sorted all the posts chronologically to show how public opinion on COVID-19 vaccination evolved.

Yimin Hong, Fang Xie, Xinyu An, Xue Lan, Chunhe Liu, Lei Yan, Han Zhang

J Med Internet Res 2023;25:e42671

Uncovering the Reasons Behind COVID-19 Vaccine Hesitancy in Serbia: Sentiment-Based Topic Modeling

Uncovering the Reasons Behind COVID-19 Vaccine Hesitancy in Serbia: Sentiment-Based Topic Modeling

Compared to LDA, which gives more general descriptions of broader topics [46], the architecture of NMF enables it to find more detailed, clear-cut, and coherent topics [37,46,47]. Chen et al [18] even claim that NMF can learn from data similarly to the way humans do, which makes its results more easily interpretable than in the case of LDA.

Adela Ljajić, Nikola Prodanović, Darija Medvecki, Bojana Bašaragin, Jelena Mitrović

J Med Internet Res 2022;24(11):e42261

Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review

Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review

Several practical guides have been published [14-17] that broadly outline several different ways to approach LDA, using a variety of packages. Broadly, training an LDA model involves 3 major steps: data selection, data preprocessing, and data analysis (Figure 1). However, these are not prescriptive, and individual applications of LDA may involve iterations of these steps. Summary of latent Dirichlet allocation (LDA) data selection, preprocessing, and analysis steps.

Lauryn J Hagg, Stephanie S Merkouris, Gypsy A O’Dea, Lauren M Francis, Christopher J Greenwood, Matthew Fuller-Tyszkiewicz, Elizabeth M Westrupp, Jacqui A Macdonald, George J Youssef

J Med Internet Res 2022;24(11):e33166