TY - JOUR AU - Lin, Shuo-Yu AU - Cheng, Xiaolu AU - Zhang, Jun AU - Yannam, Jaya Sindhu AU - Barnes, Andrew J AU - Koch, J Randy AU - Hayes, Rashelle AU - Gimm, Gilbert AU - Zhao, Xiaoquan AU - Purohit, Hemant AU - Xue, Hong PY - 2023 DA - 2023/2/13 TI - Social Media Data Mining of Antitobacco Campaign Messages: Machine Learning Analysis of Facebook Posts JO - J Med Internet Res SP - e42863 VL - 25 KW - tobacco control KW - social media campaign KW - content analysis KW - natural language processing KW - topic modeling KW - social media KW - public health KW - tobacco KW - youth KW - Facebook KW - engagement KW - use KW - smoking AB - Background: Social media platforms provide a valuable source of public health information, as one-third of US adults seek specific health information online. Many antitobacco campaigns have recognized such trends among youth and have shifted their advertising time and effort toward digital platforms. Timely evidence is needed to inform the adaptation of antitobacco campaigns to changing social media platforms. Objective: In this study, we conducted a content analysis of major antitobacco campaigns on Facebook using machine learning and natural language processing (NLP) methods, as well as a traditional approach, to investigate the factors that may influence effective antismoking information dissemination and user engagement. Methods: We collected 3515 posts and 28,125 associated comments from 7 large national and local antitobacco campaigns on Facebook between 2018 and 2021, including the Real Cost, Truth, CDC Tobacco Free (formally known as Tips from Former Smokers, where “CDC” refers to the Centers for Disease Control and Prevention), the Tobacco Prevention Toolkit, Behind the Haze VA, the Campaign for Tobacco-Free Kids, and Smoke Free US campaigns. NLP methods were used for content analysis, including parsimonious rule–based models for sentiment analysis and topic modeling. Logistic regression models were fitted to examine the relationship of antismoking message-framing strategies and viewer responses and engagement. Results: We found that large campaigns from government and nonprofit organizations had more user engagements compared to local and smaller campaigns. Facebook users were more likely to engage in negatively framed campaign posts. Negative posts tended to receive more negative comments (odds ratio [OR] 1.40, 95% CI 1.20-1.65). Positively framed posts generated more negative comments (OR 1.41, 95% CI 1.19-1.66) as well as positive comments (OR 1.29, 95% CI 1.13-1.48). Our content analysis and topic modeling uncovered that the most popular campaign posts tended to be informational (ie, providing new information), where the key phrases included talking about harmful chemicals (n=43, 43%) as well as the risk to pets (n=17, 17%). Conclusions: Facebook users tend to engage more in antitobacco educational campaigns that are framed negatively. The most popular campaign posts are those providing new information, with key phrases and topics discussing harmful chemicals and risks of secondhand smoke for pets. Educational campaign designers can use such insights to increase the reach of antismoking campaigns and promote behavioral changes. SN - 1438-8871 UR - https://www.jmir.org/2023/1/e42863 UR - https://doi.org/10.2196/42863 UR - http://www.ncbi.nlm.nih.gov/pubmed/36780224 DO - 10.2196/42863 ID - info:doi/10.2196/42863 ER -