Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/60948, first published .
Perceptions in 3.6 Million Web-Based Posts of Online Communities on the Use of Cancer Immunotherapy: Data Mining Using BERTopic

Perceptions in 3.6 Million Web-Based Posts of Online Communities on the Use of Cancer Immunotherapy: Data Mining Using BERTopic

Perceptions in 3.6 Million Web-Based Posts of Online Communities on the Use of Cancer Immunotherapy: Data Mining Using BERTopic

Original Paper

1School of Pharmacy, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China

2Department of Clinical Oncology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China

3Data Science Program, Worcester Polytechnic Institute, Worcester, MA, United States

*these authors contributed equally

Corresponding Author:

Yin Ting Cheung, PhD

School of Pharmacy, Faculty of Medicine

The Chinese University of Hong Kong

8th Floor, Lo Kwee-Seong Integrated Biomedical Sciences Building, Area 39

Shatin, N.T.

Hong Kong SAR

China

Phone: 852 3943 6833

Email: yinting.cheung@cuhk.edu.hk


Background: Immunotherapy has become a game changer in cancer treatment. The internet has been used by patients as a platform to share personal experiences and seek medical guidance. Despite the increased utilization of immunotherapy in clinical practice, few studies have investigated the perceptions about its use by analyzing social media data.

Objective: This study aims to use BERTopic (a topic modeling technique that is an extension of the Bidirectional Encoder Representation from Transformers machine learning model) to explore the perceptions of online cancer communities regarding immunotherapy.

Methods: A total of 4.9 million posts were extracted from Facebook, Twitter, Reddit, and 16 online cancer-related forums. The textual data were preprocessed by natural language processing. BERTopic modeling was performed to identify topics from the posts. The effectiveness of isolating topics from the posts was evaluated using 3 metrics: topic diversity, coherence, and quality. Sentiment analysis was performed to determine the polarity of each topic and categorize them as positive or negative. Based on the topics generated through topic modeling, thematic analysis was conducted to identify themes associated with immunotherapy.

Results: After data cleaning, 3.6 million posts remained for modeling. The highest overall topic quality achieved by BERTopic was 70.47% (topic diversity: 87.86%; topic coherence: 80.21%). BERTopic generated 14 topics related to the perceptions of immunotherapy. The sentiment score of around 0.3 across the 14 topics suggested generally positive sentiments toward immunotherapy within the online communities. Six themes were identified, primarily covering (1) hopeful prospects offered by immunotherapy, (2) perceived effectiveness of immunotherapy, (3) complementary therapies or self-treatments, (4) financial and mental impact of undergoing immunotherapy, (5) impact on lifestyle and time schedules, and (6) side effects due to treatment.

Conclusions: This study provides an overview of the multifaceted considerations essential for the application of immunotherapy as a therapeutic intervention. The topics and themes identified can serve as supporting information to facilitate physician-patient communication and the decision-making process. Furthermore, this study also demonstrates the effectiveness of BERTopic in analyzing large amounts of data to identify perceptions underlying social media and online communities.

J Med Internet Res 2025;27:e60948

doi:10.2196/60948

Keywords



In recent years, immunotherapy has emerged as one of the most promising therapeutic approaches for treating cancer. It works by activating the innate immune system to identify and attack cancer cells. Immunotherapy encompasses various strategies, including immune checkpoint inhibitors, T-cell transfer therapy, monoclonal antibodies, treatment vaccines, and immune system modulators [Immunotherapy to treat cancer. National Cancer Institute. 2019. URL: https://www.cancer.gov/about-cancer/treatment/types/immunotherapy [accessed 2024-03-04] 1]. These immunotherapeutic strategies have received approval for the treatment of several cancer types such as lung cancer, prostate cancer, chronic lymphocytic leukemia, and non-Hodgkin lymphoma [Miller KD, Nogueira L, Devasia T, Mariotto AB, Yabroff KR, Jemal A, et al. Cancer treatment and survivorship statistics, 2022. CA Cancer J Clin. 2022;72(5):409-436. [FREE Full text] [CrossRef] [Medline]2]. Immunotherapy offers substantial benefits in terms of precision, specificity, and long-term survival improvements, representing a significant breakthrough in cancer treatment [Tan S, Li D, Zhu X. Cancer immunotherapy: Pros, cons and beyond. Biomed Pharmacother. 2020;124:109821. [FREE Full text] [CrossRef] [Medline]3].

Immunotherapy has made remarkable progress and demonstrated clinical value. However, there are drawbacks that may hinder its clinical use and acceptance by patients with cancer. One notable limitation is the variability in individuals’ responses to immunotherapy. Although the treatment may be effective in some patients with specific types of cancers, it may not be as effective for others with the same cancer types [Chen DS, Mellman I. Elements of cancer immunity and the cancer-immune set point. Nature. 2017;541(7637):321-330. [CrossRef] [Medline]4]. Besides, immunotherapy-related adverse events (irAEs) have been observed with the increasing frequency and duration of immunotherapy usage. Patients’ decisions to undergo immunotherapy are influenced by a range of factors, including their perceptions of its efficacy, side effects, procedural aspects, costs, their levels of knowledge about the treatment, and the comprehensiveness of advice provided by health care providers [Bien DR, Danner M, Vennedey V, Civello D, Evers SM, Hiligsmann M. Patients' preferences for outcome, process and cost attributes in cancer treatment: a systematic review of discrete choice experiments. Patient. 2017;10(5):553-565. [FREE Full text] [CrossRef] [Medline]5,Ihrig A, Richter J, Bugaj TJ, Friederich H, Maatouk I. Between hope and reality: How oncology physicians and information providers of a cancer information service manage patients' expectations for and experiences with immunotherapies. Patient Educ Couns. 2023;109:107622. [CrossRef] [Medline]6]. These findings highlight the importance of understanding patients’ perspectives regarding immunotherapy to manage the uncertainties faced by patients and to make informed decisions on whether to proceed with immunotherapy.

The internet has become an indispensable source of health information for patients [Hesse BW, Greenberg AJ, Rutten LJF. The role of internet resources in clinical oncology: promises and challenges. Nat Rev Clin Oncol. 2016;13(12):767-776. [CrossRef] [Medline]7]. Many patients scour the internet for medical guidance, share their personal experiences, and interact on social media platforms [Iftikhar R, Abaalkhail B. Health-seeking influence reflected by online health-related messages received on social media: cross-sectional survey. J Med Internet Res. 2017;19(11):e382. [FREE Full text] [CrossRef] [Medline]8]. The internet, therefore, provides a valuable platform for capturing diverse perspectives. Several studies have explored these perspectives by collecting data from social media platforms and forums [Jenei K, Burgess M, Peacock S, Raymakers AJN. Experiences and perspectives of individuals accessing CAR-T cell therapy: A qualitative analysis of online Reddit discussions. J Cancer Policy. 2021;30:100303. [CrossRef] [Medline]9-Chang A, Xian X, Liu MT, Zhao X. Health communication through positive and solidarity messages amid the COVID-19 pandemic: automated content analysis of Facebook uses. Int J Environ Res Public Health. 2022;19(10):6159. [FREE Full text] [CrossRef] [Medline]13]. For example, through analyzing posts from social media, a study revealed that pain and fatigue were the most commonly discussed symptoms among patients with non–small-cell lung cancer regarding the use of immunotherapy [Booth A, Manson S, Halhol S, Merinopoulou E, Raluy-Callado M, Hareendran A, et al. Using health-related social media to understand the experiences of adults with lung cancer in the era of immuno-oncology and targeted therapies: observational study. JMIR Cancer. 2023;9:e45707. [FREE Full text] [CrossRef] [Medline]10]. Another study found that 55% of patients’ posts and 37% of caregivers’ posts expressed positive perceptions of immunotherapy for treating advanced bladder cancer [Renner S, Loussikian P, Foulquié P, Marrel A, Barbier V, Mebarki A, et al. Patient and caregiver perceptions of advanced bladder cancer systemic treatments: infodemiology study based on social media data. JMIR Cancer. 2023;9:e45011. [FREE Full text] [CrossRef] [Medline]11]. Notably, these previous studies usually extracted posts from a single site and analyzed only a small portion of the available posts. Moreover, these studies primarily focused on patients with specific cancer types, and thus, the findings may not be generalizable to patients with other cancer types for which immunotherapy has been approved [Rodrigues A, Chauhan J, Sagkriotis A, Aasaithambi S, Montrone M. Understanding the lived experience of lung cancer: a European social media listening study. BMC Cancer. 2022;22(1):475. [FREE Full text] [CrossRef] [Medline]12].

One challenge in analyzing data from the internet and social media posts is dealing with vast amounts of unstructured text, which necessitates extensive preprocessing measures prior to analysis. Different machine learning techniques have been explored to increase the efficiency of online text analysis. BERTopic, a topic modeling technique that is an extension of the Bidirectional Encoder Representation from Transformers (BERT) machine learning model, has been used to identify the underlying public perceptions within social media posts. It has demonstrated the capability to unveil latent patterns and extract topics from large datasets [Grootendorst M. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. ArXiv. 2022:1. [CrossRef]14-Ng QX, Yau CE, Lim YL, Wong LKT, Liew TM. Public sentiment on the global outbreak of monkeypox: an unsupervised machine learning analysis of 352,182 twitter posts. Public Health. 2022;213:1-4. [FREE Full text] [CrossRef] [Medline]16]. Compared to other models, BERT models provide a stronger understanding of the contextual meaning of each word in document representations, which is attributable to their bidirectional training of transformers [Devlin J, Chang M, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. Human Language Technologies; 2019. Presented at: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies; June 2-7; Minneapolis, Minnesota. URL: https://aclanthology.org/N19-1423/17]. This enables the generation of more accurate topics than those generated using traditional statistical models. Moreover, the BERT model is continuously trained and updated by researchers, ensuring ongoing improvements [Nayak P. Understanding searches better than ever before. Google. URL: https://blog.google/products/search/search-language-understanding-bert/ [accessed 2024-03-25] 18]. However, no study has employed this technique to explore the viewpoints of the online cancer community on immunotherapy.

With the objective of understanding the perspectives of online cancer communities regarding immunotherapy, this study used BERTopic to analyze a large number of posts extracted from multiple social media and online forums. The findings of this study may offer valuable insights for clinicians, patients, and researchers to enhance decision-making processes when considering immunotherapy as a treatment option.


Overview

This was a retrospective study that involved analyzing texts collected from social media platforms and online forums. The study proceeded through 4 primary methodological phases: (1) collecting textual data, (2) cleaning and processing the extracted texts, (3) performing and optimizing topic modeling, and (4) conducting thematic analysis. The study process workflow is illustrated in Figure 1.

Figure 1. Flowchart depicting the data mining process. BERTopic: a topic modeling technique that is an extension of the Bidirectional Encoder Representation from Transformers; cTF-IDF: class-based term frequency-inverse document frequency.

Ethics Approval

This study exclusively utilizes anonymized data from publicly available sources. The quotes presented in this report are modified or paraphrased to prevent any potential identification and direct linkage to the original posts/quotes. All identifiers or pseudo identifiers have been removed from the report. This study was approved by the Survey and Behavioral Research Ethics Committee of the Chinese University of Hong Kong (reference SBRE-20-864).

Data Sources and Collection

For this study, textual data were collected from online forums and social media platforms. For consistency and minimal translation concerns, only English-language texts were included. The posts from these platforms dated before November 15, 2022, were included, with varying start dates. For social media platforms, posts were extracted from Facebook, Twitter, and Reddit. These platforms were chosen for their popularity and the substantial textual content they contain [Wagner J. The most popular social networks of 2021. Ignite Social Media. URL: https://www.ignitesocialmedia.com/twitter-marketing/the-most-popular-social-networks-of-2021/ [accessed 2023-09-28] 19]. For cancer-related online forums, the selection criteria have been detailed in a previous study [Lam CS, Zhou K, Loong HH, Chung VC, Ngan C, Cheung YT. The use of traditional, complementary, and integrative medicine in cancer: data-mining study of 1 million web-based posts from health forums and social media platforms. J Med Internet Res. 2023;25:e45408. [FREE Full text] [CrossRef] [Medline]20]. To describe briefly, we first used the top search engines (Google, Microsoft Bing, Yahoo, and Yandex) to search for health forums by using different combinations of search terms (cancer, tumor/tumour, patient, and forum) [Search engine market share worldwide. Statcounter GlobalStats. URL: https://gs.statcounter.com/search-engine-market-share [accessed 2024-03-14] 21]. The search results were extracted and examined by the investigators. Web forums were included if they (1) appeared in more than one search result, (2) were open access (ie, no membership or passwords were needed to access the messages), (3) were active for at least the past 5 years, (4) had at least 10 messages posted to the group within the past 30 days from the date of the search, and (5) enabled web scraping of posts or feeds using Python (Python Software Foundation) or R (R Foundation for Statistical Computing). Finally, 16 different online cancer-related forums, including Cancer Chat, Cancer Survivors Network, Cancer Council Online Community, and other forums, were included for the analysis (Table 1). Search terms related to immunotherapy were identified from multiple authoritative websites, including the American Cancer Society, Cancer Research UK, and the National Cancer Institute (Table S1 in

Multimedia Appendix 1

Additional data.

DOCX File , 27 KBMultimedia Appendix 1). Site-provided search engines were utilized to identify related posts.

Table 1. List of the online health forums selected for data extraction [Cancer chat. Cancer Research UK. URL: https://www.cancerresearchuk.org/about-cancer/cancer-chat [accessed 2024-01-13] 22-HealthUnlocked. URL: https://healthunlocked.com/search/communities?query=cancer [accessed 2024-01-13] 37].
Name of web forumForum start datea
Cancer Chat [Cancer chat. Cancer Research UK. URL: https://www.cancerresearchuk.org/about-cancer/cancer-chat [accessed 2024-01-13] 22]2008
Jo\'s Cervical Cancer Trust Forum [Jo's Cervical Cancer Trust Forum. URL: https://forumstaging.jostrust.org.uk [accessed 2024-01-13] 23]2004
Prostate Cancer UK [Prostate Cancer UK. URL: https://community.prostatecanceruk.org/alltopics [accessed 2024-01-13] 24]2014
Bowel Cancer UK Community [Bowel Cancer UK. URL: https://community.bowelcanceruk.org.uk/forum/ [accessed 2024-01-13] 25]2015
Cancer Survivors Network [Cancer Survivors Network. URL: https://csn.cancer.org [accessed 2024-01-13] 26]2000
Breastcancer.org community [The Breastcancer.org Community Forum. URL: https://community.breastcancer.org/ [accessed 2024-01-13] 27]2004
Pancreatic Cancer UK Forum [Pancreatic Cancer UK. URL: https://forum.pancreaticcancer.org.uk/ [accessed 2024-01-13] 28]2007
Breast Cancer Now Forum [Breast Cancer Now Forum. URL: https://forum.breastcancernow.org/ [accessed 2024-01-13] 29]2005
Cancer Council Online Community [Cancer Council Online Community. URL: https://onlinecommunity.cancercouncil.com.au/ [accessed 2024-01-13] 30]2009
Irish Cancer Society Community [Irish Cancer Society Community. URL: https://www.cancer.ie/community [accessed 2024-01-13] 31]2008
Navigating Care [Navigating Care. URL: https://www.navigatingcare.com [accessed 2024-01-13] 32]2009
TC Cancer.com [Testicular Cancer Society. URL: http://www.tc-cancer.com/forum/# [accessed 2024-01-13] 33]2004
HealthBoards (cancer) [HealthBoards (cancer). URL: https://www.healthboards.com/boards/cancers/ [accessed 2024-01-13] 34]2000
Melanoma Patient Forum [Melanoma Patient Forum. URL: https://forum.melanoma.org/ [accessed 2024-01-13] 35]2010
Macmillan Cancer Support [Macmillan Cancer Support. URL: https://community.macmillan.org.uk/g [accessed 2024-01-13] 36]2008
HealthUnlocked [HealthUnlocked. URL: https://healthunlocked.com/search/communities?query=cancer [accessed 2024-01-13] 37] 2013

aIn cases where the start dates of certain online health forums were not explicitly provided, an estimation was made based on the published date of the first post available.

Data Preprocessing

Spark NLP, a natural language processing library for Python built on top of Apache Spark, was utilized to preprocess the textual data [Kocaman V, Talby D. Spark NLP: Natural language understanding at scale. Software Impacts. 2021;8:100058. [CrossRef]38]. The texts extracted from the forums and social media sites were subjected to a thorough cleaning process to enhance their suitability for further analysis. The cleaning process involved multiple steps, including checking and correcting spelling errors, lemmatization, removal of stop words, and elimination of short entries (Figure 1). Each text message was first spell-checked, and the vocabulary was reduced to its base root form by using Spark NLP. After that, each word vocabulary was reduced to alphanumeric characters that contained only letters from “a to z,” “A to Z,” and “0 to 9.” For a precise and semantically meaningful text corpus, any message composed of fewer than 4 words as well as commonly occurring words (eg, the, a, and, in) that carried little or no meaning in the message were removed. Finally, the cleaned text messages were processed using the medical language models from Healthcare Spark NLP, a pretrained pipeline to recognize medical terminologies within the messages [Health care NLP state of the art medical language models. John Snow Labs. URL: https://www.johnsnowlabs.com/health [accessed 2024-03-06] 39].

Topic Modeling

The next step involved the extraction of topics from the data. Topic modeling, an unsupervised machine-learning approach, was employed to unveil hidden semantic structures and extract distinct topics from the extracted textual data [Saracco BH. Data science and predictive analytics: biomedical and health applications using R. Journal of the Medical Library Association. 2020;108(2):334. [CrossRef]40]. In this study, we first evaluated 10 common topic modeling techniques, namely, Latent Dirichlet Allocation, Latent Semantic Analysis, Non-Negative Matrix Factorization, Principal Component Analysis, Random Project, K-Means, Top2Vec, BERT + K-Means, BERT, and Latent Dirichlet Allocation + BERT to determine the appropriate topic modeling approach. Topic diversity (indicating the model’s ability to capture differences between generated topics), coherence (the frequency of descriptive words of the topic within each cluster), and quality (topic diversity multiplied by topic coherence) were used as metrics to evaluate the effectiveness of isolating topics from the texts [Dieng AB, Ruiz FJR, Blei DM. Topic modeling in embedding spaces. Transactions of the Association for Computational Linguistics. 2020;8:439-453. [CrossRef]41,Mimno D, Wallach H, Talley E, Leenders M, McCallum A. Optimizing semantic coherence in topic models. 2011. Presented at: Proceedings of the conference on empirical methods in natural language processing; July 27-31; Edinburgh, Scotland, UK. URL: https://aclanthology.org/D11-102442]. Finally, we used BERTopic to extract quality topics from the data based on the modelling performance. All texts were passed through the BERT model to create embeddings that converted text messages into numerical representations by using BERTopic. The dimensions of the embedding vectors were reduced due to the “Curse of Dimensionality” [Grootendorst M. BERTopic. 2022. URL: https://github.com/MaartenGr/BERTopic [accessed 2023-09-01] 43]. The bisecting K-means algorithm was used to group the reduced-dimension embedding vectors into clusters until optimal silhouette scores were achieved [Krishna BSV, Satheesh P, SuneelKumar R. Comparative study of k-means and bisecting k-means techniques in wordnet based document clustering. Semantic Scholar. 2012. URL: https://api.semanticscholar.org/CorpusID:212499046 [accessed 2024-03-13] 44]. Various numbers of topics, ranging from 5 to 25, were tested to determine the optimal metrics.

Following the identification of clusters with the highest topic quality, the 20 most frequent words from each topic were extracted. BERTopic modeling was conducted using the Worcester Polytechnic Institute’s Turing cluster, which has high computing power for large volumes of data processing by using multiple physical computers simultaneously. The Turing cluster consists of a 4-node hyperconverged head that controls 79 compute nodes and is located at Worcester Polytechnic Institute. The total central processing unit/random access memory/graphics processing unit counts across all computer nodes were 5224, 49 TB, and 84, respectively [High performance computing. Worcester Polytechnic Institute. URL: https://arc.wpi.edu/computing/hpc-clusters/ [accessed 2024-06-17] 45].

Sentiment and Thematic Analysis

Sentiment analysis was performed using 2 popular libraries, VADER (Valence Aware Dictionary and Sentiment Reasoner) and TextBlob [Bonta V, Kumaresh N, Janardhan N. A comprehensive study on lexicon based approaches for sentiment analysis. AJCST. 2019;8(S2):1-6. [CrossRef]46,Aljedaani W, Rustam F, Mkaouer MW, Ghallab A, Rupapara V, Washington PB, et al. Sentiment analysis on Twitter data integrating TextBlob and deep learning models: The case of US airline industry. Knowledge-Based Systems. 2022;255:109780. [CrossRef]47]. Both libraries were used to enhance the accuracy of the results and minimize potential biases that may arise from relying on a single library. The sentiment scores obtained from each library were then averaged to derive a final sentiment score, which ranged from –1 (indicating a very negative sentiment) to 1 (indicating a very positive sentiment). Based on the topics generated through BERTopic modeling, thematic analysis was conducted by 2 investigators (XW and CSL) to identify themes associated with immunotherapy. The thematic analysis in this study involved the following steps: familiarization with the posts, inductive analysis, theme generation, and theme review among domain experts in the research team (including oncologists and oncology pharmacists) [Gonzalez G, Vaculik K, Khalil C, Zektser Y, Arnold C, Almario CV, et al. Using large-scale social media analytics to understand patient perspectives about urinary tract infections: thematic analysis. J Med Internet Res. 2022;24(1):e26781. [CrossRef]48]. First, 2 investigators (XW and CSL) familiarized with the posts and encoded them independently. Then, the codes were compared, the same codes were merged, and any discrepancies were addressed through discussion with a third investigator (YTC). The themes were generated through the identification and refinement of the codes, which were grouped by perceived thematic similarities. The final themes were identified by achieving consensus among domain experts. The consensus process involved multiple rounds. The experts rated their level of agreement for each theme: themes with 75% agreement were retained, and themes without consensus were discussed, adjusted, and rated in the next round until 75% agreement was reached.


Overview

A total of 4.9 million posts were gathered from online forums and social media platforms. After data cleaning, 3.6 million posts remained for further analysis (Figure 1).

Number and Quality of the Topics

In comparison with other topic modeling techniques, BERTopic demonstrated the most satisfactory performance, with a topic diversity score of 81.11%, a topic coherence score of 80.48%, and a topic quality score of 65.28% (Table S2 in

Multimedia Appendix 1

Additional data.

DOCX File , 27 KBMultimedia Appendix 1). Table S3 in

Multimedia Appendix 1

Additional data.

DOCX File , 27 KB
Multimedia Appendix 1
presents BERTopic’s performance across different numbers of topics to determine the optimal value. Fifteen topics achieved the highest overall topic quality, scoring 70.47% (with the highest topic diversity at 87.86% and a high topic coherence of 80.21%). We also used these metrics to assess the quality of individual topics generated by the model, yielding topic quality scores ranging from 78% to 88% (Figure S1 in

Multimedia Appendix 2

The distribution of topic quality scores.

PNG File , 249 KB
Multimedia Appendix 2
). One topic (Topic 15) was excluded due to the mismatch between its extracted words and the topics related to immunotherapy. Based on the sentiment analysis results, the average sentiment score of around 0.3 across the 14 topics preliminarily suggested generally positive sentiments toward immunotherapy within the online community (Figure 2). A more comprehensive interpretation of the specific topics is presented in the subsequent sections.

Figure 2. Sentiment analysis results. Sentiment score (a measure of how positive or negative the tone of the quotes is) was analyzed for each topic. The sentiment score ranged from –1 (indicating a very negative sentiment) to 1 (indicating a very positive sentiment). N/A: not applicable; T: topic.

Themes and Topics

A total of 6 themes were identified (Table 2). The quotes in Table 2 have been modified to prevent any potential identification and direct linkage to the original posts. Theme 1 centered on the feelings of hope and positivity among patients treated with immunotherapy. This theme highlighted the belief that immunotherapy offers hope for patients with cancer to lead normal lives and provides a potential cure for cancer with minimal side effects. Theme 2 described immunotherapy as a more effective treatment option after other treatments have failed. Patients may have tried other cancer treatments such as chemotherapy and radiotherapy but only found successful results with immunotherapy. Besides, patients believe combining immunotherapy with other conventional therapies, particularly chemotherapy, can lead to higher efficacy in treating cancer than using a single treatment approach. Theme 3 suggested that patients receiving immunotherapy also explore diets or complementary medicines to manage their symptoms or side effects from treatment or the cancer itself. For example, patients may employ mind-body practices such as massage and meditation or use herbal medicines to alleviate certain physical and psychological side effects caused by immunotherapy. Probiotics were frequently mentioned by users as a potential way to boost the effectiveness of immunotherapy. In Theme 4, the potential ineffectiveness, worse outcomes, and high cost of treatment were frequently mentioned among patients, which acted as hindrances to the use of immunotherapy. Theme 5 suggested that immunotherapy can disrupt patients’ normal daily lives, particularly for those enrolled in immunotherapy trials. Additionally, the treatment may negatively impact their activities of daily living, such as taste changes, dietary restrictions, and time consumption. Theme 6 indicated that immunotherapy may cause physical and psychological symptoms and side effects such as rash, inflammation, and nephrotoxicity. These adverse effects may sometimes pose greater challenges compared to those arising from other treatments or the cancer itself. Some patients acknowledged that these symptoms may be unavoidable alongside the curative effects of immunotherapy.

Table 2. Topics and selected posts under each theme.
Theme, topics and related keywordsSamples of modified quotesa
Theme 1: Hopeful prospects offered by immunotherapy

Topic 1: good, time, hope, love, great, feel, work, lot, treatment, start
  • …help not to feel alone during treatment.
  • …hope stop the growth of cancer quickly.
  • …the new treatment offers new hope.

Topic 3: fight, hope, positive, news, treatment, glad, beat, good, feel, life
  • …immunotherapy through clinical trials has shown positive results.
  • …wish to continue with immunotherapy after completing treatment.
  • …use the inert immune system to combat cancer cells.
  • …reduced damage compared with chemotherapy or radiotherapy.

Topic 5: hope, day, love, morning, huge, satisfy, glad, feel, good, love
  • …hope of one day curing all cancers without side effects.
  • …develop immunotherapy tailored to the individual.
  • …the day when immunotherapy will be available to all.
  • …eradicate unique cancer cells

Topic 6: saint, faith, family, love, god, strength, miracle, wonderful, hope, glad
  • …a miracle for patients to keep the cancer at bay.
  • …a miracle to help someone with very advanced diseases.
Theme 2: Perceived effectiveness of immunotherapy

Topic 4: gallbladder, liver, bile, symptom, surgery, scan, remove, problem, ultrasound, test
  • …a tremendous response with significant improvements.
  • …the tumors on my adrenal glands, lungs and liver have shrunk after immunotherapy.
  • …tried immunotherapy as one last hail Mary pass and it seems to be working.

Topic 13: risk, study, benefit, favorable, evidence, effect, patient, data, high, treat
  • …immunotherapy appears to be successful in that the CT scan shows the cancer has shrunk.
  • …have a favorable response against the melanoma.
  • …immunotherapy has given lives back.

Topic 8: antioxidant, radiation, chemo, protect, treatment, dose, effect, radical, kill, interfere
  • …better results with less side effects.
  • …combine chemo with immunotherapy gives the potential for long term remission and even a cure for advanced cancers.
  • …very effective in treating liver tumors.
Theme 3: Complementary therapies or self-treatments

Topic 2: acupuncture, pain, feel, needle, massage, neuropathy, flash, hot, session, side
  • …encouraged to get massage and acupuncture.
  • …had acupuncture to help with side effects of immunotherapy, such as abdominal pain, muscle pain, and severe neuropathy.

Topic 10: keto, diet, food, organic, probiotic, supplement, eat, weight, sugar, body
  • …dependent on good gut bacteria.
  • …anyone undergoing immunotherapy is recommended to take probiotics concurrently.
  • …tried anything else, eg, diet, nutrition, supplements, etc.
Theme 4: Financial and mental impact of immunotherapy

Topic 7: diagnosis, feel, anxiety, traumatic, stress, cost, experience, time, depression, therapist
  • …expensive drugs with nasty side effects.
  • …immunotherapy costs a lot that can’t afford it.
  • …dealing with progression is profoundly stressful.
  • …more depressed and weaker when waiting long for treatment.
Theme 5: Impact on lifestyle and time schedules

Topic 9: simulation, schedule, treatment, appointment, ill, week, start, time, plan, session
  • …involve an eight-week program after the previous treatments.
  • …take a break and/or reduce dosages rather than quit altogether.
  • …spent 5 weeks to decide on a new treatment plan.
  • …don’t have too much flexibility.
Theme 6: Side effects due to treatment

Topic 11: ill, swell, weak, surgery, compression, feel, wear, inflame, tight, back
  • …surprised at how the side effects returned so quickly.
  • …resulted in inflammation of lungs.
  • …the treatments can result in worst side effects than the disease itself.

Topic 12: immune, damage, skin, fever, burn, kill, system, therapy, tissue, body
  • …developed an acute kidney injury.
  • …had a very hard time with auto immune reaction
  • …had several other side effects, but not a rash.

Topic 14: pain, symptom, hope, painful, colitis, arthritis, response, issue, experience, costochondritis
  • …back pain mentioned as a side effect.
  • …have a rash on the back of neck and serious neuropathy.
  • …found a cream that helps along with pain meds.

aThe quotes have been modified or paraphrased to prevent any potential identification and direct linkage to the original posts.


Principal Findings

We analyzed a large dataset comprising 3.6 million posts from 3 social media platforms and 16 online cancer forums, providing a comprehensive overview of perceptions regarding immunotherapy within English-speaking online cancer communities. A cutting-edge natural language processing technique, BERTopic, was used to generate interpretable topics, ensuring robust data analysis. The identification of different themes underscores the diverse attitudes toward immunotherapy across various aspects. The posts show the positive attitudes of the patients toward immunotherapy and its perceived effectiveness and benefits. However, there are also concerns about the side effects of immunotherapy and potential disruptions to patients’ lifestyle. These themes may offer valuable insights into the facilitators and barriers to the adoption of immunotherapy. In addition, the results showed that using machine learning techniques to analyze patient-generated online textual data can enhance our understanding of perspectives on newer cancer treatments, and such information may facilitate communication between patients and clinicians when discussing immunotherapy as a therapeutic option.

Comparison to Current Evidence

The positive factors that encourage the utilization of immunotherapy included its comparatively mild side effect profile and the potential to instill hope in patients by offering a viable treatment option when other methods have failed. Our findings also indicated that online cancer communities seem to be well-informed, with some patients well-versed in the benefits of immunotherapy. Laypeople now access medical information on the internet independently and acquire firsthand information about advances in cancer treatment. Access to online health information assists patients in making health-related decisions, leading to more professional consultation, improved treatment compliance, and better self-care [Thapa DK, Visentin DC, Kornhaber R, West S, Cleary M. The influence of online health information on health decisions: A systematic review. Patient Educ Couns. 2021;104(4):770-784. [CrossRef] [Medline]49]. However, the internet should not be viewed as a replacement for professional health information sources [Jacobs W, Amuta AO, Jeon KC. Health information seeking in the digital age: An analysis of health information seeking behavior among US adults. Cogent Social Sciences. 2017;3(1):1302785. [CrossRef]50]. The guidance and advice of health care professionals remain indispensable in helping patients make complex medical decisions [Palanica A, Flaschner P, Thommandram A, Li M, Fossat Y. Physicians' perceptions of chatbots in health care: cross-sectional web-based survey. J Med Internet Res. 2019;21(4):e12887. [CrossRef] [Medline]51]. Online health information cannot provide detailed diagnoses or personalized treatment plans for patients; moreover, the presence of misinformation on websites might mislead patients about treatment [Suarez-Lledo V, Alvarez-Galvez J. Prevalence of health misinformation on social media: systematic review. J Med Internet Res. 2021;23(1):e17187. [FREE Full text] [CrossRef] [Medline]52]. Taken together, our results highlight the rising health literacy of online cancer communities, especially those involving younger generations. Future efforts should focus on enhancing the accuracy of online health information by using reliable rating tools, effective search engine ranking, and progress in crowdsourcing websites.

Although immunotherapy has shown clinical benefits in several trials, there are still potential barriers to its utilization. The high cost associated with immunotherapy emerges as a frequently discussed concern among patients with cancer, emphasizing the need for revising drug valuations and reimbursement models [Tran G, Zafar SY. Financial toxicity and implications for cancer care in the era of molecular and immune therapies. Ann Transl Med. 2018;6(9):166. [FREE Full text] [CrossRef] [Medline]53]. Many quotes were related to patients’ descriptions of their physical symptoms induced by immunotherapy and how it has impeded their activities of daily living and disrupted their normal lives. In addition to physical irAEs, the long-term use of immunotherapy has also been associated with psychiatric side effects such as fatigue, insomnia, anxiety, and depression [Kovacs D, Kovacs P, Eszlari N, Gonda X, Juhasz G. Psychological side effects of immune therapies: symptoms and pathomechanism. Curr Opin Pharmacol. 2016;29:97-103. [FREE Full text] [CrossRef] [Medline]54,Lacouture M, Sibaud V. Toxic side effects of targeted therapies and immunotherapies affecting the skin, oral mucosa, hair, and nails. Am J Clin Dermatol. 2018;19(S1):31-39. [CrossRef]55]. Hence, it is important for oncology practitioners to manage patients’ expectations and communicate with patients regarding the potential side effects of immunotherapy before initiating treatment. This proactive approach allows patients to better understand the benefits and risks of the treatment, which may prevent them from discontinuing treatment because of known adverse reactions.

Relevance to Clinical Practice

Several clinical practice guidelines have been proposed to address the evaluation and management of immunotherapy-induced toxicity [Haanen J, Obeid M, Spain L, Carbonnel F, Wang Y, Robert C, et al. ESMO Guidelines Committee. Electronic address: clinicalguidelines@esmo.org. Management of toxicities from immunotherapy: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann Oncol. 2022;33(12):1217-1238. [FREE Full text] [CrossRef] [Medline]56] and the diagnosis and treatment of toxicity associated with immune checkpoint inhibitor therapy in specific organ systems [Brahmer JR, Lacchetti C, Schneider BJ, Atkins MB, Brassil KJ, Caterino JM, et al. National Comprehensive Cancer Network. Management of immune-related adverse events in patients treated with immune checkpoint inhibitor therapy: American Society of Clinical Oncology Clinical Practice Guideline. J Clin Oncol. 2018;36(17):1714-1768. [FREE Full text] [CrossRef] [Medline]57]. The provision of high-quality supportive care from a multidisciplinary team of health care professionals can help identify and manage irAEs [Rahman MM, Behl T, Islam MR, Alam MN, Islam MM, Albarrati A, et al. Emerging management approach for the adverse events of immunotherapy of cancer. Molecules. 2022;27(12):3798. [FREE Full text] [CrossRef] [Medline]58]. In practice, patients may turn to self-management strategies to cope with these symptoms. We found that some patients mentioned in online posts that they used traditional, complementary, and integrative medicine modalities such as probiotics, herbal remedies, and meditation to self-manage the toxicity induced by immunotherapy. Recently, some studies have evaluated how complementary medicines may enhance the efficacy of immunotherapy and alleviate immune toxicity [Zhang N, Xiao X. Integrative medicine in the era of cancer immunotherapy: Challenges and opportunities. J Integr Med. 2021;19(4):291-294. [CrossRef] [Medline]59,Jia W, Wang L. Using Traditional Chinese Medicine to treat hepatocellular carcinoma by targeting tumor immunity. Evid Based Complement Alternat Med. 2020;2020:9843486. [FREE Full text] [CrossRef] [Medline]60]. For example, the treatment of irAEs often necessitates the long-term use of high-dose corticosteroids [Schneider BJ, Naidoo J, Santomasso BD, Lacchetti C, Adkins S, Anadkat M, et al. Management of immune-related adverse events in patients treated with immune checkpoint inhibitor therapy: ASCO guideline update. JCO. 2021;39(36):4073-4126. [CrossRef]61]. Integrating complementary modalities into the management of irAEs may reduce corticosteroid use. Our group has previously reported that patients with cancer frequently seek advice on the use of complementary medicines on social media platforms [Lam CS, Zhou K, Loong HH, Chung VC, Ngan C, Cheung YT. The use of traditional, complementary, and integrative medicine in cancer: data-mining study of 1 million web-based posts from health forums and social media platforms. J Med Internet Res. 2023;25:e45408. [FREE Full text] [CrossRef] [Medline]20]. At this juncture, clinicians should focus on initiating effective communication regarding the use of complementary medicines and help patients establish realistic expectations while being well-informed about the limited evidence supporting these approaches.

Limitations

Our study has several limitations. First, the data used in this research were collected from English-language online cancer communities, which may limit the generalizability of the findings to data posted in other languages. Social media and cancer forums may be used distinctly in different countries, and the types of immunotherapeutic approaches may also vary among regions. Future studies could conduct a thorough investigation into whether distinct concerns regarding immunotherapy exist among patients from different countries and regions of practice. Second, perceptions regarding immunotherapy on social media and cancer forums are dynamic and may be influenced by current news or events. Therefore, future research can consider exploring the fluctuations in posts on these platforms over varying time periods. Third, although the performance metrics (topic diversity and topic coherence scores) quantified from our models suggest that BERTopic is effective at unveiling latent patterns within large datasets, this approach requires validation. Nevertheless, our study demonstrates that applying artificial intelligence to analyze social media data can facilitate our understanding of patients’ perspectives as well as the effectiveness of machine-learning techniques in processing substantial volumes of data retrieved from social media platforms and online forums.

Conclusions

This study captures comprehensive insights and perspectives of online communities regarding immunotherapy through BERTopic modeling. The identified themes and topics may facilitate the understanding of immunotherapy utilization and serve as valuable supporting information when making treatment decisions. The posts have shown the positive attitudes of the patients and their perceived effectiveness and benefits of immunotherapy. Although immunotherapy presents a beacon of hope and a viable treatment option, its side effects and the accompanying potential lifestyle disruptions cannot be ignored and are major concerns perceived by the online communities. It is essential for clinicians to inform patients of the undesirable effects of immunotherapy and establish realistic expectations, thereby enabling patients to make an informed decision considering the benefits and risks associated with the treatment, which may ultimately lead to more optimal treatment outcomes with immunotherapy.

Acknowledgments

We would like to thank Worcester Polytechnic Institute’s Turing cluster hosted by the Worcester Polytechnic Institute for the implementation of the computing process.

Data Availability

The dataset used in this study is accessible via the following link [Cheung YT. Perception of online communities towards the use of cancer immunotherapy: a data mining study of 3.6 million web-based posts from social media platforms using BERTopic. The Chinese University of Hong Kong. URL: https://researchdata.cuhk.edu.hk/dataset.xhtml?persistentId=doi:10.48668/0NPBOK [accessed 2025-01-21] 62].

Authors' Contributions

XW and CSL conceptualized this study and contributed to the methodology, formal analysis, writing the original draft, review, and editing. KHH, HHFL, and KRZ contributed to conceptualization and review and editing of this paper. Ngan CK contributed to conceptualization, formal analysis, review, and editing. YTC contributed to conceptualization, formal analysis, supervision, review, and editing.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Additional data.

DOCX File , 27 KB

Multimedia Appendix 2

The distribution of topic quality scores.

PNG File , 249 KB

  1. Immunotherapy to treat cancer. National Cancer Institute. 2019. URL: https://www.cancer.gov/about-cancer/treatment/types/immunotherapy [accessed 2024-03-04]
  2. Miller KD, Nogueira L, Devasia T, Mariotto AB, Yabroff KR, Jemal A, et al. Cancer treatment and survivorship statistics, 2022. CA Cancer J Clin. 2022;72(5):409-436. [FREE Full text] [CrossRef] [Medline]
  3. Tan S, Li D, Zhu X. Cancer immunotherapy: Pros, cons and beyond. Biomed Pharmacother. 2020;124:109821. [FREE Full text] [CrossRef] [Medline]
  4. Chen DS, Mellman I. Elements of cancer immunity and the cancer-immune set point. Nature. 2017;541(7637):321-330. [CrossRef] [Medline]
  5. Bien DR, Danner M, Vennedey V, Civello D, Evers SM, Hiligsmann M. Patients' preferences for outcome, process and cost attributes in cancer treatment: a systematic review of discrete choice experiments. Patient. 2017;10(5):553-565. [FREE Full text] [CrossRef] [Medline]
  6. Ihrig A, Richter J, Bugaj TJ, Friederich H, Maatouk I. Between hope and reality: How oncology physicians and information providers of a cancer information service manage patients' expectations for and experiences with immunotherapies. Patient Educ Couns. 2023;109:107622. [CrossRef] [Medline]
  7. Hesse BW, Greenberg AJ, Rutten LJF. The role of internet resources in clinical oncology: promises and challenges. Nat Rev Clin Oncol. 2016;13(12):767-776. [CrossRef] [Medline]
  8. Iftikhar R, Abaalkhail B. Health-seeking influence reflected by online health-related messages received on social media: cross-sectional survey. J Med Internet Res. 2017;19(11):e382. [FREE Full text] [CrossRef] [Medline]
  9. Jenei K, Burgess M, Peacock S, Raymakers AJN. Experiences and perspectives of individuals accessing CAR-T cell therapy: A qualitative analysis of online Reddit discussions. J Cancer Policy. 2021;30:100303. [CrossRef] [Medline]
  10. Booth A, Manson S, Halhol S, Merinopoulou E, Raluy-Callado M, Hareendran A, et al. Using health-related social media to understand the experiences of adults with lung cancer in the era of immuno-oncology and targeted therapies: observational study. JMIR Cancer. 2023;9:e45707. [FREE Full text] [CrossRef] [Medline]
  11. Renner S, Loussikian P, Foulquié P, Marrel A, Barbier V, Mebarki A, et al. Patient and caregiver perceptions of advanced bladder cancer systemic treatments: infodemiology study based on social media data. JMIR Cancer. 2023;9:e45011. [FREE Full text] [CrossRef] [Medline]
  12. Rodrigues A, Chauhan J, Sagkriotis A, Aasaithambi S, Montrone M. Understanding the lived experience of lung cancer: a European social media listening study. BMC Cancer. 2022;22(1):475. [FREE Full text] [CrossRef] [Medline]
  13. Chang A, Xian X, Liu MT, Zhao X. Health communication through positive and solidarity messages amid the COVID-19 pandemic: automated content analysis of Facebook uses. Int J Environ Res Public Health. 2022;19(10):6159. [FREE Full text] [CrossRef] [Medline]
  14. Grootendorst M. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. ArXiv. 2022:1. [CrossRef]
  15. Ng QX, Lee DYX, Yau CE, Lim YL, Liew TM. Public perception on 'healthy ageing' in the past decade: An unsupervised machine learning of 63,809 Twitter posts. Heliyon. 2023;9(2):e13118. [FREE Full text] [CrossRef] [Medline]
  16. Ng QX, Yau CE, Lim YL, Wong LKT, Liew TM. Public sentiment on the global outbreak of monkeypox: an unsupervised machine learning analysis of 352,182 twitter posts. Public Health. 2022;213:1-4. [FREE Full text] [CrossRef] [Medline]
  17. Devlin J, Chang M, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. Human Language Technologies; 2019. Presented at: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies; June 2-7; Minneapolis, Minnesota. URL: https://aclanthology.org/N19-1423/
  18. Nayak P. Understanding searches better than ever before. Google. URL: https://blog.google/products/search/search-language-understanding-bert/ [accessed 2024-03-25]
  19. Wagner J. The most popular social networks of 2021. Ignite Social Media. URL: https://www.ignitesocialmedia.com/twitter-marketing/the-most-popular-social-networks-of-2021/ [accessed 2023-09-28]
  20. Lam CS, Zhou K, Loong HH, Chung VC, Ngan C, Cheung YT. The use of traditional, complementary, and integrative medicine in cancer: data-mining study of 1 million web-based posts from health forums and social media platforms. J Med Internet Res. 2023;25:e45408. [FREE Full text] [CrossRef] [Medline]
  21. Search engine market share worldwide. Statcounter GlobalStats. URL: https://gs.statcounter.com/search-engine-market-share [accessed 2024-03-14]
  22. Cancer chat. Cancer Research UK. URL: https://www.cancerresearchuk.org/about-cancer/cancer-chat [accessed 2024-01-13]
  23. Jo's Cervical Cancer Trust Forum. URL: https://forumstaging.jostrust.org.uk [accessed 2024-01-13]
  24. Prostate Cancer UK. URL: https://community.prostatecanceruk.org/alltopics [accessed 2024-01-13]
  25. Bowel Cancer UK. URL: https://community.bowelcanceruk.org.uk/forum/ [accessed 2024-01-13]
  26. Cancer Survivors Network. URL: https://csn.cancer.org [accessed 2024-01-13]
  27. The Breastcancer.org Community Forum. URL: https://community.breastcancer.org/ [accessed 2024-01-13]
  28. Pancreatic Cancer UK. URL: https://forum.pancreaticcancer.org.uk/ [accessed 2024-01-13]
  29. Breast Cancer Now Forum. URL: https://forum.breastcancernow.org/ [accessed 2024-01-13]
  30. Cancer Council Online Community. URL: https://onlinecommunity.cancercouncil.com.au/ [accessed 2024-01-13]
  31. Irish Cancer Society Community. URL: https://www.cancer.ie/community [accessed 2024-01-13]
  32. Navigating Care. URL: https://www.navigatingcare.com [accessed 2024-01-13]
  33. Testicular Cancer Society. URL: http://www.tc-cancer.com/forum/# [accessed 2024-01-13]
  34. HealthBoards (cancer). URL: https://www.healthboards.com/boards/cancers/ [accessed 2024-01-13]
  35. Melanoma Patient Forum. URL: https://forum.melanoma.org/ [accessed 2024-01-13]
  36. Macmillan Cancer Support. URL: https://community.macmillan.org.uk/g [accessed 2024-01-13]
  37. HealthUnlocked. URL: https://healthunlocked.com/search/communities?query=cancer [accessed 2024-01-13]
  38. Kocaman V, Talby D. Spark NLP: Natural language understanding at scale. Software Impacts. 2021;8:100058. [CrossRef]
  39. Health care NLP state of the art medical language models. John Snow Labs. URL: https://www.johnsnowlabs.com/health [accessed 2024-03-06]
  40. Saracco BH. Data science and predictive analytics: biomedical and health applications using R. Journal of the Medical Library Association. 2020;108(2):334. [CrossRef]
  41. Dieng AB, Ruiz FJR, Blei DM. Topic modeling in embedding spaces. Transactions of the Association for Computational Linguistics. 2020;8:439-453. [CrossRef]
  42. Mimno D, Wallach H, Talley E, Leenders M, McCallum A. Optimizing semantic coherence in topic models. 2011. Presented at: Proceedings of the conference on empirical methods in natural language processing; July 27-31; Edinburgh, Scotland, UK. URL: https://aclanthology.org/D11-1024
  43. Grootendorst M. BERTopic. 2022. URL: https://github.com/MaartenGr/BERTopic [accessed 2023-09-01]
  44. Krishna BSV, Satheesh P, SuneelKumar R. Comparative study of k-means and bisecting k-means techniques in wordnet based document clustering. Semantic Scholar. 2012. URL: https://api.semanticscholar.org/CorpusID:212499046 [accessed 2024-03-13]
  45. High performance computing. Worcester Polytechnic Institute. URL: https://arc.wpi.edu/computing/hpc-clusters/ [accessed 2024-06-17]
  46. Bonta V, Kumaresh N, Janardhan N. A comprehensive study on lexicon based approaches for sentiment analysis. AJCST. 2019;8(S2):1-6. [CrossRef]
  47. Aljedaani W, Rustam F, Mkaouer MW, Ghallab A, Rupapara V, Washington PB, et al. Sentiment analysis on Twitter data integrating TextBlob and deep learning models: The case of US airline industry. Knowledge-Based Systems. 2022;255:109780. [CrossRef]
  48. Gonzalez G, Vaculik K, Khalil C, Zektser Y, Arnold C, Almario CV, et al. Using large-scale social media analytics to understand patient perspectives about urinary tract infections: thematic analysis. J Med Internet Res. 2022;24(1):e26781. [CrossRef]
  49. Thapa DK, Visentin DC, Kornhaber R, West S, Cleary M. The influence of online health information on health decisions: A systematic review. Patient Educ Couns. 2021;104(4):770-784. [CrossRef] [Medline]
  50. Jacobs W, Amuta AO, Jeon KC. Health information seeking in the digital age: An analysis of health information seeking behavior among US adults. Cogent Social Sciences. 2017;3(1):1302785. [CrossRef]
  51. Palanica A, Flaschner P, Thommandram A, Li M, Fossat Y. Physicians' perceptions of chatbots in health care: cross-sectional web-based survey. J Med Internet Res. 2019;21(4):e12887. [CrossRef] [Medline]
  52. Suarez-Lledo V, Alvarez-Galvez J. Prevalence of health misinformation on social media: systematic review. J Med Internet Res. 2021;23(1):e17187. [FREE Full text] [CrossRef] [Medline]
  53. Tran G, Zafar SY. Financial toxicity and implications for cancer care in the era of molecular and immune therapies. Ann Transl Med. 2018;6(9):166. [FREE Full text] [CrossRef] [Medline]
  54. Kovacs D, Kovacs P, Eszlari N, Gonda X, Juhasz G. Psychological side effects of immune therapies: symptoms and pathomechanism. Curr Opin Pharmacol. 2016;29:97-103. [FREE Full text] [CrossRef] [Medline]
  55. Lacouture M, Sibaud V. Toxic side effects of targeted therapies and immunotherapies affecting the skin, oral mucosa, hair, and nails. Am J Clin Dermatol. 2018;19(S1):31-39. [CrossRef]
  56. Haanen J, Obeid M, Spain L, Carbonnel F, Wang Y, Robert C, et al. ESMO Guidelines Committee. Electronic address: clinicalguidelines@esmo.org. Management of toxicities from immunotherapy: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann Oncol. 2022;33(12):1217-1238. [FREE Full text] [CrossRef] [Medline]
  57. Brahmer JR, Lacchetti C, Schneider BJ, Atkins MB, Brassil KJ, Caterino JM, et al. National Comprehensive Cancer Network. Management of immune-related adverse events in patients treated with immune checkpoint inhibitor therapy: American Society of Clinical Oncology Clinical Practice Guideline. J Clin Oncol. 2018;36(17):1714-1768. [FREE Full text] [CrossRef] [Medline]
  58. Rahman MM, Behl T, Islam MR, Alam MN, Islam MM, Albarrati A, et al. Emerging management approach for the adverse events of immunotherapy of cancer. Molecules. 2022;27(12):3798. [FREE Full text] [CrossRef] [Medline]
  59. Zhang N, Xiao X. Integrative medicine in the era of cancer immunotherapy: Challenges and opportunities. J Integr Med. 2021;19(4):291-294. [CrossRef] [Medline]
  60. Jia W, Wang L. Using Traditional Chinese Medicine to treat hepatocellular carcinoma by targeting tumor immunity. Evid Based Complement Alternat Med. 2020;2020:9843486. [FREE Full text] [CrossRef] [Medline]
  61. Schneider BJ, Naidoo J, Santomasso BD, Lacchetti C, Adkins S, Anadkat M, et al. Management of immune-related adverse events in patients treated with immune checkpoint inhibitor therapy: ASCO guideline update. JCO. 2021;39(36):4073-4126. [CrossRef]
  62. Cheung YT. Perception of online communities towards the use of cancer immunotherapy: a data mining study of 3.6 million web-based posts from social media platforms using BERTopic. The Chinese University of Hong Kong. URL: https://researchdata.cuhk.edu.hk/dataset.xhtml?persistentId=doi:10.48668/0NPBOK [accessed 2025-01-21]


BERT: Bidirectional Encoder Representation from Transformers
BERTopic: a topic modeling technique that is an extension of the Bidirectional Encoder Representation from Transformers
irAE: immunotherapy-related adverse event
VADER: Valence Aware Dictionary and Sentiment Reasoner


Edited by N Cahill; submitted 27.05.24; peer-reviewed by D Carvalho, L Wreyford; comments to author 27.09.24; revised version received 17.10.24; accepted 30.12.24; published 10.02.25.

Copyright

©Xingyue Wu, Chun Sing Lam, Ka Ho Hui, Herbert Ho-fung Loong, Keary Rui Zhou, Chun-Kit Ngan, Yin Ting Cheung. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 10.02.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.