Original Paper
Abstract
Background: The global Mpox (formerly, Monkeypox) outbreak is disproportionately affecting the gay and bisexual men having sex with men community.
Objective: The aim of this study is to use social media to study country-level variations in topics and sentiments toward Mpox and Two-Spirit, Lesbian, Gay, Bisexual, Transgender, Queer or Questioning, Intersex, Asexual (2SLGBTQIAP+)–related topics. Previous infectious outbreaks have shown that stigma intensifies an outbreak. This work helps health officials control fear and stop discrimination.
Methods: In total, 125,424 Twitter and Facebook posts related to Mpox and the 2SLGBTQIAP+ community were extracted from May 1 to December 25, 2022, using Twitter application programming interface academic accounts and Facebook-scraper tools. The tweets’ main topics were discovered using Latent Dirichlet Allocation in the sklearn library. The pysentimiento package was used to find the sentiments of English and Spanish posts, and the CamemBERT package was used to recognize the sentiments of French posts. The tweets’ and Facebook posts’ languages were understood using the Twitter application programming interface platform and pycld3 library, respectively. Using ArcGis Online, the hot spots of the geotagged tweets were identified. Mann-Whitney U, ANOVA, and Dunn tests were used to compare the sentiment polarity of different topics and countries.
Results: The number of Mpox posts and the number of posts with Mpox and 2SLGBTQIAP+ keywords were 85% correlated (P<.001). Interestingly, the number of posts with Mpox and 2SLGBTQIAP+ keywords had a higher correlation with the number of Mpox cases (correlation=0.36, P<.001) than the number of posts on Mpox (correlation=0.24, P<.001). Of the 10 topics, 8 were aimed at stigmatizing the 2SLGBTQIAP+ community, 3 of which had a significantly lower sentiment score than other topics (ANOVA P<.001). The Mann-Whitney U test shows that negative sentiments have a lower intensity than neutral and positive sentiments (P<.001) and neutral sentiments have a lower intensity than positive sentiments (P<.001). In addition, English sentiments have a higher negative and lower neutral and positive intensities than Spanish and French sentiments (P<.001), and Spanish sentiments have a higher negative and lower positive intensities than French sentiments (P<.001). The hot spots of the tweets with Mpox and 2SLGBTQIAP+ keywords were recognized as the United States, the United Kingdom, Canada, Spain, Portugal, India, Ireland, and Italy. Canada was identified as having more tweets with negative polarity and a lower sentiment score (P<.04).
Conclusions: The 2SLGBTQIAP+ community is being widely stigmatized for spreading the Mpox virus on social media. This turns the community into a highly vulnerable population, widens the disparities, increases discrimination, and accelerates the spread of the virus. By identifying the hot spots and key topics of the related tweets, this work helps decision makers and health officials inform more targeted policies.
doi:10.2196/45108
Keywords
Introduction
While smallpox, an infection caused by the Variola virus (Poxviridae family, Chordopoxvirinae subfamily, and Orthopoxvirus genus), has been declared fully eradicated in 1980 [
], Mpox (formerly, Monkeypox), a similar but milder communicable disorder caused by another Orthopoxvirus (Monkeypox virus, MPXV), is still circulating [ , ]. This disease has been endemic in a number of African countries since 1970, when the first known human case of Mpox was reported in a 9-month-old child [ ], after being isolated in 1958 [ ]. A complex interplay of various variables, including geographical, environmental, historical, and socioeconomic factors such as political instability, population mobility, and poverty, as well as the waning of the immunity conferred by vaccinia vaccination, may have contributed to its spreading in Africa [ - ]. Two major MPXV clades have been discovered and identified, namely, the Central African (Congo Basin) Clade, now referred to as Clade one (I), and the Western African Clade, renamed Clade two (II) [ ], consisting of 2 subclades (Clade IIa and Clade IIb) [ ]. The ecoregion extending from southeastern Nigeria to Cameroon, between the Cross and Sanaga rivers, may have acted as a biogeographic barrier, splitting the virus into these 2 genomic variants [ , - ].Until 2003, Mpox had never been reported outside of African countries, being, as previously mentioned, endemic in West and Central Africa. In 2003, a cluster of 47 cases was announced in the United States, linked to imported African small mammals [
]. Since then, Mpox outbreaks have been reported in non-African countries. Israel reported its first case in 2018 [ ]. The United Kingdom reported 2 major clusters in 2018 and 1 in 2019 [ , ], and Singapore announced its first case in 2019 [ ].The ongoing global Mpox outbreak started on May 6, 2022, in the United Kingdom, with one case being a British resident traveling back from Nigeria, but with other case clusters of unclear epidemiological origin [
]. Since then, an increasing number of countries have progressively announced new cases, with some hospitalizations [ ] and a few fatalities [ ].As of November 10, 2022, according to the US Centers for Disease Control and Prevention (CDC), globally, 110 countries have been affected by Mpox cases, 103 of which have not historically reported Mpox [
], with Clade IIb being the MPXV viral strain currently circulating [ ]. The current outbreak is characterized by an emerging transmission route, which is to say, it is spread by close physical or sexual contact [ ]. Since 93%-98% [ , ] of reported Mpox cases affected gay and bisexual men having sex with men [ ], public and global health responses have mainly targeted the Two-Spirit, Lesbian, Gay, Bisexual, Transgender, Queer or Questioning, Intersex, Asexual (2SLGBTQIAP+) community, which has been heavily and disproportionately impacted by the ongoing global outbreak [ , ]. With this particular focus on this specific population, despite the fact that the infectious agent can be spread and acquired through multiple pathways, the 2SLGBTQIAP+ community risks being stigmatized and blamed for transmitting the virus [ , ]. Stigmatization represents a serious public health concern in that, during infectious outbreaks, it can cause more new cases and aggravate underlying health vulnerabilities [ , ]. Past experiences have shown that blaming marginalized and minority populations for spreading the disease can increase the risk of individuals not seeking health care upon observing their symptoms due to the fear of stigma [ , ]. Moreover, stigma causes depression, mental health conditions, psychological damage, and increases substance use [ - ]. Previous works have studied the negative impact of stigmatizing minority groups and communities during various infectious outbreaks, for example, HIV/AIDS and hepatitis B and C [ - ]. For instance, Nyblade [ ] conducted a survey to assess the impact of HIV/AIDS-related stigma and public opinion on the spread of the virus. The results showed that at the beginning, stigma contributed to fueling the virus transmission, with discrimination gradually decreasing and allowing more patients to seek help. Grossman and Stangl [ ] described how to devise strategies and interventions aimed at reducing HIV/AIDS stigma and counteracting and mitigating its effects, whereas Shen et al [ ] assessed the effectiveness of a crowdsourced intervention for decreasing hepatitis B–related stigma in the men having sex with men community.As such, given its implications in terms of public and global health, it is of paramount importance to measure stigma. Currently, there exist various tools and techniques to do so; for instance, some authors [
] have designed a questionnaire using the EPI (CDC) software and built a stigma score. The results showed a direct connection between stigma and depression. They also found that stigma is different across various cultures and populations. Holzemer et al [ ] had 726 patients infected with HIV from Africa, Puerto Rico, and the United States fill out a questionnaire designed for assessing the quality of life. The results showed that the patients were highly scared of stigma as a discrediting social label and were highly reluctant and hesitant to be tested. They displayed a low quality of life, and many of them endured depression. It was shown in [ ] that stigmatization and marginalization of the minor population are associated with increased alcohol use. Furthermore, a study [ ], based on questionnaires and focus group discussions, was conducted to analyze stigma among African American rural adolescents. The results showed that participants had an average level of HIV/AIDS knowledge and that stigma played a major role in the risk of contracting HIV and developing AIDS. Moreover, they found that nurses and other health care professionals can play a key role in addressing HIV/AIDS–related stigma and misconceptions among the lay population.However, questionnaires, even though specifically designed for measuring stigma, have some major shortcomings. “Novel and unconventional data streams” [
, ], including social media and social networks, offer unprecedented opportunities to quantify the levels of stigma and track them in real time. Some authors [ ] used the stigmatization term frequency of Twitter posts to assess stigma for different debilitating conditions affecting physical and mental health, including HIV/AIDS. The results showed that people with mental health conditions are more stigmatized than people with physical health issues, and, in turn, people living with HIV/AIDS are more stigmatized than people with mental health conditions. A survey [ ] conducted among African-American and Latino men having sex with men showed that stigma was positively associated with the number of hours spent on social media. In other words, individuals that had a higher sense of stigma were more likely to spend their time on social media platforms such as Facebook. Veinot et al [ ] conclude that reducing stigma improves information-seeking and sharing behavior, helps individuals have better access to information on sexually transmitted diseases, particularly HIV and AIDS, and decreases the risk of infection.On the other hand, social media can also reduce stigma and mitigate discrimination [
, ]. In more detail, three ways to counteract stigma in social media have been identified [ ], namely, (1) protest, (2) education, and (3) contact. Concerning the former way (protest), direct messages or hashtags opposing stigmatization, such as “Stop the Stigma,” can be propagated as much as possible. Concerning education, true, verified, and accurate information can be posted in response to false and misleading information. Concerning contact, a stigmatized individual comes into contact with a stigmatizing user in order for them to hear and understand the opposite side. However, before taking any action and implementing a solution, the stigma should be investigated in depth in terms of topics of interest and sentiments. natural language processing (NLP) techniques, like topic modeling and sentiment analysis [ ], coupled with geospatial software, can help achieve this aim. Moreover, authors in [ - ] found that using web-based testing, message-based surveys, and mobile health interventions instead of in-person questionnaires reduces concern over privacy and stigmatization and helps marginalized populations share truthful information for better health services.Specifically concerning Mpox, a few works have focused on the stigmatization of the 2SLGBTQIAP+ community during the 2022 global Mpox outbreak. Some authors [
], given the global burden imposed by stigma, have warned of the consequences of ignoring stigmatization as it happened during previous infectious outbreaks, such as HIV/AIDS, and advised public health officials to proactively address Mpox-related stigmatization. Other authors [ ] have discussed guidelines for taking care of and helping minority communities, emphasizing the importance of raising awareness about the risk of Mpox and the implications of stigmatizing sexual and gender minority populations [ ].Even though the abovementioned studies provided rich information and insightful comments regarding the consequences of blaming minority populations for spreading Mpox, as well as suggestions for mitigating these issues, only a few of them have adopted a social media perspective [
- ]. Social media is a web-based environment where people can share their thoughts, ideas, and beliefs. Novel information, as well as fabricated and misleading information, and fake news, can be massively propagated using these platforms every day [ , ].Previous works have exploited social media for various purposes, including opinion mining [
], hot spot detection [ ], and surveillance [ ]. In [ ], 137 Tik Tok videos were manually screened and categorized. Roughly 12% of the videos came from the Lesbian, Gay, Bisexual, Transgender, Queer (LGBTQ) category. However, the stigmatization of the LGBTQ community was not studied. Dsouza et al [ ] analyzed the sentiments of tweets to study the LGBTQ stigmatization for spreading Mpox. However, they only considered English tweets, did not perform cross-country analysis, and did not extract and analyze discussed topics on social media.In this paper, we fill in the gap by studying social media in more depth and leveraging Twitter and Facebook to better understand Mpox-related stigmatization by assessing relevant popular discussions and conversations regarding Mpox, identifying stigmatization sources, their hot spots, and their sentiments. A data set was built and analyzed by gathering relevant posts from Twitter and Facebook and using keywords related to the Mpox. Two NLP techniques, namely, topic modeling and sentiment analysis, were performed on the posts. ArcGis Online [
, ] was used to visualize the geotagged tweets and find their hot spots. The result of our work may have practical implications in that it could be used by public health officials to determine the direction of their policies and inform them in a data-driven fashion.Methods
Gathering the Data Set
The data set for this work was gathered from 2 of the most popular social media platforms: Twitter and Facebook. By using the full-archive search of the Twitter Academic Researcher Application Programming Interface, all the tweets posted since 2006 can be retrieved for a given query [
, ]. Using keywords related to Mpox, a query was built ((monkeypox OR “monkey pox” OR smallpox OR “viruela dei mono” OR “variole du singe” OR “variola do macaco”) -is:retweet) to gather all the tweets except the retweets, from May 1 to December 25, 2022. The tweets were cleaned. URLs, addresses with the “@” sign, and hashtag signs “#” were removed, and punctuations were corrected. Finally, 2,333,496 cleaned tweets were obtained. 124,712 tweets related to the 2SLGBTQIAP+ community were extracted using the following keywords: lgbtq, lgbtq+, gay, homosexual, homosexuality, lesbian, intersex, transsexual, transgender, bisexual, queer, “men having sex with men,” “men who have sex with men,” lgbt, lgbtqi, lgbt+, lgbtqi+, and lgbtq+ [ ]. All the posts in 30 Mpox-related public Facebook groups from May 1 to December 25, 2022, were gathered using the Facebook_Scraper library [ ]. Of the 16,114 retrieved posts, 712 had the 2SLGBTQIAP+ keywords after cleaning and were selected for analysis. All Facebook posts are public and gathered from public groups. The Twitter and Facebook data sets gathered with Mpox- and Mpox plus 2SLGBTQIAP+-related keywords or groups were combined and visualized by means of a word cloud ( ).The language of the tweets was retrieved using the Twitter application programming interface. Moreover, the language of Facebook posts was recognized using the pycld3 library [
]. The posts were in 102 different languages, and English, French, and Spanish posts with 1,972,637, 124,008, and 33,547 posts, respectively, had the highest frequency.Ethical Considerations
We have gathered public posts from public pages and public groups on Facebook, accessible by anyone through Facebook. We share the group IDs, page IDs, and post IDs of our data set [
], in compliance with the Association of Internet Researchers ethics [ ] and the International Chamber of Commerce/European Society for Opinion and Marketing Research code of conduct [ ]. Moreover, the Twitter data set, which is available at [ ], includes only tweet IDs and user IDs and is used and shared under Twitter’s privacy policy agreement [ ]. Since social media posts are passively analyzed in this research, informed consent from individuals is waived [ ].NLP Techniques
Topic modeling, a text mining tool for automatic discovery and extraction of hidden topics and semantic structures occurring within a text body or in a collection of documents, was done using the Latent Dirichlet Allocation model available in the sklearn package of Python (version 3.8.8; Python Software Foundation). Topic analysis was performed only on English posts. The optimal number of topics was calculated by maximization of the coherence and minimization of the Jaccard similarity scores [
]. Posts belonging to each topic with a probability higher than 0.7 were studied, and the main subject of concern for each topic was inferred.In this paper, topic modeling was coupled with sentiment analysis, which is an NLP procedure that classifies a text based on its sentiment. Most sentiment analysis models classify a text into 3 classes: positive, neutral, and negative. However, some models classify a text into 2 classes: positive and negative. The sentiment score is a number between –1 and 1, which indicates the intensity of the sentiment. Generally, models that classify a text into 3 classes have a score close to 1, 0, and –1 for positive, neutral, and negative sentiments, respectively. Moreover, models that classify a text into 2 classes provide a negative score for negative sentiments and a positive score for positive sentiments. In this work, sentiment analysis was performed on English and Spanish posts using the pysentimiento package available on the Hugging Face website. It is estimated that this model, which classifies text into 3 classes, has a macro F1 score of 0.705 [
- ]. Sentiment analysis was performed on French posts using CamemBERT, which classifies text into 2 classes and is estimated to have 94.55% accuracy [ ]. Geotagged tweets were used to study topics and sentiments regarding the 2SLGBTQIAP+ community across different countries. Sentiments on different topics were compared using Mann-Whitney U, ANOVA, and Dunn tests, and studied across different countries using the Mann-Whitney U test.Results
Trends in the Posts
The temporal trend of the number of the gathered posts related to Mpox and its epidemiology in terms of Mpox cases is depicted in
. From May 18, when Mpox began to emerge as an outbreak, the volume of the posts significantly increased until May 24, when they started falling. Since May 28, the volume of posts has stayed more or less steady. In more detail, the number of Mpox posts peaked on May 20 and on May 23, 2022, while the number of Mpox and the 2SLGBTQIAP+ community posts peaked on May 24, 2022, just 2 days after the Joint United Nations Programme on HIV and AIDS (ie, UNAIDS) urged media outlets, as well as institutional actors, including governments and communities, to respond to the outbreak with an evidence-based, data-driven, and, at the same time, inclusive and rights-based approach, avoiding attaching a stigma to the 2SLGBTQIAP+ community.The number of posts concerning the Mpox and those specifically focusing on the relationship between the Mpox and the 2SLGBTQIAP+ community were highly correlated, as expected (correlation coefficient of 0.85, P<.001). Interestingly, the correlation between the number of Mpox cases and the number of posts related to the 2SLGBTQIAP+ community (correlation coefficient of 0.36, P<.001) was higher than the correlation between the Mpox cases and the total number of posts on Mpox (correlation coefficient of 0.24, P<.001). This shows how closely the discussions regarding Mpox on social media are related to the 2SLGBTQIAP+ community.
Topic Modeling
A total of 10 different topics were extracted from posts related to Mpox and 2SLGBTQIAP+. The topics indicate that the 2SLGBTQIAP+ population is heavily stigmatized for spreading Mpox. Table S1 in
shows the identified keywords and the percentage of tweets on each topic that have them. The first 17 keywords shaded in gray are essentially related to Mpox and 2SLGBTQIAP+ and are common among almost all of the topics. The rest of the keywords dominantly belong to one of the topics. The same pattern could be observed in the word clouds created for each topic ( ). This indicates that the topics are well separated and do not overlap. This is the result of maximizing coherence while minimizing the Jaccard similarity score. In other words, posts inside each topic are very much related, and posts from different topics are far from each other. By studying the posts that belong to each topic with a probability higher than 0.7, the subjects identified for each topic listed in .Identified subjects.
On studying the posts belonging to each topic (probability>0.7), the subjects identified for each topic are as follows:
- Topic #1: lesbian, gay, bisexual, transgender, queer pride
- Topic #2: What World Health Organization/public health/health officials say about Mpox; Mpox is/is not a gay disease
- Topic #3: Mpox does/does not spread through gays/gay orgies/queers
- Topic #4: Mpox is an airborne bioweapon targeting gays
- Topic #5: Reporting number of cases in different countries; Condition of having rash or lesion on skin
- Topic #6: gay bathhouse/homosexuality/heterosexual; Centers for Disease Control and Prevention and CNN (Cable News Network) news.
- Topic #7: Mpox spreads through gay/homosexual sex
- Topic #8: Mpox outbreak linked to gay sauna/gay bars/Grindr/fetish festival; Avoid gay sex to protect yourself
- Topic #9: Mpox is a stigma against gays/African gays; stigmatizing gays/African gays
- Topic #10: Mpox particularly concentrates on gay and bisexual men, however, anyone could be at risk
Sentiment Analysis
Topic modeling indicated that the 2SLGBTQIAP+ population is highly stigmatized for spreading Mpox. Sentiment analysis shows that most sentiments are negative, then neutral, and only a few are positive. Since the posts are related to an outbreak after a pandemic and the stigmatization of minor populations, it is expected that the sentiments be mostly negative. English posts have the greatest number of negative polarities. Spanish posts have the fewest negative polarities, and a higher neutral polarity compared to English posts. Moreover, the negative polarity of French posts is significantly higher than the positive polarity (
). The P value of the Mann-Whitney U test indicates that the intensity of negative polarity is significantly higher than that of neutral and positive polarities (P<.001), and the intensity of neutral polarity is significantly higher than that of positive polarity (P<.001; ). The P value of the Mann-Whitney U test also indicates that English posts have a significantly higher negative intensity and lower neutral and positive intensities compared to Spanish and French posts (P<.001; ).All the topics have a higher negative, then neutral, and finally positive polarity (
). Additionally, the Mann-Whitney U test shows that all the topics have significantly higher negative, then neutral, and finally positive intensities ( ). However, the ANOVA test indicates that the sentiment scores of different topics are not very similar (P<.001; ). The Dunn test shows that 3 topics that are strongly related to the stigmatization of the 2SLGBTQIAP+ community, namely, “gay bathhouse/homosexuality/heterosexual; CDC and CNN news,” “Mpox outbreak linked to gay sauna/gay bars/Grindr/fetish festival; Avoid gay sex to protect yourself,” and “Mpox is a stigma against gays/African gays; Stigmatizing gays/African gays” have a significantly lower sentiment score compared to other topics ( ).Hot Spots
The visualization of the geotagged tweets gathered on Mpox and Mpox plus LGBTQ keywords shows that countries that have the greatest number of tweets include the United States, the United Kingdom, Canada, Ireland, France, the Netherlands, Switzerland, Spain, Portugal, Germany, Mexico, Brazil, South Africa, Nigeria, Kenya, Pakistan, and India (
). The tweets extracted for the 2SLGBTQIAP+ community were mostly concentrated in the United States, the United Kingdom, Canada, Spain, Portugal, India, Ireland, and Italy. However, topic modeling of tweets related to Mpox and the 2SLGBTQIAP+ community was performed only on English tweets, which were mainly from the United States, the United Kingdom, Canada, and India.After topic numbers 5 and 6, which are about the news and reporting the number of Mpox cases, topics 3, 4, 8, and 9 have the most popularity among different countries, which are all related to the 2SLGBTQIAP+ population being stigmatized for spreading Mpox (
). Since the posts were in English, it was possible to find the popularity of each topic only in 4 different countries: the United States, the United Kingdom, Canada, and India.Sentiment polarities were found for English tweets in the United States, the United Kingdom, Canada, and India. Moreover, the sentiment polarities of the Spanish and French tweets were found only for Spain and France, respectively, since the volume was low for other countries. The 3 countries with the highest negative polarity are Canada, the United States, and the United Kingdom (
). The P value of the Mann-Whitney U test indicates that the distribution of sentiment scores across different countries is less diverse ( ). However, among the countries that were studied for English tweets, Canada has the lowest sentiment score (P<.04).Discussion
Principal Findings
On July 23, 2022, the World Health Organization (WHO) declared Mpox a public health emergency of international concern [
]. Ever since then, the number of cases around the globe has been increasing. Previously, minority populations such as gay and bisexual communities were blamed for spreading different diseases like HIV/AIDS and hepatitis B and C. The result of such stigmatization was more new cases, depression, mental health problems, and substance use [ ]. The same trend is observable with the novel Mpox outbreak that is spreading around the world. This work aims to understand Mpox stigmatization of the 2SLGBTQIAP+ community using Twitter and Facebook.Social media is becoming increasingly popular among people to share their opinions, ideas, and experiences. People are sometimes more honest on social media than in their real lives. Therefore, it is a reflection of the real world. As a result, it is used in many different areas of research, such as the economy [
], marketing [ ], and health care [ ].A few studies have applied social media mining to Mpox. Ng et al [
] extracted from Twitter a body of 352,182 original tweets containing the terms “monkeypox,” “monkey pox,” or “monkey_pox,” in the English language, from May 6, 2022, to July 23, 2022, using Bidirectional Encoder Representations from Transformers named entity recognition. The authors identified 5 topics clustered into three major themes: (1) safety concerns, (2) sexual and gender minority stigmatization, and (3) a general lack of faith in public institutions. Tweets displayed high levels of partisanship and personal health anxiety.In line with these findings, in this paper, Tweets and Facebook posts are used to discover the popular discussions regarding Mpox and the stigmatization of 2SLGBTQIAP+ communities, the hot spots, and the sentiments of different topics and countries. The results of this study could be used by health officials to combat stigmatization.
Strengths and Limitations
This investigation has a number of strengths, including its methodological rigor, transparency, and novelty, as well as its main focus on the 2SLGBTQIAP+ community. On the other hand, it is not without any limitations, which should be properly acknowledged. Sentiments analysis was performed on English, French, and Spanish tweets and Facebook posts, which had the highest frequency among all the different languages. Moreover, topic modeling was performed on English posts, which made up more than 81% of the posts. As a result, the discussions and sentiments of the majority of the posts have been extracted and analyzed. However, there are countries in which people do not speak English, French, or Spanish. The analysis of this study could not be generalized to the countries whose official languages are different.
Conclusions
The number of posts with Mpox and 2SLGBTQIAP+ keywords had a higher correlation with the number of Mpox cases (correlation coefficient of 0.36, P<.001) compared to the number of posts on Mpox (correlation coefficient of 0.24, P<.001). This indicates that social media discussions on Mpox are tightly related to the 2SLGBTQIAP+ community. Out of the 10 topics related to Mpox and LGBTQ, 8 were directly focused on blaming the gay community for spreading Mpox. The sentiments on all topics were very negative. Three of the topics that were strongly related to the stigmatization of the 2SLGBTQIAP+ community had a significantly lower sentiment score compared to other topics (ANOVA P<.001). The sentiment of posts from all the 3 languages, English, Spanish, and French, had a higher negative intensity, then neutral, and then positive (P<.001). Canada had the lowest sentiment score compared to other countries (P<.04). Stigmatization of a minority community on this scale will cause seclusion of people and increase hesitancy for seeking help upon realization of the symptoms. Stigmatization of the gay community, especially in countries where the sentiment polarity is very negative (ie, Canada, the United States, and the United Kingdom), must be prevented in order to contain Mpox and control the disease.
As a contribution to the future of this work, NLP tools could be used to study the sentiments and topics of posts regarding Mpox and the 2SLGBTQIAP+ community in languages and regions other than the ones studied in this manuscript.
Acknowledgments
This research is funded by Canada’s International Development Research Centre (IDRC) and the Swedish International Development Cooperation Agency (SIDA) (109559-001). NLB and JDK acknowledge support from IDRC (109981). JDK equally acknowledges support from the NSERC Discovery Grant (RGPIN-2022-04559), the NSERC Discovery Launch Supplement (DGECR-2022-00454), and the New Frontier in Research Fund-Exploratory (NFRFE-2021-00879).
Data Availability
The Facebook data set generated during or analyzed during this study is available on GitHub [
]. The Twitter data set generated during or analyzed during this study is available from Movahedi et al [ ].Conflicts of Interest
None declared.
The Twitter and Facebook data sets with (A) Mpox and (B) Mpox plus Two-Spirit, Lesbian, Gay, Bisexual, Transgender, Queer and/or Questioning, Intersex, Asexual (2SLGBTQIAP+) keywords and groups shown by word cloud.
PNG File , 336 KB
The most prominent keywords of each topic and the percentage of their contribution in building that topic for posts on Mpox plus 2SLGBTQIAP+.
PDF File (Adobe PDF File), 308 KB
Visualizing each topic using a word cloud.
PNG File , 1005 KBReferences
- Pauli G, Blümel J, Burger R, Drosten C, Gröner A, Gürtler L, et al. Orthopox viruses: infections in humans. Transfus Med Hemother 2010;37(6):351-364 [FREE Full text] [CrossRef] [Medline]
- El Eid R, Allaw F, Haddad SF, Kanj SS. Human monkeypox: a review of the literature. PLoS Pathog 2022 Sep;18(9):e1010768 [FREE Full text] [CrossRef] [Medline]
- Hasan S, Saeed S. Monkeypox disease: an emerging public health concern in the shadow of COVID-19 pandemic: an update. Trop Med Infect Dis 2022 Oct 03;7(10):283 [FREE Full text] [CrossRef] [Medline]
- Bragazzi NL, Kong JD, Mahroum N, Tsigalou C, Khamisy-Farah R, Converti M, et al. Epidemiological trends and clinical features of the ongoing monkeypox epidemic: a preliminary pooled data analysis and literature review. J Med Virol 2023 Jan;95(1):e27931. [CrossRef] [Medline]
- Bragazzi NL, Kong JD, Wu J. Integrated epidemiological, clinical, and molecular evidence points to an earlier origin of the current monkeypox outbreak and a complex route of exposure. J Med Virol 2023 Jan;95(1):e28244. [CrossRef] [Medline]
- Giacomelli A, Moschese D, Pozza G, Casalini G, Cossu MV, Rizzardini G, et al. Route of monkeypox viral inoculum as a determinant of atypical clinical presentation. J Med Virol 2023 Jan;95(1):e28112. [CrossRef] [Medline]
- Simpson K, Heymann D, Brown CS, Edmunds WJ, Elsgaard J, Fine P, et al. Human monkeypox - after 40 years, an unintended consequence of smallpox eradication. Vaccine 2020 Jul 14;38(33):5077-5081 [FREE Full text] [CrossRef] [Medline]
- Bunge EM, Hoet B, Chen L, Lienert F, Weidenthaler H, Baer LR, et al. The changing epidemiology of human monkeypox-a potential threat? A systematic review. PLoS Negl Trop Dis 2022 Feb;16(2):e0010141 [FREE Full text] [CrossRef] [Medline]
- Happi C, Adetifa I, Mbala P, Njouom R, Nakoune E, Happi A, O'Toole, et al. Urgent need for a non-discriminatory and non-stigmatizing nomenclature for monkeypox virus. PLoS Biol 2022 Aug;20(8):e3001769 [FREE Full text] [CrossRef] [Medline]
- Nakazawa Y, Mauldin MR, Emerson GL, Reynolds MG, Lash RR, Gao J, et al. A phylogeographic investigation of African monkeypox. Viruses 2015 Apr 22;7(4):2168-2184 [FREE Full text] [CrossRef] [Medline]
- Centers for Disease Control and Prevention. Update: multistate outbreak of monkeypox—Illinois, Indiana, Kansas, Missouri, Ohio, and Wisconsin, 2003. MMWR Morb Mortal Wkly Rep 2003 Jul 11;52(27):642-646 [FREE Full text] [Medline]
- Erez N, Achdout H, Milrot E, Schwartz Y, Wiener-Well Y, Paran N, et al. Diagnosis of imported monkeypox, Israel, 2018. Emerg Infect Dis 2019 May;25(5):980-983 [FREE Full text] [CrossRef] [Medline]
- Vaughan A, Aarons E, Astbury J, Balasegaram S, Beadsworth M, Beck CR, et al. Two cases of monkeypox imported to the United Kingdom, September 2018. Euro Surveill 2018 Sep;23(38):1800509 [FREE Full text] [CrossRef] [Medline]
- Hobson G, Adamson J, Adler H, Firth R, Gould S, Houlihan C, et al. Family cluster of three cases of monkeypox imported from Nigeria to the United Kingdom, May 2021. Euro Surveill 2021 Aug;26(32):2100745 [FREE Full text] [CrossRef] [Medline]
- Ng OT, Lee V, Marimuthu K, Vasoo S, Chan G, Lin RTP, et al. A case of imported monkeypox in Singapore. Lancet Infect Dis 2019 Nov;19(11):1166 [FREE Full text] [CrossRef] [Medline]
- Vivancos R, Anderson C, Blomquist P, Balasegaram S, Bell A, Bishop L, UKHSA Monkeypox Incident Management team, Monkeypox Incident Management Team. Community transmission of monkeypox in the United Kingdom, April to May 2022. Euro Surveill 2022 Jun;27(22):2200422 [FREE Full text] [CrossRef] [Medline]
- DeWitt ME, Polk C, Williamson J, Shetty AK, Passaretti CL, McNeil CJ, et al. Global monkeypox case hospitalisation rates: a rapid systematic review and meta-analysis. EClinicalMedicine 2022 Dec;54:101710 [FREE Full text] [CrossRef] [Medline]
- 2022 Mpox outbreak global map. Centers for Disease Control and Prevention. URL: https://www.cdc.gov/poxvirus/monkeypox/response/2022/world-map.html [accessed 2023-04-11]
- Thornhill JP, Barkati S, Walmsley S, Rockstroh J, Antinori A, Harrison LB, SHARE-net Clinical Group. Monkeypox virus infection in humans across 16 countries—April-June 2022. N Engl J Med 2022 Aug 25;387(8):679-691 [FREE Full text] [CrossRef] [Medline]
- Martínez JI, Montalbán EG, Bueno SJ, Martínez FM, Juliá AN, Díaz JS, et al. Monkeypox outbreak predominantly affecting men who have sex with men, Madrid, Spain, 26 April to 16 June 2022. Euro Surveill 2022 Jul;27(27):2200471 [FREE Full text] [CrossRef] [Medline]
- Bragazzi NL, Kong JD, Wu J. Is monkeypox a new, emerging sexually transmitted disease? A rapid review of the literature. J Med Virol 2023 Jan;95(1):e28145. [CrossRef] [Medline]
- März JW, Holm S, Biller-Andorno N. Monkeypox, stigma and public health. Lancet Reg Health Eur 2022 Dec;23:100536 [FREE Full text] [CrossRef] [Medline]
- Bragazzi NL, Khamisy-Farah R, Tsigalou C, Mahroum N, Converti M. Attaching a stigma to the LGBTQI+ community should be avoided during the monkeypox epidemic. J Med Virol 2023 Jan;95(1):e27913. [CrossRef] [Medline]
- Xu J, Yu Y, Hu Q, Yan H, Wang Z, Lu L, et al. Treatment-seeking behaviour and barriers to service access for sexually transmitted diseases among men who have sex with men in China: a multicentre cross-sectional survey. Infect Dis Poverty 2017 Jan 18;6(1):15 [FREE Full text] [CrossRef] [Medline]
- Saeed F, Mihan R, Mousavi SZ, Reniers RL, Bateni FS, Alikhani R, et al. A narrative review of stigma related to infectious disease outbreaks: what can be learned in the face of the COVID-19 pandemic? Front Psychiatry 2020;11:565919 [FREE Full text] [CrossRef] [Medline]
- Earnshaw VA, Watson RJ, Eaton LA, Brousseau NM, Laurenceau J, Fox AB. Integrating time into stigma and health research. Nat Rev Psychol 2022;1(4):236-247 [FREE Full text] [CrossRef] [Medline]
- English D, Rendina HJ, Parsons JT. The effects of intersecting stigma: a longitudinal examination of minority stress, mental health, and substance use among Black, Latino, and Multiracial Gay and Bisexual Men. Psychol Violence 2018 Nov;8(6):669-679 [FREE Full text] [CrossRef] [Medline]
- Link BG, Struening EL, Rahav M, Phelan JC, Nuttbrock L. On stigma and its consequences: evidence from a longitudinal study of men with dual diagnoses of mental illness and substance abuse. J Health Soc Behav 1997 Jun;38(2):177-190. [Medline]
- Nyblade LC. Measuring HIV stigma: existing knowledge and gaps. Psychol Health Med 2006 Aug;11(3):335-345. [CrossRef] [Medline]
- Grossman CI, Stangl AL. Editorial: global action to reduce HIV stigma and discrimination. J Int AIDS Soc 2013 Nov 13;16(3 Suppl 2):18881 [FREE Full text] [CrossRef] [Medline]
- Tu T. Stigma: a major barrier to hepatitis B elimination. Nat Rev Gastroenterol Hepatol 2022 Oct;19(10):622. [CrossRef] [Medline]
- Shen K, Yang NS, Huang W, Fitzpatrick TS, Tang W, Zhao Y, et al. A crowdsourced intervention to decrease hepatitis B stigma in men who have sex with men in China: a cohort study. J Viral Hepat 2020 Feb;27(2):135-142 [FREE Full text] [CrossRef] [Medline]
- Jeyaseelan L, Kumar S, Mohanraj R, Rebekah G, Rao D, Manhart LE. Assessing HIV/AIDS stigma in south India: validation and abridgement of the Berger HIV Stigma scale. AIDS Behav 2013 Jan;17(1):434-443 [FREE Full text] [CrossRef] [Medline]
- Holzemer WL, Human S, Arudo J, Rosa ME, Hamilton MJ, Corless I, et al. Exploring HIV stigma and quality of life for persons living with HIV infection. J Assoc Nurses AIDS Care 2009;20(3):161-168. [CrossRef] [Medline]
- Heron KE, Lewis RJ, Shappie AT, Dawson CA, Amerson R, Braitman AL, et al. Rationale and design of a remote web-based daily diary study examining sexual minority stress, relationship factors, and alcohol use in same-sex female couples across the United States: study protocol of project relate. JMIR Res Protoc 2019 Feb 04;8(2):e11718 [FREE Full text] [CrossRef] [Medline]
- Piper K, Enah C, Daniel M. Black southern rural adolescents' HIV stigma, denial, and misconceptions and implications for HIV prevention. J Psychosoc Nurs Ment Health Serv 2014 Jun;52(6):50-56. [CrossRef] [Medline]
- Althouse BM, Scarpino SV, Meyers LA, Ayers JW, Bargsten M, Baumbach J, et al. Enhancing disease surveillance with novel data streams: challenges and opportunities. EPJ Data Sci 2015;4(1):17 [FREE Full text] [CrossRef] [Medline]
- Bragazzi NL, Dini G, Toletone A, Brigo F, Durando P. Leveraging Big Data for exploring occupational diseases-related interest at the level of scientific community, media coverage and novel data streams: the example of silicosis as a pilot study. PLoS One 2016;11(11):e0166051 [FREE Full text] [CrossRef] [Medline]
- Robinson P, Turk D, Jilka S, Cella M. Measuring attitudes towards mental health using social media: investigating stigma and trivialisation. Soc Psychiatry Psychiatr Epidemiol 2019 Jan;54(1):51-58 [FREE Full text] [CrossRef] [Medline]
- Garett R, Smith J, Chiu J, Young SD. HIV/AIDS stigma among a sample of primarily African-American and Latino men who have sex with men social media users. AIDS Care 2016;28(6):731-735 [FREE Full text] [CrossRef] [Medline]
- Veinot TC, Meadowbrooke CC, Loveluck J, Hickok A, Bauermeister JA. How "community" matters for how people interact with information: mixed methods study of young men who have sex with other men. J Med Internet Res 2013 Feb 21;15(2):e33 [FREE Full text] [CrossRef] [Medline]
- Betton V, Borschmann R, Docherty M, Coleman S, Brown M, Henderson C. The role of social media in reducing stigma and discrimination. Br J Psychiatry 2015 Jun;206(6):443-444. [CrossRef] [Medline]
- Parrott S, Billings AC, Hakim SD, Gentile P. From #endthestigma to #realman: stigma-challenging social media responses to NBA players’ mental health disclosures. Commun Rep 2020 Aug 30;33(3):148-160. [CrossRef]
- Rana TA, Cheah Y, Letchmunan S. Topic modeling in sentiment analysis: a systematic review. J ICT Res Appl 2016 Oct 1;10(1):76-93. [CrossRef]
- Hsiang E, Offer C, Prescott M, Rodriguez A, Behar E, Matheson T, et al. Bridging the digital divide among racial and ethnic minority men who have sex with men to reduce substance use and HIV risk: mixed methods feasibility study. JMIR mHealth uHealth 2020 Apr 29;8(4):e15282 [FREE Full text] [CrossRef] [Medline]
- Maksut JL, Eaton LA, Siembida EJ, Driffin DD, Baldwin R. A test of concept study of at-home, self-administered HIV testing with web-based peer counseling via video chat for men who have sex with men. JMIR Public Health Surveill 2016 Dec 14;2(2):e170 [FREE Full text] [CrossRef] [Medline]
- Gilbert M, Haag D, Hottes TS, Bondyra M, Elliot E, Chabot C, et al. Get checked… where? The development of a comprehensive, integrated internet-based testing program for sexually transmitted and blood-borne infections in British Columbia, Canada. JMIR Res Protoc 2016 Sep 20;5(3):e186 [FREE Full text] [CrossRef] [Medline]
- Raheel H, Raheel M, Ali Fahim MA, Naeem U. Monkeypox and spillover effects: stigmas, solutions and strategies. Ann Med Surg 2022 Sep;81:104346 [FREE Full text] [CrossRef] [Medline]
- de Sousa ÁFL, de Sousa ARD, Fronteira I. Monkeypox: between precision public health and stigma risk. Rev Bras Enferm 2022 Aug 01;75(5):e750501 [FREE Full text] [CrossRef] [Medline]
- Islam MR, Hasan M, Rahman MS, Rahman MA. Monkeypox outbreak - no panic and stigma; only awareness and preventive measures can halt the pandemic turn of this epidemic infection. Int J Health Plann Manage 2022 Sep;37(5):3008-3011. [CrossRef] [Medline]
- Ortiz-Martínez Y, Sarmiento J, Bonilla-Aldana DK, Rodríguez-Morales AJ. Monkeypox goes viral: measuring the misinformation outbreak on Twitter. J Infect Dev Ctries 2022 Jul 28;16(7):1218-1220 [FREE Full text] [CrossRef] [Medline]
- Ng QX, Yau CE, Lim YL, Wong LKT, Liew TM. Public sentiment on the global outbreak of monkeypox: an unsupervised machine learning analysis of 352,182 Twitter posts. Public Health 2022 Dec;213:1-4 [FREE Full text] [CrossRef] [Medline]
- Xu J, Ross NA. Monkeypox Twitter activity: public understanding of transmission dynamics. Skinmed 2022;20(5):394-395. [Medline]
- Zhang X, Ghorbani AA. An overview of online fake news: characterization, detection, and discussion. Inf Process Manag 2020 Mar;57(2):102025. [CrossRef]
- Aldwairi M, Alwahedi A. Detecting fake news in social media networks. Procedia Comput Sci 2018;141:215-222. [CrossRef]
- Păvăloaia V, Teodor E, Fotache D, Danileţ M. Opinion mining on social media data: sentiment analysis of user preferences. Sustainability 2019 Aug 17;11(16):4459. [CrossRef]
- Butt UM, Letchmunan S, Hassan FH, Ali M, Baqir A, Sherazi HHR. Spatio-temporal crime hotspot detection and prediction: a systematic literature review. IEEE Access 2020;8:166553-166574. [CrossRef]
- Aiello AE, Renson A, Zivich PN. Social media- and internet-based disease surveillance for public health. Annu Rev Public Health 2020 Apr 02;41:101-118 [FREE Full text] [CrossRef] [Medline]
- Ji-Xu A, Htet KZ, Leslie KS. Monkeypox content on TikTok: cross-sectional analysis. J Med Internet Res 2023 Jan 17;25:e44697. [CrossRef]
- Dsouza VS, Rajkhowa P, Mallya BR, Raksha D, Mrinalini V, Cauvery K, et al. A sentiment and content analysis of tweets on monkeypox stigma among the LGBTQ+ community: a cue to risk communication plan. Dialogues Health 2023 Dec;2:100095 [FREE Full text] [CrossRef] [Medline]
- ArcGIS Online. URL: https://www.arcgis.com/index.html [accessed 2022-07-14]
- Rogers DJ, Randolph SE. Studying the global distribution of infectious diseases using GIS and RS. Nat Rev Microbiol 2003 Dec;1(3):231-237 [FREE Full text] [CrossRef] [Medline]
- Getting started with premium Search Tweets: full-archive API. Twitter Developer Platform. URL: https://developer.twitter.com/en/docs/twitter-api/premium/search-api/quick-start/premium-full-archive [accessed 2022-06-01]
- Choosing a historical API. Twitter Developer Platform. URL: https://developer.twitter.com/en/docs/tutorials/choosing-historical-api [accessed 2022-07-14]
- Monkeypox stigmatization. ACADIC. URL: http://acadic.org/monkeypox-stigmatization [accessed 2022-07-14]
- Facebook Scraper. GitHub. URL: https://github.com/kevinzg/facebook-scraper [accessed 2023-04-14]
- Baiter J, Myers E, Bolt W, Luque A, Wisesight, Nogales R, et al. pycld3. GitHub. URL: https://github.com/bsolomon1124/pycld3 [accessed 2023-03-14]
- Nia ZM, Bragazzi NL, Asgary A, Orbinski J, Wu J, Kong JD. Facebook Mpox Data. GitHub. URL: https://github.com/Jdkong/Facebook_Mpox_Data [accessed 2023-04-11]
- Ethics. AOIR. URL: https://aoir.org/ethics/ [accessed 2023-03-17]
- Codes and guidelines. ESOMAR. URL: https://esomar.org/codes-and-guidelines [accessed 2023-03-17]
- Movahedi NZ, Bragazzi N, Kong J, Wu J. A Twitter dataset for Monkeypox, May 2022. Data in Brief 2023:109118.
- Developer agreement and policy. Twitter Developer Platform. URL: https://developer.twitter.com/en/developer-terms/agreement-and-policy [accessed 2023-03-14]
- Eysenbach G, Till JE. Ethical issues in qualitative research on internet communities. BMJ 2001 Nov 10;323(7321):1103-1105 [FREE Full text] [CrossRef] [Medline]
- Yan C, Law M, Nguyen S, Cheung J, Kong J. Comparing public sentiment toward COVID-19 vaccines across Canadian cities: analysis of comments on Reddit. J Med Internet Res 2021 Sep 24;23(9):e32685 [FREE Full text] [CrossRef] [Medline]
- Martins-Filho PR. Increase in interest in sexually transmitted infections on YouTube during the monkeypox outbreak in 2022: a global infodemiology study. Int J Surg 2022 Nov;107:106970 [FREE Full text] [CrossRef] [Medline]
- Pérez JM, Giudici JC, Luque F. pysentimiento: a Python toolkit for sentiment analysis and SocialNLP tasks. ArXiv Preprint posted online on June 17, 2021. [FREE Full text] [CrossRef]
- Pérez JM, Furman DA, Alemany LA, Luque F. RoBERTuito: a pre-trained language model for social media text in Spanish. ArXiv Preprint posted online on November 18, 2021 [FREE Full text] [CrossRef]
- robertuito-sentiment-analysis. Hugging Face. URL: https://huggingface.co/pysentimiento/robertuito-sentiment-analysis [accessed 2022-07-14]
- Blard T. French sentiment analysis with BERT. GitHub. URL: https://github.com/TheophileBlard/french-sentiment-analysis-with-bert [accessed 2023-04-11]
- Singla RK, Singla S, Shen B. Biased studies and sampling from LGBTQ communities created a next-level social stigma in monkeypox: a public health emergency of international concern (PHEIC). IGJPS 2022;12:205-208. [CrossRef]
- Olalla J, de Lomas JMG, Márquez E, González FJ, Del Arco A, De La Torre J, et al. Experience of using an app in HIV patients older than 60 years: pilot program. JMIR mHealth uHealth 2019 Mar 06;7(3):e9904 [FREE Full text] [CrossRef] [Medline]
- Wu B, Wang L, Wang S, Zeng YR. Forecasting the US oil markets based on social media information during the COVID-19 pandemic. Energy 2021 Jul 01;226:120403 [FREE Full text] [CrossRef] [Medline]
- Ansari S, Ansari G, Ghori MU, Kazi AG. Impact of brand awareness and social media content marketing on consumer purchase decision. JPVAI 2019 Jul 30;2(2):5-10. [CrossRef]
- Chun A, Panchmatia R, Doan Q, Meckler G, Narayan B. Twitter as a knowledge translation tool to increase awareness of the OpenHEARTSMAP psychosocial assessment and management tool in the field of pediatric emergency mental health. Cureus 2022 Aug 02;14(8):e27597 [FREE Full text] [CrossRef] [Medline]
Abbreviations
2SLGBTQIAP+: Two-Spirit, Lesbian, Gay, Bisexual, Transgender, Queer and/or Questioning, Intersex, Asexual |
CDC: Centers for Disease Control and Prevention |
LGBTQ: Lesbian, Gay, Bisexual, Transgender, Queer |
MPXV: Monkeypox virus |
NLP: natural language processing |
Edited by A Mavragani; submitted 15.12.22; peer-reviewed by A Ramadona, G Mboowa; comments to author 17.01.23; revised version received 28.03.23; accepted 30.03.23; published 01.05.23
Copyright©Zahra Movahedi Nia, Nicola Bragazzi, Ali Asgary, James Orbinski, Jianhong Wu, Jude Kong. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 01.05.2023.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.