Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/76309, first published .
Information Pathways and Voids in Critical German Online Communities During the COVID-19 Vaccination Discourse: Cross-Platform and Mixed Methods Analysis

Information Pathways and Voids in Critical German Online Communities During the COVID-19 Vaccination Discourse: Cross-Platform and Mixed Methods Analysis

Information Pathways and Voids in Critical German Online Communities During the COVID-19 Vaccination Discourse: Cross-Platform and Mixed Methods Analysis

1Centre for Artificial Intelligence in Public Health Research, Robert Koch Institute, Nordufer 20, Berlin, Germany

2Medical Research Council Centre for Global Infectious Disease Analysis, Imperial College London, London, United Kingdom

3Department of Infectious Diseases, Public Health Service of Amsterdam, Amsterdam, The Netherlands

4Department of Health Sciences, Faculty of Science, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands

5Department of Internal Medicine, Amsterdam UMC Location University of Amsterdam, Amsterdam, The Netherlands

6Amsterdam Institute for Immunology and Infectious Diseases (AI&I), Amsterdam, The Netherlands

7Amsterdam Public Health research institute (APH), Amsterdam, The Netherlands

8Digital Health - Connected Healthcare, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany

Corresponding Author:

Silvan Wehrli, MSc


Background: In Germany, the messaging app Telegram (Telegram FZ-LLC) served as a tool to organize protests against public health measures during the COVID-19 pandemic. A community of diverse groups formed around these protests, which used Telegram to discuss and share views outside of the general public discourse and mainstream information ecosystem. This increasingly included conspiracy theories and extremist content, propagated by sources that opposed the mainstream positions of the government and traditional media. While the use of such sources has been thoroughly investigated, the role of mainstream information in these communities remains largely unclear.

Objective: We aimed to better understand the use of mainstream information, that is, from government actors and established media outlets, within critical Telegram communities in the context of the COVID-19 pandemic in Germany. We focused on the discourse about the COVID-19 vaccination, a key public health measure. As a central element of this study, we compared the Telegram discourse with the discourse on X (formerly Twitter, X Corp) and in online news—this cross-platform analysis aimed to put the results into a broader societal context.

Methods: We analyzed Telegram, X, and news data between 2019 and 2023 for popular topics related to the COVID-19 vaccination discourse. We used a mixed methods approach, including text clustering for the exploration of popular topics, a 2-stage keyword filtering scheme for multitopic classification, link sharing analysis for assessing the prevalence of mainstream information, correlation-based time series analysis for measuring the similarity of discourse dynamics, and thematic analysis to examine the reasons for sharing information.

Results: We identified 5 popular vaccination-related topics that were discussed on both Telegram and X, namely death, long COVID, measures in schools, mandatory vaccination, and virus variants. On average per topic, 58% (SD 5.2%) of Telegram posts and 21% (SD 4.9%) of X posts contained an external link. Of these posts containing external links, 11%‐35% of Telegram posts and 44%‐60% of X posts contained a mainstream link per topic. The correlations for week-to-week changes in discourse intensity between Telegram, X, and online mainstream news ranged from no positive association (coefficient <0.2) to strong positive relationships (coefficient >0.6) per topic. Finally, the thematic analysis resulted in 5 themes describing the usage patterns of mainstream information on Telegram and X. The identified themes are observing news, news comments, directed accusations, participation, and reference in discussion (only X).

Conclusions: Mainstream information sources were part of the information mix within the analyzed critical Telegram communities. However, the role and prevalence of these sources varied. We argue that differences between platforms may indicate the existence of information voids, which pose a particular challenge in managing infodemics. These insights emphasize the importance of contextualized cross-platform analyses for understanding complex information pathways and their potential for targeted crisis communication.

J Med Internet Res 2025;27:e76309

doi:10.2196/76309

Keywords



The COVID-19 pandemic highlighted the critical role of information during public health crises. The spread of the virus, the uncertainty about the outcome of the pandemic, and the rapid changes in everyday life led to a high demand for information worldwide [1,2]. At the same time, this demand was met with a high supply of information of varying quality—a situation referred to as an infodemic [3]. Infodemics may pose significant health threats, such as when individuals become overwhelmed by the sheer volume of information and struggle to make informed health decisions [4]. In addition, the belief in false health information, including both unintentional misinformation and intentional disinformation [3,5], can undermine public health measures like the COVID-19 vaccination, which has saved countless lives in Europe [6,7]. Even though the necessity to understand and tackle infodemics has been widely acknowledged [8,9], it still remains a difficult challenge to quantify and understand infodemics [10]. As global pandemics could become more likely in the future [11,12] and with them public health infodemics, understanding infodemics remains relevant beyond current and past events. The information ecosystem is central to this, that is, the supply and demand of information by various actors and their interplay [13]. In such an ecosystem, different actors may be in (possibly reciprocal) relationships with each other. These relationships form information pathways, that is, the information flow between actors in the ecosystem [14].

During the COVID-19 pandemic, social media was an important information source alongside official announcements from government institutions and traditional media news outlets [2]. However, social media also became the focus of attention for spreading misinformation and disinformation as a medium with the ability to quickly spread information, possibly outside the established information ecosystem from authorities and others [9,15]. This proliferation may be facilitated by information voids, referring to situations where there is insufficient reliable information available on topics of public interest, creating opportunities for unreliable or misleading content to fill the gap [3,16]. Such gaps can arise, for example, when official sources are unable to meet the demand for information, especially in rapidly evolving crises, such as pandemics [3]. Apart from large social media platforms (eg, Facebook, Meta Platforms Inc or X, formerly Twitter, X Corp) [17], the messaging app Telegram (Telegram FZ-LLC) has shown particular potential for spreading false health information through a combination of circumstances [18-20]. Telegram’s channel function allowed individuals to address a large, public audience. In such channels, creators can post content that is accessible to a potentially unlimited number of subscribers, who can join freely [21]. In addition, Telegram did not remove users who had been removed from other platforms, such as Facebook for violating their platform rules [22]. In this context, Telegram is frequently referred to as a fringe platform [20,23-25], that is, a medium that allows almost unrestricted sharing of content, particularly extremist views and opinions that are marginalized by the rest of the public discourse [24]. Mainstream is used as a distinction from such fringe platforms, that is, for the dominant discourse outside of fringe groups [20,24] and social media platforms that restrict and remove extremist content [22,25]. We use this term similarly in this work, with mainstream representing opinions and views widely accepted by the general population. However, the concept of fringe platforms is not clearly defined [24] (other terms such as “alternative social media” are also used for similar concepts [22]). Furthermore, the use of Telegram does not necessarily imply a fringe discourse limited to extreme views outside the mainstream [26,27]. Ultimately, the use of Telegram may also be motivated by its variety of features that users find practical, such as the possibility to have private chats and consume news in 1 app [28].

In Germany, Telegram provided a platform for the organization of the Querdenken (literally “lateral thinking”) protests [20,29,30]. The protesters’ criticism was directed against the public health measures, which were generally supported by the broad public and leading media outlets [31,32], and not being allowed to express their critical positions during the COVID-19 pandemic [33]. The positions held by the protesters thus emerged as an alternative to the dominant public opinion in established, major media and from authorities [29]. The protests expanded across cities in Germany, with offshoots in Austria and Switzerland [20]. An active discourse formed around the protests on Telegram, consisting of a network of politically diverse groups, united by criticism of the pandemic response [33-36]. The discourse on Telegram increasingly attracted attention due to the proliferation of conspiracy theories about COVID-19 and politically extreme content (eg, from the far-right), supported by online sources outside of the mainstream information ecosystem [18,20,24,34,37]. These developments may suggest a general movement away from the mainstream discourse and information ecosystem. At the same time, however, the use of different information sources varied within different Telegram communities [35]. In addition, frequent sharing of articles from major German newspapers within Telegram channels [20,29,34,35] suggests a heterogeneous level of participation in the mainstream discourse. This asks for a differentiated and quantitative consideration of the heterogeneous communities’ interplay, which formed around the Querdenken protests. However, the role and impact of the mainstream information ecosystem on these communities and their relationship to broader societal conversations have remained unclear so far. Understanding this intricate relationship is crucial for the design and implementation of pandemic or infodemic risk communication, given the relevance of mainstream sources to the general public [2]. Ultimately, effective risk communication should reach the entire public.

Using a key example of the COVID-19 vaccination debate, we analyzed the information pathways between the Telegram communities around the Querdenken protests and the mainstream discourse. We focused on the role of information from governmental authorities and large media outlets, as these actors played a central role in the dissemination of information within the mainstream information ecosystem [2,32]. We considered 3 perspectives. First, we compared the prevalence of information from these mainstream sources on Telegram with the discourse on the social media platform X for popular vaccination-related topics, using techniques from natural language processing. Second, we applied time series analysis to the found discourse dynamics on the 2 platforms with regard to the publication of mainstream online news. Third, we examined the role of the shared mainstream information in the discourse on Telegram and X through qualitative data analysis. During these analyses, X served as a contrast to the Telegram discourse: X/Twitter actively tried to moderate misinformation and disinformation on the platform during most of the pandemic and the analyzed time period [22,25,38], respectively. In addition, the COVID-19 discourse on X in the German language was rather balanced, reflecting a broader spectrum of opinion in society [39-41].

The combination of multiple data sources and quantitative and qualitative methods revealed a complex relationship between the discourse on Telegram, on X, and in the mainstream information ecosystem, extending to communities that are particularly critical of the general public discourse. Our study thus emphasizes the potential of cross-platform analytics for understanding information pathways and information behavior during infodemics.


Data

Overview

We included data from public posts on Telegram, posts on X, and news headlines from the database of the Global Data on Events, Location and Tone (GDELT) project [42], all in the German language, between April 2019 and February 2023. The selection of the time period was based on the X data available to us, which covered the shortest period of time.

Telegram

For Telegram, we collected posts from public channels using the Telegram application programming interface, based on the work of Zehring and Domahidi [20], who analyzed a large number of Telegram channels linked to the Querdenken protests. Based on existing research, they first created an initial list of channels associated with the Querdenken protests or the far right. Then, they expanded this list to include channels whose content was frequently forwarded, that is, reposted, in the channels from the first selection. We note that, at the time of data collection (July 2024), 66 of the 558 channels were not accessible anymore (eg, due to deletion of the channel). We preprocessed the data similarly to Zehring and Domahidi [20] for their text analysis to obtain a dataset that was as similar as possible. Specifically, we removed non-German (as classified by fastText [43]) or empty posts and used the same final channel selection. The collected dataset contained original posts (without a reference to another Telegram post) and forwarded posts (or simply, forwards).

X (formerly Twitter)

The X data contained 1% of all public posts in the German language. This 1% sample could be streamed in real time directly from an application programming interface provided by X for the entire post volume on X (ie, for any language). Since this offer was accessible to a broad research community before it was terminated in February 2023 [44], it has served as the basis for numerous research projects [45]. We performed minimal text preprocessing and only refined the data quality through further language identification (using fastText [43]). The dataset contained original posts, replies, quotes, and reposts (formerly retweets).

We present the quantitative analyses below using different approaches to examine content-sharing mechanisms on Telegram and X. For Telegram, we included both original posts and forwards, as forwards represent a core mechanism for providing information within channels to their subscribers [46]. The Telegram dataset was specifically constructed by identifying channels through forward relationships, making forwards integral to understanding the information ecosystem in the studied dataset. For X, we excluded reposts from our analyses, similar to other studies [47,48]. X/Twitter content was susceptible to artificial amplification, for example, through reposts by automated accounts [49,50], as well as algorithmic content selection [51,52]. Excluding reposts allowed us to examine authentic content creation without the potential distortion introduced by artificial or algorithmic amplification. Conceptually, this corresponds to the scope of the study, as we focused on the information pathways, that is, the information flow, from mainstream actors to online platforms and communities, respectively.

News Headlines

The GDELT project [42] tracks news from a comprehensive set of online news sources worldwide to monitor and analyze the global news coverage. The project updates its databases continuously (every 15 minutes), using a dedicated server and crawler infrastructure in collaboration with a range of organizations, including Google News [53-55]. GDELT provides metadata on the crawled news and further analyses (such as sentiment analysis) through different datasets, which have been used by various studies in different scientific fields [54]. We used data from the Global Embedded Metadata Graph, which provides news article metadata on all news articles tracked by GDELT since 2018 [56]. Specifically, we extracted all news headlines in German from this dataset and their associated URLs and timestamps. We note that the extracted dataset contained duplicate URLs and duplicate news headlines (ie, from the same domain but published under different URLs), which we removed for the further analyses.

Analytical Approach

Overview

We adopted an explanatory sequential mixed methods design [57]. Initially, we conducted quantitative data analysis, and the results from this phase informed the subsequent qualitative analysis. We focused on a selection of 5 popular topics related to the vaccination discourse on Telegram and X platforms, as described below.

Quantitative Analyses
Initial Content Filtering

In the first step, we filtered all datasets (Telegram, X, and news headlines) for vaccination-related content. We included all texts containing German words related to vaccination, names of COVID-19 vaccines, or their manufacturers (Table 1). We filtered the posts using a regular expression, which is provided in Listing S1 in Multimedia Appendix 1. We note that this filter considered compound words, different word forms, and hashtags. Based on our observation that both Telegram and X posts often contained inconsistent use of upper- and lowercase letters (eg, only lowercase letters), the filter did not distinguish between different capitalizations. In the course of the further study, we then only used these vaccination-related Telegram, X, and news headline datasets.

Table 1. Keywords used for filtering vaccination-related subtopics and the resulting sizes of data subsets.
DatasetKeywordsTelegram posts (3,489,421)X (formerly Twitter) posts (114,774,512)Mainstream news headlines (6,134,824)
First-level filter (share in all data, %)
VaccinationImpfung, Vakzin (vaccine, vaccination), impfen, vakzinieren (to vaccinate); Bimervax, Comirnaty, Jcovden, Nuvaxovid, Spikevax, Sputnik V, Vaxzevria; AstraZeneca, BioNTech, CanSino, CureVac, Janssen, Johnson & Johnson, Moderna, Novavax, Pfizer, Sinopharm426,142 (12.2)2,135,912 (1.9)129,812 (2.1)
Second-level filter (share in vaccination, %)
DeathTod (death), tot (dead), sterben (to die)61,956 (14.5)116,379 (5.4)1642 (1.3)
Long COVIDLong-COVID, Post-COVID, Post-Vac, Langzeitfolge, Spätfolge (long-term consequence), Langzeitschaden (long-term damage)4017 (0.9)26,106 (1.2)207 (0.2)
Mandatory vaccinationImpfpflicht (mandatory vaccination), Freiheit (freedom), Zwang (compulsion)88,312 (20.7)245,826 (11.5)11,156 (8.6)
Measures in schoolsSchule (school), Schüler (student), Lehrer, Lehrperson (teacher)13,427 (3.2)40,369 (1.9)1949 (1.5)
Virus variantsAlpha, Delta, Omikron (Omicron)9242 (2.2)43,748 (2)2308 (1.8)
Topic Exploration

As a second step, we used the text clustering framework BERTopic [58] to explore popular topics on Telegram and X. BERTopic uses numerical representations of text, derived from language models based on machine learning, as input to unsupervised clustering algorithms. These algorithms then form groups of semantically similar text documents into topics. We manually analyzed random samples to gain a deeper understanding of the topics’ content. We used BERTopic independently for both datasets to avoid skewing the clustering process toward popular X topics, given the different dataset sizes (Table 1). The implementation details regarding BERTopic, such as the choice of language model and other BERTopic model parameters, are provided in Multimedia Appendix 1. At this stage, we also expanded all short links in Telegram and X posts (as preprocessing for the text clustering and in preparation for further analyses), which is also described in Multimedia Appendix 1 (Section 2, with relevant sample code in Listings S2 and S3). Based on the topic exploration, we selected 5 topics from the 10 largest topics on both platforms for further analysis. Table S1 and Figure S1 in Multimedia Appendix 1 provide an overview of the 10 largest topics for both datasets.

Topic Filtering

We observed that posts, particularly on Telegram, frequently contained multiple vaccination-related topics within single posts. To address this, we developed a keyword-based classification scheme that could assign multiple topics to individual texts in a straightforward and interpretable manner. We selected keywords from BERTopic’s automatically generated topic descriptions, supplemented by manual review of random post samples per topic, prioritizing those that would capture the same topical content consistently across all datasets. This 2-stage keyword filter (following initial filtering for vaccination-related content) is comparable to the approach used by Purnat et al [59], who developed a keyword-based taxonomy to classify social media posts from various sources. We have presented the selected keywords in Table 1. We have provided the specific filter implementation, based on regular expressions, in Listing S4 in Multimedia Appendix 1. Again, the filter considered compound words, different word forms, hashtags, and different capitalizations. (Tables S2 and S3 in Multimedia Appendix 1 provide an overview of the overlap between different topics for Telegram and X posts). As a quality measure, we calculated the precision and recall of correctly classified Telegram posts, X posts, and news headlines based on random samples from each of the datasets. We have presented this analysis in Section 3, respectively, Table S4 in Multimedia Appendix 1. For the following analyses, we filtered the Telegram, X, and news headlines datasets using these topic-specific filters.

External Link Sharing

To investigate the prevalence of information from the mainstream discourse on Telegram and X, we measured the frequency of shared links from websites of governmental authorities and large media outlets in posts. To this end, we compiled a list of relevant website domains, with which we could then filter Telegram and X posts. For governmental authorities, we focused on the websites of state and federal governmental institutions. For news outlets, we prioritized nationally recognized privately and publicly owned media based on survey results and user metrics. This selection approximated the mainstream discourse by selecting sources with a particularly high reach in the general population. We provide a detailed explanation of our approach in Listing S1 in Multimedia Appendix 2, including sample code for extracting domains from links. Finally, we used the resulting list to limit the dataset of collected news headlines to the selected mainstream sources.

Time Series Analysis

To further explore the relationship between mainstream information and the discourses on Telegram and X, we analyzed the weekly frequency of posts and news headlines over time (ie, time series) for the selected subtopics. The aim of this analysis was to examine the relative importance of the subtopics on Telegram, X, and in mainstream news over time. The news headlines on specific subtopics served as a proxy for the extent of news reporting and mainstream information supply. This supplemented the previous analysis that did not provide any information about the temporal relationship between the platform-specific discourses. Specifically, we calculated the Pearson correlation coefficients between these time series (cross-correlation), which we selected for its intuitive interpretation and its invariance to scaling (as the analyzed datasets were of different sizes). We used the first-order differences (the changes of post and news volume from 1 week to the next week) to detrend the time series. Conceptually, this cross-correlation analysis allowed us to identify if an increase in the frequency of news headlines, that is, a particularly active discourse in mainstream sources, coincided with an increase in posts on Telegram or X for each of the selected subtopics.

Thematic Analysis

In the final step, we qualitatively analyzed a sample of Telegram and X posts to better understand the reasons for sharing mainstream information. To this end, we used thematic analysis based on Braun and Clarke’s [60,61] framework. Thematic analysis is a qualitative research method used to organize and describe unstructured data through patterns (themes). These patterns reveal meaningful relationships and interpretations within the specific research context. In the context of this study, this meant the contextualization of mainstream information within the platform-specific discourse on Telegram and X. Particularly in connection with the Querdenken protests and the discourse on Telegram, previous research suggested a rather critical to negative view toward information from mainstream sources [29,34]. The aim of our analysis was to gain a more nuanced understanding of this view, also in contrast to the discourse on another platform (X). Thematic analysis thus provided a suitable complement to the quantitative analyses, which provided insights into the how and when of information behavior, but not into the why. Specifically, we randomly sampled equal numbers of unique posts with links to government, public media, and private media websites for each subtopic from the Telegram and X datasets. We analyzed a total of 1200 posts, which is within the range of similar studies in terms of quantity [41,62]. Since the X dataset is significantly larger, we sampled 3 times as many X posts. The initial data analysis was performed by the first author, who read the sampled posts repeatedly, noted interesting observations, and gradually abstracted similar observations into codes. The first author then structured these codes into initial themes, which were successively refined in consultation with the other authors.

Ethical Considerations

All data used in this study were publicly accessible. Public posts on X and Telegram constitute public communication [63], given their specific orientation toward large audiences and unrestricted accessibility [21,64]. We did not analyze any information related to the authors of posts (eg, user IDs or user mentions). Importantly, all results refer to aggregated analyses, that is, were performed on the basis of groups of posts, and no conclusions were drawn about individual users, their behavior, or specific posts. Finally, while we do not publicly share any of the analyzed data, we outline the availability of the used data sources in the “Data Availability” section.


Overview

For the following analyses, we selected 5 popular vaccination-related subtopics based on the initial exploration of topics on Telegram and X. To this end, we filtered all posts using vaccination-related keywords. Then, we used text clustering for topic exploration and developed a keyword filtering scheme for each selected subtopic, with which we filtered the vaccination-filtered datasets for a second time (cf Methods). The selected subtopics, their size, and the associated keywords used for the classification are summarized in Table 1. The keyword filtering considered compound words, different word forms (eg, different declinations), and hashtags. The filtering was case-insensitive. The implemented filter is specified in Multimedia Appendix 1. For Telegram, posts include original posts and forwards. For X, posts include original posts, replies, and quotes.

On Telegram, the subtopic of death was particularly present, with posts discussing deaths as a result of vaccination. By comparison, on X, deaths were also discussed as fatal consequences of COVID-19, with vaccination providing effective protection. Another prominent subtopic on both platforms was mandatory vaccination, in particular, posts discussing the likelihood of its implementation. This topic also included posts discussing the different rights of vaccinated and unvaccinated people as a result of measures relating to vaccination. Telegram posts expressed strong opposition to any form of mandatory vaccination, while X posts also expressed approval for such mandates. These opposing views on vaccination, as a health risk and a threat to personal freedom or as a protective measure and a necessity, were present in all of the analyzed subtopics. While posts on Telegram had an exclusively negative view on vaccinations, the analyzed X posts overall showed a more balanced view between perceived risks and benefits. This tendency was supported by the higher co-occurrence of death and mandatory vaccination with the other subtopics on Telegram compared to X, with rates up to 5 times higher (Tables S2 and S3 in Multimedia Appendix 1). The subtopic of long COVID discussed potential long-term health damages after a COVID-19 infection or COVID-19 vaccine. Telegram posts exclusively linked these effects to vaccination while X discussions also recognized vaccines as a protective measure. Similarly, for the subtopic of virus variants, which was addressed, they questioned the effectiveness of vaccines against emerging variants. Finally, the subtopic of measures in schools discussed the health and social effects of vaccinations, especially the necessity of measures and the relationship between vaccinated and unvaccinated individuals in schools.

Prevalence of Mainstream Information

To assess the relationship of information between the mainstream discourse, that is, the prevalent discourse in the public debate, Telegram, and X, we analyzed the frequency of shared links from 63 domains of government institutions, 12 public media outlets (state ownership), and 47 private media outlets (private ownership) from Germany, Austria, and Switzerland. We selected these sources because of their wide reach among the general population. This included, among others, the websites of public health institutes (eg, Robert Koch Institute), of public broadcasters (eg, Tagesschau), and of newspapers and news magazines (eg, Der Spiegel). We provide a full list of all included sources in Table S1 in Multimedia Appendix 2.

On average, 58% (SD 5.2%) of Telegram posts and 21% (SD 4.9%) of X posts per topic contained at least 1 external link, which is a link that leads to a website outside of the platform from where it was shared. Considering only posts with external links, the prevalence of the examined sources varied per topic (Figure 1). The sum of percentage points in Figure 1 for a specific topic and platform may exceed 100% as Telegram and X posts may contain multiple links. For Telegram, the shares of the examined sources varied from roughly every tenth post (death) to every third post (virus variants). For X, the shares were consistently higher, and for some topics (eg, virus variants), at least half of the posts contained at least one of the examined links. The majority of these shared links pointed to private media outlets, followed by links to public media and government institutions, similarly for Telegram and X. An overview of the most shared domains is provided in Figure S1 in Multimedia Appendix 2 (along with an overview of the most shared top-level domains in Figure S2 in Multimedia Appendix 2). We note that, while posts may contain multiple links, only 0.5% (22/4056) of Telegram posts and 0.2% (20/8348) of X posts linked to 2 or more of the examined categories.

Figure 1. The percentage of posts that contain at least 1 link to information from governmental authorities, public or private media, or to some other source, relative to the number of all posts for Telegram (T) and X with at least one external link (a link that leads to a website outside the source platform).

Discourse Dynamics

We examined how the discourse intensity varied across Telegram, X, and mainstream news over time by calculating Pearson cross-correlations between the weekly number of Telegram and X posts, respectively, and news headlines. The news time series was restricted to headlines from the mainstream sources discussed in the section “Prevalence of Mainstream Information”. We provide detailed results in Tables S1 and S2 in Multimedia Appendix 3. We classified correlations as follows: no association (0.0-0.20), weak (0.20-0.40), moderate (0.40-0.60), and strong (>0.60).

Long COVID showed no association to weak relationships across all datasets, while death-related content correlated weakly. The topic measures in schools showed weak to moderate relationships, and virus variants showed moderate to strong correlations. Mandatory vaccination showed strong associations across all 3 datasets. Notably, correlations were consistently positive and stronger between Telegram and X than between either platform or mainstream news.

Visual inspection of the time series provides insight into the observed correlation patterns (Figure 2). To enable comparison between time series of different magnitudes and focus on relative importance over time, weekly numbers are min-max normalized in Figure 2 (showing values relative to each series’ maximum). The visualization shows a section of the entire time series (April 2019-February 2023). Death-related discourse showed some synchronized peaks in early 2021 across all platforms, consistent with the weak correlations found, as the synchronized activity was concentrated in a relatively brief period. The long COVID discourse appeared scattered and varied across platforms throughout the observation period, with almost no synchronized peaks, explaining the weak correlations observed for this subtopic. The mandatory vaccination discourse remained low in intensity until late 2021, when sudden increases in discourse intensity occurred across all platforms, directly corresponding to the strong correlations found for this topic. Measures in schools displayed periodic synchronized activity primarily in 2021, aligning with the weak to moderate correlations observed. Virus variants showed distinct, sharp synchronized peaks concentrated in mid-2021 and early 2022, with all platforms showing simultaneous activity during these periods, which accounts for the moderate to strong correlations found for this subtopic.

Figure 2. Weekly number of posts, respectively, mainstream news headlines.

In a second set of calculations, we analyzed intraplatform correlations for subtopics on Telegram and X, comparing all posts with those linked to mainstream sources, that is, measuring how mainstream-linked posts aligned with the broader platform discourse. Correlations were positive for all subtopics. Telegram showed a weak relationship for long COVID, moderate relationships for death and measures in schools, and strong relationships for mandatory vaccination and virus variants. Except for long COVID, X showed strong correlations for all subtopics.

Purpose of Sharing Mainstream Information

Using thematic analysis (cf Methods), we developed 5 themes describing the purpose of sharing mainstream information in the Telegram and X discourses (Figure 3). The position on the x-axis in Figure 3 indicates the varying engagement with the shared source for the different usage patterns in the examined posts. Adjacent boxes reflect conceptually related themes. The reported percentages in the boxes report the frequency of the themes in the analyzed data. Percentages may not add up to 100 due to rounding errors and uncategorized posts.

Figure 3. Different themes (rectangular boxes) show the purpose of sharing mainstream information in Telegram and X posts, derived through thematic analysis.
Observing News

It describes a pattern where only the link to an information source is shared. Sometimes, the shared links are accompanied by an excerpt of the shared source (eg, the title, a quote, or a summary); however, the post does not contain a user comment on the shared content.

News Comments

In contrast, news comments describe sharing links along with user-written comments, which refer to the shared information. These comments may be critical and express disapproval of the vaccination and public measures. However, the criticism is not directed at the source and does not explicitly question the source’s credibility. Instead, users may offer personal interpretations, views, or predictions based on the shared information, for instance, on the likelihood of mandatory COVID-19 vaccination. Such comments may also serve to draw attention to specific developments, such as new COVID-19 cases.

Directed Accusations

It refers to posts that accuse the media or governmental authorities and their political representatives of inappropriate news coverage or behavior. A key difference from news comments is that links are shared specifically to criticize the source of the information. For news articles, accusations may be made against the publishing media outlet or the individuals or groups featured in the news reports. We observed a wide range of accusations, motivated by different reasoning. This included conspiracy theories (eg, that the pandemic was planned) and the suspicion of bad intentions (eg, the deliberate attempt to limit personal freedom). But reasoning also included the uncoordinated and thus (seemingly) contradictory behavior of policymakers (eg, different implementation of scientific recommendations), the withholding of information in media reporting to avoid addressing the risks of vaccination, and more generally speaking, the neglect of issues that are perceived as important. For X, accusations also involved demands for more strict public measures.

Participation

Participation, a fourth theme, describes posts that call for active participation in an event or an online petition. Links are mainly used to share additional information or to register in some way. Events included protests and, on X, also information about available vaccination appointments, while online petitions mainly related to petitions against the introduction of mandatory vaccination.

Reference in Discussion

As the last theme, this is exclusive to X. It describes the use of sources for claims or supporting remarks in a thread, that is, as part of a discussion between different users, which is impossible in Telegram channels, as only selected users can post. Examples include news reports on the effectiveness of vaccinations or statistics on case numbers provided online by public authorities.

Finally, we note that it was not always possible to clearly assign each of the analyzed posts to 1 of the 5 themes. In some cases, especially with X posts, the statement made in a post was ambiguous because the context of the posts was unclear. We excluded such posts from the analysis.


Principal Findings

This study compared the German discourse on the COVID-19 vaccination in critical Telegram communities, known for opposing the COVID-19 vaccination, with the discourse on X (formerly Twitter) and in online news. The study focused on the importance of mainstream information, represented by sources from governmental authorities and large media outlets. More precisely, we analyzed information prevalence, discourse dynamics, and the role of shared information for 5 popular vaccination-related subtopics (death, long COVID, mandatory vaccination, measures in schools, and virus variants). Our results show that mainstream sources were part of the Telegram communities’ information mix, although such sources were shared more often on X. The intensity of the discourse across Telegram, X, and online news showed varying degrees of similarity for the different subtopics, ranging from no association to high positive correlation between week-to-week changes. Mainstream information was shared both as credible information within discussions and, especially on Telegram, for criticizing public health measures and accusing the media and government, indicating an ambivalent relationship.

Comparison With Prior Work

Overall, we found that opinions on vaccination in the X discourse were fairly balanced between posts from both vaccination advocates and opponents. This impression is consistent with other studies of X posts in the German language that drew similar conclusions regarding the general vaccination discourse [41] and mandatory vaccination in particular [39]. Studies using English X data found similar tendencies, with vaccination mandates emerging as an important topic in the context of the vaccination-critical discourse, aligning with our observations [48,65].

Regarding the Telegram discourse, we found only posts that criticize or reject vaccinations. Again, this agrees with the findings of other studies [20,34]. This one-sidedness was a consequence of the focus on particularly critical Telegram communities.

With regard to the sharing of mainstream information sources, we found that these played a dominant role in vaccination-related discourse on X (Figure 1). This agrees with a study analyzing information sharing in the German vaccination discourse, where 11 out of the 15 most shared links on X were news sources that were included in our study [40]. When it comes to sharing mainstream information on Telegram, the picture is again similar to that found in other studies. The mainstream sources were frequently shared, but the influence of other information sources was relatively greater [20,29,34], especially in comparison with the results for X.

So far, our results are consistent with previous research. However, other studies focused on the analysis of individual platforms (ie, Telegram or X) and, in the case of Telegram, primarily on the analysis of misinformation and disinformation (eg, conspiracy theories). Our study thus provides new insights, particularly with regard to the role of mainstream information in the analyzed Telegram communities. Specifically, in contrast to other studies [29,34], we found that the use of mainstream sources was not limited to expressing rejection of public health measures or trust only in sources that were critical of these measures. Instead, we found an interconnectedness between mainstream information use and platform-specific discussions on Telegram and X that emphasized the need for information during a pandemic that affected daily life.

This can be seen, in particular, for mandatory vaccination. Here, we observed that both posts on Telegram and on X used mainstream information to track developments regarding the potential introduction of mandatory vaccination. In Germany, the debate on general mandatory vaccination gained momentum, particularly from the end of November 2021, after government politicians spoke out in favor of mandatory vaccination in political discussions and interviews with mainstream media (eg, [66,67]). In April 2022, a draft law was put to a vote with the aim of introducing a mandate, which was rejected [68], bringing the debate to an end. This coincides with the period of highest discourse intensity on Telegram, X, and mainstream online news (Figure 2, high correlations between time series).

However, other topics, particularly long COVID, showed a significantly different dynamic and a continuously ambiguous discourse intensity (Figure 2, low correlations between time series). Here, we observed that Telegram posts were particularly critical of mainstream information sources, which, according to these posts, generally did not address long-term consequences associated with vaccination. Posts on X also expressed concerns about long COVID, but also in connection with COVID-19, instead of vaccination effects.

In this context, our findings suggest that persistent disjointed topic dynamics across platforms may be linked to information voids. When information needs (such as concerns about the consequences of vaccination on Telegram) are not being met by the information in mainstream sources, different communities may turn to different alternative information sources, potentially leading to the observed disjointed discourse.

In the case of long COVID, surveys conducted at the end of 2021 and in April 2022 found that around half of respondents said they were poorly informed about long COVID or would like more information [69,70]. Such information voids are problematic because they can promote the spread of misinformation (ie, by filling the gaps) [3,16]. Similar discrepancies can be observed in other (international) contexts. News coverage on vaccination in English-speaking countries increased with the rollout of COVID-19 vaccines from the end of 2020, along with an increased discourse on X/Twitter in the English language, although the discussed vaccination-related topics only partially overlapped [65,71-73].

These findings highlight the complexity of information engagement patterns and suggest that understanding vaccine hesitancy requires examining not just what information is consumed, but how it is interpreted and used within specific community contexts. Combining previous studies and our results, we suggest that different attitudes toward mainstream information sources existed within the examined Telegram communities. Similarly, vaccine acceptance itself is increasingly recognized as a spectrum encompassing a range of attitudes, often varying among individuals in complex and context-specific ways [74,75]. These beliefs can range from apathy to conspiracy theories, and similar attitudes are often clustered in online discourse [75,76], in line with our findings.

Though vaccine-hesitant groups have been associated with lower-quality information online [77], our results suggest that actors from the mainstream information ecosystem can play an active role in shaping opinions in critical online communities. For example, the influence of media on the public discourse on vaccines was especially evident with the European suspension of AstraZeneca’s vaccine. One study suggested that this decision and subsequent news discourse resulted in a decline in vaccine acceptance across Europe [78].

Trust continues to remain the most important predictor of vaccine uptake, meaning that improving attitudes across media relies on balanced reporting in which accurate information promotes both risks and benefits [74,79]. Overall, the goal must be to strengthen these groups’ trust in the information disseminated within the general public debate [74,79]. Critically, the spectrum of hesitant individuals requires differing and tailored strategies to improve overall vaccine acceptance, including strategies, such as tailoring to cultural backgrounds, health literacy levels, or nationality [74,79]. This raises the question of how to consider group-specific information needs within the general public debate while maintaining a balanced reporting that does not jeopardize the general acceptance of vaccination or other public safety measures.

Limitations and Future Directions

This study has several limitations that may form the basis for further research. First, while we focused on popular vaccination-related topics, the vaccination discourse was complex and included more topics than we investigated. Moreover, we regarded the Telegram communities analytically as a single network, without relating findings to its subcommunities. Future work could, therefore, investigate a broader spectrum of topics in combination with network analysis, which would allow for an even more nuanced breakdown of the relationship of specific Telegram communities to the mainstream discourse [35]. Furthermore, we approximated definitions of mainstream information via source popularity. There are far more media sources than we have included—for instance, in Germany, there are over 300 daily newspapers [80]—and media extends to other digital information sources, such as television. One possibility could be to include national evening television news programs, such as the Tagesschau in Germany [81]. While our selection represents the most popular media sources, a more differentiated examination of the media landscape, in particular how political positions within the mainstream information ecosystem differ, would increase the robustness of the results. Since our study highlighted the benefits of cross-platform analyses, our framework should be extended to other social media platforms to obtain a full picture of the online information behavior, which is essential for the design of effective risk communication. This includes the pathways within social media platforms [14], that is, how information is propagated and amplified within communities, as this study focused mainly on the information flow from mainstream information actors to online communities.


Our study demonstrates the potential of combining different methods to create a multiperspective analysis, revealing a complex information sharing behavior across platforms with different attitudes toward the COVID-19 vaccination. In the context of infodemics, it is essential to understand the pathways and reach of validated as well as potentially harmful information within an information ecosystem. Demand and supply of information may influence each other; for example, a lack of available information in credible sources (information voids) can favor misinformation. The presented cross-platform approach shows how such potential voids can be detected by comparing discourse patterns across platforms. Our results suggest that critical communities may engage with mainstream discourse in nuanced ways, neither fully rejecting nor accepting mainstream sources, but rather contextualizing them in their own discourse. Consequently, effective risk communication must move beyond simple information dissemination to understand how different communities process and use mainstream information. Future pandemic communication strategies should consider these diverse engagement patterns to ensure that risk communication reaches all parts of the public, building trust through targeted approaches that acknowledge varying information needs and contexts.

Acknowledgments

We thank Heiner Stuke for his valuable inputs on the time series analysis. We also thank Julia Neufeld and Nora Katharina Schmid-Küpke for their critical review of the results and their discussion. Finally, we acknowledge the use of the Claude chatbot (Claude 3.7 Sonnet and Claude Sonnet 4, from Anthropic, 2025) for copyediting tasks during the scientific writing process. Specifically, we used Claude to check for grammatical correctness and to improve the clarity of the manuscript. The Robert Koch Institute is the national public health institute of Germany and is an institute within the portfolio of the Federal Ministry of Health. The funder had no involvement in the study design, data collection, analysis, interpretation, or the writing of the manuscript.

AMH acknowledges additional funding from the Bill and Melinda Gates Foundation (grant number INV-034281), previously (OPP1157270/INV-009125) and the Wellcome Trust (Grant Number 226727 Z 22 Z), both via the Vaccine Impact Modelling Consortium. AMH also receives funding from Gavi, the Vaccine Alliance, and the Medical Research Council Centre for Global Infectious Disease Analysis (reference MR/R015600/1), which is jointly funded by the UK Medical Research Council and the UK Foreign, Commonwealth, and Development Office under a concordant agreement, and is also part of the European and Developing Countries Clinical Trials

Data Availability

The X (formerly Twitter) dataset analyzed during this study is available upon request in the Zenodo repository [82,83]. The Telegram dataset collected and analyzed as part of this study is based on channels published by Zehring and Domahidi [20], and interested researchers can access the publicly available data through the Telegram application programming interface (for example, using Telethon [84]). The GDELT dataset analyzed in this study is available on Google BigQuery [85]. We provide the query to extract the relevant data in Multimedia Appendix 4.

Authors' Contributions

SW and CI contributed to conceptualization and validation. SW designed the methodology and conducted all formal analyses. SW, AMH, and CI were responsible for writing the original draft. All authors performed writing–review and editing. CI, BA, and TSB supervised this work.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Keyword filtering and text clustering.

PDF File, 204 KB

Multimedia Appendix 2

Selection of mainstream information sources.

PDF File, 167 KB

Multimedia Appendix 3

Time series analysis.

PDF File, 72 KB

Multimedia Appendix 4

Query for retrieving global data on events, location, and tone data.

PDF File, 58 KB

  1. Nielsen R, Fletcher R, Newman N, Brennen J, Howard P. Navigating the ‘infodemic’: how people in six countries access and rate news and information about coronavirus. 2020. [CrossRef]
  2. Most used sources of reliable news and information for updates on the coronavirus worldwide as of march 2020, by country. Statista. 2020. URL: https://www.statista.com/statistics/1104365/coronavirus-news-sources-worldwide/ [Accessed 2025-01-02]
  3. Briand S, Hess S, Nguyen T, Purnat TD. Infodemic management in the twenty-first century. In: Purnat TD, Nguyen T, Briand S, editors. Managing Infodemics in the 21st Century. Springer; 2023:1-16. [CrossRef] ISBN: 978-3-031-27789-4
  4. Tangcharoensathien V, Calleja N, Nguyen T, et al. Framework for managing the COVID-19 infodemic: methods and results of an online, crowdsourced WHO technical consultation. J Med Internet Res. Jun 26, 2020;22(6):e19659. [CrossRef] [Medline]
  5. Disinformation and public health. WHO. 2024. URL: https://www.who.int/news-room/questions-and-answers/item/disinformation-and-public-health [Accessed 2024-11-27]
  6. Roozenbeek J, Schneider CR, Dryhurst S, et al. Susceptibility to misinformation about COVID-19 around the world. R Soc Open Sci. Oct 2020;7(10):201199. [CrossRef] [Medline]
  7. Watson OJ, Hogan AB. Impact of COVID-19 vaccination programmes in Europe: lives saved and lessons learned. Lancet Respir Med. Sep 2024;12(9):663-664. [CrossRef] [Medline]
  8. Managing the COVID-19 infodemic: promoting healthy behaviours and mitigating the harm from misinformation and disinformation. WHO. 2020. URL: https:/​/www.​who.int/​news/​item/​23-09-2020-managing-the-covid-19-infodemic-promoting-healthy-behaviours-and-mitigating-the-harm-from-misinformation-and-disinformation [Accessed 2025-09-29]
  9. Zarocostas J. How to fight an infodemic. Lancet. Feb 29, 2020;395(10225):676. [CrossRef] [Medline]
  10. Wilhelm E, Ballalai I, Belanger ME, et al. Measuring the burden of infodemics: summary of the methods and results of the fifth WHO infodemic management conference. JMIR Infodemiology. 2023;3:e44207. [CrossRef] [Medline]
  11. Marani M, Katul GG, Pan WK, Parolari AJ. Intensity and frequency of extreme novel epidemics. Proc Natl Acad Sci U S A. Aug 31, 2021;118(35):e2105482118. [CrossRef] [Medline]
  12. Mora C, McKenzie T, Gaw IM, et al. Over half of known human pathogenic diseases can be aggravated by climate change. Nat Clim Chang. 2022;12(9):869-875. [CrossRef] [Medline]
  13. Ishizumi A, Kolis J, Abad N, et al. Beyond misinformation: developing a public health prevention framework for managing information ecosystems. Lancet Public Health. Jun 2024;9(6):e397-e406. [CrossRef] [Medline]
  14. Garibay OO, Yousefi N, Aslett K, et al. Entropy-based characterization of influence pathways in traditional and social media. Presented at: 2022 IEEE 8th International Conference on Collaboration and Internet Computing (CIC); Dec 14-16, 2022:38-44; Atlanta, GA. [CrossRef]
  15. Cinelli M, Quattrociocchi W, Galeazzi A, et al. The COVID-19 social media infodemic. Sci Rep. Oct 6, 2020;10(1):16598. [CrossRef] [Medline]
  16. Aslett K, Sanderson Z, Godel W, Persily N, Nagler J, Tucker JA. Online searches to evaluate misinformation can increase its perceived veracity. Nature New Biol. Jan 2024;625(7995):548-556. [CrossRef] [Medline]
  17. Skafle I, Nordahl-Hansen A, Quintana DS, Wynn R, Gabarron E. Misinformation about COVID-19 vaccines on social media: rapid review. J Med Internet Res. Aug 4, 2022;24(8):e37367. [CrossRef] [Medline]
  18. Hoseini M, Melo P, Benevenuto F, Feldmann A, Zannettou S. On the globalization of the qanon conspiracy theory through telegram. Presented at: WebSci ’23: Proceedings of the 15th ACM Web Science Conference 2023; Apr 30 to May 1, 2023:75-85; Austin, TX. URL: https://dl.acm.org/doi/proceedings/10.1145/3578503 [CrossRef]
  19. Walther S, McCoy A. US extremism on Telegram: dueling disinformation, conspiracy theories, and accelerationism. Perspectives on Terrorism. 2021;15:100-124. URL: https://pt.icct.nl/sites/default/files/2023-04/Article%206_2.pdf [Accessed 2024-10-10]
  20. Zehring M, Domahidi E. German corona protest mobilizers on Telegram and their relations to the far right: a network and topic analysis. Soc Media Soc. Jan 2023;9(1). [CrossRef]
  21. Channels FAQ. Telegram. 2020. URL: https://telegram.org/faq_channels [Accessed 2024-11-26]
  22. Rogers R. Deplatforming: following extreme internet celebrities to Telegram and alternative social media. Eur J Commun. Jun 2020;35(3):213-229. [CrossRef]
  23. Horta Ribeiro M, Hosseinmardi H, West R, Watts DJ. Deplatforming did not decrease Parler users’ activity on fringe social media. PNAS Nexus. Mar 2023;2(3):gad035. [CrossRef] [Medline]
  24. Schulze H, Hohner J, Greipl S, Girgnhuber M, Desta I, Rieger D. Far-right conspiracy groups on fringe platforms: a longitudinal analysis of radicalization dynamics on Telegram. Convergence: The International Journal of Research into New Media Technologies. Aug 2022;28(4):1103-1126. [CrossRef]
  25. Van Dijck J, de Winkel T, Schäfer MT. Deplatformization and the governance of the platform ecosystem. New Media Soc. Dec 2023;25(12):3438-3454. [CrossRef]
  26. Iqbal MZ, Alradhi HI, Alhumaidi AA, et al. Telegram as a tool to supplement online medical education during COVID-19 crisis. Acta Inform Med. Jun 2020;28(2):94-97. [CrossRef] [Medline]
  27. Ng LHX, Loke JY. Analyzing public opinion and misinformation in a COVID-19 Telegram group chat. IEEE Internet Comput. Mar 2021;25(2):84-91. [CrossRef] [Medline]
  28. Lou C, Tandoc Jr. EC, Hong LX, Pong X(, Lye W(, Sng N(. When motivations meet affordances: news consumption on Telegram. Journal Stud. May 19, 2021;22(7):934-952. [CrossRef]
  29. Holzer B. Between protest and parody: structures of “Querdenken” communication on Telegram (and elsewhere). SocArXiv. Preprint posted online on Apr 10, 2021. [CrossRef]
  30. Plümper T, Neumayer E, Pfaff KG. The strategy of protest against COVID-19 containment policies in Germany. Soc Sci Q. Sep 2021;102(5):2236-2250. [CrossRef] [Medline]
  31. In your opinion, are the current Corona measures in Germany appropriate overall, do they go too far, or do they not go far enough? Statista. 2023. URL: https:/​/de.​statista.com/​statistik/​daten/​studie/​1233852/​umfrage/​umfrage-zu-corona-massnahmen-deutschlandtrend/​ [Accessed 2024-12-02]
  32. Maurer M, Gutenberg J. An empirical study on the quality of journalistic reporting on the COVID-19 pandemic. Rudolf Augustein Stiftung; 2021. URL: https:/​/rudolf-augstein-stiftung.​de/​wp-content/​uploads/​2021/​11/​Studie-einseitig-unkritisch-regierungsnah-reinemann-rudolf-augstein-stiftung.​pdf [Accessed 2024-12-10]
  33. Frei N, Schäfer R, Nachtwey O. The protests against the COVID-19 measures. Forschungsjournal Soziale Bewegungen. 2021;34:249-258. [CrossRef]
  34. Almodt R. From criticism to conspiracies: the populist discourse of COVID-19 sceptics in Germany’s Querdenken community on Telegram. Discourse Soc. Jan 2024;35(1):3-26. [CrossRef]
  35. Buehling K, Heft A. Pandemic protesters on Telegram: how platform affordances and information ecosystems shape digital counterpublics. Soc Media Soc. Jul 2023;9(3):20563051231199430. [CrossRef]
  36. Koos S. Reichardt S, editor. Contours of a Heterogeneous “Community of Mistrust”: The Social Composition of the COVID-19 Protests and the Motives of Their Participants. Campus Verlag; 2021:67-90. ISBN: 978-3-593-51458-1
  37. Weigand M, Weber M, Gruber J. Conspiracy narratives in the protest movement against COVID-19 restrictions in Germany. A long-term content analysis of Telegram chat groups. In: Gruber J, editor. Presented at: Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science (NLP+CSS); Nov 7, 2022; Abu Dhabi, UAE. [CrossRef]
  38. Twitter ends COVID misinformation policy under musk. BBC. 2022. URL: https://www.bbc.com/news/technology-63796832 [Accessed 2025-01-03]
  39. De Nicola G, Tuekam Mambou VH, Kauermann G. COVID-19 and social media: beyond polarization. PNAS Nexus. Aug 2023;2(8):gad246. [CrossRef] [Medline]
  40. Nogara G, Pierri F, Cresci S, Luceri L, Giordano S. Misinformation and polarization around COVID-19 vaccines in france, germany, and italy. In: Giordano S, editor. 2024. Presented at: Proceedings of the 16th ACM Web Science Conference; May 21-24, 2024:119-128; Stuttgart Germany. URL: https://dl.acm.org/doi/proceedings/10.1145/3614419 [CrossRef]
  41. Schmid-Petri H, Bürger M, Schlögl S, Schwind M, Mitrović J, Kühn R. The multilingual Twitter-discourse on vaccination in Germany during the COVID-19 pandemic. MaC. 2023;11(1):293-305. [CrossRef]
  42. Leetaru K, Schrodt PA. Gdelt: global data on events, location, and tone, 1979–2012. ISA annual convention. 2013. URL: http://data.gdeltproject.org/documentation/ISA.2013.GDELT.pdf [Accessed 2024-08-20]
  43. Joulin A, Grave E, Bojanowski P, Mikolov T. Bag of tricks for efficient text classification. 2017. Presented at: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers; Apr 3-7, 2017; Valencia, Spain. URL: http://aclweb.org/anthology/E17-2 [CrossRef]
  44. Starting february 9, we will no longer support free access to the twitter API, both v2 and v11 a paid basic tier will be available instead. XDevelopers. 2023. URL: https://x.com/XDevelopers/status/1621026986784337922 [Accessed 2024-12-11]
  45. Pfeffer J, Mooseder A, Lasser J, Hammer L, Stritzel O, Garcia D. This sample seems to be good enough! assessing coverage and temporal reliability of twitter’s academic API. Presented at: Proceedings of the Seventeenth International AAAI Conference on Web and Social Media; Jun 5-8, 2023:720-729; Limassol, Cyprus. [CrossRef]
  46. Hoseini M, Melo PDF, Benevenuto F, Feldmann A, Zannettou S. Characterizing information propagation in fringe communities on Telegram. Presented at: Proceedings of the Eighteenth International AAAI Conference on Web and Social Media; Jun 3-6, 2024:583-595; Buffalo, NY. [CrossRef]
  47. Liu S, Li J, Liu J. Leveraging transfer learning to analyze opinions, attitudes, and behavioral intentions toward covid-19 vaccines: social media content and temporal analysis. J Med Internet Res. Aug 10, 2021;23(8):e30251. [CrossRef] [Medline]
  48. Lindelöf G, Aledavood T, Keller B. Dynamics of the negative discourse toward COVID-19 vaccines: topic modeling study and an annotated data set of Twitter posts. J Med Internet Res. Apr 12, 2023;25:e41319. [CrossRef] [Medline]
  49. Shao C, Ciampaglia GL, Varol O, Yang KC, Flammini A, Menczer F. The spread of low-credibility content by social bots. Nat Commun. Nov 20, 2018;9(1):4787. [CrossRef] [Medline]
  50. Doppelganger strikes back: FIMI activities in the context of the EE24. European External Action Service. 2024. URL: https://euvsdisinfo.eu/uploads/2024/06/EEAS-TechnicalReport-DoppelgangerEE24_June2024.pdf [Accessed 2025-07-21]
  51. Dujeancourt E, Garz M. The effects of algorithmic content selection on user engagement with news on Twitter. Inf Soc. Oct 20, 2023;39(5):263-281. [CrossRef]
  52. Bandy J, Diakopoulos N. More accounts, fewer links: how algorithmic curation impacts media exposure in twitter timelines. Proc ACM Hum-Comput Interact. 5(CSCW1):1-28. [CrossRef]
  53. Querying, analyzing and downloading. The GDELT Project. URL: https://www.gdeltproject.org/data.html [Accessed 2025-07-21]
  54. Saz-Carranza A, Maturana P, Quer X. The empirical use of GDELT big data in academic research. Global Governance and the European Union: Future Trends and Scenarios; 2020. URL: https://blog.gdeltproject.org/more-lessons-from-gdelt-3-0s-crawler-architectural-reimagination/ [Accessed 2025-07-21]
  55. The GDELT story. The GDELT Project. URL: https://www.gdeltproject.org/about.html [Accessed 2025-07-21]
  56. Announcing the global embedded metadata graph. The GDELT Project. 2020. URL: https://blog.gdeltproject.org/announcing-the-global-embedded-metadata-graph/ [Accessed 2025-07-21]
  57. Snelson CL. Qualitative and mixed methods social media research: a review of the literature. Int J Qual Methods. 2016;15:1609406915624574. [CrossRef]
  58. Grootendorst M. BERTopic: neural topic modeling with a class-based TF-IDF procedure. arXiv. Preprint posted online on Mar 11, 2022. [CrossRef]
  59. Purnat TD, Vacca P, Czerniak C, et al. Infodemic signal detection during the COVID-19 pandemic: development of a methodology for identifying potential information voids in online conversations. JMIR Infodemiology. 2021;1(1):e30971. [CrossRef] [Medline]
  60. Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol. Jan 2006;3(2):77-101. [CrossRef]
  61. Braun V, Clarke V. Toward good practice in thematic analysis: avoiding common problems and be(com)ing a knowing researcher. Int J Transgend Health. 2023;24(1):1-6. [CrossRef] [Medline]
  62. Fieselmann J, Annac K, Erdsiek F, Yilmaz-Aslan Y, Brzoska P. What are the reasons for refusing a COVID-19 vaccine? A qualitative analysis of social media in Germany. BMC Public Health. Apr 27, 2022;22(1):846. [CrossRef] [Medline]
  63. Eysenbach G, Till JE. Ethical issues in qualitative research on internet communities. BMJ. Nov 10, 2001;323(7321):1103-1105. [CrossRef] [Medline]
  64. About public and protected posts 2025. X Help Center. URL: https://help.x.com/en/safety-and-security/public-and-protected-posts [Accessed 2025-07-19]
  65. Lyu JC, Han EL, Luli GK. COVID-19 vaccine-related discussion on Twitter: topic modeling and sentiment analysis. J Med Internet Res. Jun 29, 2021;23(6):e24435. [CrossRef] [Medline]
  66. The vaccination debate in politicians’ quotes. Der Spiegel. 2021. URL: https:/​/www.​spiegel.de/​politik/​deutschland/​corona-impfpflicht-wie-sich-die-aussagen-der-politik-geaendert-haben-a-75ab35b0-29c8-4547-bf3e-c40fe13dce38 [Accessed 2025-07-25]
  67. Lauterbach: We must start thinking about mandatory vaccination. Die Zeit. 2021. URL: https://www.zeit.de/news/2021-11/22/lauterbach-muessen-anfangen-ueber-impfpflicht-nachzudenken [Accessed 2025-07-25]
  68. Compulsory vaccination against SARS-CoV-2 (Baehrens/Janecek et al. group). Deutscher Bundestag. 2022. URL: https://www.bundestag.de/parlament/plenum/abstimmung/abstimmung?id=767 [Accessed 2025-07-25]
  69. COSMO COVID-19 Snapshot Monitoring. Long COVID. University of Erfurt. 2022. URL: https://projekte.uni-erfurt.de/cosmo2020/web/topic/risiko-emotionen-sorgen/40-longcovid/ [Accessed 2025-07-25]
  70. Betsch C, Wieler LH, Habersaat K, COSMO group. Monitoring behavioural insights related to COVID-19. Lancet. Apr 18, 2020;395(10232):1255-1256. [CrossRef] [Medline]
  71. Christensen B, Laydon D, Chelkowski T, et al. Quantifying changes in vaccine coverage in mainstream media as a result of the COVID-19 outbreak: text mining study. JMIR Infodemiology. 2022;2(2):e35121. [CrossRef] [Medline]
  72. Total COVID-19 vaccine doses administered. Our World in Data. 2024. URL: https:/​/ourworldindata.​org/​grapher/​cumulative-covid-vaccinations?stackMode=relative&country=Africa~Asia~Europe~North+America~Oceania~South+America [Accessed 2025-08-29]
  73. Son H, Park YE. Agenda-setting effects for covid-19 vaccination: insights from 10 million textual data from social media and news articles using BERTopic. Int J Inf Manage. Aug 2025;83:102907. [CrossRef]
  74. Ruggeri K, Vanderslott S, Yamada Y, et al. Behavioural interventions to reduce vaccine hesitancy driven by misinformation on social media. BMJ. Jan 16, 2024;384:e076542. [CrossRef] [Medline]
  75. Moore R, Purvis RS, Willis DE, et al. The vaccine hesitancy continuum among hesitant adopters of the COVID-19 vaccine. Clin Transl Sci. Dec 2022;15(12):2844-2857. [CrossRef] [Medline]
  76. Mønsted B, Lehmann S. Characterizing polarization in online vaccine discourse-A large-scale study. PLoS ONE. 2022;17(2):e0263746. [CrossRef] [Medline]
  77. Rathje S, He JK, Roozenbeek J, Van Bavel JJ, van der Linden S. Social media behavior is associated with vaccine hesitancy. PNAS Nexus. Sep 2022;1(4):gac207. [CrossRef] [Medline]
  78. Petersen MB, Jørgensen F, Lindholt MF. Did the European suspension of the AstraZeneca vaccine decrease vaccine acceptance during the COVID-19 pandemic? Vaccine (Auckl). Jan 28, 2022;40(4):558-561. [CrossRef] [Medline]
  79. Enria L, Dwyer H, Marchant M, et al. Political dimensions of misinformation, trust, and vaccine confidence in a digital age. BMJ. Jun 20, 2024;385:e079940. [CrossRef] [Medline]
  80. Newspapers and magazines. Facts about Germany. URL: https://www.tatsachen-ueber-deutschland.de/en/media-and-communication/newspapers-and-magazines [Accessed 2025-01-06]
  81. Schirmag T, Wedemeyer JH, Stechemesser A, Wenz L. Neural topic modeling reveals German television’s climate change coverage. Commun Earth Environ. 2025;6(1):441. [CrossRef]
  82. Kratzke N. Monthly samples of German tweets (2019 - 2022). Zenodo. 2023. URL: https://zenodo.org/records/7528718 [Accessed 2025-09-29]
  83. Kratzke N. Monthly samples of German tweets (2023). Zenodo; 2023. URL: https://doi.org/10.5281/zenodo.7708787 [Accessed 2025-09-29]
  84. Telethon. GitHub. URL: https://github.com/LonamiWebs/Telethon [Accessed 2024-06-15]
  85. The GDELT project (ID GDELT-bq.gdeltv2). Google BigQuery. 2024. URL: https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1sgdelt-bq!2sgdeltv2 [Accessed 2025-08-24]


GDELT: Global Data on Events, Location, and Tone


Edited by Amaryllis Mavragani; submitted 30.Apr.2025; peer-reviewed by Hyunsang Son, Markus Reiter-Haas; final revised version received 03.Sep.2025; accepted 06.Sep.2025; published 17.Oct.2025.

Copyright

© Silvan Wehrli, Anna-Maria Hartner, T Sonia Boender, Bert Arnrich, Christopher Irrgang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 17.Oct.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.