Understanding the Engagement and Interaction of Superusers and Regular Users in UK Respiratory Online Health Communities: Deep Learning–Based Sentiment Analysis

doi:10.2196/56038

Original Paper

¹School of Business and Management, Queen Mary University of London, London, United Kingdom

²Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy

³School of Medicine, University of Nottingham, Nottingham, United Kingdom

⁴Wolfson Institute of Population Health, Asthma UK Centre for Applied Research, Queen Mary University of London, London, United Kingdom

⁵See Acknowledgments

*these authors contributed equally

Corresponding Author:

Xiancheng Li, PhD

School of Business and Management

Queen Mary University of London

Mile End Road, Bethnal Green

London, E14NS

United Kingdom

Phone: 44 2078825555

Email: x.l.li@qmul.ac.uk

Background: Online health communities (OHCs) enable people with long-term conditions (LTCs) to exchange peer self-management experiential information, advice, and support. Engagement of “superusers,” that is, highly active users, plays a key role in holding together the community and ensuring an effective exchange of support and information. Further studies are needed to explore regular users’ interactions with superusers, their sentiments during interactions, and their ultimate impact on the self-management of LTCs.

Objective: This study aims to gain a better understanding of sentiment distribution and the dynamic of sentiment of posts from 2 respiratory OHCs, focusing on regular users’ interaction with superusers.

Methods: We conducted sentiment analysis on anonymized data from 2 UK respiratory OHCs hosted by Asthma UK (AUK), and the British Lung Foundation (BLF) charities between 2006-2016 and 2012-2016, respectively, using the Bio-Bidirectional Encoder Representation from Transformers (BioBERT), a pretrained language representation model. Given the scarcity of health-related labeled datasets, BioBERT was fine-tuned on the COVID-19 Twitter Dataset. Positive, neutral, and negative sentiments were categorized as 1, 0, and –1, respectively. The average sentiment of aggregated posts by regular users and superusers was then calculated. Superusers were identified based on a definition already used in our previous work (ie, “the 1% users with the largest number of posts over the observation period”) and VoteRank, (ie, users with the best spreading ability). Sentiment analyses of posts by superusers defined with both approaches were conducted for correlation.

Results: The fine-tuned BioBERT model achieved an accuracy of 0.96. The sentiment of posts was predominantly positive (60% and 65% of overall posts in AUK and BLF, respectively), remaining stable over the years. Furthermore, there was a tendency for sentiment to become more positive over time. Overall, superusers tended to write shorter posts characterized by positive sentiment (63% and 67% of all posts in AUK and BLF, respectively). Superusers defined by posting activity or VoteRank largely overlapped (61% in AUK and 79% in BLF), showing that users who posted the most were also spreaders. Threads initiated by superusers typically encouraged regular users to reply with positive sentiments. Superusers tended to write positive replies in threads started by regular users whatever the type of sentiment of the starting post (ie, positive, neutral, or negative), compared to the replies by other regular users (62%, 51%, 61% versus 55%, 45%, 50% in AUK; 71%, 62%, 64% versus 65%, 56%, 57% in BLF, respectively; P<.001, except for neutral sentiment in AUK, where P=.36).

Conclusions: Network and sentiment analyses provide insight into the key sustaining role of superusers in respiratory OHCs, showing they tend to write and trigger regular users’ posts characterized by positive sentiment.

J Med Internet Res 2025;27:e56038

doi:10.2196/56038

Keywords

social media; online health communities; social network analysis; sentiment analysis; bio-bidirectional encoder representations from transformers; asthma; chronic obstructive pulmonary disease

Background

Online health communities (OHCs) have been increasingly explored in recent years as a means of enabling people with long-term conditions (LTCs) to exchange peer self-management support [1-3]. Such communities offer an easily accessible and cost-effective means of sharing experiences, exchanging information, and providing mutual support to one another [4,5]. Participation in OHCs for individuals living with LTCs could address part of the health care service demand and indirectly improve access to health care [6]. The analysis of the role of OHCs in health promotion and management of LTCs indicates that there might be a positive effect on patients’ perception of social support, health literacy, clinical outcomes, and behavior change [7,8]. Furthermore, the involvement of patients in these OHCs can improve their engagement with respect to their care and their ability to self-manage, their mental health outcomes [9], and contribute to health equity [3]. However, despite the growing popularity of OHCs, there is still much we do not know about how these communities function [10]. Moreover, the specific nature of regular users’ interaction with the so-called superusers—that is, individuals who frequently engage with the community—and the extent to which it supports self-management remains largely unknown [7].

Recent social network analysis performed on 2 active respiratory OHCs has suggested that superusers play a critical role in holding together the community and ensuring timely exchange of support and information [5,10]. These superusers have been shown to contribute more content to the community, initiate more interactions, and respond more often to other users’ queries than regular members [8,9]. From a topological point of view, their characteristics are similar to those of hubs, that is, nodes with a disproportionally large number of connections compared to other nodes in the network.

Across a variety of empirical domains, it has been documented that hubs are valuable resources that help facilitate the spread of information widely and amplify information cascades [11], for example, help design effective vaccination campaigns and selective immunization strategies against disease diffusion and epidemics [12,13] and help improve the system’s robustness and vulnerability to random failures [14]. However, some hub identification approaches can be very time-consuming and suffer from the possibility that spreaders are so close together that they overlap the sphere of influence. In this context, VoteRank is a simple iterative method to identify a set of decentralized spreaders with the best spreading ability [15]. In this approach, all nodes vote in a spreader in each turn, and the voting ability of neighbors of the elected spreader will be decreased in the subsequent turn. It is, therefore, an effective solution for identifying possible nonoverlapping superusers.

However, the analysis of the network topology alone is not sufficient to fully understand the interactions between regular users and superusers and their impact on the whole community. For this reason, it is necessary to analyze the content of posts and what relationships (if any) exist between the 2 groups with respect to how they react to each other’s content. Sentiment analysis (SA), that is, a subfield of natural language processing provides an understanding of the sentiment of posts and whether there is a cause-and-effect relationship between posts in a thread. This approach consists of analyzing digital text to determine its polarity, that is if the emotional tone of the message is positive, negative, or neutral. SA can create structured and actionable knowledge from unstructured text for decision makers [16] in different fields, from marketing to politics and health [17]. In particular, with respect to the health domain, a variety of works have used SA techniques (both lexicon-based and semantic-based) [18] in recent years for different health conditions, for example, assessing the degree of psychological distress linked to COVID-19 [19,20], evaluate the risk of alcoholism in particular categories of users [21], analyze the emotional state of users with diabetes [22], or the role digital platforms in mediating health-related support with respect to specific cancer drugs [23]. In most of these works, a distinction is not made between categories of users, their interactions, and their role in OHCs.

This work is part of a research program that will eventually test whether promoting engagement in OHC improves self-management and clinical outcomes [24]. The primary motivation of this study is to investigate the engagement patterns of different user types, particularly superusers and regular users, and how their interactions influence the overall sentiment of posts. Our hypothesis is that superusers play a pivotal role in community cohesion, offering immediate access to a support network for self-management, as well as emotional and illness-related support. By doing so they foster positive sentiment among regular users, which subsequently may mediate improvements in self-management behaviors [25,26]. By understanding these dynamics, we aim to provide insights that can enhance the effectiveness of OHCs.

Using a semantic approach, this study aims to explore the sentiment of posts in 2 dynamic and active respiratory OHCs; in doing this, regular user interactions with superusers are assessed, in order to shed light on the impact of such interactions on users’ sentiment and which may ultimately impact on the self-management of their LTCs. Specifically, we investigate the sentiments of both regular users and superusers expressed in these interactions as well as their patterns over time. Additionally, we aim to compare the sentiment of superusers’ posts, with superusers defined in 2 different ways, one with emphasis on high-posting activity and the other on high-spreading ability, to verify whether they display similar characteristics or represent indeed the same population.

By shedding light on these critical aspects, this study contributes to understanding the mechanisms underlying the effectiveness of OHCs as a tool to facilitate self-management and provides insights into how respiratory OHCs may meet the needs of their users.

Data

As described in our previous work [10], data were collected by HealthUnlocked [27], the platform provider of the Asthma UK (AUK) and British Lung Foundation (BLF) communities. In both communities, registered users can choose to either write posts publicly or send private posts to one another. In the latter case, posts are shared between 2 users only, whereas when posts are written publicly, other users can become connected through threads of posts. For this study, only posts that were shared publicly were considered. Our datasets were stored and analyzed in a Safe Haven space, that is, a secured database held by Queen Mary University. Anonymized user IDs were provided by HealthUnlocked, and no demographic information was available. The datasets included posts and their metadata including the date of posting, the hierarchical level of the post within the corresponding thread, and the dates in which the users joined and left the community. No data were collected on participants’ characteristics, though only people declaring themselves to be older than 16 years of age were permitted to create an account and take part in OHCs.

Six different types of data associated with the corresponding user actions were collected for each user including (1) posts followed, (2) users followed, (3) likes, (4) level-0 posts (ie, posts starting new threads), (5) level-1 replies (ie, replies to the level-0 posts), and (6) level-2+ replies (ie, replies to other replies beyond level 2). The original datasets consisted of 32,780 data items associated with 3345 users from 2006 to 2016 for AUK, and 875,151 data items associated with 19,837 users from 2012 to 2016 for BLF. Since in this study, we are interested in analyzing only the textual content associated with posts, and some of them turned out to be without any content, they were removed from the datasets. The final datasets, therefore, contained 12,413 and 369,224 posts for AUK and BLF, respectively. In 2015, HealthUnlocked took over the AUK forum, leading to substantial increases in posting activity and volume of users. Further details are provided in the posting activity section of our previous work [10].

Study Design

Superusers were first identified using two different methodological approaches: (1) their posting activity and (2) their spreading ability. Next, SA was applied first to all posts and then to interactions between superusers and regular users.

Identification of Superusers

Two ways of identifying superusers were considered. The first method was based on identifying the “top 1% of users characterized by the largest number of posts written in the community over the entire observation period,” as previously described in a study by Joglekar et al [10]. The second method approximates being a “spreader” to being a superuser, according to the VoteRank algorithm [15]. This algorithm is implemented in Python (Python Software Foundation) in the NewtworkX package [28]. The VoteRank algorithm finds the top-ranked nodes as spreaders according to an influence ranking. The idea behind VoteRank’s rank is to choose a set of spreaders one by one according to the voting scores of nodes obtained from the neighbors. The node that gets the most votes in each turn is selected as the most influential node. It is an iterative method where at the beginning all nodes take part in ranking their neighbors. However, when a node is identified as a spreader, it will no longer take part in subsequent iterations and neighboring nodes will have a penalty, so that nodes that are not significant in the transmission of information but exploit proximity to the influencing nodes are not considered as spreaders. To make a fair comparison, we identified and compared the same number of superusers according to the 2 definitions. This was achieved by picking the k top-ranked spreaders by VoteRank, where k is the number of superusers according to the “top 1%” definition.

Sentiment Analysis

SA was carried out by means of a semantic approach based on bidirectional encoder representations from transformers (BERT). BERT is a contextualized word representation pretrained language model [29,30]. Its architecture is a multilayer bidirectional transformer encoder based on the original transformer implementation. Vaswani et al [31] shows further details about the transformer architecture.

BERT is built in 2 steps: pretraining and fine-tuning. The model is trained first on unlabeled data over different pretraining tasks. Later, the model with the pretrained parameters is fine-tuned using labeled data from downstream tasks. Every task has a separate fine-tuned model. However, a unique characteristic of BERT is that it has a unified architecture across different tasks, so the difference between pretrained architecture and the final downstream is small. We used BioBERT (Bio-Bidirectional Encoder Representation from Transformers), which is a pretrained language representation model for the biomedical domain [32].

Fine-Tuning BioBERT for SA

BioBERT was fine-tuned using a COVID-19 Twitter Dataset [33] taken from Kaggle [34]. The choice was opportunistic as there were few open datasets related to health. The COVID-19 Twitter Dataset has a total of 143,903 usable records, with labels associated with neutral, positive, and negative posts. Numeric values have then been associated with sentiment: 0 for neutral, 1 for positive, and –1 for negative posts. Examples of posts with sentiment labels are given in the Multimedia Appendix 1. Normalization and stop-words removal were performed on both datasets.

The considered SA workflow performed on AUK and BLF is illustrated in Figure 1. The BioBERT model was initialized by using the standard configurations and weights from the Hugging Face repository [35]. As a first step, we converted the data to sequences, adding the tags to indicate where the sentence starts and its separator. Then, we tokenized the resulting sequences with the BioBERT tokenizer creating a tensor dataframe of 128 characters. The dataset was then split into training and validation sets, with 85% observations for training and 15% observations for validation. The Adam function was used as an optimizer [36], the sparse categorical cross-entropy as loss function [37], and the sparse categorical accuracy as a metric for training [38]. The training was performed on just 2 epochs with a batch size of 32. Google Colab Pro was used to fine-tune BioBERT [32]. After the fine-tuning, weights, and result configuration files were stored.

**Figure 1.** Sentiment analysis workflow. BioBert: Bio-Bidirectional Encoder Representation from Transformers.

SA on AUK and BLF

By using the earlier fine-tuned model, we performed SA on AUK and BLF data, including all posts (it is important to note that in this paper when we talk about posts, we also generally include replies; when we consider only replies, we refer to them directly). We performed the following analyses.

Average Sentiment

Average sentiment scores (AVSs) for both regular users’ and superusers’ posts have been computed separately. As sentiment labels are associated with numeric values (ie, 0 to neutral, 1 to positive, and –1 to negative), the average values of the sentiment of a set of posts can be computed and used to capture the general sentiment expressed in those posts. Specifically, the AVSs range in the [–1, 1] interval, where positive values and getting closer to 1 indicate increasing positive sentiment, while negative values and getting closer to –1 indicate increasing negative sentiment. We considered different aggregations of content types as follows: (1) , the AVS of all posts of a set of users in v; that is, either regular users (U) or superusers (S); (2) , the AVS of level-0 posts of users in v; (3) , the AVS of all replies given by users in v; (4) , the AVS of level-1 replies given by users in v; and (5) , the AVS of level-2+ replies given by users in v.

Along with the AVSs, we also computed the percentage of posts with different sentiments to show the distribution of sentiment expressed in users’ posts. To investigate the trend of the average sentiment over time, we sorted all posts based on their publication time and regrouped them into 15 bins with an equal volume, for which we computed AVSs. We did not aggregate posts based on the month and year of publication because the number of available data was too small in AUK before 2015 (when HealthUnlocked took over the forum), and because the period of analysis in the 2 communities was different. 2-tailed t tests were used for statistical significance when comparing 2 AVSs of different users.

User-Superuser Interaction Sentiment

These analyses investigated the interactions between regular users and superusers. We addressed the following.

Regular users’ and superusers’ sentiments when replying to each other: To do this, we first identified threads started by regular users and superusers, denoted as TU and TS, and calculated the AVS of replies written by the other group of users. Specifically, we computed , which denotes the average sentiment of regular users in reply to superusers' initiated threads; and , which denotes the average sentiment of superusers in reply to regular users' initiated threads.
Regular users’ sentiments when replying to other regular users and superusers: Here, we focused on regular users’ replies only and checked whether they acted differently when replying to other regular users or superusers. Specifically, we compared with respect to which denotes the average sentiment of regular users in replying to other regular users' threads. A similar approach to that introduced earlier was used to analyze the trend in the value of over time.
Superusers’ sentiments when replying to regular users: In this case, we only took into consideration the sentiment of the replies superusers give to regular users. Note that superusers also interact with each other, but the study of these interactions is beyond our research questions and is not shown here. We investigate the sentiment of superusers in reply to positive, negative, and neutral level-0 posts of regular users. To do this, we computed , which denotes the average sentiment of superusers in replying to regular users’ positive level-0 posts; , which denotes the average sentiment of superusers in replying to regular users' negative level-0 posts; and , which denotes the average sentiment of superusers in replying to regular users’ neutral level-0 posts.

Regular users’ replies to other regular users’ level-0 posts are used as a baseline and compared with AVSs of superusers. This analysis assesses in more detail the superusers’ tendency to act as help-givers [10], especially when regular users express negative sentiments. Along with the AVSs, we also computed the percentages of posts with different sentiments to show the distribution of sentiment expressed in users’ posts. The 2-sample z tests for proportions were used for statistical significance when comparing 2 percentages of different users.

A similar approach to that introduced earlier was used to analyze the trend in these percentages over time.

Ethical Considerations

The study was approved by the Queen Mary University Research Ethics Committee (QMERC 22.279). The research team did not have access to personally identifiable information. The data was anonymized to ensure user privacy, and no demographic information was included in the analysis. The posts analyzed were publicly available, with the users having consented to their use for analytical purposes by choosing to share them publicly. In addition, the research protocol was examined and permission to undertake the research was obtained by AUK and BLF charities, and as well as HealthUnlocked. To further protect the privacy of the users, no posts are directly quoted.

Overview

The ratio of distinct actions over all actions performed with respect to both OHCs is shown in Figure 2. In AUK, the most common action was to “follow” users, while for BLF was to “like” posts or replies. In both communities, the action of generating level-0 posts, that is, starting a new thread, was smaller than generating both level-1 and level-2+ replies, showing that users mostly communicated through replies.

**Figure 2.** Distribution of different actions in distinct communities. (A) Asthma UK (AUK); (B) British Lung Foundation (BLF).