Published on in Vol 19, No 3 (2017): March

Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community

Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community

Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community

Authors of this article:

Albert Park1 Author Orcid Image ;   Mike Conway1 Author Orcid Image

Original Paper

Department of Biomedical Informatics, School of Medicine, University of Utah, Salt Lake City, UT, United States

Corresponding Author:

Albert Park, PhD

Department of Biomedical Informatics

School of Medicine

University of Utah

421 Wakara Way #140

Salt Lake City, UT, 84108-3514

United States

Phone: 1 206 743 7843

Fax:1 801 581 4297

Email: alpark1216@gmail.com


Background: Major depression is a serious challenge at both the individual and population levels. Although online health communities have shown the potential to reduce the symptoms of depression, emotional contagion theory suggests that negative emotion can spread within a community, and prolonged interactions with other depressed individuals has potential to worsen the symptoms of depression.

Objective: The goals of our study were to investigate longitudinal changes in psychological states that are manifested through linguistic changes in depression community members who are interacting with other depressed individuals.

Methods: We examined emotion-related language usages using the Linguistic Inquiry and Word Count (LIWC) program for each member of a depression community from Reddit. To measure the changes, we applied linear least-squares regression to the LIWC scores against the interaction sequence for each member. We measured the differences in linguistic changes against three online health communities focusing on positive emotion, diabetes, and irritable bowel syndrome.

Results: On average, members of an online depression community showed improvement in 9 of 10 prespecified linguistic dimensions: “positive emotion,” “negative emotion,” “anxiety,” “anger,” “sadness,” “first person singular,” “negation,” “swear words,” and “death.” Moreover, these members improved either significantly or at least as much as members of other online health communities.

Conclusions: We provide new insights into the impact of prolonged participation in an online depression community and highlight the positive emotion change in members. The findings of this study should be interpreted with caution, because participating in an online depression community is not the sole factor for improvement or worsening of depressive symptoms. Still, the consistent statistical results including comparative analyses with different communities could indicate that the emotion-related language usage of depression community members are improving either significantly or at least as much as members of other online communities. On the basis of these findings, we contribute practical suggestions for designing online depression communities to enhance psychosocial benefit gains for members. We consider these results to be an important step toward a better understanding of the impact of prolonged participation in an online depression community, in addition to providing insights into the long-term psychosocial well-being of members.

J Med Internet Res 2017;19(3):e71

doi:10.2196/jmir.6826

Keywords



Social media, including online health communities, has become a popular platform for individuals to exchange social support and connect with others [1]. Research on the benefits of online health communities highlights psychosocial benefits—such as reduced depression [2,3], anxiety [3,4], stress [3], and negative mood [5]—as well as the development of a higher level of coping [6] and patient empowerment [2,6,7]. However, participants in these studies suffered from mental disorders (eg, depression, anxiety) as secondary symptoms and were contributing to health-related forums focusing on, for example, cancer. Although psychosocial benefits of online health communities have been widely studied, it is unclear whether the members of mental health communities would receive similar benefits. Thus, we studied the longitudinal changes in psychological states in members of a mental health community.

Online mental health communities can be a useful medium for individuals suffering from mental disorders [8], due to social stigma and discrimination surrounding such disorders [9-11]. However, in contrast to the substantial work focusing on the benefits of online health communities, emotional contagion theory [12] suggests that prolonged interactions with depressed individuals and their negative emotion can worsen the symptoms of depression. Emotional contagion theory is a type of emotional influence that describes the spread of one person’s emotion to other people during social encounters.

Scientific evidence in support of emotional contagion theory in online health communities can be illustrated in two concise assertions. First, individuals’ inner emotions have been shown to manifest in their choice of words in writing [13], including textual conversation within online health communities [14]. Second, both negative [15] and positive emotions [16] have been shown to spread even through text-based computer-mediated communication (eg, email, chat) as well as through a textual alert system in social media [17].

In the aforementioned study of spreading positive or negative emotion via a textual alert system in social media [17], findings suggest that individuals’ emotions had spread throughout their social community. Moreover, both short-lived moods evinced in response to outside stimuli and long-term emotional states (ie, general emotional state) of community members can affect the emotions of the community. For example, in an empirical study, members of mental health communities were shown to have significant increases in anxiety, anger, and negative emotions following reports of several celebrity suicides [18]; and both positive and negative long-term emotional states slowly spread across socially connected individuals in a social network [19]. Due to this effect, researchers have suggested that interaction with other depressed individuals is a potential risk factor for depression [20,21].

Negative sentiment has been shown to be highly prevalent and more common than positive sentiment in mental health communities, including a depression community [22]. Thus, members of depression communities are likely to be exposed to other members’ negative emotions. However, as members of online health communities, they are also likely to receive emotional support (ie, positive emotion) [23,24]. Thus, the extent to which one of the effects (ie, spreading of negative emotion or positive emotional support) has a greater impact on long-term participants in an online depression community is of interest. Moreover, understanding the impact of long-term participation in online mental health communities can suggest practical implications related to using, managing, designing, and researching online mental health communities that can improve the experience of online mental health community members who seek essential social support during times of difficulty. Despite the importance in clinical, practical, and public policy implications for mental health, to our knowledge, the investigation on the change of emotional states via linguistic change in depression community members has not been the focus of previous research on online mental health communities.

We aim to fill this gap in the literature with this study and answer two research questions (RQ):

RQ1. How does prolonged participation in an online depression community affect the emotional states of its members?

RQ2. How does this change in emotional states compare with other online health communities? As a control, we selected online health communities focusing on (1) positive emotion, (2) health conditions recognized for their needs for both informational and emotional support, and (3) chronic but non–life-threatening health issues that are not often related to mental disorders.


Community Platform

The data for this study consist of posts (ie, a submission that starts a conversation) and associated comments (ie, a submission that replies to posts or other comments) from several topical focused subreddits (ie, subcommunities). All subreddits found in this study are hosted on the popular social media platform, Reddit. Reddit is a social media platform that had 83 billion page views from more than 88,000 active subcommunities (subreddits) in 2015. Reddit content is accessible on the open Web. This can be contrasted with services like Facebook or specifically health-focused online community like PatientsLikeMe, where data are typically not available on the open Web, but requires user registration to access content. Members of Reddit made more than 73 million individual posts with over 725 million associated comments in the same year [25]. Reddit is highly active and the community allows posters to create submissions using throwaway and unidentifiable accounts. These features are suitable for stigmatized conditions and their discussions that may not be appropriate for identifiable accounts. Due to these two reasons, we examined posts and comments from Reddit for this study.

Subreddit Selection

The community r/depression, to our knowledge, is the largest and most active depression focused subreddit. It has been active for 7 years with 145,821 subscribers [26] in September 2016. Thus, we selected r/depression as the main community of interest for investigating the effects of prolonged participation in an online depression community to the emotional states of members (RQ1).

To understand the significance of the change of emotional states of r/depression members, we compared r/depression against 3 other subreddits (RQ2). We first selected r/happy, a subreddit focused on sharing happy thoughts and stories. r/happy was created to focus on happy thoughts and, to our knowledge, is the largest and most active positive emotion–focused subreddit that has been active for 8 years with 94,076 subscribers [27] in September 2016. Members who are participating in the positive emotion–focused subreddit should encounter less negative emotion than the r/depression members. Thus, such a community can provide an insight into the impact of prolonged participation in an online health community without much exposure to negative emotions.

Much research on online health communities focused on two types of social support emotional and informational support [24,28-30]. Both support types are important aspects of online health communities. For instance, symptom management is an important aspect for patients with diabetes. Thus, sharing informational support, such as practical information to control one’s diabetic symptoms [29] is a common practice in online diabetes communities. Also in online diabetes communities, sharing emotional support such as motivation and accountability [31] is a common practice to help members continuously manage their illnesses. To better understand the changes of emotional states in r/depression, we compared against r/diabetes, a community with abundant informational and emotional support. To our knowledge, r/diabetes is the largest and most active diabetes community on Reddit. The subreddit has been active for 8 years with 15,623 subscribers in September 2016 [32].

To better understand the effects of prolonged participation in online depression communities, we wanted to compare the depression community against online health communities that are not directly related to mental disorders in order to ensure that the changes of emotional states do not result from the secondary symptoms (ie, mental illnesses) that are associated with the primary illness. However, a confounding factor is that serious illnesses are frequently accompanied with mental illnesses, due to the distress of living with—or being diagnosed with—a serious condition. Although it may be impossible to select an online health community without any exchange of emotional support, one study investigating an online Irritable Bowel Syndrome (IBS) community characterized informational support including symptom interpretation and illness management as the main type of social support exchanged in the community [33]. Thus, we selected r/ibs, a subreddit focused on IBS, as a community focused on a chronic but non–life-threatening health issue that is not often related to mental disorders. In September 2016, r/ibs has been active for 6 years with 5251 subscribers [34] and is the largest and most active IBS community in Reddit, to our knowledge.

Data

First, we use a dataset [35] released by a Reddit member who collected the data from October 2007 to May 2015. The dataset has been used in a previous study [22]. Second, we extracted posts and associated comments from r/depression, r/happy, r/diabetes, and r/ibs. Third, we removed deleted posts or comments that are labeled as “[deleted]” from the dataset. The contents of posts or comments were deleted by members or the communities, thus we excluded them in our analyses. Fourth, to meet our study aims, we restricted our analysis to members (ie, unique user IDs) with 4 or more submissions (ie, posts and comments). In different studies [28,36], this threshold was used to determine lurkers, who are not yet a regularly contributing member of a community, thus we used the same threshold to determine regular members. Table 1 shows the overall characteristics of datasets that we examined.

Table 1. Characteristics of the analyzed datasets from r/depression, r/happy, r/diabetes, and r/ibs
Communityr/depressionr/happyr/diabetesr/ibs
Date of dataDec 2008-May 2015Jan 2008-May 2015Jan 2009-May 2015Aug 2012-May 2015
No. of Submissions571,86495,329167,91715,978
No. of Members36,74681304539923

The work described in this paper was granted an exemption from review by the University of Utah’s Institutional Review Board (IRB), ethics committee, [IRB 00076188] under Exemption 2 as defined in United States Federal Regulations 45 CFR 46.101(b).

Linguistic Analysis

To understand the change of emotional states, we examined emotion-related language using the Linguistic Inquiry and Word Count (LIWC) program [37]. LIWC analyzes text on a word-by-word basis, counts the predefined linguistic dimension words and then returns each linguistic dimension score as a proportion of the total number of words under analysis. The validity of LIWC’s performance has been established in a large number of studies, including studies measuring perceived positive and negative emotions in online mental health communities [18], online health communities [30], text-based online communication [15,16], and social media [17,38].

LIWC consists of 4 major dimensions: “psychological processes,” “linguistic processes,” “personal concerns,” and “spoken categories.” The most notable linguistic dimension related to emotion is “psychological processes,” which includes “positive emotion” and “negative emotion” as subdimensions that are hosted in “affective processes.” “Negative emotion” has 3 subdimensions: “anxiety,” “anger,” and “sadness.” Both “positive emotion” and “negative emotion” dimensions have been used in a series of studies to measure perceived positive and negative emotion in text-based communication [15-18,30,38,39], which Reddit uses to facilitate communication.

From “linguistic processes,” “negation,” and “swear words” have been used to predict depression in social media [38,39]. “Negation” has also been used to measure agreeableness in text-based online communication [15,16]. One of the most widely used dimensions for predicting depression is the “first person singular” [15,16,18,30,38,39]. The frequent use of “first person singular” indicates a heightened sense of self and is known to correlate with depression [40].

The subdimension, “death” from “personal concerns” has been used to measure suicidal thoughts and discussion in Reddit [18] and “assent” from “spoken categories” was used to measure agreeableness in text-based online communication [15,16] and to predict depression [39]. We made these linguistic dimension selections before the linguistic analysis without any information on the linguistic dimension frequency counts. LIWC was applied to individual posts or comments for each subreddit. A summary of the linguistic dimensions and dimensions’ example words for the current study is shown in Table 2.

Table 2. Summary of Linguistic Inquiry and Word Count (LIWC) linguistic dimensions used for this study and example vocabulary.
Linguistic dimensions
Example vocabulary
Psychological processes


Positive emotionhappy, love, sweet

Negative emotionagony, hurt, nasty

Anxietyafraid, fearful, nervous

Angerabuse, annoyed, hate

Sadnesscrying, grief, sad
Linguistic processes


First person singularI, me, mine

Negationno, not, never

Swear wordsass, damn, fuck
Personal concerns


Deathdeath, die, kill
Spoken categories


Assentagree, okay, yes

RQ1: Change of Emotional States Analysis and RQ2: Community Comparison Analysis

To measure the change of emotional states of r/depression members (RQ1), we first calculated the scores of prespecified LIWC dimensions. Then, we organized each LIWC score according to the posting time per-member basis for the community. To measure the changes of LIWC scores, we applied linear least-squares regression to the LIWC scores against the interaction sequence (ie, determined by the time of posting) for each member. We elected to perform linear the least-squares regression against the interaction sequence rather than time, because we were interested in the change caused by each interaction rather than time. For each dimension, we calculated mean, median, and 95% confidence interval (CI) of the community to reflect the overall linguistic changes in r/depression members. Then, we converted linguistic change to indicate improvement. Decrease of negative dimensions (ie, “negative emotion,” “anxiety,” “anger,” “sadness,” “first person singular,” “negation,” “swear words,” “death”) indicates improvement, although decrease of positive dimensions (ie, “positive emotion,” “assent”) indicates worsening. To be consistent with figures (Figures 1 and 2), we show improvement for each dimension. Then, we present an overview of improvement for each linguistic dimension along with the statistical significance against the null hypothesis (ie, prolonged participation in an online depression community has no effect on the emotional states of members).

To compare linguistic changes in communities (RQ2), we first carried out the procedure for measuring the change of emotional states for the other three communities. Then, we applied Mann-Whitney U tests [41] with the prespecified false discovery rate procedure [42] to P values. The comparison of 10 linguistic dimensions raises the need to control for Type I errors (ie, false positives); thus, we applied the false discovery rate procedure. Given the large sample of members, we followed suggestions of a previous study [43] and reported 95% CI and effect sizes (r) for the community comparisons using rank-biserial correlation [44]. We applied nonparametric tests because our data were not normally distributed.

Figure 1. An overview of improvement of each linguistic dimension for r/depression members.
View this figure
Figure 2. An overview of difference in improvement compared with r/depression.
View this figure

RQ 1. How Does Prolonged Participation in an Online Depression Community Affect the Emotional States of its Members?

We captured the average changes of emotional states that are manifest in linguistic changes for r/depression members. Table 3 summarizes the linguistic changes in r/depression members. On average, r/depression members showed improvement in all linguistic dimensions with the exception of “assent.”

Table 3. Linguistic changes in r/depression members. Variables are reported as the mean, median, and 95% CIs of word usage change of each dimension for all members.
Linguistic dimensions
r/depression mean, median (CI)
n=36,746
Psychological processes


Positive emotion0.01, 0.01 (0.004 to 0.01)

Negative emotion-0.04, -0.03 (-0.05 to -0.04)

Anxiety-0.18, -0.11 (-0.20 to -0.15)

Anger-0.10, -0.06 (-0.12 to -0.09)

Sadness-0.16, -0.13 (-0.17 to -0.14)
Linguistic processes


First person singular-0.03, -0.03 (-0.04 to -0.03)

Negation-0.06, -0.05 (-0.07 to -0.05)

Swear words-0.16, 0.09 (-0.20 to -0.12)
Personal concerns


Death-0.41, -0.21 (-0.47 to -0.35)
Spoken categories


Assent-0.02, 0.05 (-0.05 to 0.02)

Then, we converted linguistic changes to indicate improvement and showed an overview of improvement of each linguistic dimension (Figure 1). On the x-axis, less than 0 indicates worsening (ie, decreasing use of positive words or increasing use of negative words), whereas more than 0 indicates an improvement (ie, increasing use of positive words or decreasing use of negative words). The dot indicates the mean of improvement values and the range shows 95% CI. If the range includes 0.0 (eg, “assent”), it suggests that the null hypothesis of prolonged participation in an online depression community has no effect on the emotional states of members cannot be rejected. Our analyses suggested that with the exception of “assent” dimension, members of r/depression showed significant improvement in linguistic changes. No significant change was found in the “assent” dimension.

RQ 2. How Does This Change in Emotional States Compare With Other Online Communities?

To compare the linguistic changes evident in r/depression to r/happy, r/diabetes, and r/ibs, we first calculated the mean, median, and CIs of word usage change for each linguistic dimension for each subreddit (Table 4).

Table 4. Linguistic changes in members from r/happy, r/ibs, and r/diabetes. Variables are reported as the mean, median, and 95% CIs of word usage change of each dimension for all members.
Linguistic dimensions
r/happy
mean, median (CI)
n=8130
r/diabetes
mean, median (CI)
n=4539
r/ibs
mean, median (CI)
n=923
Psychological processes




Positive emotion0.005, 0.004 (0.0005-0.01)0.01, 0.01 (-0.01 to 0.02)0.02, 0.02 (-0.005 to 0.05)

Negative emotion-0.01, -0.01 (-0.03 to 0.001)-0.02, -0.02 (-0.04 to 0.003)-0.005, -0.02 (-0.05 to 0.03)

Anxiety0.002, -0.003 (-0.06 to 0.06)-0.26, -0.15 (-0.35 to -0.17)-0.18, -0.08 (-0.31 to -0.04)

Anger-0.02, -0.02 (-0.06 to 0.02)-0.11, -0.04 (-0.18 to -0.03)-0.13, -0.06 (-0.27 to 0.01)

Sadness0.01, -0.003 (-0.04 to 0.07)-0.18, -0.10 (-0.25 to -0.10)-0.15, -0.11 (-0.33 to 0.04)
Linguistic processes




First person singular-0.01, -0.01 (-0.02 to -0.01)-0.09, -0.04 (-0.11 to -0.07)-0.03, -0.03 (-0.06 to -0.008)

Negation-0.02, -0.01 (-0.04 to 0.01)0.02, 0.002 (-0.02 to 0.05)-0.008, -0.01 (-0.08 to 0.06)

Swear words-0.03, -0.02 (-0.09 to 0.03)-0.01, 0.005 (-0.20 to 0.18)-0.06, -0.01 (0.44 to 0.33)
Personal concerns




Death-0.52, -0.17 (-0.69 to -0.36)-0.93, -0.22 (-1.35 to -0.51)-0.98, -0.56 (-1.73 to -0.23)
Spoken categories




Assent0.02, 0.01 (-0.01 to 0.05)-0.03, 0.05 (-0.10 to 0.05)0.04, 0.02 (-0.09 to 0.18)

On average, members of these 3 subreddits showed significant improvement in linguistic changes with increasing number of interactions. Four exceptions are “anxiety” and “sadness” from r/happy and “negation” and “assent” from r/diabetes. Then, we compared the 3 subreddits to r/depression (Table 5).

Table 5. Community comparison of average linguistic changes against r/depression.
Linguistic dimensions
versus r/happy

versus r/diabetesversus r/ibs
Psychological processes




Positive emotionr/depression
Ua=151428562
P=3.9e-05
Padjb=9.8e-05
CIc=0.005-0.015
r=0.04
r/depression
Ua =83095368.5
P=.63
Padjb =.70
CIc =-0.01 to 0.01
r=0.09
r/ibs
Ua =16588110
P=.43
Padjb =.70
CIc =-0.03 to 0.01
r=0.02

Negative emotionr/depression
Ua =118117817
P=2.7e-05
Padjb =8.9 e-05
CIc =-0.03 to -0.01
r=0.21
r/depression
Ua =77894854
P=.16
Padjb =.31
CIc =-0.03 to 0.004
r=0.14
r/depression
Ua =16023844.5
P=.37
Padjb =.70
CIc =-0.05 to 0.02
r=0.05

Anxietyr/depression
Ua =42155753
P=3.1e-04
Padjb =6.2e-04
CIc =-0.15 to -0.05
r=0.72
r/depression
Ua = 49336254
P=.44
Padjb =.55
CIc =-0.04 to 0.09
r=0.47
r/depression
Ua =11317589.5
P=.62
Padjb =.78
CIc =-0.13 to 0.08
r=0.33

Angerr/depression
Ua =72410492.5
P=.02
Padjb =.03
CIc =-0.06 to -0.004
r=0.52
r/depression
Ua =54287645.5
P=.30
Padjb =.50
CIc =-0.07 to 0.02
r=0.40
r/depression
Ua =11059329.5
P=.82
Padjb =.82
CIc =-0.11 to 0.08
r=0.35

Sadnessr/depression
Ua =54288775
P=1.8e-11
Padjb =1.8e-10
CIc =-0.16 to -0.09
r=0.64
r/depression
Ua =59564322.5
P=.07
Padjb =.17
CIc =-0.08 to 0.002
r=0.34
r/depression
Ua =11862037.5
P=.45
Padjb =.70
CIc =-0.13 to 0.06
r=0.3
Linguistic processes




First person singularr/depression
Ua =137694516.5
P=5.6e-07
Padjb =2.8e-06
CIc =-0.02 to -0.009
r=0.08
r/diabetesd
Ua =84591685.5
P=.01
Padjb =.03
CIc =0.004 to 0.02
r=0.11
r/ibs
Ua =16945136
P=.72
Padjb =.80
CIc =-0.02 to 0.02
r=0.01

Negationr/depression
Ua =96382143.5
P=.003
Padjb =4.7e-03
CIc =-0.05 to 0.01
r=0.35
r/depressione
Ua =71632553
P=.0002
Padjb =.002
CIc =-0.08 to -0.02
r=0.21
r/depression
Ua =14855011
P=.14
Padjb =.70
CIc =-0.10 to 0.01
r=0.12

Swear wordsr/depression
Ua =40086953
P=.08
Padjb =.09
CIc =-0.08 to 0.003
r=0.73
r/depression
Ua =30918210.5
P=.02
Padjb =.07
CIc =-0.17 to -0.01
r=0.66
r/ibs
Ua =5505668
P=.40
Padjb =.70
CIc =-0.25 to 0.10
r=0.67
Personal concerns




Deathr/happy
Ua =14379783
P=.95
Padjb =.95
CIc =-0.11 to 0.11
r=0.9
r/depression
Ua =15714429
P=.40
Padjb =.55
CIc =-0.28 to 0.11
r=0.83
r/ibs
Ua =2476626
P=.37
Padjb =.70
CIc =-0.23 to 0.64
r=0.86
Spoken categories




Assentr/depression
Ua =70286713.5
P=.001
Padjb =.002
CIc =0.01-0.05
r=0.56
r/depression
Ua =45944192
P=.89
Padjb =.89
CIc = -0.04 to 0.03
r=0.49
r/depression
Ua =8873209.5
P=.49
Padjb =.70
CIc =-0.05 to 0.12
r=0.49

aU: Mann-Whitney U test values.

bPadj: adjusted P values.

c95% CI values.

Subreddit comparisons indicate that members of r/depression improved either significantly or at least as much as (ie, no significant difference) members of other online health communities. In “positive emotion,” “negative emotion,” “anxiety,” “anger,” “sadness,” “first person singular,” “negation,” and “assent,” r/depression showed significantly more improvement than r/happy. “Swear words” and “death” showed no significant difference.

The difference in improvements between r/depression and r/diabetes were not significant, with 2 exceptions. The subreddit r/diabetes improved significantly as compared with r/depression in “first person singular,” although r/depression improved significantly as compared with r/diabetes in “negation.” There were no significant differences between r/ depression and r/ibs for all dimensions.

At this point, we converted linguistic changes to indicate improvement compared with r/depression and showed an overview of the differences with respect to each linguistic dimension (Figure 2). The dot indicates the sample estimate and the range shows 95% CI. If the range includes 0.0, it indicates that the difference between the means is not significant.


Principal Findings on Change of Emotional States for Members of an Online Depression Community

This study has been deeply informed by recent literature focusing on the psychosocial benefits of online health communities [1-7]. We have also utilized emotional contagion theory [12] and empirical studies on emotional contagion theory [15-17,19,20] to develop this study. We address gaps in the previous literature by shedding light on the question of whether members of an online depression community experience positive or negative emotional changes after prolonged participation.

The linguistic changes have been shown to occur when emotional states are shifting [38]. In the online depression community we analyzed, we found that members of the depression community generally showed an increased use of positive words and decreased use of negative words as the number of interactions increased. Despite prolonged interactions with other depressed individuals, r/depression members’ emotional states were found to have become more positive. For r/depression members, on average all linguistic dimensions were improving with the exception of “assent.” The change in usage of “assent” was found to be insignificant; and as “assent” belongs to “spoken categories,” we believe many vocabularies in “assent” were used in a communicative sense, such as answering questions, a typical type of communication in online health communities [45].

Consistent with previous studies [1-7], we found a similar trend—more use of positive emotion words and less use of negative emotion words—in 3 other Reddit communities. These Reddit communities are different in size (ie, number of members), length of active years, and level of activity, while focusing on different types of health issues. Moreover, we chose these different communities on the basis of the literature that suggested a different composition of social support in these communities. Yet, in all 4 communities, we found members generally used more positive emotion words and less negative emotion words as the number of interactions increased.

The comparison of communities indicates that members of r/depression improved either significantly or at least as much as (ie, no significant difference) members of other online health communities. One unusual finding was the lack of improvement in r/happy, especially when compared with other online health communities. We believe because r/happy members were encouraged to share only positive thoughts and stories from the start, they had limited capacity for positive linguistic changes. However, further investigation is needed for a better understanding of this phenomenon.

Practical Implications for Online Community Use, Research, and Design

Many online communities utilize the efforts of moderators and even community members to monitor their communities. However, detecting changes of emotional states in members can be difficult with such manual efforts, given that the knowledge of members’ (1) previous emotional states and (2) style of writing are required, as well as (3) linguistic changes can be subtle. An adaptation of our linguistic analysis method can be a basis for detecting subtle linguistic changes in individuals in massive scale networks.

Moderators of online health communities have mentally demanding tasks [46]. An automated detection of linguistic changes could reduce the mental burden of moderators and improve the quality of moderation. For example, such a system could alert moderators and members of any undesirable linguistic changes and allow them to provide timely support. Furthermore, a similar system can assist individual members and the whole community. For individual members, a similar system could be used to raise self-awareness of their changes of linguistic or emotional state. As for communities, a similar system could alert when a community-wide emotional shift occurs. Alerting of a negative shift can allow timely intervention for the whole community, whereas alerting of a positive shift provides opportunities for researchers and community managers to study the positive change. As studies on emotional contagion theory [15-17,19,20] demonstrated, mental health communities carry a potential for a community-wide spread of positive and negative emotions.

User Privacy

Although Reddit data (like Twitter) are public, and research using public social media is typically granted exemption from review by IRBs in the US context, ethical considerations—particularly with respect to privacy—remain [47-49]. In this paper, in order to protect user anonymity, we did not present user identifiable information (eg, direct quotations, usernames).

Limitations and Future Directions

One of the limitations in our study is that we had a dataset with 4 communities within the Reddit platform. Although Reddit is a widely used platform, it is more frequently used by young males [50,51]. Therefore, results may not be generalizable to other communities. Also, selection bias of our subjects exists. Members who choose to participate in r/depression are not necessarily representative of the depression population. Similarly, we do not have any evidence that members of r/depression are clinically diagnosed with depression. Thus, we cannot conclude that participating in an online depression community is beneficial for all depression patients. However, r/depression is worth studying given the size of r/depression, the prevalence of depressive disorder and the increasing popularity of online mental health communities.

Although beyond the scope of this study, an investigation with respect to changes of emotion in users of different systems such as Internet-based interventions for depression [52] or text-based crisis counselor system [53] is important for individuals suffering from depression. It is unlikely that any of these avenues could reach the entire population of depressed individuals [9]; thus, understanding the extent to which emotional change occurs within demographic segment in each of these different avenues is important.

Another limitation is the performance of LIWC in measuring emotional states. Although LIWC has been validated and used in many studies [15-18,30,38-40] including a study using Reddit data [18], it is important to note LIWC’s limitations, such as missing potentially important misspellings and slang, which are common in online communities [54]. The findings of this study should be interpreted with caution, especially when leveraging the findings in the clinical context. Furthermore, participation in an online depression community is not the sole factor for improvement or worsening of depressive symptoms. It is also unclear why members stay or leave in this particular online depression community and how their engagement affects the results. Although a previous study suggested that emotional support (ie, positive emotion) helps members stay with an online health community [30], we cannot conclude that participating in online health communities directly caused an improvement in emotional states of members without directly asking the participants. However, this is a general limitation of any secondary analyses. Still, the consistent statistical results could indicate that depressive symptoms of members of an online depression community were improving. Future work could bolster the current study’s findings by using mixed methods such as surveys and interviews and ask the members of online depression communities about the changes in their depressive symptoms and measure depression using standardized methods like the patient health questionnaire or primary care evaluation of mental disorders [55].

We considered word usages as independent observations; nevertheless, correlations could exist in the word usages among members depending on the topic of conversation. For instance, in a conversation about death, members who are in the conversation are more likely to use words related to death regardless of their emotional states. Conversational topics in online communities, however, are known to change [56]. Thus, examining the effects of emotional states in changing topics and their associated sentiments could support our findings. Moreover, individuals could participate in multiple subreddits and use multiple user IDs, both of which violate our independent variable assumption. We also acknowledge that our large sample size could have inflated the statistical significance levels. Thus, we reported 95% CIs and effect sizes as well as compared with other online health communities to aid the interpretation of results.

Finally, our dataset did not capture passive activities like lurking. Although it has been suggested that lurkers experience fewer benefits than active participants [3], it is unclear if lurkers have less emotional changes and how their changes of emotion can affect the community when they become active participants. Understanding emotional changes in lurkers could deepen our understanding in future work.

Conclusions

We provide new insights into the impact of prolonged participation in an online depression community and highlight the positive emotion change in members. Our findings suggest that 3 other Reddit communities focusing on positive emotion, diabetes, and IBS also have a similar positive emotion change in members. The comparison of communities indicates that the emotion-related language usage of depression community members improved either significantly or at least as much as (ie, no significant difference) members of other online communities. On the basis of these findings, we contribute practical suggestions for designing online depression communities to enhance psychosocial gains for members. We consider these results to be an important step toward developing a better understanding of the impact of prolonged participation in an online depression community on emotional health, in addition to providing insights into the long-term psychosocial well-being of members.

Acknowledgments

We restricted our analysis to publicly available discussion content. The study was exempted from review by the University of Utah’s Institutional Review Board (Ethics Committee) [IRB 00076188]. AP’s contribution to this research was supported by the National Library of Medicine of the National Institutes of Health under training grant T15 LM007124. MC’s contribution to this research was supported by the National Library of Medicine of the National Institutes of Health under award numbers R00LM011393 & K99LM011393. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors thank Mr Gregory Stoddard at the University of Utah for his advice and comments on statistical methods during the preparation of this article.

Conflicts of Interest

None declared.

  1. Fox S, Duggan M. Pew Research Center. 2013 Jan 15. Health online 2013: 35% of U.S. adults have gone online to figure out a medical condition   URL: http://www.pewinternet.org/2013/01/15/health-online-2013/; [WebCite Cache]
  2. van Uden-Kraan CF, Drossaert CH, Taal E, Seydel ER, van de Laar MA. Participation in online patient support groups endorses patients' empowerment. Patient Educ Couns 2009 Jan;74(1):61-69. [CrossRef] [Medline]
  3. Setoyama Y, Yamazaki Y, Namayama K. Benefits of peer support in online Japanese breast cancer communities: differences between lurkers and posters. J Med Internet Res 2011 Dec 29;13(4):e122 [FREE Full text] [CrossRef] [Medline]
  4. Høybye MT, Dalton S, Deltour I, Bidstrup P, Frederiksen K, Johansen C. Effect of Internet peer-support groups on psychosocial adjustment to cancer: a randomised study. Br J Cancer 2010 Apr 27;102(9):1348-1354 [FREE Full text] [CrossRef] [Medline]
  5. Shaw B, Hawkins R, McTavish F. Effects of insightful disclosure within computer mediated support groups on women with breast cancer. Health Commun Internet 2006;19(2):133-142 [FREE Full text] [CrossRef]
  6. Mo PK, Coulson NS. Developing a model for online support group use, empowering processes and psychosocial outcomes for individuals living with HIV/AIDS. Psychol Health 2012;27(4):445-459. [CrossRef] [Medline]
  7. Wentzer HS, Bygholm A. Narratives of empowerment and compliance: studies of communication in online patient support groups. Int J Med Inform 2013 Dec;82(12):e386-e394. [CrossRef] [Medline]
  8. Conway M, O'Connor D. Social media, big data, and mental health: current advances and ethical implications. Curr Opin Psychol 2016 Jun;9:77-82. [CrossRef] [Medline]
  9. Corrigan P. How stigma interferes with mental health care. Am Psychol Internet 2004;59(7):614-625 [FREE Full text] [CrossRef]
  10. De Choudhury M, De S. Mental health discourse on Reddit: self-disclosure, social support, and anonymity. 2014 Jun 01 Presented at: Proceedings of ICWSM, AAAI; 2014; Ann Arbor, Michigan, USA p. 71-80.
  11. Barney LJ, Griffiths KM, Banfield MA. Explicit and implicit information needs of people with depression: a qualitative investigation of problems reported on an online depression support forum. BMC Psychiatry 2011 May 18;11:88 [FREE Full text] [CrossRef] [Medline]
  12. Hatfield E, Cacioppo J, Rapson R. Emotional Contagion. Curr Dir Psychol Sci 1993 Jun;2(3):96-100 [FREE Full text] [CrossRef]
  13. Pennebaker JW, Mehl MR, Niederhoffer KG. Psychological aspects of natural language. use: our words, our selves. Annu Rev Psychol 2003;54:547-577. [CrossRef] [Medline]
  14. De Choudhury M, Kiciman E, Dredze M, Coppersmith G, Kumar M. Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media. New York, New York, USA: ACM Press; 2016 May 07 Presented at: 2016 CHI Conference on Human Factors in Computing Systems; May 07 - 12, 2016; Santa Clara, California, USA p. 2098-2110   URL: http://dl.acm.org/citation.cfm?id=2858207 [CrossRef]
  15. Hancock J, Gee K, Ciaccio K, Lin J. I'm sad you're sad: emotional contagion in CMC. New York, New York, USA: ACM Press; 2008 Nov 08 Presented at: 2008 ACM conference on Computer supported cooperative work; November 08 - 12, 2008; San Diego, CA, USA p. 295-298   URL: http://dl.acm.org/citation.cfm?id=1460611 [CrossRef]
  16. Hancock J, Landrigan C, Silver C. Expressing emotion in text-based communication. New York, New York, USA: ACM Press; 2007 Apr 28 Presented at: SIGCHI Conference on Human Factors in Computing Systems; April 28 - May 03, 2007; San Jose, California, USA p. 929-932   URL: http://portal.acm.org/citation.cfm?doid=1240624.1240764 [CrossRef]
  17. Kramer AD, Guillory JE, Hancock JT. Experimental evidence of massive-scale emotional contagion through social networks. Proc Natl Acad Sci U S A 2014 Jun 17;111(24):8788-8790 [FREE Full text] [CrossRef] [Medline]
  18. Kumar M, Dredze M, Coppersmith G, De Choudhury M. Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides. New York, NY, USA: ACM Press; 2015 Sep 01 Presented at: 26th ACM Conference on Hypertext & Social Media; September 01 - 04, 2015; Guzelyurt, Northern Cyprus p. 85-94   URL: http://dl.acm.org/citation.cfm?id=2791026&CFID=905650047&CFTOKEN=12908080 [CrossRef]
  19. Hill AL, Rand DG, Nowak MA, Christakis NA. Emotions as infectious diseases in a large social network: the SISa model. Proc Biol Sci 2010 Dec 22;277(1701):3827-3835 [FREE Full text] [CrossRef] [Medline]
  20. Rosenquist JN, Fowler JH, Christakis NA. Social network determinants of depression. Mol Psychiatry 2011 Mar;16(3):273-281 [FREE Full text] [CrossRef] [Medline]
  21. Bastiampillai T, Allison S, Chan S. Is depression contagious? The importance of social networks and the implications of contagion theory. Aust N Z J Psychiatry 2013 Apr;47(4):299-303. [CrossRef] [Medline]
  22. Gkotsis G, Oellrich A, Hubbard T, Dobson R, Liakata M, Velupillai S. The language of mental health problems in social media. : Association for Computational Linguistics; 2016 Presented at: Proceedings of the 3rd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality; June 16, 2016; San Diego, California p. 63-73   URL: https:/​/www.​researchgate.net/​profile/​George_Gkotsis/​publication/​303365632_The_language_of_mental_health_problems_in_social_media/​links/​573e2d0408ae9f741b3009ff.​pdf
  23. Griffiths KM, Calear AL, Banfield M. Systematic review on Internet Support Groups (ISGs) and depression (1): do ISGs reduce depressive symptoms? J Med Internet Res 2009 Sep 30;11(3):e40 [FREE Full text] [CrossRef] [Medline]
  24. Hwang KO, Ottenbacher AJ, Green AP, Cannon-Diehl MR, Richardson O, Bernstam EV, et al. Social support in an Internet weight loss community. Int J Med Inform 2010 Jan;79(1):5-13 [FREE Full text] [CrossRef] [Medline]
  25. Reddit. 2015. Reddit in 2015 Internet   URL: http://www.redditblog.com/2015/12/reddit-in-2015.html [accessed 2016-01-12] [WebCite Cache]
  26. Reddit. 2016. r/depression in Sept 2016   URL: https://www.reddit.com/r/depression/search?q=depression [accessed 2016-09-21] [WebCite Cache]
  27. Reddit. 2016. r/happy in Sept 2016   URL: https://www.reddit.com/search?q=happy [accessed 2016-09-25] [WebCite Cache]
  28. Park A, Hartzler AL, Huh J, McDonald DW, Pratt W. Homophily of vocabulary usage: beneficial effects of vocabulary similarity on online health communities participation. AMIA Annu Symp Proc 2015;2015:1024-1033 [FREE Full text] [Medline]
  29. Huh J, Ackerman M. Collaborative help in chronic disease management: supporting individualized problems. New York, NY, USA: ACM Press; 2012 Feb 11 Presented at: ACM 2012 conference on Computer Supported Cooperative Work; February 11 - 15, 2012; Seattle, Washington, USA p. 853-862   URL: http://dl.acm.org/citation.cfm?id=2145331 [CrossRef]
  30. Wang Y, Kraut R, Levine J. To stay or leave? The relationship of emotional and informational support to commitment in online health support groups. New York, NY, USA: ACM Press; 2012 Feb 11 Presented at: ACM 2012 conference on Computer Supported Cooperative Work; February 11 - 15, 2012; Seattle, Washington, USA p. 833-842   URL: http://dl.acm.org/citation.cfm?id=2145329&CFID=905650047&CFTOKEN=12908080 [CrossRef]
  31. Newman M, Lauterbach D, Munson S, Resnick P, Morris M. It's not that i don't have problems, i'm just not putting them on facebook: challenges and opportunities in using online social networks for health. New York, New York, USA: ACM Press; 2011 Mar 19 Presented at: ACM 2011 conference on Computer supported cooperative work; March 19 - 23, 2011; Hangzhou, China p. 341-350   URL: http://dl.acm.org/citation.cfm?id=1958824.1958876 [CrossRef]
  32. Reddit. 2016. r/diabetes in Sept 2016   URL: https://www.reddit.com/search?q=diabetes [accessed 2016-09-30] [WebCite Cache]
  33. Coulson NS. Receiving social support online: an analysis of a computer-mediated support group for individuals living with irritable bowel syndrome. Cyberpsychol Behav 2005 Dec;8(6):580-584. [CrossRef] [Medline]
  34. Reddit. 2016. r/ibs in Sept 2016   URL: https://www.reddit.com/r/ibs/search?q=ibs [accessed 2016-09-30] [WebCite Cache]
  35. Reddit_Member. Reddit. I have every publicly available Reddit comment for research   URL: https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/; [WebCite Cache]
  36. Nonnecke B, Preece J. Lurker demographics: counting the silent. New York, New York, USA: ACM Press; 2000 Presented at: SIGCHI conference on Human Factors in Computing Systems; April 01 - 06, 2000; The Hague, The Netherlands p. 73-80   URL: http://dl.acm.org/citation.cfm?id=332409 [CrossRef]
  37. Pennebaker J, Booth J, Francis ME. LIWC. Austin, Texas; 2007. LIWC2007: Linguistic inquiry and word count [software program for text analysis]   URL: http://liwc.wpengine.com/ [accessed 2017-02-27] [WebCite Cache]
  38. De Choudhury M, Gamon M, Counts S, Horvitz E. Predicting depression via social media. 2013 Jul 08 Presented at: 7th International AAAI Conference on Weblogs and Social Media (ICWSM); July 8–11, 2013; Boston, USA   URL: http://www.aaai.org/ocs/index.php/ICWSM/ICWSM13/paper/viewFile/6124/6351
  39. De Choudhury M, Counts S, Horvitz E, Hoff A. Characterizing and predicting postpartum depression from shared facebook data. New York, New York, USA: ACM Press; 2014 Feb 15 Presented at: 17th ACM conference on Computer supported cooperative work & social computing - CSCW '14; February 15 - 19, 2014; Baltimore, Maryland, USA p. 626-638   URL: http://dl.acm.org/citation.cfm?doid=2531602.2531675 [CrossRef]
  40. Rude S, Gortner E, Pennebaker J. Language use of depressed and depression-vulnerable college students. Cogn Emot 2004 Dec;18(8):1121-1133 [FREE Full text] [CrossRef]
  41. Hollander M, Wolfe D, Chicken E. Nonparametric statistical methods 3rd Edition. Hoboken, New Jersey, USA: Wiley; 173.
  42. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol 1995;57(1):289-300 [FREE Full text] [CrossRef]
  43. Lin M, Lucas H, Shmueli G. Too big to fail: large samples and the p-value problem. Inf Syst Res 2013 Dec;24(4):906-917. [CrossRef]
  44. Cureton E. Rank-biserial correlation. Psychometrika 1956 Sep;21(3):287-290 [FREE Full text] [CrossRef]
  45. Arguello J, Butler B, Joyce E, Kraut R, Ling K, Rose C, et al. Talk to me: foundations for successful individual-group interactions in online communities. New York, NY, USA: ACM Press; 2006 Presented at: SIGCHI conference on Human Factors in computing systems - CHI '06; April 22 - 27, 2006; Montréal, Québec, Canada p. 959-968   URL: http://dl.acm.org/citation.cfm?id=1124916 [CrossRef]
  46. Calvo R, Hussain M, Milne D, Nordbo K, Hickie I, Danckwerts P. Augmenting Online Mental Health Support Services. In: Integrating Technology in Positive Psychology Practice. Hershey, Pennsylvania, USA: IGI Global; 2016:82-103.
  47. Conway M. Ethical issues in using Twitter for public health surveillance and research: developing a taxonomy of ethical concepts from the research literature. J Med Internet Res 2014 Dec 22;16(12):e290 [FREE Full text] [CrossRef]
  48. Vayena E, Salathé M, Madoff LC, Brownstein JS. Ethical challenges of big data in public health. PLoS Comput Biol 2015 Feb;11(2):e1003904 [FREE Full text] [CrossRef] [Medline]
  49. Mikal J, Hurst S, Conway M. Ethical issues in using Twitter for population-level depression monitoring: a qualitative study. BMC Med Ethics 2016 Apr 14;17:22 [FREE Full text] [CrossRef] [Medline]
  50. Duggan M, Smith A. Pew Research Center. 2013. 6% of online adults are Reddit users   URL: http://www.pewinternet.org/2013/07/03/6-of-online-adults-are-reddit-users/ [accessed 2016-02-11] [WebCite Cache]
  51. Bogers T, Wernersen R. How `social' are social news sites? exploring the motivations for using Reddit.com. : iSchools; 2014 Mar 01 Presented at: iConference 2014 Proceedings; 2014; Berlin, Germany p. 329-344   URL: http://hdl.handle.net/2142/47295 [CrossRef]
  52. Griffiths K, Farrer L, Christensen H. The efficacy of internet interventions for depression and anxiety disorders: a review of randomised controlled trials. Med J Aust 2010 Jun 07;192(11 Suppl):S4-11. [Medline]
  53. Dinakar K, Chen J, Lieberman H, Picard R, Filbin R. Mixed-Initiative Real-Time Topic Modeling & Visualization for Crisis Counseling. New York, New York, USA: ACM Press; 2015 Presented at: 20th International Conference on Intelligent User Interfaces - IUI '15; March 29 - April 01, 2015; Atlanta, Georgia, USA p. 417-426   URL: http://dl.acm.org/citation.cfm?id=2678025.2701395 [CrossRef]
  54. Park A, Hartzler AL, Huh J, McDonald DW, Pratt W. Automatically detecting failures in natural language processing tools for online community text. J Med Internet Res 2015 Aug 31;17(8):e212 [FREE Full text] [CrossRef] [Medline]
  55. Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary Care Evaluation of Mental Disorders. Patient Health Questionnaire. J Am Med Assoc 1999 Nov 10;282(18):1737-1744. [Medline]
  56. Park A, Hartzler AL, Huh J, Hsieh G, McDonald DW, Pratt W. “How did we get here?”: topic drift in online health discussions. J Med Internet Res 2016 Nov 02;18(11):e284 [FREE Full text] [CrossRef] [Medline]


IBS: irritable bowel syndrome
LIWC: Linguistic Inquiry and Word Count
CI: Confidence Interval


Edited by R Calvo; submitted 15.10.16; peer-reviewed by D Milne, S Hussain, A Rosen; comments to author 22.11.16; revised version received 03.01.17; accepted 08.02.17; published 20.03.17

Copyright

©Albert Park, Mike Conway. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 20.03.2017.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.