Published on in Vol 25 (2023)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/46655, first published .
Effect of Recent Abortion Legislation on Twitter User Engagement, Sentiment, and Expressions of Trust in Clinicians and Privacy of Health Information: Content Analysis

Effect of Recent Abortion Legislation on Twitter User Engagement, Sentiment, and Expressions of Trust in Clinicians and Privacy of Health Information: Content Analysis

Effect of Recent Abortion Legislation on Twitter User Engagement, Sentiment, and Expressions of Trust in Clinicians and Privacy of Health Information: Content Analysis

Original Paper

1Department of Medicine, University of California San Francisco, San Franicsco, CA, United States

2Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, United States

3Department of Biomedical and Health Informatics, University of Pennsylvania, Philadelphia, PA, United States

4School of Public Health, University of California, Berkeley, CA, United States

5Department of Laboratory Medicine, University of California San Francisco, San Francisco, CA, United States

*all authors contributed equally

Corresponding Author:

Simone Arvisais-Anhalt, MD

Department of Laboratory Medicine

University of California San Francisco

185 Berry Street, Suite 100

San Francisco, CA, 94143-0134

United States

Phone: 1 (415) 353 1667

Email: simone.arvisais-anhalt@ucsf.edu


Background: The Supreme Court ruling in Dobbs v Jackson Women’s Health Organization (Dobbs) overrules precedents established by Roe v Wade and Planned Parenthood v Casey and allows states to individually regulate access to abortion care services. While many states have passed laws to protect access to abortion services since the ruling, the ruling has also triggered the enforcement of existing laws and the creation of new ones that ban or restrict abortion. In addition to denying patients the full spectrum of reproductive health care, one major concern in the medical community is how the ruling will undermine trust in the patient-clinician relationship by influencing perceptions of the privacy of patient health information.

Objective: This study aimed to study the effect of recent abortion legislation on Twitter user engagement, sentiment, expressions of trust in clinicians, and privacy of health information.

Methods: We scraped tweets containing keywords of interest between January 1, 2020, and October 17, 2022, to capture tweets posted before and after the leak of the Supreme Court decision. We then trained a Latent Dirichlet Allocation model to select tweets pertinent to the topic of interest and performed a sentiment analysis using Robustly Optimized Bidirectional Encoder Representations from Transformers Pre-training Approach model and a causal impact time series analysis to examine engagement and sentiment. In addition, we used a Word2Vec model to study the terms of interest against a latent trust dimension to capture how expressions of trust for our terms of interest changed over time and used term frequency, inverse-document frequency to measure the volume of tweets before and after the decision with respect to the negative and positive sentiments that map to our terms of interest.

Results: Our study revealed (1) a transient increase in the number of daily users by 576.86% (95% CI 545.34%-607.92%; P<.001), tweeting about abortion, health care, and privacy of health information postdecision leak; (2) a sustained and statistically significant decrease in the average daily sentiment on these topics by 19.81% (95% CI −22.98% to −16.59%; P=.001) postdecision leak; (3) a decrease in the association of the latent dimension of trust across most clinician-related and health information–related terms of interest; (4) an increased frequency of tweets with these clinician-related and health information–related terms and concomitant negative sentiment in the postdecision leak period.

Conclusions: The study suggests that the Dobbs ruling has consequences for health systems and reproductive health care that extend beyond denying patients access to the full spectrum of reproductive health services. The finding of a decrease in the expression of trust in clinicians and health information–related terms provides evidence to support advocacy and initiatives that proactively address concerns of trust in health systems and services.

J Med Internet Res 2023;25:e46655

doi:10.2196/46655

Keywords



In May 2022, Politico released [1] a leak of the United States Supreme Court’s draft majority opinion of Dobbs v Jackson Women’s Health Organization (Dobbs), which was later confirmed in the final decision on June 24, 2022 [2]. The decision overruled the precedents established by Roe v Wade [3] and Planned Parenthood v Casey [4], which guaranteed a pregnant person’s (referred to as “woman” from here on) right to abortion. The Dobbs decision allows states to individually regulate access to abortion care services. While many states have passed laws to protect abortion since Dobbs [5], this decision has also triggered the enforcement of existing laws and the creation of new ones that ban or restrict abortion. To date, 35 states have legally restricted abortion access [6], creating a national landscape of abortion care that is unclear, uneven across states, and constantly changing. These restrictions limit abortion care delivery and are projected to reduce access to abortion care by 40% [7]. Women who are denied access to desired abortion care have reported worse physical health [8], mental health [9], and socioeconomic outcomes [10] compared to women who are able to access abortion care.

In addition to the far-reaching implications of the Dobbs ruling on many aspects of peoples’ lives, one major concern in the medical community is how Dobbs will undermine trust in the patient-clinician relationship [11], by influencing perceptions of the privacy of patient health information. We refer to trust as the attitudes that patients have toward their relationship with clinicians. Since the Dobbs ruling, there has been increased awareness surrounding how reproductive health data in both traditional health care settings and beyond can be used against patients. For example, reports have detailed the known limitations of major federal regulation, such as the Health Insurance Portability and Accountability Act (HIPAA), in protecting patients’ reproductive health data in the context of patient care [12-15] and how the massive exchange of health care data across organizations and state lines could jeopardize patients who receive abortion care [13,16,17]. Additionally, outside traditional health care settings, reports have detailed how mobile apps collecting data about users’ health, sexual behavior, and menstrual cycles have shared user data without the user’s knowledge or consent [18] and how these data could be used to prosecute women who receive abortion care [19].

While there is limited research exploring patients’ historical trust in clinicians performing abortion, it is known that patients’ trust in reproductive health care clinicians and the recommendations these clinicians give are compromised for patients with marginalized identities and those with trauma experiences [20]. Without trust in clinicians, clinics, or health systems to protect reproductive health data, patients may choose not to disclose information about pregnancy-related vital health information such as their last menstrual period, prior abortions, or previous obstetric complications. To date, the impact of the Dobbs decision on social trust in health care systems, clinicians, and privacy of health data, particularly surrounding women’s health, has not been well characterized.

Social media, specifically the web-based microblogging platform Twitter, can be one avenue to explore public sentiment and changes in social trust. With over 235 million daily users reported in 2022 [21], Twitter users represent a sample of public discourse on the Dobbs decision and social trust in the health care community and privacy of health data. Prior work has demonstrated the use of Twitter as a tool for health research [22] and to understand trends in social concerns in the context of current events and public health, such as the COVID-19 pandemic [22-26], mental health [27,28], and Affordable Care Act marketplace enrollment [29]. Researchers have also used Twitter to analyze users’ backlash to Georgia’s 2019 abortion ban [30], and the temporal, geographical, and sentiment patterns in the public’s reaction to the overturning of Roe v Wade with further stratification between pro-life and pro-choice tweets [31]. Beyond the analysis of unstructured text on social media platforms, advanced text analysis is also becoming increasingly used in the health care domain [32]. In this study, we build upon the existing literature and analyze tweets to better characterize (1) Twitter users’ engagement and sentiment surrounding federal abortion legislation as it relates to abortion, health care, and privacy of health information over time and (2) the Dobbs decisions’ impact on the cultural connotations of expressions of trust in clinicians and the privacy of health information.


Ethics Approval

This study was not reviewed by an Institutional Review Board as it was performed on an open and publicly available dataset.

Overview

We analyzed changes in Twitter engagement and sentiment regarding abortion, health care, and privacy of health information before and after the Dobbs decision was leaked to the public. Furthermore, we sought to quantify the cultural connotation of expressions of trust for several reproductive health care and health information–related terms of interest for the predecision leak and postdecision leak periods. The predecision leak period is January 1, 2020, to May 1, 2022, and the postdecision leak period is May 2 to October 17, 2022. These periods correspond to before and after Politico published the draft majority opinion [1]. The postleak period includes the final decision released on June 24, 2022. We will refer to these distinct time periods as preleak and postleak periods henceforth. Figure 1 is a flowchart illustrating each step in our methods (Figure 1).

Figure 1. A flowchart illustrating each step in our methods starting from scraping twitter. Cylindrical shapes indicate a dataset and square boxes represent data processing or analysis. LDA: Latent Dirichlet Allocation; RoBERTa: Robustly Optimized Bidirectional Encoder Representations from Transformers Pre-training Approach; TFIDF: term frequency, inverse-document frequency .

Data Collection, Cleaning, and Preprocessing

We used the Python package snscrape (JustAnotherArchivist) [33] to web scrape Twitter for tweets that included the following terms: “Dobbs,” “Jackson,” “Dobbs” & “Jackson,” “Roe,” “Roe” & “Wade,” and the following permutations and combinations of terms (“Dobbs” or “Jackson” or “Roe” or “Wade” or “Abortion”) & (“privacy” or “share” or “HIPAA” or “private” or “disclosure” or “PHI”). The scrape included Tweets posted between January 1, 2020, and October 17, 2022. Duplicate tweets were filtered.

To select the tweets pertaining to the topic of interest, we used Python (version 3.8) and gensim (version 4.3.0) to train a Latent Dirichlet Allocation (LDA) model [34], which clustered our data into 20 topic areas to evaluate for and remove potential confounding topics (eg, Michael Jackson), given that manual filtering was not feasible for the size of our data set. This LDA model was then used to assign probabilities to a given tweet belonging to a certain topic. We selected tweets with a 95% or greater probability of belonging to topics that contained salient terms of interest.

Determining Preleak and Postleak Twitter Engagement and Sentiment

We computed Twitter engagement and sentiment on the LDA-filtered Tweets in the preleak and postleak periods. Engagement was defined as the average number of unique users per day with tweets including our terms of interest post deduplication and LDA topic filtering. Each tweet’s sentiment was calculated by a Robustly Optimized Bidirectional Encoder Representations from Transformers Pre-training Approach (also known as “RoBERTa”) model, which has been validated in the literature and fine-tuned on Twitter data [35-37]. Sentiment is defined as the model’s highest probability score for each tweet for negative, neutral, and positive sentiment. This model allowed us to encode a sentiment score for each tweet as negative (0), neutral (0.5), and positive (1.0). The encoded sentiment score was then averaged across tweets per day.

We also used the Causal Impact package (version 0.0.13) [38] to fit a Bayesian structural model on the preleak and postleak engagement and sentiment data. This model calculates posterior distributions for both the real-world data and a predicted counterfactual as a quasi-control. We used the posterior distribution to determine if a significant change had occurred in user engagement and sentiment across the preleak and postleak periods compared to the counterfactual quasi-control.

Cultural Connotations of Expressions of Trust at the Unigram Term Level

We sought to quantify the effect of the Dobbs decision on the cultural connotations of expressions of trust, particularly surrounding clinicians and privacy of health information, at the individual word—otherwise known as the unigram term. To do this, we adapted methods from Arseniev-Koehler et al [39]. We first generated word embeddings for these terms by training a Word2Vec [40] skip-gram model on corpora from both preleak and postleak periods. We then extracted a latent semantic dimension of trust from the embeddings. This dimension is the average vector of pairwise synonyms and antonyms of a trust anchor word set. The anchor words for the positive vector space are “trust,” “trustworthy,” “faith,” “reliable,” “confidence,” “certainty,” and “conviction.” The anchor words for the negative vector space are “mistrust,” “untrustworthy,” “distrust,” “unreliable,” “skepticism,” “uncertainty,” and “doubt.” After the dimension was extracted, we projected a target word vector, such as “HIPAA,” onto the dimension and computed its cosine similarity as a scalar, producing a similarity score. The higher the similarity score a term has to the trust dimension, the more likely the term is considered to have a trust connotation and vice versa. The specific unigram level target terms of interest are grouped into clinician terms: “doctor,” “provider,” “physician,” “nurse,” “ob,” “gyn,” “obgyn,” “pcp,” and health information terms: “app,” “information,” “periodtracking,” “tracking,” “PHI” (protected health information), “HIPAA,” “data.” Of note, the term “healthcare” was included in the clinician term set as a measure of trust in health care.

Term Frequency, Inverse-Document Frequency Volume, and Tweets Containing Terms of Interest for Trust Dimension

We sought to investigate volume changes and the specific example tweets that contain clinician terms and health information terms used in our cultural connotation of expression of trust analysis. To perform this, we first performed the term frequency, inverse-document frequency (TFIDF) [41] on our LDA-filtered data set for tweets that contain our terms. The purpose of the TFIDF is to weight documents by important terms. We were able to leverage TFIDF after we split the data set into tweets with a negative or positive sentiment score and mapped those sets of tweets to each term via TFIDF for the preleak and postleak periods. For visualization purposes, we normalized the volume of tweets on a standard score scale. Finally, we selected tweets that (1) surpassed the TFIDF’s weighted threshold per term and (2) did not contain article headlines and links. We then performed a granular manual review of tweets that matched these criteria and heuristically evaluated if their content applied to the theme of trust in clinician and health care information terms. The volume of tweets identified by the TFIDF for manual review was variable by term, ranging from 6 tweets per term to over 100, depending upon whatever the TFIDF deemed was the most important tweet per term.


Data Set and Filtering

We collected 14,912,707 tweets with our terms of interest. After removing duplicates, we had 10,360,328 unique tweets. Figure 2 shows the word counts and weights for the most salient terms for LDA topics, labeled “1” and “5” by the LDA model and referred to throughout as “Policy and Legality,” and “Health Privacy,” respectively (Figure 2). Topic 5 had the terms “privacy,” “right,” “abortion,” “woman,” and “please,” as the top 5 most salient terms. Topic 1 had “roe,” “abortion,” “state,” “law,” and “constitution” as its top 5 most salient terms. An interactive version of the topics from our LDA model can be viewed on the internet [42]. After filtering for tweets that only belonged to these topics, our final data set included 1,380,166 tweets.

Figure 2. The top 10 weighted terms for Topics 1 and 5 are shown. Tweet membership to these topics served as the inclusion criteria for our final dataset. The weights are specified as a dark bar and word count as a larger light bar.

Time Series Causal Impact Analysis—Engagement Results

The number of total unique daily users tweeting on topics in our LDA-filtered data set increased in the postleak period (Figure 3). According to the counterfactual estimate based on data from the preleak period, had the Dobbs decision not happened, we would have expected 110,552 (95% CI 76,195.37-145,356.23) total unique users and 650.13 (95% CI 448.21-855.04) unique daily users to tweet in the postleak time period. Instead, we observed 748,081 unique users and 4400.48 unique daily users during the postleak time period. The number of unique daily users increased by 576.86% (95% CI 545.34%-607.92%). The probability of obtaining this effect with a Bayesian one-sided tail area probability is P<.001. However, this increase was not sustained, as can be seen in (Figure 3A) where within several months post leak, the volume of actual unique daily users matches the predicted volume of daily users.

Figure 3. (A) The average number of unique users per day in black and the counterfactual estimate in blue. (B) The point difference in the predicted counterfactual and the actual number of unique users per day, and (C) the cumulative difference in blue, with the CI for all graphs being the blue shadow surrounding the line.

Time Series Causal Impact Analysis—Sentiment Results

The average daily sentiment for tweets on topics in our LDA-filtered data set decreased and became more negative in the postleak period and was persistently negative for several months post leak, as seen in Figure 4. From our counterfactual estimate from the preleak period, we would have expected an average daily sentiment of 0.44 (95% CI 0.43-0.45) or near neutral, had the decision not happened. The observed average daily sentiment was 0.35, a decrease of 19.81% (95% CI −22.98% to −16.59%). The probability of obtaining this effect by chance is minuscule with a Bayesian one-sided tail-area probability (P<.001).

Figure 4. (A) The average sentiment per day in black and the counterfactual estimate in blue, (B) the point difference in the predicted counterfactual and the actual average sentiment per day, and (C) the cumulative difference in blue, with the CI for all graphs being the blue shadow surrounding the line.

Word2Vec Analysis—Cultural Connotations Results

Groups of clinician-related and health information–related terms all show a net decrease in cosine similarity to the trust dimension (Figures 5 and 6). Many terms in the clinician group were already negatively associated with the trust dimension in the preleak period, such as “healthcare,” “provider,” “nurse,” “gyn,” “obgyn,” but these still had an overall reduction in association to the trust dimension in the postleak period. The terms “doctor,” “physician,” and “ob” from the clinician group were initially positively associated with the trust dimension in the preleak period; however, in the postleak period, all terms became negatively associated. Similarly, in the health information term group “tracking,” “PHI,” and “HIPAA” were also negatively associated with the trust dimension in the preleak period and became further negative in the postleak periods. The terms “app” and “information” were positively associated with the trust dimension in the preleak period; however, in the postleak period, these terms became negatively associated. The term “data” remained positive overall but showed a reduction in the postleak period as well. The terms “pcp” from the clinician terms and “periodtracking” from the health information terms did not appear in the preleak period data sets at all; however, in the postleak period, they both had negative associations with the trust dimension.

Figure 5. Plot of the terms of interest for the clinician related groups of terms projected onto the trust dimension of our Word2vec models. Blue is "pre" period and orange is the "post" period.
Figure 6. Plot of the terms of interest for the health information related groups of terms projected onto the trust dimension of our Word2vec models. Blue is the "pre" period and orange is the "post" period.

TFIDF Results

In the postleak period, there is a net increase in tweets that have a negative sentiment and map to our terms of interest, except for the term “PHI” which showed a decrease in volume (Figure 7). This differs from tweets with a positive sentiment, where there is no identifiable pattern for an increase or decrease in our terms of interest. Our investigation of tweets with the highest TFIDF score per term for the negative sentiment postleak period is listed in (Textbox 1).

Figure 7. The volume of tweets for a given term of interest for the pre and post periods, split by negative and positive sentiment. In the negative sentiment dataset there is a trend of increased utilization of terms of interest during the post Dobbs period. Where as this phenomena is not observed in the positive sentiment dataset. The volumes were normalized on a standard scale for visualization purposes.
Example tweets selected via term frequency, inverse-document frequency for our terms of interest.

Healthcare

  • “@user_redacted Overturning Roe gives strangers control of a woman's healthcare.”

Doctor

  • And this is why we can’t have abortion bans. Because we can’t trust our doctors to not turn us over to the state over private health matters. It’s not just student athletes who get asked about their period during their physicals.

Provider

  • “This makes no sense. The provider based this choice off the over turning of Roe v Wade, but where is the law that states they can’t prescribe that MEDICATION to anyone? Providers are causing the issue themselves.”

Physician

  • “@user_redacted @user_redacted @user_redacted @user_redacted @user_redacted You have no idea what what the convos looked like between the physicians and patient, because it’s not your health ans it’s private. The physicians could have said no to protect their job, and that will happen more and more bc of the overturning of Roe, regardless of the laws.”

Nurse

  • “I have seen too many nurses praise the over turning of roe v wade and that’s???? Super concerning????”

Ob

  • “@user_redacted Yesterday, a recently retired Ob/Gyn told me that before Roe, the Ob/Gyns were the specialists in sepsis because they saw septic abortions every night they were on call. Guess they should brush up on those skills...”

Gyn

  • “I don’t know what the future of OB-Gyn looks like without the protections of Roe V Wade. Frankly I’m scared.”

ObGyn

  • “@user_redacted @user_redacted @user_redacted @user_redacted @user_redacted It was for me until roe was overturned . Now it’s my front burner. Seeing these tweets from OBGYN’s who can’t even perform medical abortions now in some states is just not right. They need lawyers just to proceed with care because the laws are so poorly written. It’s horrific”

PCP

  • #RoeVWade My wife will not survive a pregnancy. She's on birth control but when 45 was elected I immediately booked a vasectomy because of exactly this.\n\nWhen I got the referral from my pcp she asked why and i said the fascist coup.

App

  • @user_redacted Roe v Wade overturned in US today making abortion illegal in many states. If you get an abortion for any reason, you could be prosecuted & info in your period tracking app can be used as evidence against you. Contact CS for your app and ask that your data be wiped.

Information

  • @user_redacted @user_redacted Collateral damage from overturning Roe. When a state can pass laws governing women’s reproductive systems, sharing information on menstrual cycles and birth control usage with a dr could put you in legal jeopardy. Withholding such information from dr could be life threatening.😤'

Tracking

  • '@user_redacted I hear you. All those states banning abortion, tracking who’s buying contraception, tracking folk going to planned parenthood… yeah them justices giving them judicial approval to do that stuff don’t get to be private any longer”

PHI

  • “@user_redacted Except that HIPAA makes an exception for disclosure PHI required by other (state) laws. A state could make abortion - even miscarriage - a mandated reporting incident, just like gunshot wounds.”

HIPAA

  • “@user_redacted @user_redacted @user_redacted menstruation is not protected under HIPAA now that abortion is illegal. you can be investigated for miscarriage, stillbirths, etc which can all be calculated by the dates of a period. knowing if it’s regular is more than enough info”

Data

  • “Don't get it twisted! Cellphone tracking data will absolutely “be used to target both patients and abortion providers if Roe v. Wade is overturned.” #BansOffOurBodies #AbortionIsHealthcare #AbortionIsARight
Textbox 1. Example tweets selected via term frequency, inverse-document frequency for our terms of interest.

Principal Findings

This study analyzed tweets to characterize Twitter users’ engagement and sentiment in the context of patient trust surrounding the leak of the Supreme Court decision in Dobbs. Our study revealed (1) a transient increase in the number of daily users tweeting about abortion, health care, and privacy of health information; (2) a sustained and statistically significant decrease in the average daily sentiment on these topics post leak; (3) a decrease in the association of the latent dimension of trust across most clinician-related and health information–related terms; (4) an increased frequency of tweets with these clinician-related and health information–related terms and concomitant negative sentiment in the postleak period. These study findings are supported by a review of individual tweets from our TFIDF analysis for each term where it is apparent in the postleak period data set that expressions of trust regarding clinicians and the privacy of health information are lacking. Taken together, these results suggest that the Dobbs decision’s impact may extend beyond its direct medicolegal implications and could contribute to an insidious erosion of trust in the clinician-patient relationship.

A study by Mane et al [31] similarly assessed the public’s reaction on Twitter following Dobbs and found a global increase in conversation about Roe v Wade and abortion-related topics. In contrast to our study, Mane et al [31] stratified tweets as “Pro-Life” and “Pro-Choice,” revealing an increase in Pro-Life tweets that persisted longer than Pro-Choice tweets, although there was more negative sentiment overall following the Dobbs decision. We replicate and expand on many of Mane et al’s [31] findings, including the increase in Twitter activity discussing abortion surrounding the time of the Dobbs decision leak and demonstrate this link through our posterior distribution analysis. Our analysis extends the literature by introducing a novel analysis of cultural connotations of tweets for expressions of trust in various clinician-related and health information–related terms and shows an alarming decrease in trust associated with these terms.

We find that after the Dobbs decision, most clinician-related terms suffered a large decrease in association with trust. Notably, the clinician term “nurse” suffered the largest decrease in trust between the preleak and postleak period. This is interesting in the context of how, for the past 20 years, the field of nursing has been considered the profession with the highest rating of honesty and ethical standards in public polling [43]. These changes in perception can have real-world impacts on the delivery of care. Trust in clinicians and health systems, and in the sanctity of the patient-clinician relationship, have been shown to play a role in a patient's access to and use of medical care, adherence to medications and treatment plans, continuity of care, and even self-reported health status [44,45]. Furthermore, similar historical events, like the Tuskegee syphilis study, have shown that these erosions in trust can have long-lasting impacts on patients [46].

We also find a negative impact on the expressions of trust in the privacy of health information. Among clinicians, the medical literature has highlighted the risks associated with how health information, in the traditional care setting or otherwise, can be used to prosecute patients, clinicians, and others who in any form aid in the access to abortion care [13-15,17]. Among the general public, there has been increased attention to and awareness of how apps can collect and misuse data [18,19,47]. Additionally, a recent study of patient perspectives around data privacy revealed that nearly 75% of people are concerned about protecting the privacy of their health data [48]. Our findings support and focus these privacy concerns specifically to the Dobbs decision, as patients may be less willing to trust clinicians with reproductive health information and therefore inclined to withhold relevant health information. The clinical implications of missing and incomplete information have been studied in the context of interoperability of health data. Studies have found that the lack of data sharing across health systems and clinical teams hinders and even endangers appropriate clinical care [49-51]. Patients unwilling to share complete and relevant health information due to this erosion of trust may have similar impacts on care.

From a technical perspective, this study demonstrates a different approach to analyzing Twitter activity using natural language processing modeling techniques compared to prior work that queried tweets based on a hashtag for abortion legislation and manually labeled tweet content and themes [30]. We used an unsupervised machine learning technique, LDA, to identify common topics among a cohort of tweets, allowing us to capture tweets that may not have a hashtag associated with them. In fact, in our manual evaluation of tweet contents related to our terms of interest illustrated in Textbox 1, only 1 out of the 15 tweets would have been captured by a hashtag. A similar LDA approach has been applied to COVID-19–related tweets, describing the changes in content and sentiment during the early stages of the pandemic [24]. This study also uses a novel approach to analyzing expressions of trust. This is the first study to our knowledge that studied the cultural connotations of expressions of trust by analyzing the content of tweets at the unigram term level. Moving forward, this analytic technique could be used as a tool to analyze expressions of trust in other forms of writing or speech.

There are several potential limitations to internal validity and generalizability: (1) the list of keywords to start our scrape of Twitter data was selected to be comprehensive, but it still may have missed alternative terminology or misspellings. This may have introduced some selection bias in the tweets related to abortion and privacy that we identified. Additionally, scraping is not meant to be thorough or include a full representative set of tweets; therefore, some relevant tweets may have been excluded from our sampling. (2) There may be topics other than privacy related to abortion and health care that have increased engagement. We mitigate this by filtering our data set via LDA probability to ensure that we are analyzing tweets most related to these topics. (3) The results of the sentiment analysis help show a change in sentiment, as opposed to absolute sentiment, as the absolute sentiment on Twitter is likely to be biased toward a negative sentiment. (4) The measurement of public interest and sentiment toward abortion and privacy based on Twitter is limited by capturing a subset of populations that are present on Twitter. Therefore, it may fail to represent the engagement and sentiments of social groups who are disproportionately not on social media platforms, such as those of lower socioeconomic status with less access to personal devices, or any other population of persons who are otherwise not on social media. (5) In our LDA filtering, tweets that were not written in English were grouped into their own topics, and therefore excluded from our study, so our conclusions may not be generalizable to populations whose preferred language is not English.

Conclusions

Beyond obstructing access to the full spectrum of reproductive health care and compromising reproductive autonomy, the Dobbs ruling has broader consequences for health systems and reproductive health care. This study revealed a statistically significant increase in the number of daily users tweeting about abortion, health care, and privacy of health information and a statistically significant decrease in the average daily sentiment surrounding these topics that were persistent throughout the postleak period. The study also revealed the negative impact of the Dobbs decision on the cultural connotation of expressions of trust in clinicians and health information–related terms. The finding of a decrease in the expression of trust in clinicians and health information–related terms provides evidence to support advocacy and initiatives that proactively address concerns of trust in health systems and services.

Authors' Contributions

KS, AR, BW, SS, EP, and SA-A contributed to the study design, writing, and editing of the manuscript. KS performed the analyses.

Conflicts of Interest

None declared.

  1. Supreme Court has voted to overturn abortion rights, draft opinion shows - POLITICO.   URL: https://www.politico.com/news/2022/05/02/supreme-court-abortion-draft-opinion-00029473 [accessed 2022-11-17]
  2. Dobbs v. Jackson Women's Health Organization. Supreme Court of the United States. 2022.   URL: https://www.supremecourt.gov/opinions/21pdf/19-1392_6j37.pdf [accessed 2023-04-20]
  3. Roe v. Wade, 410 U.S. 113 (1973). Justia US Supreme Court Center.   URL: https://supreme.justia.com/cases/federal/us/410/113/ [accessed 2022-11-17]
  4. Planned Parenthood of Southeastern Pa. v. Casey, 505 U.S. 833 (1992). Justia US Supreme Court Center.   URL: https://supreme.justia.com/cases/federal/us/505/833/ [accessed 2022-11-17]
  5. Abortion rights supporters score midterm victories in at least 4 states. CBS News.   URL: https:/​/www.​cbsnews.com/​live-updates/​abortion-2022-election-results-ballot-initiatives-states-voting/​ [accessed 2022-11-27]
  6. Post-Dobbs State Abortion Restrictions and Protections. Center for Public Health Law Research.   URL: https://lawatlas.org/datasets/post-dobbs-state-abortion-restrictions-and-protections [accessed 2022-11-27]
  7. Myers C, Jones R, Upadhyay U. Predicted changes in abortion access and incidence in a post-Roe world. Contraception 2019;100(5):367-373. [CrossRef] [Medline]
  8. Ralph LJ, Schwarz EB, Grossman D, Foster DG. Self-reported physical health of women who did and did not terminate pregnancy after seeking abortion services: a cohort study. Ann Intern Med 2019;171(4):238-247. [CrossRef] [Medline]
  9. Biggs MA, Upadhyay UD, McCulloch CE, Foster DG. Women's mental health and well-being 5 years after receiving or being denied an abortion: a prospective, longitudinal cohort study. JAMA Psychiatry 2017;74(2):169-178. [CrossRef] [Medline]
  10. Foster DG, Biggs MA, Ralph L, Gerdts C, Roberts S, Glymour MM. Socioeconomic outcomes of women who receive and women who are denied wanted abortions in the united states. Am J Public Health 2018;108(3):407-413. [CrossRef] [Medline]
  11. Lagu T, DeJong C, Delk C. Abortion bans and implications for physician-patient trust. J Hosp Med 2022;17(6):499-501. [CrossRef] [Medline]
  12. HIPAA privacy rule and disclosures of information relating to reproductive health care. HHS.gov.   URL: https://www.hhs.gov/hipaa/for-professionals/privacy/guidance/phi-reproductive-health/index.html [accessed 2022-07-18]
  13. Khanna RR, Murray SG, Wen T, Salmeen K, Illangasekare T, Benfield N, et al. Protecting reproductive health information in the post-Roe era: interoperability strategies for healthcare institutions. J Am Med Inform Assoc 2022;30(1):161-166. [CrossRef] [Medline]
  14. Clayton EW, Embí PJ, Malin BA. Dobbs and the future of health data privacy for patients and healthcare organizations. J Am Med Inform Assoc 2022;30(1):155-160 [FREE Full text] [CrossRef] [Medline]
  15. Arvisais-Anhalt S, Ravi A, Weia B, Aarts J, Ahmad HB, Araj E, et al. Paging the clinical informatics community: respond STAT to Dobbs v. Jackson's women's health organization. Appl Clin Inform 2023;14(1):164-171 [FREE Full text] [CrossRef] [Medline]
  16. Zubrzycki C. Abortion’s interoperability trap: how the law of medical records will facilitate interstate persecution of contested medical procedures, and what to do about it. SSRN J. Preprint posted online on June 29, 2022 2022;197:197-227. [CrossRef]
  17. Walker DM, Hoffman S, Adler-Milstein J. Interoperability in a post-roe era: sustaining progress while protecting reproductive health information. JAMA 2022;328(17):1703-1704. [CrossRef] [Medline]
  18. Developer of popular women's fertility-tracking app settles FTC allegations that it misled consumers about the disclosure of their health data. Federal Trade Commission.   URL: https:/​/www.​ftc.gov/​news-events/​news/​press-releases/​2021/​01/​developer-popular-womens-fertility-tracking-app-settles-ftc-allegations-it-misled-consumers-about [accessed 2022-08-02]
  19. In a post-roe world, the future of digital privacy looks even grimmer. The New York Times.   URL: https://www.nytimes.com/2022/07/13/technology/personaltech/abortion-privacy-roe-surveillance.html [accessed 2022-07-18]
  20. Cahill EP, Doyle A. Trauma-informed abortion care. Curr Opin Obstet Gynecol 2021;33(6):453-457. [CrossRef] [Medline]
  21. Operational and Financial Highlights. Twitter Announces Second Quarter 2022 Results.   URL: https://s22.q4cdn.com/826641620/files/doc_financials/2022/q2/Final_Q2'22_Earnings_Release.pdf [accessed 2022-07-22]
  22. Sinnenberg L, Buttenheim AM, Padrez K, Mancheno C, Ungar L, Merchant RM. Twitter as a tool for health research: a systematic review. Am J Public Health 2017;107(1):e1-e8. [CrossRef] [Medline]
  23. Medford RJ, Saleh SN, Sumarsono A, Perl TM, Lehmann CU. An "Infodemic": leveraging high-volume Twitter data to understand early public sentiment for the coronavirus disease 2019 outbreak. Open Forum Infect Dis 2020;7(7):ofaa258 [FREE Full text] [CrossRef] [Medline]
  24. Lanier HD, Diaz MI, Saleh SN, Lehmann CU, Medford RJ. Analyzing COVID-19 disinformation on Twitter using the hashtags #scamdemic and #plandemic: retrospective study. PLoS One 2022;17(6):e0268409 [FREE Full text] [CrossRef] [Medline]
  25. Saleh SN, Lehmann CU, McDonald SA, Basit MA, Medford RJ. Understanding public perception of coronavirus disease 2019 (COVID-19) social distancing on Twitter. Infect Control Hosp Epidemiol 2021;42(2):131-138 [FREE Full text] [CrossRef] [Medline]
  26. Saleh SN, McDonald SA, Basit MA, Kumar S, Arasaratnam RJ, Perl TM, et al. Public perception of COVID-19 vaccines through analysis of Twitter content and users. medRxiv. Preprint posted online on October 18, 2021 2021.
  27. Guntuku SC, Preotiuc-Pietro D, Eichstaedt JC, Ungar LH. What Twitter profile and posted images reveal about depression and anxiety. ICWSM 2019:236-246. [CrossRef]
  28. Andy A, Sherman G, Guntuku SC. Understanding the expression of loneliness on Twitter across age groups and genders. PLoS One 2022;17(9):e0273636 [FREE Full text] [CrossRef] [Medline]
  29. Wong CA, Sap M, Schwartz A, Town R, Baker T, Ungar L, et al. Twitter sentiment predicts affordable care act marketplace enrollment. J Med Internet Res 2015;17(2):e51 [FREE Full text] [CrossRef] [Medline]
  30. Doan AE, Bogen KW, Higgins E, Orchowski LM. A content analysis of twitter backlash to Georgia's abortion ban. Sex Reprod Healthc 2022;31:100689. [CrossRef] [Medline]
  31. Mane H, Yue X, Yu W, Doig AC, Wei H, Delcid N, et al. Examination of the public's reaction on Twitter to the over-turning of Roe v Wade and abortion bans. Healthcare (Basel) 2022;10(12):2390 [FREE Full text] [CrossRef] [Medline]
  32. Elbattah M, Arnaud E, Gignon M, Dequen G. The role of text analytics in healthcare: a review of recent developments and applications. : SciTePress; 2021 Presented at: Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies; February 11-13, 2021; Vienna, Austria p. 825-832. [CrossRef]
  33. JustAnotherArchivist/snscrape: a social networking service scraper in Python. GitHub.   URL: https://github.com/JustAnotherArchivist/snscrape [accessed 2023-01-13]
  34. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res 2003;3:993-1022.
  35. Barbieri F, Camacho-Collados J, Anke LE, Neves L. Tweeteval: unified benchmark and comparative evaluation for tweet classification. In: Findings of the Association for Computational Linguistics: EMNLP 2020. Toronto, Canada: Association for Computational Linguistics; 2020:1644-1650.
  36. Maghsoudi A, Nowakowski S, Agrawal R, Sharafkhaneh A, Kunik ME, Naik AD, et al. Sentiment analysis of insomnia-related tweets via a combination of transformers using Dempster-Shafer theory: pre- and peri-COVID-19 pandemic retrospective study. J Med Internet Res 2022;24(12):e41517 [FREE Full text] [CrossRef] [Medline]
  37. Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, et al. HuggingFace's transformers: state-of-the-art natural language processing. arXiv. Published posted on online 9 October 2019 2019. [CrossRef]
  38. Martín Cervantes PA, Cruz Rambaud S. An empirical approach to the "Trump effect" on US financial markets with causal-impact bayesian analysis. Heliyon 2020;6(8):e04760 [FREE Full text] [CrossRef] [Medline]
  39. Arseniev-Koehler A, Foster JG. Machine learning as a model for cultural learning: teaching an algorithm what it means to be fat. Sociol Methods Res 2022;51(4):1484-1539. [CrossRef]
  40. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. ArXiv. Preprint posted online on January 16,2013 2013. [CrossRef]
  41. Sparck Jones K. A statistical interpretation of term specificity and its application in retrieval. J Doc 1972;28(1):11-21. [CrossRef]
  42. Intertopic distance map. twitterdobbsci.tiiny.site.   URL: https://twitterdobbsci.tiiny.site/ [accessed 2023-02-20]
  43. Saad L. Military brass, judges among professions at new image lows. Gallup. 2022.   URL: https://news.gallup.com/poll/388649/military-brass-judges-among-professions-new-image-lows.aspx [accessed 2023-01-04]
  44. Ozawa S, Sripad P. How do you measure trust in the health system? A systematic review of the literature. Soc Sci Med 2013;91:10-14. [CrossRef] [Medline]
  45. Fuertes JN, Toporovsky A, Reyes M, Osborne JB. The physician-patient working alliance: theory, research, and future possibilities. Patient Educ Couns 2017;100(4):610-615. [CrossRef] [Medline]
  46. Scharff DP, Mathews KJ, Jackson P, Hoffsuemmer J, Martin E, Edwards D. More than Tuskegee: understanding mistrust about research participation. J Health Care Poor Underserved 2010;21(3):879-897 [FREE Full text] [CrossRef] [Medline]
  47. Cox J. Data broker is selling location data of people who visit abortion clinics. Motherboard:Tech by Vice. 2022.   URL: https://www.vice.com/en/article/m7vzjb/location-data-abortion-clinics-safegraph-planned-parenthood [accessed 2022-07-21]
  48. AMA health data privacy framework. American Medical Association.   URL: https://www.ama-assn.org/delivering-care/patient-support-advocacy/ama-health-data-privacy-framework [accessed 2022-10-03]
  49. Adams KT, Howe JL, Fong A, Puthumana JS, Kellogg KM, Gaunt M, et al. An analysis of patient safety incident reports associated with electronic health record interoperability. Appl Clin Inform 2017;8(2):593-602 [FREE Full text] [CrossRef] [Medline]
  50. Li E, Clarke J, Ashrafian H, Darzi A, Neves AL. The impact of electronic health record interoperability on safety and quality of care in high-income countries: systematic review. J Med Internet Res 2022;24(9):e38144 [FREE Full text] [CrossRef] [Medline]
  51. Howe JL, Adams KT, Hettinger AZ, Ratwani RM. Electronic health record usability issues and potential contribution to patient harm. JAMA 2018;319(12):1276-1278 [FREE Full text] [CrossRef] [Medline]


Dobbs: Dobbs v Jackson Women’s Health Organization
HIPAA: Health Insurance Portability and Accountability Act
LDA: Latent Dirichlet Allocation
PHI: Protected Health Information
TFIDF: term frequency, inverse-document frequency


Edited by T Leung, K Williams; submitted 20.02.23; peer-reviewed by L Dodge, M Elbattah; comments to author 30.03.23; revised version received 06.04.23; accepted 11.04.23; published 12.05.23

Copyright

©Karl Swanson, Akshay Ravi, Sameh Saleh, Benjamin Weia, Elizabeth Pleasants, Simone Arvisais-Anhalt. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 12.05.2023.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.