Original Paper
Abstract
Background: Large racial and ethnic disparities in adverse birth outcomes persist. Increasing evidence points to the potential role of racism in creating and perpetuating these disparities. Valid measures of area-level racial attitudes and bias remain elusive, but capture an important and underexplored form of racism that may help explain these disparities. Cultural values and attitudes expressed through social media reflect and shape public norms and subsequent behaviors. Few studies have quantified attitudes toward different racial groups using social media with the aim of examining associations with birth outcomes.
Objective: We used Twitter data to measure state-level racial sentiments and investigate associations with preterm birth (PTB) and low birth weight (LBW) in a multiracial or ethnic sample of mothers in the United States.
Methods: A random 1% sample of publicly available tweets from January 1, 2011, to December 31, 2021, was collected using Twitter’s Academic Application Programming Interface (N=56,400,097). Analyses were on English-language tweets from the United States that used one or more race-related keywords. We assessed the sentiment of each tweet using support vector machine, a supervised machine learning model. We used 5-fold cross-validation to assess model performance and achieved high accuracy for negative sentiment classification (91%) and a high F1 score (84%). For each year, the state-level racial sentiment was merged with birth data during that year (~3 million births per year). We estimated incidence ratios for LBW and PTB using log binomial regression models, among all mothers, Black mothers, racially minoritized mothers (Asian, Black, or Latina mothers), and White mothers. Models were controlled for individual-level maternal characteristics and state-level demographics.
Results: Mothers living in states in the highest tertile of negative racial sentiment for tweets referencing racial and ethnic minoritized groups had an 8% higher (95% CI 3%-13%) incidence of LBW and 5% higher (95% CI 0%-11%) incidence of PTB compared to mothers living in the lowest tertile. Negative racial sentiment referencing racially minoritized groups was associated with adverse birth outcomes in the total population, among minoritized mothers, and White mothers. Black mothers living in states in the highest tertile of negative Black sentiment had 6% (95% CI 1%-11%) and 7% (95% CI 2%-13%) higher incidence of LBW and PTB, respectively, compared to mothers living in the lowest tertile. Negative Latinx sentiment was associated with a 6% (95% CI 1%-11%) and 3% (95% CI 0%-6%) higher incidence of LBW and PTB among Latina mothers, respectively.
Conclusions: Twitter-derived negative state-level racial sentiment toward racially minoritized groups was associated with a higher risk of adverse birth outcomes among the total population and racially minoritized groups. Policies and supports establishing an inclusive environment accepting of all races and cultures may decrease the overall risk of adverse birth outcomes and reduce racial birth outcome disparities.
doi:10.2196/44990
Keywords
Introduction
Low birth weight (LBW) and preterm birth (PTB) are widely used indicators of reproductive health [
, ] and are associated with an increased risk of infant mortality [ ], developmental delays [ ], and cardiometabolic disorders in adulthood [ , ]. Large racial disparities in these birth outcomes persist in the United States [ , ]. Black mothers in particular have substantially higher rates of LBW and PTB, infant mortality, and maternal morbidity compared to White mothers. For example, pregnancy-related mortality is over 3 times higher among Black compared to White women, and LBW rates in 2020 were 14.2% for Black mothers, but 6.8% for White mothers [ ]. These disparities cannot be fully accounted for by sociodemographic and individual-level factors [ - ], and there is increasing evidence for the role of racism in creating and perpetuating race-based disparities in birth outcomes [ , ].Racism is a well-established social determinant of health that operates at and across multiple levels. This includes the internalized, personally mediated (also referred to as individual or interpersonal), institutional, structural, and cultural dimensions of racism [
]. Internalized racism refers to the acceptance of negative beliefs about their own race by individuals of racially stigmatized groups [ ]. Personally mediated racism refers to racial prejudice (attitudes) and stereotypes (beliefs and assumptions) according to race and discrimination (differential treatment based on race) enacted between individuals [ ]. Institutional racism refers to laws, policies, and practices of particular institutions in providing advantages and disadvantages, differentially, according to race [ ]. Structural racism involves the coordination and interaction of multiple institutions and systems, such as those involved in housing, prison, banking, and education, to provide differential access and resources according to racial group identity [ ]. Research has found experiences of racism to be associated with a wide variety of health outcomes [ , ]. However, most of this research has examined how racism operates at the individual level, whereas work investigating how racism operates at other levels has been relatively limited.Exposure to racism is also a psychosocial stressor that, when experienced chronically, has demonstrated negative health effects that contribute to birth outcome disparities [
, ]. Maternal stress can alter neuroendocrine function, impact immune and inflammatory responses, and affect the vascular system. Racism stress, in particular, is associated with increases in the systemic corticotropin-releasing hormone, which stimulates the release of prostaglandins from the placenta and facilitates oxytocin’s role in initiating contractions earlier in pregnancy [ ]. Stress can prompt vasoconstriction, increasing blood pressure and reducing blood flow to the fetus [ ]. Stress also impacts the immune system, hindering its ability to fight infections that linger in the body longer, and is associated with an increased likelihood of preterm labor [ ]. Racism-related stress can also lead to maladaptive coping behaviors, such as smoking and alcohol use. Additionally, women who are born with LBW are more likely to give birth to children with LBWs [ ], perpetuating the impacts of racism intergenerationally. Racism may also influence birth outcomes by limiting access to resources and opportunities, such as education, employment, health care, and housing [ , ]. For example, historical practices like redlining and discriminatory home loan lending led to racial residential segregation and disinvestment in communities of color, which continue to have measurable effects on health disparities today, including racial birth outcome disparities [ ].An emerging body of research has revealed the foundational role of cultural racism in perpetuating race-based disparities. Cultural racism is the infusion of the ideology of racial hierarchy into the values, language, imagery, symbols, and unstated assumptions of the larger society [
]. It is displayed through the media, stereotyping, and norms within society and its institutions [ ]. Importantly, cultural racism produces a context that supports the other dimensions of racism to maintain and reinforce health inequities [ ]. Cultural racism, or the accepted norms, values, and ideologies of a racialized society, becomes realized in policies and practices within and across institutions [ ]. In this way, cultural and structural racism are mutually reinforcing. One way of tapping into cultural racism is to assess community members’ attitudes toward other racial groups [ , ]. A promising line of work has used such approaches to assess the impact of area-level measures of racial prejudice (also referred to as sentiment or animus) on health. A 2022 systematic review of the literature on area-level prejudice and health revealed that all studies found an association between higher area-level prejudice and worse health outcomes among racially minoritized groups, and 4 studies even showed the negative impact of prejudice among White samples [ ]. However, research on birth outcomes is limited as only 4 of the studies in the aforementioned review investigated birth outcomes [ ]. Moreover, measuring racial sentiment using traditional survey approaches can be costly, time-consuming, and subject to self-report biases [ - ]. Alternatively, social media data provide unique opportunities for assessing population-level racial sentiment, which can be used as an indicator of cultural racism.Twitter represents a rapid mode of communication, with millions of tweets sent daily by users across the globe, 80%-90% of whom have publicly accessible Twitter profile [
]. Cultural values and attitudes expressed on the web through social media both reflect and shape public norms, beliefs, and subsequent behaviors [ , ]. Few studies have attempted to quantify attitudes toward different racial groups using Twitter data with the aim of examining how it relates to birth outcomes. Using state-level data from 2015 to 2017, our previous study demonstrated that state-level racial sentiment was associated with implicit and explicit racial bias [ ], and exhibited associations with birth outcomes of different groups of minority women [ ]. This study aims to extend the findings of previous work by (1) examining a wider time frame (2011-2021) to assess whether the associations between Twitter-derived racial sentiment and birth outcomes are persistent across the years, and (2) through a closer examination of the distribution of attitudes toward different racial groups at the state- and county-level to see how they related to birth outcomes within and across racial or ethnic groups.Methods
A random sample comprising 1% of publicly available tweets from January 1, 2011 to December 31, 2021 was collected using Twitter’s Academic Application Programming Interface. We restricted our analyses to tweets that were in English, from the United States, and used one or more race-related keywords (
, Table S1). These race-related keywords were constructed based racial and ethnic terms from the US Census, previous studies examining race-related web-based conversations [ ], and a web-based database of racial slurs [ ]. We included tweets with a unique tweet id. We dropped duplicate tweets according to their “tweet_id.” Retweets and quoted tweets are included in this data. Our analytic sample included 56,400,097 tweets from 3,699,646 unique users.Sentiment Analysis
We assessed the sentiment of each tweet using a support vector machine (SVM), a supervised machine learning model. A full description of our model has been previously described by Nguyen and colleagues [
]. Our training data was comprised of manually labeled tweets obtained from Sentiment140 (n=498) [ ], Kaggle (n=7086) [ ], Sanders (n=5113) [ ], as well as 6481 tweets labeled by our research group. Sentiment140, Sanders, and Kaggle datasets are all publicly available training datasets specifically labeled for sentiment analysis. We used 5-fold cross-validation to assess model performance and achieved a high level of accuracy for the negative sentiment classification (91%) and a high F1 score (84%). Accuracy is measured as the number of posts with the correct prediction divided by the total number of tweets in the testing data set. The F1 score is a measure that balances precision (positive predicted value and recall [sensitivity]); a high F1 score suggests a model is robust in predicting posts that are labeled as 1. The trained SVM model was used to analyze our Twitter data set for negative sentiment classification. Please see the code for the data collection and sentiment analysis model in .To assess historical trends in sentiment, average negative sentiment scores for each racial and ethnic group along with the sentiment scores referencing racially minoritized groups were plotted as a function of time. The line chart was plotted using the “Matplotlib” library in Python [
]. We visualized the average negative sentiment scores with geographic plots using the Tableau software. The sentiment scores were obtained from the SVM model and grouped using the state Federal Information Processing Standards (FIPS) codes.Individual-Level Birth Outcomes Data
Individual birth outcomes data through 2011-2021 were obtained from restricted US natality files that included geographic identifiers. The latest year the natality files are currently available is 2021. We chose 2011 to examine trends in the last decade. In addition, Twitter introduced Twitter “Places” for geotagging tweets in June 2010 [
]. The natality files include all births in the United States during this time period. The data come from birth certificates filed in each state. The analysis was restricted to singleton births with no congenital abnormalities. These exclusion criteria helped ensure that associations with our birth outcomes were not due to congenital abnormalities [ ] or twins, triplets, and other higher order multiple births, which are known to increase the risk for LBW and PTB [ ]. The primary outcomes were LBW (defined as birth weight <2499 g) and PTB (defined as gestational age <37 weeks).Covariates
We adjusted for potential confounders at the individual- and area-level when assessing the association between racial sentiment and birth outcomes. Individual-level covariates in our models included birth year, maternal age (linear spline with knots at 19, 25, 29, 33, and 38 years), race (non-Hispanic White, non-Hispanic Black, non-Hispanic Asian), Hispanic ethnicity, marital status (married or unmarried), education (less than high school, high school or General Education Development, some college, bachelor’s degree, master’s degree, or doctorate), and first birth (yes or no), and prenatal care initiation during the first trimester (yes or no). We also adjusted for state-level characteristics, including proportions of non-Hispanic Black and Latinx individuals, population density (per square mile), and economic disadvantage (standardized factor score) [
, ] summarizing the following variables (%): unemployed; some college education, high school diploma, children in poverty, single parent household, and median household income) to account for state-level compositional differences in demographic and economic characteristics. Use of the factor score has been previously established [ ]. State-level covariates were derived from 2011 to 2021 from the American Community Survey [ ].Statistical Analyses
For each year, the state-level racial sentiment was merged with birth data during that year. The racial sentiment was coded in tertiles for analysis assessing associations with birth outcomes. We estimated incidence ratios for LBW and PTB (separately) using log binomial regression models with state-level racial sentiment as our independent variable of interest, and controlled for individual-level maternal characteristics and state-level demographic characteristics. We evaluated statistical significance at P<.05. Our primary models examined the association between state-level sentiment toward all racially minoritized groups on birth outcomes among the full sample, racially minoritized mothers, and White mothers. Follow-up analyses examined associations between state-level sentiment toward individual racial groups (Asian, Black, Latinx, and White) and birth outcomes for those groups specifically. Sensitivity analyses were conducted to examine the association between county-level racial sentiment and birth outcomes. Although all tweets collected had place information that permitted the identification of the state, only 60% (n=33,840,058) of tweets collected had place information to identify the county. Thus, we consider the county-level analyses to be supplementary. Sensitivity analyses also excluded tweets from users who tweeted more than 1000 times per year, which represented less than 1% (n=1066) of all tweets. Stata MP 16 [
] was used for statistical analysis.Ethical Considerations
This study was determined exempt by the University of Maryland College Park Institutional Review Board (1797788-1).
Results
Among the terms assessed, the top 20 terms by year were present in 86% (n=48,374,805) of all tweets concerning a racial or ethnic minority group (
, Table S2). The proportion of tweets referencing Black people, Latinx, and White people that were negative increased over time, peaking in 2019. For tweets referencing White people and Latinx, we saw a plateauing or declining trend in 2020-2021. The proportion of tweets referencing Asians that were negative climbed until 2020 and declined slightly in 2021 ( ). Black people experienced the highest negative sentiment of all groups. After a brief period of decline in 2020, the proportion of tweets referencing Black people that were negative began to rise again. Higher proportions of negative tweets referencing Black Americans were found in the Southern and Northeastern US regions. Geographic distribution of tweets referencing racially minoritized groups, Latinx, and Asians are presented in (Figures S1-3).Approximately 55% of births were to White mothers; 22% of the samples were Latinx. Black and Asian mothers made up 15% (n=5,152,595) and 6% (n=2,116,785) of the sample, respectively. The majority of women were US born (78%, n=27,630,726), and 31% (n=11,088,533) completed college or beyond.
The state-level negative racial sentiment was consistently associated with adverse birth outcomes controlling for individual maternal characteristics and state-level sociodemographic characteristics (
and ). For the total population, mothers living in states in the highest (third) tertile of negative racial sentiment for tweets referencing racial and ethnic minoritized groups have an 8% higher (95% CI 3%-13%) incidence of LBW and a 5% higher (95% CI 0%-11%) incidence of PTB compared to mothers living in the lowest (first) tertile of negative racial sentiment. Moreover, associations were found for mothers living in states in the second vs lowest tertile of negative racial sentiment. Negative racial sentiment referencing racially minoritized groups was associated with birth outcomes in the total population, among minoritized mothers and White mothers. Associations are slightly higher for racially minoritized mothers in all but one of the models where racially minoritized and White mothers show similar associations with racial sentiment ( ).When examining race- and ethnic-specific sentiment and birth outcomes, the strongest and most consistent associations were found for negative sentiment in tweets referencing Black people and birth outcomes among Black mothers. Black mothers living in states in the third tertile of negative Black sentiment had a 6% higher incidence of LBW and a 7% higher incidence of PTB compared to mothers living in the lowest tertile states. Negative sentiment in tweets referencing Latinx individuals was associated with a 6% and 3% higher incidence of LBW and PTB, respectively, comparing Latinx mothers living in the highest compared to the lowest tertile states. Negative sentiments referencing White individuals was not associated with birth outcomes among White mothers (
).tables present these associations by year (Tables S3-6). Associations between state-level negative racial sentiment for tweets referencing minoritized groups tended to be more strongly associated with birth outcomes among minoritized mothers compared to the total population of birthing people or of White mothers. There were small fluctuations in estimates across the years. The overall negative state-level racial sentiment was associated with adverse birth outcomes for most years. Negative sentiment referencing Black Americans was consistently associated with an elevated risk of LBW and PTB among Black mothers. Negative sentiment in tweets referencing Latinx and Asian individuals was associated with adverse birth outcomes among Latinx and Asian mothers for selected years.
Sensitivity analyses excluded tweets from users who tweeted more than 1000 per year. The estimates changed slightly, but all main conclusions remain the same (
, Tables S7 and S8). Sensitivity analyses examining the associations between county-level racial sentiment and birth outcomes ( , Tables S9-14) found attenuated associations compared to state-level analyses. The restricted natality files only included state- or county-geographic identifiers, so more granular analyses were not possible with that data. Compared to state-level analyses, small increases in risks of PTB and LBW among minoritized women were observed when comparing mothers living in the highest versus lowest tertile counties (see combined county-level in , Tables S6-7).Characteristic | Valuea | |
Age (years), mean (SD) | 28.50 (5.85) | |
Married, n (%) | 21,068,841 (59.74) | |
White, n (%) | 19,236,693 (54.55) | |
Black, n (%) | 5,152,595 (14.61) | |
Asian, n (%) | 2,116,785 (6.00) | |
Latinx, n (%) | 7,932,703 (22.49) | |
US born, n (%) | 27,630,726 (78.35) | |
Education, n (%) | ||
Less than high school | 5,018,290 (14.23) | |
High school | 8,999,298 (25.61) | |
Some college | 10,161,059 (28.81) | |
College | 7,014,807 (19.89) | |
Master’s or doctorate | 4,073,726 (11.55) | |
First birth, n (%) | 11,318,052 (32.09) | |
Prenatal care during first trimester, n (%) | 26,832,305 (76.09) | |
Birth outcomes, n (%) | ||
Low birth weight | 2,215,586 (6.28) | |
Preterm birth | 2,770,462 (7.86) |
aData source: US natality files from 2011 to 2021.
Data | Total, incidence rate ratio (95% CI) | Minoritized groups, incidence rate ratio (95% CI) | White mothers, incidence rate ratio (95% CI) | |
Low birth weight | ||||
Second tertile vs first (lowest) | 1.07 (1.02-1.13) | 1.08 (1.02-1.15) | 1.07 (1.01-1.13) | |
Third tertile vs first (lowest) | 1.08 (1.03-1.13) | 1.09 (1.03-1.15) | 1.08 (1.03-1.14) | |
Sample, n | 35,267,177 | 17,190,739 | 19,236,695 | |
Preterm | ||||
Second tertile vs first (lowest) | 1.06 (1.01-1.11) | 1.07 (1.02-1.12) | 1.05 (1.0-1.11) | |
Third tertile vs first (lowest) | 1.05 (1.00-1.11) | 1.06 (1.01-1.11) | 1.06 (0.99-1.13) | |
Sample, n | 35,288,014 | 17,200,907 | 19,247,910 |
aData source: US natality files and Twitter data from 2011 to 2021.
Data | Negative Black sentiment, birth outcomes among Black mothers, incidence rate ratio (95% CI) | Negative Asian sentiment, birth outcomes among Asian mothers, incidence rate ratio (95% CI) | Negative Latinx sentiment, birth outcomes among Black mothers, incidence rate ratio (95% CI) | Negative White sentiment, birth outcomes among White mothers, incidence rate ratio (95% CI) | |
Low birth weight | |||||
Second tertile vs first (lowest) | 1.08 (1.03-1.13) | 0.99 (0.96-1.03) | 1.00 (0.97-1.02) | 1.00 (0.97-1.04) | |
Third tertile vs first (lowest) | 1.06 (1.01-1.11) | 1.01 (0.96-1.05) | 1.06 (1.01-1.11) | 1.00 (0.97-1.04) | |
Sample, n | 5,152,595 | 2,116,785 | 7,932,703 | 19,236,695 | |
Preterm | |||||
Second tertile vs first (lowest) | 1.08 (1.03-1.14) | 1.01 (0.97-1.05) | 0.99 (0.98-1.0) | 1.01 (0.98-1.04) | |
third tertile vs first (lowest) | 1.07 (1.02-1.13) | 1.03 (0.99-1.07) | 1.03 (1.00-1.06) | 0.99 (0.96-1.03) | |
Sample, n | 5,157,600 | 2,117,808 | 7,935,836 | 19,247,910 |
aData source: US natality files and Twitter data from 2011 to 2021.
Discussion
This study used a combination of social media data and comprehensive US birth records to assess how spatial patterns in attitudes toward different racial groups on Twitter related to birth outcomes. We calculated negative sentiment toward different racial groups at the state level and found that negative attitudes toward minoritized groups were associated with increased risk for low birth-weight babies and PTB for the total population, and this remained generally consistent across each of the years we analyzed. Furthermore, we found that negative sentiment toward minoritized groups was comparably associated with birth outcomes for each racial group, including White women when stratifying outcomes by race. In contrast, negative sentiment toward White people was not associated with adverse birth outcomes for White mothers. In fully stratified models wherein exposures and outcomes were stratified by race, negative Black sentiment was associated with adverse birth outcomes for Black mothers. Results were less consistent when examining associations between Latinx and Asian sentiment and birth outcomes for Latinx and Asian mothers, respectively, and null for White sentiment on White mothers’ birth outcomes. Together, our findings suggest that area-level sentiment toward minoritized groups is associated with negative birth outcomes for the population as a whole. Furthermore, the greatest and most consistent disparities were observed for Black mothers.
This study extends our understanding of racial health disparities. Racism is an organized system of racial hierarchy that structures risks, opportunities, and resources within a society [
]. Expressions of negativity toward minoritized groups demonstrating an ideology of racial inferiority in the values, language, imagery, symbols, and unstated assumptions of the larger society are particularly salient, working to uphold other forms of racism (eg, structural, institutional, personally mediated, and internalized). Our work suggests that negative sentiment of racial or ethnic minoritized groups explains birth outcomes among White mothers also. This aligns with research showing that policies attempting to limit the rights of a minority group have negative ramifications for the majority group also. For instance, McGhee [ ] outlines how the closing of public pools across America in the 1950s—policy decisions made to deny Black people access—resulted in the elimination of this public amenity for all people, including White people. Structurally racist policies disproportionately impact people of color but also have negative consequences for all, including health outcomes for White people [ - ]. For example, racial animus toward racially minoritized groups has led to opposition to social policies and programs, such as the expansion of the Affordable Care Act [ ] a policy that would also benefit a large proportion of White people. Previous research on structural racism shows that racial segregation is associated with adverse birth outcomes among racial minorities and White people [ - ], exemplifying Camara P Jones’ (former President of the American Public Health Association) [ ] statement that “racism saps the strength of the whole society.”Our findings add to the growing body of work that examines racism and health disparities for multiple racial or ethnic groups. Although the majority of the research has examined pregnancy-related health disparities for Black people, emerging research is beginning to tease apart commonalities and differences experienced by different racial groups.
Our study also advances research aiming to understand how the social and cultural environment can shape health. Social media represent a fertile space for stigmatizing language, stereotype representations, slurs, and hostile speech. As more social interactions take place on internet, it is imperative to track and monitor this space and its potential impact on health and well-being, particularly for minoritized and stigmatized racial groups. Over the last decade, we have seen a trend of increasing negative racial sentiment referencing racial and ethnic minorities. We have previously found that racialized events have led to shifts in expressed racial sentiment. For example, the killing of George Floyd was followed by a temporary decline in negative Black sentiment in 2020 [
], and negative Asian sentiment spiked during March 2020 with the emergence of the COVID-19 pandemic [ ]. These events, including the resurgence of the Black Lives Matter movement in 2020, changed the sentiment and volume of discussions related to race and racism on the web [ ]. These events were commonly characterized by a rapid change in racial sentiment, lasting a few weeks, followed by a return to near baseline levels [ - ]. In 2020, we saw associations between negative sentiment and adverse birth outcomes remaining comparable to other years.The current findings also extend our understanding of the geography of racial health disparities in the United States. Our analysis included area-level racial sentiment measured at the state level. States vary in social norms, laws, and policies. However, cultural racism can also vary at lower geographic levels. There may be a bidirectional influence of tweets and state-level policies such that policies may impact tweets and tweets may in turn influence the support of new policies. We conducted analyses at the county level but considered these sensitivity analyses because approximately 40% of the Twitter data had missing county location information. Furthermore, Twitter data for less populated counties were relatively sparse, which may lead to biased estimates of racial sentiment. The geographic identifiers in the US natality files only include state and county identifiers. A valuable future direction would be to examine associations at a more granular resolution for studies looking at cultural racism and health in other data.
A major strength of this study comes from the use of social media–derived racial sentiment. Traditional approaches to measuring racial attitudes, such as survey measures or experimental approaches, are often time-consuming and expensive to conduct. These approaches provide a time-limited snapshot of sentiment, unlike the temporally rich data that can be obtained from social media posts. This temporally and geographically rich data thus makes it possible to look at changes in health disparities as a function of local health policies as well as race-related events, such as Black Lives Matter protests [
, ].This study is not without limitations. For this paper, we used our trained SVM model for sentiment analysis, as we have used this model in previous publications [
, ], including our paper examining racial sentiment and birth outcomes for 2015-2017 [ ]. Changing the sentiment analysis model may change the findings. We have found a high level of accuracy and a high F1 score with this model. To be most comparable to our other papers, we used this sentiment model here. The keyword list is not exhaustive, but we attempted to be as comprehensive as possible by using words used by the US Census, a racial slurs database, and previous studies, while also balancing search constraints that limited the number of characters that can be used for searches to 1024 characters. Data collected is what people were willing to express on Twitter, where negative content tends to spread faster [ ] and, therefore, may be overestimated. Furthermore, we analyzed tweets referencing different racial groups for sentiment or emotional tone, which is different from previous work that has more directly focused on hate speech and racial slurs, for example [ - ]. However, we believe this approach to be sensitive to subtle expressions of racial attitudes that are more representative of the area-level culture and have previously found that such measures are associated with explicit and implicit forms of racial bias [ ]. Other limitations include our exclusive focus on English-language tweets. Future research can examine non-English language tweets referencing minoritized populations to evaluate how the racial sentiment of English and non-English tweets may differ. Approximately 1 in 5 US adults use Twitter [ ]. Racial or ethnic minoritized populations are slightly overrepresented on Twitter compared to the US general population. For example, 17% of adult Twitter users are Latinx compared to 15% of the US adult population [ ]. Twitter users skew younger than the general US population, with 42% of Twitter users being between the ages of 18 and 29 years [ ], whereas those aged 19-34 years account for only 20.8% of the US population [ ]. However, Twitter users have become more similar to the US population in terms of education over time. In 2016, Twitter users tended to be more educated than the US population [ ]. In 2021, 33% of adult Twitter users have a college or graduate degree [ ] compared to 38% of US adults 25 years and older with a college or graduate degree [ ]. Visitors tweeting from a location may express different sentiments compared to residents of places. Twitter users differ in their frequency of tweeting with the majority of Twitter users being less frequent users. However, in our data on race-related tweets, tweets from users tweeting more than 1000 times per year represent less than 1% (n=1066) of the data.Despite these limitations, the Twitter data provide a broad signal for the social context and the online environment people may be exposed to. The focus is not on individual users at one particular time point or place but rather on the patterns of associations with birth outcomes in the United States by using a continuous random sample of US tweets from 2011 to 2021.
The state-level aggregation of racial sentiment also makes it challenging to tease apart the contribution of state politics (structural) and sentiment (cultural) to birth outcomes. For instance, Medicaid is known to mitigate racial health disparities [
], but states vary in the degree to which they expanded Medicaid benefits to their constituents [ ]. Sentiment toward minority race groups is an important, yet hard-to-measure factor that may contribute to the instantiation of policies intended to impact these groups [ ]. Given that cultural and structural racism are mutually reinforcing factors, areas that are more welcoming to racial or ethnic minorities may have greater safety net programs for example. Future work can build on the current findings to further examine the mutually reinforcing nature of state-level sentiment and policies contributing to racial health disparities.Policies that support racial literacy and cultural competency training as well as policies and supports that promote racial equity more broadly (housing, education, criminal justice, media, and city planning) and establish an inclusive social environment accepting of all races and cultures may improve population health, decreasing the overall risk of adverse birth outcomes and reducing racial-birth outcome disparities. Additionally, our results suggest that policies and practices that ostracize, stigmatize, or otherwise isolate certain racial or ethnic groups and increase hostility directed toward them may exacerbate adverse birth outcomes, which would perpetuate the impact of racism or discrimination on future generations.
Conclusions
This study represents advancement in our ability to assess racial sentiment at a more fine-grained level as a means to provide more detailed explanations for racial disparities in birth outcomes. Social media provide a unique opportunity to examine racial sentiment across time, geographic location, and for different racial groups. By examining the relationship between sentiment and birth outcomes across racial groups, we are better able to assess the specificity and generalizability of cultural racism in the outcomes for different groups. Future work can build on our findings, and use this work to inform policy aimed at reducing racial disparities in health.
Acknowledgments
Research reported in this publication was supported by the National Institute on Minority Health and Health Disparities (R00MD012615 (TTN), R01MD015716 (TTN), R01MD016037 (QCN)), the National Library of Medicine (R01LM012849 (QCN)). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Data Availability
Twitter data were collected using Twitter’s Application Programming Interface for Academic Research. More information on applying for access can be found at [
].The paper also uses restricted US natality files with geographic identifiers. The files were obtained after submitting a research proposal to and obtaining approval for data access from the National Center for Health Statistics. More information on applying can be found at [
]. We have added example code to run the analyses in .Conflicts of Interest
None declared.
Supplementary figures, tables, and code.
DOCX File , 3769 KBReferences
- Centers for Disease Control and Prevention (CDC). Infant mortality and low birth weight among black and white infants–United States, 1980-2000. MMWR Morb Mortal Wkly Rep 2002;51(27):589-592 [FREE Full text] [Medline]
- US Department of Health and Human Services. Healthy People 2010: Understanding and Improving Health. Washington, DC: US Department of Health and Human Services; 2000.
- Kim D, Saada A. The social determinants of infant mortality and birth outcomes in Western developed nations: a cross-country systematic review. Int J Environ Res Public Health 2013;10(6):2296-2335 [FREE Full text] [CrossRef] [Medline]
- Hille ET, den Ouden AL, Bauer L, van den Oudenrijn C, Brand R, Verloove-Vanhorick SP. School performance at nine years of age in very premature and very low birth weight infants: perinatal risk factors and predictors at five years of age. Collaborative project on preterm and small for gestational age (POPS) infants in The Netherlands. J Pediatr 1994;125(3):426-434. [CrossRef] [Medline]
- Saigal S, Doyle LW. An overview of mortality and sequelae of preterm birth from infancy to adulthood. Lancet 2008;371(9608):261-269. [CrossRef] [Medline]
- Couzin J. Quirks of fetal environment felt decades later. Science 2002;296(5576):2167-2169. [CrossRef] [Medline]
- Artiga S, Pham O, Orgera K, Ranji U. Racial disparities in maternal and infant health: an overview - issue brief. KFF. 2020. URL: https://www.kff.org/report-section/racial-disparities-in-maternal-and-infant-health-an-overview-issue-brief/ [accessed 2022-09-16]
- Leonard SA, Main EK, Scott KA, Profit J, Carmichael SL. Racial and ethnic disparities in severe maternal morbidity prevalence and trends. Ann Epidemiol 2019;33:30-36 [FREE Full text] [CrossRef] [Medline]
- Almeida J, Bécares L, Erbetta K, Bettegowda VR, Ahluwalia IB. Racial/ethnic inequities in low birth weight and preterm birth: the role of multiple forms of stress. Matern Child Health J 2018;22(8):1154-1163. [CrossRef] [Medline]
- Alhusen JL, Bower KM, Epstein E, Sharps P. Racial discrimination and adverse birth outcomes: an integrative review. J Midwifery Womens Health 2016;61(6):707-720 [FREE Full text] [CrossRef] [Medline]
- Dominguez TP, Dunkel-Schetter C, Glynn LM, Hobel C, Sandman CA. Racial differences in birth outcomes: the role of general, pregnancy, and racism stress. Health Psychol 2008;27(2):194-203 [FREE Full text] [CrossRef] [Medline]
- Jones CP. Levels of racism: a theoretic framework and a gardener's tale. Am J Public Health 2000;90(8):1212-1215. [CrossRef] [Medline]
- David EJR, Schroeder TM, Fernandez J. Internalized racism: a systematic review of the psychological literature on racism's most insidious consequence. J Soc Issues 2019;75(4):1057-1086. [CrossRef]
- Williams O, Ovbiagele B. Stroking out while black-the complex role of racism. JAMA Neurol 2020;77(11):1343-1344. [CrossRef] [Medline]
- Elias A, Paradies Y. The costs of institutional racism and its ethical implications for healthcare. J Bioeth Inq 2021 Mar;18(1):45-58 [FREE Full text] [CrossRef] [Medline]
- Bailey ZD, Krieger N, Agénor M, Graves J, Linos N, Bassett MT. Structural racism and health inequities in the USA: evidence and interventions. Lancet 2017;389(10077):1453-1463. [CrossRef] [Medline]
- Williams DR, Lawrence JA, Davis BA. Racism and health: evidence and needed research. Annu Rev Public Health 2019;40:105-125 [FREE Full text] [CrossRef] [Medline]
- Paradies Y. A systematic review of empirical research on self-reported racism and health. Int J Epidemiol 2006;35(4):888-901. [CrossRef] [Medline]
- Giscombé CL, Lobel M. Explaining disproportionately high rates of adverse birth outcomes among African Americans: the impact of stress, racism, and related factors in pregnancy. Psychol Bull 2005;131(5):662-683 [FREE Full text] [CrossRef] [Medline]
- Wadhwa PD, Culhane JF, Rauh V, Barve SS. Stress and preterm birth: neuroendocrine, immune/inflammatory, and vascular mechanisms. Matern Child Health J 2001;5(2):119-125. [CrossRef] [Medline]
- Nepomnaschy PA, Sheiner E, Mastorakos G, Arck PC. Stress, immune function, and women's reproduction. Ann N Y Acad Sci 2007;1113:350-364. [CrossRef] [Medline]
- Swope CB, Hernández D, Cushing LJ. The relationship of historical redlining with present-day neighborhood environmental and health outcomes: a scoping review and conceptual model. J Urban Health 2022;99(6):959-983 [FREE Full text] [CrossRef] [Medline]
- Rodat S. Cultural racism: a conceptual framework. Revista de Științe Politice/ Revue des Sciences Politiques 2017(54):129-140 [FREE Full text]
- Hicken MT, Miles L, Haile S, Esposito M. Linking history to contemporary state-sanctioned slow violence through cultural and structural racism. Ann Am Acad Pol Soc Sci 2021;694(1):48-58 [FREE Full text] [CrossRef] [Medline]
- Axt JR. The best way to measure explicit racial attitudes is to ask about them. Soc Psychol Personal Sci 2017;9(8):896-906. [CrossRef]
- McConnell AR, Leibold JM. Relations among the implicit association test, discriminatory behavior, and explicit measures of racial attitudes. J Exp Soc Psychol 2001;37(5):435-442. [CrossRef]
- Michaels EK, Board C, Mujahid MS, Riddell CA, Chae DH, Johnson RC, et al. Area-level racial prejudice and health: a systematic review. Health Psychol 2022;41(3):211-224. [CrossRef] [Medline]
- Amodio DM, Harmon-Jones E, Devine PG. Individual differences in the activation and control of affective race bias as assessed by startle eyeblink response and self-report. J Pers Soc Psychol 2003;84(4):738-753. [CrossRef] [Medline]
- Quillian L. New approaches to understanding racial prejudice and discrimination. Annu Rev Sociol 2006;32(1):299-328. [CrossRef]
- Rosenman R, Tennekoon V, Hill LG. Measuring bias in self-reported data. Int J Behav Healthc Res 2011;2(4):320-332 [FREE Full text] [CrossRef] [Medline]
- Odabas M. 10 facts about Americans and Twitter. Pew Research Center. 2022. URL: https://www.pewresearch.org/fact-tank/2022/05/05/10-facts-about-americans-and-twitter/ [accessed 2022-11-26]
- Albarracin D, Shavitt S. Attitudes and attitude change. Annu Rev Psychol 2017;69(1):1-29. [CrossRef]
- Radwan M. Effect of social media usage on the cultural identity of rural people: a case study of Bamha village, Egypt. Humanit Soc Sci Commun 2022;9(1):258. [CrossRef]
- Nguyen TT, Adams N, Huang D, Glymour MM, Allen AM, Nguyen QC. The association between state-level racial attitudes assessed from Twitter data and adverse birth outcomes: observational study. JMIR Public Health Surveill 2020;6(3):e17103 [FREE Full text] [CrossRef] [Medline]
- Bartlett J, Reffin J, Rumball N, Williamson S. Anti-social media. DEMOS. 2014. URL: http://www.demos.co.uk/files/DEMOS_Anti-social_Media.pdf [accessed 2023-02-03]
- The Racial Slur Database. URL: http://www.rsdb.org/ [accessed 2023-02-03]
- Go A, Bhayani R, Huang L. Twitter sentiment classification using distant supervision. Sentiment140. 2009. URL: http://help.sentiment140.com/home [accessed 2022-11-28]
- UMICH SI650 - sentiment classification. Kaggle. 2018. URL: https://kaggle.com/competitions/si650winter11 [accessed 2022-11-26]
- Sanders N. Twitter sentiment corpus. Sanders Analytics. 2011. URL: https://github.com/zfz/twitter_corpus [accessed 2022-11-26]
- Hunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng 2007;9(3):90-95. [CrossRef]
- Tweet metadata timeline. Twitter Developer Platform. URL: https://developer.twitter.com/en/docs/twitter-api/enterprise/data-dictionary/tweet-timeline [accessed 2023-03-13]
- Dolan SM, Gross SJ, Merkatz IR, Faber V, Sullivan LM, Malone FD, et al. The contribution of birth defects to preterm birth and low birth weight. Obstet Gynecol 2007;110(2 Pt 1):318-324. [CrossRef] [Medline]
- Luke B, Brown MB. The changing risk of infant mortality by gestation, plurality, and race: 1989-1991 versus 1999-2001. Pediatrics 2006;118(6):2488-2497 [FREE Full text] [CrossRef] [Medline]
- DeVellis RF, Thorpe CT. Scale Development : Theory and Applications. Thousand Oaks, CA: SAGE Publications; 2021.
- Elmståhl S, Gullberg B. Bias in diet assessment methods--consequences of collinearity and measurement errors on power and observed relative risks. Int J Epidemiol 1997;26(5):1071-1079. [CrossRef] [Medline]
- Nguyen QC, Li D, Meng HW, Kath S, Nsoesie E, Li F, et al. Building a national neighborhood dataset from geotagged Twitter data for indicators of happiness, diet, and physical activity. JMIR Public Health Surveill 2016;2(2):e158 [FREE Full text] [CrossRef] [Medline]
- American community survey (ACS). US Census Bureau. URL: https://www.census.gov/programs-surveys/acs [accessed 2022-11-26]
- StataCorp LLC. Stata. URL: https://www.stata.com/company/ [accessed 2022-11-26]
- McGhee H. The Sum of Us : What Racism Costs Everyone and How We Can Prosper Together. New York: One World; 2022.
- Akala A. Cost Of racism: U.S. economy lost $16 trillion because of discrimination, bank says. NPR. 2020. URL: https://tinyurl.com/2s3th4ye [accessed 2022-11-03]
- Boyle P. Racial resentment hurts White people too, physician tells colleagues. AAMC. 2022. URL: https://www.aamc.org/news-insights/racial-resentment-hurts-white-people-too-physician-tells-colleagues [accessed 2022-11-26]
- Tesler M. The spillover of racialization into health care: how President Obama polarized public opinion by racial attitudes and race. Am J Pol Sci 2012;56(3):690-704. [CrossRef]
- Debbink MP, Bader MDM. Racial residential segregation and low birth weight in Michigan's metropolitan areas. Am J Public Health 2011;101(9):1714-1720. [CrossRef] [Medline]
- Williams AD, Wallace M, Nobles C, Mendola P. Racial residential segregation and racial disparities in stillbirth in the United States. Health Place 2018;51:208-216 [FREE Full text] [CrossRef] [Medline]
- no authors listed. The relationship of fetal and infant mortality to residential segregation: an inquiry into social epidemiology. 1949. Am J Public Health 2015;105(2):278-281. [CrossRef] [Medline]
- Jones CP. Toward the science and practice of anti-racism: launching a national campaign against racism. Ethn Dis 2018;28(suppl 1):231-234 [FREE Full text] [CrossRef] [Medline]
- Nguyen TT, Criss S, Michaels EK, Cross RI, Michaels JS, Dwivedi P, et al. Progress and push-back: how the killings of Ahmaud Arbery, Breonna Taylor, and George Floyd impacted public discourse on race and racism on Twitter. SSM Popul Health 2021;15:100922 [FREE Full text] [CrossRef] [Medline]
- Nguyen TT, Criss S, Dwivedi P, Huang D, Keralis J, Hsu E, et al. Exploring U.S. shifts in anti-asian sentiment with the emergence of COVID-19. Int J Environ Res Public Health 2020;17(19):7032 [FREE Full text] [CrossRef] [Medline]
- Criss S, Nguyen TT, Michaels EK, Gee GC, Kiang MV, Nguyen QC, et al. Solidarity and strife after the Atlanta spa shootings: a mixed methods study characterizing Twitter discussions by qualitative analysis and machine learning. Front Public Health 2023;11:952069 [FREE Full text] [CrossRef] [Medline]
- Bellovary AK, Young NA, Goldenberg A. Left- and right-leaning news organizations use negative emotional content and elicit user engagement similarly. Affect Sci 2021;2(4):391-396 [FREE Full text] [CrossRef] [Medline]
- Chae DH, Clouston S, Martz CD, Hatzenbuehler ML, Cooper HL, Turpin R, et al. Area racism and birth outcomes among Blacks in the United States. Soc Sci Med 2018;199:49-55 [FREE Full text] [CrossRef] [Medline]
- Chae DH, Clouston S, Hatzenbuehler ML, Kramer MR, Cooper HLF, Wilson SM, et al. Association between an internet-based measure of area racism and Black mortality. PLoS One 2015;10(4):e0122963 [FREE Full text] [CrossRef] [Medline]
- Arcila-Calderón C, Amores JJ, Sánchez-Holgado P, Vrysis L, Vryzas N, Oller Alonso M. How to detect online hate towards migrants and refugees? Developing and evaluating a classifier of racist and xenophobic hate speech using shallow and deep learning. Sustainability 2022;14(20):13094. [CrossRef]
- Nguyen TT, Huang D, Michaels EK, Glymour MM, Allen AM, Nguyen QC. Evaluating associations between area-level Twitter-expressed negative racial sentiment, hate crimes, and residents' racial prejudice in the United States. SSM Popul Health 2021;13:100750 [FREE Full text] [CrossRef] [Medline]
- Wojcik S, Hughes A. Sizing up Twitter users. Pew Research Center. URL: https://www.pewresearch.org/internet/2019/04/24/sizing-up-twitter-users/ [accessed 2023-03-13]
- Auxier B, Anderson M. Social Media Use in 2021. Pew Research Center. 2021. URL: https://www.pewresearch.org/internet/2021/04/07/social-media-use-in-2021/ [accessed 2022-11-28]
- Population distribution by age. KFF. 2022. URL: https://www.kff.org/other/state-indicator/distribution-by-age/ [accessed 2022-11-28]
- Greenwood S, Perrin A, Duggan M. Social media update 2016. Pew Research Center. 2016. URL: https://www.pewresearch.org/internet/2016/11/11/social-media-update-2016/ [accessed 2022-11-28]
- Annual social and economic supplements. US Census Bureau. 2021. URL: https://www.census.gov/data/datasets/time-series/demo/cps/cps-asec.html [accessed 2022-11-28]
- Guth M, Artiga S. Medicaid and racial health equity. KFF. 2022. URL: https://www.kff.org/medicaid/issue-brief/medicaid-and-racial-health-equity/ [accessed 2022-11-26]
- Cross-Call J. Medicaid expansion has helped narrow racial disparities in health coverage and access to care. Center on Budget and Policy Priorities. 2020. URL: https://www.cbpp.org/research/health/medicaid-expansion-has-helped-narrow-racial-disparities-in-health-coverage-and [accessed 2022-11-26]
- Preparing for the application. Twitter Developer Platform. URL: https://developer.twitter.com/en/products/twitter-api/academic-research/application-info [accessed 2023-04-20]
- Restricted-use vital statistics data. Centers for Disease Control and Prevention. URL: https://www.cdc.gov/nchs/nvss/nvss-restricted-data.htm [accessed 2023-04-20]
Abbreviations
FIPS: Federal Information Processing Standards |
LBW: low birth weight |
PTB: preterm birth |
SVM: support vector machine |
Edited by A Mavragani; submitted 12.12.22; peer-reviewed by R Gore, K Marchi; comments to author 08.03.23; revised version received 22.03.23; accepted 28.03.23; published 28.04.23
Copyright©Thu T Nguyen, Junaid S Merchant, Shaniece Criss, Katrina Makres, Krishik N Gowda, Heran Mane, Xiaohe Yue, Yulin Hswen, M Maria Glymour, Quynh C Nguyen, Amani M Allen. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.04.2023.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.