TY - JOUR AU - Han, Nuo AU - Li, Sijia AU - Huang, Feng AU - Wen, Yeye AU - Wang, Xiaoyang AU - Liu, Xiaoqian AU - Li, Linyan AU - Zhu, Tingshao PY - 2023 DA - 2023/1/31 TI - Sensing Psychological Well-being Using Social Media Language: Prediction Model Development Study JO - J Med Internet Res SP - e41823 VL - 25 KW - mental health KW - psychological well-being KW - social media KW - machine learning KW - domain knowledge KW - mental well being KW - mental wellbeing KW - linguistic KW - predict KW - model KW - ground truth KW - lexicon AB - Background: Positive mental health is arguably increasingly important and can be revealed, to some extent, in terms of psychological well-being (PWB). However, PWB is difficult to assess in real time on a large scale. The popularity and proliferation of social media make it possible to sense and monitor online users’ PWB in a nonintrusive way, and the objective of this study is to test the effectiveness of using social media language expression as a predictor of PWB. Objective: This study aims to investigate the predictive power of social media corresponding to ground truth well-being data in a psychological way. Methods: We recruited 1427 participants. Their well-being was evaluated using 6 dimensions of PWB. Their posts on social media were collected, and 6 psychological lexicons were used to extract linguistic features. A multiobjective prediction model was then built with the extracted linguistic features as input and PWB as the output. Further, the validity of the prediction model was confirmed by evaluating the model's discriminant validity, convergent validity, and criterion validity. The reliability of the model was also confirmed by evaluating the split-half reliability. Results: The correlation coefficients between the predicted PWB scores of social media users and the actual scores obtained using the linguistic prediction model of this study were between 0.49 and 0.54 (P<.001), which means that the model had good criterion validity. In terms of the model’s structural validity, it exhibited excellent convergent validity but less than satisfactory discriminant validity. The results also suggested that our model had good split-half reliability levels for every dimension (ranging from 0.65 to 0.85; P<.001). Conclusions: By confirming the availability and stability of the linguistic prediction model, this study verified the predictability of social media corresponding to ground truth well-being data from the perspective of PWB. Our study has positive implications for the use of social media to predict mental health in nonprofessional settings such as self-testing or a large-scale user study. SN - 1438-8871 UR - https://www.jmir.org/2023/1/e41823 UR - https://doi.org/10.2196/41823 UR - http://www.ncbi.nlm.nih.gov/pubmed/36719723 DO - 10.2196/41823 ID - info:doi/10.2196/41823 ER -