TY - JOUR AU - Birnbaum, Michael L AU - Ernala, Sindhu Kiranmai AU - Rizvi, Asra F AU - De Choudhury, Munmun AU - Kane, John M PY - 2017 DA - 2017/08/14 TI - A Collaborative Approach to Identifying Social Media Markers of Schizophrenia by Employing Machine Learning and Clinical Appraisals JO - J Med Internet Res SP - e289 VL - 19 IS - 8 KW - schizophrenia KW - psychotic disorders KW - online social networks KW - machine learning KW - linguistic analysis KW - Twitter AB - Background: Linguistic analysis of publicly available Twitter feeds have achieved success in differentiating individuals who self-disclose online as having schizophrenia from healthy controls. To date, limited efforts have included expert input to evaluate the authenticity of diagnostic self-disclosures. Objective: This study aims to move from noisy self-reports of schizophrenia on social media to more accurate identification of diagnoses by exploring a human-machine partnered approach, wherein computational linguistic analysis of shared content is combined with clinical appraisals. Methods: Twitter timeline data, extracted from 671 users with self-disclosed diagnoses of schizophrenia, was appraised for authenticity by expert clinicians. Data from disclosures deemed true were used to build a classifier aiming to distinguish users with schizophrenia from healthy controls. Results from the classifier were compared to expert appraisals on new, unseen Twitter users. Results: Significant linguistic differences were identified in the schizophrenia group including greater use of interpersonal pronouns (P<.001), decreased emphasis on friendship (P<.001), and greater emphasis on biological processes (P<.001). The resulting classifier distinguished users with disclosures of schizophrenia deemed genuine from control users with a mean accuracy of 88% using linguistic data alone. Compared to clinicians on new, unseen users, the classifier’s precision, recall, and accuracy measures were 0.27, 0.77, and 0.59, respectively. Conclusions: These data reinforce the need for ongoing collaborations integrating expertise from multiple fields to strengthen our ability to accurately identify and effectively engage individuals with mental illness online. These collaborations are crucial to overcome some of mental illnesses’ biggest challenges by using digital technology. SN - 1438-8871 UR - http://www.jmir.org/2017/8/e289/ UR - https://doi.org/10.2196/jmir.7956 UR - http://www.ncbi.nlm.nih.gov/pubmed/28807891 DO - 10.2196/jmir.7956 ID - info:doi/10.2196/jmir.7956 ER -