TY - JOUR AU - Xu, Chenjie AU - Cao, Zhi AU - Yang, Hongxi AU - Gao, Ying AU - Sun, Li AU - Hou, Yabing AU - Cao, Xinxi AU - Jia, Peng AU - Wang, Yaogang PY - 2020 DA - 2020/11/12 TI - Leveraging Internet Search Data to Improve the Prediction and Prevention of Noncommunicable Diseases: Retrospective Observational Study JO - J Med Internet Res SP - e18998 VL - 22 IS - 11 KW - noncommunicable diseases KW - internet searches KW - Google Trends KW - infodemiology KW - infoveillance KW - early warning model KW - United States AB - Background: As human society enters an era of vast and easily accessible social media, a growing number of people are exploiting the internet to search and exchange medical information. Because internet search data could reflect population interest in particular health topics, they provide a new way of understanding health concerns regarding noncommunicable diseases (NCDs) and the role they play in their prevention. Objective: We aimed to explore the association of internet search data for NCDs with published disease incidence and mortality rates in the United States and to grasp the health concerns toward NCDs. Methods: We tracked NCDs by examining the correlations among the incidence rates, mortality rates, and internet searches in the United States from 2004 to 2017, and we established forecast models based on the relationship between the disease rates and internet searches. Results: Incidence and mortality rates of 29 diseases in the United States were statistically significantly correlated with the relative search volumes (RSVs) of their search terms (P<.05). From the perspective of the goodness of fit of the multiple regression prediction models, the results were closest to 1 for diabetes mellitus, stroke, atrial fibrillation and flutter, Hodgkin lymphoma, and testicular cancer; the coefficients of determination of their linear regression models for predicting incidence were 80%, 88%, 96%, 80%, and 78%, respectively. Meanwhile, the coefficient of determination of their linear regression models for predicting mortality was 82%, 62%, 94%, 78%, and 62%, respectively. Conclusions: An advanced understanding of search behaviors could augment traditional epidemiologic surveillance and could be used as a reference to aid in disease prediction and prevention. SN - 1438-8871 UR - http://www.jmir.org/2020/11/e18998/ UR - https://doi.org/10.2196/18998 UR - http://www.ncbi.nlm.nih.gov/pubmed/33180022 DO - 10.2196/18998 ID - info:doi/10.2196/18998 ER -