Machine Learning Approaches for Predicting Psoriatic Arthritis Risk Using Electronic Medical Records: Population-Based Study

doi:10.2196/39972

Original Paper

¹Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei, Taiwan

²Graduate Institute of Clinical Medicine, Taipei Medical University, Taipei, Taiwan

³Department of Dermatology, Taipei Medical University Hospital, Taipei Medical University, Taipei, Taiwan

⁴Department of Dermatology, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan

⁵Graduate Institute of Data Science, Taipei Medical University, Taipei, Taiwan

⁶Department of Healthcare Information and Management, Ming Chuan University, Taoyuan, Taiwan

⁷Department of Dermatology, Taipei Municipal Wanfang Hospital, Taipei Medical University, Taipei, Taiwan

Corresponding Author:

Yu-Chuan Jack Li, MD, PhD

Department of Dermatology

Taipei Municipal Wanfang Hospital

Taipei Medical University

No 111, Section 3

Hsing-Long Rd

Taipei, 116

Taiwan

Phone: 886 02 2930 7930

Email: jack@tmu.edu.tw

Background: Psoriasis (PsO) is a chronic, systemic, immune-mediated disease with multiorgan involvement. Psoriatic arthritis (PsA) is an inflammatory arthritis that is present in 6%-42% of patients with PsO. Approximately 15% of patients with PsO have undiagnosed PsA. Predicting patients with a risk of PsA is crucial for providing them with early examination and treatment that can prevent irreversible disease progression and function loss.

Objective: The aim of this study was to develop and validate a prediction model for PsA based on chronological large-scale and multidimensional electronic medical records using a machine learning algorithm.

Methods: This case-control study used Taiwan’s National Health Insurance Research Database from January 1, 1999, to December 31, 2013. The original data set was split into training and holdout data sets in an 80:20 ratio. A convolutional neural network was used to develop a prediction model. This model used 2.5-year diagnostic and medical records (inpatient and outpatient) with temporal-sequential information to predict the risk of PsA for a given patient within the next 6 months. The model was developed and cross-validated using the training data and was tested using the holdout data. An occlusion sensitivity analysis was performed to identify the important features of the model.

Results: The prediction model included a total of 443 patients with PsA with earlier diagnosis of PsO and 1772 patients with PsO without PsA for the control group. The 6-month PsA risk prediction model that uses sequential diagnostic and drug prescription information as a temporal phenomic map yielded an area under the receiver operating characteristic curve of 0.70 (95% CI 0.559-0.833), a mean sensitivity of 0.80 (SD 0.11), a mean specificity of 0.60 (SD 0.04), and a mean negative predictive value of 0.93 (SD 0.04).

Conclusions: The findings of this study suggest that the risk prediction model can identify patients with PsO at a high risk of PsA. This model may help health care professionals to prioritize treatment for target high-risk populations and prevent irreversible disease progression and functional loss.

J Med Internet Res 2023;25:e39972

doi:10.2196/39972

Keywords

convolutional neural network (43); deep learning, machine learning; prediction model (105); psoriasis (36); psoriatic arthritis (7); temporal phenomic map (1); electronic medical records (112)

Psoriasis (PsO) and psoriatic arthritis (PsA) are multiorgan inflammatory diseases with a similar pathophysiology. The prevalence of PsO ranges from 0.09% to 11.4% worldwide [Danielsen K, Olsen AO, Wilsgaard T, Furberg A. Is the prevalence of psoriasis increasing? A 30-year follow-up of a population-based cohort. Br J Dermatol 2013 Jun;168(6):1303-1310. [CrossRef] [Medline]1,Tsai T, Wang T, Hung S, Tsai PI, Schenkel B, Zhang M, et al. Epidemiology and comorbidities of psoriasis patients in a national database in Taiwan. J Dermatol Sci 2011 Jul;63(1):40-46. [CrossRef] [Medline]2], and PsA occurs in approximately 12.7%-30% of patients with PsO [Patrick MT, Stuart PE, Raja K, Gudjonsson JE, Tejasvi T, Yang J, et al. Genetic signature to provide robust risk assessment of psoriatic arthritis development in psoriasis patients. Nat Commun 2018 Oct 09;9(1):4178 [FREE Full text] [CrossRef] [Medline]3,Chiu H, Wang T, Chen P, Hsu S, Tsai Y, Tsai T. Psoriasis in Taiwan: from epidemiology to new treatments. Dermatologica Sinica 2018 Sep;36(3):115-123. [CrossRef]4]. Patients with PsO develop PsA in approximately 7-12 years [Tillett W, Charlton R, Nightingale A, Snowball J, Green A, Smith C, et al. Interval between onset of psoriasis and psoriatic arthritis comparing the UK Clinical Practice Research Datalink with a hospital-based cohort. Rheumatology (Oxford) 2017 Dec 01;56(12):2109-2113. [CrossRef] [Medline]5,Busse K, Liao W. Which psoriasis patients develop psoriatic arthritis? Psoriasis Forum 2010;16(4):17-25 [FREE Full text] [Medline]6]. Irreversible joint deformities develop within 5-10 years in 30%-40% of patients with PsA and adversely affect many aspects of patients’ lives. According to the meta-analysis of Villani et al [Villani AP, Rouzaud M, Sevrain M, Barnetche T, Paul C, Richard M, et al. Prevalence of undiagnosed psoriatic arthritis among psoriasis patients: systematic review and meta-analysis. J Am Acad Dermatol 2015 Aug;73(2):242-248. [CrossRef] [Medline]7], PsA was undiagnosed in 15.5% of patients with PsO. The prediction of PsA before irreversible joint or bone damage is crucial. Early control of the inflammatory burden can reduce the severity of comorbidities [Kerdel F, Don F. The importance of early treatment in psoriasis and management of disease progression. J Drugs Dermatol 2018 Jul 01;17(7):737-742. [Medline]8]. Therefore, a tool for predicting PsA could assist physicians in promptly intervening to reduce inflammation.

Machine learning (ML) algorithms have been used to diagnose diseases [Han SS, Kim MS, Lim W, Park GH, Park I, Chang SE. Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Invest Dermatol 2018 Jul;138(7):1529-1538 [FREE Full text] [CrossRef] [Medline]9], predict disease progression [Wang H, Wang Y, Liang C, Li Y. Assessment of deep learning using nonimaging information and sequential medical records to develop a prediction model for nonmelanoma skin cancer. JAMA Dermatol 2019 Sep 04:1277-1283. [CrossRef] [Medline]10], and evaluate responses to treatment [Wang R, Shao X, Zheng J, Saci A, Qian X, Pak I, et al. A machine-learning approach to identify a prognostic cytokine signature that is associated with nivolumab clearance in patients with advanced melanoma. Clin Pharmacol Ther 2020 Apr;107(4):978-987. [CrossRef] [Medline]11]. Convolutional neural networks (CNNs) are types of ML suitable for working with spatial data, such as images. CNNs can enhance diagnostic capabilities and support decision-making based on medical imaging in the fields of dermatology, pathology, and radiology because of their strong image recognition capabilities. The clinical applications of CNNs include lesion detection and evaluation as well as disease prediction by analyzing electronic medical records (EMRs) [Yasaka K, Akai H, Kunimatsu A, Kiryu S, Abe O. Deep learning with convolutional neural network in radiology. Jpn J Radiol 2018 May;36(4):257-272. [CrossRef] [Medline]12]. Cheng et al [Cheng Y, Wang F, Zhang P, Hu J. Risk prediction with electronic health records: a deep learning approach. In: Proceedings of the 2016 SIAM International Conference on Data Mining. 2016 Presented at: SDM 2016; May 5-7; Miami, FL p. 432-440. [CrossRef]13] used a temporal matrix with time as one dimension and diagnostic codes as the other and validated the effectiveness of the proposed CNN model in predicting congestive heart failure and chronic obstructive pulmonary disease using EMRs. Wang et al [Wang H, Wang Y, Liang C, Li Y. Assessment of deep learning using nonimaging information and sequential medical records to develop a prediction model for nonmelanoma skin cancer. JAMA Dermatol 2019 Sep 04:1277-1283. [CrossRef] [Medline]10] visualized EMRs by importing timeline information to increase the dimension of input data (diagnostic and prescription codes). A risk prediction model was established to predict incident nonmelanoma skin cancer.

ML has been increasingly applied to PsO and PsA, and its use is not limited to the diagnosis or differentiation of cutaneous psoriatic lesions in an image or the assessment of cutaneous lesion severity. Patrick et al [Patrick MT, Stuart PE, Raja K, Gudjonsson JE, Tejasvi T, Yang J, et al. Genetic signature to provide robust risk assessment of psoriatic arthritis development in psoriasis patients. Nat Commun 2018 Oct 09;9(1):4178 [FREE Full text] [CrossRef] [Medline]3] used a genetic signature to analyze the differences between PsA and cutaneous PsO and constructed a model to predict the risk of PsA in patients with PsO before the appearance of joint symptoms, with an area under the receiver operating characteristic curve (AUROC) of 0.82. They combined statistical methods and ML techniques to identify genetic differences between various PsO subtypes and conducted a personalized PsO subtype risk assessment. However, it requires performing genome-wide association studies, which are expensive and time-consuming for each patient.

No tool, however, is available to predict PsA in the preclinical stages, especially in cases without the typical cutaneous or nail presentation of PsO. Haroon et al [Haroon M, Gallagher P, FitzGerald O. Diagnostic delay of more than 6 months contributes to poor radiographic and functional outcome in psoriatic arthritis. Ann Rheum Dis 2015 Jul;74(6):1045-1050. [CrossRef] [Medline]14] demonstrated that a delay in the intervention of more than 6 months from symptom onset can result in severe joint damage and poor prognosis. This study proposed a chronological EMR-based deep learning algorithm to predict the risk of PsA in patients with PsO. The model was built to provide an effective tool for dermatologists to screen patients with PsO at a high risk of PsA and may help physicians prevent or delay disease progression in the early stages.

Data Source

Data were obtained from Taiwan’s National Health Insurance Research Database (NHIRD), which contains data on the 2 million beneficiaries enrolled in Taiwan’s National Health Insurance (NHI). The database contains extensive real-world data, such as original claims data for reimbursement and the registration files of beneficiaries and health care facilities. Data from January 1, 1999, to December 31, 2013, were collected. The NHIRD provides information regarding NHI usage and medical care, including demographic characteristics, diagnostic and procedure codes, as well as prescription details.

Ethics Approval

The research was reviewed and approved by the Institutional Review Board of Taipei Medical University (N201701027)

Study Population and Design

This study was a case-control study (ratio of cases to controls: 1:4) and enrolled patients aged between 5 and 99 years who had at least 3 years (ie, 156 weeks) of records between January 1, 2002, and December 31, 2013.

The purpose of the PsA model was to distinguish patients susceptible to developing PsA from PsO. In this model, PsA was defined as patients with PsO who had at least two International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) code 696.0 (psoriatic arthropathy) diagnoses from outpatient visits or at least one from an admission claim. Patients without a diagnosis of PsO (ICD-9-CM code 696.1 or 696.8) and with a first-time diagnosis of PsO after PsA were excluded. The control group included patients with PsO (ICD-9-CM code 696.1 or 696.8) and excluded patients with PsA (ICD-9-CM code 696.8; Figure S1 in

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB Multimedia Appendix 1).

The index date was the first date of PsA diagnosis. The index date of the control group was the last date for which medical records were available in the database. This study used 2.5 years (131 weeks) as the observation window and 0.5 years (25 weeks) as the prediction window. The medical information from the observation window was used to predict new-onset PsA 0.5 years in advance.

Prediction Model Construction

This study used age, sex, ICD-9-CM diagnostic codes, and the Anatomical Therapeutic Chemical (ATC) prescription codes of the World Health Organization during the observation window to establish features. This study used 1098 (999+99) ICD-9-CM codes to translate the diagnosis of diseases and other health problems from text to code. The codes (999) were divided into 17 chapters on the basis of the cause and anatomy of the system and supplemented with V codes (99). In the ATC classification system, drugs are classified on the basis of their target organs or systems and their chemical, pharmacological, and therapeutic properties. A total of 830 drug categories were included in this study. The first 3 digits of the ICD-9-CM codes were used as diagnostic information as well as the first 5 characters of most of the ATC codes and the first 7 characters for medications, with “x” as the fifth character, were used as prescription information.

Supervised CNN models were constructed to differentiate between the presence or absence of PsA as a binary classification problem. The input layer of the CNN model comprised EMR information arranged chronologically. Each patient had their own characteristic temporal phenomic map (TPM). The vertical axis was the diagnosis and drug records. The horizontal axis was the date of a hospital visit and drug prescription. Each dot in the TPM represented the number of times a certain diagnosis was established on a certain visit and the number of days a certain drug was prescribed; these values ranged from 0 to 1 after normalization (Figure 1A).

The original data set was divided into training and holdout sets at an 80:20 ratio (Figure 1B). Five-fold cross-validation was performed to train and evaluate the model, prevent problems such as overfitting, and decrease generalization error. Subsequently, 80% of the data were used for training, and 20% were used for testing; the process was executed 5 times to cover all the data. Finally, the holdout set was used to provide an unbiased evaluation of the final model fit on the training set. Figure 1C displays the CNN architecture. The hidden layers were (1) the convolutional layer, (2) the pooling layer, (3) the flatten layer, (4) the concatenation layer, (5) the dropout layer, and (6) the fully connected layer. The output was a normalized probability of PsA ranging from 0 to 1, which was then converted to crisp class labels. The CNN model consisted of several pairs of convolutional and pooling layers, which were converted to a flatten layer. Each convolutional layer was followed by an activation function that provided the nonlinear transformation capability required by the network. The flatten layer was concatenated with information regarding age and sex, and this was followed by the operation of the fully connected layers. The dropout layer was used for regularization to avoid overfitting. The activation function was applied to the last fully connected layer for classification. This study was performed on Keras (version 2.3.0; Google Inc) with the TensorFlow framework (backend; version 2.2; Google Inc) in a Python 3.7 environment (Google Colab; Google Inc).

Figure 1. An overview of the prediction model for psoriatic arthritis. (A) Schematic of the temporal phenomic map. (B) Schematic of the machine learning classification framework. (C) Structure of the convolutional neural network.

Model Evaluation

The AUROC, sensitivity, specificity, positive predictive value, and negative predictive value were calculated to evaluate the performance of the models [Thomsen K, Iversen L, Titlestad TL, Winther O. Systematic review of machine learning for diagnosis and prognosis in dermatology. J Dermatolog Treat 2020 Aug;31(5):496-510. [CrossRef] [Medline]15]. AUROC was used to evaluate the predictive value of the model. The optimal discrimination threshold for AUROC was the point at which both sensitivity and specificity were maximized (sensitivity+specificity).

Feature Importance

This study interpreted the deep learning algorithm by examining the relationship between features and model performance. An occlusion sensitivity analysis was performed to identify the most crucial parts of the TPM for the neural network’s classification [Zeiler M, Fergus R. Visualizing and understanding convolutional networks. In: ECCV 2014: Computer Vision – ECCV 2014.: SpringerLink; 2014 Presented at: European Conference on Computer Vision; Sep 6-12; Zurich, Switzerland p. 818-833. [CrossRef]16]. Stepwise elimination was performed to examine AUROC loss and identify the crucial factors of the model. Features or groups of features were eliminated one by one in a stepwise manner as described in the literature to determine AUROC loss (Figure S2 in

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB Multimedia Appendix 1). After the crucial features were identified, a logistic regression (LR) analysis was performed on the predictors and PsA. The dependence of PsA occurrence on the predictors was quantified using odds ratios.

Statistical Analysis

Data were reported as means with SDs for the parametric variables. This study performed an LR analysis of PsA and its predictors. P<.05 was considered statistically significant. Statistical analysis was performed using SAS Enterprise Guide (version 7.1) and Enterprise Miner (version 14.3; SAS Institute Inc).

Demographics of the Sampled Data Set

Table 1 lists the demographic information. The mean age was 42.66 (SD 17.21) years, and the PsA group comprised 266 (61.43%) men. In the randomly sampled control group, the mean age was 46.85 (SD 20.18) years, and the group comprised 989 (57.10%) men. The TPM of each patient consisted of 2.5 years of time-series medical records (ICD-9-CM codes for each visit and prescription medications obtained on each date). The annual number of ICD-9-CM codes multiplied by clinical visits per person was 32.6 in the PsA group and 40.1 in the control groups. The annual number of drugs multiplied by prescription days per person was 50.4 in the PsA group and 43.9 in the control groups.

Table 1. Demographics of the sampled data set.

Characteristics			Psoriatic arthritis group (n=443)		Control group (n=1772)
Age (years), mean (SD)^a			42.66 (17.21)		46.85 (20.18)
Age (years), range (min-max)			5-95		7-94
Sex, n (%)^a
	Male	266 (61.43)		989 (57.10)
	Female	177 (38.57)		783 (42.90)
Total diagnosis (ICD-9-CM^b), n (%)			520 (47.36)		695 (63.30)
Total medication (ATC^c), n (%)			415 (50)		516 (62.17)
Annual accumulation, n
	ICD-9-CM counts per person	32.6		40.1
	Medications per person	50.4		43.9

^aP<.001.

^bICD-9-CM: International Classification of Diseases, Ninth Revision, Clinical Modification.

^cATC: Anatomical Therapeutic Chemical.

The Architecture of the CNN Model

The model consisted of 8 hidden layers [Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE 1998;86(11):2278-2324. [CrossRef]17]. Figure S3 in

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB Multimedia Appendix 1 displays the CNN architecture. The first layer was a convolutional layer with 32 filters in a 1×131 shape, where 131 represents the total number of weeks in 2.5 years, which was the x-axis of the input TPM. The second layer was the average pooling layer, which calculated the average for each patch of the feature map. The purpose of the pooling layer was (1) to reduce the number of parameters and dimensions of data and the computational cost by subsampling the input image and (2) to maintain feature invariance. The average pooling size was 2×2. The third layer was a convolutional layer with 2 filters in a 1×131 shape. The fourth layer was the max pooling layer. The max pooling size was 1×3. The fifth layer was a flatten layer, and the sixth layer was concatenated with information regarding sex and age. The seventh layer was a fully connected layer with 128 neurons. The eighth layer was a dropout layer with a dropout rate of 0.3 for the overfitting problem. One neuron represented the possibility of risk in the output layer with sigmoid activation.

Regarding the hyperparameters of the CNN, the epoch was set to 20 to obtain the optimal AUROC on the basis of the experimental results, and the batch size was 64. The learning rate was optimized by Adam. The activation functions of the convolutional and fully connected layers were reLU and LeakyReLU, and the activation function of the output layer was sigmoid. All patients in the training data set were randomly divided, with 80% for training and 20% for 5-fold cross-validation. Approximately 60 to 120 minutes were required to complete the 5-fold cross-validation.

Performance of the CNN Model

The PsA model predicted PsA with a mean AUROC of 0.70 (SD 0.11; 95% CI 0.559-0.833), a mean sensitivity of 0.80 (SD 0.11), a mean specificity of 0.60 (SD 0.04), a mean positive predictive value of 0.32 (SD 0.05), and a mean negative predictive value of 0.93 (SD 0.04; Table S1 in

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB Multimedia Appendix 1). presents the receiver operating characteristic curve of the model. The risk probability score ranged from 0 (no disease) to 1 (disease). The optimal discrimination threshold was 0.429. Models with a case-control ratio of 1:10 and 10-fold cross-validation were used. However, their performance was not satisfactory, with a mean AUROC of 0.64 (SD 0.06; 95% CI 0.595-0.687; Figure S4 in ).

Figure 2. Receiver operating characteristic curve of the model using sequential medical records. AUROC: area under the receiver operating characteristic curve.

Crucial Features of the CNN Models

An occlusion sensitivity analysis was performed to interpret the CNN model. This analysis was performed to identify the most crucial parts of the TPM for model classification. The features were identified by evaluating AUROC loss through stepwise elimination.

The AUROC loss of the PsA model ranged from −0.001% to −3.54%. Table S2 in

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB Multimedia Appendix 1 presents the features with the strongest effects on the power of prediction (>2.02% loss). Age was the most crucial feature in the model, with an AUROC loss of −3.54%. Autoimmune connective tissue diseases (−2.20%); rheumatoid arthritis and other inflammatory polyarthritis (−2.10%); anxiety and depression (−2.07%); ankylosing spondylitis and other inflammatory spondylopathies (−2.07%); osteoporosis and pathologic fracture (−2.07%); atopic dermatitis (−2.04%); menopausal and postmenopausal disorders (−2.06%); as well as other chronic comorbidities, such as renal diseases (−2.10%), obesity and metabolic syndrome (−2.08%), dyslipidemia (−2.08%), cardiovascular disorders (−2.06%), diabetes (−2.03%), and hypertension (−2.03%), were identified as crucial features.

Medications such as disease-modifying antirheumatic drugs (DMARDs; −2.06%); methotrexate (−2.10%); azathioprine (−2.09%); acitretin (−2.09%); aminoquinolines like hydroxychloroquine and chloroquine (−2.06%); calcineurin inhibitors like ciclosporin (−2.06%); selective immunosuppressants like leflunomide and tofacitinib (−2.06%); and sulfasalazine (−2.02%) were identified as crucial features. Topical medications, such as tars (−2.82%), corticosteroids, and vitamin D analogues—calcipotriol (−2.77%) and calcitriol (−2.09%)—were identified as crucial features.

Principal Results

A CNN model was constructed to identify patients with PsO at a high risk of PsA 6 months in advance using chronological EMRs, with an AUROC of 0.70. Early intervention can prevent PsA, especially in patients with PsO. This model used prescription and medication EMR data for prediction and did not require other information, such as nail lesions, severity, area of cutaneous lesions, or family history. In addition, potential predictive features, such as comorbidities and medications, were analyzed. This PsA prediction model may help physicians to identify high-risk groups and intervene before irreversible disease progression and functional loss.

PsA is a severe and often irreversible comorbidity of PsO that substantially affects patients’ quality of life. Patrick et al [Patrick MT, Stuart PE, Raja K, Gudjonsson JE, Tejasvi T, Yang J, et al. Genetic signature to provide robust risk assessment of psoriatic arthritis development in psoriasis patients. Nat Commun 2018 Oct 09;9(1):4178 [FREE Full text] [CrossRef] [Medline]3] constructed an ML pipeline to distinguish patients with PsA from those with PsO on the basis of genetic background. The AUROC was 0.82 in the cross-validation and testing when 200 genetic markers were used. Although the AUROC of our PsA model was 0.70 for the test set, this study only used the easy-to-obtain and simple ICD-9-CM diagnostic and ATC medication codes in EMRs to construct the model rather than conducting a genome-wide association study for each patient, which is expensive and time-consuming.

Mease et al [Mease PJ, Gladman DD, Papp KA, Khraishi MM, Thaçi D, Behrens F, et al. Prevalence of rheumatologist-diagnosed psoriatic arthritis in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2013 Nov;69(5):729-735. [CrossRef] [Medline]18] indicated that 41% of patients with PsA had not received their diagnosis at dermatology clinics. Villani and colleagues [Villani AP, Rouzaud M, Sevrain M, Barnetche T, Paul C, Richard M, et al. Prevalence of undiagnosed psoriatic arthritis among psoriasis patients: systematic review and meta-analysis. J Am Acad Dermatol 2015 Aug;73(2):242-248. [CrossRef] [Medline]7] estimated that 15.5% of patients with PsO have undiagnosed PsA. The high prevalence of undiagnosed PsA in patients with PsO should remind dermatologists of the criticality of screening all patients with PsO for PsA. Mease et al [Mease PJ, Gladman DD, Helliwell P, Khraishi MM, Fuiman J, Bananis E, et al. Comparative performance of psoriatic arthritis screening tools in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2014 Oct;71(4):649-655. [CrossRef] [Medline]19] evaluated 3 PsA screening questionnaires: the Psoriasis and Arthritis Screening Questionnaire, the Psoriasis Epidemiology Screening Tool, and the Toronto Psoriatic Arthritis Screening Questionnaire. The negative predictive value of our PsA model (0.93) was higher than those of the screening tools in Mease et al (0.83-0.91) [Mease PJ, Gladman DD, Papp KA, Khraishi MM, Thaçi D, Behrens F, et al. Prevalence of rheumatologist-diagnosed psoriatic arthritis in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2013 Nov;69(5):729-735. [CrossRef] [Medline]18], but the sensitivity was similar (0.80 vs 0.67-0.84). The specificity of our PsA model (0.60) was also similar to that of Mease et al (0.64-0.75) [Mease PJ, Gladman DD, Papp KA, Khraishi MM, Thaçi D, Behrens F, et al. Prevalence of rheumatologist-diagnosed psoriatic arthritis in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2013 Nov;69(5):729-735. [CrossRef] [Medline]18]. Therefore, the prediction models in this study are useful screening tools for detecting probable or subclinical PsA.

An occlusion sensitivity analysis is a simple technique used to determine how a CNN makes a classification [Zeiler M, Fergus R. Visualizing and understanding convolutional networks. In: ECCV 2014: Computer Vision – ECCV 2014.: SpringerLink; 2014 Presented at: European Conference on Computer Vision; Sep 6-12; Zurich, Switzerland p. 818-833. [CrossRef]16]. The output of the model can be observed by occluding a part of the image to determine the blocks in the image that are more crucial for model classification. This study used a similar technique for the TPM-based ML model by removing one or more diagnostic or medication codes in a stepwise manner and observing the changes in the AUROC. When a crucial part of a TPM is removed, the AUROC of the prediction should decrease substantially. Most of the decisive factors in this study were consistent with those identified in the literature.

The comorbidities resulting in the greatest AUROC loss (>2.03%) when removed one code at a time were the diffuse diseases of systemic lupus erythematosus, rheumatoid arthritis, vitiligo, alopecia, diabetes, hypertension, cardiovascular disease, dyslipidemia, metabolic syndrome, kidney disease, and psychosis, which confirmed the multisystemic nature of PsA [Tsai T, Wang T, Hung S, Tsai PI, Schenkel B, Zhang M, et al. Epidemiology and comorbidities of psoriasis patients in a national database in Taiwan. J Dermatol Sci 2011 Jul;63(1):40-46. [CrossRef] [Medline]2,Chiu H, Wang T, Chen P, Hsu S, Tsai Y, Tsai T. Psoriasis in Taiwan: from epidemiology to new treatments. Dermatologica Sinica 2018 Sep;36(3):115-123. [CrossRef]4,Furue K, Ito T, Tsuji G, Kadono T, Nakahara T, Furue M. Autoimmunity and autoimmune co-morbidities in psoriasis. Immunology 2018 May;154(1):21-27 [FREE Full text] [CrossRef] [Medline]20].

The drugs resulting in the greatest AUROC loss (>2.02%) when removed one code at a time were topical corticosteroids, topical vitamin D analogs, tars, methotrexate, acitretin, ciclosporin, leflunomide, and sulfasalazine, indicating that the use of topical steroids and DMARDs for PsO strongly affected the prediction of the models. High-potency topical steroids and DMARDs are usually reserved for severe and refractory PsO, and these results indicate that more severe skin symptoms are associated with a higher risk of PsA [Tillett W, Charlton R, Nightingale A, Snowball J, Green A, Smith C, et al. Interval between onset of psoriasis and psoriatic arthritis comparing the UK Clinical Practice Research Datalink with a hospital-based cohort. Rheumatology (Oxford) 2017 Dec 01;56(12):2109-2113. [CrossRef] [Medline]5].

Because the results from ML are not necessarily based on clinical evidence, physician interpretation is crucial for practical implementation. Studies have performed regression analyses to identify the correlation between dependent and independent variables [Munger E, Choi H, Dey AK, Elnabawi YA, Groenendyk JW, Rodante J, et al. Application of machine learning to determine top predictors of noncalcified coronary burden in psoriasis: an observational cohort study. J Am Acad Dermatol 2020 Dec;83(6):1647-1653 [FREE Full text] [CrossRef] [Medline]21]. LR is performed to obtain ORs in cases with more than 1 explanatory variable to mitigate the effect of confounding factors, [Sperandei S. Understanding logistic regression analysis. Biochem Med (Zagreb) 2014;24(1):12-18 [FREE Full text] [CrossRef] [Medline]22] but correlation does not imply causation in LR. Deep learning techniques are particularly suited to complex data sets with nonlinear solutions, especially in high-dimensionality data sets. In this study, the CNN model weighted whole factors, including chronological sequence, at the same time for prediction; thus, it could not provide reliable statistical inferences of associations because of the black box nature. However, even a feature without a statistically significant correlation can function as a predictor in the model. Although ML can assist physicians in making diagnoses and treatment plans, it cannot replace physicians’ decision-making process, which is based on clinical evidence and experience [Chan S, Reddy V, Myers B, Thibodeaux Q, Brownstone N, Liao W. Machine learning in dermatology: current applications, opportunities, and limitations. Dermatol Ther (Heidelb) 2020 Jun;10(3):365-386 [FREE Full text] [CrossRef] [Medline]23].

Limitations

This study had several potential limitations. First, this study was retrospective in nature. Because of the lack of prospective and external validation data, the utility of these models in clinical scenarios for real-world applications must be evaluated. Second, the optimal structure and hyperparameters of CNNs trained on certain data and the number of cases required to train models varied from task to task [Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med 2019 Apr 04;380(14):1347-1358. [CrossRef] [Medline]24]. Third, the data in this study were from 1999 to 2013; the absence of PsA in this 15-year period did not mean that the patients would not develop PsA in the future. Fourth, although Taiwan’s NHI claims data are particularly valuable because of their standardization, representativeness, and comprehensiveness, we should keep in mind that the claims data are intended for administrative purposes and lack data on examination results, disease severity, and health behavior [Hsieh C, Su C, Shao S, Sung S, Lin S, Kao Yang Y, et al. Taiwan's National Health Insurance Research Database: past and future. Clin Epidemiol 2019;11:349-358 [FREE Full text] [CrossRef] [Medline]25].

Conclusions

PsA causes irreversible joint damage that severely affects patients’ daily functions and quality of life and is a burden on medical resources. No simple and reliable predictive tools or criteria are available to help physicians intervene early before joint symptoms appear. In addition, PsA is often indistinguishable from other types of inflammatory arthritis, such as rheumatoid arthritis and ankylosing spondylitis, if no skin or nail symptoms are present. Because most skin symptoms of PsO appear before joint symptoms, dermatologists must be diligent in identifying and treating patients with PsA. This study visualized EMRs and temporal information as TPMs and created a computer vision model using CNNs. Our prediction model achieved good performance for predicting PsA risk using standardized and population-level claim data. The predictive models may be used as a screening tool to assist physicians in risk stratification and identifying psoriatic patients with a high risk of PsA.

Acknowledgments

We acknowledge the statistical support of the Research Center of Biostatistics, Taipei Medical University (TMU), Taiwan. The authors also acknowledge the academic and science graphic illustration service provided by TMU Office of Research and Development. This manuscript was edited by Wallace Academic Editing. This research was supported by grant 109TMUH-NE-08 from the TMU Hospital in Taiwan and NSTC 111-2622-8-038-006-IE from the National Science and Technology Council, Taiwan.

Conflicts of Interest

None declared.

‎

Multimedia Appendix 1

Supplementary materials.

DOCX File , 718 KB

Danielsen K, Olsen AO, Wilsgaard T, Furberg A. Is the prevalence of psoriasis increasing? A 30-year follow-up of a population-based cohort. Br J Dermatol 2013 Jun;168(6):1303-1310. [CrossRef] [Medline]
Tsai T, Wang T, Hung S, Tsai PI, Schenkel B, Zhang M, et al. Epidemiology and comorbidities of psoriasis patients in a national database in Taiwan. J Dermatol Sci 2011 Jul;63(1):40-46. [CrossRef] [Medline]
Patrick MT, Stuart PE, Raja K, Gudjonsson JE, Tejasvi T, Yang J, et al. Genetic signature to provide robust risk assessment of psoriatic arthritis development in psoriasis patients. Nat Commun 2018 Oct 09;9(1):4178 [FREE Full text] [CrossRef] [Medline]
Chiu H, Wang T, Chen P, Hsu S, Tsai Y, Tsai T. Psoriasis in Taiwan: from epidemiology to new treatments. Dermatologica Sinica 2018 Sep;36(3):115-123. [CrossRef]
Tillett W, Charlton R, Nightingale A, Snowball J, Green A, Smith C, et al. Interval between onset of psoriasis and psoriatic arthritis comparing the UK Clinical Practice Research Datalink with a hospital-based cohort. Rheumatology (Oxford) 2017 Dec 01;56(12):2109-2113. [CrossRef] [Medline]
Busse K, Liao W. Which psoriasis patients develop psoriatic arthritis? Psoriasis Forum 2010;16(4):17-25 [FREE Full text] [Medline]
Villani AP, Rouzaud M, Sevrain M, Barnetche T, Paul C, Richard M, et al. Prevalence of undiagnosed psoriatic arthritis among psoriasis patients: systematic review and meta-analysis. J Am Acad Dermatol 2015 Aug;73(2):242-248. [CrossRef] [Medline]
Kerdel F, Don F. The importance of early treatment in psoriasis and management of disease progression. J Drugs Dermatol 2018 Jul 01;17(7):737-742. [Medline]
Han SS, Kim MS, Lim W, Park GH, Park I, Chang SE. Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Invest Dermatol 2018 Jul;138(7):1529-1538 [FREE Full text] [CrossRef] [Medline]
Wang H, Wang Y, Liang C, Li Y. Assessment of deep learning using nonimaging information and sequential medical records to develop a prediction model for nonmelanoma skin cancer. JAMA Dermatol 2019 Sep 04:1277-1283. [CrossRef] [Medline]
Wang R, Shao X, Zheng J, Saci A, Qian X, Pak I, et al. A machine-learning approach to identify a prognostic cytokine signature that is associated with nivolumab clearance in patients with advanced melanoma. Clin Pharmacol Ther 2020 Apr;107(4):978-987. [CrossRef] [Medline]
Yasaka K, Akai H, Kunimatsu A, Kiryu S, Abe O. Deep learning with convolutional neural network in radiology. Jpn J Radiol 2018 May;36(4):257-272. [CrossRef] [Medline]
Cheng Y, Wang F, Zhang P, Hu J. Risk prediction with electronic health records: a deep learning approach. In: Proceedings of the 2016 SIAM International Conference on Data Mining. 2016 Presented at: SDM 2016; May 5-7; Miami, FL p. 432-440. [CrossRef]
Haroon M, Gallagher P, FitzGerald O. Diagnostic delay of more than 6 months contributes to poor radiographic and functional outcome in psoriatic arthritis. Ann Rheum Dis 2015 Jul;74(6):1045-1050. [CrossRef] [Medline]
Thomsen K, Iversen L, Titlestad TL, Winther O. Systematic review of machine learning for diagnosis and prognosis in dermatology. J Dermatolog Treat 2020 Aug;31(5):496-510. [CrossRef] [Medline]
Zeiler M, Fergus R. Visualizing and understanding convolutional networks. In: ECCV 2014: Computer Vision – ECCV 2014.: SpringerLink; 2014 Presented at: European Conference on Computer Vision; Sep 6-12; Zurich, Switzerland p. 818-833. [CrossRef]
Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE 1998;86(11):2278-2324. [CrossRef]
Mease PJ, Gladman DD, Papp KA, Khraishi MM, Thaçi D, Behrens F, et al. Prevalence of rheumatologist-diagnosed psoriatic arthritis in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2013 Nov;69(5):729-735. [CrossRef] [Medline]
Mease PJ, Gladman DD, Helliwell P, Khraishi MM, Fuiman J, Bananis E, et al. Comparative performance of psoriatic arthritis screening tools in patients with psoriasis in European/North American dermatology clinics. J Am Acad Dermatol 2014 Oct;71(4):649-655. [CrossRef] [Medline]
Furue K, Ito T, Tsuji G, Kadono T, Nakahara T, Furue M. Autoimmunity and autoimmune co-morbidities in psoriasis. Immunology 2018 May;154(1):21-27 [FREE Full text] [CrossRef] [Medline]
Munger E, Choi H, Dey AK, Elnabawi YA, Groenendyk JW, Rodante J, et al. Application of machine learning to determine top predictors of noncalcified coronary burden in psoriasis: an observational cohort study. J Am Acad Dermatol 2020 Dec;83(6):1647-1653 [FREE Full text] [CrossRef] [Medline]
Sperandei S. Understanding logistic regression analysis. Biochem Med (Zagreb) 2014;24(1):12-18 [FREE Full text] [CrossRef] [Medline]
Chan S, Reddy V, Myers B, Thibodeaux Q, Brownstone N, Liao W. Machine learning in dermatology: current applications, opportunities, and limitations. Dermatol Ther (Heidelb) 2020 Jun;10(3):365-386 [FREE Full text] [CrossRef] [Medline]
Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med 2019 Apr 04;380(14):1347-1358. [CrossRef] [Medline]
Hsieh C, Su C, Shao S, Sung S, Lin S, Kao Yang Y, et al. Taiwan's National Health Insurance Research Database: past and future. Clin Epidemiol 2019;11:349-358 [FREE Full text] [CrossRef] [Medline]

‎

ATC: Anatomical Therapeutic Chemical

AUROC: area under the receiver operating characteristic curve

CNN: convolutional neural network

DMARD: disease-modifying antirheumatic drug

EMR: electronic medical record

ICD-9: International Classification of Disease, Ninth Revision

LR: logistic regression

ML: machine learning

PsA: psoriatic arthritis

PsO: psoriasis

TPM: temporal phenomic map

Edited by G Eysenbach; submitted 31.05.22; peer-reviewed by SJC Soerensen , S Sarejloo, J Liao, F Lai; comments to author 09.08.22; revised version received 21.08.22; accepted 07.03.23; published 28.03.23

©Leon Tsung-Ju Lee, Hsuan-Chia Yang, Phung Anh Nguyen, Muhammad Solihuddin Muhtar, Yu-Chuan Jack Li. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.03.2023.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Machine Learning Approaches for Predicting Psoriatic Arthritis Risk Using Electronic Medical Records: Population-Based Study