Review
Abstract
Background: Although catheter ablation (CA) is currently the most effective clinical treatment for atrial fibrillation, its variable therapeutic effects among different patients present numerous problems. Machine learning (ML) shows promising potential in optimizing the management and clinical outcomes of patients undergoing atrial fibrillation CA (AFCA).
Objective: This scoping review aimed to evaluate the current scientific evidence on the application of ML for managing patients undergoing AFCA, compare the performance of various models across specific clinical tasks within AFCA, and summarize the strengths and limitations of ML in this field.
Methods: Adhering to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, relevant studies published up to October 7, 2023, were searched from PubMed, Web of Science, Embase, the Cochrane Library, and ScienceDirect. The final included studies were confirmed based on inclusion and exclusion criteria and manual review. The PROBAST (Prediction model Risk Of Bias Assessment Tool) and QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies-2) methodological quality assessment tools were used to review the included studies, and narrative data synthesis was performed on the modeled results provided by these studies.
Results: The analysis of 23 included studies showcased the contributions of ML in identifying potential ablation targets, improving ablation strategies, and predicting patient prognosis. The patient data used in these studies comprised demographics, clinical characteristics, various types of imaging (9/23, 39%), and electrophysiological signals (7/23, 30%). In terms of model type, deep learning, represented by convolutional neural networks, was most frequently applied (14/23, 61%). Compared with traditional clinical scoring models or human clinicians, the model performance reported in the included studies was generally satisfactory, but most models (14/23, 61%) showed a high risk of bias due to lack of external validation.
Conclusions: Our evidence-based findings suggest that ML is a promising tool for improving the effectiveness and efficiency of managing patients undergoing AFCA. While guiding data preparation and model selection for future studies, this review highlights the need to address prevalent limitations, including lack of external validation, and to further explore model generalization and interpretability.
doi:10.2196/60888
Keywords
Introduction
Background
Atrial fibrillation (AF) is the most common type of cardiac arrhythmia [
], and its incidence has risen steadily over the past 3 decades, making it an urgent public health problem worldwide [ , ]. While AF substantially impacts patients’ quality of life, it also increases the risk of serious medical conditions [ - ], such as stroke and thromboembolism. This contributes to long-term rises in both morbidity and mortality rates [ ] as well as a growing strain on health care costs [ ].Catheter ablation (CA) represents the most effective clinical intervention currently available for AF [
]. When patients receive their first CA treatment, the effectiveness rate can reach 60%-80% within 1 year [ , ]. However, the long-term efficacy is not optimal , and the success rate decreases to 50%-60% at 3 to 5 years [ - ]. The type and degree of AF, cardiac stroma, obesity, alcohol consumption, and obstructive sleep apnea syndrome are all known to be potential contributors to high rates of recurrence in the long term [ , ]. In addition, the surgical modality and ablation strategy used in treatment have an impact, particularly for those patients who require additional non–pulmonary vein (PV) ablations [ ]. Consequently, developing tailored management strategies that consider different patient conditions, optimizing the surgical approach and enhancing its long-term efficacy, is a challenge that needs to be resolved in this field.In clinical practice, to better manage patients, clinical risk scores (such as APPLE [
]) specifically for predicting postoperative recurrence of AF CA (AFCA) have emerged in the past decade, or attempts have been made to use other scores (such as CHADS2 [ ] and CHA2DS2-VASc [ ]) to evaluate patients’ postoperative rhythm outcomes after AFCA [ ]. These risk scores typically use a limited number of risk factors as predictors. Although this simplicity facilitates their adoption and use in clinical settings, it concurrently caps their predictive effectiveness. In addition, these scoring models predominantly rely on the evaluation of specific biomarkers in previous literature when selecting predictors. This approach may easily lead to the neglect of some potential predictors [ ], thus failing to truly capture the complexity of the AFCA recurrence mechanism and making it difficult to truly meet the pursuit of efficient and personalized patient management.Rapidly developing machine learning (ML) techniques hold significant promise as a formidable support in grappling with the complexities of this challenge. In recent years, the ability of ML to analyze large biomedical datasets, such as electronic health records (EHRs), medical imaging, and multiomics data, has made it increasingly popular in medical research [
, ]. ML models have become promising tools for enhancing the precision and efficiency of AF diagnosis and treatment. By analyzing complex, multidimensional datasets, they help identify individuals at high risk for AF, uncover risk factors and biomarkers associated with the disease’s progression, and guide personalized interventions for prognosis. One branch of ML, deep learning (DL), stands out due to its powerful capabilities in automatic feature recognition and prediction. In processing complex data types such as images and electrophysiological signals, DL has demonstrated performance comparable to that of human experts [ , ]. Several studies have attempted to apply ML technology to the management of patients undergoing AFCA. As the application prospects of ML technology in this field become increasingly clear, a comprehensive overview and evaluation of the existing relevant literature is needed.Objectives
The purpose of this scoping review is to systematically review the current progress in using ML technology in the management of patients undergoing AFCA. Specifically, the studies discussed in this review explore a wide range of ML models and multiple data modalities, including EHR, imaging omics, and cardiac electrophysiological signals. We will summarize how these studies prepare data, select models, and evaluate results across different clinical tasks. In addition, we will emphasize the advantages and limitations of ML technology in this field based on the integration and comparison of these study findings.
Methods
This scoping review was conducted following the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) guidelines [
].Information Sources and Search Strategies
A comprehensive search was conducted to identify relevant studies across 5 databases as of October 7, 2023: PubMed, Web of Science, Embase, the Cochrane Library, and ScienceDirect. The specific search strategies used for each database are detailed in Table S1 in
. In addition to obtaining literature through search formula, additional relevant articles (3/246, 1.2%) were manually identified by using a forward-backward snowballing approach.Study Inclusion and Exclusion Criteria
In line with the thematic focus and objectives of our review, the studies included in this scoping review were required to meet the eligibility criteria outlined in
. Studies were excluded from this scoping review if they met any of the exclusion criteria.Inclusion criteria
- Article type: articles (peer reviewed and formally published)
- Study topic: specializes in the management of patients undergoing catheter ablation for atrial fibrillation
- Study participants: real patients with atrial fibrillation
- Study design: ML techniques were clearly applied and described; results of the study and the ML model performance outcome metrics were explicitly presented
- Language of publication: English
Exclusion criteria
- Article type: reviews; meta-analyses; editorials; case reports; editorial materials; other types of publications
- Study topic: atrial fibrillation disease detection; risk prediction of new onset of atrial fibrillation
- Study participants: only used fictional patient data
- Study design: did not apply ML techniques; did not explicitly report on the model outcome metrics
- Language of publication: non-English
Study Selection
After removing duplicates, all retrieved studies were manually screened by title and abstract to exclude those clearly not aligned with the objectives of this review. Studies that passed this initial screening were then downloaded in full and assessed based on the inclusion and exclusion criteria during a full-text review. This process was independently conducted by 2 reviewers, with any disagreements resolved through consultation with a third reviewer to reach a consensus.
Data Extraction
Two reviewers independently extracted data from the selected studies, and a third reviewer was responsible for assessing and addressing any discrepancies and biases in data extraction. The following data items were extracted: author names and publication years, data sources, dataset sizes, ML models, model variables, model tasks (such as prediction or detection), and model results. If multiple ML models were used in a study, we will report only the one with the best performance.
Quality Assessment
We selected the widely recognized Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) [
] and Prediction model Risk Of Bias Assessment Tool (PROBAST) [ ] as methodological quality assessment tools. We divided the included studies into 2 subgroups for independent evaluation based on different research objectives, namely, subgroup A: studies that aim at image classification or segmentation, assessed using an adapted version of QUADAS-2; and subgroup B: with the goal of predicting patient prognosis, assessed using PROBAST. This assessment was independently conducted by 2 reviewers, and any disagreements or discrepancies that arise during the assessment process were resolved with the involvement of a third reviewer.During our review, we adjusted some signal questions in QUADAS-2 as needed. Tables S2 and S3 in
report the specific signal questions and assessment instructions for the tools used in the quality assessment process for this review.Results
Study Selection and Study Characterization
A total of 246 studies were identified from different databases using the method described in the Information Sources and Search Strategies section (
). After excluding duplicate publications (65/246, 26.4%), nonarticle publications (32/246, 13%), and studies that did not match the research objectives of this scoping review (115/246, 46.7%), 34 (13.8%) studies remained for full-text assessment. Ultimately, 23 studies were deemed eligible and were included for data extraction and synthesis. Table S4 in provides a detailed list of the studies excluded after full-text assessment, along with the reasons for their exclusion.
provides the basic characteristics of the studies included in this review. The publication of these studies was concentrated within the past 5 years, which can be attributed to the maturation of ML technologies and the growing interest in precision medicine among researchers in the field of AF. The included studies were conducted in 9 different countries, with the United States (7/23, 30%) having the largest number of studies, followed by China (5/23, 22%) and the United Kingdom (4/23, 17%). This diversity in publication countries reflects the global interest in using ML models for the management of patients undergoing AFCA. Regarding the types of models used, most of the included studies (14/23, 61%) used DL techniques.
Characteristic | Studies, n (%) | |
Country of study | ||
United States | 7 (30) | |
China | 5 (22) | |
United Kingdom | 4 (17) | |
Korea | 2 (9) | |
Spain | 1 (4) | |
The Netherlands | 1 (4) | |
Canada | 1 (4) | |
Italy | 1 (4) | |
Japan | 1 (4) | |
Year of publication | ||
2023 | 3 (13) | |
2022 | 7 (30) | |
2021 | 6 (26) | |
2020 | 6 (26) | |
2019 | 1 (4) | |
Type of models | ||
Machine learning | 23 (100) | |
Deep learning | 14 (61) | |
Key medical tasks | ||
Patient prognosis prediction | 16 (70) | |
Ablation targets identification | 3 (13) | |
Ablation strategy improvement | 4 (17) |
Regarding the application direction of ML in AFCA patient management, this review divides the included studies into 3 subgroups corresponding to 3 key medical tasks based on the purpose of the model. The first and most common focus is the development of prognostic models for patients undergoing AFCA (16/23, 70%). These studies aim to provide accurate, timely, and personalized postoperative risk predictions for patients undergoing AFCA [
- ]. Although these studies share similar objectives, they use diverse types of data, with EHRs being the primary data source. The second key task focuses on the identification of ablation targets (3/23, 13%), aiming to use the feature extraction capabilities of DL models to identify potential ablation targets, such as rotors [ , ] and focal sources [ ], from electrophysiological and medical imaging data. Third, some studies focus on using preoperative imaging data of patients to improve ablation strategies (4/23,17%), assisting physicians in recognizing interpatient differences to guide ablation treatments [ - ].Characteristics of the Data in the Included Studies
When developing ML models for patient management, key factors to consider include dataset size, data modality, and resource availability.
outlines the wide range of data modalities covered in this review, and we note that there is a wide variation in the choice of variables and the number of features across studies, which highlights the diversity and complexity of factors associated with influencing patient management in AFCA.Study | Patient cohort | Input data for the model | Data sources | |
Patients, n | Time frame | |||
Liu et al [ | ], 2020358 | October 2004 to December 2017 | PVCTa | Taipei Veterans General Hospital |
Firouznia et al [ | ], 2021203 | 2013 to 2016 | CTb, age, LAVc, AFd type, and catheter ablation technique | Cleveland Clinic |
Atta-Fosu et al [ | ], 202168 | July 2015 to November 2016 | CT, age, sex, LAV, LVEFe, BMI, sinus rhythm at the time of ablation, AF type, and catheter ablation technique | Cleveland Clinic |
Muizniece et al [ | ], 2021122 | 2011 to 2016 | LGE-MRIf | 2018 STACOM Split Challenge and St. Thomas Hospital |
Muffoletto et al [ | ], 2021122 | 2011 to 2016 | LGE-MRI | 2018 STACOM Split Challenge and St. Thomas Hospital |
Shade et al [ | ], 202032 | December 2011 to December 2015 | Personalized atrial computational modeling and LGE-MRI | Hopkins University Hospital |
Roney et al [ | ], 2022100 | NRg | Different types of fibrosis in the atrial matrix, fibrograms, AF induction protocols, ERPh values, PVIi size, visual fibrosis score, total surface area of left atrium, total surface area of the PVs, fibrotic area of PVs, LVEF, BMI, age, female, congestive HFj, hypertension, diabetes mellitus, stroke, coronary artery disease, and type of AF | St. Thomas Hospital |
Yang et al [ | ], 2020190 | 2011 to 2018 | LGE-MRI | Local regional ethics committee |
Hwang et al [ | ], 2020606 | July 2008 to July 2019 | Transthoracic echocardiography | Chang Gung Memorial Hospital |
Liao et al [ | ], 202178 | NR | uEGMk | Focal Source and Trigger randomized controlled trial (Chauhan et al [ | ])
Ríos-Muñoz et al [ | ], 202248 | NR | uEGM, bEGMl, and uLATm | Gregorio Marañón University General Hospital |
Alhusseini et al [ | ], 202035 | Up to December 2017 | EGMn | Comparison of Algorithms for Rotational Evaluation in Atrial Fibrillation Registry |
An et al [ | ], 202212 | January 2015 to January 2019 | 3D atrial endocardial unipolar and bipolar voltage map | the Dallas VA Medical Center |
Tang et al [ | ], 2022156 | 2015 to 2017 | EGM, ECGo, age, gender, height, weight, BMI, ethnicity and race, hypertension, hyperlipidemia, TIAp, stroke, CADq, diabetes mellitus, chronic kidney disease, congestive HF, OSAr, type of AF, previous history of AF ablation, LVEF, LADs, LAV, LASAt, and LASIu | Stanford University Tertiary Referral Center |
Li et al [ | ], 201914 | NR | BSPMv | West China Hospital of Sichuan University |
Saiz-Vivo et al [ | ], 202174 | October 2005 to July 2014 | Clinical data and heart rate variability characteristics extracted from ICMw | Reveal LINQ Usability Study and clinical data from a single center |
Jiang et al [ | ], 20231618 | January 2012 to May 2019 | 12-lead ECG | Guangdong Provincial People’s Hospital |
Lee et al [ | ], 2022177 | February 2017 to October 2020 | Age, sex, height, weight, hypertension, AF type, AF duration, LAD, LAMIx, LVEF, and eGFRy | Chungbuk National University Hospital |
Park et al [ | ], 20221872 | 2009 to 2018 | Age, female, AF type, BMI, heart failure, hypertension, diabetes mellitus, stroke or TIA, vascular disease, left atrium dimension, LVEF, EEMz, creatinine, hemoglobin, and preablation PR interval | Yonsei AF Ablation Cohort Database |
Hung et al [ | ], 202011,334 | January 2013 to December 2013 | Age, total hospital discharges, number of diagnoses, number of chronic diseases, length of hospitalization, number of surgeries, gender, diabetes, hypertension, hypothyroidism, COPDaa, renal failure, depression, peripheral vascular disease, obesity, size of hospital beds, type of hospital, and discharge status | National Readmission Database 2013 |
Saglietto et al [ | ], 20233128 | April 2012 to April 2015 | LVEDVab, estimated glomerular filtration rate, BMI, age, pre- and post–left atrium diameters, LVEF, CHAD2 DS2-VASc score, dyslipidemia, AFLac, type of ablation procedure, type of AF, structural heart disease, hypertension, baseline sinus rhythm, gender, abnormal ECG, heart failure, CAD, and smoking history | European Society of Cardiology-European Heart Rhythm Association Long-Term Registry for Atrial Fibrillation Ablation |
Ma et al [ | ], 2023471 | January 2018 to December 2020 | Age, male, BMI, diastolic blood pressure, systolic blood pressure, smoking history, CHA2DS2-VASc, HABLED score, AF duration, hypertension, CAD, type 2 diabetes mellitus, chronic HF, atrial septal defect, LAD, LVEF, and white blood cell counts | The First Affiliated Hospital of Air Force Medical University |
Zhou et al [ | ], 2022310 | Jun 2016 - Oct 2019 | NT-proBNPad, paroxysmal AF, LAAVae, and LAV | Toyo University Ohashi Medical Center |
aPVCT: pulmonary vein computed tomography.
bCT: computed tomography.
cLAV: left atrial volume.
dAF: atrial fibrillation.
eLVEF: left ventricular ejection fraction.
fLGE-MRI: late gadolinium-enhanced cardiac magnetic resonance.
gNR: not reported.
hERP: effective refractory period.
iPVI: pulmonary vein isolation.
jHF: heart failure.
kuEGM: unipolar endocardial electrogram.
lbEGM: bipolar endocardial electrogram.
muLAT: unipolar localized activation time.
nEGM: electrogram.
oECG: electrocardiogram.
pTIA: transient ischemic attack.
qCAD: coronary artery disease.
rOSA: obstructive sleep apnea.
sLAD: left atrial diameter.
tLASA: left atrial surface area.
uLASI: left atrial spherical index.
vBSPM: body surface potential map.
wICM: implantable cardiac monitor.
xLAMI: left atrial mass index.
yeGFR. estimated glomerular filtration rate.
zEEM: peak transmitral flow velocity (E), and tissue Doppler echocardiography of the peak septal mitral annular velocity (em).
aaCOPD: chronic obstructive pulmonary disease.
abLVEDV: left ventricular end-diastolic volume.
acAFL: atrial flutter.
adNT-proBNP: N-terminal pro-brain natriuretic peptide.
aeLAAV: left atrial appendage volume.
The characteristics of the data used in the included studies are provided in
, which summarizes dataset size, data modality, data source, and whether the data were multicenter. There was substantial variation among the included studies in the size of the dataset used to develop the ML model, with patient numbers ranging from 8 to 5872, but few studies adequately described their methodology or reasons for determining the sample size, in contrast to studies of randomized controlled trials based on statistical methods. In terms of data resource availability, only 17% (4/23) of the studies used publicly available datasets of patients with AF (2013 Nationwide Readmissions Database [ ], 2018 Atria Segmentation Data [ , ], and European Society of Cardiology-European Heart Rhythm Association Long-Term Registry for AF Ablation [ ]) for model development or validation, while most of the remaining studies (19/23, 83%) used private single-center retrospective datasets. In addition, 17% (4/23) of the studies used multicenter datasets in model development or external datasets for validation after model development [ , , , ].Characteristics of used data | Studies, n (%) | References | |
Size of dataset | |||
<50 | 5 (22) | [ | , , , , ]|
50-100 | 4 (17) | [ | , , , ]|
101-200 | 5 (22) | [ | , , - ]|
201-500 | 4 (17) | [ | , , , ]|
501-1000 | 1 (4) | [ | ]|
>1000 | 4 (17) | [ | , - ]|
Data modality | |||
Electronic health records | 10 (43) | [ | , , , , - ]|
Medical imaging | 9 (39) | [ | - , - ]|
Electrophysiological signal | 7 (30) | [ | - , , - ]|
Biomedical simulation | 2 (9) | [ | , ]|
Other | 1 (4) | [ | ]|
Multimodal input used | 4 (17) | [ | , , , ]|
Availability of data source | |||
Private | 19 (83) | [ | - , - , ]|
Public | 4 (17) | [ | , , , ]|
Multicenter data used | |||
Yes | 4 (17) | [ | , , , ]|
No | 19 (83) | [ | - , - , - , ]
To provide a more intuitive integration and comparison of the data modalities of the included studies, an evidence heat map was used to visualize the results (
), which highlights and reveals the medical imaging data and electrophysiological signal data involved in the included studies. In studies aimed at predicting patient prognosis, the most commonly used data type was the EHR (10/16, 63%), and it is worth noting that a small number of studies (4/16, 25%) in this segment also used multimodal data, combining either imaging data or electrocardiographic signals (or both) with clinical features to build model, and reported superior model performance relative to their constructed unimodal models. Intracardiac electrograms (EGMs) are a key data modality in studies focused on identifying ablation targets, whereas imaging data are commonly used in studies aimed at improving ablation strategies.
Modeling Approaches and Performance
Distinct medical tasks and research objectives necessitate the application of various ML techniques.
provides the ML models constructed in each study and the specific values of their performance indicators. It should be noted that if >1 ML model was constructed in 1 study at the same time, only the optimal model performance among them will be reported in this review, and the complete information of the ML models is provided in Table S2 in . In this review, if a study reports multiple ML models simultaneously, we will report only the optimal model performance among them. provides details on the used ML techniques, medical tasks, and model performances in the included studies.The classical ML models covered in this review are diverse, with random forest (6/23, 26%) and support vector machines (6/23, 26%) appearing most frequently, and in some studies, although they are not the best-performing models, they have been applied for feature selection or as comparative models. Of the 23 studies, 5 (22%) used the k-nearest neighbors algorithm, 3 (13%) used logistic regression, and 2 (9%) used the decision tree. In addition, 17% (4/23) of the studies used integrated learning models such as XGBoost and AdaBoost. It is important to note that DL models represented by convolutional neural network (CNN) and RestNet were the most frequently used as performance-optimized models in all the included studies, particularly in those studies that used image data and electrical signal data as model inputs, and 61% (14/23) of the studies reported performance-optimized models belonging to the DL type. Specifically, CNN and its variants were applied in 30% (7/23) of the studies, and ResNet was applied in 9% (2/23) of the studies.
In terms of model performance evaluation indicators, the main ones used were area under the receiver operating characteristic curve (AUC; 17/23, 74%), sensitivity and specificity (11/23, 48%), and accuracy (11/23, 48%), while some studies chose to assess the model performance with metrics such as F1-score. From the information in
and , it can be seen that the performance of ML models in the included studies was good overall. Among them, 74% (17/23) of the studies reported the AUC values of the ML models, and the values of this performance indicator ranged from 0.5 to 0.98. The average AUC for ML models predicting patient prognosis was 0.8 (SD=0.11), with a median of 0.83 (IQR 0.74-0.86), and 13% (3/23) of the studies had a model AUC >0.90. Furthermore, 48% (11/23) of the studies reported model sensitivity and specificity, with sensitivity ranging between 0.71 and 0.97 and specificity ranging from 0.79 to 0.99. In terms of accuracy, 48% (11/23) of the studies reported this statistic, with values ranging from 0.72 to 0.99. All models aimed at improving ablation strategies reported accuracy, with an average of 0.87.Medical tasks and proposed model | AUCa | Sensitivity | Specificity | Accuracy | Other | |
Patient prognosisprediction | ||||||
RFb [ | ]0.87 | NRc | NR | NR | NR | |
RF [ | ]0.965 | 0.96 | 0.89 | NR | NR | |
RF [ | ]0.721 | NR | NR | NR | NR | |
RF [ | ]0.667 | NR | NR | NR | NR | |
XGboost [ | ]0.78 | NR | NR | NR | NR | |
QDAd [ | ]0.82 | 0.82 | 0.89 | NR | NR | |
SVMe [ | ]0.85 | NR | NR | NR | F1-score=0.8 | |
CNNf [ | ]0.861 | 0.803 | 0.789 | 0.796 | NR | |
CNN [ | ]0.859 | NR | NR | NR | NR | |
CNN-SVM [ | ]NR | 0.88 | 0.96 | 0.96 | NR | |
CNN [ | ]0.84 | 0.723 | 0.95 | 0.92 | F1-score=0.71 | |
CNN [ | ]0.76 | NR | NR | NR | CIg=0.76 | |
KNNh [ | ]0.91 | 0.713 | 0.991 | 0.854 | PR=0.886 | |
OWVi [ | ]NR | 0.76 | 0.87 | 0.82 | NR | |
MLPj [ | ]0.5 | NR | NR | 0.875 | F1-score=0.933 | |
MLP [ | ]0.766 | NR | NR | NR | NR | |
Ablation targets identification | ||||||
ResNet [ | ]0.980 | 0.781 | 0.822 | NR | NR | |
CRNNk [ | ]0.81 | NR | NR | NR | MCCl=0.680 | |
CNN [ | ]NR | 0.97 | 0.93 | 0.95 | NR | |
Ablation strategy improvement | ||||||
ResNet [ | ]0.88 | 0.75 | 0.957 | 0.886 | NR | |
Reinforcement Q-learning [ | ]NR | NR | NR | 0.72 | NR | |
MVVTm [ | ]NR | 0.92 | 0.99 | 0.99 | DIn=93.11 | |
CNN [ | ]NR | NR | NR | 0.865 | NR |
aAUC: area under the curve.
bRF: random forest.
cNR: not reported.
dQDA: quadratic discriminant analysis.
eSVM: support vector machine.
fCNN: convolutional neural network.
gCI: C-index.
hKNN: K-nearest neighbor.
iOWV: optimal weighted voting.
jMLP: multilayer perceptron.
kCRNN: convolutional recurrent neural network.
lMCC: Matthews correlation coefficient.
mMVTT: fully automatic multiview dual-task recursive attention model.
nDI: Dice score.
Statistics | Studies, n (%) | Medical tasks involved (fraction) and corresponding study reference | |||
AUC (%a) | |||||
<70 | 2 (9) |
| |||
70-79 | 4 (17) |
| |||
80-85 | 4 (17) |
| |||
86-90 | 4 (17) |
| |||
>90 | 3 (13) |
| |||
Sensitivity (%b) | |||||
<85 | 7 (30) |
| |||
85-89 | 1 (4) |
| |||
90-95 | 1 (4) |
| |||
>95 | 2 (9) |
| |||
Specificity (%c) | |||||
<85 | 2 (9) |
| |||
85-89 | 3 (13) |
| |||
90-95 | 2 (9) |
| |||
>95 | 4 (17) |
| |||
Accuracy (%d) | |||||
<80 | 2 (9) |
| |||
80-86 | 2 (9) |
| |||
87-92 | 4 (17) |
| |||
>92 | 3 (13) |
|
aAUC: area under the curve; AUC values were mentioned in 17 included studies.
bSensitivity was mentioned in 11 included studies.
cSpecificity was mentioned in 11 included studies.
dAccuracy was mentioned in 11 included studies.
Findings of the Included Studies
Identifying AFCA Targets
Three studies discussed the application of ML techniques in identifying potential targets for CA in AF [
- ]. Clinically, electrophysiologists must resort to time-consuming and complex invasive cardiac electrophysiological examinations to confirm ablation targets; deep neural networks can help improve this process. Liao et al [ ] applied ResNet to classify raw intracardiac EGMs during AF, accurately identifying patients’ putative focal source target site with performance approximating that of human experts, obtaining an AUC of 0.98. Two other studies [ , ] applied CNN to detecting rotors in the intracardiac EGMs, with one of the them taking into account the interpretability of DL model and using gradient-weighted class-activation mapping (Grad-CAM) to explore how the model classifies AF electrical patterns [ ], and the other designed the convolutional recurrent neural network from the perspective of improving model, while experimenting with multiple types of input signals, resulting in improved classification performance of the model.Improving CA Strategies for AF
Due to varying degrees of individualization among patients, the development of a rational and effective strategy for AFCA relies on the judgment and surgical experience of the physician [
]. Notably, 17% (4/23) of the studies demonstrated the potential of DL to improve this process [ - ]. A common factor among these studies is that they were all based on imaging data. Liu et al [ ] used the patient’s preablation PV computed tomography (CT) images to construct a ResNet-based prediction model, which identified potential non-PV triggering factors before ablation treatment with an AUC of 0.88, thereby prompting physicians to perform necessary additional ablation on the basis of PV isolation. In 3 other studies, cardiac late gadolinium-enhanced cardiac magnetic resonance (LGE-MRI) was used [ - ]. Muffoletto et al [ ] and Muizniece et al [ ] used CNN and reinforcement learning (Q-Learning) algorithms, respectively, to directly engage in the process of formulating personalized ablation strategies for patients with AF and achieved success rates of 79% and 72%, respectively, in their respective test sets. Yang et al [ ] proposed a joint segmentation method based on the multiview two-task recursive attention model to provide clinicians with segmented left ventricular (LV) anatomical structures and LV scars directly and simultaneously from LGE-MRI, which can be used to provide valuable guidance for ablation treatment.Prognosis for Patients With AFCA
The high long-term recurrence rate of CA has been troubling patients and clinicians. Of the 23 included studies, 16 (70%) focused on the prognosis of patients after AFCA. Furthermore, 35% (8/23) of the studies [
, - ] attempted to predict the risk of postprocedural recurrence of AF using easily accessible preprocedural patient clinical characteristics, with commonly used variables including age, gender, BMI, comorbidities, and electrocardiogram (ECG), but their modeling performance varied widely, with reported AUC ranging from 0.667 to 0.965. Four studies [ , , , ] chose to fuse features from multiple modalities as the input of ML model for recurrence prediction of AF. For example, Tang et al [ ] proposed a CNN-based multimodal fusion framework to predict AF recurrence 1 year after CA using patients’ clinical features, EGMs, ECG, and their combinations. The comparison of multiple experimental results showed that the recurrence prediction AUC of the multimodal feature fusion model was improved by 13.2% (0.86 vs 0.76) compared with the single-modality model. Independent prediction of AF recurrence using various types of examination images is also a feasible approach, but the relative performance is not particularly impressive. In contrast, several other studies have shown that models combining radiomics data with common clinical features exhibit predictive performance with better results [ , , ].Quality Assessment
The risk of bias and applicability of the included studies in subgroup A (5/23, 22%; targeting patients classification and image segmentation) were confirmed by using the adapted QUADAS-2 tool. The results presented in
indicate that the “index test” domain is the most susceptible to bias. All studies exhibit a high risk of bias in this domain because they rely solely on single-center data for model development and lack external validation, which is essential to fully assess the model’s generalizability. The applicability of 60% (3/5) of the studies showed unclear or high risk of bias in different fields.For subgroup B (16/23, 70%; targeting patient prognosis prediction), the result indicated that the overall risk of bias and applicability was determined to be low in 25% (4/16) of the studies and high in 69% (11/16) of the studies, and the latter was also due to insufficient consideration of model generalization issues in the assessment of the “analysis” domain, resulting in a high risk of bias. Furthermore, 12% (2/16) of the studies were rated as uncertain risk in terms of applicability due to unclear inclusion criteria for the study participants. The results are shown in
.The detailed risk of bias and applicability assessment results for both subgroups are shown in Figure S1 and Table S5 in
.

Discussion
Principal Findings
ML has shown great advantages in processing and analyzing medical data and has been increasingly applied to solve various medical problems [
], including those in the field of AF. Compared with several existing ML-related reviews in this field [ - ], we focus more on the research of applying ML in the management of patients undergoing AFCA. We systematically evaluated the 23 highly relevant studies and found that ML performed well overall in AFCA patient management. Models constructed in several studies performed as well as human clinicians in identifying potential ablation targets (rotors and focal sources) and assisting in improving ablation strategies (left atrium scar segmentation) [ , , ]. It should be noted that comparisons between all included studies are hampered to some extent by the heterogeneity of ML results, and some findings may require discussion at a smaller granularity.General Overview of the Studies
The results of our review suggest that the application of ML techniques to the prognosis of patients undergoing AFCA seems promising and has received the most attention from researchers within the scope of this review (16/23, 70%). Compared with traditional prediction models, ML models have a better prediction effect on the long-term recurrence of AF after CA. In the past decade, the construction of AF recurrence prediction models was mainly based on 2 methods; the first is to directly use clinical risk scores, such as APPLE [
], CAAP-AF [ ], and MB-LATER [ ]. The second is to use some specific biomarkers of patients with AF (such as LA volume [ ] and LA sphericity index [ ]) or to combine the scoring system with clinical variables and biomarkers for statistical inference. Several different evaluation studies have shown that the AUC range for the prediction of AF recurrence of the models constructed using the abovementioned methods is 0.55 to 0.67 [ , , ], which is lower than the results of ML models reported in most of the included studies. Most of these ML models achieved performance values >0.8 in terms of AUC, accuracy, and F1-score indicators. The reason for the poor predictive performance of the traditional methods may lie in the limitations of the predictor dimensions, as scoring and statistical models usually rely on previous literature to select a limited number of clinical variables and specific markers due to the need to fully consider clinical ease of use and interpretability, which also leads to some extent to the neglect of the interactions between variables.In contrast, ML models are capable of handling much larger and diverse datasets, particularly including imaging and electrophysiologic signal data that were not directly amenable to statistical analysis in the past. Through comprehensive learning of complex data patterns and nonlinear relationships, ML models can achieve more accurate predictive results, which will have many positive impacts for effective intervention in the long-term health of patients undergoing AFCA. On the one hand, the predictive results provided by ML models will guide postprocedure medical management and expectations [
] and help clinicians to take interventions in advance, including medication, lifestyle modifications, or regular follow-ups; on the other hand, the application of ML models can also improve patient engagement in health management and treatment adherence, as in the case of the web calculator developed by Saglietto et al [ ], which allows the free personalized prediction of the probability of recurrence after 1 year of AFCA treatment, serving as a decision-support tool that facilitates patient-centered care and joint decision-making between physicians and patients.Similarly, optimizing CA strategies and identifying potential ablation targets could also benefit from the integration of ML. Because of the different mechanisms of AF triggering and maintenance among patients and the difficulty of identifying patients with one or more of several different mechanisms [
], physicians need to develop ablation strategies and identify ablation targets with the help of different means, such as imaging (including MRI and CT) or invasive electrophysiologic examinations. When dealing with some specific patient groups, such as patients with non–PV-triggered AF, this process not only tests the physician’s clinical experience but also raises the patient’s exposure time to radiation [ ]. Several studies in this review demonstrated that the DL models can identify potential ablation targets based on the patient’s EGM data, with 82.5% and 80.4% accuracy[ , ], or guide CA according to personalized appropriate strategies based on LGE-MRI images [ , ], which is conducive to guiding clinicians to implement targeted treatment and shorten the duration of radiation exposure. In addition, using ML technology and patient preoperative data to determine whether additional ablation is needed on the basis of PV isolation is also meaningful for improving patient selection and reducing the risk of postoperative recurrence in patients [ , ]. With the help of ML technology, physicians were given the opportunity to intuitively gain insights from these data into the mechanisms that initiate or maintain arrhythmias in patients, thereby efficiently locating ablation targets, reducing uncertainty in CA treatment and late recurrence, optimizing the allocation of medical resources, and ultimately improving patients’ clinical outcomes.In terms of model training, using multimodal data from patients with AF to develop ML models might be a promising approach to improve model performance. Models benefit from multimodal data that more fully reflect the demographic characteristics, clinical features, and actual cardiac condition with electrophysiological characteristics of patients with AF, as reported in some previous studies [
, , , ]. In traditional prognostic studies involving patients who have undergone ablation, factors such as AF type, ejection fraction values, LA volume [ ], and obstructive sleep apnea [ ] have been acknowledged as potential predictors that may be independently or jointly associated with recurrence. However, after comprehensively analyzing the cardiac morphological structure, electrophysiological structure, and clinical characteristics with the help of ML, factors that have been ignored, such as fractal characteristics [ , ], have attracted attention. This suggests that ML will reveal previously unknown associations and patterns that will drive new insights and innovations in the management of patients undergoing CA for AF. It is worth noting that the use of ML techniques to process multimodal data is common across the medical field (eg, oncology [ ] and lymphocytosis [ ]), but it needs to be further popularized in the field of AF, particularly for the management of patients undergoing AFCA.In addition, compared with classical ML methods, DL seems to have more obvious advantages in the field of AFCA patient management. As shown in
and , >60% (14/23, 61%) of the studies included in this scoping review used DL technology, and a notable characteristic of this subset of studies is that many involved data modalities beyond general clinical features, encompassing imaging data (CT, echocardiography, etc) or electrophysiological signal (ECG, EGM, etc). This is attributed to DL’s demonstrated prowess in complex feature learning, time series processing, and other aspects, contributing to its gradual emergence as a mainstream technology in the field of computer-aided detection and diagnosis [ ]. In a blinded, randomized controlled trial designed by He et al [ ] for DL algorithm, the initial assessment of LV ejection fraction by DL was even superior to that of sonographers. Therefore, we believe that in the future, there will be more data-driven instances of applying DL to AFCA patient management to promote substantive progress in this field, such as optimizing treatment strategies, reducing the risk of complications, and conducting early intervention in patient populations prone to recurrence.Existing Limitations of the ML Models
We also noted some general limitations to the current use of ML techniques in AFCA patient management, which have hindered its more comprehensive implementation. In using the adapted QUADAS-2 and PROBAST to assess the risk of bias for all included studies, we found that most of the studies (19/23, 83%) were conducted based on single-center, retrospective data and lacked external validation, which raises concerns as to whether ML models can be generalized to a high level. Limited by data availability, some ML models with excellent results will not be perfectly reproduced on other datasets. In addition, the absence of an external validation set may result in overestimating the model’s performance, preventing an accurate assessment of the ML model’s true capability [
]. Therefore, how to break through the limitations of single-center studies is an issue that has to be considered when further promoting the in-depth use of ML.In addition, there remain some difficulties in effectively comparing and evaluating the results of ML models applied to the management of patients undergoing AFCA. We found that even for the similar medical task (eg, patient prognosis prediction or ablation strategy improvement), existing studies still showed significant heterogeneity in the reporting and analysis of results [
, ], which hinders objective comparisons between the results of different ML models.It is also worth noting that when considering the use and introduction of ML techniques from a practical perspective, we should correctly recognize that there are differences in the level of trust and acceptance of ML among different individuals (both physicians and patients). This phenomenon is partly caused by the “black box” problem of some DL models [
], which was rarely discussed in the studies included in this scoping review. Without the ability to understand how models make decisions, physicians will be more likely to trust their own clinical experience and medical knowledge to confirm treatment decisions based on interviews, symptom judgments, and clinical examinations, as they may not be willing to take responsibility for suboptimal results of ML models [ ]. More critically, the “black box” prevents clinicians and researchers from fully understanding or explaining the principles behind model decisions [ ], which is a limitation that cannot be ignored for research with the original intention of explaining medical problems.Future Research
In future studies, the composition of the patient cohort and the construction of the dataset are primary issues that need to be considered. The challenges of conducting multicenter studies primarily stem from the legal, ethical, and privacy constraints associated with sharing raw AFCA patient data [
]. Distributed learning, represented by paradigms such as integrated learning and federated learning, would be a strategy for future studies to realize effective collaboration among multicenters [ ]. In a network of distributed learning nodes, individual institutions do not need to share patient data externally. Instead, they contribute to constructing the final model across multiple centers by updating the model (eg, adjusting the weighting parameters) [ ].In addition, efforts should be accelerated to develop standardized protocols and consensus guidelines for the evaluation of ML models, thus helping clinicians in selecting more appropriate ML techniques to address specific clinical tasks in the management of patients undergoing AFCA [
, ].To address the issue of limited clinical interpretability of ML models, commonly used techniques in medical research, such as Shapley Additive Explanation and attention mechanisms [
, - ], can generate input importance maps to highlight the features that contribute most to model detection. We believe that more research will be devoted to opening this “black box” in the future [ ]. Whether it is physicians, patients, or researchers, the real hope is that ML will be able to prompt as much human-understandable auxiliary information as possible to prove its usefulness as an additional decision point in the complex, changing, practice-based patient management environment of AFCA.Strengths and Limitations
To the best of the authors’ knowledge, this is the first scoping review of studies applying ML techniques to the management of patients undergoing AFCA. We summarized the various ML methods, variables, model results, and clinical application scenarios that have emerged; emphasized the huge potential for further in-depth application of ML and the limitations that need to be overcome; and provided guidance for future research in terms of data preparation and model selection. Moreover, some sources of evidence were critically evaluated using modified QUADAS-2 and PROBAST to make the conclusions of the review more reliable and convincing. This study has several limitations. First, the results of the included studies could not be synthesized into quantitative analysis. Although all studies reported model performance, different indicators were not directly comparable. Second, this review only focused on studies published in English and did not cover other languages and gray literature, which may have led to the omission of some important findings in this field.
Conclusions
The rapid development of ML has injected vitality into the transformation of AFCA patient management to become more precise and reliable. It could improve the identification of ablation targets, ablation strategies, and prediction of patient prognosis, which will effectively improve treatment effectiveness and patient management efficiency. Despite its broad prospects, the widespread and in-depth application of ML technology in AFCA patient management still requires consideration of issues from data and the model itself, such as data quality, model generalization, and model interpretability.
Acknowledgments
This study was supported by the Key Laboratory of Medical Information Research (Central South University), college of Hunan Province, as part of the project Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province (agreement 2021SK4005).
Data Availability
All data generated or analyzed during this study are included in this published article and
and .Authors' Contributions
AL and ZX contributed to the conception and design of the study. WC, HZ, WX, XC, and ZL developed the search strategy. WC, XC, ZL, and ZX contributed to the literature selection and data extraction. AL, WC, HZ, WX, and ZX contributed to risk of bias and applicability evaluation. AL, WC, HZ, WX, and ZX wrote the first draft of the manuscript and edited the manuscript. All authors contributed to critical revision of the manuscript. All authors have read and agreed to the published version of the manuscript.
Conflicts of Interest
None declared.
Multimedia Appendix 2
PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist.
PDF File (Adobe PDF File), 516 KBReferences
- Brundel BJ, Ai X, Hills MT, Kuipers MF, Lip GY, de Groot NM. Atrial fibrillation. Nat Rev Dis Primers. Apr 07, 2022;8(1):21. [CrossRef] [Medline]
- Dong XJ, Wang BB, Hou FF, Jiao Y, Li HW, Lv SP, et al. Global burden of atrial fibrillation/atrial flutter and its attributable risk factors from 1990 to 2019. Europace. Mar 30, 2023;25(3):793-803. [FREE Full text] [CrossRef] [Medline]
- Elliott AD, Middeldorp ME, Van Gelder IC, Albert CM, Sanders P. Epidemiology and modifiable risk factors for atrial fibrillation. Nat Rev Cardiol. Jun 2023;20(6):404-417. [CrossRef] [Medline]
- January CT, Wann LS, Alpert JS, Calkins H, Cigarroa JE, Cleveland JC, et al. American College of Cardiology/American Heart Association Task Force on Practice Guidelines. 2014 AHA/ACC/HRS guideline for the management of patients with atrial fibrillation: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the Heart Rhythm Society. J Am Coll Cardiol. Dec 02, 2014;64(21):e1-76. [FREE Full text] [CrossRef] [Medline]
- Seiffge DJ, Werring DJ, Paciaroni M, Dawson J, Warach S, Milling TJ, et al. Timing of anticoagulation after recent ischaemic stroke in patients with atrial fibrillation. Lancet Neurol. Jan 2019;18(1):117-126. [FREE Full text] [CrossRef] [Medline]
- Pabel S, Knierim M, Stehle T, Alebrand F, Paulus M, Sieme M, et al. Effects of atrial fibrillation on the human ventricle. Circ Res. Apr 2022;130(7):994-1010. [CrossRef]
- Frederiksen TC, Dahm CC, Preis SR, Lin H, Trinquart L, Benjamin EJ, et al. The bidirectional association between atrial fibrillation and myocardial infarction. Nat Rev Cardiol. Sep 17, 2023;20(9):631-644. [CrossRef] [Medline]
- Zoni-Berisso M, Lercari F, Carazza T, Domenicucci S. Epidemiology of atrial fibrillation: European perspective. Clin Epidemiol. 2014;6:213-220. [FREE Full text] [CrossRef] [Medline]
- Burdett P, Lip GY. Atrial fibrillation in the UK: predicting costs of an emerging epidemic recognizing and forecasting the cost drivers of atrial fibrillation-related costs. Eur Heart J Qual Care Clin Outcomes. Mar 02, 2022;8(2):187-194. [CrossRef] [Medline]
- Dewire J, Calkins H. Update on atrial fibrillation catheter ablation technologies and techniques. Nat Rev Cardiol. Oct 27, 2013;10(10):599-612. [CrossRef] [Medline]
- Cappato R, Calkins H, Chen SA, Davies W, Iesaka Y, Kalman J, et al. Updated worldwide survey on the methods, efficacy, and safety of catheter ablation for human atrial fibrillation. Circ Arrhythm Electrophysiol. Feb 2010;3(1):32-38. [CrossRef]
- Wazni OM, Marrouche NF, Martin DO, Verma A, Bhargava M, Saliba W, et al. Radiofrequency ablation vs antiarrhythmic drugs as first-line treatment of symptomatic atrial fibrillation: a randomized trial. JAMA. Jun 01, 2005;293(21):2634-2640. [CrossRef] [Medline]
- Ouyang F, Tilz R, Chun J, Schmidt B, Wissner E, Zerm T, et al. Long-term results of catheter ablation in paroxysmal atrial fibrillation: lessons from a 5-year follow-up. Circulation. Dec 07, 2010;122(23):2368-2377. [CrossRef]
- Medi C, Sparks PB, Morton JB, Kistler PM, Halloran K, Rosso R, et al. Pulmonary vein antral isolation for paroxysmal atrial fibrillation: results from long-term follow-up. J Cardiovasc Electrophysiol. Feb 2011;22(2):137-141. [CrossRef] [Medline]
- Gaita F, Caponi D, Scaglione M, Montefusco A, Corleto A, Di Monte F, et al. Long-term clinical results of 2 different ablation strategies in patients with paroxysmal and persistent atrial fibrillation. Circ Arrhythm Electrophysiol. Oct 2008;1(4):269-275. [CrossRef]
- Latchamsetty R, Morady F. Atrial fibrillation ablation. Annu Rev Med. Jan 29, 2018;69(1):53-63. [CrossRef] [Medline]
- Parameswaran R, Al-Kaisey AM, Kalman JM. Catheter ablation for atrial fibrillation: current indications and evolving technologies. Nat Rev Cardiol. Mar 13, 2021;18(3):210-225. [CrossRef] [Medline]
- Lee KN, Roh SY, Baek YS, Park H, Ahn J, Kim D, et al. Long-term clinical comparison of procedural end points after pulmonary vein isolation in paroxysmal atrial fibrillation. Circ Arrhythm Electrophysiol. Feb 2018;11(2):e005019. [CrossRef]
- Kornej J, Hindricks G, Shoemaker MB, Husser D, Arya A, Sommer P, et al. The APPLE score: a novel and simple score for the prediction of rhythm outcomes after catheter ablation of atrial fibrillation. Clin Res Cardiol. Oct 17, 2015;104(10):871-876. [FREE Full text] [CrossRef] [Medline]
- Jacobs V, May HT, Bair TL, Crandall BG, Cutler M, Day JD, et al. The impact of risk score (CHADS2 versus CHA2DS2-VASc) on long-term outcomes after atrial fibrillation ablation. Heart Rhythm. Apr 2015;12(4):681-686. [CrossRef] [Medline]
- Kornej J, Hindricks G, Kosiuk J, Arya A, Sommer P, Husser D, et al. Comparison of CHADS2, R2CHADS2, and CHA2DS2-VASc scores for the prediction of rhythm outcomes after catheter ablation of atrial fibrillation: the Leipzig Heart Center AF ablation registry. Circ Arrhythm Electrophysiol. Apr 2014;7(2):281-287. [CrossRef]
- Mulder MJ, Kemme MJ, Hopman LH, Kuşgözoğlu E, Gülçiçek H, van de Ven PM, et al. Comparison of the predictive value of ten risk scores for outcomes of atrial fibrillation patients undergoing radiofrequency pulmonary vein isolation. Int J Cardiol. Dec 01, 2021;344:103-110. [FREE Full text] [CrossRef] [Medline]
- Mohsen F, Al-Absi HR, Yousri NA, El Hajj N, Shah Z. A scoping review of artificial intelligence-based methods for diabetes risk prediction. NPJ Digit Med. Oct 25, 2023;6(1):197. [FREE Full text] [CrossRef] [Medline]
- He J, Baxter SL, Xu J, Xu J, Zhou X, Zhang K. The practical implementation of artificial intelligence technologies in medicine. Nat Med. Jan 2019;25(1):30-36. [FREE Full text] [CrossRef] [Medline]
- Deo RC. Machine learning in medicine. Circulation. Nov 17, 2015;132(20):1920-1930. [FREE Full text] [CrossRef] [Medline]
- Gao Y, Zeng S, Xu X, Li H, Yao S, Song K, et al. Deep learning-enabled pelvic ultrasound images for accurate diagnosis of ovarian cancer in China: a retrospective, multicentre, diagnostic study. Lancet Digit Health. Mar 2022;4(3):e179-e187. [FREE Full text] [CrossRef] [Medline]
- Khurshid S, Friedman S, Reeder C, Di Achille P, Diamant N, Singh P, et al. ECG-based deep learning and clinical risk factors to predict atrial fibrillation. Circulation. Jan 11, 2022;145(2):122-133. [CrossRef]
- Tricco AC, Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 02, 2018;169(7):467-473. [FREE Full text] [CrossRef] [Medline]
- Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2 Group. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. Oct 18, 2011;155(8):529-536. [FREE Full text] [CrossRef] [Medline]
- Moons KG, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med. Jan 01, 2019;170(1):W1-33. [FREE Full text] [CrossRef] [Medline]
- Firouznia M, Feeny AK, LaBarbera MA, McHale M, Cantlay C, Kalfas N, et al. Machine learning–derived fractal features of shape and texture of the left atrium and pulmonary veins from cardiac computed tomography scans are associated with risk of recurrence of atrial fibrillation postablation. Circ Arrhythm Electrophysiol. Mar 2021;14(3):e009265. [CrossRef]
- Atta-Fosu T, LaBarbera M, Ghose S, Schoenhagen P, Saliba W, Tchou PJ, et al. A new machine learning approach for predicting likelihood of recurrence following ablation for atrial fibrillation from CT. BMC Med Imaging. Mar 09, 2021;21(1):45. [FREE Full text] [CrossRef] [Medline]
- Shade JK, Ali RL, Basile D, Popescu D, Akhtar T, Marine JE, et al. Preprocedure application of machine learning and mechanistic simulations predicts likelihood of paroxysmal atrial fibrillation recurrence following pulmonary vein isolation. Circ Arrhythm Electrophysiol. Jul 2020;13(7):e008213. [CrossRef]
- Roney CH, Sim I, Yu J, Beach M, Mehta A, Alonso Solis-Lemus J, et al. Predicting atrial fibrillation recurrence by combining population data and virtual cohorts of patient-specific left atrial models. Circ Arrhythm Electrophysiol. Feb 2022;15(2):e010253. [CrossRef]
- Hwang YT, Lee HL, Lu CH, Chang P, Wo H, Liu H, et al. A novel approach for predicting atrial fibrillation recurrence after ablation using deep convolutional neural networks by assessing left atrial curved m-mode speckle-tracking images. Front Cardiovasc Med. Jan 22, 2020;7:605642. [FREE Full text] [CrossRef] [Medline]
- An Q, McBeth R, Zhou H, Lawlor B, Nguyen D, Jiang S, et al. Prediction of type and recurrence of atrial fibrillation after catheter ablation via left atrial electroanatomical voltage mapping registration and multilayer perceptron classification: a retrospective study. Sensors (Basel). May 27, 2022;22(11):4058. [FREE Full text] [CrossRef] [Medline]
- Tang S, Razeghi O, Kapoor R, Alhusseini MI, Fazal M, Rogers AJ, et al. Machine learning–enabled multimodal fusion of intra-atrial and body surface signals in prediction of atrial fibrillation ablation outcomes. Circ Arrhythm Electrophysiol. Aug 2022;15(8):e010850. [CrossRef]
- Li Z, Feng X, Wu Z, Yang C, Bai B, Yang Q. Classification of atrial fibrillation recurrence based on a convolution neural network with SVM architecture. IEEE Access. 2019;7:77849-77856. [CrossRef]
- Saiz-Vivo J, Corino VD, Hatala R, de Melis M, Mainardi LT. Heart rate variability and clinical features as predictors of atrial fibrillation recurrence after catheter ablation: a pilot study. Front Physiol. May 25, 2021;12:672896. [FREE Full text] [CrossRef] [Medline]
- Jiang J, Deng H, Liao H, Fang X, Zhan X, Wei W, et al. An artificial intelligence-enabled ECG algorithm for predicting the risk of recurrence in patients with paroxysmal atrial fibrillation after catheter ablation. J Clin Med. Mar 01, 2023;12(5):1933. [CrossRef] [Medline]
- Lee DI, Park MJ, Choi JW, Park S. Deep learning model for predicting rhythm outcomes after radiofrequency catheter ablation in patients with atrial fibrillation. J Healthc Eng. Sep 10, 2022;2022:2863495-2863410. [FREE Full text] [CrossRef] [Medline]
- Park JW, Kwon OS, Shim J, Hwang I, Kim YG, Yu HT, et al. Machine learning-predicted progression to permanent atrial fibrillation after catheter ablation. Front Cardiovasc Med. Feb 16, 2022;9:813914. [FREE Full text] [CrossRef] [Medline]
- Hung M, Lauren E, Hon E, Xu J, Ruiz-Negrón B, Rosales M, et al. Using machine learning to predict 30-day hospital readmissions in patients with atrial fibrillation undergoing catheter ablation. J Pers Med. Aug 09, 2020;10(3):82. [FREE Full text] [CrossRef] [Medline]
- Saglietto A, Gaita F, Blomstrom-Lundqvist C, Arbelo E, Dagres N, Brugada J, et al. AFA-Recur: an ESC EORP AFA-LT registry machine-learning web calculator predicting atrial fibrillation recurrence after ablation. Europace. Feb 08, 2023;25(1):92-100. [FREE Full text] [CrossRef] [Medline]
- Ma Y, Zhang D, Xu J, Pang H, Hu M, Li J, et al. Explainable machine learning model reveals its decision-making process in identifying patients with paroxysmal atrial fibrillation at high risk for recurrence after catheter ablation. BMC Cardiovasc Disord. Feb 17, 2023;23(1):91. [FREE Full text] [CrossRef] [Medline]
- Zhou X, Nakamura K, Sahara N, Takagi T, Toyoda Y, Enomoto Y, et al. Deep learning-based recurrence prediction of atrial fibrillation after catheter ablation. Circ J. Jan 25, 2022;86(2):299-308. [FREE Full text] [CrossRef] [Medline]
- Ríos-Muñoz GR, Fernández-Avilés F, Arenal Á. Convolutional neural networks for mechanistic driver detection in atrial fibrillation. Int J Mol Sci. Apr 11, 2022;23(8):4216. [FREE Full text] [CrossRef] [Medline]
- Alhusseini MI, Abuzaid F, Rogers AJ, Zaman JA, Baykaner T, Clopton P, et al. Machine learning to classify intracardiac electrical patterns during atrial fibrillation. Circ Arrhythm Electrophysiol. Aug 2020;13(8):e008160. [CrossRef]
- Liao S, Ragot D, Nayyar S, Suszko A, Zhang Z, Wang B, et al. Deep learning classification of unipolar electrograms in human atrial fibrillation: application in focal source mapping. Front Physiol. Jul 30, 2021;12:704122. [FREE Full text] [CrossRef] [Medline]
- Liu CM, Chang SL, Chen HH, Chen WS, Lin YJ, Lo LW, et al. The clinical application of the deep learning technique for predicting trigger origins in patients with paroxysmal atrial fibrillation with catheter ablation. Circ Arrhythm Electrophysiol. Nov 2020;13(11):e008518. [CrossRef] [Medline]
- Muizniece L, Bertagnoli A, Qureshi A, Zeidan A, Roy A, Muffoletto M, et al. Reinforcement learning to improve image-guidance of ablation therapy for atrial fibrillation. Front Physiol. Aug 25, 2021;12:733139. [CrossRef] [Medline]
- Muffoletto M, Qureshi A, Zeidan A, Muizniece L, Fu X, Zhao J, et al. Toward patient-specific prediction of ablation strategies for atrial fibrillation using deep learning. Front Physiol. May 26, 2021;12:674106. [FREE Full text] [CrossRef] [Medline]
- Yang G, Chen J, Gao Z, Li S, Ni H, Angelini E, et al. Simultaneous left atrium anatomy and scar segmentations via deep learning in multiview information with attention. Future Gener Comput Syst. Jun 2020;107:215-228. [FREE Full text] [CrossRef] [Medline]
- Chauhan VS, Verma A, Nayyar S, Timmerman N, Tomlinson G, Porta-Sanchez A, et al. Focal source and trigger mapping in atrial fibrillation: randomized controlled trial evaluating a novel adjunctive ablation strategy. Heart Rhythm. May 2020;17(5):683-691. [CrossRef] [Medline]
- Olier I, Ortega-Martorell S, Pieroni M, Lip GY. How machine learning is impacting research in atrial fibrillation: implications for risk prediction and future management. Cardiovasc Res. Jun 16, 2021;117(7):1700-1717. [FREE Full text] [CrossRef] [Medline]
- Wegner FK, Plagwitz L, Doldi F, Ellermann C, Willy K, Wolfes J, et al. Machine learning in the detection and management of atrial fibrillation. Clin Res Cardiol. Sep 2022;111(9):1010-1017. [FREE Full text] [CrossRef] [Medline]
- Siontis KC, Yao X, Pirruccello JP, Philippakis AA, Noseworthy PA. How will machine learning inform the clinical care of atrial fibrillation? Circ Res. Jun 19, 2020;127(1):155-169. [CrossRef]
- Winkle RA, Jarman JW, Mead RH, Engel G, Kong MH, Fleming W, et al. Predicting atrial fibrillation ablation outcome: the CAAP-AF score. Heart Rhythm. Nov 2016;13(11):2119-2125. [FREE Full text] [CrossRef] [Medline]
- Mujović N, Marinković M, Marković N, Shantsila A, Lip GY, Potpara TS. Prediction of very late arrhythmia recurrence after radiofrequency catheter ablation of atrial fibrillation: the MB-LATER clinical score. Sci Rep. Jan 20, 2017;7(1):40828. [FREE Full text] [CrossRef] [Medline]
- Chollet L, Iqbal SU, Wittmer S, Thalmann G, Madaffari A, Kozhuharov N, et al. Impact of atrial fibrillation phenotype and left atrial volume on outcome after pulmonary vein isolation. Europace. Mar 30, 2024;26(4):258. [FREE Full text] [CrossRef] [Medline]
- Shi J, Xu S, Chen L, Wu B, Yang K, Chen S, et al. Impact of left atrial sphericity index on the outcome of catheter ablation for atrial fibrillation. J Cardiovasc Transl Res. Oct 06, 2021;14(5):912-920. [CrossRef] [Medline]
- Della Rocca DG, Di Biase L, Mohanty S, Trivedi C, Gianni C, Romero J, et al. Targeting non-pulmonary vein triggers in persistent atrial fibrillation: results from a prospective, multicentre, observational registry. Europace. Dec 07, 2021;23(12):1939-1949. [CrossRef] [Medline]
- Balk EM, Garlitski AC, Alsheikh-Ali AA, Terasawa T, Chung M, Ip S. Predictors of atrial fibrillation recurrence after radiofrequency catheter ablation: a systematic review. J Cardiovasc Electrophysiol. Nov 29, 2010;21(11):1208-1216. [CrossRef] [Medline]
- Boehm KM, Khosravi P, Vanguri R, Gao J, Shah SP. Harnessing multimodal data integration to advance precision oncology. Nat Rev Cancer. Feb 18, 2022;22(2):114-126. [FREE Full text] [CrossRef] [Medline]
- Sahasrabudhe M, Sujobert P, Zacharaki EI, Maurin E, Grange B, Jallades L, et al. Deep multi-instance learning using multi-modal data for diagnosis of lymphocytosis. IEEE J Biomed Health Inform. Jun 2021;25(6):2125-2136. [CrossRef] [Medline]
- Feeny AK, Chung MK, Madabhushi A, Attia ZI, Cikes M, Firouznia M, et al. Artificial intelligence and machine learning in arrhythmias and cardiac electrophysiology. Circ Arrhythm Electrophysiol. Aug 2020;13(8):e007952. [FREE Full text] [CrossRef] [Medline]
- He B, Kwan AC, Cho JH, Yuan N, Pollick C, Shiota T, et al. Blinded, randomized trial of sonographer versus AI cardiac function assessment. Nature. Apr 05, 2023;616(7957):520-524. [FREE Full text] [CrossRef] [Medline]
- Ho SY, Phua K, Wong L, Bin Goh WW. Extensions of the external validation for checking learned model interpretability and generalizability. Patterns (N Y). Nov 13, 2020;1(8):100129. [FREE Full text] [CrossRef] [Medline]
- Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell. May 13, 2019;1(5):206-215. [FREE Full text] [CrossRef] [Medline]
- Rasheed K, Qayyum A, Ghaly M, Al-Fuqaha A, Razi A, Qadir J. Explainable, trustworthy, and ethical machine learning for healthcare: a survey. Comput Biol Med. Oct 2022;149:106043. [FREE Full text] [CrossRef] [Medline]
- Pesapane F, Volonté C, Codari M, Sardanelli F. Artificial intelligence as a medical device in radiology: ethical and regulatory issues in Europe and the United States. Insights Imaging. Oct 2018;9(5):745-753. [FREE Full text] [CrossRef] [Medline]
- Kirienko M, Sollini M, Ninatti G, Loiacono D, Giacomello E, Gozzi N, et al. Distributed learning: a reliable privacy-preserving strategy to change multicenter collaborations using AI. Eur J Nucl Med Mol Imaging. Nov 13, 2021;48(12):3791-3804. [FREE Full text] [CrossRef] [Medline]
- Rieke N, Hancox J, Li W, Milletarì F, Roth HR, Albarqouni S, et al. The future of digital health with federated learning. NPJ Digit Med. Sep 14, 2020;3(1):119. [FREE Full text] [CrossRef] [Medline]
- Chakraborty C, Bhattacharya M, Pal S, Lee S. From machine learning to deep learning: advances of the recent data-driven paradigm shift in medicine and healthcare. Curr Res Biotechnol. 2024;7:100164. [CrossRef]
- Rubinger L, Gazendam A, Ekhtiari S, Bhandari M. Machine learning and artificial intelligence in research and healthcare. Injury. May 2023;54 Suppl 3:S69-S73. [CrossRef] [Medline]
- Elul Y, Rosenberg AA, Schuster A, Bronstein AM, Yaniv Y. Meeting the unmet needs of clinicians from AI systems showcased for cardiology with deep-learning-based ECG analysis. Proc Natl Acad Sci U S A. Jun 15, 2021;118(24):e1541. [FREE Full text] [CrossRef] [Medline]
- Tseng PY, Chen YT, Wang CH, Chiu KM, Peng YS, Hsu SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. Jul 31, 2020;24(1):478. [FREE Full text] [CrossRef] [Medline]
- Zhao Y, Cao LY, Zhao YX, Wang F, Xie L, Xing H, et al. Medical record data-enabled machine learning can enhance prediction of left atrial appendage thrombosis in nonvalvular atrial fibrillation. Thromb Res. Mar 2023;223:174-183. [CrossRef]
- Holzinger A, Langs G, Denk H, Zatloukal K, Müller H. Causability and explainability of artificial intelligence in medicine. Wiley Interdiscip Rev Data Min Knowl Discov. Apr 02, 2019;9(4):e1312. [FREE Full text] [CrossRef] [Medline]
Abbreviations
AF: atrial fibrillation |
AFCA: atrial fibrillation catheter ablation |
AUC: area under the receiver operating characteristic curve |
CA: catheter ablation |
CNN: convolutional neural network |
CT: computed tomography |
DL: deep learning |
ECG: electrocardiogram |
EGM: electrogram |
EHR: electronic health record |
LGE-MRI: late gadolinium-enhanced cardiac magnetic resonance |
LV: left ventricular |
ML: machine learning |
PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews |
PROBAST: Prediction model Risk Of Bias Assessment Tool |
PV: pulmonary vein |
QUADAS-2: Quality Assessment of Diagnostic Accuracy Studies-2 |
Edited by A Mavragani; submitted 24.05.24; peer-reviewed by S Mao, W-C Lee, L Dall’Olio; comments to author 02.12.24; revised version received 21.12.24; accepted 30.12.24; published 10.02.25.
Copyright©Aijing Luo, Wei Chen, Hongtao Zhu, Wenzhao Xie, Xi Chen, Zhenjiang Liu, Zirui Xin. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 10.02.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.