%0 Journal Article %@ 1438-8871 %I JMIR Publications %V 27 %N %P e66491 %T Artificial Intelligence Models for Pediatric Lung Sound Analysis: Systematic Review and Meta-Analysis %A Park,Ji Soo %A Park,Sa-Yoon %A Moon,Jae Won %A Kim,Kwangsoo %A Suh,Dong In %+ Department of Pediatrics, Seoul National University College of Medicine, 101, Daehak-Ro Jongno-Gu, Seoul, 03080, Republic of Korea, 82 2 2072 362, dongins0@snu.ac.kr %K machine learning %K respiratory disease classification %K wheeze detection %K auscultation %K mel-spectrogram %K abnormal lung sound detection %K artificial intelligence %K pediatric %K lung sound analysis %K systematic review %K asthma %K pneumonia %K children %K morbidity %K mortality %K diagnostic %K respiratory pathology %D 2025 %7 18.4.2025 %9 Review %J J Med Internet Res %G English %X Background: Pediatric respiratory diseases, including asthma and pneumonia, are major causes of morbidity and mortality in children. Auscultation of lung sounds is a key diagnostic tool but is prone to subjective variability. The integration of artificial intelligence (AI) and machine learning (ML) with electronic stethoscopes offers a promising approach for automated and objective lung sound. Objective: This systematic review and meta-analysis assess the performance of ML models in pediatric lung sound analysis. The study evaluates the methodologies, model performance, and database characteristics while identifying limitations and future directions for clinical implementation. Methods: A systematic search was conducted in Medline via PubMed, Embase, Web of Science, OVID, and IEEE Xplore for studies published between January 1, 1990, and December 16, 2024. Inclusion criteria are as follows: studies developing ML models for pediatric lung sound classification with a defined database, physician-labeled reference standard, and reported performance metrics. Exclusion criteria are as follows: studies focusing on adults, cardiac auscultation, validation of existing models, or lacking performance metrics. Risk of bias was assessed using a modified Quality Assessment of Diagnostic Accuracy Studies (version 2) framework. Data were extracted on study design, dataset, ML methods, feature extraction, and classification tasks. Bivariate meta-analysis was performed for binary classification tasks, including wheezing and abnormal lung sound detection. Results: A total of 41 studies met the inclusion criteria. The most common classification task was binary detection of abnormal lung sounds, particularly wheezing. Pooled sensitivity and specificity for wheeze detection were 0.902 (95% CI 0.726-0.970) and 0.955 (95% CI 0.762-0.993), respectively. For abnormal lung sound detection, pooled sensitivity was 0.907 (95% CI 0.816-0.956) and specificity 0.877 (95% CI 0.813-0.921). The most frequently used feature extraction methods were Mel-spectrogram, Mel-frequency cepstral coefficients, and short-time Fourier transform. Convolutional neural networks were the predominant ML model, often combined with recurrent neural networks or residual network architectures. However, high heterogeneity in dataset size, annotation methods, and evaluation criteria were observed. Most studies relied on small, single-center datasets, limiting generalizability. Conclusions: ML models show high accuracy in pediatric lung sound analysis, but face limitations due to dataset heterogeneity, lack of standard guidelines, and limited external validation. Future research should focus on standardized protocols and the development of large-scale, multicenter datasets to improve model robustness and clinical implementation. %R 10.2196/66491 %U https://www.jmir.org/2025/1/e66491 %U https://doi.org/10.2196/66491