Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

doi:10.2196/49324

Journals

Sallam M, Barakat M, Sallam M. A Preliminary Checklist (METRICS) to Standardize the Design and Reporting of Studies on Generative Artificial Intelligence–Based Models in Health Care Education and Practice: Development Study Involving a Literature Review. Interactive Journal of Medical Research 2024;13:e54704 View
Rudroff T. Revealing the Complexity of Fatigue: A Review of the Persistent Challenges and Promises of Artificial Intelligence. Brain Sciences 2024;14(2):186 View
Marchi F, Bellini E, Iandelli A, Sampieri C, Peretti G. Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses. European Archives of Oto-Rhino-Laryngology 2024;281(4):2123 View
Berrezueta-Guzman S, Kandil M, Martín-Ruiz M, Pau de la Cruz I, Krusche S. Future of ADHD Care: Evaluating the Efficacy of ChatGPT in Therapy Enhancement. Healthcare 2024;12(6):683 View
Litvin A, Stoma I, Sharshakova T, Rumovskaya S, Kyovalev A. New possibilities of artificial intelligence in medicine: a narrative review. Health and Ecology Issues 2024;21(1):7 View
Omar M, Brin D, Glicksberg B, Klang E. Utilizing natural language processing and large language models in the diagnosis and prediction of infectious diseases: A systematic review. American Journal of Infection Control 2024;52(9):992 View
Bonnechère B. Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation. American Journal of Physical Medicine & Rehabilitation 2024;103(6):532 View
Naqvi W, Shaikh S, Mishra G. Large language models in physical therapy: time to adapt and adept. Frontiers in Public Health 2024;12 View
Leypold T, Lingens L, Beier J, Boos A. Integrating AI in Lipedema Management: Assessing the Efficacy of GPT-4 as a Consultation Assistant. Life 2024;14(5):646 View
Tailor P, D'Souza H, Li H, Starr M. Vision of the future: large language models in ophthalmology. Current Opinion in Ophthalmology 2024;35(5):391 View
Tan S, Xin X, Wu D. ChatGPT in medicine: prospects and challenges: a review article. International Journal of Surgery 2024;110(6):3701 View
Goktas P, Gulseren D, Tobin A. Large Language and Vision Assistant in dermatology: a game changer or just hype?. Clinical and Experimental Dermatology 2024;49(8):783 View
Letterie G. Moonshot. Long shot. Or sure shot. What needs to happen to realize the full potential of AI in the fertility sector?. Human Reproduction 2024;39(9):1863 View
Cong Y, LaCroix A, Lee J. Clinical efficacy of pre-trained large language models through the lens of aphasia. Scientific Reports 2024;14(1) View
Luo M, Pang J, Bi S, Lai Y, Zhao J, Shang Y, Cui T, Yang Y, Lin Z, Zhao L, Wu X, Lin D, Chen J, Lin H. Development and Evaluation of a Retrieval-Augmented Large Language Model Framework for Ophthalmology. JAMA Ophthalmology 2024;142(9):798 View
Leypold T, Schäfer B, Boos A, Beier J. Artificial Intelligence-Powered Hand Surgery Consultation: GPT-4 as an Assistant in a Hand Surgery Outpatient Clinic. The Journal of Hand Surgery 2024;49(11):1078 View
Yang Z, Wang D, Zhou F, Song D, Zhang Y, Jiang J, Kong K, Liu X, Qiao Y, Chang R, Han Y, Li F, Tham C, Zhang X. Understanding natural language: Potential application of large language models to ophthalmology. Asia-Pacific Journal of Ophthalmology 2024;13(4):100085 View
Labinsky H, Nagler L, Krusche M, Griewing S, Aries P, Kroiß A, Strunz P, Kuhn S, Schmalzing M, Gernert M, Knitza J. Vignette-based comparative analysis of ChatGPT and specialist treatment decisions for rheumatic patients: results of the Rheum2Guide study. Rheumatology International 2024;44(10):2043 View
Wang Y, Liang L, Li R, Wang Y, Hao C. Comparison of the Performance of ChatGPT, Claude and Bard in Support of Myopia Prevention and Control. Journal of Multidisciplinary Healthcare 2024;Volume 17:3917 View
Shapiro J, Lyakhovitsky A. Revolutionizing teledermatology: Exploring the integration of artificial intelligence, including Generative Pre-trained Transformer chatbots for artificial intelligence-driven anamnesis, diagnosis, and treatment plans. Clinics in Dermatology 2024;42(5):492 View
Zheng Y, Gan W, Chen Z, Qi Z, Liang Q, Yu P. Large language models for medicine: a survey. International Journal of Machine Learning and Cybernetics 2025;16(2):1015 View
Giacobbe D, Marelli C, Guastavino S, Signori A, Mora S, Rosso N, Campi C, Piana M, Murgia Y, Giacomini M, Bassetti M. Artificial intelligence and prescription of antibiotic therapy: present and future. Expert Review of Anti-infective Therapy 2024;22(10):819 View
Wang J, Shi R, Le Q, Shan K, Chen Z, Zhou X, He Y, Hong J. Evaluating the effectiveness of large language models in patient education for conjunctivitis. British Journal of Ophthalmology 2025;109(2):185 View
Merlino D, Brufau S, Saieed G, Van Abel K, Price D, Archibald D, Ator G, Carlson M. Comparative Assessment of Otolaryngology Knowledge Among Large Language Models. The Laryngoscope 2025;135(2):629 View
Tam T, Sivarajkumar S, Kapoor S, Stolyar A, Polanska K, McCarthy K, Osterhoudt H, Wu X, Visweswaran S, Fu S, Mathur P, Cacciamani G, Sun C, Peng Y, Wang Y. A framework for human evaluation of large language models in healthcare derived from literature review. npj Digital Medicine 2024;7(1) View
Goktas P, Grzybowski A. Assessing the Impact of ChatGPT in Dermatology: A Comprehensive Rapid Review. Journal of Clinical Medicine 2024;13(19):5909 View
Bedi S, Liu Y, Orr-Ewing L, Dash D, Koyejo S, Callahan A, Fries J, Wornow M, Swaminathan A, Lehmann L, Hong H, Kashyap M, Chaurasia A, Shah N, Singh K, Tazbaz T, Milstein A, Pfeffer M, Shah N. Testing and Evaluation of Health Care Applications of Large Language Models. JAMA 2025;333(4):319 View
Wu A. Chatting together: Using AI chatbots to improve diagnostic excellence. Journal of Patient Safety and Risk Management 2024;29(5):222 View
Zhou S, Luo X, Chen C, Jiang H, Yang C, Ran G, Yu J, Yin C. The performance of large language model-powered chatbots compared to oncology physicians on colorectal cancer queries. International Journal of Surgery 2024;110(10):6509 View
Al Khatib H, Neupane S, Kumar Manchukonda H, Golilarz N, Mittal S, Amirlatifi A, Rahimi S. Patient-centric knowledge graphs: a survey of current methods, challenges, and applications. Frontiers in Artificial Intelligence 2024;7 View
Reyhan A, Mutaf Ç, Uzun İ, Yüksekyayla F. A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity. Journal of Clinical Medicine 2024;13(21):6512 View
Zhang C, Liu S, Zhou X, Zhou S, Tian Y, Wang S, Xu N, Li W. Examining the Role of Large Language Models in Orthopedics: Systematic Review. Journal of Medical Internet Research 2024;26:e59607 View
Coskun Benlidayi I, Gupta L. Translation and Cross-Cultural Adaptation: A Critical Step in Multi-National Survey Studies. Journal of Korean Medical Science 2024;39(49) View
Slawaska-Eng D, Bourgeault-Gagnon Y, Cohen D, Pauyo T, Belzile E, Ayeni O. ChatGPT-3.5 and -4 provide mostly accurate information when answering patients’ questions relating to femoroacetabular impingement syndrome and arthroscopic hip surgery. Journal of ISAKOS 2025;10:100376 View
Ding Z, Wei R, Xia J, Mu Y, Wang J, Lin Y. Exploring the potential of large language model–based chatbots in challenges of ribosome profiling data analysis: a review. Briefings in Bioinformatics 2024;26(1) View
Bach T, Kaarstad M, Solberg E, Babic A. Insights into suggested Responsible AI (RAI) practices in real-world settings: a systematic literature review. AI and Ethics 2025;5(3):3185 View
Zhan Y, Chen X, Ye F, Wu Z, Usman M, Yuan Z, Wu H, Huang J, Yu H. Evaluating AI Chatbot Responses to Postkidney Transplant Inquiries. Transplantation Proceedings 2025;57(2):394 View
Ammo T, Guillaume V, Hofmann U, Ulmer N, Buenting N, Laenger F, Beier J, Leypold T. Evaluating ChatGPT-4o as a decision support tool in multidisciplinary sarcoma tumor boards: heterogeneous performance across various specialties. Frontiers in Oncology 2025;14 View
Beheshti M, Toubal I, Alaboud K, Almalaysha M, Ogundele O, Turabieh H, Abdalnabi N, Boren S, Scott G, Dahu B. Evaluating the Reliability of ChatGPT for Health-Related Questions: A Systematic Review. Informatics 2025;12(1):9 View
Flory J, Ancker J, Kim S, Kuperman G, Petrov A, Vickers A. Large Language Model GPT-4 Compared to Endocrinologist Responses on Initial Choice of Glucose-Lowering Medication Under Conditions of Clinical Uncertainty. Diabetes Care 2025;48(2):185 View
Waldock W, Lam G, Baptista A, Walls R, Sam A. Which curriculum components do medical students find most helpful for evaluating AI outputs?. BMC Medical Education 2025;25(1) View
Dillion D, Mondal D, Tandon N, Gray K. AI language model rivals expert ethicist in perceived moral expertise. Scientific Reports 2025;15(1) View
Wang X, Ye H, Zhang S, Yang M, Wang X. Evaluation of the Performance of Three Large Language Models in Clinical Decision Support: A Comparative Study Based on Actual Cases. Journal of Medical Systems 2025;49(1) View
Kleebayoon A, Wiwanitkit V. ChatGPT for responding to patient inquiries about otosclerosis: correspondence. European Archives of Oto-Rhino-Laryngology 2025;282(5):2785 View
Huang Y, Shi R, Chen C, Zhou X, Zhou X, Hong J, Chen Z. Evaluation of large language models for providing educational information in orthokeratology care. Contact Lens and Anterior Eye 2025;48(3):102384 View
Koss M, McLaughlin M, Switalla K, Falade I, Kim E. Exploring the role of ChatGPT in decision making for gender-affirming surgery. Artificial Intelligence Surgery 2025;5(1):116 View
Choo S, Yoo S, Endo K, Truong B, Son M. Advancing Clinical Chatbot Validation Using AI-Powered Evaluation With a New 3-Bot Evaluation System: Instrument Validation Study. JMIR Nursing 2025;8:e63058 View
Şişman A, Acar A. Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study. BMC Oral Health 2025;25(1) View
Shool S, Adimi S, Saboori Amleshi R, Bitaraf E, Golpira R, Tara M. A systematic review of large language model (LLM) evaluations in clinical medicine. BMC Medical Informatics and Decision Making 2025;25(1) View
Rider N, Li Y, Chin A, DiGiacomo D, Dutmer C, Farmer J, Roberts K, Savova G, Ong M. Evaluating large language model performance to support the diagnosis and management of patients with primary immune disorders. Journal of Allergy and Clinical Immunology 2025;156(1):81 View
Leypold T, Bahm J, Beier J, Guillaume V, Ammo T, Lauer H, Kolbenschlag J, Schäfer B. Evaluating ChatGPT o1’s Capabilities in Peripheral Nerve Surgery: Advancing Artificial Intelligence in Clinical Practice. World Neurosurgery 2025;196:123753 View
Bhasuran B, Jin Q, Xie Y, Yang C, Hanna K, Costa J, Shavor C, Han W, Lu Z, He Z. Preliminary analysis of the impact of lab results on large language model generated differential diagnoses. npj Digital Medicine 2025;8(1) View
Li J, Chang C, Li Y, Cui S, Yuan F, Li Z, Wang X, Li K, Feng Y, Wang Z, Wei Z, Jian F. Large Language Models’ Responses to Spinal Cord Injury: A Comparative Study of Performance. Journal of Medical Systems 2025;49(1) View
Ao G, Chen M, Li J, Nie H, Zhang L, Chen Z. Comparative analysis of large language models on rare disease identification. Orphanet Journal of Rare Diseases 2025;20(1) View
Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intelligent Medicine 2025;5(2):151 View
Kunze K, Gerhold C, Dave U, Abunnur N, Mamonov A, Nwachukwu B, Verma N, Chahla J. Large Language Model Use Cases in Health Care Research Are Redundant and Often Lack Appropriate Methodological Conduct: A Scoping Review and Call for Improved Practices. Arthroscopy: The Journal of Arthroscopic & Related Surgery 2025;41(11):4928 View
Saxena A, Rishi B. AI and human collaboration in tourism: a framework for scalable, authentic, and engaging content. Asia Pacific Journal of Tourism Research 2025;30(9):1226 View
Okenyi M, Ataguba G, Henry K, Anukem S, Orji R. Going vegan with ChatGPT: Towards designing LLMs for personalized lifestyle changes. Machine Learning with Applications 2025;20:100659 View
Giuffrè M, You K, Pang Z, Kresevic S, Chung S, Chen R, Ko Y, Chan C, Saarinen T, Ajcevic M, Crocè L, Garcia-Tsao G, Gralnek I, Sung J, Barkun A, Laine L, Sekhon J, Stadie B, Shung D. Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology. npj Digital Medicine 2025;8(1) View
Li Y, Li Z, Li J, Liu L, Liu Y, Zhu B, shi K, Lu Y, Li Y, Zeng X, Feng Y, Wang X. The actual performance of large language models in providing liver cirrhosis-related information: A comparative study. International Journal of Medical Informatics 2025;201:105961 View
Othman A, Flaharty K, Ledgister Hanchard S, Hu P, Duong D, Waikel R, Solomon B. Assessing large language model performance related to aging in genetic conditions. npj Aging 2025;11(1) View
Qiang S, Zhang H, Liao Y, Zhang Y, Gu Y, Wang Y, Xu Z, Shi H, Han N, Yu H. Application of Large Language Models in Stroke Rehabilitation Health Education: 2-Phase Study. Journal of Medical Internet Research 2025;27:e73226 View
Zhou J, Cheng Y, He S, Chen Y, Chen H. Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1. MedComm – Future Medicine 2025;4(2) View
Borgonovo F, Matsuo T, Petri F, Amin Alavi S, Mazudie Ndjonko L, Gori A, Berbari E. Battle of the Bots: Solving Clinical Cases in Osteoarticular Infections With Large Language Models. Mayo Clinic Proceedings: Digital Health 2025;3(3):100230 View
Ejas F, Khan S, Mujahid A, AlJoker F, Mautong H, Alvarado-Villa G, Kashyap A, Yasir M, Nigatu K, Jain N, Iyer N, Sandhu A, Sharafat S, Yahya S, Ghaly M, Ibrar I, Singh A, Grewal H, Huespe I, Mehta P, Arshad Z, Kashyap R, Nawaz F. Medical Students’ Perceptions of Large Language Models in Healthcare: A Multinational Cross-Sectional Study. Journal of Medical Education and Curricular Development 2025;12 View
Su H, Sun Y, Li R, Zhang A, Yang Y, Xiao F, Duan Z, Chen J, Hu Q, Yang T, Xu B, Zhang Q, Zhao J, Li Y, Li H. Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis. Journal of Medical Internet Research 2025;27:e72062 View
Alkalbani A, Alrawahi A, Salah A, Haghighi V, Zhang Y, Alkindi S, Sheng Q. A Systematic Review of Large Language Models in Medical Specialties: Applications, Challenges and Future Directions. Information 2025;16(6):489 View
See Y, Lim K, Au W, Chia S, Fan X, Li Z. The Use of Large Language Models in Ophthalmology: A Scoping Review on Current Use-Cases and Considerations for Future Works in This Field. Big Data and Cognitive Computing 2025;9(6):151 View
Angyal V, Bertalan Á, Domján P, Dinya E. Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening. BMC Medical Informatics and Decision Making 2025;25(1) View
Shataer D, Cao S, Liu X, Aierken K, Bhattacharya P, Sinha A, Liu H. Application of Large Language Models in Traditional Chinese Medicine: A State-of-the-Art Review. The American Journal of Chinese Medicine 2025;53(04):973 View
Urda-Cîmpean A, Leucuța D, Drugan C, Duțu A, Călinici T, Drugan T. Assessing the Accuracy of Diagnostic Capabilities of Large Language Models. Diagnostics 2025;15(13):1657 View
Zhan L, Dang X, Xie Z, Zeng C, Wu W, Zhang X, Zhang L, Cai X. Evaluating GPT-4o in infectious disease diagnostics and management: A comparative study with residents and specialists on accuracy, completeness, and clinical support potential. DIGITAL HEALTH 2025;11 View
Yang H, Li M, Zhou H, Xiao Y, Fang Q, Zhou S, Zhang R. Large Language Model Synergy for Ensemble Learning in Medical Question Answering: Design and Evaluation Study. Journal of Medical Internet Research 2025;27:e70080 View
Báez J, Ahn E, Tamietti A, Victor B, Goldkind L. Clinical Social Workers’ Perceptions of Large Language Models in Practice: Resistance to Automation and Prospects for Integration. Journal of Evidence-Based Social Work 2026;23(1):42 View
Artiaga J, Guevarra M, Sosuan G, Agnihotri A, Nagel I, Kalaw F. Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators. Eye 2025;39(15):2752 View
Duan L, Li T, Li B, Li X, Fu D, Yang X, Cao K, Cai H. Application of large language models to natural language processing and image analysis tasks in dermatology: a systematic review. Intelligent Medicine 2025 View
Elangovan K, Ong J, Jin L, Seng B, Kwan Y, Ng L, Zhong R, Ma J, Ke Y, Liu N, Giacomini K, Ting D, Bui T. Development and evaluation of a lightweight large language model chatbot for medication enquiry. PLOS Digital Health 2025;4(9):e0000961 View
Salehin I, Tomal Ahmed Sajib M, Huda Badhon N, Sakibul Hassan Rifat M, Amin N, Nessa Moon N. Systematic Literature Review of LLM‐Large Language Model in Medical: Digital Health, Technology and Applications. Engineering Reports 2025;7(9) View
Wu S, Miao Y, Mei J, Xiong S. The Rise of Artificial Intelligence in Orthopedics: A Bibliometric and Visualization Analysis. Journal of Multidisciplinary Healthcare 2025;Volume 18:6037 View
Jaleel A, Aziz U, Farid G, Zahid Bashir M, Mirza T, Khizar Abbas S, Aslam S, Sikander R. Evaluating the Potential and Accuracy of ChatGPT-3.5 and 4.0 in Medical Licensing and In-Training Examinations: Systematic Review and Meta-Analysis. JMIR Medical Education 2025;11:e68070 View
Tian M, Li S, Du W, Yang S, Zhao X, Xiong H, Li H, Lu M, Ying Y, Zhang J, Liao Q, Yang D, Guo F. Novel Insights into the Application of Large Language Models in the Diagnosis and Treatment of Complex Cardiovascular Diseases: A Comparative Study. Journal of Medical Systems 2025;49(1) View
XUE T, BAI Y, ZHANG T. Intelligent technology-driven orthopedic rehabilitation: Progress and applications. SCIENTIA SINICA Technologica 2025;55(10):1659 View
Goh S, Mariappan R, Soo Woon Tan G, Yao J, Hew F, Yeo Y, Guan Wei Ow S, Koh W, Kumarakulasingh N, Tan T, Tai B, Hartman M, Ngiam K. Augmenting Large Language Models With National Comprehensive Cancer Network Guidelines for Improved and Standardized Adjuvant Therapy Recommendations in Postoperative Breast Cancer Cases. JCO Clinical Cancer Informatics 2025;(9) View
Büyükceran E, Seyfettin A, Babatürk A, Eskalen Z, Özkan M, Kaymaz E, Mersin H, Dönmez F. Text-based prediction of ımmunohistochemical biomarkers in breast cancer using a generative large language model: a retrospective study. Health Information Science and Systems 2025;14(1) View
Zhou H, Zhu Z, Oh K, Hong S. Empowering Informal Caregivers of Persons with Early-Stage Dementia by Large Language Models: Mixed Methods Evaluation (Preprint). JMIR Formative Research 2025 View
Pohlmann P, Glienke M, Sandkamp R, Gratzke C, Schmal H, Schoeb D, Fuchs A. Assessing the Efficacy of Ortho GPT: A Comparative Study with Medical Students and General LLMs on Orthopedic Examination Questions. Bioengineering 2025;12(12):1290 View
Sieciński K, Oliński M. A Multidisciplinary Bibliometric Analysis of Differences and Commonalities Between GenAI in Science. Publications 2025;13(4):67 View
Akkus Yildirim B, Tutun B, Durak G, Yildirim E, Uysal E, Erturk S, Bagci U. Large language models standardize the interpretation of complex oncology guidelines for brain metastases. Communications Medicine 2025;6(1) View
Kaczmarczyk R, Pieroh P, Koob S, Fröschen F, Scheidt S, Welle K, Martin R, Roos J. Application of Vision-Language Models in the Automatic Recognition of Bone Tumors on Radiographs: A Retrospective Study. AI 2025;6(12):327 View
Cui Z, Liu W, Tian X, You C, Meng X, Zhang H, Gong K, Wang X, Wu J. Performance of large language model in cross-specialty medical scenarios. Journal of Translational Medicine 2025 View
Coşkun Ü, Erten Tayşi A. The Use of Artificial Intelligence for Medication Support in Dentistry: A Reliability Assessment of Chatbots. Clinical and Experimental Health Sciences 2025;15(4):866 View
Wang E, Song S, Peng K, Liu T. Comparison between China-based DeepSeek and US-based major LLMs in answering social determinants of health questions in ophthalmology. Asia-Pacific Journal of Ophthalmology 2026;15(1):100276 View

Books/Policy Documents

Berlincioni L, Cultrera L, Becattini F, Bertini M, Del Bimbo A. Computer Vision – ECCV 2024 Workshops. View

Conference Proceedings

Mohammed H, Kiss G, Serrano J, Lindseth F. 2025 IEEE Symposium on Computational Intelligence in Health and Medicine (CIHM). Comparative Analysis and Evaluation of Well-Being Activity-Infused Fine-Tuned Language Models with Benchmark Models View
Zhao S, Wang J. Proceedings of the 34th ACM SIGSOFT International Symposium on Software Testing and Analysis. Best practice for supply chain in LLM-assisted medical applications View
Liu Z, Hu L, Zhou T, Tang Y, Cai Z. 2025 IEEE Symposium on Security and Privacy (SP). Prevalence Overshadows Concerns? Understanding Chinese Users' Privacy Awareness and Expectations Towards LLM-Based Healthcare Consultation View
Zhang T, Chung T, Dey A, Bae S. 2025 International Conference on Activity and Behavior Computing (ABC). AXAI-CDSS: An Affective Explainable AI-Driven Clinical Decision Support System for Cannabis Use View
Preiß N, Westner M. Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS). From Agents to Copilots: A Systematic Review of Digital Assistant Technology Adoption in Proprietary Productivity Software View

This paper is in the following e-collection/theme issue:

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Journals

Books/Policy Documents

Conference Proceedings