Published on in Vol 25 (2023)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/49324, first published .
Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study

Journals

  1. Sallam M, Barakat M, Sallam M. A Preliminary Checklist (METRICS) to Standardize the Design and Reporting of Studies on Generative Artificial Intelligence–Based Models in Health Care Education and Practice: Development Study Involving a Literature Review. Interactive Journal of Medical Research 2024;13:e54704 View
  2. Rudroff T. Revealing the Complexity of Fatigue: A Review of the Persistent Challenges and Promises of Artificial Intelligence. Brain Sciences 2024;14(2):186 View
  3. Marchi F, Bellini E, Iandelli A, Sampieri C, Peretti G. Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses. European Archives of Oto-Rhino-Laryngology 2024;281(4):2123 View
  4. Berrezueta-Guzman S, Kandil M, Martín-Ruiz M, Pau de la Cruz I, Krusche S. Future of ADHD Care: Evaluating the Efficacy of ChatGPT in Therapy Enhancement. Healthcare 2024;12(6):683 View
  5. Litvin A, Stoma I, Sharshakova T, Rumovskaya S, Kyovalev A. New possibilities of artificial intelligence in medicine: a narrative review. Health and Ecology Issues 2024;21(1):7 View
  6. Omar M, Brin D, Glicksberg B, Klang E. Utilizing natural language processing and large language models in the diagnosis and prediction of infectious diseases: A systematic review. American Journal of Infection Control 2024;52(9):992 View
  7. Bonnechère B. Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation. American Journal of Physical Medicine & Rehabilitation 2024 View
  8. Naqvi W, Shaikh S, Mishra G. Large language models in physical therapy: time to adapt and adept. Frontiers in Public Health 2024;12 View
  9. Leypold T, Lingens L, Beier J, Boos A. Integrating AI in Lipedema Management: Assessing the Efficacy of GPT-4 as a Consultation Assistant. Life 2024;14(5):646 View
  10. Tailor P, D'Souza H, Li H, Starr M. Vision of the future: large language models in ophthalmology. Current Opinion in Ophthalmology 2024;35(5):391 View
  11. Tan S, Xin X, Wu D. ChatGPT in medicine: prospects and challenges: a review article. International Journal of Surgery 2024;110(6):3701 View
  12. Goktas P, Gulseren D, Tobin A. Large Language and Vision Assistant in dermatology: a game changer or just hype?. Clinical and Experimental Dermatology 2024;49(8):783 View
  13. Letterie G. Moonshot. Long shot. Or sure shot. What needs to happen to realize the full potential of AI in the fertility sector?. Human Reproduction 2024;39(9):1863 View
  14. Cong Y, LaCroix A, Lee J. Clinical efficacy of pre-trained large language models through the lens of aphasia. Scientific Reports 2024;14(1) View
  15. Luo M, Pang J, Bi S, Lai Y, Zhao J, Shang Y, Cui T, Yang Y, Lin Z, Zhao L, Wu X, Lin D, Chen J, Lin H. Development and Evaluation of a Retrieval-Augmented Large Language Model Framework for Ophthalmology. JAMA Ophthalmology 2024;142(9):798 View
  16. Leypold T, Schäfer B, Boos A, Beier J. Artificial Intelligence-Powered Hand Surgery Consultation: GPT-4 as an Assistant in a Hand Surgery Outpatient Clinic. The Journal of Hand Surgery 2024;49(11):1078 View
  17. Yang Z, Wang D, Zhou F, Song D, Zhang Y, Jiang J, Kong K, Liu X, Qiao Y, Chang R, Han Y, Li F, Tham C, Zhang X. Understanding natural language: Potential application of large language models to ophthalmology. Asia-Pacific Journal of Ophthalmology 2024;13(4):100085 View
  18. Labinsky H, Nagler L, Krusche M, Griewing S, Aries P, Kroiß A, Strunz P, Kuhn S, Schmalzing M, Gernert M, Knitza J. Vignette-based comparative analysis of ChatGPT and specialist treatment decisions for rheumatic patients: results of the Rheum2Guide study. Rheumatology International 2024;44(10):2043 View
  19. Wang Y, Liang L, Li R, Wang Y, Hao C. Comparison of the Performance of ChatGPT, Claude and Bard in Support of Myopia Prevention and Control. Journal of Multidisciplinary Healthcare 2024;Volume 17:3917 View
  20. Shapiro J, Lyakhovitsky A. Revolutionizing teledermatology: Exploring the integration of artificial intelligence, including Generative Pre-trained Transformer chatbots for artificial intelligence-driven anamnesis, diagnosis, and treatment plans. Clinics in Dermatology 2024;42(5):492 View
  21. Zheng Y, Gan W, Chen Z, Qi Z, Liang Q, Yu P. Large language models for medicine: a survey. International Journal of Machine Learning and Cybernetics 2025;16(2):1015 View
  22. Giacobbe D, Marelli C, Guastavino S, Signori A, Mora S, Rosso N, Campi C, Piana M, Murgia Y, Giacomini M, Bassetti M. Artificial intelligence and prescription of antibiotic therapy: present and future. Expert Review of Anti-infective Therapy 2024;22(10):819 View
  23. Wang J, Shi R, Le Q, Shan K, Chen Z, Zhou X, He Y, Hong J. Evaluating the effectiveness of large language models in patient education for conjunctivitis. British Journal of Ophthalmology 2025;109(2):185 View
  24. Merlino D, Brufau S, Saieed G, Van Abel K, Price D, Archibald D, Ator G, Carlson M. Comparative Assessment of Otolaryngology Knowledge Among Large Language Models. The Laryngoscope 2025;135(2):629 View
  25. Tam T, Sivarajkumar S, Kapoor S, Stolyar A, Polanska K, McCarthy K, Osterhoudt H, Wu X, Visweswaran S, Fu S, Mathur P, Cacciamani G, Sun C, Peng Y, Wang Y. A framework for human evaluation of large language models in healthcare derived from literature review. npj Digital Medicine 2024;7(1) View
  26. Goktas P, Grzybowski A. Assessing the Impact of ChatGPT in Dermatology: A Comprehensive Rapid Review. Journal of Clinical Medicine 2024;13(19):5909 View
  27. Bedi S, Liu Y, Orr-Ewing L, Dash D, Koyejo S, Callahan A, Fries J, Wornow M, Swaminathan A, Lehmann L, Hong H, Kashyap M, Chaurasia A, Shah N, Singh K, Tazbaz T, Milstein A, Pfeffer M, Shah N. Testing and Evaluation of Health Care Applications of Large Language Models. JAMA 2025;333(4):319 View
  28. Wu A. Chatting together: Using AI chatbots to improve diagnostic excellence. Journal of Patient Safety and Risk Management 2024;29(5):222 View
  29. Zhou S, Luo X, Chen C, Jiang H, Yang C, Ran G, Yu J, Yin C. The performance of large language model-powered chatbots compared to oncology physicians on colorectal cancer queries. International Journal of Surgery 2024;110(10):6509 View
  30. Al Khatib H, Neupane S, Kumar Manchukonda H, Golilarz N, Mittal S, Amirlatifi A, Rahimi S. Patient-centric knowledge graphs: a survey of current methods, challenges, and applications. Frontiers in Artificial Intelligence 2024;7 View
  31. Reyhan A, Mutaf Ç, Uzun İ, Yüksekyayla F. A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity. Journal of Clinical Medicine 2024;13(21):6512 View
  32. Zhang C, Liu S, Zhou X, Zhou S, Tian Y, Wang S, Xu N, Li W. Examining the Role of Large Language Models in Orthopedics: Systematic Review. Journal of Medical Internet Research 2024;26:e59607 View
  33. Coskun Benlidayi I, Gupta L. Translation and Cross-Cultural Adaptation: A Critical Step in Multi-National Survey Studies. Journal of Korean Medical Science 2024;39(49) View
  34. Slawaska-Eng D, Bourgeault-Gagnon Y, Cohen D, Pauyo T, Belzile E, Ayeni O. ChatGPT-3.5 and -4 provide mostly accurate information when answering patients’ questions relating to femoroacetabular impingement syndrome and arthroscopic hip surgery. Journal of ISAKOS 2025;10:100376 View
  35. Ding Z, Wei R, Xia J, Mu Y, Wang J, Lin Y. Exploring the potential of large language model–based chatbots in challenges of ribosome profiling data analysis: a review. Briefings in Bioinformatics 2024;26(1) View
  36. Bach T, Kaarstad M, Solberg E, Babic A. Insights into suggested Responsible AI (RAI) practices in real-world settings: a systematic literature review. AI and Ethics 2025 View
  37. Zhan Y, Chen X, Ye F, Wu Z, Usman M, Yuan Z, Wu H, Huang J, Yu H. Evaluating AI Chatbot Responses to Postkidney Transplant Inquiries. Transplantation Proceedings 2025;57(2):394 View
  38. Ammo T, Guillaume V, Hofmann U, Ulmer N, Buenting N, Laenger F, Beier J, Leypold T. Evaluating ChatGPT-4o as a decision support tool in multidisciplinary sarcoma tumor boards: heterogeneous performance across various specialties. Frontiers in Oncology 2025;14 View
  39. Beheshti M, Toubal I, Alaboud K, Almalaysha M, Ogundele O, Turabieh H, Abdalnabi N, Boren S, Scott G, Dahu B. Evaluating the Reliability of ChatGPT for Health-Related Questions: A Systematic Review. Informatics 2025;12(1):9 View
  40. Flory J, Ancker J, Kim S, Kuperman G, Petrov A, Vickers A. Large Language Model GPT-4 Compared to Endocrinologist Responses on Initial Choice of Glucose-Lowering Medication Under Conditions of Clinical Uncertainty. Diabetes Care 2025;48(2):185 View
  41. Waldock W, Lam G, Baptista A, Walls R, Sam A. Which curriculum components do medical students find most helpful for evaluating AI outputs?. BMC Medical Education 2025;25(1) View
  42. Dillion D, Mondal D, Tandon N, Gray K. AI language model rivals expert ethicist in perceived moral expertise. Scientific Reports 2025;15(1) View
  43. Wang X, Ye H, Zhang S, Yang M, Wang X. Evaluation of the Performance of Three Large Language Models in Clinical Decision Support: A Comparative Study Based on Actual Cases. Journal of Medical Systems 2025;49(1) View
  44. Kleebayoon A, Wiwanitkit V. ChatGPT for responding to patient inquiries about otosclerosis: correspondence. European Archives of Oto-Rhino-Laryngology 2025 View
  45. Huang Y, Shi R, Chen C, Zhou X, Zhou X, Hong J, Chen Z. Evaluation of large language models for providing educational information in orthokeratology care. Contact Lens and Anterior Eye 2025:102384 View
  46. Koss M, McLaughlin M, Switalla K, Falade I, Kim E. Exploring the role of ChatGPT in decision making for gender-affirming surgery. Artificial Intelligence Surgery 2025;5(1):116 View
  47. Choo S, Yoo S, Endo K, Truong B, Son M. Advancing Clinical Chatbot Validation Using AI-Powered Evaluation With a New 3-Bot Evaluation System: Instrument Validation Study. JMIR Nursing 2025;8:e63058 View
  48. Şişman A, Acar A. Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study. BMC Oral Health 2025;25(1) View
  49. Shool S, Adimi S, Saboori Amleshi R, Bitaraf E, Golpira R, Tara M. A systematic review of large language model (LLM) evaluations in clinical medicine. BMC Medical Informatics and Decision Making 2025;25(1) View
  50. Rider N, Li Y, Chin A, DiGiacomo D, Dutmer C, Farmer J, Roberts K, Savova G, Ong M. Evaluating large language model performance to support the diagnosis and management of patients with primary immune disorders. Journal of Allergy and Clinical Immunology 2025 View
  51. Leypold T, Bahm J, Beier J, Guillaume V, Ammo T, Lauer H, Kolbenschlag J, Schäfer B. Evaluating ChatGPT o1’s Capabilities in Peripheral Nerve Surgery: Advancing Artificial Intelligence in Clinical Practice. World Neurosurgery 2025;196:123753 View
  52. Bhasuran B, Jin Q, Xie Y, Yang C, Hanna K, Costa J, Shavor C, Han W, Lu Z, He Z. Preliminary analysis of the impact of lab results on large language model generated differential diagnoses. npj Digital Medicine 2025;8(1) View
  53. Li J, Chang C, Li Y, Cui S, Yuan F, Li Z, Wang X, Li K, Feng Y, Wang Z, Wei Z, Jian F. Large Language Models’ Responses to Spinal Cord Injury: A Comparative Study of Performance. Journal of Medical Systems 2025;49(1) View
  54. Ao G, Chen M, Li J, Nie H, Zhang L, Chen Z. Comparative analysis of large language models on rare disease identification. Orphanet Journal of Rare Diseases 2025;20(1) View
  55. Chen X, Xiang J, Lu S, Liu Y, He M, Shi D. Evaluating large language models and agents in healthcare: key challenges in clinical applications. Intelligent Medicine 2025 View
  56. Kunze K, Gerhold C, Dave U, Abunnur N, Mamonov A, Nwachukwu B, Verma N, Chahla J. Large Language Model Use Cases in Healthcare Research are Redundant and Often Lack Appropriate Methodological Conduct: A Scoping Review and Call for Improved Practices. Arthroscopy: The Journal of Arthroscopic & Related Surgery 2025 View