Published on in Vol 26 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/66114, first published .
Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis

Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis

Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis

Journals

  1. Azizoğlu M, Klyuev S. A Comparative Study on the Question-Answering Proficiency of Artificial Intelligence Models in Bladder-Related Conditions: An Evaluation of Gemini and ChatGPT 4.o. Medical Records 2025;7(1):201 View
  2. Wei Y, Zhang R, Zhang J, Qi D, Cui W. Research on Intelligent Grading of Physics Problems Based on Large Language Models. Education Sciences 2025;15(2):116 View
  3. Zeng J, Sun K, Qin P, Liu S. Enhancing ophthalmology students’ awareness of retinitis pigmentosa: assessing the efficacy of ChatGPT in AI-assisted teaching of rare diseases—a quasi-experimental study. Frontiers in Medicine 2025;12 View
  4. Acar A, Yanik E, Altin E, Kurtkaya Kocak O. Is artificial intelligence successful in the Turkish neurology board exam?. Neurological Research 2025;47(5):402 View
  5. Hasei J, Nakahara R, Takeuchi K, Yoshida A, Itano T, Fujiwara T, Nakata E, Kunisada T, Ozaki T. Comparative analysis of a standard (GPT-4o) and reasoning-enhanced (o1 pro) large language model on complex clinical questions from the Japanese orthopaedic board examination. Journal of Orthopaedic Science 2025;30(3):565 View
  6. Budler L, Chen H, Chen A, Topaz M, Tam W, Bian J, Stiglic G. A Brief Review on Benchmarking for Large Language Models Evaluation in Healthcare. WIREs Data Mining and Knowledge Discovery 2025;15(2) View
  7. Bi C, Zheng X, Zhang Y, Zhou S, Song J, Shang H, Shen B. NDDRF 2.0: An update and expansion of risk factor knowledge base for personalized prevention of neurodegenerative diseases. Alzheimer's & Dementia 2025;21(5) View
  8. Wu D, Liu N, Ma R, Wu P. Advancements in Herpes Zoster Diagnosis, Treatment, and Management: Systematic Review of Artificial Intelligence Applications. Journal of Medical Internet Research 2025;27:e71970 View
  9. Wei J, Wang X, Huang M, Xu Y, Yang W. Evaluating the Performance of ChatGPT on Board-Style Examination Questions in Ophthalmology: A Meta-Analysis. Journal of Medical Systems 2025;49(1) View
  10. Yan Z, Fan K, Zhang Q, Wu X, Chen Y, Wu X, Yu T, Su N, Zou Y, Chi H, Xia L, Cao Q. Comparative analysis of the performance of the large language models DeepSeek-V3, DeepSeek-R1, open AI-O3 mini and open AI-O3 mini high in urology. World Journal of Urology 2025;43(1) View
  11. Paruzel K, Ordak M. Assessment of ChatGPT-3.5 performance on the medical genetics specialist exam. Laboratory Medicine 2025 View
  12. Hu D, Guo Y, Zhou Y, Flores L, Zheng K. A systematic review of early evidence on generative AI for drafting responses to patient messages. npj Health Systems 2025;2(1) View

Books/Policy Documents

  1. Zong H, Tao L, Li Z, Wu C, Liu Y, Zhang X. Health Information Processing. Evaluation Track Papers. View