Published on in Vol 25 (2023)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/50865, first published .
Evaluation of GPT-4’s Chest X-Ray Impression Generation: A Reader Study on Performance and Perception

Evaluation of GPT-4’s Chest X-Ray Impression Generation: A Reader Study on Performance and Perception

Evaluation of GPT-4’s Chest X-Ray Impression Generation: A Reader Study on Performance and Perception

Journals

  1. Li J, Dada A, Puladi B, Kleesiek J, Egger J. ChatGPT in healthcare: A taxonomy and systematic review. Computer Methods and Programs in Biomedicine 2024;245:108013 View
  2. Wu Q, Wu Q, Li H, Wang Y, Bai Y, Wu Y, Yu X, Li X, Dong P, Xue J, Shen D, Wang M. Evaluating Large Language Models for Automated Reporting and Data Systems Categorization: Cross-Sectional Study. JMIR Medical Informatics 2024;12:e55799 View
  3. Shiraishi M, Miyamoto S, Takeishi H, Kurita D, Furuse K, Ohba J, Moriwaki Y, Fujisawa K, Okazaki M. The Potential of Chat-Based Artificial Intelligence Models in Differentiating Between Keloid and Hypertrophic Scars: A Pilot Study. Aesthetic Plastic Surgery 2024;48(24):5367 View
  4. Mukherjee P, Hou B, Suri A, Zhuang Y, Parnell C, Lee N, Stroie O, Jain R, Wang K, Sharma K, Summers R. Evaluation of GPT Large Language Model Performance on RSNA 2023 Case of the Day Questions. Radiology 2024;313(1) View
  5. Chang Y, Yin J, Li J, Liu C, Cao L, Lin S. Applications and Future Prospects of Medical LLMs: A Survey Based on the M-KAT Conceptual Framework. Journal of Medical Systems 2024;48(1) View
  6. Yang X, Li T, Su Q, Liu Y, Kang C, Lyu Y, Zhao L, Nie Y, Pan Y. Application of large language models in disease diagnosis and treatment. Chinese Medical Journal 2025;138(2):130 View
  7. Su Y, Yang S, Liu Y, Kai A, Chen L, Liu M. Knowledge discovery from porous organic cage literature using a large language model. Digital Discovery 2025;4(2):403 View
  8. Altalla’ B, Ahmad A, Bitar L, Al-Bssol M, Al Omari A, Sultan I, Sarkar S. Radiology Report Annotation Using Generative Large Language Models: Comparative Analysis. International Journal of Biomedical Imaging 2025;2025(1) View
  9. Zhou Z, Qin P, Cheng X, Shao M, Ren Z, Zhao Y, Li Q, Liu L. ChatGPT in Oncology Diagnosis and Treatment: Applications, Legal and Ethical Challenges. Current Oncology Reports 2025;27(4):336 View
  10. Shmilovitch A, Katson M, Cohen-Shelly M, Peretz S, Aran D, Shelly S. GPT-4 as a Clinical Decision Support Tool in Ischemic Stroke Management: Evaluation Study. JMIR AI 2025;4:e60391 View
  11. Jin G. Artificial intelligence in thoracic imaging—a new paradigm for diagnosing pulmonary diseases: a narrative review. Journal of the Korean Medical Association 2025;68(5):288 View
  12. Kim S, Schramm S, Wihl J, Raffler P, Tahedl M, Canisius J, Luiken I, Endrös L, Reischl S, Marka A, Walter R, Schillmaier M, Zimmer C, Wiestler B, Hedderich D. Boosting LLM-assisted diagnosis: 10-minute LLM tutorial elevates radiology residents’ performance in brain MRI interpretation. Neuroradiology 2025;67(8):2069 View
  13. Wihl J, Rosenkranz E, Schramm S, Berberich C, Griessmair M, Woźnicki P, Pinto F, Ziegelmayer S, Adams L, Bressem K, Kirschke J, Zimmer C, Wiestler B, Hedderich D, Kim S. Data extraction from free-text stroke CT reports using GPT-4o and Llama-3.3-70B: the impact of annotation guidelines. European Radiology Experimental 2025;9(1) View
  14. Hürsoy N, Kolluk H, Solak M, Budak K, Kaba E. Interpreting Chest X-ray with ChatGPT: Can It Serve as a Tool for Justifying Computed Tomography?. CERASUS JOURNAL OF MEDICINE 2025;2(2):118 View
  15. de Almeida J, Alberich L, Tsakou G, Marias K, Tsiknakis M, Lekadir K, Marti-Bonmati L, Papanikolaou N. Foundation models for radiology—the position of the AI for Health Imaging (AI4HI) network. Insights into Imaging 2025;16(1) View
  16. Han M, Liu Y. Evaluating generative artificial intelligence products using fuzzy social network multi-attribute decision-making model: User perspective. Applied Soft Computing 2025;183:113715 View
  17. Chetla N, Samayamanthula S, Chang J, Leigh A, Akosman S, Tandon M, Hage T, Cusick M. Assessing the Diagnostic Capabilities of ChatGPT-4 Omni in Grading Diabetic Retinopathy Fundoscopy Using Color Fundus Photographs. Clinical Ophthalmology 2025;Volume 19:3103 View

Conference Proceedings

  1. Mahmood R, Yan P, Reyes D, Wang G, Kalra M, Kaviani P, Wu J, Syeda-Mahmood T. 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI). Evaluating Automated Radiology Report Quality Through Fine-Grained Phrasal Grounding of Clinical Findings View