Published on in Vol 26 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/67409, first published .
The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

Journals

  1. Menz B, Modi N, Abuhelwa A, Ruanglertboon W, Vitry A, Gao Y, Li L, Chhetri R, Chu B, Bacchi S, Kichenadasse G, Shahnam A, Rowland A, Sorich M, Hopkins A. Generative AI chatbots for reliable cancer information: Evaluating web-search, multilingual, and reference capabilities of emerging large language models. European Journal of Cancer 2025;218:115274 View
  2. Gao C, Satheakeerthy S, Guo C, Pradhan A, Booth A, Chan W, Kanjilal S, Roberts M, Kotton C, Bacchi S. Large language models for infectious diseases require evidence generation and regulation. Internal Medicine Journal 2025;55(7):1198 View
  3. Modi N, Menz B, Awaty A, Alex C, Logan J, McKinnon R, Rowland A, Bacchi S, Gradon K, Sorich M, Hopkins A. Assessing the System-Instruction Vulnerabilities of Large Language Models to Malicious Conversion Into Health Disinformation Chatbots. Annals of Internal Medicine 2025;178(8):1172 View
  4. Alomari L, Alshammari M, Arbaeen A, Alshehri R, Almalki H. Safety and accuracy of AI in triaging patients in the emergency department. International Journal of Emergency Medicine 2025;18(1) View
  5. Wang X, Wang Q, Ding G, Wang J, Tang Y, Feng Y. Artificial intelligence in multidisciplinary tumor boards enhancing decision making and clinical outcomes in oncology. iScience 2025;28(12):114082 View