Published on in Vol 26 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/67409, first published .
The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance

Journals

  1. Menz B, Modi N, Abuhelwa A, Ruanglertboon W, Vitry A, Gao Y, Li L, Chhetri R, Chu B, Bacchi S, Kichenadasse G, Shahnam A, Rowland A, Sorich M, Hopkins A. Generative AI chatbots for reliable cancer information: Evaluating web-search, multilingual, and reference capabilities of emerging large language models. European Journal of Cancer 2025;218:115274 View
  2. Gao C, Satheakeerthy S, Guo C, Pradhan A, Booth A, Chan W, Kanjilal S, Roberts M, Kotton C, Bacchi S. Large language models for infectious diseases require evidence generation and regulation. Internal Medicine Journal 2025 View