Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/69910, first published .
Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study

Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study

Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study

Kaitlin Hanss   1 , MD, MPH ;   Karthik V Sarma   1 , MD, PhD ;   Anne L Glowinski   1 , MD, MPE ;   Andrew Krystal   1 , MD ;   Ramotse Saunders   1 , MD ;   Andrew Halls   1 , MD ;   Sasha Gorrell   1 , PhD ;   Erin Reilly   1 , PhD

1 Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, CA, United States

Corresponding Author:

  • Kaitlin Hanss, MD, MPH
  • Department of Psychiatry and Behavioral Sciences
  • University of California, San Francisco
  • 675 18th Street, Box 3134
  • San Francisco, CA 94143
  • United States
  • Phone: 1 415 476-7000
  • Fax: 1 415-502-6361
  • Email: Kaitlin.Hanss@ucsf.edu