Accessibility settings

Published on in Vol 28 (2026)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/90061, first published .
Person using a laptop with ChatGPT medical question interface open

Benchmark Integrity and Reasoning-Trace Errors in Medical Question Answering With Large Language Models: Mixed Methods Study With Sparse Autoencoders

Benchmark Integrity and Reasoning-Trace Errors in Medical Question Answering With Large Language Models: Mixed Methods Study With Sparse Autoencoders

Authors of this article:

Jialin Liu1, 2 Author Orcid Image ;   Siru Liu3, 4 Author Orcid Image ;   Adam Wright3, 5 Author Orcid Image

There are no citations yet available for this article according to Crossref .