Accessibility settings

Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/84120, first published .
Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Knowledge-Practice Performance Gap in Clinical Large Language Models: Systematic Review of 39 Benchmarks

Journals

  1. Lin X, Yang Y, Ren Y. Making Chatbots more human: deep reasoning large language models in ophthalmology. Frontiers in Medicine 2026;12 View
  2. Spieser J, Balapour A, Meller J, Patra K, Shamsaei B. A Review of Multi-Agent AI Systems for Biological and Clinical Data Analysis. Methods and Protocols 2026;9(2):33 View
  3. Zhu Q, Li Q, Zan Y, Lu Y, Xia L, Xia Y, Xu T. Patient-centered gastrointestinal function assessment technologies: a paradigm shift from traditional approaches to non-invasive innovations. Frontiers in Physiology 2026;17 View
  4. Prause M. No skin in the game: why agentic AI requires principal-agent governance. AI and Ethics 2026;6(2) View
  5. Eltaybani S. Knowledge Cut‐Off in Large Language Models: Implications for Critical Care Nursing. Nursing in Critical Care 2026;31(3) View

Conference Proceedings

  1. Kumar A, Joshi S, Sachdeva S. 2026 International Conference on Signal Processing and Electronics Design (ICSPED). JsonUtil: An Open-Source RESTful JSON-Based Dynamic Form Generation Framework validation with OpenEHR ORBDA Benchmarking Dataset View