Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/70080, first published .
Large Language Model Synergy for Ensemble Learning in Medical Question Answering: Design and Evaluation Study

Large Language Model Synergy for Ensemble Learning in Medical Question Answering: Design and Evaluation Study

Large Language Model Synergy for Ensemble Learning in Medical Question Answering: Design and Evaluation Study

Journals

  1. Kurz C, Merzhevich T, Eskofier B, Kather J, Gmeiner B. Benchmarking vision-language models for diagnostics in emergency and critical care settings. npj Digital Medicine 2025;8(1) View
  2. Zhang Y, Xie X, Xu Q. ChatGPT in Medical Education: Bibliometric and Visual Analysis. JMIR Medical Education 2025;11:e72356 View
  3. Liu M, Wang B. Integrating natural non-pharmaceutical therapies into medical tourism: a dynamic health portrait-driven model for proactive older adult health and public health services. Frontiers in Public Health 2025;13 View
  4. Koppula M, Madhulika F, Sreeramoju N, Kolimi P. AI-Powered Chatbot for FDA Drug Labeling Information Retrieval: OpenAI GPT for Grounded Question Answering. Analytics 2025;4(4):33 View
  5. Elshaer Z, Rashed E. CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering. Big Data and Cognitive Computing 2025;9(12):299 View
  6. Zhou S, Xie W, Li J, Zhan Z, Song M, Yang H, Espinoza C, Welton L, Mai X, Jin Y, Xu Z, Chung Y, Xing Y, Tsai M, Schaffer E, Shi Y, Liu N, Liu Z, Zhang R. Automating expert-level medical reasoning evaluation of large language models. npj Digital Medicine 2025;9(1) View
  7. Mbah S, Matthew Fagbola T, Kumar Mishra B, Al Jaber T, Colin Thakur S, Althobaiti T. ICONQUER: A Transformer-Based Instruction-Finetuned Context-Aware Medical Question Answering Model With Knowledge Graph Augmentation. IEEE Access 2025;13:210950 View
  8. Ekingen E, Ucdal M. Comparative Performance of Multimodal and Unimodal Large Language Models Versus Multicenter Human Clinical Experts in Aortic Dissection Management. Diagnostics 2026;16(2):323 View
  9. Zhou H, Chow L, Harnack L, Panda S, Manoogian E, Li M, Xiao Y, Zhang R. NutriRAG: unleashing the power of large language models for food identification and classification through retrieval methods. Journal of the American Medical Informatics Association 2026 View
  10. Zhang X, Shang L, Hou S, Li J, Yang K, Lai F, Tian P, Zheng Y, Su G, Xu T, Hu K, Huang R. Large language models for predicting one-year major adverse cardiovascular events in acute coronary syndrome. iScience 2026;29(2):114644 View
  11. Wang B, Du Y, Jin X, Wang Z. Efficient Tuning Framework for Resource- Constrained Biomedical Question Answering. IEEE Transactions on Computational Biology and Bioinformatics 2026;23(1):528 View