ChatGPT and the clinical informatics board examination: the end of unproctored maintenance of certification?

Yaa Kumah-Crystal; Scott Mankowitz; Peter Embi; Christoph U Lehmann

doi:10.1093/jamia/ocad104

ChatGPT and the clinical informatics board examination: the end of unproctored maintenance of certification?

J Am Med Inform Assoc. 2023 Aug 18;30(9):1558-1560. doi: 10.1093/jamia/ocad104.

Authors

Yaa Kumah-Crystal¹, Scott Mankowitz², Peter Embi³, Christoph U Lehmann⁴

Affiliations

¹ Department of Biomedical Informatics, Pediatric Endocrinology, Vanderbilt University Medical Center, Nashville, Tennessee, USA.
² Department of Clinical Informatics, Overlook Medical Center, Summit, New Jersey, USA.
³ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.
⁴ Clinical Informatics Center, UT Southwestern Medical Center, Dallas, Texas, USA.

PMID: 37335851
PMCID: PMC10436139 (available on 2024-06-19)
DOI: 10.1093/jamia/ocad104

Abstract

We aimed to assess ChatGPT's performance on the Clinical Informatics Board Examination and to discuss the implications of large language models (LLMs) for board certification and maintenance. We tested ChatGPT using 260 multiple-choice questions from Mankowitz's Clinical Informatics Board Review book, omitting 6 image-dependent questions. ChatGPT answered 190 (74%) of 254 eligible questions correctly. While performance varied across the Clinical Informatics Core Content Areas, differences were not statistically significant. ChatGPT's performance raises concerns about the potential misuse in medical certification and the validity of knowledge assessment exams. Since ChatGPT is able to answer multiple-choice questions accurately, permitting candidates to use artificial intelligence (AI) systems for exams will compromise the credibility and validity of at-home assessments and undermine public trust. The advent of AI and LLMs threatens to upend existing processes of board certification and maintenance and necessitates new approaches to the evaluation of proficiency in medical education.

Keywords: ChatGPT; Clinical Informatics Board Examination; artificial intelligence; large language models; medical education.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Artificial Intelligence
Certification
Education, Medical*
Language
Medical Informatics*

Grants and funding

UL1 TR003163/TR/NCATS NIH HHS/United States