Evaluating Capabilities of Large Language Models: Performance of GPT4 on Surgical Knowledge Assessments

Brendin R Beaulieu-Jones; Sahaj Shah; Margaret T Berrigan; Jayson S Marwaha; Shuo-Lun Lai; Gabriel A Brat

doi:10.1101/2023.07.16.23292743

Article Information

doi

https://doi.org/10.1101/2023.07.16.23292743

History

July 24, 2023.

Article Versions

Version 1 (July 19, 2023 - 08:31).
You are viewing Version 2, the most recent version of this article.

The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.

Author Information

Brendin R Beaulieu-Jones, MD MBA1,2,
Sahaj Shah, BS3,
Margaret T Berrigan, MD1,
Jayson S Marwaha, MD MBI4,
Shuo-Lun Lai, MD4 and
Gabriel A Brat, MD, FACS, MPH1,2,*

¹Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA
²Department of Biomedical Informatics, Harvard Medical School, Boston, MA
³Geisinger Commonwealth School of Medicine, Scranton, PA
⁴Division of Colorectal Surgery, National Taiwan University Hospital, Taipei, Taiwan

↵*Corresponding Author: Gabriel A Brat, MD, FACS, MPH, Department of Surgery, Beth Israel Deaconess Medical Center Department of Biomedical Informatics, Harvard Medical School 110 Francis Street, Suite 2G, Boston, MA 02215, gbrat{at}bidmc.harvard.edu