Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Spot the Difference: Can ChatGPT4-Vision Transform Radiology Artificial Intelligence?

View ORCID ProfileBrendan S Kelly, Sophie Duignan, Prateek Mathur, Henry Dillon, Edward H Lee, Kristen W Yeom, Pearse Keane, Aonghus Lawlor, Ronan P Killeen
doi: https://doi.org/10.1101/2023.11.15.23298499
Brendan S Kelly
1St Vincent’s University Hospital, Dublin, Ireland
2Insight Centre for Data Analytics, UCD, Dublin, Ireland
3Wellcome Trust – HRB, Irish Clinical Academic Training, Dublin, Ireland
4School of Medicine, University College Dublin, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Brendan S Kelly
  • For correspondence: brendanskelly{at}me.com
Sophie Duignan
2Insight Centre for Data Analytics, UCD, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Prateek Mathur
2Insight Centre for Data Analytics, UCD, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Henry Dillon
1St Vincent’s University Hospital, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Edward H Lee
5Lucille Packard Children’s Hospital at Stanford, Stanford, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kristen W Yeom
5Lucille Packard Children’s Hospital at Stanford, Stanford, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pearse Keane
6Professor of Artificial Medical Intelligence, University College London
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aonghus Lawlor
2Insight Centre for Data Analytics, UCD, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ronan P Killeen
1St Vincent’s University Hospital, Dublin, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

OpenAI’s flagship Large Language Model ChatGPT can now accept image input (GPT4V). “Spot the Difference” and “Medical” have been suggested as emerging applications. The interpretation of medical images is a dynamic process not a static task. Diagnosis and treatment of Multiple Sclerosis is dependent on identification of radiologic change. We aimed to compare the zero-shot performance of GPT4V to a trained U-Net and Vision Transformer (ViT) for the identification of progression of MS on MRI.

170 patients were included. 100 unseen paired images were randomly used for testing. Both U-Net and ViT had 94% accuracy while GPT4V had 85%. GPT4V gave overly cautious non-answers in 6 cases. GPT4V had a precision, recall and F1 score of 0.896, 0.915, 0.905 compared to 1.0, 0.88 and 0.936 for U-Net and 0.94, 0.94, 0.94 for ViT.

The impressive performance compared to trained models and a no-code drag and drop interface suggest GPT4V has the potential to disrupt AI radiology research. However misclassified cases, hallucinations and overly cautious non-answers confirm that it is not ready for clinical use. GPT4V’s widespread availability and relatively high error rate highlight the need for caution and education for lay-users, especially those with limited access to expert healthcare.

Key points

  • Even without fine tuning and without the need for prior coding experience or additional hardware, GPT4V can perform a zero-shot radiologic change detection task with reasonable accuracy.

  • We find GPT4V does not match the performance of established state of the art computer vision models. GPT4V’s performance metrics are more similar to the vision transformers than the convolutional neural networks, giving some possible insight into its underlying architecture.

  • This is an exploratory experimental study and GPT4V is not intended for use as a medical device.

Summary statement GPT4V can identify radiologic progression of Multiple Sclerosis in a simplified experimental setting. However GPT4V is not a medical device and its widespread availability and relatively high error rate highlight the need for caution and education for lay-users, especially those with limited access to expert healthcare.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was performed within the Irish Clinical Academic Training (ICAT) Programme, supported by the Wellcome Trust and the Health Research Board (Grant No. 203930/B/16/Z), the Health Service Executive National Doctors Training and Planning and the Health and Social Care, Research and Development Division, Northern Ireland and the Faculty of Radiologists, Royal College of Surgeons in Ireland. This research was supported by Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289_P2 and by a Fulbright-HRB HealthImpact Scholarship.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Ethics committee/IRB of St Vincent's University hospital gave ethical approval for this work

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced in the present study are available upon reasonable request to the authors

  • Abbreviations

    (CNS)
    Central Nervous System
    (GPT4V)
    Chat Generative Pretrained Transformer 4 Vision
    (MRI)
    Magnetic Resonance Imaging
    (MS)
    Vision Transformers
    (ViT)
    Multiple sclerosis
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted November 18, 2023.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Spot the Difference: Can ChatGPT4-Vision Transform Radiology Artificial Intelligence?
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Spot the Difference: Can ChatGPT4-Vision Transform Radiology Artificial Intelligence?
    Brendan S Kelly, Sophie Duignan, Prateek Mathur, Henry Dillon, Edward H Lee, Kristen W Yeom, Pearse Keane, Aonghus Lawlor, Ronan P Killeen
    medRxiv 2023.11.15.23298499; doi: https://doi.org/10.1101/2023.11.15.23298499
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Spot the Difference: Can ChatGPT4-Vision Transform Radiology Artificial Intelligence?
    Brendan S Kelly, Sophie Duignan, Prateek Mathur, Henry Dillon, Edward H Lee, Kristen W Yeom, Pearse Keane, Aonghus Lawlor, Ronan P Killeen
    medRxiv 2023.11.15.23298499; doi: https://doi.org/10.1101/2023.11.15.23298499

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Radiology and Imaging
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)