Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Evaluation of ChatGPT’s Usefulness and Accuracy in Diagnostic Surgical Pathology

View ORCID ProfileVincenzo Guastafierro, Devin Nicole Corbitt, View ORCID ProfileAlessandra Bressan, View ORCID ProfileBethania Fernandes, Ömer Mintemur, View ORCID ProfileFrancesca Magnoli, Susanna Ronchi, View ORCID ProfileStefano La Rosa, View ORCID ProfileSilvia Uccella, View ORCID ProfileSalvatore Lorenzo Renne
doi: https://doi.org/10.1101/2024.03.12.24304153
Vincenzo Guastafierro
1Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy;
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vincenzo Guastafierro
Devin Nicole Corbitt
1Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alessandra Bressan
1Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy;
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alessandra Bressan
Bethania Fernandes
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Bethania Fernandes
Ömer Mintemur
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Francesca Magnoli
3Unit of Pathology, Department of Oncology, ASST Sette Laghi, Varese, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Francesca Magnoli
Susanna Ronchi
3Unit of Pathology, Department of Oncology, ASST Sette Laghi, Varese, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stefano La Rosa
3Unit of Pathology, Department of Oncology, ASST Sette Laghi, Varese, Italy;
4Unit of Pathology, Department of Medicine and Technological Innovation, University of Insubria, Varese, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stefano La Rosa
Silvia Uccella
1Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy;
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Silvia Uccella
Salvatore Lorenzo Renne
1Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy;
2IRCCS Humanitas Research Hospital, via Manzoni 56, 20089 Rozzano, Milan, Italy;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Salvatore Lorenzo Renne
  • For correspondence: salvatore.renne{at}hunimed.eu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

ChatGPT is an artificial intelligence capable of processing and generating human-like language. ChatGPT’s role within clinical patient care and medical education has been explored; however, assessment of its potential in supporting histopathological diagnosis is lacking. In this study, we assessed ChatGPT’s reliability in addressing pathology-related diagnostic questions across 10 subspecialties, as well as its ability to provide scientific references. We created five clinico-pathological scenarios for each subspecialty, posed to ChatGPT as open-ended or multiple-choice questions. Each question either asked for scientific references or not. Outputs were assessed by six pathologists according to: 1) usefulness in supporting the diagnosis and 2) absolute number of errors. All references were manually verified. We used directed acyclic graphs and structural causal models to determine the effect of each scenario type, field, question modality and pathologist evaluation. Overall, we yielded 894 evaluations. ChatGPT provided useful answers in 62.2% of cases. 32.1% of outputs contained no errors, while the remaining contained at least one error (maximum 18). ChatGPT provided 214 bibliographic references: 70.1% were correct, 12.1% were inaccurate and 17.8% did not correspond to a publication. Scenario variability had the greatest impact on ratings, followed by prompting strategy. Finally, latent knowledge across the fields showed minimal variation. In conclusion, ChatGPT provided useful responses in one-third of cases, but the number of errors and variability highlight that it is not yet adequate for everyday diagnostic practice and should be used with discretion as a support tool. The lack of thoroughness in providing references also suggests caution should be employed even when used as a self-learning tool. It is essential to recognize the irreplaceable role of human experts in synthesizing images, clinical data and experience for the intricate task of histopathological diagnosis.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study did not receive any funding.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced are available online at https://github.com/slrenne/PathGPT

https://github.com/slrenne/PathGPT

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted March 13, 2024.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Evaluation of ChatGPT’s Usefulness and Accuracy in Diagnostic Surgical Pathology
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Evaluation of ChatGPT’s Usefulness and Accuracy in Diagnostic Surgical Pathology
Vincenzo Guastafierro, Devin Nicole Corbitt, Alessandra Bressan, Bethania Fernandes, Ömer Mintemur, Francesca Magnoli, Susanna Ronchi, Stefano La Rosa, Silvia Uccella, Salvatore Lorenzo Renne
medRxiv 2024.03.12.24304153; doi: https://doi.org/10.1101/2024.03.12.24304153
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Evaluation of ChatGPT’s Usefulness and Accuracy in Diagnostic Surgical Pathology
Vincenzo Guastafierro, Devin Nicole Corbitt, Alessandra Bressan, Bethania Fernandes, Ömer Mintemur, Francesca Magnoli, Susanna Ronchi, Stefano La Rosa, Silvia Uccella, Salvatore Lorenzo Renne
medRxiv 2024.03.12.24304153; doi: https://doi.org/10.1101/2024.03.12.24304153

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Pathology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)