Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Is ChatGPT Better Than Epileptologists at Interpreting Seizure Semiology?

Meng Jiao, Yaxi Luo, Neel Fotedar, Ioannis Karakis, Vikram R. Rao, Melissa Asmar, Xiaochen Xian, Orwa Aboud, Yuxin Wen, Jack J. Lin, Felix Rosenow, Hai Sun, View ORCID ProfileFeng Liu
doi: https://doi.org/10.1101/2024.04.13.24305773
Meng Jiao
1School of Systems and Enterprises, Stevens Institute of Technology
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yaxi Luo
2Department of Computer Science, Schaefer School of Engineering & Science, Stevens Institute of Technology
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Neel Fotedar
3Department of Neurology, University Hospitals Cleveland Medical Center, School of Medicine at Case Western Reserve University
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ioannis Karakis
4Department of Neurology, Emory University School of Medicine and Department of Neurology, University of Crete School of Medicine
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vikram R. Rao
5Department of Neurology and Weill Institute for Neurosciences, University of California San Francisco;
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Melissa Asmar
6Department of Neurology, University of California Davis
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaochen Xian
7Department of Industrial & Systems Engineering, University of Florida
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Orwa Aboud
8Department of Neurology and Neurological Surgery, University of California Davis
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuxin Wen
9Fowler School of Engineering, Chapman University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jack J. Lin
10Department of Neurology, University of California Davis
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Felix Rosenow
11Goethe-University Frankfurt, Epilepsy Center Frankfurt Rhine-Main, Department of Neurology
MD, MHBA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hai Sun
12Department of Neurosurgery, Rutgers Robert Wood Johnson Medical School, Rutgers University
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Feng Liu
13School of Systems and Enterprises, Stevens Institute of Technology.
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Feng Liu
  • For correspondence: fliu22{at}stevens.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Utilizing large language models (LLMs), primarily ChatGPT, to interpret the seizure semiology with focal epilepsy could yield valuable data for presurgical assessment. Assessing the reliability and comparability of LLM-generated responses with those from well-trained neurologists, especially epileptologists, is crucial for ascertaining the value of LLMs in the presurgical evaluation.

Methods A total of 865 descriptions of seizure semiology and validated epileptogenic zone (EZ) pairs were derived from 189 public papers. These semiology records were utilized as input of ChatGPT to generate responses on the most likely locations of EZ. Additionally, a panel of 5 epileptologists was recruited to complete an online survey by providing responses on EZ locations based on 100 well-defined semiology records. All responses from ChatGPT and epileptologists were graded for their reliability score (RS) and regional accuracy rate (RAR).

Results In evaluating responses to semiology queries, the highest RARs in each general region from ChatGPT-4.0 were 89.28% for the frontal lobe and 71.39% for the temporal lobe. However, the RAR was lower for the occipital lobe at 46.24%, the parietal lobe at 31.01%, the insular cortex at 8.51%, and the cingulate cortex at 2.78%. Comparatively, the RAR achieved by epileptologists was 82.76% for the frontal lobe, 58.33% for the temporal lobe, 68.42% for the occipital lobe, 50% for the parietal lobe, 60% for the insular cortex, and 28.57% for the cingulate cortex.

Conclusions In this study of seizure semiology interpretation, ChatGPT-4.0 outperformed epileptologists in interpreting seizure semiology originating in the frontal and temporal lobes, whereas epileptologists outperformed ChatGPT-4.0 in the occipital and parietal lobes, and significantly outperformed in the insular cortex and cingulate cortex. ChatGPT demonstrates the potential to assist in the preoperative assessment for epilepsy surgery. Presumably, with the continuous development of LLM, the reliability of ChatGPT will be strengthened in the foreseeable future.

Competing Interest Statement

Dr. Aboud has served on the advisory board for Servier and is supported in part by the UC Davis Paul Calabresi Career Development Award for Clinical Oncology as funded by the National Cancer Institute/National Institutes of Health through grant #2K12CA138464-11. Dr. Rosenow has received research support from the Federal State of Hesse, specifically at the Center for Personalized Translational Epilepsy Research from 2018 to 2022. Dr. Rosenow has received research support from Chaja-Foundation Frankfurt, focusing on establishing and evaluating the ketogenic diet in institution. Dr. Rosenow received research support from Reiss-Foundation Frankfurt, mainly for the research on the ketogenic diet in GLUT1-DS. Dr. Rosenow received research support from German Ministry of Education, focusing on the ERAPerMed Raise-Genic.

Funding Statement

This study did not receive any funding.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The data used in this paper were collected from published papers from multiple journals and publishers.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • Updated two brain plot figures.

Data Availability

All data produced in the present study are available upon reasonable request to the authors after the completion of the project.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 30, 2024.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Is ChatGPT Better Than Epileptologists at Interpreting Seizure Semiology?
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Is ChatGPT Better Than Epileptologists at Interpreting Seizure Semiology?
Meng Jiao, Yaxi Luo, Neel Fotedar, Ioannis Karakis, Vikram R. Rao, Melissa Asmar, Xiaochen Xian, Orwa Aboud, Yuxin Wen, Jack J. Lin, Felix Rosenow, Hai Sun, Feng Liu
medRxiv 2024.04.13.24305773; doi: https://doi.org/10.1101/2024.04.13.24305773
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Is ChatGPT Better Than Epileptologists at Interpreting Seizure Semiology?
Meng Jiao, Yaxi Luo, Neel Fotedar, Ioannis Karakis, Vikram R. Rao, Melissa Asmar, Xiaochen Xian, Orwa Aboud, Yuxin Wen, Jack J. Lin, Felix Rosenow, Hai Sun, Feng Liu
medRxiv 2024.04.13.24305773; doi: https://doi.org/10.1101/2024.04.13.24305773

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neurology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)