Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

ChatGPT for automating lung cancer staging: feasibility study on open radiology report dataset

View ORCID ProfileYuta Nakamura, View ORCID ProfileTomohiro Kikuchi, View ORCID ProfileYosuke Yamagishi, View ORCID ProfileShouhei Hanaoka, View ORCID ProfileTakahiro Nakao, View ORCID ProfileSoichiro Miki, View ORCID ProfileTakeharu Yoshikawa, View ORCID ProfileOsamu Abe
doi: https://doi.org/10.1101/2023.12.11.23299107
Yuta Nakamura
1Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yuta Nakamura
  • For correspondence: yutanakamura-tky{at}umin.ac.jp
Tomohiro Kikuchi
1Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
2Department of Radiology, Jichi Medical University, School of Medicine. 3311-1 Yakushiji, Shimot-suke, Tochigi, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tomohiro Kikuchi
Yosuke Yamagishi
3Division of Radiology and Biomedical Engineering, Graduate School of Medicine, The University of Tokyo. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yosuke Yamagishi
Shouhei Hanaoka
3Division of Radiology and Biomedical Engineering, Graduate School of Medicine, The University of Tokyo. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
4Department of Radiology, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shouhei Hanaoka
Takahiro Nakao
1Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Takahiro Nakao
Soichiro Miki
1Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Soichiro Miki
Takeharu Yoshikawa
1Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Takeharu Yoshikawa
Osamu Abe
3Division of Radiology and Biomedical Engineering, Graduate School of Medicine, The University of Tokyo. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
4Department of Radiology, The University of Tokyo Hospital. 7-3-1 Hongo, Bunkyo, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Osamu Abe
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Objectives CT imaging is essential in the initial staging of lung cancer. However, free-text radiology reports do not always directly mention clinical TNM stages. We explored the capability of OpenAI’s ChatGPT to automate lung cancer staging from CT radiology reports.

Methods We used MedTxt-RR-JA, a public de-identified dataset of 135 CT radiology reports for lung cancer. Two board-certified radiologists assigned clinical TNM stage for each radiology report by consensus. We used a part of the dataset to empirically determine the optimal prompt to guide ChatGPT. Using the remaining part of the dataset, we (i) compared the performance of two ChatGPT models (GPT-3.5 Turbo and GPT-4), (ii) compared the performance when the TNM classification rule was or was not presented in the prompt, and (iii) performed subgroup analysis regarding the T category.

Results The best accuracy scores were achieved by GPT-4 when it was presented with the TNM classification rule (52.2%, 78.9%, and 86.7% for the T, N, and M categories). Most ChatGPT’s errors stemmed from challenges with numerical reasoning and insufficiency in anatomical or lexical knowledge.

Conclusions ChatGPT has the potential to become a valuable tool for automating lung cancer staging. It can be a good practice to use GPT-4 and incorporate the TNM classification rule into the prompt. Future improvement of ChatGPT would involve supporting numerical reasoning and complementing knowledge.

Clinical relevance statement ChatGPT’s performance for automating cancer staging still has room for enhancement, but further improvement would be helpful for individual patient care and secondary information usage for research purposes.

Key points

  • ChatGPT, especially GPT-4, has the potential to automatically assign clinical TNM stage of lung cancer based on CT radiology reports.

  • It was beneficial to present the TNM classification rule to ChatGPT to improve the performance.

  • ChatGPT would further benefit from supporting numerical reasoning or providing anatomical knowledge.

Figure
  • Download figure
  • Open in new tab

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study did not receive any funding.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced are available online at https://sociocom.naist.jp/medtxt/rr/

https://sociocom.naist.jp/medtxt/rr/

  • Abbreviations and acronyms

    cTNM
    clinical TNM stage
    NLP
    natural language processing
    API
    application programming interface
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted December 13, 2023.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    ChatGPT for automating lung cancer staging: feasibility study on open radiology report dataset
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    ChatGPT for automating lung cancer staging: feasibility study on open radiology report dataset
    Yuta Nakamura, Tomohiro Kikuchi, Yosuke Yamagishi, Shouhei Hanaoka, Takahiro Nakao, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe
    medRxiv 2023.12.11.23299107; doi: https://doi.org/10.1101/2023.12.11.23299107
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    ChatGPT for automating lung cancer staging: feasibility study on open radiology report dataset
    Yuta Nakamura, Tomohiro Kikuchi, Yosuke Yamagishi, Shouhei Hanaoka, Takahiro Nakao, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe
    medRxiv 2023.12.11.23299107; doi: https://doi.org/10.1101/2023.12.11.23299107

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Radiology and Imaging
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)