Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Fact Check: Assessing the Response of ChatGPT to Alzheimer’s Disease Statements with Varying Degrees of Misinformation

View ORCID ProfileSean S. Huang, Qingyuan Song, Kimberly J. Beiting, Maria C. Duggan, Kristin Hines, Harvey Murff, Vania Leung, James Powers, T.S. Harvey, Bradley Malin, Zhijun Yin
doi: https://doi.org/10.1101/2023.09.04.23294917
Sean S. Huang
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
bDepartment of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sean S. Huang
  • For correspondence: sean.huang{at}vumc.org
Qingyuan Song
cDepartment of Computer Science, Vanderbilt University, Nashville, TN, USA
M.E.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kimberly J. Beiting
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maria C. Duggan
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
dGeriatric Research Education and Clinical Center (GRECC), Department of Veteran Affairs, Tennessee Valley Healthcare System, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kristin Hines
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harvey Murff
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vania Leung
eDepartment of Academic Internal Medicine and Geriatrics, University of Illinois at Chicago, Chicago, IL, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James Powers
aDepartment of Medicine, Division of Geriatrics, Vanderbilt University Medical Center, Nashville, TN, USA
dGeriatric Research Education and Clinical Center (GRECC), Department of Veteran Affairs, Tennessee Valley Healthcare System, Nashville, TN, USA
M.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
T.S. Harvey
fDepartment of Anthropology, Vanderbilt University, Nashville, TN, USA
Ph.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bradley Malin
bDepartment of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
cDepartment of Computer Science, Vanderbilt University, Nashville, TN, USA
gDepartment of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
Ph.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhijun Yin
bDepartment of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
cDepartment of Computer Science, Vanderbilt University, Nashville, TN, USA
Ph.D.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background There are many myths regarding Alzheimer’s disease (AD) that have been circulated on the Internet, each exhibiting varying degrees of accuracy, inaccuracy, and misinformation. Large language models such as ChatGPT, may be a useful tool to help assess these myths for veracity and inaccuracy. However, they can induce misinformation as well. The objective of this study is to assess ChatGPT’s ability to identify and address AD myths with reliable information.

Methods We conducted a cross-sectional study of clinicians’ evaluation of ChatGPT (GPT 4.0)’s responses to 20 selected AD myths. We prompted ChatGPT to express its opinion on each myth and then requested it to rephrase its explanation using a simplified language that could be more readily understood by individuals with a middle school education. We implemented a survey using Redcap to determine the degree to which clinicians agreed with the accuracy of each ChatGPT’s explanation and the degree to which the simplified rewriting was readable and retained the message of the original. We also collected their explanation on any disagreement with ChatGPT’s responses. We used five Likert-type scale with a score ranging from -2 to 2 to quantify clinicians’ agreement in each aspect of the evaluation.

Results The clinicians (n=11) were generally satisfied with ChatGPT’s explanations, with a mean (SD) score of 1.0(±0.3) across the 20 myths. While ChatGPT correctly identified that all the 20 myths were inaccurate, some clinicians disagreed with its explanations on 7 of the myths.

Overall, 9 of the 11 professionals either agreed or strongly agreed that ChatGPT has the potential to provide meaningful explanations of certain myths.

Conclusions The majority of surveyed healthcare professionals acknowledged the potential value of ChatGPT in mitigating AD misinformation. However, the need for more refined and detailed explanations of the disease’s mechanisms and treatments was highlighted.

Impact Statement There are many statements regarding Alzheimer’s disease (AD) diagnosis, management, and treatment circulating on the Internet, each exhibiting varying degrees of accuracy, inaccuracy, and misinformation. Large language models are a popular topic currently, and many patients and caregivers may turn to LLMs such as ChatGPT to learn more about the disease. This study aims to assess ChatGPT’s ability to identify and address AD myths with reliable information. We certify that this work is novel.

Key Points

  • - Geriatricians acknowledged the potential value of ChatGPT in mitigating misinformation in Alzheimer’s Disease

  • - There remain nuanced cases where ChatGPT explanations are not as refined or appropriate.

  • - Why does this matter? Large language models such as ChatGPT are very popular nowadays and patients and caregivers often may use them to learn about their disease. The paper seeks to determine whether ChatGPT does an appropriate job in moderating understanding of Alzheimer’s Disease myths.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

Research reported in this paper was supported by the National Institutes of Health under award number U54HG012510.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

IRB of Vanderbilt University gave ethical approval for this work.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • Funding Disclosure: Research reported in this paper was supported by the National Institutes of Health under award number U54HG012510.

  • Dr. Beiting is supported by the Health Resources and Services Administration (HRSA) of the U.S. Department of Health and Human Services (HHS) under grant number K01HP49070. This information or content and conclusions are those of the author and should not be construed as the official position or policy of, nor should any endorsements be inferred by HRSA, HHS or the U.S. Government.

Data Availability

All data produced in the present study are available upon reasonable request to the authors.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted September 07, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Fact Check: Assessing the Response of ChatGPT to Alzheimer’s Disease Statements with Varying Degrees of Misinformation
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Fact Check: Assessing the Response of ChatGPT to Alzheimer’s Disease Statements with Varying Degrees of Misinformation
Sean S. Huang, Qingyuan Song, Kimberly J. Beiting, Maria C. Duggan, Kristin Hines, Harvey Murff, Vania Leung, James Powers, T.S. Harvey, Bradley Malin, Zhijun Yin
medRxiv 2023.09.04.23294917; doi: https://doi.org/10.1101/2023.09.04.23294917
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Fact Check: Assessing the Response of ChatGPT to Alzheimer’s Disease Statements with Varying Degrees of Misinformation
Sean S. Huang, Qingyuan Song, Kimberly J. Beiting, Maria C. Duggan, Kristin Hines, Harvey Murff, Vania Leung, James Powers, T.S. Harvey, Bradley Malin, Zhijun Yin
medRxiv 2023.09.04.23294917; doi: https://doi.org/10.1101/2023.09.04.23294917

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Geriatric Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)