Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

From Answers to Insights: Unveiling the Strengths and Limitations of ChatGPT and Biomedical Knowledge Graphs

Yu Hou, Jeremy Yeung, Hua Xu, Chang Su, Fei Wang, View ORCID ProfileRui Zhang
doi: https://doi.org/10.1101/2023.06.09.23291208
Yu Hou
1Department of Surgery, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeremy Yeung
1Department of Surgery, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hua Xu
2Section of Biomedical Informatics and Data Science, Yale University, New Haven, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chang Su
3Department of Health Service Administration and Policy, Temple University, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fei Wang
4Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rui Zhang
1Department of Surgery, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rui Zhang
  • For correspondence: zhan1386{at}umn.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Large Language Models (LLMs) have demonstrated exceptional performance in various natural language processing tasks, utilizing their language generation capabilities and knowledge acquisition potential from unstructured text. However, when applied to the biomedical domain, LLMs encounter limitations, resulting in erroneous and inconsistent answers. Knowledge Graphs (KGs) have emerged as valuable resources for structured information representation and organization. Specifically, Biomedical Knowledge Graphs (BKGs) have attracted significant interest in managing large-scale and heterogeneous biomedical knowledge. This study evaluates the capabilities of ChatGPT and existing BKGs in question answering, knowledge discovery, and reasoning. Results indicate that while ChatGPT with GPT-4.0 surpasses both GPT-3.5 and BKGs in providing existing information, BKGs demonstrate superior information reliability. Additionally, ChatGPT exhibits limitations in performing novel discoveries and reasoning, particularly in establishing structured links between entities compared to BKGs. To overcome these limitations, future research should focus on integrating LLMs and BKGs to leverage their respective strengths. Such an integrated approach would optimize task performance and mitigate potential risks, thereby advancing knowledge in the biomedical field and contributing to overall well-being.

Introduction

In recent years, Large Language Models (LLMs) have exhibited exceptional performance across a diverse range of natural language processing tasks1–3. LLMs, such as GPT-3.5 and GPT-4, are powerful models trained on vast amounts of textual data, enabling them to generate human-like text and perform various language-related tasks4. These models have found applications in diverse domains, including chatbots, question-answering systems, and language translation, among others. Their ability to understand and generate text has sparked interest in exploring their potential to replace traditional knowledge resources.

Knowledge Graphs (KGs) serve as valuable repositories of structured information, and have gained significant attention due to their ability to represent and organize knowledge in a structured manner. They facilitate knowledge discovery, entity linking, and semantic querying, making them essential for various applications, including information retrieval, recommendation systems, and semantic search. In recent years, the field of biomedicine has witnessed the emergence of Biomedical Knowledge Graphs (BKGs) as a novel paradigm for managing large-scale and heterogeneous biomedical knowledge, which have garnered considerable interest in the biomedical community5–10. A BKG is a multi-relational graph or network that integrates, harmonizes, and stores biomedical knowledge acquired from single or multiple expert-derived knowledge sources. Over the past decade, substantial efforts have been dedicated to constructing BKGs by integrating diverse expert-curated knowledge bases6,8,11–13 and extracting knowledge from literature using natural language processing (NLP) techniques14–16. Consequently, numerous distinct BKGs have been developed17–20.

LLMs exhibit impressive language generation capabilities and have the potential to acquire knowledge from vast amounts of unstructured text. They can generate responses to questions and provide valuable insights. However, LLMs face several limitations when confronted with the biomedical domain, leading to issues like erroneous and inconsistent answers21–23. This study aims to evaluate the ChatGPT (a popular LLM) and BKG through comprehensive assessments encompassing querying existing biomedical knowledge, discovering novel knowledge, and providing reasoning capabilities. We shed light on the strengths and limitations of ChatGPT and existing KG, providing insights into their complementary roles in knowledge representation and utilization. Our findings contribute to the ongoing discussions surrounding the synergies and potential collaborations between LLMs and KGs in enhancing knowledge-driven applications.

Methods

To evaluate the effectiveness of ChatGPT and Knowledge Graphs (KGs), we conducted a comprehensive assessment based on their performance in answering drug-related and dietary supplements (DS)-related questions, their knowledge discovery capabilities, and the comprehensiveness of the knowledge they provide. Specifically, we investigated ChatGPT’s ability to generate accurate and relevant responses to drug-related and DS-related queries and its potential for knowledge discovery by identifying hidden patterns and relationships.

Performance Evaluation of ChatGPT and BKG in Question-Answering

The question-answering (Q&A) dataset was obtained by extracting questions (including their titles and contents) from the “Alternative Medicine” sub-category in Yahoo! Answers24. The questions were grouped into categories such as Adverse Effects, Background, Contraindication, Effectiveness, Indication, Interaction, Safety, Uncertain, Unclassified, and Usage. Initially, we randomly selected 5 questions from each group, resulting in a total of 50 questions. To collect responses from ChatGPT, we input the questions as prompts and record the generated answers. For retrieving answers from iDISK, an integrated knowledge graph focused on DS25, we followed a two-step process. Firstly, we identified the unique identifier of the subject and its corresponding relationship based on the question’s description. Next, we linked the identified object identifiers to their respective names and translated them into natural language. To evaluate the responses, we followed the LiveQA Track guidelines26 and assigned judgment scores on a scale of 1 to 4. Two experts who have medical backgrounds were introduced for manual scoring. A score of 1 indicated an incorrect response, 2 represented an incorrect but related answer, 3 denoted a correct but incomplete response, and 4 indicated a correct and complete answer. These scores were then transformed into a range of 0-3, with 0 indicating a poor or unreadable response, 1 representing a fair response, 2 signifying a good response, and 3 designating an excellent response. Using this scale, we calculated two metrics. Firstly, we computed the average score, which evaluated the first retrieved answer for each test question by converting the 1-4 level grades to the 0-3 scores26,27. Secondly, we measured the succ@i + metric, which represents the ratio of the number of questions with a score of i or higher (where i ranges from 2 to 4) to the total number of questions. For example, succ@2 + quantifies the percentage of questions that were answered by the conversational agent (CA) with at least a fair grade26. To assess the statistical differences in the performance of the three systems, first, the QQ-plot was performed to look at the normality of the data. Then a t-test was employed. The analysis is conducted using R 1.1 with the package “car28”.

Performance Evaluation of ChatGPT and BKG in Knowledge Discovery

To test knowledge discovery capabilities between ChatGPT and existing KGs, we devised a prediction scenario that emulates the task of drug repurposing through link prediction within a knowledge graph. Our primary objective was to prompt ChatGPT to identify drugs or DSs that are not presently utilized for the treatment or prevention of Alzheimer’s Disease (AD) but possess the potential to be employed in such capacities. The specifically crafted prompts included:

  1. Please provide the approved drugs that are not currently used to treat Alzheimer’s disease but are potentially available for the treatment of AD. And please give your rationale. (Drug)

  2. Please provide which dietary supplements have the potential to treat/prevent Alzheimer’s disease. And please give your rationale. (DS)

Subsequently, we examined the answers generated by ChatGPT to determine if these answers met the following criteria: 1. whether they were already present in existing KGs (specifically, iBKH29 for drugs and ADInt30 for DSs); 2. whether they were documented in clinical trials; and 3. whether they were supported by existing literature.

Performance Evaluation of ChatGPT and BKG in Knowledge Reasoning

To assess the comprehensiveness of ChatGPT’s knowledge base, we conducted an experiment to examine its capability in establishing associations between the proposed drug and DS candidates and AD. In our previous study, we investigated potential pharmaceuticals and DS for the treatment or prevention of AD using link prediction techniques29,30. Building upon these previous findings, our objective was to evaluate ChatGPT’s knowledge base by examining the associations it provides between these hypothetical drug/DS candidates and AD, as well as the corresponding references it offers to support these hypotheses. To accomplish this, we formulated scenario-based inquiries as follows:

  1. Please show the association/linkage (direct link or indirect link) between Caryophyllus aromaticus and Alzheimer’s disease (AD) in a structured way (like a triplet). And please provide the reference for your finding.

  2. Please show the association/linkage (direct link or indirect link) between Loperamide and Alzheimer’s disease (AD) in a structured way (like a triplet). And please provide the reference for your finding.

Results

We conducted a comprehensive evaluation of the performance of three systems, namely GPT-4.0, iDISK, and GPT-3.5, in addressing a set of 43 questions. Table 1 demonstrated the performance of three systems in the Q&A task. To satisfy the assumptions of the t-test, we plotted the Q-Q Plot for the scores of each system and found all scores to be normally distributed. The comparison between ChatGPT (GPT-3.5) and KG revealed comparable performance (p-value: 0.20), indicating that both systems possess similar capabilities in answering questions. Notably, ChatGPT (GPT-4.0) demonstrated better performance than iDSIK and ChatGPT (GPT-3.5) (both p-value < 0.05) with an average score of 2.12, outperforming iDISK (average score: 1.64) and GPT-3.5 (average score: 1.44). In terms of providing references, iDISK surpassed ChatGPT (both GPT-4.0 and GPT-3.5) by offering the name of the database from which the data was retrieved. Conversely, ChatGPT (both GPT-4.0 and GPT-3.5) fell short in providing valid references, as the mentioned article names and/or authors were found to be fabricated.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1. Performance of three systems in the Q&A task

Table 2 presents the results obtained from the conversation questions using ChatGPT (model GPT-3.5) to simulate a scenario related to drug repositioning. Upon conducting ten iterations of the question, we observed that ChatGPT (GPT-3.5) returned several identical responses. Specifically, Levetiracetam was suggested in 6 out of the ten iterations, while Lithium and Ibuprofen were each suggested in 5 iterations. Moreover, the majority of drugs suggested by ChatGPT (GPT-3.5) have been investigated for potential associations with Alzheimer’s Disease treatment in ClinicalTrials.gov. And, all of the drugs have been investigated in one or more scientific literature that explores their potential association with Alzheimer’s Disease. Upon examining these drugs within a comprehensive biomedical knowledge graph (iBKH), it was observed that nearly all the drugs suggested by ChatGPT (GPT-3.5) are present in iBKH with direct links to AD. For those drugs lacking direct links, the shortest distance between them and Alzheimer’s Disease is only 2, with reasonable paths existing, such as (Drug) - [Target_DG] - (Gene) - [Associate_DiG] - (AD). Analogous outcomes are observed in the employment of ChatGPT (GPT-3.5) to simulate exploratory scenarios involving novel DS that may potentially treat or prevent AD. In addition, all of the DSs recommended by ChatGPT (GPT-3.5) have been included in AD clinical trials. These DSs have been extensively researched for their correlation with AD, with findings documented in relevant academic publications. Furthermore, direct connections between the suggested DSs and AD can be identified within ADInt, a comprehensive knowledge graph encompassing AD-related concepts and various potential interventions. The recorded pathway in ADInt is denoted as (DS) - [Treats/Prevents] - (AD). Additionally, we executed a parallel experimental manipulation employing ChatGPT (GPT-4.0). Despite the enhanced diversity of the recommended drugs, they remained present in established clinical trials and literature. Concurrently, the associated pathway records persisted within the extant knowledge graph. (The detailed information was shown in the supplementary table 1.)

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Figure 1 presents the responses of ChatGPT (GPT-3.5) and ChatGPT (GPT-4.0) to the devised scenario query 1, respectively. An examination of their replies reveals that ChatGPT endeavored to establish a structured connection between the hypothetical DS (Caryophyllus aromaticus) and Alzheimer’s disease (AD). Nonetheless, upon verification of the references provided by ChatGPT, it becomes evident that these citations are not authentic. Similar outcomes were observed during the investigation of novel drugs in the designed scenario 2. Figure 2 illustrates the attempts made by ChatGPT (GPT-3.5) and ChatGPT (GPT-4.0) to present structured links between the hypothetical drug (Loperamide) and AD. The connection offered by GPT-3.5 continues to be unsuccessful in validating the authenticity of the provided reference. Moreover, the response generated by GPT-4.0 indicates that it could not establish any potential direct or indirect association between Loperamide and AD.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1. Responses of ChatGPT (GPT-3.5 for Fig. a and GPT-4.0 for Fig. b) to the Knowledge Reasoning task (DS)
Figure 2.
  • Download figure
  • Open in new tab
Figure 2. Responses of ChatGPT (GPT-3.5 for Fig. a and GPT-4.0 for Fig. b) to the Knowledge Reasoning task (Drug)

Discussion

During the process of querying existing information, both ChatGPT (GPT-3.5) and BKG exhibited comparable capabilities, demonstrating their effectiveness in providing relevant information based on user queries. However, with the introduction of ChatGPT (GPT-4.0), a notable improvement in performance was observed compared to GPT-3 and BKG. The enhanced capabilities of ChatGPT (GPT-4.0) allowed for more accurate and comprehensive responses, surpassing the performance of both GPT-3 and KGs in this context. However, when it comes to the reliability of information sources, KG exhibited a clear advantage. KGs are built upon curated and structured knowledge from trusted sources, ensuring the reliability and accuracy of the information contained within them. This advantage stems from the rigorous data collection and validation processes employed in constructing BKG. On the other hand, ChatGPT’s responses are generated based on patterns and associations learned from a vast amount of text data, which may include both reliable and unreliable sources. As a result, while ChatGPT can provide quick responses, there may be a higher risk of encountering misinformation or inaccuracies compared to BKG. Therefore, when considering the reliability and trustworthiness of the information provided, KGs offer a more dependable and robust solution. Consequently, our findings highlight the potential benefits of integrating knowledge graph-based approaches with ChatGPT to enhance its domain-specific knowledge and overall performance in specialized applications. Further research is required to explore the feasibility of this integration and its implications on the efficacy of ChatGPT in diverse domains.

In addition, we discovered that ChatGPT is unable to perform the novel finding task based on existing knowledge, which is a critical limitation when considering its application in scientific research and discovery. In the second experiment, we attempted to simulate drug repurposing using ChatGPT as a means to generate innovative insights. The results of this experiment revealed that ChatGPT primarily provided outputs that were derived from pre-existing information. This information could either be directly queried within a knowledge graph or easily found in relevant resources, suggesting that ChatGPT’s capacity for generating truly novel findings is limited. These outcomes can be attributed to the underlying training data and architecture of ChatGPT, which is designed to draw upon its vast knowledge base to produce contextually relevant and coherent responses, rather than extrapolate new insights or hypothesize potential connections. This limitation highlights the need for developing advanced AI models that can not only process and comprehend existing knowledge but also deduce novel findings by identifying hidden patterns and relationships.

Our investigation revealed that ChatGPT exhibits limitations in its ability to establish a structured link between two entities based on existing knowledge. In our third experiment, we assessed the performance of ChatGPT in comparison to the BKG with respect to the establishment of relationships between entities. The results demonstrated that ChatGPT was unable to provide a structured link between two entities as effectively as the knowledge graph, underscoring its shortcomings in this specific task. Furthermore, the credibility of the results returned by ChatGPT emerged as a significant concern. Our findings indicated that the accuracy of its responses necessitates further verification, as the information provided by ChatGPT may not always be reliable. In the third experiment, it became evident that none of the references furnished by ChatGPT were genuine, casting doubt on the trustworthiness of the information it generated. These observations highlight the need for rigorous validation and verification mechanisms when employing ChatGPT for tasks that require high levels of accuracy and credibility. Consequently, future research should explore strategies to enhance the reliability of ChatGPT’s outputs, such as incorporating external validation sources or refining its training data to improve its capacity to provide accurate and credible information.

Our study assesses the capabilities of ChatGPT and existing BKGs in question answering, knowledge discovery, and knowledge reasoning. While ChatGPT with GPT-4.0 outperformed both GPT-3.5 and BKGs (both are comparable) to provide existing information; BKGs exhibited a clear advantage in terms of information reliability over both GPT models. Our findings revealed limitations in ChatGPT’s ability to perform novel discoveries based on existing knowledge. Furthermore, our investigation highlighted ChatGPT’s limitations in providing reasoning for knowledge discovery (e.g., establishing structured links between entities compared to BKGs). In conclusion, future investigations should prioritize the development of methodologies that integrate LLMs and BKGs, allowing researchers to harness the unique capabilities of each approach. This holistic approach would not only optimize task performance but also enable mitigating potential risks, thus advancing knowledge in the biomedical field and contributing to the overall well-being of individuals.

Data Availability

All data produced are available online

Funding

Research reported in this publication was supported by the National Institutes of Health (NIH)/National Institute On Aging (NIA) under Award Number R01AG078154 (PI: RZ) and the NIH/National Center For Complementary & Integrative Health (NCCIH) under Award Number R01AT00945 (PI: RZ).

Author Contributions

YH and RZ conceived the study design and wrote the initial draft of the manuscript. YH implemented the experiments of the study. All authors contributed to the production of the final manuscript. JY contributed to the statistics. RZ, HX, CS and FW advised this project. All authors contributed to the production of the final manuscript.

Reference

  1. 1.↵
    Fan, L. et al. A Bibliometric Review of Large Language Models Research from 2017 to 2023. (2023) doi:10.48550/ARXIV.2304.02020.
    OpenUrlCrossRef
  2. 2.
    De Angelis, L. et al. ChatGPT and the rise of large language models: the new AI-driven infodemic threat in public health. Front. Public Health 11, 1166120 (2023).
    OpenUrl
  3. 3.↵
    Birhane, A., Kasirzadeh, A., Leslie, D. & Wachter, S. Science in the age of large language models. Nat. Rev. Phys. 5, 277–280 (2023).
    OpenUrl
  4. 4.↵
    1. Larochelle, H.,
    2. Ranzato, M.,
    3. Hadsell, R.,
    4. Balcan, M. F. &
    5. Lin, H.
    Brown, T. et al./person-group>. Language Models are Few-Shot Learners. in Advances in Neural Information Processing Systems (eds. Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M. F. & Lin, H.) vol. 33 1877–1901 (Curran Associates, Inc., 2020).
    OpenUrl
  5. 5.↵
    Nicholson, D. N. & Greene, C. S. Constructing knowledge graphs and their biomedical applications. Comput. Struct. Biotechnol. J. 18, 1414–1428 (2020).
    OpenUrl
  6. 6.↵
    Santos, A. et al. A knowledge graph to interpret clinical proteomics data. Nat. Biotechnol. 40, 692–702 (2022).
    OpenUrl
  7. 7.
    Nelson, C. A., Butte, A. J. & Baranzini, S. E. Integrating biomedical research and electronic health records to create knowledge-based biologically meaningful machine-readable embeddings. Nat. Commun. 10, 3045 (2019).
    OpenUrl
  8. 8.↵
    Himmelstein, D. S. et al. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. eLife 6, e26726 (2017).
    OpenUrlCrossRefPubMed
  9. 9.
    Sügis, E. et al. HENA, heterogeneous network-based data set for Alzheimer’s disease. Sci. Data 6, 151 (2019).
    OpenUrl
  10. 10.↵
    Rotmensch, M., Halpern, Y., Tlimat, A., Horng, S. & Sontag, D. Learning a Health Knowledge Graph from Electronic Medical Records. Sci. Rep. 7, 5994 (2017).
    OpenUrl
  11. 11.↵
    Zhu, Y. et al. Knowledge-driven drug repurposing using a comprehensive drug knowledge graph. Health Informatics J. 26, 2737–2750 (2020).
    OpenUrlCrossRef
  12. 12.
    Yu, Y. et al. PreMedKB: an integrated precision medicine knowledgebase for interpreting relationships between diseases, genes, variants and drugs. Nucleic Acids Res. 47, D1090–D1101 (2019).
    OpenUrlCrossRef
  13. 13.↵
    Li, N. et al. KGHC: a knowledge graph for hepatocellular carcinoma. BMC Med. Inform. Decis. Mak. 20, 135 (2020).
    OpenUrl
  14. 14.↵
    Percha, B. & Altman, R. B. A global network of biomedical relationships derived from text. Bioinformatics 34, 2614–2624 (2018).
    OpenUrl
  15. 15.
    Ernst, P., Siu, A. & Weikum, G. KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences. BMC Bioinformatics 16, 157 (2015).
    OpenUrlCrossRef
  16. 16.↵
    Yuan, J. et al. Constructing biomedical domain-specific knowledge graph with minimum supervision. Knowl. Inf. Syst. 62, 317–336 (2020).
    OpenUrl
  17. 17.↵
    Rubin, D. L., Shah, N. H. & Noy, N. F. Biomedical ontologies: a functional perspective. Brief. Bioinform. 9, 75–90 (2007).
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.
    Smith, B. et al. Relations in biomedical ontologies. Genome Biol. 6, R46 (2005).
    OpenUrlCrossRefPubMed
  19. 19.
    Zhu, Y., Elemento, O., Pathak, J. & Wang, F. Drug knowledge bases and their applications in biomedical informatics research. Brief. Bioinform. 20, 1308–1321 (2019).
    OpenUrl
  20. 20.↵
    Callahan, T. J., Tripodi, I. J., Pielke-Lombardo, H. & Hunter, L. E. Knowledge-Based Biomedical Data Science. Annu. Rev. Biomed. Data Sci. 3, 23–41 (2020).
    OpenUrl
  21. 21.↵
    Munoz-Zuluaga, C., Zhao, Z., Wang, F., Greenblatt, M. B. & Yang, H. S. Assessing the Accuracy and Clinical Utility of ChatGPT in Laboratory Medicine. Clin. Chem. hvad058 (2023) doi:10.1093/clinchem/hvad058.
    OpenUrlCrossRef
  22. 22.
    Omar, R., Mangukiya, O., Kalnis, P. & Mansour, E. ChatGPT versus Traditional Question Answering for Knowledge Graphs: Current Status and Future Directions Towards Knowledge Graph Chatbots. (2023) doi:10.48550/ARXIV.2302.06466.
    OpenUrlCrossRef
  23. 23.↵
    Mountantonakis, M. & Tzitzikas, Y. Using Multiple RDF Knowledge Graphs for Enriching ChatGPT Responses. (2023) doi:10.48550/ARXIV.2304.05774.
    OpenUrlCrossRef
  24. 24.↵
    Yahoo! Answers Homepage. https://answers.yahoo.com/ (2020).
  25. 25.↵
    Rizvi, R. F. et al. iDISK: the integrated DIetary Supplements Knowledge base. J. Am. Med. Inform. Assoc. 27, 539–548 (2020).
    OpenUrlCrossRef
  26. 26.↵
    Overview of the TREC 2015 LiveQA Track. https://trec.nist.gov/pubs/trec24/papers/Overview-QA.pdf.
  27. 27.↵
    Demner-Fushman, D., Mrabet, Y. & Ben Abacha, A. Consumer health information and question answering: helping consumers find answers to their health-related information needs. J. Am. Med. Inform. Assoc. 27, 194–201 (2020).
    OpenUrl
  28. 28.↵
    Fox, J. & Weisberg, S. An R companion to applied regression. (Sage publications, 2011).
  29. 29.↵
    Su, C. et al. Biomedical discovery through the integrative biomedical knowledge hub (iBKH). Iscience 26, (2023).
  30. 30.↵
    Xiao, Y. et al. Repurposing Non-pharmacological Interventions for Alzheimer’s Diseases through Link Prediction on Biomedical Literature. medRxiv 2023.05. 15.23290002 (2023).
  31. 31.
    Sanchez, P. E. et al. Levetiracetam suppresses neuronal network dysfunction and reverses synaptic and cognitive deficits in an Alzheimer’s disease model. Proc. Natl. Acad. Sci. 109, (2012).
  32. 32.
    Vossel, K. et al. Effect of Levetiracetam on Cognition in Patients With Alzheimer Disease With and Without Epileptiform Activity: A Randomized Clinical Trial. JAMA Neurol. 78, 1345 (2021).
    OpenUrl
  33. 33.
    Matsunaga, S. et al. Lithium as a Treatment for Alzheimer’s Disease: A Systematic Review and Meta-Analysis. J. Alzheimers Dis. 48, 403–410 (2015).
    OpenUrl
  34. 34.
    Lim, G. P. et al. Ibuprofen Suppresses Plaque Pathology and Inflammation in a Mouse Model for Alzheimer’s Disease. J. Neurosci. 20, 5709–5714 (2000).
    OpenUrlAbstract/FREE Full Text
  35. 35.
    McKee, A. C. et al. Ibuprofen reduces Aβ, hyperphosphorylated tau and memory deficits in Alzheimer mice. Brain Res. 1207, 225–236 (2008).
    OpenUrlCrossRefPubMedWeb of Science
  36. 36.
    Kaeberlein, M. & Galvan, V. Rapamycin and Alzheimer’s disease: Time for a clinical trial? Sci. Transl. Med. 11, eaar4289 (2019).
    OpenUrlFREE Full Text
  37. 37.
    Sato, T. et al. Efficacy of PPAR-γ agonist pioglitazone in mild Alzheimer disease. Neurobiol. Aging 32, 1626–1633 (2011).
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.
    Turner, R. S. et al. Nilotinib Effects on Safety, Tolerability, and Biomarkers in Alzheimer’s Disease. Ann. Neurol. 88, 183–194 (2020).
    OpenUrl
  39. 39.
    Austin, S. A. & Katusic, Z. S. Abstract 3063: Nitroglycerin Treatment Attenuates APP and BACE1 Protein Levels In Cerebral Microcirculation. Stroke 43, (2012).
  40. 40.
    Qing, H. et al. Valproic acid inhibits Aβ production, neuritic plaque formation, and behavioral deficits in Alzheimer’s disease mouse models. J. Exp. Med. 205, 2781–2789 (2008).
    OpenUrlAbstract/FREE Full Text
  41. 41.
    Zhang, X.-Z., Li, X.-J. & Zhang, H.-Y. Valproic acid as a promising agent to combat Alzheimer’s disease. Brain Res. Bull. 81, 3–6 (2010).
    OpenUrlCrossRefPubMedWeb of Science
  42. 42.
    Wang, L. Y. et al. Prazosin for the Treatment of Behavioral Symptoms in Patients With Alzheimer Disease With Agitation and Aggression. Am. J. Geriatr. Psychiatry 17, 744–751 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  43. 43.
    Craft, S. Insulin Resistance Syndrome and Alzheimer Disease: Pathophysiologic Mechanisms and Therapeutic Implications. Alzheimer Dis. Assoc. Disord. 20, 298–301 (2006).
    OpenUrlCrossRefPubMedWeb of Science
  44. 44.
    Aso, E. & Ferrer, I. Cannabinoids for treatment of Alzheimer’s disease: moving toward the clinic. Front. Pharmacol. 5, (2014).
  45. 45.
    Campbell, V. A. & Gowran, A. Alzheimer’s disease; taking the edge off with cannabinoids?: Cannabinoids and Alzheimer’s disease. Br. J. Pharmacol. 152, 655–662 (2007).
    OpenUrlPubMed
  46. 46.
    Camargos, E. F. et al. Trazodone Improves Sleep Parameters in Alzheimer Disease Patients: A Randomized, Double-Blind, and Placebo-Controlled Study. Am. J. Geriatr. Psychiatry 22, 1565–1574 (2014).
    OpenUrlCrossRef
  47. 47.
    Matthews, D. C. et al. Riluzole, a glutamate modulator, slows cerebral glucose metabolism decline in patients with Alzheimer’s disease. Brain 144, 3742–3755 (2021).
    OpenUrl
  48. 48.
    Vallée, A., Vallée, J.-N., Guillevin, R. & Lecarpentier, Y. Riluzole: a therapeutic strategy in Alzheimer’s disease by targeting the WNT/β-catenin pathway. Aging 12, 3095–3113 (2020).
    OpenUrl
  49. 49.
    Oz, M., Lorke, D. E. & Petroianu, G. A. Methylene blue and Alzheimer’s disease. Biochem. Pharmacol. 78, 927–932 (2009).
    OpenUrlCrossRefPubMed
  50. 50.
    Morris, M. C. Dietary niacin and the risk of incident Alzheimer’s disease and of cognitive decline. J. Neurol. Neurosurg. Psychiatry 75, 1093–1099 (2004).
    OpenUrlAbstract/FREE Full Text
  51. 51.
    Chumakov, I. et al. Combining two repurposed drugs as a promising approach for Alzheimer’s disease therapy. Sci. Rep. 5, 7608 (2015).
    OpenUrl
  52. 52.
    Gupta, S. & Dasmahapatra, A. K. Destabilization of Aβ fibrils by omega-3 polyunsaturated fatty acids: a molecular dynamics study. J. Biomol. Struct. Dyn. 41, 581–598 (2023).
    OpenUrl
  53. 53.
    Browne, D., McGuinness, B., Woodside, J. V. & McKay, G. J. Vitamin E and Alzheimer’s disease: what do we know so far? Clin. Interv. Aging Volume 14, 1303–1317 (2019).
    OpenUrl
  54. 54.
    Bland, A. R., Ashton, J. C., Kamal, M. A. & Sahebkar, A. The Current Evidence for the Therapeutic Role of Curcumin inAlzheimer’s Disease. CNS Neurol. Disord. - Drug Targets 22, 318–320 (2023).
    OpenUrl
  55. 55.
    Liao, Z. et al. Meta-analysis of Ginkgo biloba Preparation for the Treatment of Alzheimer’s Disease. Clin. Neuropharmacol. 43, 93–99 (2020).
    OpenUrl
  56. 56.
    Guo, J., Ni, S., Li, Q., Wang, J.-Z. & Yang, Y. Folate/Vitamin B Alleviates Hyperhomocysteinemia-Induced Alzheimer-Like Pathologies in Rat Retina. Neurosci. Bull. 35, 325–335 (2019).
    OpenUrl
  57. 57.
    Anastasiou, C. A., Yannakoulia, M. & Scarmeas, N. Vitamin D and Cognition: An Update of the Current Evidence. J. Alzheimers Dis. 42, S71–S80 (2014).
    OpenUrl
  58. 58.
    Tosatti, J. A. G., Fontes, A. F. D. S., Caramelli, P. & Gomes, K. B. Effects of Resveratrol Supplementation on the Cognitive Function of Patients with Alzheimer’s Disease: A Systematic Review of Randomized Controlled Trials. Drugs Aging 39, 285–295 (2022).
    OpenUrl
  59. 59.
    Luchsinger, J. A., Tang, M.-X., Miller, J., Green, R. & Mayeux, R. Relation of Higher Folate Intake to Lower Risk of Alzheimer Disease in the Elderly. Arch. Neurol. 64, 86 (2007).
    OpenUrlCrossRefPubMedWeb of Science
Back to top
PreviousNext
Posted June 12, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
From Answers to Insights: Unveiling the Strengths and Limitations of ChatGPT and Biomedical Knowledge Graphs
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
From Answers to Insights: Unveiling the Strengths and Limitations of ChatGPT and Biomedical Knowledge Graphs
Yu Hou, Jeremy Yeung, Hua Xu, Chang Su, Fei Wang, Rui Zhang
medRxiv 2023.06.09.23291208; doi: https://doi.org/10.1101/2023.06.09.23291208
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
From Answers to Insights: Unveiling the Strengths and Limitations of ChatGPT and Biomedical Knowledge Graphs
Yu Hou, Jeremy Yeung, Hua Xu, Chang Su, Fei Wang, Rui Zhang
medRxiv 2023.06.09.23291208; doi: https://doi.org/10.1101/2023.06.09.23291208

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)