Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Scoping review of knowledge graph applications in biomedical and healthcare sciences

View ORCID ProfileSanjay Budhdeo, View ORCID ProfileJoe Zhang, View ORCID ProfileYusuf Abdulle, View ORCID ProfilePaul M Agapow, View ORCID ProfileDouglas GJ McKechnie, View ORCID ProfileMatt Archer, View ORCID ProfileViraj Shah, Eugenia Forte, View ORCID ProfileAyush Noori, View ORCID ProfileMarinka Zitnik, View ORCID ProfileHutan Ashrafian, Nikhil Sharma
doi: https://doi.org/10.1101/2023.12.13.23299844
Sanjay Budhdeo
1Department of Clinical and Movement Neurosciences, University College London, London, UK
2National Hospital for Neurology and Neurosurgery, London, UK
3Broad Institute of MIT and Harvard, Cambridge, MA, USA
4The Alan Turing Institute, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sanjay Budhdeo
  • For correspondence: sanjay.budhdeo{at}doctors.org.uk
Joe Zhang
5Institute for Global Health Innovation, Imperial College London, London, UK
6Department of Critical Care, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joe Zhang
Yusuf Abdulle
7Institute of Health Informatics, University College London, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yusuf Abdulle
Paul M Agapow
8GSK, Stevenage, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paul M Agapow
Douglas GJ McKechnie
9UCL Research Department of Primary Care and Population Health, UCL Medical School (Royal Free Campus), London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Douglas GJ McKechnie
Matt Archer
10Oxford University Hospitals NHS Foundation Trust, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Matt Archer
Viraj Shah
11Imperial College Business School, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Viraj Shah
Eugenia Forte
12GKT School of Medical Education, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ayush Noori
13Harvard College, Cambridge, MA, USA
14Department of Biomedical Informatics, Harvard University, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ayush Noori
Marinka Zitnik
14Department of Biomedical Informatics, Harvard University, Boston, MA, USA
15Harvard Data Science Initiative, Cambridge, MA, USA
16Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA
3Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marinka Zitnik
Hutan Ashrafian
17Department of Surgery and Cancer, Imperial College London, London, UK
18Institute of Global Health Innovation, Imperial College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hutan Ashrafian
Nikhil Sharma
1Department of Clinical and Movement Neurosciences, University College London, London, UK
19BioCorteX Ltd, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Introduction There is increasing use of knowledge graphs within medicine and healthcare, but a comprehensive survey of their applications in biomedical and healthcare sciences is lacking. Our primary aim is to systematically describe knowledge graph use cases, data characteristics, and research attributes in the academic literature. Our secondary objective is to assess the extent of real-world validation of findings from knowledge graph analysis.

Methods We conducted this review in accordance with the PRISMA extension for Scoping Reviews to characterize biomedical and healthcare uses of knowledge graphs. Using keyword-based searches, relevant publications and preprints were identified from MEDLINE, EMBASE, medRxiv, arXiv, and bioRxiv databases. A final set of 255 articles were included in the analysis.

Results Although medical science insights and drug repurposing are the most common uses, there is a broad range of knowledge graph use cases. General graphs are more common than graphs specific to disease areas. Knowledge graphs are heterogenous in size with median node numbers 46 983 (IQR 6 415-460 948) and median edge numbers 906 737 (IQR 66 272-9 894 909). DrugBank is the most frequently used data source, cited in 46 manuscripts. Analysing node and edge classes within the graphs suggests delineation into two broad groups: biomedical and clinical. Querying is the most common analytic technique in the literature; however, more advanced machine learning techniques are often used.

Discussion The variation in use case and disease area focus identifies areas of opportunity for knowledge graphs. There is diversity of graph construction and validation methods. Translation of knowledge graphs into clinical practice remains a challenge. Critically assessing the success of deploying insights derived from graphs will help determine the best practice in this area.

Competing Interest Statement

SB owns equity in Owkin Inc.; JZ is employed as a senior informatician by Arcturis Data; HA is the Chief Scientific Officer of Preemptive Medicine and Health Security initiative at Flagship Pioneering UK; NS is cofounder and the Chief Executive of BioCorteX Inc.

Clinical Protocols

https://osf.io/etg6f

Funding Statement

SB acknowledges funding from the Wellcome Trust (102186/B/13/Z), from the Charlotte and Yule Bogue Research Fellowship in honour of Sir Charles Lovatt Evans and A.J. Clark from University College London, and from The Alan Turing Institute Enrichment Scheme. JZ acknowledges funding from the Wellcome Trust (203928/Z/16/Z) and support from the National Institute for Health Research Biomedical Research Centre based at Imperial College NHS Trust and Imperial College London. DGJM was supported by NIHR as an In-Practice Fellow (NIHR301988). AN gratefully acknowledges the support of the SPARK Fellowship from the Center for Public Service and Engaged Scholarship at Harvard College. MZ gratefully acknowledges the support of the National Institutes of Health under R01HD108794, U.S. Air Force under FA8702-15-D-0001, awards from Harvard Data Science Initiative, Amazon Faculty Research, Google Research Scholar Program, Bayer Early Excellence in Science, AstraZeneca Research, Roche Alliance with Distinguished Scientists, and Kempner Institute for the Study of Natural and Artificial Intelligence.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted December 14, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Scoping review of knowledge graph applications in biomedical and healthcare sciences
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Scoping review of knowledge graph applications in biomedical and healthcare sciences
Sanjay Budhdeo, Joe Zhang, Yusuf Abdulle, Paul M Agapow, Douglas GJ McKechnie, Matt Archer, Viraj Shah, Eugenia Forte, Ayush Noori, Marinka Zitnik, Hutan Ashrafian, Nikhil Sharma
medRxiv 2023.12.13.23299844; doi: https://doi.org/10.1101/2023.12.13.23299844
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Scoping review of knowledge graph applications in biomedical and healthcare sciences
Sanjay Budhdeo, Joe Zhang, Yusuf Abdulle, Paul M Agapow, Douglas GJ McKechnie, Matt Archer, Viraj Shah, Eugenia Forte, Ayush Noori, Marinka Zitnik, Hutan Ashrafian, Nikhil Sharma
medRxiv 2023.12.13.23299844; doi: https://doi.org/10.1101/2023.12.13.23299844

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)