Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

The accuracy of large language models in labelling neurosurgical ‘case-control studies’ and risk of bias assessment: protocol for a study of interrater agreement with human reviewers

View ORCID ProfileJoanne Igoli, View ORCID ProfileTemidayo Osunronbi, View ORCID ProfileOlatomiwa Olukoya, View ORCID ProfileJeremiah Oluwatomi Itodo Daniel, Hillary Alemenzohu, Alieu Kanu, Alex Mwangi Kihunyu, View ORCID ProfileEbuka Okeleke, View ORCID ProfileHenry Oyoyo, Oluwatobi Shekoni, Damilola Jesuyajolu, View ORCID ProfileAndrew F Alalade
doi: https://doi.org/10.1101/2024.08.11.24311830
Joanne Igoli
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
2Deanery of Clinical Sciences, The University of Edinburgh, Edinburgh, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joanne Igoli
  • For correspondence: s1408071{at}sms.ed.ac.uk
Temidayo Osunronbi
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
3Department of Neurosurgery, Salford Royal NHS Foundation Trust, Manchester, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Temidayo Osunronbi
Olatomiwa Olukoya
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
4The National Hospital for Neurology and Neurosurgery, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Olatomiwa Olukoya
Jeremiah Oluwatomi Itodo Daniel
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeremiah Oluwatomi Itodo Daniel
Hillary Alemenzohu
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alieu Kanu
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alex Mwangi Kihunyu
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ebuka Okeleke
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ebuka Okeleke
Henry Oyoyo
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Henry Oyoyo
Oluwatobi Shekoni
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Damilola Jesuyajolu
1Neurosurgery section, Surgery Interest Group of Africa, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrew F Alalade
5Department of Neurosurgery, Royal Preston Hospital, Lancashire Teaching Hospitals NHS Foundation Trust, Preston, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrew F Alalade
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Introduction Accurate identification of study designs and risk of bias (RoB) assessment is crucial for evidence synthesis in research. However, mislabelling of case-control studies (CCS) is prevalent, leading to a downgraded quality of evidence. Large Language Models (LLMs), a form of artificial intelligence, have shown impressive performance in various medical tasks. Still, their utility and application in categorising study designs and assessing RoB needs to be further explored. This study will evaluate the performance of four publicly available LLMs (ChatGPT-3.5, ChatGPT-4, Claude 3 Sonnet, Claude 3 Opus) in accurately identifying CCS designs from the neurosurgical literature. Secondly, we will assess the human-LLM interrater agreement for RoB assessment of true CCS.

Methods We identified thirty-four top-ranking neurosurgical-focused journals and searched them on PubMed/MEDLINE for manuscripts reported as CCS in the title/abstract. Human reviewers will independently assess study designs and RoB using the Newcastle-Ottawa Scale. The methods sections/full-text articles will be provided to LLMs to determine study designs and assess RoB. Cohen’s kappa will be used to evaluate human-human, human-LLM and LLM-LLM interrater agreement. Logistic regression will be used to assess study characteristics affecting performance. A p-value < 0.05 at a 95% confidence interval will be considered statistically significant.

Conclusion If the human-LLM agreement is high, LLMs could become valuable teaching and quality assurance tools for critical appraisal in neurosurgery and other medical fields. This study will contribute to validating LLMs for specialised scientific tasks in evidence synthesis. This could lead to reduced review costs, faster completion, standardisation, and minimal errors in evidence synthesis.

Introduction

Observational studies, including cross-sectional, cohort, and case-control studies, are ideal for neurosurgery research when placebo or no-treatment groups are risky or ethically challenging or when randomised controlled trials are impractical due to logistical complexities or inadequacy for addressing clinical questions [1].

Cross-sectional studies concurrently evaluate exposure and outcome status at a single time point without longitudinal follow-up [2]. Cohort studies divide participants based on exposures or treatments and follow them over a period, either prospectively or retrospectively, to compare outcomes between the groups [2]. Case-control studies (CCS) compare individuals with (case) and without (control) a particular outcome, retrospectively examining differences in exposure risk factors [2].

Unlike other observational studies, CCS is best suited for investigating rare outcomes or those with long latency periods, leading to its increasing use in neurosurgery [3]. However, they have limitations such as recall bias and the inability to determine incidence and absolute risk or establish temporality [1–3]. Previous research indicates a significant prevalence of misclassified ‘CCS’ in neurosurgery literature, ranging from 41% to 63% [1–3]. Mislabelling of CCS is not unique to neurosurgery, with mislabelling rates reaching as high as 30% to 97% in other fields [4–6]. Cohort studies are most frequently mislabelled as CCS, leading to a downgrading of evidence quality since cohort studies represent the highest level of evidence among observational studies [1–3].

Moreover, mislabelled CCS often report odds ratios instead of relative risks, leading to distorted effect size measurements, particularly in systematic reviews and meta-analyses [3]. Hence, accurate labelling of study designs is crucial for stakeholders, including readers, authors, and editors. In addition, assessing the risk of bias (RoB) is critical to systematic reviews. This process involves reviewing and understanding each eligible study, which relies on a solid grasp of study methods and RoB assessment tools. However, RoB assessment is labour-intensive and prone to human error, which may introduce biases in the conclusions of an evidence synthesis [7].

The recent upsurge in excitement about artificial intelligence (AI) has increased its impact on every aspect of healthcare [8]. Large Language Models (LLMs), a subset of AI, are trained on extensive amounts of text data to understand, generate, and process human-like language for various natural language processing tasks [8, 9]. Many healthcare professionals have begun to use LLMs such as ChatGPT and Claude as advanced search tools for complex medical information. These models exhibit emergent properties resembling human-level intelligence and have demonstrated impressive performance on various medical speciality exams, including neurosurgery [10, 11], and have even succeeded in challenging tests like the United States Medical Licensing Examination [9]. Additionally, some machine learning systems, such as the RobotReviewer, have shown high accuracy in evaluating the risk of bias in clinical trials [12]. However, the potential of LLMs, an advanced AI tool, in categorising study designs and assessing RoB in neurosurgery research still needs to be explored. Leveraging LLMs in these tasks may lead to reduced review costs, faster completion times, and decreased errors in the assessment process.

This study aims to evaluate the performance of four publicly available LLMs (ChatGPT-3.5 [OpenAI/Microsoft], ChatGPT-4 [OpenAI/Microsoft], Claude 3 Sonnet [Anthropic], and Claude 3 Opus [Anthropic]) in accurately identifying the design of ‘case-control studies’ in the neurosurgical literature. It also seeks to identify predictive study characteristics that affect LLM performance. Additionally, we will evaluate human-LLM agreement in overall and domain-level risk of bias (RoB) assessment using the Newcastle-Ottawa Scale for CCS [13].

Materials and Methods

Search strategy

There are no official lists of all neurosurgical journals in the literature, considering the ongoing introduction of new journals. We conducted an online Google search using the phrase ‘top neurosurgery journals’ to compile a speciality list of journals for neurosurgeons. This search yielded a Google Scholar [14] and Welch Medical Library [15] list of neurosurgical journals, which we complemented with additional journals from a previously published article [3]. We excluded the journals that have stopped publications and those with a nursing theme. Our search strategy featured 34 PubMed-indexed journals (Appendix 1).

A PubMed/MEDLINE search was performed for all the articles in these thirty-four indexed journals from database inception to 8 June 2024, using the search terms ‘case-control’, ‘case control’, ‘case controlled’, or ‘case-controlled’ in the title or abstract.

The human reviewers

The assessment team will include a consultant/attending neurosurgeon (AFA), two neurosurgical trainees/residents with training in critical appraisal/ postgraduate certificate in health research and statistics/ Masters of Neurosurgery by Research (TO, OO), and nine medical students/ clinicians who will be trained on critical appraisal prior to commencing this study.

Eligibility criteria

Only original research articles reported as ‘case-control’ in the titles or abstracts will be included. Reviews, commentaries, letters, genetic studies, animal studies, and cost-effectiveness studies will be excluded. Similarly, articles will be excluded if they lack the term ‘case-control’/ ‘case control’/ ‘case controlled’/ ‘case-controlled’ in their abstract/title or if this term was used in reference to another study. Studies with ambiguous study design labels in their abstract/ title and/or those that use multiple study designs will be excluded (for example: ‘cross-sectional case-control study’, ‘case-control cohort study’, ‘systematic review/ meta-analysis and case-control study’). In addition, articles that are neurology-focused instead of neurosurgery-focused will be excluded.

The titles/abstracts will be screened independently by pairs of authors using the Rayyan software, with a third author (TO) resolving any discrepancies.

Data extraction

Data extraction from the eligible full texts will be performed by a pair of authors (TO and other authors), with a third author (OO) resolving any discrepancies. The following data will be extracted based on previous related publications [1, 3]:

  • - Journal name (The journal results will be presented anonymously in the resulting publication).

  • - Year of publication (<2008, 2008 - 2019, >2019). The STROBE statement was published in 2007, and the last publication on the mislabelling of case-control studies in neurosurgery was published in 2019 [2, 3]. This forms the rationale for the year categories.

  • - Topic (spine, trauma, vascular, functional/epilepsy, neuro-oncology, paediatrics, skull base, pituitary, hydrocephalus and other).

  • - Country of origin (based on where the study took place; the first author’s country will be used when the study location is not specified). Countries will be grouped by the number of case-control studies published (Group A: countries with >10 case-control studies; Group B: countries with 5 to 10 case-control studies; Group C: countries with <5 case-control studies). The countries will also be grouped by continents (Africa, Antarctica, Asia, Australia, Europe, North America, and South America).

  • - Presence or acknowledgement of a case-control expert in the study (such as a statistician, epidemiologist, or one with a master’s degree or equivalent in public health)

  • - Study design characteristics:

    • ○ Aim of study (a): Outcome assessment

    • ○ Aim of study (b): Risk factors assessment

    • ○ Used logistic regression analysis.

    • ○ Reported odds ratio (OR)

    • ○ Used survival analysis/Kaplan-Meier curves.

  • - Terminology of the study:

    • ○ The word ‘cohort’ was used in the methods, results, or discussion sections.

    • ○ The word ‘outcome’ was used in the results section.

    • ○ The word ‘prospective’ or ‘prospectively’ was used in the methods section.

    • ○ The word ‘retrospective’ or ‘retrospectively’ was used in the methods section.

Assessment of study design and risk of bias by human reviewers

The assessment of the study design of the eligible full text articles will be performed by a pair of authors (TO and other authors), with a third author (OO) resolving any discrepancies. The human assessors will classify the studies as ‘true case-control studies’ or ‘non-case-control studies’. A study will be deemed a true case-control study if it comprises three fundamental elements [1]: 1) compares a group of patients with a disease or who have experienced an event with a control group lacking the disease or event; 2) a retrospective evaluation from the time point of a known outcome is made; and 3) focuses on identifying risk factors/associations/causality of the disease or event. The ‘non-case-control studies’ design will be specified as prospective cohort studies, retrospective cohort studies, cross-sectional studies, case series, case reports, randomised clinical trials, and other.

The Newcastle-Ottawa Scale (NOS) will be used to evaluate the risk of bias (RoB) in the true case-control studies [13]. The true case-control studies will be divided into five groups, and the RoB assessment will be performed by a pair of authors, with a third author (TO or OO) adjudicating any discrepancies. Studies with NOS scores of 0-3, 4-5, 6-7, and 8-9 will be considered unsatisfactory, satisfactory, good, and very good quality, respectively.

Assessment of study design and risk of bias assessment by LLMs

For each eligible article obtained from the abstract/title screening, the methods section will be copied and imputed separately into each LLM (ChatGPT-3.5 [OpenAI/Microsoft], ChatGPT-4 [OpenAI/Microsoft], Claude 3 Sonnet [Anthropic], and Claude 3 Opus [Anthropic]) and the LLMs will be prompted with this question: ‘Some authors may or may not correctly label their study design. Using the hierarchy of evidence, with a rationale, what is the actual specific study design in the text below?’ To facilitate the assessment of the LLM-LLM intrarater agreement, we will obtain LLM assessments in duplicate, i.e., two different authors (TO and OO) will separately use the LLMs independently for the assessment of study design.

Subsequently, we will evaluate the LLMs RoB assessment for the author-labelled true CCS. The PDF files of the eligible papers will be imputed separately as attachments into each LLM. The LLMs will be prompted with this question: ‘Given that studies with an overall Newcastle-Ottawa scale (NOS) scores of 0-3, 4-5, 6-7, and 8-9 are considered unsatisfactory, satisfactory, good, and very good quality, respectively, provide a domain-level and overall risk of bias assessment for the following study using the Newcastle-Ottawa scale for case-control studies.’ If we are unable to attach the PDF file or the LLM is unable to read the PDF file, we will copy the methods text and the patients/participants characteristics/demographics subsection of the results and impute this into the LLM. To facilitate the assessment of the LLM-LLM intrarater agreement, we will obtain LLM RoB assessments in duplicate — i.e., two authors (TO and OO) will each use the LLMs independently for the RoB assessment.

Statistical analysis and reportingxs

Statistical analyses will be conducted on IBM SPSS Statistics 27 (Windows).

LLM-LLM and human-human interrater reliability for the study design and RoB assessments will be assessed using Cohen’s kappa (κ) for categorical data. In the event of LLM-LLM (for example, ChatGPT-3.5 - ChatGPT-3.5) discrepancies, we will reduce the duplicate assessments to a single assessment for each study by randomly choosing one of the assessments for each study.

We will calculate the proportion of articles labelled as ‘case-control’ in the title/abstract that are true case-control studies as identified by human reviewers. Furthermore, using the study design determined by human reviewers in this study as a reference, we will calculate the proportion of study design correctly labelled by each LLM. Subsequently, LLM-human inter-rater reliability for the study design and RoB assessments will be assessed using Cohen’s kappa (κ) for categorical data. Kappa values will be interpreted as follows: values ≤ 0 (no agreement), 0.01–0.20 (slight agreement), 0.21–0.40 (fair agreement), 0.41– 0.60 (moderate agreement), 0.61–0.80 (substantial agreement), and 0.81–1.00 (almost perfect agreement) [16].

Simple logistic regression analyses will be conducted to assess the associations between select study characteristics and whether a study was a true case-control (yes/no). These analyses will also be conducted for each LLM to assess the association between the select study characteristics and the accurate labelling of study designs by the LLM (yes/no). A p-value < 0.05 at a 95% confidence interval will be considered statistically significant.

Discussion

To our knowledge, this study will be the first to evaluate interrater agreement between human reviewers and LLMs in labelling study designs and assessing RoB in neurosurgical case-control studies.

If the human-LLM interrater agreement is almost perfect, then LLMs could become valuable tools for teaching and quality assurance in critical appraisal and identifying study designs in neurosurgery and other fields. This study is expected to make a significant early contribution to the research exploring the utilisation and validation of general-purpose LLMs trained on vast internet data for specialised scientific tasks. It is anticipated that this study will mark the beginning of a series focused on employing LLMs in evidence synthesis. The investigation into the application of LLMs, particularly for systematic reviews, is poised to bring about significant changes in how evidence synthesis tasks are conducted, who undertakes them, the speed and cost of completion, and the way primary studies are conducted and reported to enhance comprehensibility for artificial intelligence.

Limitations

This study will not include some non-neurosurgical-specific journals where the neurosurgical community may choose to publish. Thus, the representativeness of the selected articles as a sample of all neurosurgical case-control studies can be questioned. Based on our exclusion criteria, articles lacking explicit mention of “case control” in title or abstract will be excluded. However, the improper use of the term “case-control” might be more prevalent in these articles and missed in our search. Though unlikely, reverse mislabelling could occur, where true case-control studies may not have been labelled as such and thus missed in our search.

To evaluate LLMs’ ability in RoB assessment, we will provide only the methods and results section or full articles (where possible) to the LLMs. Human reviewers will have access to the entire text and supplementary materials where available, providing them with more information about each study than LLMs. As a result, the human-LLM interrater agreements we estimate are expected to be conservative estimates of what is achievable.

Data Availability

No datasets were generated or analysed during the current study. All relevant data from this study will be made available upon study completion.

Declarations

Authors’ contributions

Joanne Igoli, Temidayo Osunronbi, Olatomiwa Olukoya, Damilola Jesuyajolu, and Andrew F Alalade contributed to the study’s conception and design. The first draft of the paper was written by Temidayo Osunronbi, and all authors commented on the subsequent versions of the manuscript. All authors read and approved the final manuscript.

References

  1. ↵
    Nesvick CL, Thompson CJ, Boop FA, Klimo P Jr.. Case-control studies in neurosurgery. J Neurosurg. 2014;121(2):285–296. doi:10.3171/2014.5.JNS132329
    OpenUrlCrossRef
  2. ↵
    Kicielinski KP, Dupépé EB, Gordon AS, Mayo NE, Walters BC. What Isn’t a Case-Control Study?. Neurosurgery. 2019;84(5):993–999. doi:10.1093/neuros/nyy591
    OpenUrlCrossRef
  3. ↵
    Esene IN, Mbuagbaw L, Dechambenoit G, Reda W, Kalangu KK. Misclassification of Case-Control Studies in Neurosurgery and Proposed Solutions. World Neurosurg. 2018;112:233–242. doi:10.1016/j.wneu.2018.01.171
    OpenUrlCrossRef
  4. ↵
    Grimes DA. “Case-control” confusion: mislabeled reports in obstetrics and gynecology journals. Obstet Gynecol. 2009;114(6):1284–1286. doi:10.1097/AOG.0b013e3181c03421
    OpenUrlCrossRefPubMed
  5. Mayo NE, Goldberg MS. When is a case-control study a case-control study?. J Rehabil Med. 2009;41(4):217–222. doi:10.2340/16501977-0341
    OpenUrlCrossRefPubMed
  6. ↵
    Mihailovic A, Bell CM, Urbach DR. Users’ guide to the surgical literature. Case-control studies in surgical journals. Can J Surg. 2005;48(2):148–151.
    OpenUrlAbstract/FREE Full Text
  7. ↵
    Könsgen N, Barcot O, Heß S, et al. Inter-review agreement of risk-of-bias judgments varied in Cochrane reviews. J Clin Epidemiol. 2020;120:25–32. doi:10.1016/j.jclinepi.2019.12.016
    OpenUrlCrossRef
  8. ↵
    Sallam M. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare (Basel). 2023;11(6):887. doi:10.3390/healthcare11060887
    OpenUrlCrossRefPubMed
  9. ↵
    Kung TH, Cheatham M, Medenilla A, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health. 2023;2(2):e0000198. doi:10.1371/journal.pdig.0000198
    OpenUrlCrossRefPubMed
  10. ↵
    Hoch CC, Wollenberg B, Lüers JC, et al. ChatGPT’s quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions. Eur Arch Otorhinolaryngol. 2023;280(9):4271–4278. doi:10.1007/s00405-023-08051-4
    OpenUrlCrossRef
  11. ↵
    Ali R, Tang OY, Connolly ID, et al. Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank. Neurosurgery. 2023;93(5):1090–1098. doi:10.1227/neu.0000000000002551
    OpenUrlCrossRef
  12. ↵
    Marshall IJ, Kuiper J, Wallace BC. RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials. J Am Med Inform Assoc. 2016;23(1):193–201. doi:10.1093/jamia/ocv044
    OpenUrlCrossRefPubMed
  13. ↵
    Wells G, Shea B, O’Connell D, Peterson J. The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses. Ottawa, ON: Ottawa Hospital Research Institute. https://www.ohri.ca/programs/clinical_epidemiology/oxford.asp [Accessed on 30 April 2023]
  14. ↵
    Google Scholar. Top journals in neurosurgery. https://scholar.google.co.uk/citations?view_op=top_venues&hl=en&vq=med_neurosurgery [Accessed on 30 April 2024)]
  15. ↵
    Welch Medical Library. Journals by Subject: Neurosurgery. https://welch.jhmi.edu/journalsbysubject?s=Neurosurgery [Accessed on 30 April 2024]
  16. ↵
    McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276–282.
    OpenUrl
Back to top
PreviousNext
Posted August 12, 2024.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The accuracy of large language models in labelling neurosurgical ‘case-control studies’ and risk of bias assessment: protocol for a study of interrater agreement with human reviewers
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The accuracy of large language models in labelling neurosurgical ‘case-control studies’ and risk of bias assessment: protocol for a study of interrater agreement with human reviewers
Joanne Igoli, Temidayo Osunronbi, Olatomiwa Olukoya, Jeremiah Oluwatomi Itodo Daniel, Hillary Alemenzohu, Alieu Kanu, Alex Mwangi Kihunyu, Ebuka Okeleke, Henry Oyoyo, Oluwatobi Shekoni, Damilola Jesuyajolu, Andrew F Alalade
medRxiv 2024.08.11.24311830; doi: https://doi.org/10.1101/2024.08.11.24311830
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
The accuracy of large language models in labelling neurosurgical ‘case-control studies’ and risk of bias assessment: protocol for a study of interrater agreement with human reviewers
Joanne Igoli, Temidayo Osunronbi, Olatomiwa Olukoya, Jeremiah Oluwatomi Itodo Daniel, Hillary Alemenzohu, Alieu Kanu, Alex Mwangi Kihunyu, Ebuka Okeleke, Henry Oyoyo, Oluwatobi Shekoni, Damilola Jesuyajolu, Andrew F Alalade
medRxiv 2024.08.11.24311830; doi: https://doi.org/10.1101/2024.08.11.24311830

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Surgery
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)