Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark

View ORCID ProfileHui Feng, View ORCID ProfileFrancesco Ronzano, Jude LaFleur, Matthew Garber, Rodrigo de Oliveira, View ORCID ProfileKathryn Rough, Katharine Roth, Jay Nanavati, Khaldoun Zine El Abidine, View ORCID ProfileChristina Mack
doi: https://doi.org/10.1101/2024.05.17.24307411
Hui Feng
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hui Feng
  • For correspondence: hui.feng{at}iqvia.com
Francesco Ronzano
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Francesco Ronzano
Jude LaFleur
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew Garber
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rodrigo de Oliveira
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kathryn Rough
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kathryn Rough
Katharine Roth
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jay Nanavati
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Khaldoun Zine El Abidine
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christina Mack
1Real world solutions, IQVIA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christina Mack
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Article usage

Article usage: May 2024 to June 2025

AbstractFullPdf
May 202428024149
Jun 202418125133
Jul 202410817104
Aug 20242511889
Sep 202414233101
Oct 2024919183
Nov 202410614139
Dec 20241719143
Jan 202517319153
Feb 202515476158
Mar 2025148122253
Apr 20252171881308
May 2025263202534
Jun 20254429157
Back to top
PreviousNext
Posted May 17, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark
Hui Feng, Francesco Ronzano, Jude LaFleur, Matthew Garber, Rodrigo de Oliveira, Kathryn Rough, Katharine Roth, Jay Nanavati, Khaldoun Zine El Abidine, Christina Mack
medRxiv 2024.05.17.24307411; doi: https://doi.org/10.1101/2024.05.17.24307411
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark
Hui Feng, Francesco Ronzano, Jude LaFleur, Matthew Garber, Rodrigo de Oliveira, Kathryn Rough, Katharine Roth, Jay Nanavati, Khaldoun Zine El Abidine, Christina Mack
medRxiv 2024.05.17.24307411; doi: https://doi.org/10.1101/2024.05.17.24307411

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)