Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Algorithms for the identification of prevalent diabetes in the All of Us Research Program validated using polygenic scores – a new resource for diabetes precision medicine

View ORCID ProfileLukasz Szczerbinski, View ORCID ProfileRavi Mandla, View ORCID ProfilePhilip Schroeder, Bianca C. Porneala, View ORCID ProfileJosephine H. Li, View ORCID ProfileJose C. Florez, View ORCID ProfileJosep M. Mercader, View ORCID ProfileAlisa K. Manning, View ORCID ProfileMiriam S. Udler
doi: https://doi.org/10.1101/2023.09.05.23295061
Lukasz Szczerbinski
1Department of Endocrinology, Diabetology and Internal Medicine, Medical University of Bialystok, Bialystok, Poland
2Clinical Research Centre, Medical University of Bialystok, Bialystok, Poland
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lukasz Szczerbinski
Ravi Mandla
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
6Cardiology Division, Department of Medicine and Cardiovascular Research Institute, University of California, San Francisco, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ravi Mandla
Philip Schroeder
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Philip Schroeder
Bianca C. Porneala
7Division of General Internal Medicine, Department of Medicine, Massachusetts General Hospital, Boston, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Josephine H. Li
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
8Department of Medicine, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Josephine H. Li
  • For correspondence: mercader{at}broadinstitute.org amanning{at}broadinstitute.org MUDLER{at}mgh.harvard.edu
Jose C. Florez
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
8Department of Medicine, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jose C. Florez
Josep M. Mercader
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
8Department of Medicine, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Josep M. Mercader
Alisa K. Manning
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
8Department of Medicine, Harvard Medical School, Boston, MA, USA
9Clinical and Translational Epidemiology Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alisa K. Manning
  • For correspondence: mercader{at}broadinstitute.org amanning{at}broadinstitute.org MUDLER{at}mgh.harvard.edu
Miriam S. Udler
3Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, USA
4Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
5Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, USA
8Department of Medicine, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Miriam S. Udler
  • For correspondence: mercader{at}broadinstitute.org amanning{at}broadinstitute.org MUDLER{at}mgh.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

OBJECTIVE The study aimed to develop and validate algorithms for identifying people with type 1 and type 2 diabetes in the All of Us Research Program (AoU) cohort, using electronic health record (EHR) and survey data.

RESEARCH DESIGN AND METHODS Two sets of algorithms were developed, one using only EHR data (EHR), and the other using a combination of EHR and survey data (EHR+). Their performance was evaluated by testing their association with polygenic scores for both type 1 and type 2 diabetes.

RESULTS For type 1 diabetes, the EHR-only algorithm showed a stronger association with T1D polygenic score (p=3×10−5) than the EHR+. For type 2 diabetes, the EHR+ algorithm outperformed both the EHR-only and the existing AoU definition, identifying additional cases (25.79% and 22.57% more, respectively) and showing stronger association with T2D polygenic score (DeLong p=0.03 and 1×10−4, respectively).

CONCLUSIONS We provide new validated definitions of type 1 and type 2 diabetes in AoU, and make them available for researchers. These algorithms, by ensuring consistent diabetes definitions, pave the way for high-quality diabetes research and future clinical discoveries.

Figure
  • Download figure
  • Open in new tab

Why did we undertake this study?This study was conducted to develop and validate algorithms for identifying type 1 and type 2 diabetes cases in the All of Us Research Program (AoU).

What is the specific question(s) we wanted to answer?Can accurate algorithms for type 1 and type 2 diabetes identification be developed and validated using AoU cohort Electronic Health Record (EHR) and survey data? Do the identified diabetes cases show association with polygenic scores in diverse populations?

What did we find?We developed a new validated type 1 diabetes definition and expanded upon the existing type 2 diabetes definition.

What are the implications of our findings?The developed algorithms can be universally implemented in AoU for identifying study participants for well-defined case-control diabetes studies.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

L.S. is supported by funds from the Ministry of Education and Science of Poland within the project ‘Excellence Initiative – Research University’, the Ministry of Health of Poland within the project ‘Center of Artificial Intelligence in Medicine at the Medical University of Bialystok’ and the American Diabetes Association grant 11–22–PDFPM–03. J.H.L. is supported by NIDDK K23 DK131345 and MGH ECOR Fund for Medical Discovery Clinical Research Award. J.C.F. is supported by NHLBI K24 HL157960. J.M.M. is supported by American Diabetes Association Innovative and Clinical Translational Award 1–19–ICTS–068, American Diabetes Association grant #11–22–ICTSPM–16 and by NHGRI U01HG011723. A.K.M. is supported by the Foundation for the National Institutes of Health with funding from AMP CMD RFP 2: GENERATION of New genetic, –omic, or biomarker data for Common Metabolic Diseases titled ‘Common metabolic disease genetic association analysis in the All of Us Research Program’ and by NHGRI U01HG011723. M.S.U. is supported by NIDDK K23DK114551, NIDDK R03DK131249, and Doris Duke Foundation Award 2022063.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

workbench.researchallofus.org

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵# These authors jointly directed this work.

  • Twitter Summary “New study develops and validates type 1 and type 2 diabetes algorithms in the All of Us Research Program cohort, improving case identification for diabetes research. #diabetesresearch #AllOfUsResearchProgram”

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted September 05, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Algorithms for the identification of prevalent diabetes in the All of Us Research Program validated using polygenic scores – a new resource for diabetes precision medicine
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Algorithms for the identification of prevalent diabetes in the All of Us Research Program validated using polygenic scores – a new resource for diabetes precision medicine
Lukasz Szczerbinski, Ravi Mandla, Philip Schroeder, Bianca C. Porneala, Josephine H. Li, Jose C. Florez, Josep M. Mercader, Alisa K. Manning, Miriam S. Udler
medRxiv 2023.09.05.23295061; doi: https://doi.org/10.1101/2023.09.05.23295061
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Algorithms for the identification of prevalent diabetes in the All of Us Research Program validated using polygenic scores – a new resource for diabetes precision medicine
Lukasz Szczerbinski, Ravi Mandla, Philip Schroeder, Bianca C. Porneala, Josephine H. Li, Jose C. Florez, Josep M. Mercader, Alisa K. Manning, Miriam S. Udler
medRxiv 2023.09.05.23295061; doi: https://doi.org/10.1101/2023.09.05.23295061

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Endocrinology (including Diabetes Mellitus and Metabolic Disease)
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)