Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Clinical signatures of genetic epilepsy precede diagnosis in electronic medical records of 32,000 individuals

View ORCID ProfilePeter D. Galer, Shridhar Parthasarathy, Julie Xian, Jillian L. McKee, Sarah M. Ruggiero, Shiva Ganesan, David Lewis-Smith, View ORCID ProfileMichael C. Kaufman, Stacey R. Cohen, Scott Haag, Alexander K. Gonzalez, Olivia Wilmarth, Colin A. Ellis, Brian Litt, View ORCID ProfileIngo Helbig
doi: https://doi.org/10.1101/2022.12.08.22283226
Peter D. Galer
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
4University of Pennsylvania, Center for Neuroengineering and Therapeutics, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter D. Galer
Shridhar Parthasarathy
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julie Xian
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jillian L. McKee
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
5Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sarah M. Ruggiero
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shiva Ganesan
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Lewis-Smith
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
6Translational and Clinical Research Institute, Newcastle University, UK
7Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle-upon-Tyne, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael C. Kaufman
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael C. Kaufman
Stacey R. Cohen
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Scott Haag
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexander K. Gonzalez
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Olivia Wilmarth
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Colin A. Ellis
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
5Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brian Litt
4University of Pennsylvania, Center for Neuroengineering and Therapeutics, Philadelphia, PA, USA
5Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ingo Helbig
1Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
2Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
3The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
5Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ingo Helbig
  • For correspondence: helbigi{at}chop.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

An early genetic diagnosis can guide the time-sensitive treatment and care of individuals with genetic epilepsies. However, identification of a genetic cause often occurs long after onset of these disorders. Here, we aimed to identify early clinical features suggestive of genetic diagnoses in individuals with epilepsy by systematic large-scale analysis of clinical information from full-text patient notes in the electronic medical records (EMR).

From the EMR of 32,112 individuals with childhood epilepsy, we retrieved 4,572,783 clinical notes spanning 203,369 total patient-years. A subcohort of 1,925 individuals had a known or presumed genetic epilepsy with 738 genetic diagnoses spanning 271 genes. We employed a customized natural language processing (NLP) pipeline to extract 89 million time-stamped standardized clinical annotations from free text of the retrieved clinical notes. Our analyses identified 47,641 clinical associations with a genetic cause at distinct ages prior to diagnosis. Notable among these associations were: SCN1A with status epilepticus between 9 and 12 months of age (P<0.0001, 95% CI=8.10-133); STXBP1 with muscular hypotonia between 6 and 9 months (P=3.4×10−4, 95% CI=3.08-102); SCN2A with autism between 1.5 and 1.75 years (P<0.0001, 95% CI=11.1-Inf); DEPDC5 with focal-onset seizure between 5.75 and 6 years (P<0.0001, 95% CI=12.8-Inf); and IQSEC2 with myoclonic seizure between 2.75 and 3 years (P=2.5×10−4, 95% CI=11.3-1.15×104). We also identified associations between clinical terms and gene groups. Variants in ion channel gating mechanisms were associated with myoclonus between 3 and 6 months of age (P<0.0001, 95% CI=5.23-24.2), and variants in calcium channel genes were associated with neurodevelopmental delay between 1.75 and 2 years (P<0.0001, 95% CI=4.8-Inf). Cumulative longitudinal analysis revealed further associations, including KCNT1 with migrating focal seizures from at 0 to 1.75 years (P<0.0001, 95% CI=96.8-4.50×1015). A neurodevelopmental abnormality presenting between 6 and 9 months of age was strongly associated with an individual having any genetic diagnosis (P<0.0001, 95% CI=3.55-7.42). The earliest features associated with genetic diagnosis occurred a median of 3.6 years prior to the median age of diagnosis. Latency to diagnosis was greater in older individuals (P<0.0001) and those who initially underwent less comprehensive genetic testing (P=5.5×10−3, 95% CI=1.23-3.35).

In summary, we identified key clinical features that precede genetic diagnosis, leveraging EMR data at scale from a large cohort of individuals with genetic epilepsies. Our findings demonstrate that automated EMR analysis may assist clinical decision making, leading to earlier diagnosis and more precise prognostication and treatment of genetic epilepsies in the precision medicine era.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

IH is supported by a NINDS K award (K02 NS112600) and the Hartwell Foundation (Individual Biomedical Research Award). BL is supported by National Institute for Neurological Disorders and Stroke (DP1NS122038) and The Jonathan Rothberg Family Fund. DLS is supported by the Wellcome Trust [203914/Z/16/Z]. For the purpose of Open Access, the author has applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The IRB of Children's Hospital of Philadelphia gave ethical approval for this work.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data produced in the present study are available upon reasonable request to the authors.

https://eig.research.chop.edu/cube3/

  • Abbreviations

    CI
    Confidence Interval
    EGRP
    Epilepsy Genetics Research Project
    EMR
    Electronic Medical Records
    HPO
    Human Phenotype Ontology
    NLP
    Natural Language Processing
    OR
    Odds Ratio
    PELHS
    Pediatric Epilepsy Learning Health System
    PPV
    Positive Predictive Value
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted December 09, 2022.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Clinical signatures of genetic epilepsy precede diagnosis in electronic medical records of 32,000 individuals
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Clinical signatures of genetic epilepsy precede diagnosis in electronic medical records of 32,000 individuals
    Peter D. Galer, Shridhar Parthasarathy, Julie Xian, Jillian L. McKee, Sarah M. Ruggiero, Shiva Ganesan, David Lewis-Smith, Michael C. Kaufman, Stacey R. Cohen, Scott Haag, Alexander K. Gonzalez, Olivia Wilmarth, Colin A. Ellis, Brian Litt, Ingo Helbig
    medRxiv 2022.12.08.22283226; doi: https://doi.org/10.1101/2022.12.08.22283226
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Clinical signatures of genetic epilepsy precede diagnosis in electronic medical records of 32,000 individuals
    Peter D. Galer, Shridhar Parthasarathy, Julie Xian, Jillian L. McKee, Sarah M. Ruggiero, Shiva Ganesan, David Lewis-Smith, Michael C. Kaufman, Stacey R. Cohen, Scott Haag, Alexander K. Gonzalez, Olivia Wilmarth, Colin A. Ellis, Brian Litt, Ingo Helbig
    medRxiv 2022.12.08.22283226; doi: https://doi.org/10.1101/2022.12.08.22283226

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Genetic and Genomic Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)