Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

PDIVAS: Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing

View ORCID ProfileRyo Kurosawa, View ORCID ProfileKei Iida, Masahiko Ajiro, View ORCID ProfileTomonari Awaya, Mamiko Yamada, View ORCID ProfileKenjiro Kosaki, Masatoshi Hagiwara
doi: https://doi.org/10.1101/2023.03.20.23287464
Ryo Kurosawa
1Department of Anatomy and Developmental Biology, Graduate school of Medicine, Kyoto University, Kyoto, 606-8501, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ryo Kurosawa
  • For correspondence: kurosawa.ryo.43r{at}st.kyoto-u.ac.jp hagiwara.masatoshi.8c{at}kyoto-u.ac.jp
Kei Iida
2Faculty of Science and Engineering, Kindai University, 3-4-1 Kowakae, Higashi-osaka, Osaka 577-8502, Japan
3Medical Research Support Center, Graduate School of Medicine, Kyoto University, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kei Iida
Masahiko Ajiro
1Department of Anatomy and Developmental Biology, Graduate school of Medicine, Kyoto University, Kyoto, 606-8501, Japan
4Department of Drug Discovery Medicine, Graduate School of Medicine, Kyoto University, Yoshida Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tomonari Awaya
1Department of Anatomy and Developmental Biology, Graduate school of Medicine, Kyoto University, Kyoto, 606-8501, Japan
5Laboratory of Tumor Microenvironment and Immunity, Graduate school of Medicine, Kyoto University, Kyoto, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tomonari Awaya
Mamiko Yamada
6Center for Medical Genetics, Keio University School of Medicine, Tokyo, 160-8582, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kenjiro Kosaki
6Center for Medical Genetics, Keio University School of Medicine, Tokyo, 160-8582, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kenjiro Kosaki
Masatoshi Hagiwara
1Department of Anatomy and Developmental Biology, Graduate school of Medicine, Kyoto University, Kyoto, 606-8501, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: kurosawa.ryo.43r{at}st.kyoto-u.ac.jp hagiwara.masatoshi.8c{at}kyoto-u.ac.jp
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Deep-intronic variants often cause genetic diseases by altering RNA splicing. However, these pathogenic variants are overlooked in whole-genome sequencing analyses, because they are quite difficult to segregate from a vast number of benign variants (approximately 1,500,000 deep-intronic variants per individual). Therefore, we developed the Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing (PDIVAS), an ensemble machine-learning model combining multiple splicing features and regional splicing constraint metrics. Using PDIVAS, around 27 pathogenic candidates were identified per individual with 95% sensitivity, and causative variants were more efficiently prioritized than previous predictors in simulated patient genome sequences. PDIVAS is available at https://github.com/shiro-kur/PDIVAS.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study was supported by JSPS KAKENHI Grant Numbers 22J23899 and 19K07367. This study was also supported by AMED under Grant Number JP22gm4010013.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study used ONLY openly available human data that were originally located at: http://hgdownload.cse.ucsc.edu/gbdb/hg19/1000Genomes/phase3/. and https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/gnomad.genomes.r2.0.1.sites.noVEP.vcf.gz.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • Formats of figures were modified (no change in contents) and more detailed descriptions were added to captions.

Data Availability

The PDIVAS source code, command-line interface, and predictions for all rare deep-intronic SNVs, short insertion, and deletion within genes of Mendelian disease are available at https://github.com/shiro-kur/PDIVAS. ConSplice scores and precomputed scores of ConSpliceML are available at https://home.chpc.utah.edu/~u1138933/ConSplice/. Precomputed scores of CADD-Splice are available at https://krishna.gs.washington.edu/download/CADD/v1.6/GRCh37/. The pathogenic splice-altering variants from HGMD were downloaded from the HGMD website http://www.hgmd.cf.ac.uk/ under the HGMD commercial license. Due to HGMD commercial licensing, we are not allowed to share these variants publicly. 1000 Genomes Project variants are publically available at http://hgdownload.cse.ucsc.edu/gbdb/hg19/1000Genomes/phase3/. gnomAD variants are publically available at https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/gnomad.genomes.r2.0.1.sites.noVEP.vcf.gz. Gene list from OMIM is available to users from academic institutions and non-profit organizations at https://www.omim.org/downloads. Gene lists from CGD are publically available at https://research.nhgri.nih.gov/CGD/download/.

https://github.com/shiro-kur/PDIVAS

  • Abbreviations

    PDIVAS
    Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing;
    SAV
    Splice-altering variant;
    HGMD
    Human Gene Mutation Database;
    SNV
    single nucleotide variant;
    OMIM
    Online Mendelian Inheritance in
    Man
    CGD Clinical Genomic Database;
    VEP
    Variant Effect Predictor;
    WGS
    whole-genome sequencing;
    PR
    Precision and Recall;
    MCC
    Matthews correlation coefficient
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted March 27, 2023.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    PDIVAS: Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    PDIVAS: Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing
    Ryo Kurosawa, Kei Iida, Masahiko Ajiro, Tomonari Awaya, Mamiko Yamada, Kenjiro Kosaki, Masatoshi Hagiwara
    medRxiv 2023.03.20.23287464; doi: https://doi.org/10.1101/2023.03.20.23287464
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    PDIVAS: Pathogenicity predictor for Deep-Intronic Variants causing Aberrant Splicing
    Ryo Kurosawa, Kei Iida, Masahiko Ajiro, Tomonari Awaya, Mamiko Yamada, Kenjiro Kosaki, Masatoshi Hagiwara
    medRxiv 2023.03.20.23287464; doi: https://doi.org/10.1101/2023.03.20.23287464

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Genetic and Genomic Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)