Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A Short Plus Long-Amplicon Based Sequencing Approach Improves Genomic Coverage and Variant Detection In the SARS-CoV-2 Genome

Carlos Arana, Chaoying Liang, Matthew Brock, Bo Zhang, Jinchun Zhou, Li Chen, Brandi Cantarel, Jeffrey SoRelle, Lora V. Hooper, Prithvi Raj
doi: https://doi.org/10.1101/2021.06.16.21259029
Carlos Arana
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chaoying Liang
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew Brock
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bo Zhang
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jinchun Zhou
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Li Chen
2Department of Pathology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brandi Cantarel
3Department of Bioinformatics, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeffrey SoRelle
2Department of Pathology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lora V. Hooper
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
4Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Prithvi Raj
1Department of Immunology, Microbiome and Genomics core, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: prithvi.raj{at}utsouthwestern.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

High viral transmission in the COVID-19 pandemic has enabled SARS-CoV-2 to acquire new mutations that impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome. We observed that some genomic intervals are poorly captured with ARTIC primers. To improve the genomic coverage and variant detection across these intervals, we designed long amplicon primers and evaluated the performance of a short (ARTIC) plus long amplicon (MRL) sequencing approach. Sequencing assays were optimized on VR-1986D-ATCC RNA followed by sequencing of nasopharyngeal swab specimens from five COVID-19 positive patients. ARTIC data covered >90% of the virus genome fraction in the positive control and four of the five patient samples. Variant analysis in the ARTIC data detected 67 mutations, including 66 single nucleotide variants (SNVs) and one deletion in ORF10. Of 66 SNVs, five were present in the spike gene, including nt22093 (M177I), nt23042 (S494P), nt23403 (D614G), nt23604 (P681H), and nt23709 (T716I). The D614G mutation is a common variant that has been shown to alter the fitness of SARS-CoV-2. Two spike protein mutations, P681H and T716I, which are represented in the B.1.1.7 lineage of SARS-CoV-2, were also detected in one patient. Long-amplicon data detected 58 variants, of which 70% were concordant with ARTIC data. Combined analysis of ARTIC +MRL data revealed 22 mutations that were either ambiguous (17) or not called at all (5) in ARTIC data due to poor sequencing coverage. For example, a common mutation in the ORF3a gene at nt25907 (G172V) was missed by the ARTIC assay. Hybrid data analysis improved sequencing coverage overall and identified 59 high confidence mutations for phylogenetic analysis. Thus, we show that while the short amplicon (ARTIC) assay provides good genomic coverage with high throughput, complementation of poorly captured intervals with long amplicon data can significantly improve SARS-CoV-2 genomic coverage and variant detection.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

No specific funding for the present study.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Review waived by UT Southwestern Institutional Review Board as analyzed specimens were de-identified and were residual material.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Sequencing data (FASTQ files) from the present study have been deposited in the NCBI SRA database with accession ID PRJNA729878 for public access.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted June 20, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A Short Plus Long-Amplicon Based Sequencing Approach Improves Genomic Coverage and Variant Detection In the SARS-CoV-2 Genome
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A Short Plus Long-Amplicon Based Sequencing Approach Improves Genomic Coverage and Variant Detection In the SARS-CoV-2 Genome
Carlos Arana, Chaoying Liang, Matthew Brock, Bo Zhang, Jinchun Zhou, Li Chen, Brandi Cantarel, Jeffrey SoRelle, Lora V. Hooper, Prithvi Raj
medRxiv 2021.06.16.21259029; doi: https://doi.org/10.1101/2021.06.16.21259029
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A Short Plus Long-Amplicon Based Sequencing Approach Improves Genomic Coverage and Variant Detection In the SARS-CoV-2 Genome
Carlos Arana, Chaoying Liang, Matthew Brock, Bo Zhang, Jinchun Zhou, Li Chen, Brandi Cantarel, Jeffrey SoRelle, Lora V. Hooper, Prithvi Raj
medRxiv 2021.06.16.21259029; doi: https://doi.org/10.1101/2021.06.16.21259029

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)