Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Evidence for SARS-CoV-2 Delta and Omicron co-infections and recombination

View ORCID ProfileAlexandre Bolze, Tracy Basler, Simon White, Andrew Dei Rossi, View ORCID ProfileDana Wyman, View ORCID ProfilePavitra Roychoudhury, View ORCID ProfileAlexander L. Greninger, Kathleen Hayashibara, Mark Beatty, Seema Shah, Sarah Stous, Eric Kil, Hang Dai, Tyler Cassens, Kevin Tsan, Jason Nguyen, Jimmy Ramirez, Scotty Carter, View ORCID ProfileElizabeth T. Cirulli, View ORCID ProfileKelly Schiabor Barrett, View ORCID ProfileNicole L. Washington, Pedro Belda-Ferre, Sharoni Jacobs, Efren Sandoval, David Becker, View ORCID ProfileJames T. Lu, Magnus Isaksson, View ORCID ProfileWilliam Lee, View ORCID ProfileShishi Luo
doi: https://doi.org/10.1101/2022.03.09.22272113
Alexandre Bolze
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexandre Bolze
Tracy Basler
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Simon White
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrew Dei Rossi
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dana Wyman
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dana Wyman
Pavitra Roychoudhury
2Department of Laboratory Medicine and Pathology, University of Washington Medical Center, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Pavitra Roychoudhury
Alexander L. Greninger
2Department of Laboratory Medicine and Pathology, University of Washington Medical Center, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexander L. Greninger
Kathleen Hayashibara
3Thermo Fisher Scientific, South San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mark Beatty
4County of San Diego Health and Human Services, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Seema Shah
4County of San Diego Health and Human Services, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sarah Stous
4County of San Diego Health and Human Services, San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eric Kil
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hang Dai
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tyler Cassens
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kevin Tsan
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jason Nguyen
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jimmy Ramirez
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Scotty Carter
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elizabeth T. Cirulli
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elizabeth T. Cirulli
Kelly Schiabor Barrett
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kelly Schiabor Barrett
Nicole L. Washington
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nicole L. Washington
Pedro Belda-Ferre
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sharoni Jacobs
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Efren Sandoval
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Becker
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James T. Lu
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James T. Lu
Magnus Isaksson
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
William Lee
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for William Lee
Shishi Luo
1Helix, San Mateo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shishi Luo
  • For correspondence: shishi.luo{at}helix.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Between November 2021 and February 2022, SARS-CoV-2 Delta and Omicron variants co-circulated in the United States, allowing for co-infections and possible recombination events. We sequenced 29,719 positive samples during this period and analyzed the presence and fraction of reads supporting mutations specific to either the Delta or Omicron variant. We identified 18 co-infections, one of which displayed evidence of a low Delta-Omicron recombinant viral population. We also identified two independent cases of infection by a Delta-Omicron recombinant virus, where 100% of the viral RNA came from one clonal recombinant. In the three cases, the 5’-end of the viral genome was from the Delta genome, and the 3’-end from Omicron including the majority of the spike protein gene, though the breakpoints were different. Delta-Omicron recombinant viruses were rare, and there is currently no evidence that Delta-Omicron recombinant viruses are more transmissible between hosts compared to the circulating Omicron lineages.

Introduction

Recombination is one way a virus can evolve to acquire a new combination of mutations. Recombinations have played an important role in the evolution of the RNA viruses HIV-1 and SARS-CoV-2 (Fischer et al., 2021; Jackson et al., 2021). In humans, the analysis of the first 87,695 SARS-CoV-2 genomes shared on GISAID in 2021 identified 225 sequences of likely recombinant origins (Varabyou et al., 2021). However, tracking recombinations for SARS-CoV-2 remains challenging because of the relatively low diversity of the genomes. Moreover, without the underlying sequencing data or orthogonal confirmation, it is difficult to determine whether recombinant sequences are real or due to contamination, technical artifacts or naturally occuring mutations shared by multiple variants.

The Omicron variant, first detected by scientists in Botswana and South Africa in early November 2021 (Viana et al., 2022), has rapidly spread across the globe. The variant includes the three Pango lineages BA.1 (Nextstrain clade 21K), BA.2 (Nextstrain clade 21L) and BA.3, and is characterized by a large number of mutations in the spike protein (Viana et al., 2022). In the United States, the first Omicron case was reported on December 1 (Vaziri and Allday, 2021), when the Delta variant was dominant (Bolze et al., 2021; Earnest et al., 2021). We hypothesized that during the subsequent period when Delta and Omicron co-circulated, some cases of co-infections would occur and potentially result in the emergence of a new SARS-CoV-2 variant resulting from the recombination of a Delta variant and an Omicron variant. This, in turn, would result in a new combination of mutations with unknown properties.

The aims of this study were (i) to look for cases co-infected with Delta and Omicron variants, and (ii) to study SARS-CoV-2 recombination in humans by analyzing co-infection samples, as well as looking for clonal infections caused by a Delta and Omicron recombinant.

Results

Co-circulation of Delta and Omicron variants in the United States

We sequenced and assigned a lineage to 29,719 samples positive for SARS-CoV-2 collected by the Helix laboratory across the United States (Table S1) between 22 November 2021 and 13 February 2022. These samples came from anterior nasal swabs of different individuals, with one viral sequence assay performed per person infected (similar to a cross-sectional analysis). The large majority of samples were collected at a national retail pharmacy. Samples from San Diego County were collected as part of community testing organized by San Diego County. The individuals tested represented a diverse range of race and age groupings (details in Table S1). We observed that the Omicron variant quickly grew to explain >99% of cases as of the week of January 17 (Figure S1, Table S2). Delta and Omicron variants therefore co-circulated (each representing >1% of infections) from 6 December 2021 to 16 January 2022, represented by 14,214 sequences in our dataset (Figure S1, Table S2). During that time, the overall number of cases in the United States remained high, above 150,000 new cases per day and above a 7-day case rate of 250 per 100,000 individuals (CDC, 2020). The possibility of a co-infection by two distinct variants was therefore also high during this period.

Co-infection with Delta and Omicron variants

When a person is infected by two distinct variants/viruses, multiple copies of the full genome of each variant are present in the sample. A fraction (x%) of the total extracted SARS-CoV-2 RNA will come from variant A, and the remaining fraction (100-x%) of the RNA will come from variant B. Sequencing at a high enough coverage will lead to calling mutations that define both variant A and variant B, but each mutation will only be supported by a fraction of the reads overlapping the given position; the mutations specific to variant A should be called with ∼x% of the reads overlapping the position, whereas the mutations specific to variant B should be called with (100-x)% (Figure 1A). In order to identify this co-infection signature, we selected a list of mutations specific to the Delta variant and a list of mutations specific to the Omicron variant (Table S3). All mutations selected had a call (not an “N”) in >95% of the samples between November 2021 and February 2022 (Table S3).

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Co-infection with Delta and Omicron variants.

(A) Schematic of a co-infection and the impact on the sequencing output. (B) Three example graphs representing the alternate allele fraction for each mutation. Forty-seven mutations are plotted in order of their position on the SARS-CoV-2 genome from 5’ to 3’. Genes are separated by dashed vertical lines. Sixteen mutations specific to Delta are represented in orange. Nineteen mutations specific to Omicron and shared by all its sub-lineages are represented in blue. Five mutations specific to BA.1 (and BA.1.1) are in light blue. The defining BA.1.1 mutation is in black and 6 mutations specific to BA.2 are in magenta. The graphs for all 18 co-infections are in Figure S2. (C) The difference of Delta fraction and Omicron fraction is plotted for the initial sample and the replicate sequenced after a new RNA extraction. Dashed lines connect identical samples. Orange dots represent samples with a majority of Delta. Blue dots represent samples with a majority of Omicron. (D) The difference of Delta fraction and Omicron fraction for the initial sample is plotted against the collection month of the sample. Orange dots represent samples with a majority of Delta. Blue dots represent samples with a majority of Omicron.

We identified 21 samples that were very likely to be co-infected with Delta and Omicron variants by filtering for samples where the median alternate allele fraction is less than 0.85 (see Methods). For 19 samples (two samples were unable to be re-tested), we validated the results by RNA re-extraction and re-sequencing. For a few samples, we also validated the results using an orthogonal genotyping assay (see Methods). The results replicated in 18 of the 19 samples (Figure 1B, Figure S2, Table 1), including two (HMIX1 and HMIX2) that have already been reported in a separate study (currently under review). A detailed list of the allele depths for each mutation for all of these samples is in Table S4. Overall, we estimate that on average ∼1 in 800 (18 / 14,214; 95% CI: 1/540 to 1/1,470, assuming a binomial distribution) positive samples between 6 December 2021 and 16 January 2022 had a co-infection.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1: Samples with co-infections and Delta-Omicron infections

Given how quickly Omicron displaced Delta, we hypothesized that in cases of co-infections we would see on average more Omicron virions compared to Delta. Using the fraction of sequencing reads that mapped to mutations in either Delta or Omicron as a proxy, the fraction of Delta and Omicron virions in a given sample appeared similar (between 40 and 60%) in 8 out of 18 co-infections (Figure 1B, Figure S2, Table S5). The Delta variant was higher than the Omicron variant in five co-infection samples, while the Omicron variant was higher than the Delta variant in the remaining five (Figure 1B, Figure S2, Table S5). The fraction of Delta and Omicron virions in each sample were similar in the replicates (Figure 1C) despite re-extraction. These results did not support the hypothesis that the Omicron variant would outcompete the Delta variant when in the same host. There was also no correlation between the period of co-infection and the dominant variant (Figure 1D), or between the viral load in the nose and the dominant variant (Figure S3). We were not able to test the hypothesis that vaccination or prior infection by SARS-CoV-2 would better control Delta and lead to a higher fraction of Omicron in these co-infected samples. Our analysis is also limited by the fact that we do not have information whether the exposure and seed infection by the two distinct variants happened at the same time, or if they followed each other.

Within-host recombination of Delta and Omicron genomes

We hypothesized that a subset of host cells in a co-infection would inevitably contain both variants, and therefore have the potential to generate recombinants. If these recombinants were replication competent and replicated enough, then we would detect them in sequencing output, manifesting as a change in allele fraction of defining mutations near the recombination breakpoint.

Indeed, we find that HMIX16 (Figure 2A) exhibits precisely this characteristic. Alternative allele fractions for Delta mutations hover around 0.80 near the 5’ end of the genome, but drop to around 0.50 near the beginning of the S gene and remain at this level until the 3’ end of the genome. This profile suggests the presence of a Delta-Omicron recombinant with a breakpoint preceding the S:214EPEins. Upon examination of read-pairs sequenced from HMIX16 that spanned mutations unique for Delta and Omicron upstream of S:214EPEins, we found 4 read-pairs that supported a Delta-Omicron recombinant, 7 read-pairs that supported Delta only, and 10 that supported Omicron only (Figure 2B). The read pairs that supported a Delta-Omicron recombinant comprise the S:156/157del mutation of Delta on the 5’ end, and the S:212del of Omicron on the 3’ end. The existence of these three unique mutation profiles presents compelling evidence that a recombinant virus was generated during co-infection with a breakpoint region of 157 base pairs between positions 22,036 and position 22,193. We did not find read-pairs supporting Delta-Omicron recombination in the same interval in the other co-infection samples showing that these recombinations remain a rare event.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Evidence of Delta, Omicron, and recombinant read pairs in HMIX16.

(A) Graph representing the Alternate Allele Fraction for each mutation of sample HMIX16. Forty-seven mutations are plotted in order of their position on the SARS-CoV-2 genome from 5’ to 3’. Genes are separated by dashed vertical lines. Sixteen mutations specific to Delta are represented in orange. Nineteen mutations specific to Omicron and shared by all its sub-lineages are represented in blue. Five mutations specific to BA.1 (and BA.1.1) are in light blue. One BA.1.1 mutation is in black and 6 mutations specific to BA.2 are in magenta. The gray box represents the region where the alternate allele fraction changes. (B) An IGV (Robinson et al., 2011) view of the alignments for HMIX16, subsampled to only include read pairs where the first in pair covers the S:156/157del position, and the second in pair covers the S:212del position. Read pairs representing three mutation profiles are present: (i) supporting Delta mutations only (7 read pairs), (ii) supporting Omicron mutations only (10 read pairs), and (iii) supporting a Delta/Omicron recombinant (4 read pairs) marked with a green arrow. Delta mutations in orange, Omicron mutations in blue. Read pairs that do not span these mutations are not shown.

Infection with clonal Delta-Omicron recombinants

Having established that co-infections occur and can generate replication competent virus, we then looked for samples that are composed entirely of recombinant virus. In such samples, we expect that all mutations called would be supported by ∼100% of the reads because the viral population in the sample is composed of multiple copies of the same variant, rather than a mixture of two. We first looked for recombinants with one breakpoint where all mutations identified on the 5’-end of the breakpoint should be characteristic of one variant (e.g., variant A), and all mutations on the 3’-end of the breakpoint should be characteristic of the other variant (e.g., variant B) (Figure 3A). We identified seven samples that had Delta-specific ORF1A:A1306S at the 5’-end of the genome, and Omicron-specific N:P13L at the 3’-end. One sample had Omicron-specific ORF1A:P3395H at the 5’-end and Delta-specific N:D63G at the 3’-end. Further analysis of these eight genomes showed that only two genomes, RECOMB1 and RECOMB2, had multiple consecutive Delta mutations at the 5’-end while the 3’-end of the genome had all of the Omicron mutations but none of the Delta mutations (Figure 3B). Four of the six other genomes had all (5’ to 3’ of the genome) Omicron-specific mutations and the additional Delta ORF1A:A1306S, which was probably acquired independently. The remaining genomes had all of the Delta-specific mutations with one containing an additional Omicron N:P13L, and the other containing Omicron ORF1A:P3395H. These were probably also acquired independently.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Infection with Delta-Omicron recombinants.

(A) Schematic of the impact of an infection from a virus resulting from the recombination of Delta and Omicron on the sequencing output. (B) Graphs representing the alternate allele fraction for each mutation. Forty-seven mutations are plotted in order of their position on the SARS-CoV-2 genome from 5’ to 3’. Genes are separated by dashed vertical lines. Sixteen mutations specific to Delta are represented in orange. Nineteen mutations specific to Omicron and shared by all its sub-lineages are represented in blue. Five mutations specific to BA.1 (and BA.1.1) are in light blue. One BA.1.1 mutation is in black and 6 mutations specific to BA.2 are in magenta. The gray box represents the homologous region where the breakpoint of the recombination is.

Given that amplicon primer-based artifacts, in conjunction with laboratory contamination, have previously led to spurious signatures of recombination (Kreier, 2022), we were careful to check the quality of the two possible recombinant viruses. First, the library preparation method used by Helix’s viral sequencing protocol is hybridization-based capture, not amplicon. Hybrid capture is less susceptible to artifacts due to mutations at primer sites, which has been a recurring issue with possible recombinant viruses observed in GISAID (Sanderson and Barrett, 2021). We have also seen far less drop-out in sequences generated via hybrid capture compared to amplicon in our sequencing data (Figure S4). Second, the Cq values for these two infections were low Cq(RECOMB1) = 22.5 and Cq(RECOMB2) = 20.5, implying a high source viral load that would be less susceptible to contamination. Third, we replicated these results for RECOMB1 and RECOMB2 by re-extracting RNA from the collected sample and re-sequencing. In the initial and replicate samples, the median alternate allele fraction was 1, indicating that the majority of mutations were supported by 100% of the reads. Fourth, we performed a manual review of the alignments using IGV to make sure the reads supporting the mutations were of high quality. Fifth, we were able to validate our results for RECOMB1 by running a genotyping assay looking at Delta: C21618G (S:T19R at the protein level) and Omicron: G8393A, T13195C, C23202A (S:T547K at the protein level). The results showed the presence of Delta C21618G and Omicron C23202A in the sample, but the absence of Omicron: G8393A and T13195C. These confirm that the 5’-end of the genome was from Delta and the 3’-end from Omicron. Lastly, RECOMB1 sample was also sent to the US Centers for Disease Control and Prevention (CDC) laboratory and results were confirmed via another sequencing technology (Lacek et al., 2022). Together, these experiments provide evidence that the two independent infections were caused by viruses resulting from the recombination of Delta and Omicron.

The sequences of the two recombinant viruses differ slightly. The breakpoint region of RECOMB1 is 374 bases between position 22,204 and position 22,578, while the breakpoint region of RECOMB2 is 2,398 bases between position 19,220 and position 21,618 (Figure 3B). Of interest, there is a private mutation T19404C in RECOMB2 inside the breakpoint region. RECOMB1 is a recombination between Delta sublineage AY.119 and Omicron sublineage BA.1.1. The 5’ Delta end of RECOMB2 is too short for sublineage classification, but the 3’ end is Omicron sublineage BA.1. The full list of mutations including the presence of unlabeled mutations (not shared by a large fraction of genomes in the same lineage) are in Table S6 (RECOMB1) and Table S7 (RECOMB2) in a VCF-like format. These two samples were both collected in Massachusetts, but the difference in sequence suggests they are unrelated. Overall, infections from a recombinant Delta-Omicron virus remain rare: 2 out of 10,742 sequences between January 10 and February 13, 2022. Eight other sequences similar to RECOMB1 have been reported by the CDC from samples collected in the US from December 31, 2021 to February 12, 2022 (Lacek et al., 2022). We did not identify any recombinant with Omicron on the 5’-end and Delta on the 3’-end or any recombinant with two breakpoints in our dataset.

Discussion

In this study we identified and validated 18 cases of co-infection with the Delta and Omicron variants, and 2 cases infected by a virus resulting from the recombination of Delta and Omicron. While contamination could lead to the same output as a co-infection, several pieces of evidence discount contamination: (i) re-extraction and re-sequencing these samples led to the same results; (ii) the fraction of reads supporting each variant was high in all cases (at least 15%); (iii) samples that showed a co-infection were collected and processed on different days, and other samples sequenced on the same plates did not show co-infection; (iv) the collection dates of the co-infections and the infections by a recombinant virus followed a logical timeline; (v) in one of these co-infections, we found evidence of recombinant virus at a low but detectable frequency, consistent with template-switching during viral replication in a cell infected with two variants.

In the case of the two recombinants, our data again supports chimeric sequences being the cause, rather than technical artifacts. First, we were able to replicate the result for both samples after re-extracting RNA. Second, our sequencing protocol is based on hybrid capture and is less prone to amplicon-based artifacts (Figure S4). Third, the chimeric exhibited breakpoint regions on the 5’ end of the spike protein as well as within the spike protein. These were not events limited to the spike protein, which is where many amplicon-based primer artifacts have been detected (Kreier, 2022).

Our study demonstrates the existence of co-infections, the presence of a recombinant population in at least one of these co-infections, and the existence of two infections consisting almost entirely of multiple copies of a recombinant virus. However, the mechanism by which a recombinant virus comes to dominate an infection remains somewhat of a puzzle. One possibility is that the two infections that contain only recombinant virus were themselves seeded by a recombinant virus. This implies that in their respective ancestral co-infections, the two recombinant viruses each rose to a high enough fraction to be transmitted during an exposure and were able to establish an infection in a new host. Yet, despite transmitting to a new host at least once, the transmission chain was not sustained; neither RECOMB1 nor RECOMB2 have led to large clusters of cases. The other possibility is that these two infections began as co-infections and that the recombinant viral population then completely outcompeted the Delta and Omicron populations within the host. Yet, it seems unlikely that Delta and Omicron can be completely cleared from a host, while leaving the recombinant virus population intact. In either case, the recombinant did not appear to have an increased ability to transmit between hosts compared to co-circulating Omicron variants. There are parallels here with HIV-1, where chronic infection and host immune response leads to extensive within-host diversity of the virus, but the genotypes of the virus that ultimately seed new infections are from a much narrower set of viral types (Joseph et al., 2015; Sagar et al., 2009).

With more diversity in circulating SARS-CoV-2 genomes, it will now be possible to track recombinations, characterize the rate of recombination, and identify hot spots for breakpoints (de Klerk et al., 2021). One way to detect these recombinants is the strategy we used. Another is to review every instance where a sample has good sequencing metrics but where methods like Nextclade (Hadfield et al., 2018) or Pangolearn (Rambaut et al., 2020) have difficulty attributing a clade or a lineage to the sequence. Specialized methods have also been developed to detect recombination in viruses (Martin et al., 2015; Samson et al., 2021; Varabyou et al., 2021). With a better understanding of SARS-CoV-2 recombination, and by drawing parallels with recombination in other unsegmented positive-strand RNA viruses, as well as other viruses in general (Simon-Loriere and Holmes, 2011), we can be better prepared to anticipate new variants or combinations of mutations of SARS-CoV-2 that may arise in the future.

Methods

Samples

This paper is based on the study of SARS-CoV-2 found and collected from individuals in the United States. The detailed demographics about these individuals can be found in Table S1 of this paper.

The Helix data analyzed and presented here were obtained through IRB protocol WIRB#20203438, which grants a waiver of consent for a limited dataset for the purposes of public health under section 164.512(b) of the Privacy Rule (45 CFR § 164.512(b)). All samples were de-identified before receipt by the study investigators.

Helix COVID-19 test data and sample selection

All viral samples in this investigation were collected by Helix through its diagnostic testing laboratory. The Helix COVID-19 test is run on specimens collected across the US, and results are obtained as part of our standard test processing workflow using specimens from anterior nares swabs. The Helix COVID-19 Test is based on the Thermo Fisher TaqPath COVID-19 Flu A, Flu B Combo Kit, which targets three respiratory pathogens (SARS-CoV-2, Influenza A, and Influenza B). Swabs are transported in saline and sample tubes are heat inactivated upon receipt at the lab. Test results from positive cases, together with a limited amount of metadata (including sample collection date, state, and qRT-PCR Cq values for all targets), were used to build the research database used here.

SARS-CoV-2 sequencing and consensus sequence generation

Sequencing was performed by Helix as part of the SARS-CoV-2 genomic surveillance program in partnership with the Centers for Disease Control and Prevention (CDC). In the Helix workflow, RNA is extracted from 400 μL of patient anterior nares sample using the MagMAX Viral/Pathogen kit (ThermoScientific). Sequencing libraries were generated with total RNA library preparation (5μL RNA input volume) using the Rapid RNA Library Kit protocol (Swift Biosciences/Integrated DNA Technologies). SARS-CoV-2 genome capture was accomplished using hybridization kit xGen COVID-19 Capture Panel (Integrated DNA Technologies). Samples were sequenced using the NovaSeq 6000 Sequencing system S1 flow cell, with S1 Reagent Kit v1.5 (300 cycles). Note: Specimens identified as co-infections or recombinants through the surveillance program were verified by reprocessing from the original specimen. The process was replicated as described above; however, the hybridization probe panel was substituted. The IDT COVID-19 Capture Panel was replaced with the Respiratory Virus Research Panel (Twist Biosciences), while all other reagents remained the same.

Bioinformatic processing of this sequencing output is as follows. The flow cell output is demultiplexed with bcl2fastq (Illumina) into per-sample FASTQ sequences that are then run through the Helix fastagenerator pipeline to produce a sequence FASTA file. First, reads are aligned to a reference comprising the SARS-CoV-2 genome (NCBI accession NC_045512.2) and the human transcriptome (GENCODE v37) using BWA-MEM. Reads are then marked for duplicates before proceeding to variant calling using the Haplotyper algorithm (Sentieon, Inc). Finally, the per-base coverage from the alignment file (BAM) and per-variant allele depths from the variant call format (VCF) file are used to build a consensus sequence according to the following criteria: coverage from at least 5 unique reads is required with at least 80% of the reads supporting the allele. Otherwise, that base is considered uncertain, and an N is reported.

Alternate allele fraction is the number of reads supporting an alternate allele (i.e. a mutation) divided by the total number of reads covering the position. The median alternate allele fraction is calculated as the median value of alternate allele fractions at sites where at least 15% of the reads support a mutation.

Viral lineage designation

Viral sequences were assigned a Pango lineage (Rambaut et al., 2020) using pangoLEARN (https://github.com/cov-lineages/pangoLEARN). For this analysis, pangoLEARN version 2022-02-02 with Pangolin software version 3.1.11 was used. We sequenced and were able to attribute a lineage to 29,719 sequences from samples collected between November 22, 2021 and February 13, 2022 for genomic surveillance purposes.

Genotyping

The detailed genotyping method as well as the validation of the method used in this study are previously described (Lai et al., 2022). The four specific markers used were:

  • – Delta: C21618G

  • – Omicron: G8393A

  • – Omicron: T13195C

  • – Omicron: C23202A

Relative fraction of each variant in co-infections

Number of RNA copies and coverage does vary across the SARS-CoV-2 genome. The density of mutations specific to Delta or Omicron also varies. There are many more Omicron-specific mutations in the spike protein. To try to minimize some biases, we took 4 Delta-specific mutations and 4 Omicron-specific mutations spread across SARS-CoV-2 genome to calculate the mean Delta-allele fraction and the mean Omicron-allele fraction in each sample. The 4 Delta-specific mutations are: ORF1A:P2046L, ORF1B:P1000L, S:T19R, M:I82T. The 4 Omicron-specific mutations are: ORF1A:P3395H, ORF1B:I1566V, S:N969K, M:A63T.

The results are in Table S5. We considered Delta to be the dominant variant if the Delta fraction was above 60% and the Omicron fraction was below 40%. We considered Omicron to be the dominant variant if the Omicron fraction was above 60% and the Delta fraction was below 40%. Other samples were considered balanced. To decide which variant was dominant for each sample for Table 1, we used the results of the initial sequencing.

Statistics

The 95% Confidence Interval for the frequency of co-infections during co-circulation was calculated using the Python statsmodels.stats.proportion.proportion_confint package. ci_low, ci_up = sm.stats.proportion_confint(18, 14214, alpha=0.05, method=‘normal’)

Data Availability

All data produced in the present work are contained in the manuscript. SARS-CoV-2 sequences have been uploaded on GISAID. BAM files have been uploaded to SRA Bioproject: PRJNA804575

Data availability

All samples with a qc_status of pass were uploaded to GISAID.

GISAID identifiers for RECOMB1: hCoV-19/USA/MA-CDC-STM-HZEBR92XC/2022, EPI_ISL_9088187

GISAID identifiers for for RECOMB2: hCoV-19/USA/MA-CDC-STM-SP94WR2RW/2022, EPI_ISL_10114799

BAMs of co-infection samples and recombinant samples are available at SRA STUDY: PRJNA804575. Link: https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA804575&o=acc_s%3Aa

SRA BioSample accessions:

HMIX16: SAMN26527328, bioproject: PRJNA804575

(https://www.ncbi.nlm.nih.gov/biosample/26527328)

RECOMB1: SAMN26527329, bioproject: PRJNA804575

(https://www.ncbi.nlm.nih.gov/biosample/26527329)

RECOMB2: SAMN26527330, bioproject: PRJNA804575

(https://www.ncbi.nlm.nih.gov/biosample/26527330)

Declarations of interest

A.B., T.B., S.W., A.D.R., D.W., E.K., H.D., T.C., K.T., J.N., J.R., S.C., E.T.C., K.S.B., N.L.W., P.B., S.J., E.S., D.B., J.T.L., M.I., W.L., and S.L. are all employees of Helix.

Supplementary Information

4 Supplementary figures and 7 supplementary tables

Table S1: Demographic information on the population where the samples were collected

Table S2: Counts and fraction of different lineages in the United States by week, related to Figure S1

Table S3: List of mutations used to identify and differentiate Delta and Omicron variants

Table S4: Reference and Alternate allele depth for each mutation for all HMIX (co-infections) and RECOMB (Delta Omicron recombinant) samples

Table S5: Fraction of Delta vs Omicron in each co-infection samples

Table S6: List of mutations identified in RECOMB1 sample

Table S7: List of mutations identified in RECOMB2 sample

Figure S1:
  • Download figure
  • Open in new tab
Figure S1: Co-circulation of Delta and Omicron variants in the United States.

Fraction of lineages sequenced per week in the United States. Delta includes B.1.617.2 and all lineages starting with AY. Detailed numbers for the most common AY lineages are in Table S2. Delta: orange. Omicron: light blue (BA.1), blue (BA.1.1), navy blue (BA.2). The week of collection of the sample is on the X-axis.

Figure S2:
  • Download figure
  • Open in new tab
Figure S2: Co-infection with Delta and Omicron variants, related to Figure 1.

Graphs representing the Alternate Allele Fraction for each mutation. 35 mutations are plotted in order of their position on the SARS-CoV-2 genome from 5’ to 3’. Genes are separated by dashed vertical lines. 18 mutations specific to Delta are represented in orange: ORF1A:A1306S, ORF1A:P2046L, ORF1A:P2287S, ORF1A:V2930L, ORF1A:T3646A, ORF1B:G662S, ORF1B:P1000L, ORF1B:1918V, S:T19R, S:L452R, S:P681R, S:D950N, ORF3A:S26L, M:I82T, ORF7A:V82A, ORF7A:T120I, N:D63G, N:R203M. 16 mutations specific to Omicron are represented in blue: ORF1A:P3395H, ORF1B:I1566V, S:G339D, S:S375F, S:E484A, S:Q498R, S:H655Y, S:N764K, S:D796Y, S:N969K, E:T9I, M:Q19E, M:A63T, N:P13L, N:DEL31-33, N:R203K. The S:R346K characteristic of Omicron BA.1.1 is in dark blue.

Figure S3:
  • Download figure
  • Open in new tab
Figure S3: Viral load in co-infections.

The Cq of each sample, grouped by the dominant variant. The first column are the samples (n=5) with a majority of Omicron; the second column are the samples (n=8) with a balanced mix of Delta and Omicron; the third column are the samples (n=5) with a majority of the Delta variant. Cq is based on the qRT-PCR assay performed on samples collected from the anterior nares of patients. Whiskers denote the median and 95% Confidence Interval. Of note, Omicron BA.1 and BA.1.1 samples have the S:69/70del, which interferes with one of the SARS-CoV-2 probes of the Thermo Fisher TaqPath COVID-19 Flu A, Flu B Combo Kit. Therefore 100% Omicron samples may be expected to have an elevated Cq compared to the Cq of a 100% Delta sample with the same viral load.

Figure S4:
  • Download figure
  • Open in new tab
Figure S4: Comparison of sequencing output based on Amplicon vs. Hybrid capture.

Fraction of sequences with an N by position in the SARS-CoV-2 genome. The shaded rectangle corresponds to the region that encodes the S glycoprotein. Fraction based on 2500 positive samples sequenced via the amplicon method and 2500 positive samples sequenced via hybrid capture. The 2500 samples sequenced via hybrid capture include 174 that were designated as Omicron. None of the 2500 samples sequenced via the amplicon method were designated Omicron.

Acknowledgements

We thank the employees of Helix, members of the CDC SPHERES consortium and California CovidNET for discussions and help with logistics. We thank Elizabeth Ohlsen and Temet McMichael for discussions and help with the epidemiology of co-infections. We thank the healthcare workers, frontline workers, and patients who made the collection of this SARS-CoV-2 dataset possible. Special thanks to Anand Pai for helpful discussions on the parallels with HIV-1. We thank the NIH RADx initiative, which funded a portion of this work. This work has been supported by the Centers for Disease Control and funded in part by CDC Contract 75D30121C12730 (Helix).

Footnotes

  • We added replication results for all co-infection samples and the second recombinant sample. All of these replications were done by re-extracting RNA and resequencing the samples. Replication was possible on 18 of the 20 samples in the first version. To make it clearer, we removed from the main figures the 2 samples with co-infections for whom we could not perform a replication experiment. Previous Figure 1 was moved to Supplementary Figure 1. Analysis results of possible recombination at the 5'end of the spike protein gene in all co-infection samples was added (just a couple of phrases in the text). We added as a reference a recent (after the version 1 of our paper was published) preprint (BioRxiv) from the CDC showing a cluster of 9 Delta-Omicron recombinants in the US, which includes RECOMB1.

References

  1. ↵
    Bolze, A., Cirulli, E.T., Luo, S., White, S., Wyman, D., Dei Rossi, A., Cassens, T., Jacobs, S., Nguyen, J., Ramirez, J.M., III., et al. (2021). Rapid displacement of SARS-CoV-2 variant B.1.1.7 by B.1.617.2 and P.1 in the United States (medRxiv).
  2. ↵
    CDC (2020). COVID data tracker.
  3. ↵
    Earnest, R., Uddin, R., Matluk, N., Renzette, N., Siddle, K.J., Loreth, C., Adams, G., Tomkins-Tinch, C.H., Petrone, M.E., Rothman, J.E., et al. (2021). Comparative transmissibility of SARS-CoV-2 variants Delta and Alpha in New England, USA. medRxiv.
  4. ↵
    Fischer, W., Giorgi, E.E., Chakraborty, S., Nguyen, K., Bhattacharya, T., Theiler, J., Goloboff, P.A., Yoon, H., Abfalterer, W., Foley, B.T., et al. (2021). HIV-1 and SARS-CoV-2: Patterns in the evolution of two pandemic pathogens. Cell Host Microbe 29, 1093–1110.
    OpenUrl
  5. ↵
    Hadfield, J., Megill, C., Bell, S.M., Huddleston, J., Potter, B., Callender, C., Sagulenko, P., Bedford, T., and Neher, R.A. (2018). Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123.
    OpenUrlCrossRefPubMed
  6. ↵
    Jackson, B., Boni, M.F., Bull, M.J., Colleran, A., Colquhoun, R.M., Darby, A.C., Haldenby, S., Hill, V., Lucaci, A., McCrone, J.T., et al. (2021). Generation and transmission of interlineage recombinants in the SARS-CoV-2 pandemic. Cell 184, 5179–5188.e8.
    OpenUrl
  7. ↵
    Joseph, S.B., Swanstrom, R., Kashuba, A.D.M., and Cohen, M.S. (2015). Bottlenecks in HIV-1 transmission: insights from the study of founder viruses. Nat. Rev. Microbiol. 13, 414–425.
    OpenUrlCrossRefPubMed
  8. ↵
    de Klerk, A., Swanepoel, P., Lourens, R., Zondo, M., Abodunran, I., Lytras, S., MacLean, O.A., Robertson, D., Kosakovsky Pond, S.L., Zehr, J.D., et al. (2021). Conserved recombination patterns across coronavirus subgenera.
  9. ↵
    Kreier, F. (2022). Deltacron: the story of the variant that wasn’t. Nature 602, 19.
    OpenUrl
  10. ↵
    Lacek, K.A., Rambo-Martin, B.L., Batra, D., Zheng, X.-Y., Sakaguchi, H., Peacock, T., Keller, M., Wilson, M.M., Sheth, M., Davis, M.L., et al. (2022). Identification of a Novel SARS-CoV-2 Delta-Omicron Recombinant Virus in the United States.
  11. ↵
    Lai, E., Becker, D., Brzoska, P., Cassens, T., Davis-Turak, J., Diamond, E., Furtado, M., Gandhi, M., Gort, D., Greninger, A.L., et al. (2022). A method for variant agnostic detection of SARS-CoV-2, rapid monitoring of circulating variants, detection of mutations of biological significance, and early detection of emergent variants such as Omicron.
  12. ↵
    Martin, D.P., Murrell, B., Golden, M., Khoosal, A., and Muhire, B. (2015). RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol 1, vev003.
    OpenUrlCrossRefPubMed
  13. ↵
    Rambaut, A., Holmes, E.C., O’Toole, Á., Hill, V., McCrone, J.T., Ruis, C., du Plessis, L., and Pybus, O.G. (2020). A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 5, 1403–1407.
    OpenUrl
  14. ↵
    Robinson, J.T., Thorvaldsdóttir, H., Winckler, W., Guttman, M., Lander, E.S., Getz, G., and Mesirov, J.P. (2011). Integrative genomics viewer. Nature Biotechnology 29, 24–26.
    OpenUrlCrossRefPubMedWeb of Science
  15. ↵
    Sagar, M., Laeyendecker, O., Lee, S., Gamiel, J., Wawer, M.J., Gray, R.H., Serwadda, D., Sewankambo, N.K., Shepherd, J.C., Toma, J., et al. (2009). Selection of HIV variants with signature genotypic characteristics during heterosexual transmission. J. Infect. Dis. 199, 580–589.
    OpenUrlCrossRefPubMedWeb of Science
  16. ↵
    Samson, S., Lord, É., and Makarenkov, V. (2021). SimPlot++: a Python application for representing sequence similarity and detecting recombination.
  17. ↵
    Sanderson, T., and Barrett, J.C. (2021). Variation at Spike position 142 in SARS-CoV-2 Delta genomes is a technical artifact caused by dropout of a sequencing amplicon.
  18. ↵
    Simon-Loriere, E., and Holmes, E.C. (2011). Why do RNA viruses recombine? Nat. Rev. Microbiol. 9, 617–626.
    OpenUrlCrossRefPubMed
  19. ↵
    Varabyou, A., Pockrandt, C., Salzberg, S.L., and Pertea, M. (2021). Rapid detection of inter-clade recombination in SARS-CoV-2 with Bolotie. Genetics 218.
  20. ↵
    Vaziri, A., and Allday, E. (2021). First U.S. omicron case found in San Francisco. Here’s what we know. Chronicle.
  21. ↵
    Viana, R., Moyo, S., Amoako, D.G., Tegally, H., Scheepers, C., Althaus, C.L., Anyaneji, U.J., Bester, P.A., Boni, M.F., Chand, M., et al. (2022). Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature.
Back to top
PreviousNext
Posted March 28, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Evidence for SARS-CoV-2 Delta and Omicron co-infections and recombination
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Evidence for SARS-CoV-2 Delta and Omicron co-infections and recombination
Alexandre Bolze, Tracy Basler, Simon White, Andrew Dei Rossi, Dana Wyman, Pavitra Roychoudhury, Alexander L. Greninger, Kathleen Hayashibara, Mark Beatty, Seema Shah, Sarah Stous, Eric Kil, Hang Dai, Tyler Cassens, Kevin Tsan, Jason Nguyen, Jimmy Ramirez, Scotty Carter, Elizabeth T. Cirulli, Kelly Schiabor Barrett, Nicole L. Washington, Pedro Belda-Ferre, Sharoni Jacobs, Efren Sandoval, David Becker, James T. Lu, Magnus Isaksson, William Lee, Shishi Luo
medRxiv 2022.03.09.22272113; doi: https://doi.org/10.1101/2022.03.09.22272113
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Evidence for SARS-CoV-2 Delta and Omicron co-infections and recombination
Alexandre Bolze, Tracy Basler, Simon White, Andrew Dei Rossi, Dana Wyman, Pavitra Roychoudhury, Alexander L. Greninger, Kathleen Hayashibara, Mark Beatty, Seema Shah, Sarah Stous, Eric Kil, Hang Dai, Tyler Cassens, Kevin Tsan, Jason Nguyen, Jimmy Ramirez, Scotty Carter, Elizabeth T. Cirulli, Kelly Schiabor Barrett, Nicole L. Washington, Pedro Belda-Ferre, Sharoni Jacobs, Efren Sandoval, David Becker, James T. Lu, Magnus Isaksson, William Lee, Shishi Luo
medRxiv 2022.03.09.22272113; doi: https://doi.org/10.1101/2022.03.09.22272113

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)