Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

SARS-CoV-2 VARIANT PREVALENCE ESTIMATION USING WASTEWATER SAMPLES

I. López-de-Ullibarri, L. Tomás, N. Trigo-Tasende, B. Freire, M. Vaamonde, P. Gallego-García, I. Barbeito, J.A. Vallejo, J. Tarrío-Saavedra, P. Alvariño, E. Beade, N. Estévez, S. Rumbo-Feal, K. Conde-Pérez, L. de Chiara, I. Iglesias-Corrás, M. Poza, S. Ladra, D. Posada, R. Cao
doi: https://doi.org/10.1101/2023.01.13.23284507
I. López-de-Ullibarri
aResearch Group MODES, Research Center for Information and Communication Technologies (CITIC), University of A Coruña (UDC), Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
L. Tomás
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
N. Trigo-Tasende
da, As Xubias, 15006 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
B. Freire
eUniversity of A Coruña (UDC), Research Center for Information and Communication Technologies (CITIC), Database Laboratory, Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
M. Vaamonde
aResearch Group MODES, Research Center for Information and Communication Technologies (CITIC), University of A Coruña (UDC), Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
P. Gallego-García
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
I. Barbeito
aResearch Group MODES, Research Center for Information and Communication Technologies (CITIC), University of A Coruña (UDC), Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J.A. Vallejo
da, As Xubias, 15006 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J. Tarrío-Saavedra
aResearch Group MODES, Research Center for Information and Communication Technologies (CITIC), University of A Coruña (UDC), Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
P. Alvariño
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
E. Beade
eUniversity of A Coruña (UDC), Research Center for Information and Communication Technologies (CITIC), Database Laboratory, Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
N. Estévez
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
S. Rumbo-Feal
da, As Xubias, 15006 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
K. Conde-Pérez
da, As Xubias, 15006 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
L. de Chiara
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
I. Iglesias-Corrás
eUniversity of A Coruña (UDC), Research Center for Information and Communication Technologies (CITIC), Database Laboratory, Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
M. Poza
da, As Xubias, 15006 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
S. Ladra
eUniversity of A Coruña (UDC), Research Center for Information and Communication Technologies (CITIC), Database Laboratory, Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
D. Posada
bCINBIO, Universidade de Vigo, 36310 Vigo, Spain
cGalicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36312 Vigo, Spain
fDepartment of Biochemistry, Genetics, and Immunology, Universidade de Vigo, 36310 Vigo, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
R. Cao
aResearch Group MODES, Research Center for Information and Communication Technologies (CITIC), University of A Coruña (UDC), Campus de Elviña, 15071 A Coruña, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rcao{at}udc.es
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The present work describes a statistical model to account for sequencing information of SARS-CoV-2 variants in wastewater samples. The model expresses the joint probability distribution of the number of genomic reads corresponding to mutations and non-mutations in every locus in terms of the variant proportions and the joint mutation distribution within every variant. Since the variant joint mutation distribution can be estimated using GISAID data, the only unknown parameters in the model are the variant proportions. These are estimated using maximum likelihood. The method is applied to monitor the evolution of variant proportions using genomic data coming from wastewater samples collected in A Coruña (NW Spain) in the period May 2021 – March 2022. Although the procedure is applied assuming independence among the number of reads along the genome, it is also extended to account for Markovian dependence of counts along loci in the aggregated information coming from wastewater samples.

Motivation and background

During the last decade, wastewater-based epidemiological surveillance has emerged as a highly relevant discipline, with the potential to provide information by combining the use of analytical methods with the development of ad hoc modelling approaches. This surveillance has been widely used in recent years to accurately predict consumption patterns for numerous substances (EMCDDA, 2020). During the COVID-19 pandemic, processes for monitoring the viral load of SARS-CoV-2 in wastewater were developed for the first time in the Netherlands (Medema et al., 2020).

Around a third of the people primarily infected with SARS-CoV-2 in Spain were asymptomatic (Pollán et al., 2020). However, the percentage of asymptomatic cases depends on many factors, such as the average age and the degree of natural or artificial immunity in each population. In addition, a significant proportion of people infected with COVID-19, including symptomatic and asymptomatic, who were tested for fecal viral RNA tested positive from the initial steps of infection (Gupta et al., 2020) and tested positive persistently in rectal swabs even after nasopharyngeal testing was negative (Chen et al., 2020; Xing et al., 2020; Xu et al., 2020; Zhang et al., 2020; Cevik et al., 2021; Miura et al., 2021).

Due to all of the above, the genetic material of SARS-CoV-2 can be found in wastewater (Lodder and de Roda Husman, 2020), which has made the monitoring of the RNA viral load in wastewater an excellent tool for the epidemiological monitoring of the COVID-19 pandemic, as well as an efficient early warning method for the detection of outbreaks (Randazzo et al., 2020; Ahmed et al., 2020; Medema et al., 2020; Peccia et al., 2020; F Wu et al., 2020; Wurtzer et al., 2020). Likewise, the methods of massive sequencing of aggregate samples collected in wastewater treatment plants or in the sanitation network itself make it possible to obtain readings that include the mutations observed in the SARS-CoV-2 genome. With the help of appropriate statistical models and methods, estimates of the number of active cases of patients with COVID-19 can be obtained from the viral load quantification data at Wastewater Treatment Plants (WWTPs) (Vallejo et al. 2022).

On the other hand, as a result of the proliferation of SARS-CoV-2 variants, specific mutations have been monitored to study the evolution of variants (Bar-Or et al. 2021) and the total SARS-CoV-2 concentration (Radu et al. 2022). Recently, statistical methods have been proposed that make it possible to analyze the readings of mutation frequencies in the virus genome in order to obtain precise estimates of the proportions of variants (Barbeito et al. 2022, Gafurov et al. 2022, Karthikeyan et al. 2022, Radu et al. 2022, Valieris et al. 2022). In this paper, the joint mutation distribution is estimated using GISAID data and the variant proportions are estimated using maximum likelihood. The model can be formulated either assuming independence among the number of reads along the genome or allowing for Markovian dependence of counts along loci.

Methodology

Since the genetic material of the samples collected at the WWTP is degraded as a consequence of the passage of wastewater through the sanitation network, the genomes collected are remarkably fragmented. On the other hand, each sample corresponds to the genetic material of the thousands of infected human beings among the almost 400,000 inhabitants of the metropolitan area of A Coruña. As a consequence of all this and of the amplicon technology used for massive sequencing (see Section 4), the available information corresponds to counts of mutation reads throughout a number of positions (loci) in the virus genome.

In the case in which clinical samples could be taken from individual patients, it would be possible to observe the complete RNA strand (or at least very large fragments of it that could be juxtaposed), which means having observations of the vector variable that considers which type of mutation has occurred at each locus. However, for the samples obtained at the WWTP, it is only possible to observe the frequencies of mutations in each of these loci in an aggregated manner on the set of individuals that have excreted that genetic material. As a consequence, the statistical methods for estimating the proportions of variants have to be designed for the data-generating process, aggregated, in individuals, and marginal, in loci, that occurs in this setup. We will now formulate this data-generating process.

A viral haplotype can be expressed as a vector x = (x1, …, xl), l being the number of genomic positions or loci. The set of feasible values for locus xi is Ai = {0, …, ai}, where 0 refers to the reference allele and 1, …, ai are indices identifying the alternative alleles (i.e. different types of mutations at locus i=1,…,l). As a consequence, x ∈ H, H being the Cartesian product. A1 × … × Al. We denote by X and V, respectively, discrete random variables modeling a haplotype and a viral variant sampled at random from the viral genomes in wastewater. For r viral variants ν1, …, νr the quantities Embedded Image, for j = 1, …, r, are defined as P(X = x | V = νj). So Embedded Image, when x ∈ H, is just the haplotype distribution of variant νj. By the total probability law, Embedded Image, where πj = P (V = νj) is the unknown probability of the j-th variant. It is important to remark that, although the Embedded Image are also unknown, they can be estimated very easily without using the wastewater samples, e.g., from the viral genomes available at GISAID’s EpiCoV database.

If the viral genomic sequences could be fully observed in wastewater, the data would consist of a sample of haplotype vectors X1. …, Xn. Given this “ideal sample” (not observable in wastewater, just for clinical patients), the observed sample can be modeled as follows. Consider, for each locus k, for k = 1, …, l, the probability αk that the k-th locus of a viral genome selected at random is observed in the sample. The number of observations for locus k is Embedded Image, where Embedded Image is a binary random variable indicating whether the i-th “ideally observed” haplotype has been actually observed at locus k. It is natural to model Nk as a random variable with binomial distribution, B(n, αk) being the expected number of reads at locus k. Its mean n αk depends on the αk probabilities, which are strongly determined by the sequencing technology and may greatly differ across loci. Since the are observable, in the following we condition on their observed values.

Given, for k =1, …, l and assuming that the sequencing technology does not affect the marginal distribution of X, it is possible to derive the distribution of the observed allele frequencies for each locus in the sample, Y = (Y1, …, Yl) where, for k = 1, …, l, Embedded Image and Embedded Image. In the last expression, to avoid ambiguity, the superscript (k) is used to refer to the k-th component of Xi, and 1 (A) is the indicator of event A. Clearly, Embedded Image, and, conditionally on Nk, Yk has multinomial distribution M(Nk, qk) where Embedded Image, is a vector whose S-th component is Embedded Image.

Thus, the distribution of Y depends on the “known” haplotype probabilities within every viral variant estimated from available data Embedded Image, the number of reads at every locus (Nk, k = 1, …, l), and the unknown variant probabilities (πj, j = 1, …, r) in the population of viral genomes sampled. The πj can be estimated using available information and the observed allele frequencies in the wastewater sample. Assuming independence of the random variables, Yk, k = 1, …, l, and having observed the allele mutation frequencies collected in the vector y = (y1, …, yl) the likelihood (conditional on, Nk, k = 1, …, l) is: Embedded Image

Maximum conditional likelihood estimates of (π1, …, πr) are obtained by maximizing L (π1, …, πr) constrained to π1 ≥ 0, …, πr, ≥ 0, Embedded Imagee.g., using an augmented Lagrangian method.

Markovian dependence among loci

The independence assumption among the random variables, Yk, k = 1, …, l, can be relaxed by just assuming a Markovian condition for the random vector Y: Embedded Image

By assuming this condition, the likelihood becomes: Embedded Image which just requires to deal with the conditional probabilities of the form P(Yk = yk | Yk−1 = yk−1), for k = 2, … l. Without loss of generality and for simplifying the notation, we consider P(Y2 = y2 | Y1 = y1) and assume that a1 = a2 = 1, i.e. just one type of possible mutation at loci k = 1,2. As a consequence, the joint distribution of (Y1,Y2) = (Y1,0,Y1,1,Y2,0,Y2,1) can be expressed in terms of the random vector Z = (Z0,0 Z0,1 Z1,0 Z1,1), where the random variable Zi,j denotes the number of co-occurrences of mutation i in locus 1 and mutation j in locus 2. Indeed Embedded Image

Now, since the random vector Z has a multinomial distribution: M(N1,2,(p0,0,p0,1,p1,0,p1,1)), where N1,2 is the number of joint reads at loci 1 and 2 and (p0,0,p0,1,p1,0,p1,1) is the vector with the probability mass corresponding to mutations (0 or 1) at loci 1 and 2, the joint probability mass of (Y1,Y2) is then straightforward: Embedded Image where z ∈ C (y) in the sum means that the values of z ranges over all possibilities such that y1,0 = z0,0 + z0,1, y1,1 = z1,0 + z1,1, y2,0 = z0,0 + z1,0, y2,1 = z0,1 + y1,1. The marginal probability mass of Y1 is even simpler: Embedded Image

Using the definition of conditional probability, the conditional distribution becomes: Embedded Image where the co-occurrence probabilities can be easily expressed in terms of the variants bivariate haplotype distributions, Embedded Image, and the variant marginal distribution: Embedded Image

As a consequence, the likelihood in the Markovian dependence case can be written just in terms of the variants bivariate haplotype distributions and the unknown variant probabilities.

Simulations

Simulated data, as well as synthetic data coming from in vitro experiments, where the proportion of every variant is known, have been used to assess the quality of the method. We considered four scenarios.

Dataset #1 consists of simulated reads of 1 genome per variant without sequencing errors. The data were created from four different genomes from GISAID (consensus sequences), each genome corresponding to a different variant. A simulator of amplicon reads (with no sequencing errors) is applied based on the real coverage/depth profiles of ARCTIC protocol (obtained from real reads) and then those simulated reads are mixed in the percentages included in Table 1, which also contains the estimated percentages.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Mixing variant percentages and their estimations for Dataset #1.

Dataset #2 also contains simulated reads without sequencing errors but of multiple genomes per variant. The data were created from four different genomes from GISAID (consensus sequences), each genome corresponding to a different variant. As for the previous dataset, a simulator of amplicon reads is applied based on the real coverage/depth profiles of ARCTIC protocol, obtained from real reads. The simulated reads are mixed in the percentages included in Table 2, which also contains the estimated percentages.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2:

Mixing variant percentages and their estimations for Dataset #2.

Dataset #3 consists of mixing clinical samples created from real genomes reads obtained in the project EPICOVIGAL. For each variant, just one dataset is used and then the reads were mixed according to the percentages presented in Table 3. This table also includes the estimated percentages.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3:

Mixing variant percentages and their estimations for Dataset #3.

Dataset #4 was also constructed by mixing clinical samples. It was created from real genomes reads obtained in the project EPICOVIGAL mixed in the percentages collected in Table 4, which also includes the estimated percentages.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 4:

Mixing variant percentages and their estimations for Dataset #4.

The results in Tables 1-4 show that the estimation error of the variant percentages is always below 2.7% for all the variants in Datasets #1, #2 and #4. For Dataset #3, the largest estimation error is around 5.3%. This happens for B.1.1.7, with a real percentage of 45%. This implies a relative estimation error of around 1/9.

Monitoring the evolution of variant proportions

The method presented is applied to monitoring the evolution of variant proportions using genomic data coming from weekly wastewater samples collected in A Coruña (NW Spain) in the period May 2021 – March 2022. This monitoring was part of the COVIDBENS project. It was an initiative carried out from April 2020 to March 2022 and financed by the public company WWTP Bens S.A., responsible for managing the WWTP in charge of purifying wastewater from the municipalities of A Coruña, Arteixo, Cambre, Culleredo and Oleiros, which comprise a population of nearly 400,000 inhabitants of the metropolitan area of A Coruña (NW Spain). The main objective of the project was to monitor the SARS-CoV-2 coronavirus epidemic in the metropolitan area of A Coruña.

COVIDBENS served as an early warning against possible outbreaks, since it proved to be able to anticipate between 2 and 3 weeks in the beginning of the pandemic waves with respect to the data on active cases reported by the health system (Trigo-Tasende et al. 2022). In addition, using the amount of genetic material of the virus present in the wastewater, nonparametric statistical models were used to estimate the number of infected people in the population (Vallejo et al. 2022).

Since December 2020, complying with the recommendation of the European Commission (https://ec.europa.eu/environment/pdf/water/recommendation_covid19_monitoring_wastewaters.pdf), the COVIDBENS team has been in charge of monitoring the emergence of new mutations and variants of SARS-CoV-2 in the wastewater arriving at the Bens WWTP using massive sequencing technologies. With the collaboration of Aguas de Galicia and EDAR Bens S.A., this challenge was tackled using two different strategies: 1) amplicon sequencing and 2) shotgun sequencing with enrichment of human respiratory viruses. The results obtained by the COVIDBENS team showed that both technologies are effective for the detection of SARS-CoV-2 mutations. Amplicon sequencing works very effectively to specifically detect SARS-CoV-2 mutations and variants, while shotgun sequencing should be oriented towards the epidemiological monitoring of respiratory viruses in general (SARS-CoV-2, influenza, RSV, etc.). It should be noted that these techniques made it possible to retrospectively detect mutations of the Alfa variant in samples from the metropolitan area of A Coruña at the beginning of December, a month before that variant was detected in clinical samples, demonstrating the great potential of genome analysis of SARS-CoV-2 in wastewater for early epidemiological detection of variants. Once the methodology was fine-tuned and contrasted, it was decided to implement amplicon sequencing as a routine mutation tracking method. The genetic material was extracted and sequenced from samples obtained weekly. Data were analysed for surveillance mutations recommended by ECDC (European Center for Disease Prevention and Control), guidelines updated on March 11, 2022 (https://www.ecdc.europa.eu/en/covid-19/variants-concern).

In the period May 2021 – March 2020, the SARS-CoV-2 sequencing work in wastewater carried out by COVIDBENS enabled reporting on the evolution in the presence of mutations and variants in the metropolitan area of A Coruña on a weekly basis. The data obtained through sequencing and analysis of mutations and variants of the virus can be viewed at the link http://www.edarbens.es/covid19.

The statistical methods presented in the second section were used to estimate weekly the proportions of SARS-CoV-2 variants in the metropolitan area of A Coruña. For facilitating visual interpretation, the estimates of the proportions along time were smoothed with a local polynomial regression estimator. The smoothing parameters were selected using plug-in methods (see Loader, 1999).

Figure 1 contains the smoothed estimates of the SARS-CoV-2 variant proportions along time in the period May 2021 – March 2022. The decrease of the Alpha variant (B.1.1.7) is shown at the beginning of the time period under study. The irruption of the Delta variant (B.1.617.2), its subsequent predominance and final vanishing are observed during this period. In the time interval December 2021 – January 2022, the Omicron variant (B.1.1.529) appeared and abruptly increased, which was parallel to a sudden decrease of the Delta variant. The BA.2 Omicron subvariant also exhibits a sudden increase in February 2022.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1:

Smooth estimation of the SARS-CoV-2 variant proportions along time in the metropolitan area of A Coruña in the period May 2021 – March 2020.

Data Availability

Data can be provided upon request.

References

  1. ↵
    Ahmed, W., Angel, N., Edson, J., Bibby, K., Bivins, A., O’Brien, J.W., Choi, P.M., Kitajima, M., Simpson, S.L., Li, J., et al., 2020. First confirmed detection of SARS-CoV-2 in untreated wastewater in Australia: a proof of concept for the wastewater surveillance of COVID-19 in the community. Sci. Total Environ. 728, 138764.
    OpenUrlCrossRefPubMed
  2. ↵
    Barbeito, I., Cao, R., Ladra, S., López de Ullibarri, I., Posada, D., Poza, M., Tarrío, J., Vaamonde, M., Vallejo, J.A., Freire, B., Gallego, P., Iglesias, I., Rumbo, S., Tomás, L., Trigo, N., Alvariño, P., Beade, E., de Chiara, L., Estévez, N., 2022. Wastewater-based epidemiological modelling of SARS-CoV-2 viral load and monitorization of genomic variants in urban metropolitan areas. 40th Annual Meeting of the Spanish Society for Epidemiology.
  3. ↵
    Bar-Or, I., Weil, M., Indenbaum, V., Bucris, E., Bar-Ilan, D., Elul, M., Levi, N., Aguvaev, I., Cohen, Z., Shirazi, R., Erster, O., Sela-Brown, A., Sofer, D., Mor, O., Mendelson, E., Zuckerman, N.S., 2021. Detection of SARS-CoV-2 variants by genomic analysis of wastewater samples in Israel. Sci. Total Environ., 789, 148002.
    OpenUrlCrossRefPubMed
  4. ↵
    Cevik, M., Tate, M., Lloyd, O., Maraolo, A.E., Schafers, J., Ho, A., 2021. SARS–CoV–2, SARS–CoV, and MERS–CoV viral load dynamics, duration of viral shedding, and infectiousness: a systematic review and meta-analysis. Lancet Microbe 2 (1), 13–22.
    OpenUrl
  5. ↵
    Chen, Y., Chen, L., Deng, Q., Zhang, G., Wu, K., Ni, L., Yang, Y., Liu, B., Wang, W., Wei, C., et al., 2020. The presence of SARS-CoV-2 RNA in the feces of COVID-19 patients. J. Med. Virol. 92 (7), 833–840.
    OpenUrlCrossRefPubMed
  6. ↵
    Emcdda, E.B., 2020. Wastewater Analysis and Drugs: A EuropeanMulti-city Study. European Monitoring Center for Drugs and Drug Addiction, pp. 1–14.
  7. ↵
    Gafurov, A., Baláž, A., Amman, F., Boršová, K., Čabanová, V., Klempa, B., Bergthaler, A., Vinař, T., Brejová, B., 2022. VirPool: model-based estimation of SARS-CoV-2 variant proportions in wastewater samples. BMC Bioinformatics, 19, 23 (1), 551.
    OpenUrl
  8. ↵
    Gupta, S., Parker, J., Smits, S., Underwood, J., Dolwani, S., 2020. Persistent viral shedding of SARS-CoV-2 in faeces–a rapid review. Color. Dis. 22 (6), 611–620.
    OpenUrl
  9. ↵
    Karthikeyan, S., Levy, J.I, De Hoff, P., Humphrey, G., Birmingham, A., Jepsen, K., Farmer, S., Tubb, H.M., Valles, T., Tribelhorn, C.E., Tsai, R., Aigner, S., Sathe, S., Moshiri, N., Henson, B., Mark, A.M., Hakim, A., Baer, N.A., Barber, T., Belda-Ferre, P., Chacón, M., Cheung, W., Cresini, E.S., Eisner, E.R., Lastrella, A.L., Lawrence, E.S., Marotz, C.A., Ngo, T.T., Ostrander, T., Plascencia, A., Salido, R.A., Seaver, Ph., Smoot, E.W., McDonald, D., Neuhard, R.M., Scioscia, A.L., Satterlund, A.M., Simmons, E.H., Abelman, D.B., Brenner, D., Bruner, J.C., Buckley, A., Ellison, M., Gattas, J., Gonias, S.L, Hale, M., Hawkins, F., Ikeda, L., Jhaveri, H., Johnson, T., Kellen, V., Kremer, B., Matthews, G., McLawhon, R.W, Ouillet, P., Park, D., Pradenas, A., Reed, S., Riggs, L., Sanders, A., Sollenberger, B., Song, A., White, B., Winbush, T., Aceves, C.M., Anderson, C., Gangavarapu, K., Hufbauer, E., Kurzban, E., Lee, J., Matteson N.L., Parker, E., Perkins, S.A., Ramesh, K.S., Robles-Sikisaka, R., Schwab, M.A., Spencer, E., Wohl, S., Nicholson, L., McHardy, I.H., Dimmock, D.P., Hobbs, C.A., Bakhtar, O., Harding, A., Mendoza, A., Bolze, A., Becker, D., Cirulli, E.T., Isaksson, M., Barrett, K.M.S., Washington, N.L., Malone, J.D., Schafer, A.M., Gurfield, N., Stous, S., Fielding-Miller, R., Garfein, R.S., Gaines, T., Anderson, C., Martin, N.K., Schooley, R., Austin, B., MacCannell, D.R., Kingsmore, S.F., Lee, W., Shah, S., McDonald, E., Yu, A.T., Zeller, M., Fisch, K.M., Longhurst, C., Maysent, P., Pride, D., Khosla, P.K., Laurent, L.C., Yeo, G.W., Andersen, K.G., Knight, R., 2022. Wastewater sequencing reveals early cryptic SARS-CoV-2 variant transmission. Nature, 609 (7925), 101–108.
    OpenUrl
  10. ↵
    Loader, C.R., 1999. Bandwidth selection: classical or plug-in?. Ann. Stat., 27 (2), 415–438.
    OpenUrl
  11. ↵
    Lodder, W., de Roda Husman, A.M., 2020. SARS-CoV-2 in wastewater: potential health risk, but also data source. Lancet Gastroenterol. Hepatol. 5 (6), 533–534.
    OpenUrl
  12. ↵
    Medema, G., Heijnen, L., Elsinga, G., Italiaander, R., Brouwer, A., 2020. Presence of SARSCoronavirus-2 RNA in sewage and correlation with reported COVID-19 prevalence in the early stage of the epidemic in the Netherlands. Environ. Sci. Technol. Lett. 7 (7), 511–516.
    OpenUrl
  13. ↵
    Miura, F., Kitajima, M., Omori, R., 2021. Duration of SARS–CoV–2 viral shedding in faeces as a parameter for wastewater-based epidemiology: re-analysis of patient data using a shedding dynamics model. Sci. Total Environ. 769, 144549.
    OpenUrl
  14. ↵
    Peccia, J., Zulli, A., Brackney, D.E., Grubaugh, N.D., Kaplan, E.H., Casanovas-Massana, A., Ko, A.I., Malik, A.A., Wang, D., Wang, M., et al., 2020. Measurement of SARS-CoV-2 RNA in wastewater tracks community infection dynamics. Nat. Biotechnol. 38 (10), 1164–1167.
    OpenUrlPubMed
  15. ↵
    Pollán, M., Pérez-Gómez, B., Pastor-Barriuso, R., Oteo, J., Hernán, M.A., Pérez-Olmeda, M., Sanmartín, J.L., Fernández-García, A., Cruz, I., de Larrea, N.F., et al., 2020. Prevalence of SARS–CoV–2 in Spain (ENE–COVID): a nationwide, population–based seroepidemiological study. Lancet 396 (10250), 535–544.
    OpenUrlCrossRefPubMed
  16. ↵
    Radu, E., Masseron, A., Amman, F., Schedl, A., Agerer, B., Endler, L., Penz, T., Bock, C., Bergthaler, A., Vierheilig, J., Hufnagl, P., Korschineck, I., Krampe, J., Kreuzinger, N., 2022. Emergence of SARS-CoV-2 Alpha lineage and its correlation with quantitative wastewater-based epidemiology data. Water Research 215, 118257.
    OpenUrl
  17. Randazzo, W., Cuevas-Ferrando, E., Sanjuán, R., Domingo-Calap, P., Sánchez, G., 2020a. Metropolitan wastewater analysis for COVID-19 epidemiological surveillance. Int. J. Hyg. Environ. Health 230, 113621.
    OpenUrlCrossRefPubMed
  18. ↵
    Trigo-Tasende, N. Vallejo, J.A., Rumbo-Feal, S., Conde-Pérez, K. Vaamonde, M. López-Oriona, A., Barbeito, I., Nasser-Ali, M., Reif, R., Rodiño-Janeiro, B.K., Fernández-Álvarez, E., Iglesias-Corrás, I., Freire, B. Tarrío-Saavedra, J., Tomás, L., Gallego-García, P., Posada, D., Bou, G., López-de-Ullibarri, I., Cao, R., Ladra, S., Poza, M., 2022. COVIDBENS: a multidisciplinary surveillance program for SARS-CoV-2 in wastewater in A Coruña, Spain. Submitted for possible publication.
  19. ↵
    Valieris, R., Drummond, R.D., Defelicibus, A., Dias-Neto, E., Rosales, R.A., Tojal da Silva, I., 2022. A mixture model for determining SARS-Cov-2 variant composition in pooled samples. Bioinformatics. 38 (7), 1809–1815.
    OpenUrl
  20. ↵
    Vallejo, J.A., Trigo-Tasende, N., Rumbo-Feal, S., Conde-Pérez, K., López-Oriona, Á., Barbeito, I., Vaamonde, M., Tarrío-Saavedra, J. Reif, R., Ladra, S., Rodiño-Janeiro, B.K., Nasser-Alia, M., Cid, Á., Veiga, M.C., Acevedo, A., Lamora, C., Bou, G., Cao, R., Poza, M. 2022. Modeling the number of people infected with SARS-COV-2 from wastewater viral load in Northwest Spain. Science of the Total Environment, 811, 152334.
    OpenUrl
  21. Wu, F., Zhang, J., Xiao, A., Gu, X., Lee, W.L., Armas, F., Kauffman, K., Hanage, W., Matus, M., Ghaeli, N., et al., 2020a. SARS-CoV-2 titers in wastewater are higher than expected from clinically confirmed cases. Msystems 5 (4).
  22. ↵
    Wurtzer, S., Marechal, V., Mouchel, J.M., Maday, Y., Teyssou, R., Richard, E., Almayrac, J.L., Moulin, L., 2020. Evaluation of lockdown impact on SARS-CoV-2 dynamics through viral genome quantification in Paris wastewaters. MedRxiv https://doi.org/10.1101/2020.04.12.20062679.
  23. ↵
    Xing, Y.H., Ni, W., Wu, Q., Li, W.J., Li, G.J., Wang, W.D., Tong, J.N., Song, X.F.,Wong, G.W.K., Xing, Q.S., 2020. Prolonged viral shedding in feces of pediatric patients with coronavirus disease 2019. J. Microbiol. Immunol. Infect. 53 (3), 473–480.
    OpenUrlPubMed
  24. ↵
    Xu, Y., Li, X., Zhu, B., Liang, H., Fang, C., Gong, Y., Guo, Q., Sun, X., Zhao, D., Shen, J., et al., 2020. Characteristics of pediatric SARS-CoV-2 infection and potential evidence for persistent fecal viral shedding. Nat. Med. 26 (4), 502–505.
    OpenUrlCrossRefPubMed
  25. ↵
    Zhang, T., Cui, X., Zhao, X., Wang, J., Zheng, J., Zheng, G., Guo, W., Cai, C., He, S., Xu, Y., 2020. Detectable SARS-CoV-2 viral RNA in feces of three children during recovery period of COVID-19 pneumonia. J. Med. Virol. 92 (7), 909–914.
    OpenUrlPubMed
Back to top
PreviousNext
Posted January 14, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
SARS-CoV-2 VARIANT PREVALENCE ESTIMATION USING WASTEWATER SAMPLES
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
SARS-CoV-2 VARIANT PREVALENCE ESTIMATION USING WASTEWATER SAMPLES
I. López-de-Ullibarri, L. Tomás, N. Trigo-Tasende, B. Freire, M. Vaamonde, P. Gallego-García, I. Barbeito, J.A. Vallejo, J. Tarrío-Saavedra, P. Alvariño, E. Beade, N. Estévez, S. Rumbo-Feal, K. Conde-Pérez, L. de Chiara, I. Iglesias-Corrás, M. Poza, S. Ladra, D. Posada, R. Cao
medRxiv 2023.01.13.23284507; doi: https://doi.org/10.1101/2023.01.13.23284507
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
SARS-CoV-2 VARIANT PREVALENCE ESTIMATION USING WASTEWATER SAMPLES
I. López-de-Ullibarri, L. Tomás, N. Trigo-Tasende, B. Freire, M. Vaamonde, P. Gallego-García, I. Barbeito, J.A. Vallejo, J. Tarrío-Saavedra, P. Alvariño, E. Beade, N. Estévez, S. Rumbo-Feal, K. Conde-Pérez, L. de Chiara, I. Iglesias-Corrás, M. Poza, S. Ladra, D. Posada, R. Cao
medRxiv 2023.01.13.23284507; doi: https://doi.org/10.1101/2023.01.13.23284507

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)