PT - JOURNAL ARTICLE AU - Gafurov, Askar AU - Baláž, Andrej AU - Amman, Fabian AU - Boršová, Kristína AU - Čabanová, Viktória AU - Klempa, Boris AU - Bergthaler, Andreas AU - Vinař, Tomáš AU - Brejová, Broňa TI - VirPool: Model-Based Estimation of SARS-CoV-2 Variant Proportions in Wastewater Samples AID - 10.1101/2022.06.21.22276717 DP - 2022 Jan 01 TA - medRxiv PG - 2022.06.21.22276717 4099 - http://medrxiv.org/content/early/2022/06/22/2022.06.21.22276717.short 4100 - http://medrxiv.org/content/early/2022/06/22/2022.06.21.22276717.full AB - Background The genomes of SARS-CoV-2 are classified into variants, some of which are monitored as variants of concern (e.g. the delta variant B.1.617.2 or omicron variant B.1.1.529). Proportions of these variants in a population are typically estimated by large-scale sequencing of individual patient samples. Sequencing a mixture of SARS-CoV-2 RNA molecules from wastewater provides a cost-effective alternative, but requires methods for estimating variant proportions in a mixed sample.Results We propose a new method based on a probabilistic model of sequencing reads, capturing sequence diversity present within individual variants, as well as sequencing errors. The algorithm is implemented in an open source Python program called VirPool. We evaluated the accuracy of VirPool on several simulated and real sequencing data sets from both Illumina and nanopore sequencing platforms, including wastewater samples from Austria and France monitoring the onset of alpha and delta variants.Conclusions VirPool is a versatile tool for wastewater and other mixed-sample analysis that can handle both short- and long-read sequencing data. Our approach does not require pre-selection of characteristic mutations for variant profiles, it is able to use the entire length of reads instead of just the most informative positions, and can also capture haplotype dependencies within a single read.Availability VirPool is an open source software available at https://github.com/fmfi-compbio/virpool.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis research was supported by a grant from the Slovak Research and Development Agency APVV-18-0239, by grants from the Slovak grant agency VEGA 1/0463/20 (to BB), 1/0538/22 (to TV), and by the grant from the∽Operational Program Integrated Infrastructure ITMS:313011ATL7. The research was also supported from the European Union Horizon 2020 Research and Innovation Staff Exchange programme under the Marie Skłodowska-Curie grant agreement No. 872539 (PANGAIA). Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics Committee of Biomedical Research Center of the Slovak Academy of Sciences gave ethical approval for this work (statement no. EK/BmV-02/2020)I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data produced in the present work are contained in the manuscript https://github.com/fmfi-compbio/virpool