Rapid evaluation of COVID-19 vaccine effectiveness against VOC/VOIs by genetic mismatch ======================================================================================= * Lirong Cao * Jingzhi Lou * Hong Zheng * Shi Zhao * Chris Ka Pun Mok * Renee Wan Yi Chan * Marc Ka Chun Chong * Zigui Chen * Eliza Lai Yi Wong * Paul Kay Sheung Chan * Benny Chung-Ying Zee * Eng Kiong Yeoh * Maggie Haitian Wang ## Abstract Timely evaluation of the protective effects of COVID-19 vaccines is challenging but urgently needed to inform the pandemic control planning. Based on vaccine efficacy/effectiveness (VE) data of 11 vaccine products and 297,055 SARS-CoV-2 sequences collected in 20 regions, we analyzed the relationship between genetic mismatch of circulating viruses against the vaccine strain and VE. Variations from technology platforms are controlled by a mixed-effects model. We found that the genetic mismatch measured on the RBD is highly predictive for vaccine protection and accounted for 72.0% (*p*-value < 0.01) of the VE change. The NTD and S protein also demonstrate significant but weaker per amino acid substitution association with VE (*p*-values < 0.01). The model is applied to predict vaccine protection of existing vaccines against new genetic variants and is validated by independent cohort studies. The estimated VE against the delta variant is 79.3% (95% prediction interval: 67.0 – 92.1) using the mRNA platform, and an independent survey reported a close match of 83.0%; against the beta variant (B.1.351) the predicted VE is 53.8% (95% prediction interval: 39.9 – 67.4) using the viral-vector vaccines, and an observational study reported a close match of 48.0%. Genetic mismatch provides an accurate prediction for vaccine protection and offers a rapid evaluation method against novel variants to facilitate vaccine deployment and public health responses. Keywords * COVID-19 * prediction * vaccine effectiveness * vaccine efficacy * genetic mismatch * sequence analysis ## Main Vaccination is a crucial measure to control the transmission scale and mitigate the impact of COVID-19 infections. To date, 19 vaccines against SARS-CoV-2 are in early use or have been fully approved for application in mass population1. However, protective effect of the various vaccine products is under the challenge of new genetic variants. Vaccine efficacy or effectiveness (VE) against COVID-19, which measures the relative reduction of risk a disease outcome in clinical trials or mass population, exhibited a wide range of variation from 10.4% to 97.2%2-5. A number of reasons contribute to the variation in VE that makes it difficult to directly interpret and inform the protective effect of vaccines. The notable factors include the technology platforms, the target population, differences in study protocols, background risk of COVID-19 and time of study. The various vaccine technology strategies generated non-identical immune correlates of protection for SARS-CoV-2 infection6. For instance, the LNP-mRNA vaccine (Moderna) induces S-specific IgG, high TH1 cell responses and low TH2 cell responses7,8, while the inactivated virus strategy (Sinovac) generates S, RBD and N-specific IgG, without obvious T cell responses9,10. Among all the influencing factors, emerging genetic variants relative to the vaccine strain play a critical role in affecting vaccine effectiveness. Many investigations showed that neutralizing activity in plasma or sera of vaccinated individuals against variants containing E484K and N501Y mutations decreased significantly11-13. Viral structure studies demonstrated that these amino acid substitutions on the S protein may alter virus-host cell interactions and reshape antigenic surfaces of the major neutralizing sites, thus leading to immune evasion14,15. While the mechanisms of immune escape caused by the new mutations are continuously being elucidated in experimental studies, an integrative framework to quantify the effect of genetic mismatch on VE would be instrumental for efficient evaluation of vaccine protection for any country in real-time. The genetic mismatch from vaccine strains due to evolution of the circulating strains occurred in different time periods and locations could provide a relatively compact approach to account for the spatial-temporal confounding factors for VE and facilitate the interpretation of vaccine protective effect. In this study, we drew the connection between genetic mismatch of circulating SARS-CoV-2 viruses and reported COVID-19 VE from population studies. Based on previous bioinformatics approach established for the influenza viruses16,17, we further advanced the VE estimation framework for COVID-19 by controlling the clustered random variation of technology platforms using a mixed-effects model. Through extensive analysis of worldwide VE studies and genetic sequences, we showed that a significant proportion of the change in VE could be explained by the genetic factor and provided an efficient framework to evaluate vaccine protection. ## Results ### VEs and genetic mismatch distributions by vaccine platforms VE and genetic mismatch of the four vaccine platforms are compared in **Figure 1**. Within each vaccine platform, the vaccine effectiveness is generally lower compared to the efficacy outcome (**Figure 1a**); while in terms of genetic mismatch (**Figure 1b, Supplementary Figure S3.1**), the vaccine effectiveness cohort encompasses larger genetic mismatch relative to the vaccine efficacy cohorts. This result indicates that genetic mismatch had increased during the mass vaccination phase compared to the earlier clinical trial periods. Across the technology platforms, vaccine protection (efficacy/effectiveness) shows significant difference (ANOVA *p*-value < 0.001, **Figure 1a**). The mRNA vaccines reported the highest mean VE of 89.2% (95% CI: 86.2 – 92.2, N=18), followed by the protein subunit vaccine 77.9% (range: 49.4 – 96.4, N=3), inactivated vaccine 72.3% (95% CI: 64.3 – 80.3, N=8), and viral-vectored vaccines 66.7% (95% CI: 57.5 – 75.6, N=15). Interestingly, the genetic mismatch of these platforms shows a perfect reverse trend, of which the mRNA vaccines cohorts correspond to the smallest mismatch, and the viral-vector the highest. The genetic mismatch summarizes the deviation of genetic variants with respect to the vaccine strains, accounting for time, location and multiple strain co-circulation, for vaccine evaluation at population level using sequencing data. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/11/26/2021.04.22.21254079/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/11/26/2021.04.22.21254079/F1) Figure 1. Comparison of COVID-19 vaccine efficacy and effectiveness (VE) and genetic mismatch across vaccine platforms. Panel (a): distribution of the VE estimates for different platforms. The VE of mRNA vaccines is higher than other vaccines (ANOVA *p*-value < 0.001). Panels (b): distribution of genetic mismatch on RBD for different vaccine technologies. Genetic mismatch is the lowest for mRNA vaccines (ANOVA *p*-value < 0.05). ### Relationship between vaccine protection and genetic mismatch Next, we explored the statistical relationship between vaccine protection and genetic mismatch. Using a mixed-effects model, at most 72.0% of the variations in VE can be explained by the genetic mismatch measure, controlling for the random effect of technology platforms (**Figure 2, Supplementary Table S3.2**). Among the candidate genomic regions, genetic mismatch on the RBD region demonstrated the strongest influence on vaccine protection. For every residue substitution on the RBD, the VE would reduce by an average of 7.2% (95% CI: 3.8 – 10.7, *p*-value < 0.001); the reduction of VE due to one mutation on the NTD and S-protein are 5.4% (95% CI: 2.8 – 7.9) and 1.6% (95% CI: 0.6 – 2.6), respectively (**Supplementary Table S3.2**), while mismatch on ORF1ab, ORF3a, ORF8 and N proteins show no association to VE (**Supplementary Figures S3.3-3.4**). When no genetic mismatch is present, VE is the highest for the mRNA vaccine of an expected level of 94.4% (95% CI: 91.2 – 97.7), estimated by the RBD region; and the inactivated and viral-vector vaccines show a systematically lower VE by 16% and 18.6% relative to the mRNA vaccines. ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/11/26/2021.04.22.21254079/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/11/26/2021.04.22.21254079/F2) Figure 2. The relationship between VE and genetic mismatch of the circulating SARS-CoV-2 strains to the vaccine strain on S protein. Panels (a-c): negative linear relationships between VE and genetic mismatch for RBD (*p*-value< 0.01, R-sq = 72.0%), NTD (*p*-value <0.001, R-sq = 68.8%), and full-length sequence (*p*-value < 0.01, R-sq= 69.0%), respectively. The dashed line was fitted by all data points. The colored lines were fitted by data points of each platform. The shaded area indicates 95% confidence interval. ### Independent validation and estimating VE against specific genetic variants In **Figure 3a**, the predicted and observed VEs for all independent datasets are summarized. Calibration plot (**Supplementary Figure S3.5**) shows a close matching, and the concordance correlation coefficient reaches a high level of 0.96 (95% CI: 0.88 – 0.99). These validation results demonstrated feasibility of using genetic mismatch to estimate vaccine performance. In Figure 3b, we further predicted VEs of the mRNA, inactivated and viral-vector vaccines for 15 different variants, including VOC and VOI based on the RBD mismatch (**Supplementary Table S1.3**). Among these variants, four of them have observed VE reported while most of the rest variants have not been surveyed for VE. Against the delta variant (B.1.617.2), the estimated VE is 79.3% (95% prediction interval: 67.0 – 92.1), 63.2% (95% prediction interval: 50.5 – 76.1) and 61.5% (95% prediction interval: 48.3 – 73.4) for the mRNA, inactivated, and viral-vector vaccines, respectively (**Figure 3a**). These estimates are supported by two independent epidemiological studies against the delta variant: the mRNA vaccine BNT162b2 and the viral-vector vaccine ChAdOx1 provided 83% and 61% protection, respectively18; and the inactivated vaccine BBV152 conferred 65.2% protection according to19. Furthermore, against the beta (B.1.351) and gamma (P.1) variant, the estimated VE for viral-vector vaccines is 53.8% (95% prediction interval: 39.9 – 67.4) and 54.1% (95% prediction interval: 40.0 – 67.7), respectively. An independent study of the viral-vector ChAdOx1-S vaccine reported a VE of 48.0% against these variants20. ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/11/26/2021.04.22.21254079/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/11/26/2021.04.22.21254079/F3) Figure 3. Prediction of the VE based on the genetic distance. Panel (a): Validation outcome of estimated VE and observed VE in independent datasets. The predicted VEs against VOC are close to outcomes of cohort studies observations with concordance correlation coefficient 0.96 (95% CI: 0.88 – 0.99) (Supplementary Figure S3.5). Panel (b): Estimation of the variant-specific VE for mRNA vaccines (pink bar), inactivated vaccines (green bar) and viral-vector vaccines (blue bar). Panel (c): VEs in California were predicted at weekly intervals for different vaccine platforms. The plot indicates that the VE is declining at an accelerating speed. The surveyed VE from clinical trials or observational studies during the same period are overlaid on the trend curve as colored rectangles for reference, and only the mRNA and viral-vector platform vaccines are available. The shaded areas are 95% prediction interval. The dashed line marks the 50% efficacy threshold. ### Depicting trend of VE from the genetic mismatch VEs are predicted for the major vaccine platforms in California at weekly intervals (**Figure 3c**). In general, an accelerating decreasing trend of VE in California is depicted from the genetic mismatch. We showed that the model can be conveniently applied to track the continuous change of VE. The observed VEs from clinical trials conducted during the period are overlaid on the prediction outcomes for reference3,21-26. During February and March 2021, the predicted VE is 86.3% (95% prediction interval: 73.8 – 98.2) for the mRNA vaccines, and an independent survey in the US reported 91% protection for the same vaccine platform27. ## Discussion As novel variants of SARS-CoV-2 keep emerging in the ongoing pandemic, rapid assessment of vaccine performance in populations is crucial to inform public health and clinical responses. This study established an efficient computational framework to estimate COVID-19 VE using virus sequencing data. The predicted VEs against the VOCs are close to outcomes of independent cohort studies. The framework has several advantages. First, it enables prediction of VE against novel variants using existing virus surveillance network to derive a rapid estimate, thus it could inform timely hospital resource allocation and preparedness. Second, it provides an integrated measure to facilitate the interpretation of vaccine effects, which takes account of the confounding effect of time and location related to genetic evolution. Third, through mixed-effects modelling, the framework controls for the random effects in technology platforms, providing a consistent and adaptable prediction framework for inclusion of multiple vaccine platforms. Among the candidate genomic regions, the RBD and NTD regions exhibit the strongest statistical association with VE. These findings are also supported by biological evidence. The RBD is the major target for neutralizing antibodies that interfere with viral receptor binding28,29, and the NTD is reported to be the target of 5-20% of S-specific monoclonal antibodies from memory B cells against SARS-CoV-230,31. Recent studies have investigated the use of the neutralization titer as a predictor of vaccine efficacy32, however the neutralizing results against COVID-19 genetic variants showed varying outcomes. The vaccine protection against the B.1.351 variant reduced from 95.0%3 to 75.0%33 by BNT162b2. Due to lack of standardized neutralization assays and different protocols, one neutralization study showed that the titer against the B.1.351 variant is 7.6- and 9-fold lower compared to the early Wuhan-related Victoria variant in the BNT162b2 vaccine serum and ChAdOx1 vaccine serum, respectively12; while another experiment reported a 2.7-fold decrease in neutralization titers against the B.1.351 strain in the BNT162b2-elicited Serum34. The varying neutralization results increase the challenge of inferring vaccine performance solely by neutralization levels. In addition, the association of neutralization with protection across studies showed that neutralizing antibodies might not be deterministic in mediating protection, and the effect of other vaccine-induced immune responses also need to be quantified. This work uses an alternative angle to bridge the link between molecular activities and population level vaccine responses. Further investigations are needed to integrate potential correlates of vaccine protection and improve the existing framework. The global pandemic of COVID-19 and virus evolution have caused regions in the world to encompass diversified virus populations. We explored the possibility of developing region-specific vaccines and how well they would match the circulating virus profiles. We investigated optimal candidate vaccine strains for 13 regions, including the United Kingdom (UK), Germany, South Africa, Russia, India, Hong Kong (HK), Malaysia, Japan, California, New York, Mexico, Peru and Brazil. Based on the genetic mismatch between vaccine strains and observed viruses circulating in the region and period, hierarchical clustering of the regions was performed to show the similarity of vaccine mismatches (**Figure 4**). We found that no single strain can match to the epidemic viruses in all regions during March-April or May-Jun 2021. Particularly, for the new Moderna vaccine mRNA-1273.351 adopting the B.1.351 variant35, the mean genetic discrepancy to local circulating strains is wider compared to either the Wuhan strain or the dominating region strains. This result suggests that updating the vaccine compositions with a single genetic variant might not be sufficient. As manufacturing of region-specific vaccines may not economically feasible, a reconciling strategy might be to provide optimal vaccine candidates for country-clusters that share similar compositions of circulating viruses, or to provide multivalent vaccines. ![Figure 4.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/11/26/2021.04.22.21254079/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2021/11/26/2021.04.22.21254079/F4) Figure 4. Clustering of regions by circulating strains similarities to VOC/VOIs. Panel (a): Genetic mismatch of genetic variants to the local circulating virus during March and April 2021. The best candidate vaccine antigen for a region measured by genetic distance is highlighted by a green box. Rows: target regions; Columns: candidate vaccine antigens (VOC/VOIs). Panel (b): Genetic mismatch during May and June 2021. For example, in Panel (b), the dark blue of B.1.1.7 in Japan means that the average genetic mismatch between the circulating viruses to the B.1.1.7 is lowest compared to using other variants as vaccine strains, suggesting that the B.1.1.7 is the most optimal vaccine antigen in Japan during May-Jun 2021. The figure shows that no single strain can match to the epidemic viruses in all regions, and the solution might be to provide optimal vaccine candidates for country-clusters that share similar compositions of circulating viruses, or to develop multivalent vaccines. This study has several limitations. Although the current model reached good statistical significance, the complexity of the model is restricted by the sample size of the available VE studies. Thus, population characteristics and study design factors that may influence VE cannot be included. Secondly, the waned immunity in host was not accounted for in the current model. Thus, the current estimate only suggests the mean protection level within weeks since vaccination based on the data used for model training, and should be interpreted with caution of potentially optimistic estimates. Further study will be sought to consider penalization of the VE according to the time elapsed since last vaccination, as more longitudinal data of immune correlates are available. Thirdly, bias might occur if sequences in databases disproportionately represented regions with known circulation of a given variant. Enhanced efforts are needed to ensure better geographical representativeness of available SARS-CoV-2 sequences. Despite these limitations, the relationship of genetic mismatch and VE observed in multiple countries showed robust outcomes and were validated by independent data. The framework could further pool VE outcomes by various manufacturers using one additional layer of structured modelling, when enough data is available in the future. To conclude, this work developed a modeling framework integrating data from genetics and epidemiological studies for estimating COVID-19 vaccine effectiveness in a given period and region against a specific variant or for a particular cohort. Rapid assessment of VE before exposure to pathogens can be a useful instrument to inform the vaccine development, distribution and public health responses. ## Methods ### Vaccine Efficacy and Effectiveness Data Vaccine efficacy is the relative proportion of vaccine protection measured in clinical trials, and vaccine effectiveness is the quantity obtained from observational studies. Both quantities are calculated by (1–RR) ×100, where RR is the relative risk of a COVID-19 outcome in the vaccine group compared to the placebo group. VE reports before May 17, 2021 were collected from journal articles, documents of World Health Organization and Food and Drug Administration, and other government reports. Studies that related to human subjects and contained clear investigation time were included. A total number of 44 VEs were obtained for model building, among which 24 reported vaccine efficacy and 20 surveyed vaccine effectiveness. The vaccine efficacy studies contain thirteen Phase III trials, two Phase II trials, and two Phase II/III trials. Inclusion criteria for the vaccine effectiveness studies are: target population is all age group without special conditions; and the primary outcome is symptomatic COVID-19 infections or confirmed infections requiring medical care. We also extracted 14 VEs from subsequent independent research for validation study. The detailed information of VE for model building and validation is available in **Supplementary Table S1.1-1.2**. ### Genetic Sequences In the first part of analysis, relationship between genetic mismatch and VE was modelled. Human SARS-CoV-2 strains with collection dates ranging from April 23, 2020 to May 16, 2021 were retrieved from the global initiative on sharing all influenza data (GISAID) EpiCoV database36. All available sequences that matched to the period and location of the clinical trials or observational studies were downloaded. A total number of 297,055 full-length genome sequences were sampled from 20 geographical regions for model development. For model validation, a total of 331,116 complete SARS-CoV-2 genome sequences were retrieved from the GISAID. The source of all SARS-CoV-2 sequences involved in this study was acknowledged in the **Supplementary Acknowledgement Table**. Strains with duplicated names were removed. Multiple sequence alignment was performed using MAFFT (version 7). The ‘Wuhan-Hu-1’ genome (GenBank ‘[NC\_045512.2](http://medrxiv.org/lookup/external-ref?link\_type=GEN&access\_num=NC_045512.2&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom)’, or GISAID ‘EPI_ISL_402125’) was set as the reference sequence. The variants involved in this study were summarized in **Supplementary Tables S1.3-1.4**. Lineage classification for sequences was referenced from the GISAID. ### Statistical Methods Following the previous framework developed for influenza virus16, let *x* = {*xij*} denote the *i*-th sample from the GISAID database collected for a target population, where *i* =1,…, *n, j*=1,…, *J*; and v = {vj} denote the vaccine strain applied in the target population, where index *j* indicates the *j*-th codon position in the sequence. Denote the amino acids in a given genome region as W= {wk}, where *k* is the index for codon positions contained in the segment, *k* = 1, …, *K*, 0 ≤ *K* ≤ *J*. Suppose the Hamming distance is used as a basic measure of dissimilarity between two sequences, the vaccine genetic mismatch statistic (*d*) calculated for the target population is, ![Formula][1] Thus, the *d* summarized the average amino acids mismatch of circulating strains versus the vaccine strain based on a given genome segment in a target population. In this study, we considered a range of candidate *W*, including the receptor-binding domain (RBD), N-terminal domain (NTD), spike (S), nucleocapsid (N), ORF1ab, ORF3 and ORF8 proteins. A schematic representation of SARS-CoV-2 genome and the structure of the S protein are available in **Supplementary Figures S2.1-2.2**. All vaccine strains are based on the Wuhan strain isolated in January 2020. When the target population is composed of subjects infected with multiple co-circulating variants, the *d* captures the viral diversity in the cohort; while when the target population is composed of subjects infects by a specific genetic variant, the mismatch measures variant-specific distance. In view of the differences in vaccine platforms, a two-level mixed-effects model was adopted to account for the random effect associated with technology platform. We specified the following random-intercept model for a VE outcome (*Y**ij*) of technology *j* and study/trial *i*, ![Formula][2] The fixed intercept parameter *β*** represents the expected value of VE when genetic mismatch is zero, that is, the maximum protection of a vaccine. *β**1* represents the fixed effect of genetic mismatch; and *u**j* denotes the random effect associated with the intercept for platform *j*, which is assumed to follow a normal distribution with zero mean and constant variance ![Graphic][3]. The *εij* denotes the residual of observation from experiment *i* of platform *j*, which follows a normal distribution of zero mean and constant variant *σ**2*. The model was fitted using R package lmerTest37. The protein subunit vaccines were excluded in the mixed-effects model as the sample size for this platform is only three. One VE (10.4%) of viral vector vaccine was considered as an outlier and excluded, which is reported from a secondary analysis against the B.1.351 variant in a small-scale South African trial2. All analyses were performed using **R** statistical software (version 4.0.3). Statistical significance was declared if *p*-value < 0.05. Validation study compared the estimated VE of a given platform by using specific lineage sequences or sequences of circulating viruses in the respective regions and periods with fourteen VE outcomes pulling out from independent observational studies. As an application example, we predicted VE against major variants of concern (VOC) and variants of interest (VOI). We also estimated VE of all existing vaccines at weekly intervals from July 20 2020 to July 19 2021 in California to depict the trend of VE through time. The prediction interval of mixed-effects model was calculated using R package merTools38. ## Supporting information Supplementary Materials [[supplements/254079_file02.docx]](pending:yes) Supplementary Acknowledgement Table [[supplements/254079_file03.pdf]](pending:yes) ## Data Availability All data used in this work are publicly available. ## Data availability All data used in this study is publicly available. The detailed information of VE outcomes is available in the supplementary materials. Viral sequence data were downloaded from the global initiative on sharing all influenza data (GISAID) at [http://platform.gisaid.org/](http://platform.gisaid.org/) and the accession numbers were provided in the supplementary acknowledgment table. ## Code availability The code is available upon request from the corresponding author. ## Author Contributions M.H.W conceived the study, L.C and M.H.W wrote the manuscript. L.C and H.Z. collected data. L.C processed data, carried out the analysis and wrote the first draft. J.L., S.Z., C.K.P.M, R.W.Y.C., M.K.C.C., Z.C., E.L.Y.W., P.K.S.C., B.C.Y.Z. and E.K.Y. critically read and revised the manuscript and gave final approval for publication. ## Competing interests M.H.W and B.C.Y.Z are shareholders of Beth Bioinformatics Co., Ltd. B.C.Y.Z is a shareholder of Health View Bioanalytics Ltd. All other authors declare no competing interests. ## Acknowledgements A complete GISAID acknowledgement table could be found in online supplementary materials. We thank the contributions of all the health care workers and scientists, the GISAID team, and the submitting and the originating laboratories. This work was partially supported by the Health and Medical Research Fund, the Food and Health Bureau, the Government of the Hong Kong Special Administrative Region [COVID190103, INF-CUHK-1], the National Natural Science Foundation of China [31871340, 71974165], and the Chinese University of Hong Kong Grant [PIEF/Ph2/COVID/06, 4054600]. We appreciate the constructive comments from reviewers that improved this work. * Received April 22, 2021. * Revision received November 25, 2021. * Accepted November 26, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.COVID-19 VACCINE TRACKER. Vol. 2021. 2. 2.Madhi, S.A., et al. Efficacy of the ChAdOx1 nCoV-19 Covid-19 Vaccine against the B.1.351 Variant. N Engl J Med (2021). 3. 3.Polack, F.P., et al. Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. N Engl J Med 383, 2603–2615 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2034577&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 4. 4.Hitchings, M.D.T., et al. Effectiveness of CoronaVac in the setting of high SARS-CoV-2 P.1 variant transmission in Brazil: A test-negative case-control study. medRxiv, 2021.04.07.21255081 (2021). 5. 5.Haas, E.J., et al. Impact and effectiveness of mRNA BNT162b2 vaccine against SARS-CoV-2 infections and COVID-19 cases, hospitalisations, and deaths following a nationwide vaccination campaign in Israel: an observational study using national surveillance data. Lancet 397, 1819–1829 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(21)00947-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 6. 6.Dai, L. & Gao, G.F. Viral targets for vaccines against COVID-19. Nat Rev Immunol 21, 73–82 (2021). 7. 7.Jackson, L.A., et al. An mRNA Vaccine against SARS-CoV-2 - Preliminary Report. N Engl J Med 383, 1920–1931 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 8. 8.Corbett, K.S., et al. Evaluation of the mRNA-1273 Vaccine against SARS-CoV-2 in Nonhuman Primates. N Engl J Med 383, 1544–1555 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 9. 9.Zhang, Y., et al. Safety, tolerability, and immunogenicity of an inactivated SARS-CoV-2 vaccine in healthy adults aged 18-59 years: a randomised, double-blind, placebo-controlled, phase 1/2 clinical trial. Lancet Infect Dis 21, 181–192 (2021). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 10. 10.Wang, H., et al. Development of an Inactivated Vaccine Candidate, BBIBP-CorV, with Potent Protection against SARS-CoV-2. Cell 182, 713-721.e9 (2020). 11. 11.Wang, Z., et al. mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants. Nature (2021). 12. 12.Zhou, D., et al. Evidence of escape of SARS-CoV-2 variant B.1.351 from natural and vaccine-induced sera. Cell 184, 2348-2361.e6 (2021). 13. 13.Supasa, P., et al. Reduced neutralization of SARS-CoV-2 B.1.1.7 variant by convalescent and vaccine sera. Cell 184, 2201-2211.e7 (2021). 14. 14.Cai, Y., et al. Structural basis for enhanced infectivity and immune evasion of SARS-CoV-2 variants. Science 373, 642–648 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzMvNjU1NS82NDIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMS8yNi8yMDIxLjA0LjIyLjIxMjU0MDc5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 15. 15.Gobeil, S.M., et al. Effect of natural mutations of SARS-CoV-2 on spike structure, conformation, and antigenicity. Science 373(2021). 16. 16.Cao, L., et al. In silico prediction of influenza vaccine effectiveness by sequence analysis. Vaccine (2021). 17. 17.Cao, L.R., et al. Differential Influence of Age on the Relationship between Genetic Mismatch and A(H1N1)pdm09 Vaccine Effectiveness. Viruses-Basel 13(2021). 18. 18.Sheikh, A., McMenamin, J., Taylor, B. & Robertson, C. SARS-CoV-2 Delta VOC in Scotland: demographics, risk of hospital admission, and vaccine effectiveness. Lancet 397, 2461–2462 (2021). 19. 19.Ella, R., et al. Efficacy, safety, and lot to lot immunogenicity of an inactivated SARS-CoV-2 vaccine (BBV152): a, double-blind, randomised, controlled phase 3 trial. medRxiv, 2021.06.30.21259439 (2021). 20. 20.Nasreen, S., et al. Effectiveness of COVID-19 vaccines against variants of concern, Canada. medRxiv, 2021.06.28.21259420 (2021). 21. 21.Corchado-Garcia, J., et al. Real-world effectiveness of Ad26.COV2.S adenoviral vector vaccine for COVID-19. medRxiv, 2021.04.27.21256193 (2021). 22. 22.Andrejko, K., et al. Early evidence of COVID-19 vaccine effectiveness within the general population of California. medRxiv, 2021.04.08.21255135 (2021). 23. 23.Thompson, M.G., et al. Interim Estimates of Vaccine Effectiveness of BNT162b2 and mRNA-1273 COVID-19 Vaccines in Preventing SARS-CoV-2 Infection Among Health Care Personnel, First Responders, and Other Essential and Frontline Workers - Eight U.S. Locations, December 2020-March 2021. MMWR Morb Mortal Wkly Rep 70, 495–500 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.15585/MMWR.MM7013E3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 24. 24.CDC. Largest CDC COVID-19 Vaccine Effectiveness Study in Health Workers Shows mRNA Vaccines 94% Effective. (2021). 25. 25.Sadoff, J., et al. Safety and Efficacy of Single-Dose Ad26.COV2.S Vaccine against Covid-19. N Engl J Med 384, 2187–2201 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2101544&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 26. 26.Baden, L.R., et al. Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine. N Engl J Med 384, 403–416 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2035389&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 27. 27.Kim, S.S., et al. mRNA Vaccine Effectiveness against COVID-19 among Symptomatic Outpatients Aged ≥16 Years in the United States, February – May 2021. medRxiv, 2021.07.20.21260647 (2021). 28. 28.Ju, B., et al. Human neutralizing antibodies elicited by SARS-CoV-2 infection. Nature 584, 115–119 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1101/2020.03.21.990770&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 29. 29.Piccoli, L., et al. Mapping Neutralizing and Immunodominant Sites on the SARS-CoV-2 Spike Receptor-Binding Domain by Structure-Guided High-Resolution Serology. Cell 183, 1024-1042.e21 (2020). 30. 30.McCallum, M., et al. N-terminal domain antigenic mapping reveals a site of vulnerability for SARS-CoV-2. Cell 184, 2332-2347.e16 (2021). 31. 31.McCallum, M., et al. SARS-CoV-2 immune evasion by the B.1.427/B.1.429 variant of concern. Science 373, 648–654 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzMvNjU1NS82NDgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMS8yNi8yMDIxLjA0LjIyLjIxMjU0MDc5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 32. 32.Khoury, D.S., et al. Neutralizing antibody levels are highly predictive of immune protection from symptomatic SARS-CoV-2 infection. Nat Med 27, 1205–1211 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-021-01377-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 33. 33.Abu-Raddad, L.J., Chemaitelly, H. & Butt, A.A. Effectiveness of the BNT162b2 Covid-19 Vaccine against the B.1.1.7 and B.1.351 Variants. N Engl J Med 385, 187–189 (2021). 34. 34.Liu, Y., et al. Neutralizing Activity of BNT162b2-Elicited Serum. N Engl J Med 384, 1466–1468 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMc2102017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33684280&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F11%2F26%2F2021.04.22.21254079.atom) 35. 35.Wu, K., et al. Variant SARS-CoV-2 mRNA vaccines confer broad neutralization as primary or booster series in mice. bioRxiv (2021). 36. 36.Shu, Y. & McCauley, J. GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill 22(2017). 37. 37.Kuznetsova, A., Brockhoff, P.B. & Christensen, R.H. lmerTest package: tests in linear mixed effects models. Journal of statistical software 82, 1–26 (2017). 38. 38.Knowles, J.E., Frederick, C. & Knowles, M.J.E. Package ‘merTools’. (2016). [1]: /embed/graphic-5.gif [2]: /embed/graphic-6.gif [3]: /embed/inline-graphic-1.gif