Identification of host-pathogen-disease relationships using a scalable Multiplex Serology platform in UK Biobank ================================================================================================================ * Alexander J Mentzer * Nicole Brenner * Naomi Allen * Thomas J Littlejohns * Amanda Y Chong * Adrian Cortes * Rachael Almond * Michael Hill * Simon Sheard * Gil McVean * UKB Infection Advisory Board * Rory Collins * Adrian VS Hill * Tim Waterboer ## Abstract **Background** Certain infectious agents are recognised causes of cancer and potentially other chronic diseases. Identifying associations and understanding pathological mechanisms involving infectious agents and subsequent chronic disease risk will be possible through measuring exposure to multiple infectious agents in large-scale prospective cohorts such as UK Biobank. **Methods** Following expert consensus we designed a Multiplex Serology platform capable of simultaneously measuring quantitative antibody responses against 45 antigens from 20 infectious agents implicated in non-communicable diseases, including human herpes, hepatitis, polyoma, papilloma, and retroviruses, as well as *Chlamydia trachomatis, Helicobacter pylori* and *Toxoplasma gondii*. This panel was assayed in a random subset of UK Biobank participants (n=9,695) to test associations between infectious agents and recognised demographic and genetic risk factors and disease outcomes. **Findings** Seroprevalence estimates for each infectious agent were consistent with those expected from the literature. The data confirmed epidemiological associations of infectious agent antibody responses with sociodemographic characteristics (e.g. lifetime sexual partners with *C, trachomatis*; *P*=1·8×10−149), genetic variants (e.g. rs6927022 with Epstein-Barr virus (EBV) EBNA1 antibodies, *P*=9·5×10−91) and disease outcomes including human papillomavirus-16 seropositivity and cervical intraepithelial neoplasia (odds ratio 2·28, 95% confidence interval 1·38-3·63), and quantitative EBV viral capsid antigen responses and multiple sclerosis through genetic correlation (MHC rG=0·30, *P=*0·01). **Interpretation** This dataset, intended as a pilot study to demonstrate applicability of Multiplex Serology in epidemiological studies, is itself one of the largest studies to date covering diverse infectious agents in a prospective UK cohort including those traditionally under-represented in population cohorts such as human immunodeficiency virus-1 and *C. trachomatis*. Our results emphasise the validity of our Multiplex Serology approach in large-scale epidemiological studies opening up opportunities for improving our understanding of host-pathogen-disease relationships. These data are available to researchers interested in examining the relationship between infectious agents and human health. ## Introduction Some bacterial, viral and parasitic infectious agents are well established causal factors for certain types of cancer1 or have been linked to the development of inflammatory diseases such as multiple sclerosis2, and cardiovascular disease3,4. However, much of this evidence is derived from cross-sectional or case-control analyses that are unable to determine temporality, or from small nested case-control studies that have generally been small or have focused on a small number of infectious agents, leading to inconsistent findings5. UK Biobank (UKB) is a large prospective cohort study that has collected a large amount of genetic, lifestyle, and biomarker data alongside a wide range of health outcomes for over 500,000 adults in the United Kingdom6. The depth and breadth of data available in UKB alongside repeated biological sampling in random subsets every few years offer unique opportunities to explore the inter-relationships between risk factors and disease outcomes in very large numbers of individuals. The availability of serological data characterising lifetime exposure history for multiple infectious agents would allow the retesting of putative and identification of novel associations between infectious agents and multiple disease outcomes whilst accounting for temporal exposure and confounders. Moreover, since many infectious agents such as Epstein-Barr virus (EBV), cytomegalovirus (CMV) and human immunodeficiency virus (HIV-1) are known to have modulatory effects on the immune system, it is equally important to consider their exposure history when testing other non-infectious exposure and disease risk associations. Understanding the role of infectious agents in the development of non-communicable diseases (NCDs) could have major implications for guiding public health decisions such as targeted vaccination strategies for primary prevention7. Here, we describe the results from the first phase of an intended project to measure antibody responses against 45 antigens from 20 infectious agents using a validated Multiplex Serology panel for all 500,000 UKB participants. Using a randomly selected subset of 9,695 individuals, we demonstrate the applicability of the assay platform by confirming expected seroprevalence estimates, and reproducing previously reported epidemiological and genetic associations with infectious agent exposure, highlighting the significant value of these data in understanding the role of infectious agents in the development of NCDs. ## Methods ### Infectious agent selection A UK Biobank Infectious Disease Working Group was established to provide a consensus list of infectious agents deemed to be of significant importance to public health (**Supplementary Methods**), and to determine the most appropriate methodology for their measurement in a large-scale prospective study (**Supplementary Table 1**). The final Multiplex Serology panel measured antibody responses against 45 antigens spanning 20 infectious agents (**Supplementary Table 2**) including herpes simplex viruses 1 and 2 (HSV-1 and −2); varicella zoster virus (VZV); EBV; CMV; human herpesviruses 6A, 6B and 7 (HHV-6A, HHV-6B and HHV-7); Kaposi’s sarcoma-associated herpesvirus (KSHV); HIV-1; human T-lymphotropic virus-1 (HTLV-1); hepatitis B (HBV) and hepatitis C (HCV) viruses; human papillomaviruses 16 (HPV-16) and 18 (HPV-18); the JC (JCV), BK (BKV) and Merkel cell (MCV) polyomaviruses; *Helicobacter pylori* (*Hp*); *Chlamydia trachomatis* (*Ct*); and *Toxoplasma gondii* (*Tg*). View this table: [Table 1:](http://medrxiv.org/content/early/2019/08/29/19004960/T1) Table 1: Baseline characteristics of the 9,695 randomly selected individuals with Multiplex Serology data compared with the remainder of the UKB cohort. View this table: [Table 2:](http://medrxiv.org/content/early/2019/08/29/19004960/T2) Table 2: Unadjusted seroprevalence estimates and 95% confidence intervals for infectious agents tested using the Multiplex Serology platform in all 9,695 individuals, overall and stratified by sex. Estimates in men and women highlighted in bold are significantly different from each other (i.e. they do not have overlapping confidence intervals). Ranges of previously published seroprevalence estimates representing expectations are provided as a reference. ### UK Biobank study population The UKB recruitment process has been described previously8. Briefly, half a million men and women aged 40-69 years attended one of 22 UKB assessment centres located throughout England, Scotland and Wales between 2006 and 2010. All participants completed a touchscreen questionnaire, verbal interview and had a range of physical measurements and blood, urine and saliva samples taken for long-term storage. A subset of 20,000 individuals attended a repeat assessment between 2012 and 2013. For this study, serum samples from 9,695 UKB participants were selected at random and assayed using the final Multiplex Serology panel. For 277 (2·9%) of these participants, an additional sample from the repeat assessment was assayed to assess stability of seroconversion over a 4-5 year period (**Supplementary Methods**). ### Infectious agent assay validation and quality control Our assay was designed using the principles of an enzyme linked immunosorbent assay (ELISA) that measures levels of serum immunoglobulin G (IgG), which is a stable biological marker of lifetime exposure for a given infectious agent. Unlike ELISAs for individual infectious agents, we implemented a bead-based immunoassay protocol that allows simultaneous measurement of antibodies against multiple agent-specific antigens in the same reaction vessel hence reducing cost and workload. As opposed to many established standard ELISAs, which are predominantly based on inactivated multi-antigenic components of the target infectious agent, our assay is based on measurement of antibody responses against pre-defined recombinantly expressed agent-specific antigens creating opportunities for agent lifecycle-specific antigen characterisation. For each infectious agent tested, we measured antibody responses for up to six antigens. Where two or more antigens were used, we defined algorithms to combine the responses against multiple antigens and designate overall seropositivity or seronegativity against individual infectious agents (**Supplementary Methods**). The workflow for validation was in line with the criteria outlined in the STARD guidelines9 (**Supplementary Figure 1**). The description and validation of the assays for the individual infectious agents described here have been detailed elsewhere (**Supplementary Methods and Supplementary Table 2**)10-17. In brief, Multiplex Serology is a bead-based glutathione *S*-transferase (GST) capture assay incorporating glutathione-casein coated fluorescence-labelled polystyrene beads and pathogen-specific GST-X-tag fusion proteins as antigens17,18. Full length or fragment viral, bacterial or parasitic antigens were expressed in *E. coli* fused to an N-terminal GST domain and a C-terminal peptide tag for detection of full-length expression and stored as cleared lysate10-12,14,16,19. Each antigen was loaded onto one distinct bead set via *in situ* affinity purification of GST fusion proteins on glutathione-derivatised beads. Subsequently, the bead sets (i.e. antigens) were mixed and simultaneously presented to primary serum antibodies (at serum dilution 1:1000). Formed immunocomplexes were detected using a biotinylated goat-α-human IgG secondary antibody and quantified in a Luminex 200 flow cytometer via streptavidin-R-phycoerythrin as reporter dye. Per bead set, at least 100 beads were measured and Median Fluorescence Intensities (MFI) were calculated. ![Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2019/08/29/19004960/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2019/08/29/19004960/F1) Figure 1: Crude seroprevalence estimates and 95% confidence intervals for multiple infectious agents in 9,695 UK Biobank participants stratified by age (A for females and B for males), self-reported ethnicity (C for females and D for males) and LSP (E for females and F for males). Individual infectious agent immunoassays were validated by comparing seropositivity and seronegativity estimates from the Multiplex Serology platform against gold standard assays using collections of anonymised, unlinked samples available as reference panels10-12,14,16. These panels were also used to ensure that the performance of the agent-specific Multiplex Serology assays conducted on non-magnetic beads were equivalent when combined in the final Multiplex Serology panel tested on magnetic beads (**Supplementary Methods, Supplementary Figure 2 and Supplementary Table 3)**. Magnetic beads are required for full scalability of the Multiplex Serology assay. View this table: [Table 3:](http://medrxiv.org/content/early/2019/08/29/19004960/T3) Table 3: Crude seroprevalence and adjusted odds ratios for HBV, HIV-1 and *Ct* in men reporting same sex intercourse in males. Odds ratios adjusted for age interval, ethnicity, TDI and number of reported lifetime sexual partners. ![Figure 2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2019/08/29/19004960/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2019/08/29/19004960/F2) Figure 2: Association between HPV-16 L1 antigen seropositivity and risk of cervical cancer and cervical intraepithelial neoplasia in 9,695 UK Biobank participants. A) Unadjusted odds ratios of cervical cancer and cervical intraepithelial neoplasia (CIN) by HPV-16 L1 seropositivity are shown with 95% confidence intervals (*P*-values calculated with Fisher’s exact test). B) Odds ratio of CIN risk after adjustment for age, sex, ethnicity, LSP and TDI (*P*-value calculated using multivariable regression analysis). The serum samples were assayed in two equally sized batches, each assayed over a one week period, including blind-spiked duplicates for 107 (1·1%) individuals to quantify within- and between-batch variation (**Supplementary Table 4 and Supplementary Figure 3**). Consistency of serostatus between baseline and repeat assessment samples were tested for 277 individuals to estimate seroconversion and seroreversion rates (**Supplementary Methods and Supplementary Table 5**). Additionally, three reference control sera with known reactivity patterns for the infectious agent panel were tested on each plate to verify uniform plate handling **(Supplementary Methods)**. View this table: [Table 4:](http://medrxiv.org/content/early/2019/08/29/19004960/T4) Table 4: Replication of genetic variants previously associated with exposure to specific infectious agent antigens. ![Figure 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2019/08/29/19004960/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2019/08/29/19004960/F3) Figure 3: EBV EBNA1 and VCAp18 antibody responses and associations with multiple sclerosis. Log10 transformed antibody levels against EBNA1 (A) and VCAp18 (B) antigens were compared between groups in the UKB Multiplex Serology subset with or without self-reported diagnoses of multiple sclerosis (34 cases). **: *P*<0·01. C) The beta coefficients from the GWAS analyses of quantitative antibody responses against EBNA1 (blue) and VCAp18 (red) in the UKB subset (y-axis) were compared to the coefficients from the largest available case-control GWAS of MS (x-axis) for HLA alleles recognised to be associated with MS risk. The points are shaped by imputed HLA allele. ### Statistical analysis Statistical measures (sensitivity, specificity, Cohen’s *kappa*) for Multiplex Serology assays using reference panel samples10-12,14,16 including 95% confidence intervals (CI) were calculated using SAS 9·4. During Multiplex Serology validation **(Supplementary Figure 1)** we defined cut-off values distinguishing positive versus negative antibody responses against individual antigens based on previously published analyses10-12,14,16,20-22. These antigen-specific seropositivity results were subsequently combined using published algorithms to define overall seropositivity for infectious agents (as discussed in more detail on the UK Biobank Data Showcase: [https://biobank.ctsu.ox.ac.uk/showcase/refer.cgi?id=1348](https://biobank.ctsu.ox.ac.uk/showcase/refer.cgi?id=1348)). The seroprevalence estimates for each infectious agent were compared to those expected for European populations based on previously published data, determined through a PubMed search using the terms ‘seroprevalence’, ‘prevalence’, ‘IgG’ or ‘antibody’ in addition to the abbreviated or full name of the agent (**Supplementary Methods**). Analyses were performed that tested associations between seropositivity (for individual antigens) or overall infectious agent seropositivity (i.e. seroprevalence) as well as log10 transformed quantitative MFI antibody responses (i.e., seroreactivity) with environmental, demographic and genetic exposures and disease outcomes. The variables used were defined based on existing knowledge of such associations and are described in the **Supplementary Methods**. The variables included sex, age (by decade), ethnicity, Townsend deprivation index (TDI) quintiles, lifetime number of sexual partners (LSP in categories of ‘0’, ‘1’, ‘2-4’, ‘5-10’, ‘Greater than 10’) and self-report of ever same-sex intercourse (sameSI) using a univariate regression model tested against the null (outcome ∼ 1) using the likelihood ratio test (R version 3·5·1). A chi-squared test for trend was used to test for linear upward or downward trends of seroprevalence in the ordered quintiles of TDI. For those variables demonstrating any evidence of association (*P*<0·01), multivariable logistic regression was used to test for association whilst adjusting for other covariates with or without stratification by sex. Similarly, associations of seropositivity with disease prevalence was performed using either univariate logistic regression or, where sufficient power was present, multivariable regression accounting for relevant covariates. Disease outcomes included cervical cancer, multiple sclerosis (MS) and coeliac disease, selected owing to their known (cervical cancer with HPV-16), potential (MS with EBV and CMV exposure) and unlikely (coeliac) association with infection exposure. Disease outcomes were obtained through either self-report, or cancer registry data (details of which are provided in the **Supplementary Methods)**. ### Genetic analysis Details of the generation of the genotype data have been described elsewhere23 and further details are provided in the **Supplementary Methods**. To compare association statistics against genetic association signals for multiple sclerosis, genotypes and case-control phenotypes from the International Multiple Sclerosis dataset and GWAS analysis were used24. Human leukocyte antigen (HLA) alleles were imputed using genotype data available from across the extended MHC region only (chromosome 6 base pair positions 25,500,000-34,000,000 in genome build 37) using the SNP2HLA algorithm. Imputation and subsequent association analyses were performed in PLINK 1·9 using logistic regression including sex and age as fixed effect covariates, then statistics from individual countries were meta-analysed using a fixed effects model in METASOFT. Association and bivariate GREML analyses of genetic data were undertaken using the GCTA software (version 1·26·0) for directly genotyped variants. This uses a linear mixed model to control for cryptic relatedness and to allow inclusion of all individuals irrespective of ethnic origin through the calculation of a genetic relatedness matrix (GRM) incorporated as a random effect covariate. BOLT-LMM was an equivalent software used for imputed data. Age and sex were included as fixed effect covariates in all models. ### Data availability All data are available through the UKB Access Management System ([https://bbams.ndph.ox.ac.uk/ams/](https://bbams.ndph.ox.ac.uk/ams/)). ### Role of the funding source The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Data analysis and interpretation was performed by the authors independently of any funding source. All authors had full access to the study data and had the final responsibility for the decision to submit the paper for publication. ## Results ### UKB Sample Overview The baseline characteristics of the 9,695 UKB individuals included in the Multiplex Serology study compared with the rest of the cohort with serum available for future analyses are shown in **Table 1**. Of these individuals, 9,611 also had genetic data available for analyses following quality control. Seroprevalence estimates for each infectious agent, overall and in men and women separately, are provided in **Table 2**. For all infectious agents, seroprevalence estimates were within ranges consistent with those previously reported in European or North American populations. ### Associations between demographic factors and infectious agent seropositivity We observed differences in seroprevalence estimates between males and females for 11 infectious agents, with 5 found to be higher in women (EBV, HHV-7, HPV-16, HPV-18 and *Ct*) and 6 higher in men (VZV, HBV, HTLV-1, JCV and BKV and *Hp*). For all of the agents where data from multiple antigens were used to calculate total infectious agent seroprevalences, equivalent differences were observed in the same directions for the individual antigens (**Supplementary Table 6**). The crude differences observed by sex for these 11 infectious agents persisted following adjustment for age, TDI, self-reported ethnicity, LSP and sameSI (**Supplementary Table 7**). We also observed significant differences in seroprevalence by age (**Figures 1A and 1B** and **Supplementary Table 8**), being strongest for HSV-1, CMV and *Tg*. Age was inversely associated with HPV-16, JCV and BKV. For example, BKV seroprevalence was 97·0% (95% CI 96·1-97·9%) in men aged 40-49 years, and 93·0% (91·8-94·1%) in 60-69 year olds. We observed significant differences in seroprevalences of multiple infectious agents by self-reported ethnicity (**Figures 1C and 1D** and **Supplementary Table 9**). For example, there was a higher CMV seroprevalence in Asian (93·6%; 89·6-97·7%) compared to White individuals (56·5%; 55·5-57·5%) and a higher seroprevalence of HBV in Black (23·4%; 16·4-30·4%) compared to White individuals (1·7%; 1·4-1·9%). We also found a higher seroprevalence of several infectious agents known to be transmitted through sexual or close physical contact with increasing LSP (HSV-1, HSV-2, EBV, HPV-16 and HPV-18 and *Ct*; **Figures 1E and 1F** and **Supplementary Table 10**). We did not observe significant associations between LSP and rare sexually transmitted infections such as HIV-1, HBV, HCV or HTLV-1. We did however, observe statistically significant associations in men reporting same sex intercourse for two of these infections (HBV, HIV-1) as well as *Ct* (**Table 3**). We observed an association between higher socio-economic deprivation and many infectious agents including HSV-1, HSV-2, EBV, CMV, KSHV, HBV, *Tg, Hp* and *Ct* (**Supplementary Table 11**). This was particularly striking for *Hp* in women where 25·7% (22·2-29·2%) of women in the least deprived group had evidence of exposure compared to 42·8% (38·6-47·0%) in the most deprived group (*P**trend*=4·3×10−11). Similarly, 19·7% (17·4-22·1%) of women in the first quintile had evidence of exposure to *Ct* compared to 34·8% (31·9-37·7%) of women in the fifth quintile (*P**trend*=2·5×10−16). The same trends were also observed for males. However, with the exception of *Hp*, none of the associations remained significant after adjusting for age, ethnic group and LSP. ### Genetic associations with infectious agent antibody responses We replicated three previously reported associations between human genetic variants and antibody responses against infectious agents (**Supplementary Table 12**). All three associations are with variants in the class II region of the MHC locus in the human genome. Two variants (rs9269910 and rs9269268) were found to be associated with seropositivity for JCV (*P*=6·9 x 10−22) and MCV (*P*=1·4×10−38) respectively, and another variant (rs6927022) was associated with antibody levels against EBV EBNA1 antigen (*P*=9·5×10−91; **Table 4**). Furthermore, we discovered a novel, statistically significant signal of association between MHC variants (with the most significant association with the rs7197 variant) and antibody response against the EBV viral capsid antigen (VCAp18; beta 0·04; 95% CI 0·03-0·05; *P*=1·7×10−22, **Supplementary Figure 5**). ### Associations between infectious agent exposure and health outcomes We replicated the well-established association between HPV-16 L1 seropositivity (considered a marker for cumulative exposure) and the risk of cervical intraepithelial neoplasia (CIN; based on 131 cases; OR=2·65; 95% CI 1·51-4·66, *P=*0·001) and cervical cancer (18 cases, OR=4·37; 95% CI 1·26-15·15, *P*=0·04). The association with CIN remained significant after adjustment for age, ethnicity, LSP and TDI (OR=2·02; 95% CI 1·06-3·05; *P*=0·03; **Figure 2**). We found that all self-reported cases of MS disease in the UKB subset (n=34) were seropositive for EBV infection as determined using either our validated algorithm incorporating all four EBV antigens, or using VCAp18 seropositivity alone **(Figure 3A)** which remained significant after adjustment for age, sex, ethnic group and TDI (OR=5·3; 1·55-18·39; *P*=7·8×10−3). A similar, non-significant pattern of association was observed for VCAp18 antibody levels **(Figure 3B)**. We also observed an inverse association with CMV seroprevalence and risk of MS that persisted after adjustment for the same covariates (OR=0·39; 0·18-0·81; *P*=0·01). ### Genetic Correlation Analysis Using imputed HLA allele information we found that several alleles reported to be associated with MS disease risk had similar statistics of association compared to the alleles associated with antibody responses against the EBV antigens EBNA1 and VCAp18 **(Figure 3C)**. To determine the extent of association between EBV antigen response traits and MS risk we calculated the genetic correlation between quantitative VCAp18 and EBNA1 antibody response associations and MS case-control status using the largest available independent case-control analysis of MS. We found statistically significant evidence of a positive genetic correlation between MS risk and VCAp18 antibody response using variants across the MHC region alone (rG=0·30, *P*=0·01) and a similar level of correlation using variants across the entire genome (rG=0·21, *P*=0·02). We did not observe genetic correlation between EBNA1 antibody levels and MS (rG=0·09, *P*=0·41). Using data on coeliac disease (1,468 cases and 10,000 randomly selected controls) within UKB as a negative control we found no evidence of genetic correlation between VCAp18 antibody responses and coeliac disease when tested across the genome or across the MHC (rG = −0·09, *P* = 0·49). ## Discussion Seven viruses and one bacterial species are established causes of cancer. These and many other species have been hypothesised to contribute to a range of NCDs including cardiovascular and inflammatory conditions. Confirming and understanding the relationships underlying such associations could have significant implications for public health and facilitate the discovery of novel therapeutics. Here we present data from a Multiplex Serology platform applied to a subset of a very large prospective cohort study. Our findings replicate many established relationships, highlighting the validity of our data, and through these results alongside novel associations we demonstrate the potential of the dataset for addressing multiple questions relating to the epidemiology of infectious agents and biology underlying subsequent disease. Our seroprevalence estimates were consistent with previously published estimates in the UK and Europe. Some of our seroprevalence estimates were on the higher limit of expectations. For example, the *Ct* seroprevalence in women was approximately 25% in our study, and thus significantly higher than in other studies using the microimmunofluorescence (MIF) assay or major outer membrane protein (MOMP) peptide ELISA16,25. However, our seroprevalence estimate was consistent with recent data from England based on the same antigen (pGP3) in ELISA assays26. This highly immunogenic antigen is believed to be the most species-specific marker of *Ct* that may provide a more accurate estimate of Ct exposure. We have previously demonstrated high agreement of our Multiplex Serology pGP3 assay and the published pGP3 ELISA methods16. Our observations of demographic associations with infectious agent seroprevalence, such as those between LSP and sexually transmitted infections like *Ct* or HPV-16 further enhance the reliability of the seroprevalence estimates derived from our Multiplex Serology panel. Although we did not observe similarly significant associations between LSP and other infections known to be sexually transmitted (HBV, HCV, HIV-1 and HTLV-1), these infections are rare and the existing analysis may be underpowered to detect such associations. Instead, the comprehensive data availability from UKB participants including sensitive aspects of lifestyle meant that we could observe alternative relationships with sexual behaviour reports such as sameSI with HBV, HIV-1 and *Ct* serostatus, thus increasing confidence even in those infections that are rare within UKB. Many of the demographic associations we observed, such as those with sex, age or deprivation status, have been reported previously but are poorly understood; and such associations may occur as a result of confounding. UKB offers the opportunity to clarify such complex inter-relationships. For example, we were able to demonstrate clear associations with age and CMV27, HSV-128 and *Tg**29* that are likely to be independent and not confounded by other reported risk factors. Such infectious agents have repeatedly been shown to be transmitted through behaviours in adulthood including close physical and sexual contact, and animal exposure or diet as in the case of *Tg*. The associations of CMV, HBV and *Tg* we observed with ethnicity were particularly striking. Although these associations remained highly significant after adjustment for age, sex, LSP and TDI, it is likely that these relationships occur as a combination of environmental and genetic factors in addition to the increased likelihood of infectious agent exposure in other countries where participants of alternative ethnicities may have lived when younger. In contrast, although we observed significant associations between multiple infectious agents and deprivation, these associations were no longer significant after adjustment for age, ethnicity and LSP. Such findings could have significant implications for public health interventions for efforts to improve health equality across strata of society. The replication of a series of genetic associations with both seropositivity and quantitative antibody responses further increases confidence in the validity of the dataset. Furthermore, the discovery of a new signal of association with antibody responses against EBV VCAp18 highlights the potential of this dataset for novel discovery of genetic variant associations. Associations of pathogen-specific antibodies with the MHC locus are mechanistically highly plausible as variation in *HLA* genes may modulate an individual’s immune response to an infection influencing susceptibility for and the ability to clear infections, and the magnitude of antibody response. Such genetic associations offer opportunities to improve our understanding of biological mechanisms when interpreted alone. In addition, they can also be used to understand relationships between infections and other NCDs through testing for the presence of shared heritability. This is particularly relevant given the current availability of infectious agent data for only a subset of the existing UKB. Although we had reasonable power within the UKB subset to observe the well-recognised, causative association between HPV-16 seroprevalence and CIN and cervical cancer30, testing for associations with small to moderate effect sizes is difficult owing to the limited sample size. Nevertheless, when testing a hypothetical association that has long been speculated to be causal, we did still observe differences in the magnitude and seroprevalence of EBV antibody responses in patients with self-reported MS compared to those without MS in directions consistent with previous reports5. By comparing genetic association signals derived from the antibody response in the UKB subset against signals generated from an independently recruited case-control analysis of MS, we observed a significant degree of genetic correlation between EBV VCAp18 antibody responses and MS risk. Our estimation with an rG between 0·2 and 0·3 both across the MHC and the entire genome is in line with previous reports and has been interpreted as evidence in favour of a causal relationship between EBV and MS31,32. However, since the majority of this correlation will be driven by the MHC region well known to demonstrate significant levels of pleiotropy (i.e. a single genetic variant may be associated with multiple traits) it is not possible to definitively exclude other confounders that may better explain this relationship. These confounders may indeed include other infectious agents with similar associations across the MHC. Reassuringly we observed no association with coeliac disease that was included as a negative control. Although we have demonstrated a range of potential utilities for the data generated from the Multiplex Serology panel, it is important to recognise the limitations of our approach. Multiplex Serology is an epidemiological screening tool designed to detect cumulative (i.e. past or present) exposure and thus may be less sensitive or specific than assays used in clinical diagnosis. Moreover, the multiplex nature of the assay does not allow for the optimisation of assay conditions for each and every infectious agent as demonstrated by the variation in validation statistics (e.g. as seen for *Tg*). Nevertheless, even for *Tg*, we were still able to observe robust epidemiological associations consistent with the literature, reinforcing the high quality of the data for epidemiological analyses. Irrespective of all of the above, the availability of additional biomaterials (such as saliva and buffy coats) from baseline and follow-up sampling in the UKB participants offers significant future opportunities. For example, it may be possible to undertake complementary assays (such as nucleotide-based detection systems) to add data about acute infections in individuals identified as seropositive for either prior exposure or carriage using data from the Multiplex Serology panel. We show that our Multiplex Serology panel offers an attractive validated approach to identify novel and confirm previously reported associations between infectious agents and a range of demographic factors and genetic variants that may influence the risk of chronic disease. In time, with further cohort maturation and investigation by the scientific community, these data are likely to have significant impact on our understanding of the burden of infectious agents and their sequelae, which will in turn inform future public health strategies. ## Research in context ### Evidence before this study Multiple epidemiological studies have estimated the burden of prior exposure or chronic carriage with a variety of infectious agents. By searching PubMed using criteria as listed in the **Supplementary Methods** we found examples of such cross-sectional studies including the National Health and Nutrition Examination Survey (NHANES), the Detroit Neighbourhood Health Study, the Multi-Ethnic Study of Atherosclerosis and Add Health in the USA, and national surveillance systems implemented in various countries including the UK. Such studies have led to an increased appreciation of disease risk across communities and populations, and in some cases have contributed data in support of infection-disease associations. However, cross-sectional studies cannot determine directionality of such infection-disease associations and are therefore difficult to interpret. Seroepidemiology data from large, broadly characterised prospective cohort studies such as UK Biobank (UKB) and the European Prospective Investigation into Cancer and Nutrition (EPIC) offer opportunities to investigate the temporality of associations between past exposure to infectious agents and subsequent disease risk. Despite the value of such cohort study designs, large-scale serology testing has rarely been undertaken owing to financial and logistical barriers and a tendency to undertake nested case-control analyses to answer specific questions. Furthermore, very few studies have considered infections that are either very rare or very common in populations owing to consequences in terms of statistical power. Some infections, such as HIV-1, have additional ethical barriers to surveillance and detection that confer a risk of selection bias dependent on the populations amenable to study. ### Added value of this study This study demonstrates the utility of a scalable Multiplex Serology platform to measure antibodies against 45 antigens from 20 infectious agents in a random subset of 9,695 UKB participants. Our seroprevalence estimates fall within expected UK age-sex distributions for all tested infectious agents. The associations observed between seroprevalence and sex, ethnicity and sexual behaviour for multiple infectious agents not only serve to validate the utility of the panel but should also guide public health initiatives to reduce the burden of infections, and thus their immediate symptoms and subsequent disease sequelae. The associations with human genetic variation similarly offer reassurances in terms of cross-cohort reproducibility, and provides an opportunity to discover novel genetic associations, such as that observed between human leukocyte antigen complex variation and EBV VCAp18 antigen responses that offer biological insights into disease pathogenesis research and prevention. If, as planned, data on infectious agents were available for all 500,000 UK Biobank individuals, this resource would represent a unique opportunity to understand how exposure to a range of infectious agents influences subsequent disease risk and how host genetic factors influence these relationships. ### Implications of all the available evidence The work presented here is the first phase of infectious agent exposure characterisation in UKB and has generated comprehensive data that is now publically available. It represents an exemplar of the types of analyses possible using the wealth of data collected from all UKB participants. The seroprevalence estimates offer opportunities to understand differences by demographics highlighting, for example, the importance of sexual behaviour in disease risk, whilst explaining differences by deprivation status through combinations of age, ethnicity and sexual behaviour. Furthermore, the availability of quantitative seroresponses, as highlighted by the correlation between EBV antigen responses and MS, offers opportunities to estimate the magnitude of disease risk. These investigations will help inform public health initiatives, and generate novel hypotheses for future mechanistic studies. ## Data Availability All data are available through the UKB Access Management System ([https://bbams.ndph.ox.ac.uk/ams/](https://bbams.ndph.ox.ac.uk/ams/)) [https://biobank.ctsu.ox.ac.uk/showcase/refer.cgi?id=1348](https://biobank.ctsu.ox.ac.uk/showcase/refer.cgi?id=1348) ## Declaration of interests All authors declare no conflicts of interests. ## Author Contributions AJM, NA, RA, MH, SS, GM, the UKB Infection Advisory Board, RC, AVSH and TW conceptualised and designed the methods for the research project; AJM, NB, TJL, AYC, AC, and RA performed the analyses; AJM, NB, NA, TJL, RA, MH, SS, RC, and TW were responsible for generation of resources and curation of data; AJM, NA, GM, RC, AVSH, and TW were responsible for funding acquisition and supervision, AJM, NB, NA, TJL, RC, AVSH, and TW were responsible for writing the manuscript and all co-authors reviewed the manuscript. ## Acknowledgements We thank the UKB participants and all individuals involved in recruitment, data and sample collection, data curation and release. The authors would like to thank Drs James Gilchrist, Tom Parks and Kathryn Auckland, Mr Andres Kaufmann and Professor Andres Moreno-Estrada for advice and assistance during data analysis. AJM was supported by a Wellcome Trust Fellowship with reference 106289/Z/14/Z and the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. AYC was supported by the Research Councils UK Newton Fund Award with reference MR/N028937/1. * Received August 16, 2019. * Revision received August 16, 2019. * Accepted August 29, 2019. * © 2019, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Bouvard V, Baan R, Straif K, et al. A review of human carcinogens-Part B: biological agents. Lancet Oncology 2009; 10(4): 321–2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1470-2045(09)70096-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19350698&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000264812300014&link_type=ISI) 2. 2.Giovannoni G, Cutter GR, Lunemann J, et al. Infectious causes of multiple sclerosis. The Lancet Neurology 2006; 5(10): 887–94. 3. 3.Gkrania-Klotsas E, Langenberg C, Sharp SJ, Luben R, Khaw KT, Wareham NJ. Higher immunoglobulin G antibody levels against cytomegalovirus are associated with incident ischemic heart disease in the population-based EPIC-Norfolk cohort. The Journal of infectious diseases 2012; 206(12): 1897–903. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/infdis/jis620&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23045624&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 4. 4.Zhu J, Nieto FJ, Horne BD, Anderson JL, Muhlestein JB, Epstein SE. Prospective study of pathogen burden and risk of myocardial infarction or death. Circulation 2001; 103(1): 45–51. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjg6IjEwMy8xLzQ1IjtzOjQ6ImF0b20iO3M6Mzk6Ii9tZWRyeGl2L2Vhcmx5LzIwMTkvMDgvMjkvMTkwMDQ5NjAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 5. 5.Almohmeed YH, Avenell A, Aucott L, Vickers MA. Systematic review and meta- analysis of the sero-epidemiological association between Epstein Barr virus and multiple sclerosis. PloS one 2013; 8(4): e61110. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0061110&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23585874&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 6. 6.Littlejohns TJ, Sudlow C, Allen NE, Collins R. UK Biobank: opportunities for cardiovascular research. Eur Heart J 2019; 40(14): 1158–66. 7. 7.Metcalf CJ, Farrar J, Cutts FT, et al. Use of serological surveys to generate key insights into the changing global landscape of infectious disease. Lancet 2016; 388(10045): 728–30. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(16)30164-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27059886&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 8. 8.Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 2015; 12(3): e1001779. 9. 9.Bossuyt PM, Reitsma JB, Bruns DE, et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. Bmj 2015; 351: h5527. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNTEvb2N0MjhfMS9oNTUyNyI7czo0OiJhdG9tIjtzOjM5OiIvbWVkcnhpdi9lYXJseS8yMDE5LzA4LzI5LzE5MDA0OTYwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 10. 10.Brenner N, Mentzer AJ, Butt J, et al. Validation of Multiplex Serology for human hepatitis viruses B and C, human T-lymphotropic virus 1 and Toxoplasma gondii. PloS one 2019; 14(1): e0210407. 11. 11.Brenner N, Mentzer AJ, Butt J, et al. Validation of Multiplex Serology detecting human herpesviruses 1-5. PloS one 2018; 13(12): e0209379. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0209379&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30589867&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 12. 12.Dondog B, Schnitzler P, Michael KM, et al. Hepatitis C Virus Seroprevalence in Mongolian Women Assessed by a Novel Multiplex Antibody Detection Assay. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 2015; 24(9): 1360–5. 13. 13.Gossai A, Waterboer T, Nelson HH, et al. Prospective Study of Human Polyomaviruses and Risk of Cutaneous Squamous Cell Carcinoma in the United States. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 2016; 25(5): 736–44. 14. 14.Kranz LM, Gartner B, Michel A, Pawlita M, Waterboer T, Brenner N. Development and validation of HIV-1 Multiplex Serology. Journal of immunological methods 2019; 466: 47–51. 15. 15.Michel A, Waterboer T, Kist M, Pawlita M. Helicobacter pylori multiplex serology. Helicobacter 2009; 14(6): 525–35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1523-5378.2009.00723.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19889070&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 16. 16.Trabert B, Waterboer T, Idahl A, et al. Antibodies Against Chlamydia trachomatis and Ovarian Cancer Risk in Two Independent Populations. Journal of the National Cancer Institute 2018: pii: 5001107. 17. 17.Waterboer T, Sehr P, Michael KM, et al. Multiplex human papillomavirus serology based on in situ-purified glutathione s-transferase fusion proteins. Clinical chemistry 2005; 51(10): 1845–53. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2xpbmNoZW0iO3M6NToicmVzaWQiO3M6MTA6IjUxLzEwLzE4NDUiO3M6NDoiYXRvbSI7czozOToiL21lZHJ4aXYvZWFybHkvMjAxOS8wOC8yOS8xOTAwNDk2MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. 18.Waterboer T, Sehr P, Pawlita M. Suppression of non-specific binding in serological Luminex assays. Journal of immunological methods 2006; 309(1-2): 200–4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jim.2005.11.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16406059&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000235637600021&link_type=ISI) 19. 19.Sehr P, Zumbach K, Pawlita M. A generic capture ELISA for recombinant proteins fused to glutathione S-transferase: validation for HPV serology. Journal of immunological methods 2001; 253(1-2): 153–62. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0022-1759(01)00376-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11384677&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000169258800014&link_type=ISI) 20. 20.Carter JJ, Paulson KG, Wipf GC, et al. Association of Merkel cell polyomavirus- specific antibodies with Merkel cell carcinoma. Journal of the National Cancer Institute 2009; 101(21): 1510–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jnci/djp332&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19776382&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000271575100011&link_type=ISI) 21. 21.Michael KM, Waterboer T, Sehr P, et al. Seroprevalence of 34 human papillomavirus types in the German general population. PLoS pathogens 2008; 4(6): e1000091. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.ppat.1000091&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18566657&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 22. 22.Migchelsen SJ, Martin DL, Southisombath K, et al. Defining Seropositivity Thresholds for Use in Trachoma Elimination Studies. PLoS Negl Trop Dis 2017; 11(1): e0005230. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0005230&link_type=DOI) 23. 23.Bycroft C, Freeman C, Petkova D, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018; 562(7726): 203–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0579-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 24. 24.he International Multiple Sclerosis Genetics C. Class II HLA interactions modulate genetic risk for multiple sclerosis. Nat Genet 2015; 47(10): 1107–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3395&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26343388&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 25. 25.Hulstein SH, Matser A, Alberts CJ, et al. Differences in Chlamydia trachomatis seroprevalence between ethnic groups cannot be fully explained by socioeconomic status, sexual healthcare seeking behavior or sexual risk behavior: a cross-sectional analysis in the HEalthy LIfe in an Urban Setting (HELIUS) study. BMC infectious diseases 2018; 18(1): 612. 26. 26.Woodhall SC, Wills GS, Horner PJ, et al. Chlamydia trachomatis Pgp3 Antibody Population Seroprevalence before and during an Era of Widespread Opportunistic Chlamydia Screening in England (1994-2012). PloS one 2017; 12(1): e0152810. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0152810&link_type=DOI) 27. 27.van Boven M, van de Kassteele J, Korndewal MJ, et al. Infectious reactivation of cytomegalovirus explaining age- and sex-specific patterns of seroprevalence. PLoS Comput Biol 2017; 13(9): e1005719. 28. 28.Smith JS, Robinson NJ. Age-specific prevalence of infection with herpes simplex virus types 2 and 1: a global review. The Journal of infectious diseases 2002; 186 Suppl 1: S3-28. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/343739&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12353183&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000178305700002&link_type=ISI) 29. 29.Burrells A, Opsteegh M, Pollock KG, et al. The prevalence and genotypic analysis of Toxoplasma gondii from individuals in Scotland, 2006-2012. Parasites & vectors 2016; 9(1): 324. 30. 30. Human papilomaviruses. Lyon: International Agency for Research on Cancer; 2007. 31. 31.Olsson T, Barcellos LF, Alfredsson L. Interactions between genetic, lifestyle and environmental risk factors for multiple sclerosis. Nat Rev Neurol 2017; 13(1): 25–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrneurol.2016.187&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27934854&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 32. 32.Zhou Y, Zhu G, Charlesworth JC, et al. Genetic loci for Epstein-Barr virus nuclear antigen-1 are associated with risk of multiple sclerosis. Mult Scler 2016; 22(13): 1655–64. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1352458515626598&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26819262&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 33. 33.Looker KJ, Magaret AS, May MT, et al. Global and Regional Estimates of Prevalent and Incident Herpes Simplex Virus Type 1 Infections in 2012. PloS one 2015; 10(10): e0140765. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0140765&link_type=DOI) 34. 34.Pebody RG, Andrews N, Brown D, et al. The seroepidemiology of herpes simplex virus type 1 and 2 in Europe. Sexually transmitted infections 2004; 80(3): 185–91. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoic2V4dHJhbnMiO3M6NToicmVzaWQiO3M6ODoiODAvMy8xODUiO3M6NDoiYXRvbSI7czozOToiL21lZHJ4aXYvZWFybHkvMjAxOS8wOC8yOS8xOTAwNDk2MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 35. 35.Aebi C, Fischer K, Gorgievski M, Matter L, Muhlemann K. Age-specific seroprevalence to varicella-zoster virus: study in Swiss children and analysis of European data. Vaccine 2001; 19(23-24): 3097–103. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0264-410X(01)00035-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11312004&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 36. 36.Nardone A, de Ory F, Carton M, et al. The comparative sero-epidemiology of varicella zoster virus in 11 countries in the European region. Vaccine 2007; 25(45): 7866–72. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.vaccine.2007.07.036&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17919788&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251103600019&link_type=ISI) 37. 37.Pembrey L, Raynor P, Griffiths P, Chaytor S, Wright J, Hall AJ. Seroprevalence of cytomegalovirus, Epstein Barr virus and varicella zoster virus among pregnant women in Bradford: a cohort study. PloS one 2013; 8(11): e81881. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0081881&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24312372&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 38. 38.Dowd JB, Palermo T, Brite J, McDade TW, Aiello A. Seroprevalence of Epstein-Barr virus infection in U.S. children ages 6-19, 2003-2010. PloS one 2013; 8(5): e64921. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0064921&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23717674&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 39. 39.Morris MC, Edmunds WJ, Hesketh LM, et al. Sero-epidemiological patterns of Epstein-Barr and herpes simplex (HSV-1 and HSV-2) viruses in England and Wales. Journal of medical virology 2002; 67(4): 522–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jmv.10132&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12115998&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000176589900009&link_type=ISI) 40. 40.Bate SL, Dollard SC, Cannon MJ. Cytomegalovirus seroprevalence in the United States: the national health and nutrition examination surveys, 1988-2004. Clinical infectious diseases : an official publication of the Infectious Diseases Society of America 2010; 50(11): 1439–47. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/652438&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20426575&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000277194800002&link_type=ISI) 41. 41.Cannon MJ, Schmid DS, Hyde TB. Review of cytomegalovirus seroprevalence and demographic characteristics associated with infection. Reviews in medical virology 2010; 20(4): 202–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/rmv.655&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20564615&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 42. 42.Ablashi D, Agut H, Alvarez-Lafuente R, et al. Classification of HHV-6A and HHV- 6B as distinct viruses. Arch Virol 2014; 159(5): 863–70. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00705-013-1902-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24193951&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 43. 43.Arvin AM. Human herpesviruses : biology, therapy, and immunoprophylaxis. Cambridge; New York: Cambridge University Press; 2007. 44. 44.Ablashi DV, Berneman ZN, Kramarsky B, Whitman J, Jr.., Asano Y, Pearson GR. Human herpesvirus-7 (HHV-7): current status. Clin Diagn Virol 1995; 4(1): 1–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0928-0197(95)00005-S&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15566823&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 45. 45.Ward KN. The natural history and laboratory diagnosis of human herpesviruses-6 and −7 infections in the immunocompetent. J Clin Virol 2005; 32(3): 183–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jcv.2004.11.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15722023&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000227693500001&link_type=ISI) 46. 46.Rohner E, Wyss N, Heg Z, et al. HIV and human herpesvirus 8 co-infection across the globe: Systematic review and meta-analysis. International journal of cancer Journal international du cancer 2016; 138(1): 45–54. 47. 47.Schatz O, Monini P, Bugarini R, et al. Kaposi’s sarcoma-associated herpesvirus serology in Europe and Uganda: multicentre study with multiple and novel assays. Journal of medical virology 2001; 65(1): 123–32. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jmv.2011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11505454&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170257200019&link_type=ISI) 48. 48.Simpson GR, Schulz TF, Whitby D, et al. Prevalence of Kaposi’s sarcoma associated herpesvirus infection measured by antibodies to recombinant capsid protein and latent immunofluorescence antigen. Lancet 1996; 348(9035): 1133–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(96)07560-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8888167&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996VP10600011&link_type=ISI) 49. 49.Brown AE, Ross DA, Simpson AJ, et al. Prevalence of markers for HIV, hepatitis B and hepatitis C infection in UK military recruits. Epidemiology and infection 2011; 139(8): 1166–71. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0950268810002712&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21208514&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 50. 50.Hahne SJ, Veldhuijzen IK, Wiessing L, Lim TA, Salminen M, Laar M. Infection with hepatitis B and C virus in Europe: a systematic review of prevalence and cost-effectiveness of screening. BMC infectious diseases 2013; 13: 181. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2334-13-181&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23597411&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 51. 51.Ott JJ, Stevens GA, Groeger J, Wiersma ST. Global epidemiology of hepatitis B virus infection: new estimates of age-specific HBsAg seroprevalence and endemicity. Vaccine 2012; 30(12): 2212–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.vaccine.2011.12.116&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22273662&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000302662900031&link_type=ISI) 52. 52.Cortina-Borja M, Williams D, Peckham CS, Bailey H, Thorne C. Hepatitis C virus seroprevalence in pregnant women delivering live-born infants in North Thames, England in 2012. Epidemiology and infection 2016; 144(3): 627–34. 53. 53.Petruzziello A, Marigliano S, Loquercio G, Cacciapuoti C. Hepatitis C virus (HCV) genotypes distribution: an epidemiological up-date in Europe. Infect Agent Cancer 2016; 11: 53. 54. 54.England PH. HIV in the UK: 2016 report. London, United Kingdom; 2016. 55. 55.Lattimore S, Yin Z, Logan L, et al. Situation of HIV infections and STIs in the United Kingdom in 2007. Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin 2008; 13(49). 56. 56.Gessain A, Cassar O. Epidemiological Aspects and World Distribution of HTLV-1 Infection. Front Microbiol 2012; 3: 388. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fmicb.2012.00388&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23162541&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 57. 57.Ireland G, Croxford S, Tosswill J, et al. Human T-lymphotropic viruses (HTLV) in England and Wales, 2004 to 2013: testing and diagnoses. Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin 2017; 22(20). 58. 58.Anderson L, O’Rorke M, Jamison J, Wilson R, Gavin A, members HPVWG. Prevalence of human papillomavirus in women attending cervical screening in the UK and Ireland: new data from northern Ireland and a systematic review and meta-analysis. Journal of medical virology 2013; 85(2): 295–308. 59. 59.Scherpenisse M, Mollers M, Schepp RM, et al. Changes in antibody seroprevalence of seven high-risk HPV types between nationwide surveillance studies from 1995-96 and 2006- 07 in The Netherlands. PloS one 2012; 7(11): e48807. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23152809&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 60. 60.Lang Kuhs KA, Anantharaman D, Waterboer T, et al. Human Papillomavirus 16 E6 Antibodies in Individuals without Diagnosed Cancer: A Pooled Analysis. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 2015; 24(4): 683–9. 61. 61.Antonsson A, Green AC, Mallitt KA, et al. Prevalence and stability of antibodies to the BK and JC polyomaviruses: a long-term longitudinal study of Australians. J Gen Virol 2010; 91(Pt 7): 1849–53. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1099/vir.0.020115-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20219899&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000279747600025&link_type=ISI) 62. 62.Bozic C, Subramanyam M, Richman S, Plavina T, Zhang A, Ticho B. Anti-JC virus (JCV) antibody prevalence in the JCV Epidemiology in MS (JEMS) trial. Eur J Neurol 2014; 21(2): 299–304. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/ene.12304.&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24894998&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 63. 63.Hammer C, Begemann M, McLaren PJ, et al. Amino Acid Variation in HLA Class II Proteins Is a Major Determinant of Humoral Response to Common Viruses. Am J Hum Genet 2015; 97(5): 738–43. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2015.09.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26456283&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 64. 64.Rollison DE, Giuliano AR, Messina JL, et al. Case-control study of Merkel cell polyomavirus infection and cutaneous squamous cell carcinoma. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 2012; 21(1): 74–81. 65. 65.Schadendorf D, Lebbe C, Zur Hausen A, et al. Merkel cell carcinoma: Epidemiology, prognosis, therapy and unmet medical needs. European journal of cancer 2017; 71: 53–69. 66. 66.Jackson MH, Hutchison WM, Siim JC. A seroepidemiological survey of toxoplasmosis in Scotland and England. Ann Trop Med Parasitol 1987; 81(4): 359–65. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=3446026&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom) 67. 67.Hooi JKY, Lai WY, Ng WK, et al. Global Prevalence of Helicobacter pylori Infection: Systematic Review and Meta-Analysis. Gastroenterology 2017; 153(2): 420–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1053/j.gastro.2017.04.022&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2019%2F08%2F29%2F19004960.atom)