Epigenetic Signatures of Asthma: A Comprehensive Study of DNA Methylation and Clinical Markers ============================================================================================== * Austin J. Van Asselt * Jeffrey J. Beck * Brandon N. Johnson * Casey T. Finnicum * Noah Kallsen * Sarah Viet * Patricia Huizenga * Lannie Ligthart * Jouke-Jan Hottenga * René Pool * A.H Maitland-van der Zee * S.J. Vijverberg * Eco de Geus * Dorret I. Boomsma * Erik A. Ehli * Jenny van Dongen ## Abstract **Background** Asthma, a complex respiratory disease, presents with inflammatory symptoms in the lungs, blood, and other tissues. We investigated the relationship between DNA methylation and 35 clinical markers of asthma. The Illumina Infinium EPIC v1 methylation array was used to evaluate 742,442 CpGs in whole blood samples from 319 participants. They were part of the Netherlands Twin Register from families with at least one member suffering from severe asthma. Repeat blood samples were taken after 10 years from 182 of these individuals. Principal component analysis (PCA) on the clinical markers yielded ten principal components (PCs) that explained 92.8% of the total variance. We performed epigenome-wide association studies (EWAS) for each of the ten PCs correcting for familial structure and other covariates. **Results** 221 unique CpGs reached genome-wide significance at timepoint 1 (T1) after Bonferroni correction. PC7 accounted for the majority of associations (204), which correlated with loadings of eosinophil counts and immunoglobulin levels. Enrichment analysis via the EWAS Atlas identified 190 of these CpGs to be previously identified in EWASs of asthma and asthma-related traits. Proximity assessment to previously identified SNPs associated with asthma identified 17 unique SNPs within 1 MB of two of the 221 CpGs. EWAS in 182 individuals with epigenetic data at a second timepoint (T2) identified 49 significant CpGs. EWAS Atlas enrichment analysis indicated that 4 of the 49 were previously associated with asthma or asthma-related traits. Comparing the estimates of all the significant associations identified across the two time points (271 in total) yielded a correlation of 0.81. **Conclusion** We identified 270 unique CpGs that were associated with PC scores generated from 35 clinical markers of asthma, either cross-sectionally or 10 years later. A strong correlation was present between effect sizes at the 2 timepoints. Most associations were identified for PC7, which captured blood eosinophil counts and immunoglobulin levels and many of these CpGs have previous associations in earlier studies of asthma and asthma-related traits. The results point to using this robust DNA methylation profile as a new, stable biomarker for asthma. Keywords * Methylation * Asthma * Epigenetics * Epigenome-wide association * Microarrays ## Introduction Asthma affects approximately 262 million individuals worldwide and poses a significant health burden resulting in over 450,000 deaths annually [1, 2]. Often, the challenges associated with treating and diagnosing asthma stem from the complex nature of the disease [3] due to its multi-level heterogeneity with various clinical presentations, treatment responses, and disease trajectories [4]. Clinically, diagnosing asthma is done through the collection of several different clinical measurements and symptoms including a medical history assessment, a symptom and physical assessment, lung function tests, bronchial challenge tests, response to bronchodilators, and allergy testing [5]. These different diagnostic criteria, along with different individual phenotype characteristics, such as age of onset, have led to the identification of subtypes of asthma [6–8] and different endotypes with varying molecular underpinnings of the disease [6]. Several large-scale genomic studies have elucidated associations in multiple immune and regulatory genes for these endotypes, which may form the basis for polygenic scores [9]. Two of the largest genome-wide association studies (GWAS) of asthma were performed by Demenais et al. and Ferreira et al. The GWAS of asthma by Demenais et al. in 2018 included over 140,000 individuals and identified five novel, independent associations and confirmed several others that had previously been identified [10]. The study performed by Ferreira et al. in 2019, included over 300,000 individuals, identified 123 single nucleotide polymorphisms (SNPs) significantly associated with childhood asthma, 56 SNPs associated with adult-onset asthma, and 37 to be associated with both [11]. Collectively, Ferreira et. al. found 28 novel, independent associations between the two tested age groups [11]. Many of the associations that have been identified appear to be strongly linked to immune-related mechanisms [10, 11]. Though significant genetic associations were identified in both instances, the amount of total variance in relation to asthma they explained was limited (3.5-5.1%) [10, 11]. These outcomes reaffirm the notion that asthma, as a disease, is heavily influenced by environmental stimuli, which has led researchers to explore additional fields of study such as epigenetics and, more specifically, DNA methylation [12–14]. DNA methylation, as a molecular mechanism in humans, involves the addition of a methyl group to cytosine nucleotides, often in response to certain environmental queues [15–17]. The addition and removal of DNA methylation, specifically in regulatory regions of DNA, can lead to changes in gene expression and downstream cellular and tissue function [16, 17]. This entanglement with the external environment provides a basis for investigating the association of DNA methylation and asthma. The first studies of asthma and DNA methylation were conducted on candidate genes through techniques such as bisulfite pyrosequencing, which yielded moderate success highlighting CpG sites near *IFNy* as potential mediators of asthma [18, 19]. The candidate gene studies were followed by epigenome-wide association studies (EWAS) of asthma in different age cohorts, asthma subtypes, and tissue types [9], with most studies focusing on childhood, allergic asthma in whole blood and nasal epithelial samples [12, 13]. Studies of childhood asthma in nasal epithelial samples have identified several CpGs in both immune regulatory pathways and pathways of basic cell function [20–22]. Furthermore, a small number have shown to replicate in EWASs of whole blood, indicating some pan-tissue effects, though this effect appears to be limited [9]. Studies in adults found CpGs in genes relating to general inflammation, but the results show more variability in the CpGs that have been identified [23]. Though the majority of these results point towards immune-mediated pathways, the complete biological dynamic between these sites and their influence on asthmatic phenotypes has yet to be investigated [24]. Additionally, little is known regarding which measurable clinical markers associated with asthma may be contributing to these significantly associated CpGs, an area we explore here. More generally, the continued identification of genetic and epigenetic variants influencing asthma is valuable for personalized treatment approaches. Here, we examined the genome-wide, CpG-specific methylation association with 35 different clinical markers of asthma from whole blood samples measured on the Illumina EPIC v1 methylation array and assessed the proximity of significantly associated CpGs to SNPs previously associated with asthma. We included individuals from families enrolled in the Netherlands Twin Register where at least one member of a nuclear family (consisting of 4 or 5 individuals) was diagnosed with asthma. Blood samples were collected at two timepoints, with 341 individuals having a sample collected at the first timepoint in the 1990’s and 233 individuals a second timepoint approximately 10 years later in the 2000’s [25]. Clinical markers were assessed at timepoint 1, while epigenetic data was generated on DNA from blood specimens for both timepoint 1 and timepoint 2. ## Results ### Clinical marker summary and data reduction An initial set of 39 asthma-related clinical markers were measured. Markers that exhibited no variation between individuals were excluded (4 out of 39). Markers included measurements of lung function/capacity, skin prick test results measuring the reaction to common allergens, and others previously shown to be associated with asthma. Table 1 provides descriptions of the remaining 35 clinical markers. We performed an imputation step via the R package mice (v3.16.0) to supplement 12 of the 35 markers that had missing values. Distribution plots of these markers before and after imputation can be found in Supplemental Figure 1. We then performed principal component analysis on the 35 variables to reduce the data dimensionally using the r function robPCA (rospca version 1.0.4), which is specifically suited for non-normally distributed data [26]. The distributions of the principal component scores can be found in Supplemental Figure 2. View this table: [Table 1.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T1) Table 1. Descriptions/definitions for each clinical marker that was measured at T1. PCA investigating the multidimensional structure of the clinical marker data showed the first principal component to account for 35.27% of the variance. Principal component (PC) 1 was largely associated with methacholine challenge values, other measurements of lung capacity/functionality, and lung ailments such as coughing and wheezing. PCs 2, 3, and 4 accounted for 19.54%, 12.46%, and 9.75% of the variance, respectively, and were also mostly associated with functional lung measurements. PCs 5 and 6 accounted for 8.39% and 6.05% of the variance, respectively, and showed strong association with the FEV1/FVC ratio, also called modified Tiffeneau-Pinelli index. PC7 accounted for 4.28% of the variance and showed very strong associations to eosinophil counts and immunoglobulin levels. PCs 8, 9 and 10 accounted for 2.20%, 1.22%, and 0.84% of the variance and each associated with a variety of allergen response measurements (Figure 1). Supplemental Table 1 contains the raw eigen values and percentage of variance explained for each PC. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F1) Figure 1. Heatmap showing correlations between the clinical asthma markers and principal components generated from the asthma marker data. An explanation of the variables is provided in Table 1. ### Timepoint 1 EWAS via 10 Generated PCs Epigenome-wide association studies of each of the ten principal components yielded multiple CpGs that reached genome-wide significance. In total, we identified 222 significantly associated CpGs (following a Bonferroni correction, α = 0.05 / 742,442). Of the 222, one, cg07329820, was associated with two PCs (PCs 9 and 10), resulting in a total of 221 unique CpGs that were significantly associated. Table 2 provides a summary of the covariate frequencies for the sample population. Table 3 summarizes the number of CpGs identified in each EWAS and Supplemental Figure 3 shows the results of each individual EWAS via Manhattan plots. Inflation values for each EWAS can be found in Supplemental Table 2. A complete list of all the CpGs significantly associated at T1 can be found in Supplemental Table 3, and complete summary statistics for each of the EWASs can be found in Supplemental Table 4. View this table: [Table 2.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T2) Table 2. Sample population covariate frequencies. View this table: [Table 3.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T3) Table 3. A summary of the genome-wide significant CpGs identified in each EWAS for time point 1 and timepoint 2. By far, the largest number of CpGs were associated with PC7 (which mostly captures variance from eosinophil counts and immunoglobulin levels). These 204 CpGs, largely, are composed of CpGs with a high average methylation level and almost exclusively show negative effect sizes, such that higher scores on PC7 correlate with lower methylation levels. The PC10 EWAS identified associations with 5 CpGs, followed by the PC2 EWAS with 4 significantly associated CpGs. EWASs for PCs 3, 5, and 9 showed significant associations with 2 CpGs. EWASs for PCs 1, 4, and 8 were significantly associated with 1 CpG, while the EWAS for PC6 was the only association study that did not identify any significantly associated CpGs. An enrichment analysis via the online EWAS Atlas ([https://ngdc.cncb.ac.cn/ewas/atlas/index](https://ngdc.cncb.ac.cn/ewas/atlas/index), accessed June 11, 2024) for the 221 unique CpGs was performed. This analysis identified that 71 (32.1%) and 173 (78.3%) of these CpGs had previously been identified to associate with allergic asthma and fractional exhaled nitric oxide (FENO), respectively (shown in Figure 2). Further, asthma related phenotypes such as childhood asthma, allergic sensitization, atopy, serum IgE levels, and others were shown to be previously associated with CpGs identified in our analysis. In total, 190 of the 221 CpGs (86.0%) were shown to have a previously identified association with asthma or some asthma marker, leaving 31 novel, unique associations. This enrichment analysis also highlighted significant overlap with the FoxO signalling pathway (7 CpGs previously associated). Enrichment analysis restricting to the 204 CpGs associated with PC7 recapitulated these findings (Supplemental Figure 4), showing that all of the previously associated CpGs were identified via this PC7 EWAS. We then performed an additional query of the eFORGE online database ([https://eforge.altiusinstitute.org/](https://eforge.altiusinstitute.org/), accessed July 2nd, 2024) with the 204 CpGs associated with PC7 to identify overlap with cell-type specific DNA regulatory elements. This identified significant associations with regulatory elements in multiple tissue sources, with the greatest number of significant findings (31 in total) associating with various subpopulations of white blood cells (Supplemental Figure 5). ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F2) Figure 2. EWAS atlas enrichment analysis of the 221 unique CpGs identified at TP1 across the 10 EWASs. The plot shows the number of previously identified Differentially Methylated CpGs (DMC) that are present within the CpGs identified in our analysis. The EWAS Atlas job can be found via this job ID: ea5abe0e33f28b27ab3948e2ccd4044c. ### Analysis of overlap with previously identified SNPs An investigation of proximity with the 221 unique significantly associated CpGs was performed to identify SNPs that have been previously identified by Demenais et al. to associate with asthma (p< 5 × 10-8, N= 892 SNPs). This assessment was performed by searching for SNPs within 1 MB upstream and downstream of each CpG. We identified a total of 17 unique SNPs (1.9% of all asthma-associated SNPs) to reside within 1 MB of a CpG identified at time point 1. These 17 unique SNPs involved only two unique CpGs (cg02046836, cg23515090). A summary of these SNPs within proximity to each CpG is in Supplemental Table 5. Additionally, a query of the BBMRI methylation Quantitative Trait Loci (mQTL) database with these two CpGs was performed, which showed that the asthma GWAS SNPs are not mQTLs for cg02046836, cg23515090, and that these CpGs are also not associated with any other SNPs [27]. ### Timepoint 2 EWAS Of the initial 319 individuals who were included at the first timepoint, 182 individuals had a second whole blood sample collected approximately 10 years later. The DNA from these blood samples were also profiled on Illumina EPIC v1 methylation arrays (but no detailed asthma phenotyping was performed at follow-up). We repeated the EWASs for the ten PCs with DNA methylation data at T2. The EWASs of the ten PCs with DNA methylation data from samples collected ten years after asthma phenotyping identified 49 unique CpGs at epigenome-wide significance. While none of the CpGs identified at T1 were significant at T2, three CpGs identified at T2 were within 100 kb of a significant CpG identified at timepoint 1 (Supplemental Table 8). Furthermore, we assessed the correlation (Pearson’s) of the estimates for all 271 significant associations (222 at T1, 49 at T2) across the two time points, and we identified a strong correlation (r = 0.82). Table 3 outlines the number of CpGs identified in each time point 2 EWAS and Supplemental Figure 6 contains the Manhattan plots for these EWASs. Supplemental Table 6 contains all of the 49 significantly associated CpGs identified at T2, and Supplemental Table 7 states the inflation values for each of these 10 EWASs. ## Discussion Asthma is a complex chronic disease that has been shown to be influenced by multiple factors including genetics and epigenetics. Here, we have identified 221 unique CpGs that significantly associate with PC scores generated from an advanced, comprehensive panel of 35 clinical markers of asthma. Of these, 190 have previously been identified to associate with asthma and asthma-related phenotypes, while the other 31 CpGs are novel findings. Further, two of these CpGs reside within close proximity (1 MB) to 17 SNPs previously identified in a large GWAS of asthma (source). We then performed a longitudinal assessment using methylation data from whole blood samples collected approximately 10 years and identified 49 unique significantly associated CpGs. Finally, we yielded a correlation of the estimates across the two timepoints of 0.82 from the 271 (222 T1 + 49 T2) significant associations. Historically, asthma as a disease has been revealed to be extremely challenging to identify the root cause of and treat effectively [28]. The disease heterogeneity and mixed involvement of environmental stimuli and the genome have largely contributed to this difficulty. To avoid the complications that can come with this, we utilized measured clinical markers of asthma in young adults and their middle-aged parents from families with a history of asthma burden where at least one individual within the family had been diagnosed with asthma in some capacity. For each individual, an extensive number of clinical markers related to asthma were captured. This comprehensive dataset, while immensely valuable with its depth, is too overwhelming to glean the valuable nuances through individual assessments of each specific marker. Additionally, some markers that were collected lacked a normal distribution, which could reduce their performance in a traditional EWAS. For this reason, we utilized PCA to simplify the 35 different variables, and their accompanying variance, into 10 PCs with each successive PC capturing a lesser and different portion of the overall variance within the data set. This method reduced the total number of variables to investigate from 35 to 10, making comparisons of it with a rich DNA methylation dataset much more feasible. Additionally, the PCs maintained a much more normal distribution, making them ideal for EWAS. This method of phenotype data reduction and handling has been previously implemented in a GWAS of rice variants, where GWASs were successfully performed using the PC scores from multiple phenotype variables [29]. To our knowledge, this is the first instance of implementing it for epigenome-wide association studies. Through our EWASs of each of the 10 PCs, we identified 222 CpGs to be significantly associated with one of the PCs derived from the 35 clinical asthma markers. Of these 222 associations, only a single CpG was found to be replicated across multiple principal components, leaving 221 unique CpGs to be significantly associated. The single overlapping CPG, cg07329820, was identified with PCs 9 and 10. These two PCs align with asthma markers that are very similar (largely allergen response measures), which could explain why they identified the same CpG. A breakdown of the identified CpGs from each PC shows some intriguing details. PC7 had the greatest number of CpG associations with 204. While this PC accounts for only 4.3% of the total clinical marker variance, it almost exclusively represents the variance associated with blood eosinophil counts and blood immunoglobulin levels. This large number of associations could be indicative of the eosinophilic inflammatory response typically seen in allergic asthma. Previous literature has shown that methylation experiments in whole blood may capture more of the Th2-mediated immune response due to the nature of the cell population, making it the ideal sample source for identifying this type of association [30–32]. It is critical to note that because the proportions of cells themselves in a sample can drastically influence the measurement of methylation, we made sure to incorporate cell proportion estimates as a covariate in our analyses. Without this inclusion, it could be argued that the findings identified here are only due to differences in cellular composition. With this correction in mind, it may be the case that the differential methylation identified here is something global, across multiple immune cell types, either in response to or to promote the eosinophilic inflammatory pathway. A query of the EWAS Atlas of these 204 CpGs showed that, almost exclusively, the CpGs we identified have a previous association with asthma and measurements of asthma (Supplemental Figure 4). In total 190 of the 204 CpGs were shown to have some form of previous association (93.1%), leaving 14 novel associations. Since the majority of overlap of our findings with previous EWASs of asthma, asthma phenotypes, and symptoms are captured within this EWAS of PC7, we postulate that many of the previous findings generated in studies utilizing whole blood samples could be identifying associations with the underlying influences of increased eosinophil counts/activation and increased immunoglobulin levels. An additional investigation into these 204 CpGs via the eFORGE online database identified multiple significantly associated tissue-specific regulatory elements (Supplemental Figure 5). The greatest number of associations were from subpopulations of white blood cells in enhancers and weak transcriptional activators. This is to be expected considering our results were generated from samples of whole blood. Many of the subpopulations identified here are lymphocytes, potentially indicating that the methylation profiles of these adaptive immune cells may be altered due to the changes in eosinophil counts and immunoglobulin levels. The tissue group with the next largest number of significant associations was the digestive tract, with six significant associations to various subpopulations of cells. These associations could hint at the conserved role that all mucosal and epithelial barriers play in pathogen protection and the body’s response to the external environment. A KEGG pathway enrichment analysis (conducted via the EWAS Atlas) of the 221 unique CpGs highlighted the FoxO signaling pathway, showing that 7 of the CpGs (all found within the PC7 EWAS) have been previously identified to associate with this pathway. This pathway is of particular interest due its significant role in regulating apoptosis, glucose metabolism, oxidative stress, and longevity, which has led to findings associating its dysregulation with other illnesses such as Alzheimer’s disease, type 2 diabetes, and cancer [33, 34]. It has been previously shown that FoxO1 modulates IL-9 generating Th9 cells [35]. Further, overexpression of IL-9 in the lungs can result eosinophilic inflammatory infiltration and mucus secretion [35, 36]. Additionally, FoxO3 has been found to be expressed in airway epithelium playing a critical role in controlling the innate immune response to airway infections [36]. It could be the case, based on these intertangled findings of DNA methylation with eosinophilic presence and FoxO signaling, that epigenetic modulation of FoxO signaling plays a role in its contribution to asthma and airway inflammation. Principal component 1 identified a single significantly associated CpG, cg18182148. While this principal component does capture the largest amount of variance (35.3%), the variance it captures is largely from one specific subset of measurements, most notably those measuring general lung functionality such as vital capacity, peak flow, forced expiratory volume, and methacholine challenge response. This is continued with PC2, PC3, and PC4 which were significantly associated with 4, 2, and 1 CpGs, respectively. These PCs, like PC1, capture a significant amount of variance from these same lung function measurements. This may indicate that these measurements of lung function, though important in characterizing symptoms of asthma, may have limited identifiable associations with DNA methylation. The specific CpG significantly associated with PC1, cg18182148, is the only CpG of these identified here related to immune regulation. It has not been previously shown to associate with asthma phenotypes, though it has been previously identified in studies of various cancers of the prostate, liver, and colon (via the EWAS Atlas) [37–40]. These associations are likely due to its placement within the transcription start site (TSS) of GFI-1, a strong oncogenic and hematopoietic regulator. GFI-1 plays a critical role in lymphoid differentiation, which could be the reason for its association here due to the large influence the immune system can play on lung functionality and, more generally, asthma as a whole [41]. While the use of a whole blood sample, as done here, is logical when investigating an immune-mediated illness such as asthma, an investigation of these lung function measurements in association with DNA methylation measured via a lung epithelial sample may highlight additional findings not seen here and provide more information on the local cellular environment responsible for the specific differences in lung function. Principal components 5 and 6 capture variance associated with bronchial hyperreactivity, coughing, reversibility (of lung function following the administering of ventolin), and Tiffenau Index. PC5 was significantly associated with 2 CpGs, while PC6 did not have any significant associations. The two CpGs identified here, cg07454584 and cg18561513, have not been previously shown to have any associations with asthma or other immune-mediated illnesses (via the EWAS Atlas). Like the measurements captured in PCs 1-4, these asthma markers, while important for asthma diagnosis and the understanding of symptoms, may not have large quantities of associations with DNA methylation measured via whole blood. Principal components 8, 9, and 10 contain much of the variance from the numerous allergen tests included in this study (containing both skin prick test results and serum specific IgE measurements). These PCs were significantly associated with one, two, and 5 CpGs, respectively. Of the 8 CpGs identified here, all but one has not been previously shown to associate with chronic illnesses or immune-mediated diseases. Cg14161241, which was significantly associated with PC10, is the lone exception with previous associations with obesity and type 2 diabetes [42]. Like many of the other markers represented via PCs 1-6, the effect on DNA methylation measurable via a whole blood sample is likely limited with these markers. Due to the well-documented genetic contributions to asthma, we next wanted to compare our findings within the methylome to those previously discovered in the genome via large GWASs [10, 43]. Utilizing the findings of Demenais et al., we searched for significantly associated SNPs within 1 MB of the 221 unique CpGs that we identified. Of the significantly associated SNPs (p< 5 × 10-8, N= 892 SNPs), we found 17 (1.9%) to be within 1 MB of 2 of the 221 CpGs that we identified (cg02046836, cg23515090). Interestingly, neither of these two CpGs have been previously identified as an mQTL [27]. The proximity of these SNPs to the identified CpGs could be caused from multiple factors outside of known standard mQTL associations. It could be the case that these CpGs do have some genetic influence acting on their methylation status that has not yet been characterized. Additionally, it is certainly possible that the SNPs and CpGs being within close proximity to one another is simply a random occurrence, and their presence and downstream effects are independent of one another. To summarize, we found limited overlap of our EWAS signal with top loci from a previously performed GWAS of asthma and that these significant CpGs nearby these GWAS SNPs were not directly affected by mQTLs, which suggests that our EWAS largely captures independent epigenetic signal at loci that also harbor small amounts of genetic variants that influence genetic susceptibility to asthma. Thus far, we have shown significant associations with clinical markers of asthma from a singular, cross-sectional timepoint that corroborates other previous findings from similar studies. However, a potential increase in our understanding of asthma may lie in investigating the disease to some longitudinal capacity. Two previous studies in the Isle of Wight Birth Cohort (IOWBC) were conducted in 2022, which investigated the association of DNA methylation with asthma acquisition across adolescence and adulthood [44, 45]. In each of these studies, several CpGs, with some residing in immune regulatory genes, were identified to be associated with asthma acquisition [44, 45]. The findings from these two studies highlight promising insight into the potential longitudinal persistence of DNA methylation signatures associated with asthma, though additional studies are needed to investigate this persistence later in life (adulthood), in additional cohorts, and how they relate to specific measurements of asthma. After our initial findings, with the availability of an additional sample collected significantly later for many of the individuals, we were interested in investigating the longitudinal persistence of the DNA methylation signature that was identified to associate with asthma markers at the first timepoint. This analysis was based on the same PC data from the original 35 clinical markers measured at baseline and DNA methylation at follow-up from a sample collected ∼10 years later. The results from these additional 10 EWASs showed very little overlap with the findings from the initial assessment. In total, these 10 EWASs identified a total of 49 significantly associated CpGs (of which none were replicated across multiple PCs). No CpGs were replicated across the two timepoints, however, when looking within a 100 kb window of proximity, we found that 3 CpGs identified at timepoint 2 (found via PCs 2, 3, and 7) were within 100 kb of 4 CpGs identified at T1 (all found via PC7). The low direct overlap between the two timepoints could be due to the methylation signature associating with asthma diminishing over time, potentially as individuals’ asthma status and severity changes. A query of the EWAS Atlas, however, using these 49 CpGs identified at T2 showed that 3 and 1 CpGs have been previously associated with FENO and allergic sensitization, respectively (Figure 3). This highlights that there may be some longitudinal persistence of an asthmatic epigenetic signature. Further, we identified, when comparing the estimates of all 271 significant associations (222 + 49) across the two timepoints, a strong correlation of 0.82 adding to the notion that some long-term persistence of the asthma associated methylation signature may be occurring. This consistency of the EWAS estimates indicates that replication of specific CpGs could occur with an increased sample size. Additional studies investigating the long-term lung functionality following asthma diagnosis, even in cases of remission, could benefit from investigating DNA methylation longitudinally in a more nuanced, complete manner. ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F3) Figure 3. EWAS atlas enrichment analysis of the 49 unique CpGs identified at TP2 across the 10 EWASs. The plot shows the number of previously identified Differentially Methylated CpGs (DMC) that are present within the CpGs identified in our analysis. The EWAS Atlas job can be found via this job ID: a3e03507f39d7af4c7a82d0b0725308e. Our study does have some limitations that should be considered. First, all individuals included in this study are of European descent and from the Netherlands. Asthma prevalence can differ greatly depending on the local environment of the individual and their ethnicity. Including a diverse population could improve the generality of the findings to other groups of individuals. Additionally, the results generated from the second time point contained a significantly reduced number of individuals due to sample availability, which could contribute to the differences reported. We also did not have clinical marker measurements available from the second timepoint, which could have shed some insight on the persistence of the asthma symptoms and severity at that time. Overall, increasing the number of individuals at both timepoints using multiple individuals from varying backgrounds would be the most optimal strategy and offer the most comprehensive assessment of asthma. ## Conclusion Asthma, along with many other chronic, immune-mediated diseases, is challenging to diagnose, classify, and treat. Here, we contribute a comprehensive assessment of the relationship between DNA methylation and a collection of clinical asthma markers. We identified 270 unique, significant CpGs to be associated with principal component scores based on clinical asthma markers. The large majority of these CpGs were associated with PC7, which represented eosinophil counts and blood immunoglobulin levels and 190 were previously associated with asthma, asthma markers, and other allergic illnesses. We also investigated the proximity of these 221 identified unique CpGs to previously identified SNPs that associate with asthma. We found 17 unique SNPs that were previously identified to be associated with asthma within 1 MB of two of the 221 CpGs reported here demonstrating limited proximal overlap with these known genetic associations. Over approximately 10 years we showed that the correlation of the estimates across the two timepoints from the 271 significant associations was strong at 0.82, pointing to a persistent methylation signature associated with asthma. Thus, these results highlight a robust, persistent epigenetic signal in whole blood associated with asthma and, more specifically, a selection of clinical asthma markers. ## Methods ### Participants The study participants and the enrollment process have been previously described [46–48]. Participants came from twin families (young adult twins and their parents) from the Netherlands [48]. To be eligible for the study, at least one member of the family had to report to have been diagnosed with asthma, indicated by self-report in the Netherlands Twin Register (NTR) survey 1 or 2, collected in the early 1990s (please see Supplementary Tables 9 and 10). During a visit to the Vrije Universiteit Medical Center (VUMC), blood samples and over 40 biological and clinical measures of asthma were collected from 425 persons [49]. A complete list of the clinical markers that were collected can be found in Table 1. A subset of 233 persons participated in the NTR Biobank project between 2004 and 2008 in which blood and buccal samples were collected [25]. DNA was extracted at the time of sample collection or immediately prior to the assessment of methylation [50]. All samples were simultaneously measured on the Illumina Infinium EPIC v1 methylation array. In total, 375 of the 425 individuals had a sample remaining for analysis, and 341 of these had good quality Illumina blood EPIC Array data from at least the first time point, and 232 individuals had blood Illumina EPIC Array data from two time points. The study was approved by the Central Ethics Committee on Research Involving Human Subjects of the VU University Medical Centre, Amsterdam, an Institutional Review Board certified by the U.S. Office of Human Research Protections (IRB number IRB00002991 under Federal-wide Assurance-FWA00017598; IRB/institute codes, NTR 03-180). ### Sample collection and DNA extraction The procedures of whole blood sample collection have been previously described [25, 46, 51]. The genomic material analyzed in this study have been extracted and assessed for DNA quantity, quality, purity, and individual identity. The DNA analyzed in this study was extracted from whole blood samples using the Zymo Quick DNA mini-prep Kit [50]. Genomic material was quantified via the Invitrogen Qubit Broad-Range Fluorescent Assay, and sample purity was assessed using standard absorbance metrics via a SpectraMax microplate reader. ### Genotyping The genotyping process has been previously described [47, 52]. Briefly, genotype data was generated via the Illumina Infinium Global Screening Array. All sample data quality was assessed vigorously, and the refined genotype data was used to confirm familial relationships and sample sex. In total, 20 individuals were omitted for analysis due to mismatched familial identities. ### DNA Methylation Assessment DNA bisulfite conversion was performed utilizing the Zymo EZ-96 DNA Methylation Kit [53]. DNA methylation was assessed using the Illumina Infinium EPIC v1 DNA Methylation Array on all samples at the Avera Genetics Laboratory [54]. The samples were fully randomized across arrays. ### DNA Methylation Data Quality Control Quality control has been described in detail previously [47]. In brief, DNA methylation data quality was performed using two bioinformatic tools. First, DNA methylation data was preliminarily assessed using the Illumina GenomeStudio 1.0 software. Second, methylation data quality control and normalization were conducted using a pipeline developed by the Biobank-based Integrative Omics Study (BIOS) Consortium in R (version 4.4.0). Sample quality was first assessed using the R package MethylAid (v1.38.0) to omit any outliers in sample performance (default thresholds) [55]. Array probe filtering and functional normalization were performed using the R package DNAmArray (v2.2.0) [56]. The R package omicsPrint (v.1.24.0) was utilized to identify sample genotypes based on methylation probes to verify sample relationships gathered from genotype data via the GSA [57]. The functions getSex from DNAmArray and the R package meffil (v1.3.8) were used to identify and confirm sample sex based on X chromosome methylation pattern [58]. The following probe filters were applied: Probes were set to missing (NA) in a sample if they had an intensity value of exactly zero, detection P-value > 0.01, or bead count < 3. DNAmArray will also remove any probes that show a success rate below 0.95 across all samples. In total, this filtering process reduced our number of individuals to 341 (from the original 375). Finally, previous studies performed by Zhou et al. in 2017 identified polymorphic and cross-reactive probes that are included on Illumina methylation array platforms [59]. Generally, they recommend an omitting of approximately 100,000 probes on the EPIC methylation array. This removal process was carried out using the probemasking() function via DNAmArray. Following these steps, a total of 759,263 methylation sites were included in this study out of the total 865,859 possible CpGs. Only autosomal methylation sites were considered for downstream analyses, which left a total of 742,442 CpGs included in the final analyses. Finally, we calculated cellular proportions for all samples using the IDOL whole blood reference library [60]. ### Covariates Several covariates were utilized in our epigenome-wide association studies. For each sample at each timepoint, covariate data was collected at the same time. We included age, sex, array row, sample plate (dummy-coded), smoking status, asthma/lung medication use, and cellular proportions. For smoking status, individuals were classified into three groups (codes as 0 = Never smoker, 1 = former smoker, 2 = current smoker). Cellular proportions (natural killer cells, monocytes, B cells, CD4+ T cells, CD8+T cells) for each of the whole blood samples were estimated via the methylation data from that sample. The usage of specific and general asthma medications (inhaled corticosteroids (0=no, 1=yes), betamimetica (0=no, 1=yes), general lung medication (0=no, 1=yes)) was recorded for each participant at the first time point, and all three medications were included as covariates for the T1 EWAS. The information gathered here was used at both time points. Due to limited variability in medication use at the second timepoint, only a single “general lung medication use” variable was used as a covariate for T2. ### Methylation Data Annotation Genomic annotations were gathered from the EPIC manifest file that is provided by Illumina (MethylationEPIC_v-1-0_B5.csv): locations of CpG islands, ENCODE DNase I Hypersensitive sites (DHSs), ENCODE transcription factor binding sites (TFBSs), open chromatin, FANTOM4 enhancers and FANTOM5 enhancers, etc. Genome build 37 coordinates were utilized for all the analyses. ### Asthma phenotyping Of the total list of available clinical markers, exclusions for markers were made if there was no variation amongst participants. In total, 35 clinical markers were selected to be included in the principal component analysis. If an individual themselves had more than 6 missing values they were excluded from the analysis as well, which resulted in removing 4 individuals leaving a total of 319 to include. 10 of the 35 markers contained missing values following these exclusion processes. For these missing values, the R package mice (v3.16.0) was utilized to impute the missing data. The default settings were utilized, 5 imputations per marker, predictive mean matching (pmm) for continuous traits, logistic regression (logreg) for categorical traits. Distribution plots for these markers prior to and after imputation can be found in Supplemental Figure 1. ### Principal component analysis Following imputation, the data was normalized using the scale function in R, which performs a z-score transformation for each marker. We then performed principal component analysis (PCA) including 10 principal components (PCs) on our 35 clinical markers that were included. PCA was performed using the robpca() r function via the R package rospca (v1.0.4), which is suited for non-normally distributed data [26, 61]. A scree plot of the eigen values for each of the 10 PCs can be found in Supplemental Figure 7. Outliers for each PC were identified via the R package ewaff (v0.0.2) using the ewaff.handle.outliers function with the following settings: method set to iqr, iqr range set to 3. Outliers were found in PCs 7-10. Principal component 7 contained four outliers, which were removed prior to further analyses (n = 315). Principal components 8-10 each contained several outliers. Due to the nature of the clinical marker data and the way the PCA captured the variance, these PCs largely captured variance from markers recording allergen responses, which resulted in individuals who had strong allergen responses having extreme PC scores. Due to the biological significance of these people, we decided to keep these individuals in the subsequent analyses by dichotomizing the group based on their outlier status (1 = outliers, 0 = not outlier), rather than excluding them. These new dichotomized values for PCs 8, 9, and 10 were then used for the downstream analyses. Frequencies for these PCs can be found in Supplemental Table 11. ### EWAS Ten epigenome-wide association studies (EWAS) were then performed via the PC scores for each individual for each of these 10 PCs using DNA methylation beta value as the outcome and the PC scores for each individual as the predictor. Due to the related nature of the samples utilized here, each EWAS was conducted via the r package gee (v4.13-25) using a generalized estimating equations (GEE) model that corrected for the correlation structure in families. The following settings were utilized: Gaussian link function, 100 iterations, and the “exchangeable” option to correct for the familial structure. Additionally, each EWAS included age, sex, methylation array row, bisulfite sample plate (dummy-coded), smoking status (codes as 0 = Never smoker, 1 = former smoker, 2 = current smoker), estimated cellular proportions (natural killer cells, monocytes, B cells, CD4+ T cells, CD8+T cells), and lung medication status via questionnaire (inhaled corticosteroids, betamimetica, general lung medication) as covariates. For each EWAS, a standard Bonferroni correction (α = 0.05 / # of CpGs (742,442)) was applied to account for multiple testing to identify significantly associated CpGs. Enrichment analysis was performed in the EWAS Atlas with the EWAS Toolkit, which compares against previously known associations from previous studies investigating DNA methylation. We also conducted an enrichment analysis via the online tool eFORGE, which is a database of known CpG associations to cell-type specific regulatory elements. This analysis was performed using the “Consolidated Roadmap Epigenomics – Chromatin – All 15-state marks” data set and the standard settings. Next, we cross-analyzed our significantly associated CpGs with significantly associated SNPs (p< 5 × 10-8, N= 892 SNPs) identified by Demenais et al in a large meta-analysis [10], utilizing leave-one-out GWAS summary statistics without NTR participants. As has been previously utilized in other studies, a 1 MB range around each CpG was utilized as a cutoff point to identify SNPs within close proximity [62]. Finally, we investigated the CpGs that were found to be within close proximity to a known significantly associated SNP for any previously identified mQTL association by querying the BBMRI mQTL database (based on a previous study by Bonder et. Al.) [27]. Of the 319 individuals included in timepoint 1, 182 had a follow-up blood sample with methylation data available for subsequent analyses at a later time point. For this second timepoint, an additional 10 EWASs were performed using the same PC scores generated from the timepoint 1 clinical marker data to assess longitudinal persistence of the DNA methylation signature associated with these asthma markers. The EWASs were performed with largely the same models as at timepoint 1 samples (stated above) but utilizing covariate data from time point 2. Due to the reduction of individuals and a lack of unique medication covariate combinations, asthma medication status was reduced to a Bayesian variable grouping individuals on the basis of using any one of the three listed medications. This single covariate capturing asthma medication status was then used for these timepoint 2 EWASs. An additional longitudinal comparison was made looking for CpGs in timepoint 1 that were within 100 KB (upstream and downstream) of a CpG identified in timepoint 2. Similar to timepoint 1, following this the timepoint 2 CpGs were grouped and a query of the EWAS Atlas was performed looking for previously identified associations. ## Supporting information Supplementary Figure 5. Enrichment results from the eFORGE database. [[supplements/310829_file03.pdf]](pending:yes) Supplementary Table 4. Independent attachment with complete summary statistics for each of the 10 EWASs. [[supplements/310829_file04.csv]](pending:yes) Supplementary Table 3. Independent attachment with a complete list of significant CpGs for all 10 timepoint 1 EWASs. [[supplements/310829_file05.csv]](pending:yes) ## Funding We acknowledge funding from the Amsterdam Public Health Institute (personalized medicine innovation grant); the Avera Institute, Sioux Falls (USA); the National Institutes of Health (NIH R01 HD042157-01A1, MH081802, and Grand Opportunity grants: 1RC2 MH089951 and 1RC2 MH089995); the Netherlands Organization for Scientific Research (NWO): Netherlands Twin Registry Repository: researching the interplay between genome and environment (NWO-Groot 480-15-001/674); Biobanking and Biomolecular Research Infrastructure (BBMRI–NL, NWO 184.033.111); the BBRMI-NL-financed BIOS Consortium (NWO 184.021.007), Geno-type/phenotype database for behavior genetic and genetic epidemiological studies (ZonMw Middelgroot 911-09-032); and large-scale infrastructures X-Omics (184.034.019). DIB acknowledges the Royal Netherlands Academy of Science Professor Award (PAH/6635). AJVA acknowledges the University of South Dakota Wesley H. Parke Research Award. ## Data Availability Statement All NTR data can be requested from bona fide researchers ([https://ntr-data-request.psy.vu.nl/](https://ntr-data-request.psy.vu.nl/)). Because of the consent given by the study participants, the data cannot be made publicly available. The pipeline for DNA methylation–array analysis developed by the Biobank-based Integrative Omics Study (BIOS) consortium is available at [https://molepi.github.io/DNAmArray_workflow/](https://molepi.github.io/DNAmArray_workflow/) ([https://doi.org/10.5281/zenodo.3355292](https://doi.org/10.5281/zenodo.3355292)). ## Conflicts of Interest The authors declare no conflict of interest. ## Supplementary Tables and Figures View this table: [Supplementary Table 1.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T4) Supplementary Table 1. Table showing PC eigen values and % of total variance captured. View this table: [Supplementary Table 2.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T5) Supplementary Table 2. EWAS results for time point 1 with inflation values. **Supplementary Table 3.** Independent attachment with a complete list of significant CpGs for all 10 timepoint 1 EWASs. **Supplementary Table 4.** Independent attachment with complete summary statistics for each of the 10 EWASs. View this table: [Supplementary Table 5.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T6) Supplementary Table 5. Summary table of the GWAS SNPs identified by Demenais et. al. to reside within 1 MB of a CpG identified at timepoint 1. **Supplementary Table 6.** Independent attachment with a complete list of significant CpGs for all 10 timepoint 2 EWASs. View this table: [Supplementary Table 7.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T7) Supplementary Table 7. EWAS results for time point 2 with inflation values. View this table: [Supplementary Table 8.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T8) Supplementary Table 8. The CpGs from timepoint 1 that reside within 100 kb of a CpG identified at timepoint 2. View this table: [Supplementary Table 9.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T9) Supplementary Table 9. Questionnaire responses for two surveys asking about asthma status prior to study enrollment for the initial 425 individuals. View this table: [Supplementary Table 10.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T10) Supplementary Table 10. Comparison of survey responses from individuals who responded to both surveys. View this table: [Supplementary Table 11.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/T11) Supplementary Table 11. Frequencies of outliers for principal components 8, 9, and 10. Outlier status was determined using thresholds of Q1 – (3 x IQR) and Q3 + (3 x IQR). ![Supplementary Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F4.medium.gif) [Supplementary Figure 1.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F4) Supplementary Figure 1. Distribution plots of the imputed clinical markers of asthma prior to (blue) and after imputation (yellow). ![Supplementary Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F5.medium.gif) [Supplementary Figure 2.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F5) Supplementary Figure 2. Distribution plots of the individual principal component scores for each PC. ![Supplementary Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F6.medium.gif) [Supplementary Figure 3.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F6) Supplementary Figure 3. EWAS results calculated the 10 different sets of PC scores from the first timepoint. The red line indicates the significance threshold set via a Bonferroni correction. ![Supplementary Figure 4.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F7.medium.gif) [Supplementary Figure 4.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F7) Supplementary Figure 4. EWAS atlas enrichment analysis of the 204 CpGs identified at TP1 from PC7. The EWAS Atlas job can be found via this job ID: 2a37ab12d2785eaa0d564e062cff5f96. ![Supplementary Figure 5.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F8.medium.gif) [Supplementary Figure 5.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F8) Supplementary Figure 5. Enrichment results from the eFORGE database (separate PDF included for better visualization). ![Supplementary Figure 6.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F9.medium.gif) [Supplementary Figure 6.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F9) Supplementary Figure 6. EWAS results calculated the 10 different sets of PC scores from the second timepoint. The red line indicates the significance threshold set via a Bonferroni correction. ![Supplementary Figure 7.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/07/23/2024.07.22.24310829/F10.medium.gif) [Supplementary Figure 7.](http://medrxiv.org/content/early/2024/07/23/2024.07.22.24310829/F10) Supplementary Figure 7. Scree plot showing the raw eigen values for each of the 10 principal components included. ## Acknowledgments We would like to acknowledge all the participants in the Netherlands Twin Register for their willingness to participate in this study. We would also like to acknowledge the BIOS Consortium. A list of the contributors to this consortium and other additional details can be found at [https://www.bbmri.nl/bios-consortium](https://www.bbmri.nl/bios-consortium). * Received July 22, 2024. * Revision received July 22, 2024. * Accepted July 23, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Asthma. 2023 [cited 2024 02/26/2024]; Available from: [https://www.who.int/news-room/fact-sheets/detail/asthma#:~:text=Inflammation%20and%20narrowing%20of%20the,and%20caused%20455%20000%20deaths](https://www.who.int/news-room/fact-sheets/detail/asthma#:~:text=Inflammation%20and%20narrowing%20of%20the,and%20caused%20455%20000%20deaths). 2. 2.Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet, 2020. 396(10258): p. 1204–1222. 3. 3.Jones, T.L., D.M. Neville, and A.J. Chauhan, Diagnosis and treatment of severe asthma: a phenotype-based approach. Clin Med (Lond), 2018. 18(Suppl 2): p. s36–s40. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7861/clinmedicine.18-2-s36&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29700091&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 4. 4.Svenningsen, S. and P. Nair, Asthma Endotypes and an Overview of Targeted Therapy for Asthma. Frontiers in Medicine, 2017. 4. 5. 5.Gans, M.D. and T. Gavrilova, Understanding the immunology of asthma: Pathophysiology, biomarkers, and treatments for asthma endotypes. Paediatr Respir Rev, 2020. 36: p. 118–127. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 6. 6.Kuruvilla, M.E., F.E. Lee, and G.B. Lee, Understanding Asthma Phenotypes, Endotypes, and Mechanisms of Disease. Clin Rev Allergy Immunol, 2019. 56(2): p. 219–233. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 7. 7.Lötvall, J., et al., Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. J Allergy Clin Immunol, 2011. 127(2): p. 355–60. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaci.2010.11.037&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21281866&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286808000007&link_type=ISI) 8. 8.Kaur, R. and G. Chupp, Phenotypes and endotypes of adult asthma: Moving toward precision medicine. J Allergy Clin Immunol, 2019. 144(1): p. 1–12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaci.2019.05.031&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 9. 9.Ntontsi, P., et al., Genetics and Epigenetics in Asthma. Int J Mol Sci, 2021. 22(5). 10. 10.Demenais, F., et al., Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat Genet, 2018. 50(1): p. 42–53. 11. 11.Ferreira, M.A.R., et al., Genetic Architectures of Childhood- and Adult-Onset Asthma Are Partly Distinct. Am J Hum Genet, 2019. 104(4): p. 665–684. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2019.02.022&link_type=DOI) 12. 12.Yang, I.V., et al., DNA methylation and childhood asthma in the inner city. J Allergy Clin Immunol, 2015. 136(1): p. 69–80. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaci.2015.01.025&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25769910&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 13. 13.Xu, C.J., et al., DNA methylation in childhood asthma: an epigenome-wide meta-analysis. Lancet Respir Med, 2018. 6(5): p. 379–388. 14. 14.Sheikhpour, M., et al., A review of epigenetic changes in asthma: methylation and acetylation. Clinical Epigenetics, 2021. 13(1): p. 65. 15. 15.Smith, Z.D. and A. Meissner, DNA methylation: roles in mammalian development. Nat Rev Genet, 2013. 14(3): p. 204–20. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg3354&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23400093&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 16. 16.Moore, L.D., T. Le, and G. Fan, DNA Methylation and Its Basic Function. Neuropsychopharmacology, 2013. 38(1): p. 23–38. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/npp.2012.112&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22781841&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312099500003&link_type=ISI) 17. 17.Jin, Z. and Y. Liu, DNA methylation in human diseases. Genes Dis, 2018. 5(1): p. 1–8. 18. 18.Torrone, D., et al., Reproducibility and intraindividual variation over days in buccal cell DNA methylation of two asthma genes, interferon γ (IFNγ) and inducible nitric oxide synthase (iNOS). Clin Epigenetics, 2012. 4(1): p. 3. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22414378&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 19. 19.Lovinsky-Desir, S., et al., DNA methylation of the allergy regulatory gene interferon gamma varies by age, sex, and tissue type in asthmatics. Clin Epigenetics, 2014. 6(1): p. 9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1868-7083-6-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24891923&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 20. 20.Cardenas, A., et al., The nasal methylome as a biomarker of asthma and airway inflammation in children. Nat Commun, 2019. 10(1): p. 3095. 21. 21.Yang, I.V., et al., The nasal methylome and childhood atopic asthma. J Allergy Clin Immunol, 2017. 139(5): p. 1478–1488. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaci.2016.07.036&link_type=DOI) 22. 22.Forno, E., et al., DNA methylation in nasal epithelium, atopy, and atopic asthma in children: a genome-wide study. Lancet Respir Med, 2019. 7(4): p. 336–346. 23. 23.Jeong, A., et al., DNA Methylation in Inflammatory Pathways Modifies the Association between BMI and Adult-Onset Non-Atopic Asthma. Int J Environ Res Public Health, 2019. 16(4). 24. 24.Bae, D.J., et al., Epigenetic Changes in Asthma: Role of DNA CpG Methylation. Tuberc Respir Dis (Seoul), 2020. 83(1): p. 1–13. 25. 25.Willemsen, G., et al., The Netherlands Twin Register Biobank: A Resource for Genetic Epidemiological Studies. Twin Research and Human Genetics, 2010. 13(3): p. 231–245. 26. 26.Hubert, M. and S. Engelen, Robust PCA and classification in biosciences. Bioinformatics, 2004. 20(11): p. 1728–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bth158&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14988110&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223142000010&link_type=ISI) 27. 27.Bonder, M.J., et al., Disease variants alter transcription factor levels and methylation of their binding sites. Nat Genet, 2017. 49(1): p. 131–138. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3721&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27918535&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 28. 28.North, M.L. and A.K. Ellis, The role of epigenetics in the developmental origins of allergic disease. Ann Allergy Asthma Immunol, 2011. 106(5): p. 355–61; quiz 362. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.anai.2011.02.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21530865&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000290509100002&link_type=ISI) 29. 29.Yano, K., et al., GWAS with principal component analysis identifies a gene comprehensively controlling rice architecture. Proc Natl Acad Sci U S A, 2019. 116(42): p. 21262–21267. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTE2LzQyLzIxMjYyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMjMvMjAyNC4wNy4yMi4yNDMxMDgyOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 30. 30.Georas, S.N., et al., T-helper cell type-2 regulation in allergic disease. European Respiratory Journal, 2005. 26(6): p. 1119–1137. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjk6IjI2LzYvMTExOSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzIzLzIwMjQuMDcuMjIuMjQzMTA4MjkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 31. 31.Bosnjak, B., et al., Treatment of allergic asthma: modulation of Th2 cells and their responses. Respir Res, 2011. 12(1): p. 114. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1465-9921-12-114&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21867534&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 32. 32.Barnes, P.J., Pathophysiology of allergic inflammation. Immunol Rev, 2011. 242(1): p. 31–50. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1600-065X.2011.01020.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21682737&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 33. 33.Du, S. and H. Zheng, Role of FoxO transcription factors in aging and age-related metabolic and neurodegenerative diseases. Cell & Bioscience, 2021. 11(1): p. 188. 34. 34.Farhan, M., et al., FOXO Signaling Pathways as Therapeutic Targets in Cancer. Int J Biol Sci, 2017. 13(7): p. 815–827. 35. 35.Malik, S. and A. Awasthi, Transcriptional Control of Th9 Cells: Role of Foxo1 in Interleukin-9 Induction. Front Immunol, 2018. 9: p. 995. 36. 36.Kim, M.E., D.H. Kim, and J.S. Lee, FoxO Transcription Factors: Applicability as a Novel Immune Cell Regulators and Therapeutic Targets in Oxidative Stress-Related Diseases. Int J Mol Sci, 2022. 23(19). 37. 37.Tang, Y., et al., Promoter DNA methylation analysis reveals a combined diagnosis of CpG-based biomarker for prostate cancer. Oncotarget, 2017. 8(35): p. 58199–58209. 38. 38.Shen, J., et al., Exploring genome-wide DNA methylation profiles altered in hepatocellular carcinoma using Infinium HumanMethylation 450 BeadChips. Epigenetics, 2013. 8(1): p. 34–43. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4161/epi.23062&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23208076&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 39. 39.Sun, X.J., et al., An integrated analysis of genome-wide DNA methylation and gene expression data in hepatocellular carcinoma. FEBS Open Bio, 2018. 8(7): p. 1093–1103. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/2211-5463.12433&link_type=DOI) 40. 40.Zhu, L., et al., Genome-wide DNA methylation profiling of primary colorectal laterally spreading tumors identifies disease-specific epimutations on common pathways. Int J Cancer, 2018. 143(10): p. 2488–2498. 41. 41.van der Meer, L.T., J.H. Jansen, and B.A. van der Reijden, Gfi1 and Gfi1b: key regulators of hematopoiesis. Leukemia, 2010. 24(11): p. 1834–1843. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/leu.2010.195&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20861919&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000284057400002&link_type=ISI) 42. 42.Kirchner, H., et al., Altered DNA methylation of glycolytic and lipogenic genes in liver from obese and type 2 diabetic patients. Mol Metab, 2016. 5(3): p. 171–183. 43. 43.Moffatt, M.F., et al., A large-scale, consortium-based genomewide association study of asthma. N Engl J Med, 2010. 363(13): p. 1211–1221. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa0906312&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20860503&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282050000005&link_type=ISI) 44. 44.Rathod, A., et al., DNA Methylation and Asthma Acquisition during Adolescence and Post-Adolescence, an Epigenome-Wide Longitudinal Study. J Pers Med, 2022. 12(2). 45. 45.Li, L., et al., Newborn DNA methylation and asthma acquisition across adolescence and early adulthood. Clin Exp Allergy, 2022. 52(5): p. 658–669. 46. 46.Wu, T., et al., Genetic and environmental influences on objective intermediate asthma phenotypes in Dutch twins. Eur Respir J, 2010. 36(2): p. 261–8. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjM2LzIvMjYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMjMvMjAyNC4wNy4yMi4yNDMxMDgyOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 47. 47.Van Asselt, A.J., et al., Genome-Wide DNA Methylation Profiles in Whole-Blood and Buccal Samples-Cross-Sectional, Longitudinal, and across Platforms. Int J Mol Sci, 2023. 24(19). 48. 48.Los, H., P.E. Postmus, and D.I. Boomsma, Asthma genetics and intermediate phenotypes: a review from twin studies. Twin Res, 2001. 4(2): p. 81–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1375/1369052012191&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11665340&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 49. 49.Bechger, T.M., D.I. Boomsma, and H. Koning, A Limited Dependent Variable Model for Heritability Estimation with Non-Random Ascertained Samples. Behavior Genetics, 2002. 32(2): p. 145–151. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1015257908396&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12036112&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000175458100008&link_type=ISI) 50. 50.Quick-DNA Miniprep Kit Instruction Manual, in Online, Z. Research, Editor. 2020, Zymo Research: [https://www.zymoresearch.com/collections/quick-dna-kits/products/quick-dna-miniprep](https://www.zymoresearch.com/collections/quick-dna-kits/products/quick-dna-miniprep). 51. 51.van Dongen, J., et al., Genetic and environmental influences interact with age and sex in shaping the human methylome. Nat Commun, 2016. 7: p. 11115. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ncomms11115&link_type=DOI) 52. 52.Van Asselt, A.J. and E.A. Ehli, Whole-Genome Genotyping Using DNA Microarrays for Population Genetics, in Estrogen Receptors: Methods and Protocols, K.M. Eyster, Editor. 2022, Springer US: New York, NY. p. 269–287. 53. 53.Research, Z., EZ-96 DNA Methylation™ Kit, in Online, Z. Research, Editor. 2021. 54. 54.Infinium HD Assay Methylation Protocol Guide. [Protocol] 2015 [cited 2020; 15019519 v01:[Available from: [https://www.illumina.com/products/by-type/microarray-kits/infinium-methylation-epic.html](https://www.illumina.com/products/by-type/microarray-kits/infinium-methylation-epic.html). 55. 55.van Iterson, M., et al., MethylAid: visual and interactive quality control of large Illumina 450k datasets. Bioinformatics, 2014. 30(23): p. 3435–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btu566&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25147358&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 56. 56.Fortin, J.-P., et al., Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome Biology, 2014. 15(11): p. 503. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-014-0503-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25599564&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 57. 57.van Iterson, M., et al., omicsPrint: detection of data linkage errors in multiple omics studies. Bioinformatics, 2018. 34(12): p. 2142–2143. 58. 58.Min, J.L., et al., Meffil: efficient normalization and analysis of very large DNA methylation datasets. Bioinformatics, 2018. 34(23): p. 3983–3989. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty476&link_type=DOI) 59. 59.Zhou, W., P.W. Laird, and H. Shen, Comprehensive characterization, annotation and innovative use of Infinium DNA methylation BeadChip probes. Nucleic Acids Res, 2017. 45(4): p. e22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkw967&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 60. 60.Salas, L.A., et al., An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray. Genome Biol, 2018. 19(1): p. 64. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-018-1448-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F23%2F2024.07.22.24310829.atom) 61. 61.Hubert, M., P. Rousseeuw, and K. Branden, ROBPCA: A new approach to robust principal component analysis. Technometrics, 2005. 47: p. 64–79. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1198/004017004000000563&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000226647400007&link_type=ISI) 62. 62.Odintsova, V.V., et al., DNA methylation in peripheral tissues and left-handedness. Sci Rep, 2022. 12(1): p. 5606.