Genetic architecture of brain age and its casual relations with brain and mental disorders ========================================================================================== * Esten H. Leonardsen * Didac Vidal-Piñeiro * James M. Roe * Oleksandr Frei * Alexey A. Shadrin * Olena Iakunchykova * Ann-Marie G. de Lange * Tobias Kaufmann * Bernd Taschler * Stephen M. Smith * Ole A. Andreassen * Thomas Wolfers * Lars T. Westlye * Yunpeng Wang ## Abstract The difference between chronological age and the apparent age of the brain estimated from brain imaging data — the brain age gap (BAG) — is widely considered a general indicator of brain health. Converging evidence supports that BAG is sensitive to an array of genetic and non-genetic traits and diseases, yet few studies have examined the genetic architecture and its corresponding causal relationships with common brain disorders. Here, we estimate BAG using state-of-the-art neural networks trained on brain scans from 53,542 individuals (age range 3-95 years). A genome-wide association analysis across 28,104 individuals (40-84 years) from the UK Biobank revealed eight independent genomic regions significantly associated with BAG (p<5×10−8) implicating neurological, metabolic, and immunological pathways – among which seven are novel. No significant genetic correlations or causal relationships with BAG were found for Parkinson’s disease, major depressive disorder, or schizophrenia, but two-sample Mendelian randomization indicated a causal influence of AD (p=7.9×10−4) and bipolar disorder (p=1.35×10−2) on BAG. These results emphasize the polygenic architecture of brain age and provide insights into the causal relationship between selected neurological and neuropsychiatric disorders and BAG. ## Introduction Over the last decade, brain age has emerged as a promising measure of overall brain health1, 2. To estimate brain age, machine learning models are applied to brain imaging data to learn visual patterns characteristic of different ages3, 4. The difference between predicted brain age and chronological age is termed the brain age gap (BAG) and indicates deviation from a normative aging trajectory, a potential health indicator. Earlier studies have found a large variation in the predicted brain age of individuals with the same chronological age, and that these interindividual variations correlate with neurological and mental disorders5-7, such as dementia6, 8, schizophrenia (SCZ)9, 10, major depressive disorder (MDD)11, and also mortality7,12. In addition, biological, environmental and lifestyle factors associated with these disorders have been reported to correlate with BAG, for example infections13, 14, smoking5, physical activity15, and education level16. Genetic differences have been shown to explain a sizeable portion of interindividual variation in BAG. Twin-based heritability for BAG has been estimated to be as high as 0.6617, and single nucleotide polymorphism (SNP)-based heritability estimates are also relatively high — around 0.2 6, 18. Earlier gene-discovery efforts investigating genetic associations with BAG have found and examined two genomic loci in detail: one on chromosome 1 containing the potassium channel gene, *KCNK2*, and one in the chromosome 17 inversion region (17q21.31)18, 19; Genetic variants in these two regions together explain a negligible fraction of estimated SNP-heritability18. These results suggest that existing GWAS were potentially underpowered to fully characterize the genetic architecture, supported by studies using a conditional false discovery rate-based models yielding a larger set of associations6. Furthermore, Smith *et al*.20 found a rich set of associations when investigating different facets of a multimodal brain age, suggesting that the interplay between genetic variants is complex. Although BAG has been frequently associated with mental disorders, the underlying genetic basis for the observed association has seldom been investigated, possibly due to incomplete knowledge of the genetic architecture of the former. Furthermore, the causal relationships between BAG and brain disorders remain untapped. Mendelian randomization (MR) has become an attractive tool to interrogate cause-effect relationships between risk factors or disorders21. Two-sample MR models have been used to infer causal relations between hundreds of traits or diseases22. However, MR analyses targeting the causal relations between BAG and brain disorders and associated traits have been lacking23. In the present work, we improve the yield of genetic associations for BAG using three strategies: First, we estimate brain age using a state-of-the-art deep neural network architecture (SFCN-reg) trained on one of the largest samples assembled to date5. Then we perform a GWAS for BAG on out-of-sample predictions for a portion of the UK Biobank v3 data containing 28,104 unrelated individuals, about eight thousand more than earlier studies. Finally, we use two-sample MR to assess the genetic and causal relations between BAG and SCZ, bipolar disorder (BIP), Alzheimer’s disease (AD), MDD, and Parkinson’s disease (PD). ## Methods ### Sample All datasets used in the present study have been obtained from previously published studies which have been approved by their respective institutional review board, research ethic committee or other relevant ethic organizations. We used UK Biobank imaging data (UKB, accession number 27412) released in 2019 in combination with a pre-compiled dataset from various sources (**Supplementary Table S1**) for brain age model training and estimation. For the downstream genetic analyses, we started with the initial 40,330 UKB participants that had undergone at least one brain scan (using baseline scans where more were available). We excluded those with recorded brain injury or neurological or psychiatric conditions, those failing standard image quality checks5, those whose genetic data failed quality check by the UKB team, in addition to participants who withdrew consent. Among related participants (UKB-reported kinship coefficient >0), only one was included in the study. In total, 28,104 unique participants remained. ### Brain age estimation A minimal preprocessing protocol was applied to all raw T1-weighted brain MRI images before brain age estimation5: The auto-recon pipeline from FreeSurfer 5.324 was used to remove non-brain tissue. The resulting volumes were reoriented to the standard FSL25 orientation using fslreorient2std, and linearly registered to the 1 mm FSL (version 6.0) MNI152 template using FLIRT26, with 6 degrees of freedom. For efficiency, during model fitting, we cropped a central cube spanning the voxels 6:173, 2:214, and 0:160 in the sagittal, coronal, and axial dimensions, respectively. Before modelling, all voxel intensities were normalized by a constant factor to produce values in the range [0, 1]. The data from all sources (**Supplementary Table S1** and UKB) were split into five equally-sized and disjoint folds with comparable age-ranges and sex distributions. Four of these folds were used for fitting the brain age model, and out-of-sample estimates were computed for the remaining fold. This procedure was repeated five times, resulting in out-of-sample brain age estimates for all participants. Next, BAG was calculated by subtracting chronological age from estimated brain age. The subsequent analyses were performed on the out-of-sample estimates of the UKB data (**Supplementary Table S2**). ### Genome-wide association study Imputed genotypes for the 28,104 participants were obtained from UKB (Category 100314, for further details see27). We excluded SNPs based on missing rate (>0.02), the Hardy-Weinberg Equilibrium test (p<10−6) and minor allele frequency (MAF; < 0.01). In total, ≈ 8.6 million SNPs were analyzed. Since we have observed apparent differences in predicted brain age across folds (**Supplementary Fig. S1**), a GWAS was performed on each hold-out fold separately using PLINK 1.90 beta28. The additive genetic model was assumed, and chronological age, sex and the top ten principal components were included as covariates, accounting for population structure. Association results for each hold-out fold of UKB along with distributions of BAG are shown in **Supplementary Figure S1**. These association results were then meta-analyzed using the inverse variance weighted model implemented in PLINK to identify SNPs that are associated with BAG. **Supplementary Figure S2** shows the association QQ plot which indicated no noticeable genomic inflation. ### Associated regions and genes Association results were ‘clumped’ by the FUMA29 web-service using the linkage disequilibrium (LD) structure from the 1000 Genomes projects phase 3 EUR dataset (1KGp3), with parameters *–clump-p 5e-8 –clump-2 1e-6 –clump-r2 0*.*1*. The standard 250 kilo-bases (kb) were used as the inter-region distance threshold. Genes whose genomic coordinates located within the boundaries of each region were assigned to the corresponding region. SNPs with the smallest association p-values were taken as the lead SNPs for the corresponding regions. In addition, the gene that is closest to each lead SNP by genomic position was annotated using the Ensembl tool VEP30 (**Table 1**). View this table: [Table 1](http://medrxiv.org/content/early/2023/01/26/2023.01.09.23284310/T1) Table 1 Genomic loci associated with BAG. Associated regions were fine-mapped using the FINEMAP31 program. The LD structure from 1KGp3 was also used in this analysis. The default settings of FINEMAP were used, which compares causal models assuming one causal variant in each region to that assuming two, based on the estimated posterior probabilities (PP_1 versus PP_2). FINEMAP ranks all possible configurations in each model presented as 95% credible sets. The confidence of a variant belonging to a set was evaluated by posterior probabilities of inclusion (PPI). In the case of assuming one causal variant, each single variant was assigned a PPI. In the two causal variants cases, each pair of variants was assigned a PPI. ### Post-GWAS annotations Both FUMA and Garfield32 were used for annotating associated SNPs. First, SNPs were assigned to genic elements (*e*.*g*., exon, intron, 3’ and 5’ untranslated regions, intergenic regions, etc.), and the enrichment of this assignment was tested by hypergeometric test (FUMA) or logistic regression models (Garfield). Expression levels of annotated genes to the associated SNPs were inspected in each of the 54 tissue types from the GTEx v8 dataset33. Detailed biological functions for proteins coded by these genes were manually searched in the NCBI Entrez Gene database34 and the UniProtKB database35. ### Genetic correlations between BAG and disorders GWAS summary data for four disorders (SCZ36, BIP37, MDD38, and AD39) were obtained from the Psychiatric Genomics Consortium (PGC, [https://med.unc.edu/pgc/download-results](https://med.unc.edu/pgc/download-results)). For each GWAS, the association results for European ancestral samples excluding samples from 23andMe were used (SCZ: n case=67,390, n control=94,015; BIP: n case=41,917, n control=371,549; MDD: n case=59,851, n control=113,154; AD: n case=71,880, n control=383,378). The PD GWAS results were obtained from the fixed-effect meta-analysis performed by the International Parkinson Disease Genomics Consortium (IPDGC, n case=33674, n control=449056)40. Before post-GWAS analysis we processed the results from all GWAS using a standard protocol. Specifically, SNPs having a MAF <0.05, or imputation INFO <0.5, or ambiguous allelic coding (A/T, or C/G) were removed from subsequent analyses. The LD score model (ldsc)41 was applied to estimate SNP-heritability and genetic correlations between BAG and disorders. Only high-quality SNPs published in the HapMap3 dataset were used for estimation. The LD score derived from the 1KGp3 was used as input to ldsc. The Benjamini-Hochberg False Discovery Rate (FDR) procedure was used to correct for multiple testing across disorders (FDR-corrected p<0.05 was considered statistically significant). To visualize polygenic enrichment, conditional QQ plots42, 43 were made for BAG versus each disorder. In these plots, the QQ curves for the association statistics (-log10 p-values) for BAG were stratified by the corresponding association strength for the conditioned disorder. As the association strength to the conditioned disorder increases, a successive leftward deflation in these curves indicates polygenic enrichment. Similarly, conditional QQ for each disorder versus BAG shows polygenic enrichment in the reverse direction. ### Two-sample Mendelian randomization To study the cause-effect relations between BAG and the five disorders, two sets of MR analyses were performed. The first set, using standard models, included the inverse-variance weighted model (IVW)44, weighted median (wMed)45, Egger regression (Egger)46 and MR-PRESSO (PRESSO)47. For these analyses, only genome wide significant SNPs (p<5×10−8) to the exposure traits or disorders were used as potential instruments. The PLINK program and the LD structure of 1KGp3 dataset were used to select instruments with the following parameters, --*clump-kb 500 kb, --clump-p1 5×10**-8*, and *--clump-r2 0*.*01*. The TwoSampleMR package19 was used for data harmonization and causal inference for the IVW, wMed and Egger models. The same harmonized datasets were used as input to the MR-PRESSO software to assess outliers that may artificially affect MR estimates, *i*.*e*., SNPs that show horizontal pleiotropy to both BAG and disorders. Harmonized instrumental SNPs are shown in **Supplementary Tables S6**-**S15**. The second set of models included the robust adjusted profile score (RAPS)48 and the CAUSE models49. These models can make use of SNPs that show a suggestive level of association (p<10−3) with exposure to increase statistical power without incurring weak instrument bias in estimation. While both models control for horizontal pleiotropy, CAUSE directly tests for a shared (correlated horizontal pleiotropy) versus a causal model for each relation49. Instead of estimating causal effects, CAUSE provides z scores for such tests; negative z scores suggest the existence of a causal relation and positive suggest otherwise. The same instrument selection procedures used in the first set of models were used here, except that 10−3 was taken as the cut-off for selecting instruments, *i*.*e*. –*clump-p1 10**-3*. As each of the six MR models has different assumptions that are difficult to verify in real data, a majority vote ensemble scheme was used to make conclusions for the existence of cause-effect relations: specifically, only when four or more models indicated a cause-effect relation (FDR adjusted p<0.05) was such a relation considered causal. In addition to applying multiple MR models, GWAS results for height measured for European samples50 (n=253,288; [https://portals.broadinstitute.org/collaboration/giant/](https://portals.broadinstitute.org/collaboration/giant/)), for AD diagnosed in a Japanese sample51 (n case=3,962 and n control=4,047) and an African sample52 (n case=2,784 and n control=5,222) and for BIP diagnosed in a Japanese sample53(n case=2,964 and n control=61,887) were used to corroborate MR findings. As commonly done in genetic studies, height was used as a negative exposure control to test if population stratification could generate spurious causal effects 54. The non-European GWAS data were used to test if any observed causal effects generalize across ancestry groups, although with significantly smaller sample sizes. ## Results We obtained accurate brain age estimates; mean absolute errors (MAEs) in each of the five disjoint folds were consistently below 2.5 years (**Supplementary Table S2**). This was consistent when we split the dataset into different subsets based on covariates (MAE = 2.40 in females compared to 2.53 in males; 2.40 in the youngest half compared to 2.52 in the oldest), although we observed a slight age bias (**Supplementary Figure S9**). Based on the meta-analyzed GWAS results, we estimated a SNP heritability of 0.27 (standard error (SE)=0.036) for BAG (**Methods**). Our estimate is comparable to or higher than the two previously reported estimates (0.26, SE=0.0446; 0.19, SE=0.0218). We identified eight independent genomic loci significantly associated with BAG (Fig. 1 a,b and **Table 1**). Associations of lead SNPs in these regions to BAG are highly consistent in directions and effect sizes across the five folds (**Supplementary Table S3**). Among these loci, the one in the inversion region on chromosome 17 (lead SNP rs2106786), including the *MAPT* gene, has been previously reported18, 19, although indexed by a different SNP (rs2435204). This SNP was also highly significant in our analysis (p=5.4×10−21, beta=0.27 years, effective-allele=G). The locus containing the *RUNX2* gene (lead by rs2790102: p=8.92×10−9, beta=-0.15 years, effective-allele=A), which showed suggestive significance in Jonsson *et al*.18, was genome-wide significant in the present study. The *RUNX2* gene codes for a master transcription factor which plays a critical role in skeletal development55. Among the remaining six novel loci, the rs79107704-A allele showed the largest association with BAG; one copy of this allele was associated with an average increase in brain age of 0.63 years (**Table 1**). This SNP is located 3,405bp downstream of the Betaine-homocystein S-methyltransferase 2 gene (*BHMT2*, **Fig. 1b**), a gene whose product is involved in choline metabolism during development56. Other protein-coding genes that are closest to the lead SNPs include those involved in calcium signaling (*CAMK2N2* and *INPP5A*) and metabolism and transcription regulation (*GALC, KLF3* and *KLHL38*), both processes are implicated in biological ageing57. In **Supplementary Table S5** we present detailed annotations of biological functions to each gene. ![Figure 1](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/01/26/2023.01.09.23284310/F1.medium.gif) [Figure 1](http://medrxiv.org/content/early/2023/01/26/2023.01.09.23284310/F1) Figure 1 Genetic associations for brain age gap. **a**. The Manhattan plot of meta-analyzed association results for brain age gap (BAG). Chromosome numbers are shown on x axis, -log10 association p values on y axis and lead SNP rs-numbers in the plot. **b**. Region plots for each of the eight associated regions. Genes located in each region are shown below each figure. Linkage disequilibrium r-squared values are indicated by colors; and recombination frequences by curves. **c**. Expression levels of the annotated genes across tissues analyzed by the GTEx v8 study. Colors indicate average log2 transformed expression level in each tissue. We further annotated these identified SNPs to nearby genes and regulatory elements (**Methods**). Most of the associated SNPs are in non-coding regions such as intergenic, intron or untranslated regions (**Supplementary Figs. S3** and **S4**). Using the default parameters in FUMA29, 54 unique genes were found to be implied by these significant associations by genomic position. The expression levels of these genes in the 54 tissue types from the GTEx v8 project33 showed three remarkable patterns (**Fig. 1c**). The first set of genes expressed highly across almost all 54 tissues; the second set of genes showed low expression levels in most tissue types; and the last set, including eight genes, was highly expressed only in brain tissue types (**Fig. 1c**), for example *MAPT, GFAP*, and the Homeobox protein gene *NKX6-2*. These results suggest that BAG encodes coordinated physiological processes implicating both the brain and the peripheral systems. To nominate causal variants in each locus we performed statistical fine-mapping31 for regions around each lead SNP in **Table 1** (**Methods**). Except for the locus on chromosome 14 which was not resolvable, all loci clearly indicated that the 95% credible sets suggest a causal model with one causal SNP, instead of two, *i*.*e*., the posterior probability for the 1-SNP set were larger than those of the 2-SNP sets (**Supplementary Table S16**). Furthermore, four credible sets indicated that the lead SNPs were also the causal ones (posterior inclusion probability (PPI_1) > 0.05 and >PPI of the second most probable SNP(PPI_2)) (**Methods**; **Supplementary Table 16**) but identifying the causal SNP for the remaining were difficult. For example, the *MAPT* locus on chromosome 17 and the *RUNX2* locus on chromosome 6 showed two SNPs having almost equal and small PPIs (*i*.*e*., <=0.05), indicating that the true causal variants may be some untyped rare ones not investigated in this study. The clearest signal comes from the regions on chromosome 3 and 5, where the PPIs for the lead SNPs were much larger than that for the second most probable causal SNPs. We observed nominally significant genetic correlation between BAG and AD that did not survive FDR-correction (r=0.23, SE=0.1, p=0.02, FDR adjusted-p=0.13) and no significant associations with any other of the four disorders (**Fig. 2a**). SNP heritability estimates for the five disorders were all significant but varied greatly; SCZ showed the largest (0.34, SE=0.01) and AD showed the lowest (0.01, SE=0.005) estimates. Bidirectional conditional QQ plots (**Fig. 2b-d**; **Supplementary Figs. S6** and **S7**) showed that there was noticeable genetic enrichment for BIP conditional on BAG but not in the reverse direction. For AD and PD, both directions showed clear enrichment, surprisingly for PD which did not show significant genetic correlation with BAG (r=-0.07, p=0.42). ![Figure 2](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/01/26/2023.01.09.23284310/F2.medium.gif) [Figure 2](http://medrxiv.org/content/early/2023/01/26/2023.01.09.23284310/F2) Figure 2 Polygenic genetic overlap between brain age gap and disorders. **a**. Genetic correlation between brain age gap and disorders computed by ldsc. SNP heritability and its standard error are indicated. **b-d**. Conditional QQ plot between brain age gap and disorders in both directions. Colors are used to indicate different association strength to the conditioned traits, i.e., the ones indicated after the vertical bar in each figure. Dashed diagonal lines indicate expected null distributions. Abbreviations used: AD, Alzheimer’s disease; BIP, bipolar disorder; MDD, major depression disorder; PD, Parkinson’s disease; SCZ, schizophrenia. We then performed extensive MR analyses using six different models to examine the existence of cause-effect relations between BAG and the five disorders (**Methods**). **Figure 3a** shows that BAG was only causally associated with PD, *i*.*e*., four out of the six MR models showed a negative relation with varying effect sizes (all with adjusted p <0.05). One year increase in genetically predicted BAG was estimated to reduce the risk of PD by a log odds ratio from 1.4 (by Egger regression) to 0.02 (by MR-RAPS) (**Supplementary Table S15**). In the reverse direction (*i*.*e*., disorders as exposure), increased genetic risk for AD and BIP were causally associated with increased BAG (30 and 55 SNPs used as instruments, respectively); these estimated causal effects on BAG were relatively larger for AD than BIP (**Fig. 3b; Supplementary Table S16**). ![Figure 3](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/01/26/2023.01.09.23284310/F3.medium.gif) [Figure 3](http://medrxiv.org/content/early/2023/01/26/2023.01.09.23284310/F3) Figure 3 Causal inference between brain age gap and disorders. **a**. Causal effect of brain age gap (BAG) on risk of disorders; **b**. Causal effect of genetic risk of disorders on BAG. Colors indicate different models; triangle indicates significant effect after false discovery correction. Estimated standard errors for each effect are aslo shown. **c**. Scatter plots of SNP effects on AD (x axis) and BAG (y axis). **d**. Scatter plots of SNP effects on BIP (x axis) and BAG (y axis). **e**. Scatter plots of SNP effects BAG (x axis) and PD (y axis). **f**. Scatter plots of SNP effects on PD (x axis) and BAG (y axis). Causal effects estimated by the five models (except CAUSE) are shown by fitted lines; slopes of these lines indicate causal effect sizes. Exceptional SNPs are marked by boxes which include SNP rs-numbers and genome location in th hg19 coordinates. Abbreviation used: AD, Alzheimer’s disease; BIP, bipolar disorder; MDD, major depression disorder; PD, Parkinson’s disease; SCZ, schizophrenia. A close investigation into the scatter plots of instrumental SNPs showed that the causal effect of AD on BAG was primarily driven by a SNP (rs59007384) in the APOE region, which was not identified as a horizontal pleiotropic instrument by MR-PRESSO (outlier test p>0.05) (**Fig. 3c**); there were no extreme instruments identified for the BIP to BAG relation by MR-PRESSO (**Fig. 3d**) but Egger-regression indicated existence of horizontal pleiotropy (Egger intercept test: p=0.017). The causal effects of BAG on PD were primarily driven by two SNPs in the inversion region on chromosome 17, effective alleles of these SNPs were associated with higher BAG and lower risk of PD (**Fig. 3e**). SNPs in the same region also drove the negative causal relation (not significant) from PD to BAG (**Fig. 3f**). However, both SNPs were flagged as horizontal pleiotropic instruments by MR-PRESSO (p<0.05) and Egger-regression (Egger intercept test: p=0.03 and 0.008, respectively). Therefore, the observed negative relations between BAG and PD are less likely to be causal. We used the GWAS results for height of European samples and cross-ancestry MR analysis to corroborate the identified causal relations (**Methods**). We found no causal effect between BAG and height with any of the MR methods employed (all p > 0.05). Therefore, our observed AD and BIP to BAG relations are less likely to be driven by population stratification, *i*.*e*., both the exposure and outcome data originating from the same ancestry group. There was also no significant cross-ancestral causal effect detected using AD data from Japanese or African samples (IVW p=0.74, 0,85, respectively), and BIP data from the Japanese sample to BAG (beta=0.10, p=0.13). ## Discussion Combining the advantages of large samples and advanced models for brain age prediction, we confirmed that BAG is a heritable and polygenic trait, and estimated the genetic pleiotropy and causal genetic relations with major brain and mental disorders. We identified seven novel loci associated with BAG, in addition to confirming the previously reported *MAPT* loci18, 19. While MR indicated that increased genetic risk for AD or BIP may be causally associated with higher BAG, our results demonstrate that individual variability and previously reported case-control differences in BAG only to a marginal degree should be attributed to the genetic risk of the respective diseases. Functional annotation of the genes linked to the identified loci confirms that deviations in BAG are linked to complex processes encompassing multiple biological systems20. While earlier work observed this variety when investigating different multimodal aspects of imaging data linked to brain aging, our findings suggest it also exists when looking at a singular BAG computed from only T1-weighted structural imaging. Our coarse division of the implied 54 genes into three groups indicates that only eight genes are specifically expressed in brain tissues. The remaining genes were either expressed in abundance across all tissue types tested, including the brain, or expressed at very low levels across all tissues. Nonetheless, the proteins coded by these non-brain specific genes have been implicated in brain-related disorders or traits (**Supplementary Table S5**). For example, among the genes we found to be expressed across all tissue types (group 1), mutations in *AP2M1* have been linked to epilepsy, intellectual developmental disorder, and seizures58; among the genes expressed in low levels across tissues (group 2), *STH* has been associated with frontotemporal dementia and 17q21.31 duplication syndrome59, 60. In addition, although our analysis revealed no significant pathway enrichment, these 54 genes contribute to biological functions that include calcium signaling, protein metabolism, DNA damage repair and general innate immune defense. Thus, our analyses highlight the role of these diverse sets of processes affecting the brain throughout life. Prior work has shown higher BAG in patients with a multitude of disorders compared to healthy controls5, 6, 8, 9, 11, and has documented partly overlapping genetic associations between BAG and clinical conditions6. However, the causal effects have remained unclear. Our MR approach revealed that genetically predicted risks for AD or BIP were causally associated with increased BAG. However, these relations were only weakly supported by genetic correlation analysis. One possible explanation for this weaker support from genome wide signals (genetic correlation) in contrast to MR (significant associations only) might be due to heterogeneous genetic correlations across the genome, *i*.*e*. some genomic regions show positive correlations while others show negative correlations61, 62. In such a scenario, the net genetic correlations between the two traits are expected to be lower than regional correlations. The causal effect of genetically predicted risk to AD on BAG was small but consistent in directions across the six MR models, four of which were significant after multiple-testing correction. For BIP, although four models showed significant effects, the CAUSE model suggested an opposite direction of effect to the other five models. Thus, we advise careful interpretation of this result. Our attempts of testing across ancestral group causal relations led to largely null fundings for the AD to BAG relations. We believe these non-significant fundings are largely due to the lack of statistical power in the non-European GWAS51-53. The observed causal relations between genetically predicted risk of brain disorders and BAG are intriguing. One possible interpretation is that overt changes in the brain incurred by the disorders contribute to accelerated aging. Another possibility may be that lifestyle and health-related behaviors of patients with clinical conditions such as AD and BIP, *e*.*g*., medication63, may increase brain age. Yet another is that genetic variation drives early life changes in brain structure which contribute to both phenotypes later in life. Given the comparable sample sizes to the GWAS of AD and BIP and the widely observed clinical correlations, surprisingly, no genetic nor causal relations of SCZ and MDD with BAG were found. On the one hand this may suggest that previously reported case-control differences do not reflect causal relations, but rather a combination of indirect and confounding factors. For example, smoking and physical exercise have been associated both with MDD and SCZ64-67 and brain age5, 66. Alternatively, it has been shown that both BAG20 and psychiatric disorders are highly heterogeneous phenotypes68, 69, and thus further identification and characterization of the causal relations may require even larger, and carefully screened, samples. It is also worth noting that while the sample sizes for the disorders are large, the one underlying our BAG GWAS is relatively small. Thus, our null findings in the direction from BAG to disorders may be due to too weak instruments70. Our initial results showed weak evidence of a causal relation between BAG and PD, corroborating two recent studies which reported a weaker correlation between BAG and PD 71, 72. Striking patterns of enrichment between the two were shown in the conditional QQ plots and four out of six MR models indicated that genetically predicted BAG may have protective effect from PD. However, we found that these relations were completely caused by the *MAPT* gene region on chromosome 17: after removing chromosome 17 from our analyses, no enrichment was observed in either direction (**Supplementary Fig. S8**). In addition, instrumental SNPs in this region were detected by MR-PRESSO as horizontal pleiotropic SNPs, *i*.*e*., affecting BAG and PD through independent biological pathways. Reperforming MR analysis excluding these outlier SNPs confirmed null causal relations. Thus, we conclude that we found no evidence for causal relations between genetically predicted risk for PD and BAG. Our analytic procedures also highlight the importance of triangulation and converging evidence in causal inference analysis73. While the present study advances current knowledge regarding the genetic architecture of and causal contributions to BAG, the results should be interpreted with caution. Although we confirm previously reported genetic associations with BAG, e.g., the *MAPT* gene locus18, 19, our sample overlaps with previous ones—which were also based on UK Biobank data. We attempted replicating our findings in three independent but small samples (n ranges from 321 to 702; **Supplementary Analysis** and **Table S4**) but no clear replications were achieved. Therefore, independent large-scale samples are needed for replication. We used a simple voting schema across six different MR models to infer causal relations between genetically predicted BAG and brain disorders. Furthermore, as we only identified eight independent loci showing significant associations to BAG, other models74 that require large number of genome-wide significant instruments were considered not applicable. However, it should be noted our simple voting approach may not be the most efficient strategy for identifying causal effects. Formal development of ensemble methods, such as bagging75, may provide better grounds for precise interpretation. Furthermore, our BAG GWAS is still smaller than GWAS performed for the disorders, which may partly explain the lack of causal effects of BAG on brain disorders. Therefore, to increase our confidence in the identified relations, large-scale data for BAG, and replications in independent datasets are needed. Relatedly, our estimation of brain age was based on cross-sectional samples, which makes its interpretation non-trivial76, and studies built on longitudinal data could help disentangle its complexities. Finally, although we refer to our brain age estimation in general terms, it is important to reaffirm that it is based on T1-weighted imaging data. The brain is a complex and heterogeneous organ, and different imaging modalities are known to capture different aspects of the naturally occurring variation. Thus, studies relying on other modalities, either independently or in combination, could reveal a broader set of associations77. In conclusion, the present study increases the yield of genetic associations with brain age to eight genomic loci; implicated genes indicate involvement of calcium signaling, DNA damage repair, protein metabolism, and general innate immune defense. Our analysis did not provide evidence of a causal relationship between BAG and the included clinical conditions, and their interactions remain unclear. ## Data Availability All data used in the study are from openly accessible datasets. Requests for the individual dataset have to be placed with the individual principal investigators ## Conflict of Interests All authors declared no conflict of interests. ## Funding This study is supported by Norwegian Research Council (No. 223273, 298646, 300767, 302854), the UiO:Life Science Convergence environment, University of Oslo, Norway (4MENT), the South-Eastern Norway Regional Health Authority (2019101), KG Jebsen Stiftelsen, and the European Research Council under the European Union’s Horizon 2020 research and Innovation program (ERC StG, Grant 802998). ## Author contributions EL, TW, and YW designed this study; EL, DVP, OF, AAS, OI, AMG and YW performed data analysis; EL, DVP, JMR, OI, TK, LTW, OOA, TW, BT, SMS, and YW interpreted the results; EL, DVP, JMR, OI, YW prepared the first draft of the manuscripts; All authors contributed and approved the final draft. ## Acknowledgement This research has been conducted using the UK Biobank Resource under Application Number 27412. We thank the computational resources provided by UNINETT Sigma2-the National Infrastructure for High Performance Computing and Data Storage in Norway – with project no. (nn9769k/ns9769k), the PGC consortium and the International Parkinson Disease Genomics Consortium (IPDGC) for sharing of their GWAS results. ## Footnotes * Updated with acknowledgement to the UK Biobank * Received January 9, 2023. * Revision received January 26, 2023. * Accepted January 26, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Franke K, Gaser C. Ten Years of BrainAGE as a Neuroimaging Biomarker of Brain Aging: What Insights Have We Gained? Frontiers in Neurology 2019; 10: 789. 2. 2.Cole JH, Franke K. Predicting Age Using Neuroimaging: Innovative Brain Ageing Biomarkers. Trends in neurosciences 2017; 40(12): 681–690. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.tins.2017.10.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 3. 3.Franke K, Luders E, May A, Wilke M, Gaser C. Brain maturation: Predicting individual BrainAGE in children and adolescents using structural MRI. NeuroImage 2012; 63(3): 1305–1312. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2012.08.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22902922&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000310379100032&link_type=ISI) 4. 4.Smith SM, Vidaurre D, Alfaro-Almagro F, Nichols TE, Miller KL. Estimation of brain age delta from brain imaging. NeuroImage 2019; 200: 528–539. 5. 5.Leonardsen EH, Peng H, Kaufmann T, Agartz I, Andreassen OA, Celius EG et al. Deep neural networks learn general and clinically relevant representations of the ageing brain. NeuroImage 2022; 256: 119210. 6. 6.Kaufmann T, van der Meer D, Doan NT, Schwarz E, Lund MJ, Agartz I et al. Common brain disorders are associated with heritable patterns of apparent aging of the brain. Nature Neuroscience 2019; 22(10): 1617–1623. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41593-019-0471-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31551603&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 7. 7.Cole JH, Ritchie SJ, Bastin ME, Valdés Hernández MC, Muñoz Maniega S, Royle N et al. Brain age predicts mortality. Molecular Psychiatry 2018; 23(5): 1385–1392. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 8. 8.Gaser C, Franke K, Klöppel S, Koutsouleris N, Sauer H, Alzheimer’s Disease Neuroimaging I. BrainAGE in Mild Cognitive Impaired Patients: Predicting the Conversion to Alzheimer’s Disease. PloS one 2013; 8(6): e67346. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0067346&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23826273&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 9. 9.Schnack HG, van Haren NEM, Nieuwenhuis M, Hulshoff Pol HE, Cahn W, Kahn RS. Accelerated Brain Aging in Schizophrenia: A Longitudinal Pattern Recognition Study. American Journal of Psychiatry 2016; 173(6): 607–616. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.2015.15070922&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26917166&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 10. 10.Constantinides C, Han LK, Alloza C, Antonucci L, Arango C, Ayesa-Arriola R et al. Brain ageing in schizophrenia: evidence from 26 international cohorts via the ENIGMA Schizophrenia consortium. medRxiv 2022: 2022.2001.2010.21267840. 11. 11.Han LKM, Dinga R, Hahn T, Ching CRK, Eyler LT, Aftanas L et al. Brain aging in major depressive disorder: results from the ENIGMA major depressive disorder working group. Molecular Psychiatry 2021; 26(9): 5124–5139. 12. 12.Elliott ML, Belsky DW, Knodt AR, Ireland D, Melzer TR, Poulton R et al. Brain-age in midlife is associated with accelerated biological aging and cognitive decline in a longitudinal birth cohort. Molecular Psychiatry 2021; 26(8): 3829–3838. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41380-019-0626-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 13. 13.Kuhn T, Kaufmann T, Doan NT, Westlye LT, Jones J, Nunez RA et al. An augmented aging process in brain white matter in HIV. Human brain mapping 2018; 39(6): 2532–2540. 14. 14.Cole JH, Underwood J, Caan MWA, De Francesco D, van Zoest RA, Leech R et al. Increased brain-predicted aging in treated HIV disease. Neurology 2017; 88(14): 1349–1357. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/wnl.0000000000003790&link_type=DOI) 15. 15.Steffener J, Habeck C, O’Shea D, Razlighi Q, Bherer L, Stern Y. Differences between chronological and brain age are related to education and self-reported physical activity. Neurobiology of Aging 2016; 40: 138–144. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neurobiolaging.2016.01.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26973113&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 16. 16.Wrigglesworth J, Ward P, Harding IH, Nilaweera D, Wu Z, Woods RL et al. Factors associated with brain ageing - a systematic review. BMC neurology 2021; 21(1): 312. 17. 17.Cole JH, Poudel RPK, Tsagkrasoulis D, Caan MWA, Steves C, Spector TD et al. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. NeuroImage 2017; 163: 115–124. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2017.07.059&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 18. 18.Jonsson BA, Bjornsdottir G, Thorgeirsson TE, Ellingsen LM, Walters GB, Gudbjartsson DF et al. Brain age prediction using deep learning uncovers associated sequence variants. Nature Communications 2019; 10(1): 5409. 19. 19.Ning K, Zhao L, Matloff W, Sun F, Toga AW. Association of relative brain age with tobacco smoking, alcohol consumption, and genetic variants. Scientific Reports 2020; 10(1): 10. 20. 20.Smith SM, Elliott LT, Alfaro-Almagro F, McCarthy P, Nichols TE, Douaud G et al. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. eLife 2020; 9: e52677. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.52677&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 21. 21.Smith GD, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International journal of epidemiology 2003; 32(1): 1–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyg070&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12689998&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000182341300001&link_type=ISI) 22. 22.Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 2018; 7. 23. 23.Kolbeinsson A, Filippi S, Panagakis Y, Matthews PM, Elliott P, Dehghan A et al. Accelerated MRI-predicted brain ageing and its associations with cardiometabolic and brain disorders. Scientific Reports 2020; 10(1): 19940. 24. 24.Ségonne F, Dale AM, Busa E, Glessner M, Salat D, Hahn HK et al. A hybrid approach to the skull stripping problem in MRI. NeuroImage 2004; 22(3): 1060–1075. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2004.03.032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15219578&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222423200004&link_type=ISI) 25. 25.Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM. FSL. NeuroImage 2012; 62(2): 782–790. 26. 26.Jenkinson M, Smith S. A global optimisation method for robust affine registration of brain images. Medical Image Analysis 2001; 5(2): 143–156. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1361-8415(01)00036-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11516708&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000169672400005&link_type=ISI) 27. 27.Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018; 562(7726): 203–209. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0579-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 28. 28.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015; 4: 7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13742-015-0047-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25722852&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 29. 29.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nature Communications 2017; 8(1): 1826. 30. 30.McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A et al. The Ensembl Variant Effect Predictor. Genome Biology 2016; 17(1): 122. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-016-0974-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27268795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 31. 31.Benner C, Spencer CC, Havulinna AS, Salomaa V, Ripatti S, Pirinen M. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 2016; 32(10): 1493–1501. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btw018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26773131&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 32. 32.Yap CX, Henders AK, Alvares GA, Wood DLA, Krause L, Tyson GW et al. Autismrelated dietary preferences mediate autism-gut microbiome associations. Cell 2021; 184(24): 5916-5931.e5917. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2021.10.015&link_type=DOI) 33. 33.Consortium G. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 2015; 348(6235): 648–660. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNDgvNjIzNS82NDgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wMS8yNi8yMDIzLjAxLjA5LjIzMjg0MzEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 34. 34.Maglott D, Ostell J, Pruitt KD, Tatusova T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Research 2011; 39(suppl_1): D52–D57. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkq1237&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21115458&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000285831700010&link_type=ISI) 35. 35.1. Edwards D Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A. UniProtKB/Swiss-Prot. In: Edwards D (ed). Plant Bioinformatics: Methods and Protocols. Humana Press: Totowa, NJ, 2007, pp 89–112. 36. 36.Trubetskoy V, Pardiñas AF, Qi T, Panagiotaropoulou G, Awasthi S, Bigdeli TB et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 2022;604(7906):502–508. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 37. 37.Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z et al. Genomewide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nature Genetics 2021; 53(6): 817–829. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00857-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34002096&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 38. 38.Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nature Genetics 2018; 50(5): 668–681. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0090-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29700475&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 39. 39.Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S et al. Genomewide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nature Genetics 2019; 51(3): 404–413. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 40. 40.Nalls MA, Blauwendraat C, Vallerga CL, Heilbron K, Bandres-Ciga S, Chang D et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. The Lancet Neurology 2019; 18(12): 1091–1102. 41. 41.Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR et al. An atlas of genetic correlations across human diseases and traits. Nature Genetics 2015; 47(11): 1236–1241. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3406&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26414676&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 42. 42.Lo MT, Wang Y, Kauppi K, Sanyal N, Fan CC, Smeland OB et al. Modeling prior information of common genetic variants improves gene discovery for neuroticism. Human Molecular Genetics 2017; 26(22): 4530–4539. 43. 43.Chen CH, Wang Y, Lo MT, Schork A, Fan CC, Holland D et al. Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure. Scientific Reports 2017; 7(1): 15736. 44. 44.Relton CL, Davey Smith G. Two-step epigenetic Mendelian randomization: a strategy for establishing the causal role of epigenetic processes in pathways to disease. International journal of epidemiology 2012; 41(1): 161–176. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyr233&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22422451&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000302026800020&link_type=ISI) 45. 45.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genetic Epidemiology 2016; 40(4): 304–314. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/gepi.21965&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27061298&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 46. 46.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International journal of epidemiology 2015; 44(2): 512–525. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyv080&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26050253&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 47. 47.Verbanck M, Chen CY, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nature Genetics 2018; 50(5): 693–698. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0099-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29686387&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 48. 48.Zhao Q, Wang J, Hemani G, Bowden J, Small DS. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. Annals Statistics 2020; 48(3): 1742–1769. 49. 49.Morrison J, Knoblauch N, Marcus JH, Stephens M, He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nature Genetics 2020; 52(7): 740–747. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 50. 50.Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature Genetics 2014; 46(11): 1173–1186. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3097&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25282103&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 51. 51.Shigemizu D, Mitsumori R, Akiyama S, Miyashita A, Morizono T, Higaki S et al. Ethnic and trans-ethnic genome-wide association studies identify new loci influencing Japanese Alzheimer’s disease risk. Translational psychiatry 2021; 11(1): 151. 52. 52.Kunkle BW, Schmidt M, Klein H-U, Naj AC, Hamilton-Nelson KL, Larson EB et al. Novel Alzheimer Disease Risk Loci and Pathways in African American Individuals Using the African Genome Resources Panel: A Meta-analysis. JAMA neurology 2021; 78(1): 102–113. 53. 53.Ikeda M, Takahashi A, Kamatani Y, Okahisa Y, Kunugi H, Mori N et al. A genome-wide association study identifies two novel susceptibility loci and trans population polygenicity associated with bipolar disorder. Molecular Psychiatry 2018; 23(3): 639–647. 54. 54.Sanderson E, Richardson TG, Hemani G, Davey Smith G. The use of negative control outcomes in Mendelian randomization to detect potential population stratification. International journal of epidemiology 2021; 50(4): 1350–1361. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyaa288&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33570130&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 55. 55.Lam K, Zhang DE. RUNX/CBF Transcription Factors☆. Reference Module in Biomedical Sciences. Elsevier2015. 56. 56.Kotb M, Geller AM. Methionine adenosyltransferase: Structure and function. Pharmacology & therapeutics 1993; 59(2): 125–143. 57. 57.López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The Hallmarks of Aging. Cell 2013; 153(6): 1194–1217. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2013.05.039&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23746838&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000319979200007&link_type=ISI) 58. 58.Helbig I, Lopez-Hernandez T, Shor O, Galer P, Ganesan S, Pendziwiat M et al. A Recurrent Missense Variant in AP2M1 Impairs Clathrin-Mediated Endocytosis and Causes Developmental and Epileptic Encephalopathy. The American Journal of Human Genetics 2019; 104(6): 1060–1072. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2019.04.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 59. 59.Mc Cormack A, Taylor J, Te Weehi L, Love DR, George AM. A Case of 17q21.31 Microduplication and 7q31.33 Microdeletion, Associated with Developmental Delay, Microcephaly, and Mild Dysmorphic Features. Case Reports in Genetics 2014; 2014: 658570. 60. 60.Arbogast T, Iacono G, Chevalier C, Afinowi NO, Houbaert X, van Eede MC et al. Mouse models of 17q21.31 microdeletion and microduplication syndromes highlight the importance of Kansl1 for cognition. PLOS Genetics 2017; 13(7): e1006886. 61. 61.Frei O, Holland D, Smeland OB, Shadrin AA, Fan CC, Maeland S et al. Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nature Communications 2019; 10(1): 2417. 62. 62.van Rheenen W, Peyrot WJ, Schork AJ, Lee SH, Wray NR. Genetic correlations of polygenic disease traits: from theory to practice. Nature Reviews Genetics 2019; 20(10): 567–581. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31171865&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 63. 63.Van Gestel H, Franke K, Petite J, Slaney C, Garnham J, Helmick C et al. Brain age in bipolar disorders: Effects of lithium treatment. Australian & New Zealand Journal of Psychiatry 2019; 53(12): 1179–1188. 64. 64.Pasco JA, Williams LJ, Jacka FN, Ng F, Henry MJ, Nicholson GC et al. Tobacco smoking as a risk factor for major depressive disorder: population-based study. British Journal of Psychiatry 2008; 193(4): 322–326. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImJqcHJjcHN5Y2giO3M6NToicmVzaWQiO3M6OToiMTkzLzQvMzIyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDEvMjYvMjAyMy4wMS4wOS4yMzI4NDMxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 65. 65.Winterer G. Why do patients with schizophrenia smoke? Current opinion in psychiatry 2010; 23(2). 66. 66.Sanders A-M, Richard G, Kolskår K, Ulrichsen KM, Kaufmann T, Alnæs D et al. Linking objective measures of physical activity and capability with brain structure in healthy community dwelling older adults. NeuroImage: Clinical 2021; 31: 102767. 67. 67.Brokmeier LL, Firth J, Vancampfort D, Smith L, Deenik J, Rosenbaum S et al. Does physical activity reduce the risk of psychosis? A systematic review and meta-analysis of prospective studies. Psychiatry research 2020; 284: 112675. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.psychres.2019.112675&link_type=DOI) 68. 68.Wolfers T, Doan NT, Kaufmann T, Alnaes D, Moberget T, Agartz I et al. Mapping the Heterogeneous Phenotype of Schizophrenia and Bipolar Disorder Using Normative Models. JAMA Psychiatry 2018; 75(11): 1146–1155. 69. 69.Wolfers T, Rokicki J, Alnæs D, Berthet P, Agartz I, Kia SM et al. Replicating extensive brain structural heterogeneity in individuals with schizophrenia and bipolar disorder. Human brain mapping 2021; 42(8): 2546–2555. 70. 70.Taschler B, Smith SM, Nichols TE. Causal inference on neuroimaging data with Mendelian randomisation. NeuroImage 2022; 258: 119385. 71. 71.Eickhoff CR, Hoffstaedter F, Caspers J, Reetz K, Mathys C, Dogan I et al. Advanced brain ageing in Parkinson’s disease is related to disease duration and individual impairment. Brain Communications 2021; 3(3): fcab191. 72. 72.Charissé D, Erus G, Pomponio R, Gorges M, Schmidt N, Schneider C et al. Brain age and Alzheimer’s-like atrophy are domain-specific predictors of cognitive impairment in Parkinson’s disease. Neurobiology of aging 2022; 109: 31–42. 73. 73.Lawlor DA, Tilling K, Davey Smith G. Triangulation in aetiological epidemiology. International journal of epidemiology 2016; 45(6): 1866–1886. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyw314&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28108528&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F01%2F26%2F2023.01.09.23284310.atom) 74. 74.Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafò MR et al. Mendelian randomization. Nature Reviews Methods Primers 2022; 2(1): 6. 75. 75.Büchlmann P, Yu B. Analyzing Bagging. The Annals of Statistics 2002; 30(4): 927–961. 76. 76.Vidal-Pineiro D, Wang Y, Krogsrud SK, Amlien IK, Baaré WFC, Bartres-Faz D et al. Individual variations in ‘brain age’ relate to early-life factors more than to longitudinal brain change. eLife 2021; 10: e69995. 77. 77.Rokicki J, Wolfers T, Nordhøy W, Tesli N, Quintana DS, Alnæs D et al. Multimodal imaging improves brain age prediction and reveals distinct abnormalities in patients with psychiatric and neurological disorders. Human brain mapping 2021; 42(6): 1714–1726. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/hbm.25323&link_type=DOI)