Multi-ancestry genome-wide meta-analysis in Parkinson’s disease ================================================================= * Jonggeol Jeffrey Kim * Dan Vitale * Diego Véliz Otani * Michelle Lian * Karl Heilbron * the 23andMe Research Team * Hirotaka Iwaki * Julie Lake * Caroline Warly Solsberg * Hampton Leonard * Mary B. Makarious * Eng-King Tan * Andrew B. Singleton * Sara Bandres-Ciga * Alastair J Noyce * Cornelis Blauwendraat * Mike A. Nalls * Jia Nee Foo * Ignacio Mata ## Abstract Although over 90 independent risk variants have been identified for Parkinson’s disease using genome-wide association studies, all studies have been performed in just one population at the time. Here we performed the first large-scale multi-ancestry meta-analysis of Parkinson’s disease with 49,049 cases, 18,785 proxy cases, and 2,458,063 controls including individuals of European, East Asian, Latin American, and African ancestry. In a single joint meta-analysis, we identified 78 independent genome-wide significant loci including 12 potentially novel loci (*MTF2, RP11-360P21*.*2, ADD1, SYBU, IRS2, USP8:RP11-562A8*.*5, PIGL, FASN, MYLK2, AJ006998*.*2, Y_RNA, PPP6R2*) and finemapped 6 putative causal variants at 6 known PD loci. By combining our results with publicly available eQTL data, we identified 23 genes near these novel loci whose expression is associated with PD risk. This work lays the groundwork for future efforts aimed at identifying PD loci in non-European populations. Keywords * Parkinson’s disease * meta-analysis * multi-ancestry * genetics * genomics * population genetics ## Introduction Parkinson’s disease (PD) is a neurodegenerative disease pathologically defined by Lewy body inclusions in the brain and the death of dopaminergic neurons in the midbrain. The identification of genetic risk factors is imperative to mitigating the global burden of PD, one of the fastest growing age-related neurodegenerative diseases. The largest PD Genome-Wide Association Study (GWAS) meta-analysis to date uncovered 90 independent genetic risk variants in individuals of European ancestry 1. Similarly, large-scale PD GWAS meta-analyses of East Asian2 and a single GWAS of Latin American3 individuals have each identified 2 risk loci that were not previously identified in Europeans. For PD, there are now large-scale efforts to sequence and analyze genomic data in under-represented populations with the goal of both identifying novel associated loci, fine mapping known loci, and addressing the inequality that exists in current precision medicine efforts4,5. Here we performed a large-scale multi-ancestry GWAS meta-analysis of PD by including individuals from four ancestral populations. This effort can serve as a guide for future genetic analyses to increase ancestral representation. ## Methods In brief, we used a single joint meta-analysis study design (Figure 1) due its efficiency over a two-stage GWAS design6. All datasets underwent quality control filters and were harmonized to hg19. We used datasets representing four different ancestry groups: European, East Asian, Latin American, and African. In addition to previously described results from European1, East Asian2, and Latin American3 cohorts, we also used FinnGen and additional datasets for East Asian, Latin American and African cohorts from 23andMe, Inc. All multi-ancestry meta-analyses were conducted using a union of a random-effects model and a multi-ancestry meta-regression model using PLINK 1.97 and MR-MEGA8. MR-MEGA has the greatest power to detect heterogeneous effects across the different cohorts, while the random-effects model has greater power to detect homogenous allelic effects8. In total, we included 49,049 PD cases, 18,618 proxy cases (first-degree relative with PD), and 2,458,063 neurologically-healthy controls. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/08/06/2022.08.04.22278432/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2022/08/06/2022.08.04.22278432/F1) Figure 1. Study design of Multi-Ancestry Meta-Analysis ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/08/06/2022.08.04.22278432/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2022/08/06/2022.08.04.22278432/F2) Figure 2. : Miami plot of random-effect and MR-MEGA meta-analyses. Top: random effect; Bottom: MR-MEGA. Dotted lines indicate the bonferroni adjusted significant threshold of P < 5 × 10−9. All -log10P values greater than 40 were truncated to 40 for visual clarity. SNPs significant in random-effect and MR-MEGA models are colored red and orange respectively. Novel loci are indicated by the nearest gene. Bonferroni-adjusted alpha for genome-wide significant variants was set to a more stringent 5 × 10−9 for all multi-ancestry meta-analyses to account for the larger number of haplotypes resulting from the ancestrally diverse datasets9. Fine-mapping was also performed using MR-MEGA, which uses ancestry heterogeneity to increase fine-mapping resolution. We ran our meta-analysis results through the Proportion of Population-specific and Shared Causal Variants (PESCA)10 software to estimate the population specific and shared burden on the variants found in the meta-analysis. Only East Asian and European populations were compared due to the smaller sample size of the other populations. We used the Functional Mapping and Annotation (FUMA) software11,12 to functionally annotate the random effect results and run Multi-marker Analysis of GenoMic Annotation (MAGMA)13 for gene-ontology, tissue-level, and single-cell expression data. We also searched our novel loci through GTEx v8 brain tissue eQTLs and multi-ancestry eQTL meta-analysis of the brain14 to correlate novel loci with gene expression data, then searched the significant-eQTL genes and genes near the loci with previously completed summary-based mendelian randomization (SMR)15 results to correlate said genes with PD risk. Detailed methods are available in the Online Methods and the Data and Code Availability sections. ## Results ### Multi-ancestry meta-analyses confirm 62 known loci and nominate 12 novel loci All together we found 12 novel PD risk loci and 66 hits in known risk loci from single ancestry GWAS (Supplementary Table S2). 9 of the novel loci found in the random-effect method showed homogeneous effects across the different ancestries. The 3 loci identified exclusively in MR-MEGA showed ancestrally heterogeneous effects. All three loci showed evidence for ancestral heterogeneity (PANC-HET < 0.05) but no significant residual heterogeneity (PRES-HET > 0.148), supporting the idea that the signals are due to population structural differences rather than other confounding factors. The *IRS2* locus (PANC-HET = 5.3 × 10−3) shows the biggest departure in allelic effects in the Finnish cohort (Supplementary Figure 4). The *MYLK2* locus has the main European cohort at odds with the Latin American and Finnish cohorts (PANC-HET = 0.035). While this is a novel single-trait GWAS locus, its lead SNP was previously discovered as a potential pleiotropic locus in a multi-trait conditional/conjunctional false discovery rate (FDR) study between schizophrenia and PD16. Lastly, the *AJ006998*.*2* locus had the most significant ancestral heterogeneity (PANC-HET = 4.74 × 10−5) and its effects were specific to European and African cohorts. We found 17 suggestive loci that failed to meet the significance threshold at random-effect but met the more common threshold of P < 5 × 10−8 in fixed-effect and P < 1 × 10−6 in random-effect (Supplementary Table S2). 14 of these regions qualified as novel loci. Two loci near *RP11-182I10*.*3* and *HS1BP3* were exclusively found in the 23andMe Latin American and African cohorts. The lead SNPs (rs578139575, rs73919910) for these loci are very rare in European populations but more common in Africans and Latin Americans (gnomAD v3.1.2 EUR: 0.0001616, 0.002307; AFR: 0.01637, 0.08837; AMR:0.004063, 0.01905). If confirmed, these loci would confer a large amount of PD protective effect (beta: -1.3, -0.54). These loci merit further studies in the African and Latino populations. ### Fine mapping finds 6 putative SNPs in 6 loci 6 putative causal variants were found in 6 loci in *TMEM163, TMEM175, SNCA, HIP1R, MAPT*, and *LSM7* (Table 4). All putative causal variants had posterior probabilities (PP) of causality > 0.99 and overlapped with known PD loci. Our results confirmed the previous results of the *TMEM175* M393T coding variant as the likely causal variant17. We reported only single-variant 99% credible sets as MR-MEGA assumes a single causal variant in a GWAS signal. 99% credible sets in 57 additional loci can be found in Supplementary Table S3. ### MAGMA Gene Ontology analysis finds enrichment in brain and pituitary tissues Out of 15,479 gene ontology sets in MsigDB c5, 111 gene sets were significantly enriched among the random effect hits, a significant increase from previous 11 gene sets in Nalls et al. 2019. Out of the 11, only 3 gene ontology terms were replicated in the multi-ancestry: vacuole (PFDR = 6.9 × 10−4), vacuolar part (PFDR = 0.009), and vacuolar membrane (PFDR = 0.01). At the tissue level the genes of interest were enriched in all brain cells as well as pituitary tissue (Supplementary Figure 9), consistent with the results from Nalls et al. 20191. When analyzing single-cell RNAseq data, there was no expression enrichment across 88 brain cell types in mouse brain data DropViz 18 (Supplementary Figure 13). When narrowed down to substantia nigra tissue, glutamate receptor (Grin2c) in neurons were enriched (Supplementary Figure 13). In human midbrain data 19 oculomotor and trochlear nucleus (OMTN), dopaminergic (DA1), neuroblast GABAnergic (NbGABA), and GABA were enriched (Supplementary Figure 13). Out of 15,479 gene ontology sets in MsigDB c5, 110 gene sets were significantly enriched among the random effect hits (Supplementary Table S6), a significant increase from previous 11 gene sets in Nalls et al. 2019. Out of the 11, only 3 gene ontology terms were replicated in the multi-ancestry: vacuole (PFDR = 6.9 × 10−4), vacuolar part (PFDR = 0.009), and vacuolar membrane (PFDR = 0.01). At the tissue level the genes of interest were enriched in all brain cells as well as pituitary tissue (Supplementary Figure 9), consistent with the results from Nalls et al. 20191. ### eQTL and SMR nominate 23 putative genes near novel loci When comparing the SNPs in novel loci with multi-ancestry brain eQTL results14, 28 genes showed high levels of association (Supplementary Figures 8). SMR found 23 genes near four novel loci associated with PD risk. Interestingly, *PPP6R2* and *CENPV* expression changes in *substantia nigra* were associated with PD risk. *PPP6R2* or protein phosphatase 6 regulatory subunit 2 is a regulatory protein for Protein Phosphatase 6 Catalytic Subunit (*PPP6C*), which is involved in the vesicle-mediated transport pathway. *CENPV* or Centromere Protein V is involved in centromere formation and cell division. Detailed results for eQTL and the SMR can be found in Supplementary Table S8, S9, and S10. ## Discussion This is the first large-scale GWAS meta-analysis of PD that incorporates multiple diverse ancestry populations. From the joint cohort analysis, we were able to confirm 65 independent known risk loci and identify 12 potentially novel risk loci, 9 with homogeneous effect and 3 with heterogeneous effect across the different cohorts. We further found 17 suggestive loci at the more traditional method of fixed-effect at 5 × 10−8 and fine-mapped 11 putative SNPs in 11 of the known loci by leveraging the diverse ancestry populations. We highlighted tissues and cell types involved with the loci, which were consistent with previous findings1. Finally we used SMR to nominate 23 putative genes near our novel loci. Novel loci contain genes previously implicated with PD. The *MTF2* and *PPP6R2* loci contain genes *TMED5* and *PPP6R2*, which intersect with golgi body20 and vesicular transport pathways21,22, both implicated in PD pathogenesis23–28. Because *substantia nigra* deterioration is a hallmark pathogenic feature of PD, *PPP6R2*, and *CENPV* expression in *substantia nigra* and their association with PD merit additional investigation. Within a known locus, a new independent signal was found in *RILPL1* (rs28659953), which interacts with LRRK2-phosphorylated Rab10 to block primary cilia generation29. All potentially novel PD loci identified in this analysis will require additional replication and functional validation to elucidate their role in PD pathogenesis. Previous findings in European populations found that the known variants explained 16–36% of the heritable risk1, and while we did not test for genetic liability of the novel loci, they have been added to the list of variants that explain potential heritable PD risk. We found that 26 of the 68 detected known PD loci have nominal ancestral heterogeneity (PANC-HET < 0.05) and 5 significant after bonferroni correction (PANC-HET < 0.05/4,512 significant SNPs) (Supplementary Table S2). This heterogeneity may be caused by significant differences in allele effect sizes and frequencies between the different ancestry populations and thus should be studied as potential loci with ancestrally divergent risk. 18 known loci from single-ancestry GWASes failed to replicate (Supplementary Table S4) which implies that the variants associated in these loci may only be relevant in certain populations. However, it is worth noting that there are large differences in statistical power for each included ancestry. Additional population-specific loci will likely reach significance when larger sample sizes are available for non-European datasets. Our fine mapping isolated several putative causal variants in previously discovered loci. *TMEM175*-rs34311866 has been previously identified as functionally relevant to PD risk17, which is consistent with our fine-mapping results. While the fine-mapping results provided by MR-MEGA are sufficient to identify putative causal variants for loci driven by one independent signal, multiple variants in a locus can contribute to complex traits. The additive and epistatic effects of multiple causal variants in a locus can be difficult to interpret when the effects associated with each independent signal are small. Although this is the largest multi-ancestry PD meta-analysis GWAS to date, the European population is still overrepresented in this analysis. Around 79% of effective PD cases are of European descent. Individuals of African descent were particularly underpowered at just 0.5% of the effective PD cases. The discoveries in our study warrant future efforts to expand the studies in more diverse populations. The Global Parkinson’s Genetics Program (GP2) is partnering with institutions that care for underrepresented populations to generate data for these underserved communities all over the world5, and we will continue the ongoing analysis as more participants are genotyped. Just as the first PD GWASs failed to identify significant signals 30,31, we are confident that future diverse-ancestry GWAS will produce impactful association results as sample sizes increase. Further efforts in multi-ancestry and non-European GWAS will identify loci that are more relevant to the global population and will continue to facilitate fine mapping efforts to identify the genetic variants that drive these associations. ## Supporting information Supplementary Tables and Figures [[supplements/278432_file05.zip]](pending:yes) ## Data Availability The analysis pipeline code is available on GP2 github: https://github.com/GP2code/GP2-WorkingGroups/tree/main/CD-DAWG-Complex-Data-Analysis/PROJECTS/GP2-Multiancestry-PD GWAS summary statistics for 23andMe datasets (post-Chang and data included in Chang et al. 2017 and Nalls et al. 2014) will be made available through 23andMe to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. Please visit research.23andme.com/collaborate/#publication for more information and to apply to access the data. An immediately accessible version of the summary statistics is available here https://drive.google.com/file/d/1TmDZNFgyQvsOZ0xu-aZmBpVCpeUUa0UX/ excluding Nalls et al. 2014, 23andMe post-Chang et al. 2017 and Web-Based Study of Parkinson's Disease (PDWBS) but including all analyzed SNPs. After applying with 23andMe, the full summary statistics including all analyzed SNPs and samples in this GWAS meta-analysis will be accessible to the approved researcher(s). [https://github.com/GP2code/GP2-WorkingGroups/tree/main/CD-DAWG-Complex-Data-Analysis/PROJECTS/GP2-Multiancestry-PD](https://github.com/GP2code/GP2-WorkingGroups/tree/main/CD-DAWG-Complex-Data-Analysis/PROJECTS/GP2-Multiancestry-PD) [https://drive.google.com/file/d/1TmDZNFgyQvsOZ0xu-aZmBpVCpeUUa0UX/](https://drive.google.com/file/d/1TmDZNFgyQvsOZ0xu-aZmBpVCpeUUa0UX/) ## Data and code availability The analysis pipeline code is available on GP2 github: [https://github.com/GP2code/GP2-WorkingGroups/tree/main/CD-DAWG-Complex-Data-Analysis/PROJECTS/GP2-Multiancestry-PD](https://github.com/GP2code/GP2-WorkingGroups/tree/main/CD-DAWG-Complex-Data-Analysis/PROJECTS/GP2-Multiancestry-PD) GWAS summary statistics for 23andMe datasets (post-Chang and data included in Chang et al. 2017 and Nalls et al. 2014) will be made available through 23andMe to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. Please visit research.23andme.com/collaborate/#publication for more information and to apply to access the data. An immediately accessible version of the summary statistics is available here [https://drive.google.com/file/d/1TmDZNFgyQvsOZ0xu-aZmBpVCpeUUa0UX/](https://drive.google.com/file/d/1TmDZNFgyQvsOZ0xu-aZmBpVCpeUUa0UX/) excluding Nalls et al. 2014, 23andMe post-Chang et al. 2017 and Web-Based Study of Parkinson’s Disease (PDWBS) but including all analyzed SNPs. After applying with 23andMe, the full summary statistics including all analyzed SNPs and samples in this GWAS meta-analysis will be accessible to the approved researcher(s). ## Potential Conflict of Interest K.H. and members of the 23andMe Research Team are employed by and hold stock or stock options in 23andMe, Inc. M.A.N.’s participation in this project was part of a competitive contract awarded to Data Tecnica International LLC by the National Institutes of Health to support open science research, he also currently serves on the scientific advisory board for Clover Therapeutics and is an advisor to Neuron23 Inc. A.J.N. reports consultancy and personal fees from AstraZeneca, AbbVie, Profile, Roche, Biogen, UCB, Bial, Charco Neurotech, uMedeor, Alchemab, and Britannia, outside the submitted work. ## Online Methods ### Study Design and Cohort Descriptions We used a single joint meta-analysis study design to maximize statistical power 6. We used datasets representing four different ancestry groups: European, East Asian, Latin American, and African. The meta-analysis included 49,049 PD cases, 49,049 PD proxy cases (participant with a parent with PD) and 2,458,063 neurologically normal controls (Table 1, Supplementary Table S1). GWAS results of European1, East Asian2, and Latin American3 populations were previously reported. African dataset as well as the additional Latin American and East Asian PD GWAS summary statistics were provided by 23andMe. The Finnish PD GWAS summary statistics was acquired from FinnGen Release 4 (G6\_PARKINSON_EXMORE). For the FinnGen data, we chose the endpoint “Parkinson’s Disease (more controls excluded)” (G6_PARKINSON_EXMORE), which excludes control participants with psychiatric diseases or neurological diseases. While some FinnGen GWAS results also include UK Biobank participants, our FinnGen data did not include any UK Biobank participants. View this table: [Table 1.](http://medrxiv.org/content/early/2022/08/06/2022.08.04.22278432/T1) Table 1. Cohort Description View this table: [Table 2.](http://medrxiv.org/content/early/2022/08/06/2022.08.04.22278432/T2) Table 2. Meta-analysis results of lead SNPs in the novel loci. RE – random effect, A1 – effect allele, A2 – alternate allele, ANC-HET – ancestry heterogeneity, RES-HET – residual heterogeneity View this table: [Table 3.](http://medrxiv.org/content/early/2022/08/06/2022.08.04.22278432/T3) Table 3. MR-MEGA fine mapping results for loci with a single SNP within the 99% credible set. Known PD genes are either known PD risk genes (*SNCA, TMEM175, MAPT*) or genes with the highest score in the nearest known PD locus by the PD GWAS Locus Browser52 ### 23andMe diverse ancestry data All self-reported PD cases and controls from 23andMe provided informed consent and participated in the research online, under a protocol approved by the external AAHRPP-accredited IRB, Ethical & Independent Review Services (E&I Review). Participants were included in the analysis on the basis of consent status as checked at the time data analyses were initiated. The name of the IRB at the time of the approval was Ethical & Independent Review Services. Ethical & Independent Review Services was recently acquired, and its new name as of July 2022 is Salus IRB ([https://www.versiticlinicaltrials.org/salusirb](https://www.versiticlinicaltrials.org/salusirb)). Samples were genotyped on one of five genotyping platforms: V1 and V2, which are variants of Illumina HumanHap550+ BeadChip; V3, Illumina OmniExpress+ BeadChip; V4, Illumina custom array that includes SNPs overlapping V2 and V3 chips; or V5, Illumina Infinium Global Screening Array. For inclusion, samples needed a minimal call rate of 98.5%. Genotyped samples were then phased using either Finch or Eagle232 and imputed using Minimac3 and a reference panel of 1000 Genomes Phase III33 and UK10K data34. For this replication study, samples were classified as African, East Asian, or Latino using a genotype-based pipeline35 consisting of a support vector machine and a hidden Markov model, followed by a logistic classifier to differentiate Latinos from African Americans. Unrelated individuals were included in the analysis, as determined via identity-by-descent (IBD). Variants were tested for association with PD status using logistic regression, adjusting for age, sex, the first five PCs, and genotyping platform. Reported p-values were from a likelihood ratio test. ### Multi-ancestry Meta-Analysis We performed a multi-ancestry meta-analysis of GWAS results using Meta-Regression of Multi-Ethnic Genetic Association (MR-MEGA)8 and PLINK 1.9. MR-MEGA performs a meta-regression by generating axes of genetic variation for each cohort, which are then used as covariates in the meta-analysis to account for differences in population structure. While MR-MEGA was able to generate 4 principal components as axes of genetic variation, 3 principal components visibly separated the super population ancestries and explained 98% of the population variance (Supplementary Figure 7). Therefore we used 3 principal components to minimize overfitting. MR-MEGA has reduced power to detect associations for variants with homogeneous effects across populations. It is therefore recommended to run MR-MEGA alongside another meta-analysis method. PLINK 1.9 was used to perform random-effect (RE) meta-analysis to detect homogenous allelic effects. Before the analysis, all datasets were harmonized to genome build hg19 using CrossMap36 and all variants were filtered by imputation score (r2 > 0.3) and minor allele frequency (MAF >= 0.001). Only autosomal variants were kept in the final results. In total 20,590,839 variants met the inclusion criteria. Only 5,662,641 SNPs were analyzed in the MR-MEGA analysis due to cohort-number requirements. Bonferroni-adjusted alpha was set to a more stringent 5 × 10−9 for all multi-ancestry meta-analyses to account for the larger number of haplotypes resulting from the ancestrally diverse datasets9. Genomic inflations were measured for all cohorts and the meta-analysis. Inflation for cohorts with large discrepancy between the case and control numbers were normalized to 1000 cases and 1000 controls. All inflation was nominal and below 1.02 (Supplementary Figure 3, Supplementary Table S1). No genomic control was applied prior to meta-analysis. We identified genomic risk loci within our meta-analysis results using Functional Mapping and Annotation (FUMA) v1.3.811,12. In brief, FUMA first identifies independent significant SNPs in the GWAS results by clumping all significant variants with the r2 threshold < 0.6, then a locus is defined by merging linkage disequilibrium (LD) blocks of all independent significant SNPs within 250kb of each other. Start and end of a locus is defined by identifying SNPs in LD with the independent significant SNPs (r2 ≥□0.6) and defining a region that encompasses all SNPs within the locus. The 1000 Genome reference panel with all ancestries were used to calculate the r2. To determine if any associated loci in the meta-analysis were not previously identified, all significant SNPs were compared to the 92 known PD risk variants found in the previous two major meta-analyses1,2. Two variants identified in the Latin American admixture population3 could not be replicated as the variants and their proxies were removed during quality control of the discovery cohort. When a genomic risk loci was scanned for significant variants in previous European and East Asian results. If the locus contained a significant hit in either results, the locus was annotated with the closest previously nominated risk SNP within 250kb. Any loci that did not have previously significant variants or failed the annotation step was considered a novel hit. ### Fine-Mapping Fine-mapping was performed using MR-MEGA8, which approximates a single-SNP Bayes’ factor in favor of association. This is reported as the natural log of Bayes’ factor (lnBF) per SNP in the MR-MEGA meta-analysis summary statistics. SNPs were selected at meta-GWAS significance level (P < 5 × 10−9). Posterior probabilities (PPs) of driving the association signal at each locus were calculated from the Bayes’ factor as follows: ![Formula][1] Where *⋀**j* is the Bayes’ factor of the *j*th SNP within a locus with *n* number of SNPs. Credible set of SNPs with PP (*π**j*) greater than 0.99 were accepted as putative causal variants. ### Putative variant burden estimation by population Proportion of population-specific and shared causal variants (PESCA)10 was used to estimate the population-specific and shared burden of the variants tested in the meta-analysis. In brief, genome-wide heritability was estimated for the European and East Asian GWAS summary statistics using LD Score regression (LDSC)37,38. Summary statistics of both populations were intersected with common variants with the 1000 Genome reference panels provided by PESCA, which have already been LD pruned (R2 > 0.95) and low frequency SNPs removed (minor allele frequency < 0.05). The intersected variants were further split according to independent LD regions from the European and East Asian populations. The genome-wide prior probabilities of population-specific and shared causal variants were calculated using default parameters or as otherwise recommended by PESCA, then the results were used to calculate the PP for each variant. When the lead SNP was unavailable in the results, proxy variants (r2 > 0.8) were used to approximate the PP for each variant. Other cohorts were not included due to sample size constraints for this method. ### Functional Annotation Functional annotation of the discovery results utilizing publicly available gene expression and ontology data was done using FUMA v1.3.811,12. The summary statistics were annotated by ANNOVAR39 through the FUMA platform. Our meta-analysis results were analyzed using Multi-marker Analysis of GenoMic Annotation (MAGMA)13 to compare gene expression data from gene sets and tissues in GTEx v840. FUMA tested 15,479 gene sets and gene ontology terms from MsigDB v741 as well as single-cell RNAseq expression data from mouse brain samples in DropViz18 and human ventral midbrain samples19. Test parameters were set to default. Results were adjusted for multiple tests using Benjamini–Hochberg False Discovery Rate (FDR) correction with the alpha of 0.05. Additional pathway analyses of genes mapped by FUMA SNP2GENE were performed through GENE2FUNC with default parameters. SNPs in the novel loci were searched in multi-ancestry brain eQTL meta-analysis results14. We used a P-value cutoff of 10−6 as previously described14. eQTL and GWAS comparison plots were generated using LocusCompare42. Multi-SNP Summary-Based Mendelian Randomization (SMR) was used to test if DNA methylation and/or RNA expression of genes near the novel loci were associated with PD risk15. The nearest genes from the lead SNPs, significant genes in the multi-ancestry meta-analysis brain eQTL results, and significant genes in GTEx v8 brain tissue were chosen for SMR. In total 44 genes near the novel loci were searched in a list of previously completed PD SMR results15,43–51. The association signals were adjusted using FDR correction with the alpha of 0.05 and all signals with PHEIDI > 0.05 were removed due to heterogeneity. ## Acknowledgements M.A.N.’s research was supported in part by the Intramural Research Program of the NIH, National Institute on Aging (NIA), National Institutes of Health, Department of Health and Human Services; project number ZO1 AG000535, as well as the National Institute of Neurological Disorders and Stroke (NINDS), and participation in this project was part of a competitive contract awarded to Data Tecnica International LLC by the National Institutes of Health to support open science research. I.F.M.’s research is supported by grants from the National Institute of Neurological Disorders and Stroke under (R01NS112499), the Parkinson’s Foundation (Stanley Fahn Junior Faculty Award and an International Research Grants Program award), the Michael J Fox Foundation, Aligning Science Across Parkinson’s Global Parkinson’s Genetic Project (ASAP-GP2) and the American Parkinson’s Disease Association. E.K.T. is supported by a grant from the National Medical Research Council Singapore, Open Fund Large Collaborative Grant (MOH-OFLCG-000207). J.N.F. is supported by the Singapore National Research Foundation Fellowship (NRF-NRFF2016-03) and the National Medical Research Council Singapore, Open Fund Individual Research Grant (MOH-000559). A.J.N. reports grants from Parkinson’s UK, Barts Charity, Cure Parkinson’s, NIHR, Innovate UK, Virginia Keiley benefaction, Alchemab, Aligning Science Across Parkinson’s Global Parkinson’s Genetics Program (ASAP-GP2) and Michael J Fox Foundation. This research has been conducted using the UK Biobank Resource under Application Number 33601. We want to acknowledge the participants and investigators of FinnGen study. We thank the research participants and employees of 23andMe. The following members of the 23andMe Research Team contributed to this study: Stella Aslibekyan, Adam Auton, Elizabeth Babalola, Robert K. Bell, Jessica Bielenberg, Katarzyna Bryc, Emily Bullis, Paul Cannon, Daniella Coker, Gabriel Cuellar Partida, Devika Dhamija, Sayantan Das, Sarah L. Elson, Nicholas Eriksson, Teresa Filshtein, Alison Fitch, Kipper Fletez-Brant, Pierre Fontanillas, Will Freyman, Julie M. Granka, Karl Heilbron, Alejandro Hernandez, Barry Hicks, David A. Hinds, Ethan M. Jewett, Yunxuan Jiang, Katelyn Kukar, Alan Kwong, Keng-Han Lin, Bianca A. Llamas, Maya Lowe, Jey C. McCreight, Matthew H. McIntyre, Steven J. Micheletti, Meghan E. Moreno, Priyanka Nandakumar, Dominique T. Nguyen, Elizabeth S. Noblin, Jared O’Connell, Aaron A. Petrakovitz, G. David Poznik, Alexandra Reynoso, Madeleine Schloetter, Morgan Schumacher, Anjali J. Shastri, Janie F. Shelton, Jingchunzi Shi, Suyash Shringarpure, Qiaojuan Jane Su, Susana A. Tat, Christophe Toukam Tchakouté, Vinh Tran, Joyce Y. Tung, Xin Wang, Wei Wang, Catherine H. Weldon, Peter Wilton, Corinna D. Wong This research was funded by Aligning Science Across Parkinson’s through the Michael J. Fox Foundation for Parkinson’s Research (MJFF). ## Footnotes * # denotes co-first authorship * & denotes co-corresponding authorship * Received August 4, 2022. * Revision received August 4, 2022. * Accepted August 6, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license ## References 1. 1.Nalls MA, Blauwendraat C, Vallerga CL, et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 2019;18(12):1091–1102. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1474-4422(19)30320-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 2. 2.Foo JN, Chew EGY, Chung SJ, et al. Identification of Risk Loci for Parkinson Disease in Asians and Comparison of Risk Between Asians and Europeans: A Genome-Wide Association Study. JAMA Neurol. 2020;77(6):746–754. 3. 3.Loesch DP, Horimoto ARVR, Heilbron K, et al. Characterizing the Genetic Architecture of Parkinson’s Disease in Latinos. Ann Neurol. 2021;90(3):353–365. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ana.26153&link_type=DOI) 4. 4.Rajan R, Divya KP, Kandadai RM, et al. Genetic Architecture of Parkinson’s Disease in the Indian Population: Harnessing Genetic Diversity to Address Critical Gaps in Parkinson’s Disease Research. Front Neurol. 2020;11. doi:10.3389/fneur.2020.00524 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fneur.2020.00524&link_type=DOI) 5. 5.Program TGPG, The Global Parkinson’s Genetics Program. GP2 : The Global Parkinson’s Genetics Program. Movement Disorders. 2021;36(4):842–851. doi:10.1002/mds.28494 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mds.28494&link_type=DOI) 6. 6.Skol AD, Scott LJ, Abecasis GR, Boehnke M. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nat Genet. 2006;38(2):209–213. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng1706&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16415888&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000234953200016&link_type=ISI) 7. 7.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13742-015-0047-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25722852&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 8. 8.Mägi R, Horikoshi M, Sofer T, et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum Mol Genet. 2017;26(18):3639–3650. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddx280&link_type=DOI) 9. 9.Pulit SL, de With SAJ, de Bakker PIW. Resetting the bar: Statistical significance in whole-genome sequencing-based association studies of global populations. Genet Epidemiol. 2017;41(2):145–151. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 10. 10.Shi H, Burch KS, Johnson R, et al. Localizing Components of Shared Transethnic Genetic Architecture of Complex Traits from GWAS Summary Data. Am J Hum Genet. 2020;106(6):805–817. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.04.012&link_type=DOI) 11. 11.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-017-01261-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 12. 12.Watanabe K, Umićević Mirkov M, de Leeuw CA, van den Heuvel MP, Posthuma D. Genetic mapping of cell type specificity for complex traits. Nat Commun. 2019;10(1):1–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-019-09078-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30602773&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 13. 13.de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: Generalized Gene-Set Analysis of GWAS Data. PLoS Comput Biol. 2015;11(4). doi:10.1371/journal.pcbi.1004219 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1004219&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25885710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 14. 14.Zeng B, Bendl J, Kosoy R, Fullard JF, Hoffman GE, Roussos P. Multi-ancestry eQTL meta-analysis of human brain identifies candidate causal variants for brain-related traits. Nat Genet. Published online January 20, 2022. doi:10.1038/s41588-021-00987-9 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00987-9&link_type=DOI) 15. 15.Wu Y, Zeng J, Zhang F, et al. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat Commun. 2018;9(1):918. 16. 16.Smeland OB, Shadrin A, Bahrami S, et al. Genome-wide association analysis of Parkinson’s disease and schizophrenia reveals shared genetic architecture and identifies novel risk loci. Biol Psychiatry. 2021;89(3):227. 17. 17.Jinn S, Blauwendraat C, Toolan D, et al. Functionalization of the TMEM175 p.M393T variant as a risk factor for Parkinson disease. Hum Mol Genet. 2019;28(19):3244–3254. 18. 18.Saunders A, Macosko EZ, Wysoker A, et al. Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain. Cell. 2018;174(4):1015–1030.e16. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2018.07.028&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30096299&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 19. 19.La Manno G, Gyllborg D, Codeluppi S, et al. Molecular Diversity of Midbrain Development in Mouse, Human, and Stem Cells. Cell. 2016;167(2). doi:10.1016/j.cell.2016.09.027 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2016.09.027&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27716510&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 20. 20.Koegler E, Bonnon C, Waldmeier L, Mitrovic S, Halbeisen R, Hauri HP. p28, a novel ERGIC/cis Golgi protein, required for Golgi ribbon formation. Traffic. 2010;11(1):70–89. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1600-0854.2009.01009.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19948005&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000274184800008&link_type=ISI) 21. 21.Gaudet P, Livstone MS, Lewis SE, Thomas PD. Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium. Brief Bioinform. 2011;12(5). doi:10.1093/bib/bbr042 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bib/bbr042&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21873635&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 22. 22.“ER to Golgi Anterograde Transport”. Reactome, release 73, [https://reactome.org/content/detail/R-HSA-199977](https://reactome.org/content/detail/R-HSA-199977): R-HSA-199977 (Access date: March 29, 2021) 23. 23.Bonet-Ponce L, Beilina A, Williamson CD, et al. LRRK2 mediates tubulation and vesicle sorting from lysosomes. Science Advances. 2020;6(46). doi:10.1126/sciadv.abb2454 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo4OiJhZHZhbmNlcyI7czo1OiJyZXNpZCI7czoxMzoiNi80Ni9lYWJiMjQ1NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 24. 24.Bandres-Ciga S, Saez-Atienzar S, Bonet-Ponce L, et al. The endocytic membrane trafficking pathway plays a major role in the risk of Parkinson disease. Mov Disord. 2019;34(4):460. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mds.27614&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30675927&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 25. 25.Beilina A, Rudenko IN, Kaganovich A, et al. Unbiased screen for interactors of leucine-rich repeat kinase 2 supports a common pathway for sporadic and familial Parkinson disease. Proc Natl Acad Sci U S A. 2014;111(7). doi:10.1073/pnas.1318306111 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMDoiMTExLzcvMjYyNiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 26. 26.MacLeod DA, Rhinn H, Kuwahara T, et al. RAB7L1 interacts with LRRK2 to modify intraneuronal protein sorting and Parkinson’s disease risk. Neuron. 2013;77(3). doi:10.1016/j.neuron.2012.11.033 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2012.11.033&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23395371&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000317030800007&link_type=ISI) 27. 27.Fujita Y, Ohama E, Takatama M, Al-Sarraj S, Okamoto K. Fragmentation of Golgi apparatus of nigral neurons with alpha-synuclein-positive inclusions in patients with Parkinson’s disease. Acta Neuropathol. 2006;112(3):261–265. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00401-006-0114-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16855830&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 28. 28.Martínez-Menárguez JÁ, Tomás M, Martínez-Martínez N, Martínez-Alonso E. Golgi Fragmentation in Neurodegenerative Diseases: Is There a Common Cause? Cells. 2019;8(7):748. doi:10.3390/cells8070748 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/cells8070748&link_type=DOI) 29. 29.Sobu Y, Wawro PS, Dhekne HS, Yeshaw WM, Pfeffer SR. Pathogenic LRRK2 regulates ciliation probability upstream of tau tubulin kinase 2 via Rab10 and RILPL1 proteins. Proc Natl Acad Sci U S A. 2021;118(10). doi:10.1073/pnas.2005894118 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxODoiMTE4LzEwL2UyMDA1ODk0MTE4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDgvMDYvMjAyMi4wOC4wNC4yMjI3ODQzMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 30. 30.Fung HC, Scholz S, Matarin M, et al. Genome-wide genotyping in Parkinson’s disease and neurologically normal controls: first stage analysis and public release of data. Lancet Neurol. 2006;5(11):911–916. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1474-4422(06)70578-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17052657&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000241591600013&link_type=ISI) 31. 31.Maraganore DM, de Andrade M, Lesnick TG, et al. High-resolution whole-genome association study of Parkinson disease. Am J Hum Genet. 2005;77(5):685–693. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/496902&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16252231&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000232569600001&link_type=ISI) 32. 32.Loh PR, Palamara PF, Price AL. Fast and accurate long-range phasing in a UK Biobank cohort. doi:10.1101/028282 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czo4OiIwMjgyODJ2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 33. 33.A global reference for human genetic variation. Nature. 2015;526(7571):68–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature15393&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26432245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 34. 34.UK10K Consortium, Walter K, Min JL, et al. The UK10K project identifies rare variants in health and disease. Nature. 2015;526(7571):82–90. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature14962&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26367797&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 35. 35.Durand EY, Do CB, Mountain JL, Michael Macpherson J. Ancestry Composition: A Novel, Efficient Pipeline for Ancestry Deconvolution. bioRxiv. Published online October 18, 2014:010512. doi:10.1101/010512 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czo4OiIwMTA1MTJ2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 36. 36.Zhao H, Sun Z, Wang J, Huang H, Kocher JP, Wang L. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics. 2014;30(7):1006–1007. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btt730&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24351709&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000334078300017&link_type=ISI) 37. 37.Bulik-Sullivan BK, Loh PR, Finucane HK, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–295. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 38. 38.Finucane HK, Bulik-Sullivan B, Gusev A, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47(11):1228–1235. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3404&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26414678&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 39. 39.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Research. 2010;38(16):e164–e164. doi:10.1093/nar/gkq603 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkq603&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20601685&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 40. 40.Aguet F, Barbeira AN, Bonazzola R, et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. bioRxiv. Published online October 3, 2019:787903. doi:10.1101/787903 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czo4OiI3ODc5MDN2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 41. 41.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–1740. doi:10.1093/bioinformatics/btr260 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btr260&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21546393&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000291261300036&link_type=ISI) 42. 42.Liu B, Gloudemans MJ, Rao AS, Ingelsson E, Montgomery SB. Abundant associations with gene expression complicate GWAS follow-up. Nat Genet. 2019;51(5):768–769. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0404-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31043754&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 43. 43.de Klein N, Tsai EA, Vochteloo M, et al. Brain expression quantitative trait locus and network analysis reveals downstream effects and putative drivers for brain-related diseases. bioRxiv. Published online March 5, 2021:2021.03.01.433439. doi:10.1101/2021.03.01.433439 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMS4wMy4wMS40MzM0Mzl2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 44. 44.Bryois J, Garrett ME, Song L, et al. Evaluation of chromatin accessibility in prefrontal cortex of individuals with schizophrenia. Nat Commun. 2018;9(1):3121. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-018-05379-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30087329&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 45. 45.Võsa U, Claringbould A, Westra HJ, et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat Genet. 2021;53(9):1300–1310. 46. 46.McRae AF, Marioni RE, Shah S, et al. Identification of 55,000 Replicated DNA Methylation QTL. Sci Rep. 2018;8(1):17605. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 47. 47.Wingo AP, Liu Y, Gerasimov ES, et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer’s disease pathogenesis. Nature Genetics. 2021;53(2):143–146. doi:10.1038/s41588-020-00773-z [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-020-00773-z&link_type=DOI) 48. 48.Elsworth B, Lyon M, Alexander T, et al. The MRC IEU OpenGWAS data infrastructure. doi:10.1101/2020.08.10.244293 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wOC4xMC4yNDQyOTN2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA4LzA2LzIwMjIuMDguMDQuMjIyNzg0MzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 49. 49.Hemani G, Zheng J, Elsworth B, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7. doi:10.7554/eLife.34408 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.34408&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29846171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F08%2F06%2F2022.08.04.22278432.atom) 50. 50.Nightingale Health and UK Biobank announces major initiative to analyse half a million blood samples to facilitate global medical research. Accessed March 11, 2022. [https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/news/nightingale-health-and-uk-biobank-announces-major-initiative-to-analyse-half-a-million-blood-samples-to-facilitate-global-medical-research](https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/news/nightingale-health-and-uk-biobank-announces-major-initiative-to-analyse-half-a-million-blood-samples-to-facilitate-global-medical-research) 51. 51.Consortium TG, The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369(6509):1318–1330. doi:10.1126/science.aaz1776 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNjkvNjUwOS8xMzE4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDgvMDYvMjAyMi4wOC4wNC4yMjI3ODQzMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 52. 52.Grenn FP, Kim JJ, Makarious MB, et al. The Parkinson’s Disease Genome-Wide Association Study Locus Browser. Mov Disord. 2020;35(11):2056–2067. [1]: /embed/graphic-6.gif