Abstract
Fine-mapping aims to prioritize causal variants underlying complex traits by accounting for the linkage disequilibrium of GWAS risk locus. The expanding resources of functional annotations serve as auxiliary evidence to improve the power of fine-mapping. However, existing fine-mapping methods tend to generate many false positive results when integrating a large number of annotations. In this study, we propose a unified method to integrate high-dimensional functional annotations with fine-mapping (Funmap). Funmap can effectively improve the power of fine-mapping by borrowing information from hundreds of functional annotations. Meanwhile, it relates the annotation to the causal probability with a random effects model that avoids the over-fitting issue, thereby producing a well-controlled false positive rate. Paired with a fast algorithm, Funmap enables scalable integration of a large number of annotations to facilitate prioritizing multiple causal SNPs. Our simulations demonstrate that Funmap is the only method that produces well-calibrated FDR under the setting of high-dimensional annotations while achieving better or comparable power gains as compared to existing methods. By integrating GWASs of 4 lipid traits with 187 functional annotations, Funmap consistently identified more variants that can be replicated in an independent cohort, achieving 15.5% − 26.2% improvement over the runner-up in terms of replication rate.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
M.C was supported by City University of Hong Kong Startup Grant (NO. 7200746) and Strategic Research Grant (NO. 21300423). Y.Z was supported by NSFC (Grant No. 2301383), Shenzhen Science and Technology Program (Grant No. RCBS20221008093336086), Hetao Shenzhen-Hong Kong Science and Technology Innovation Cooperation Zone Project (No. HZQSWS-KCCYB-2024016) and Longgang District Special Funds for Science and Technology Innovation (No. LGKCSDPT2023002). This research has been conducted using the UK Biobank Resource under Application Number 96744.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study used (or will use) ONLY openly available human data that were originally located at: https://nealelab.github.io/UKBB ldsc/index.html https://alkesgroup.broadinstitute.org/UKBBLD.ThescATAC-seqdata https://alkesgroup.broadinstitute.org/LDSCORE/baselineLDv2.1 annots https://csg.sph.umich.edu/willer/public/glgclipids2021
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced are available online at: https://nealelab.github.io/UKBB ldsc/index.html https://alkesgroup.broadinstitute.org/UKBB LD.ThescATAC-seqdata https://alkesgroup.broadinstitute.org/LDSCORE/baselineLD v2.1 annots https://csg.sph.umich.edu/willer/public/glgclipids2021