Integrative polygenic risk score improves the prediction accuracy of complex traits and diseases

Buu Truong; Leland E. Hull; Yunfeng Ruan; Qin Qin Huang; Whitney Hornsby; Hilary Martin; David A. van Heel; Ying Wang; Alicia R. Martin; S. Hong Lee; Pradeep Natarajan

doi:10.1101/2023.02.21.23286110

ABSTRACT

Polygenic risk scores (PRS) are an emerging tool to predict the clinical phenotypes and outcomes of individuals. Validation and transferability of existing PRS across independent datasets and diverse ancestries are limited, which hinders the practical utility and exacerbates health disparities. We propose PRSmix, a framework that evaluates and leverages the PRS corpus of a target trait to improve prediction accuracy, and PRSmix+, which incorporates genetically correlated traits to better capture the human genetic architecture. We applied PRSmix to 47 and 32 diseases/traits in European and South Asian ancestries, respectively. PRSmix demonstrated a mean prediction accuracy improvement of 1.23-fold (95% CI: [1.18; 1.29]; P-value < 2 × 10⁻¹⁶) and 1.19-fold (95% CI: [1.11; 1.27]; P-value = 3.94 × 10⁻⁶, and PRSmix+ improved the prediction accuracy by 1.71-fold (95% CI: [1.48; 1.94]; P-value = 9.98 × 10⁻¹⁰ and 1.41-fold (95% CI: [1.24; 1.58]; P-value = 2.51 × 10⁻⁶) in European and South Asian ancestries, respectively. Our method provides a comprehensive framework to benchmark and leverage the combined power of PRS for maximal performance in a desired target population.

INTRODUCTION

Thousands of polygenic risk scores (PRS) have been developed to predict an individual’s genetic propensity to diverse phenotypes¹. PRS are generated when risk alleles for distinct phenotypes are weighted by their effect size estimates and summed². Risk alleles included in PRS have traditionally been identified from genome-wide association studies (GWAS) results conducted on a training dataset, which are weighted and aggregated to derive a PRS to predict distinct phenotypes. The association between PRS and the phenotype of interest is subsequently evaluated in a test dataset that is non-overlapping with the training dataset³.

Most PRS have been developed in specific cohorts that may vary in terms of population demographics, admixture, environment, and SNP availability. Limited validation of many PRS outside of the training datasets and poor transferability of PRS to other populations may limit their clinical utility. However, pooling of data from individual PRS generated and validated in diverse cohorts has the potential to improve the predictive ability of PRS across diverse populations. The Polygenic Score Catalog (PGS Catalog) is a publicly available repository that archives SNP effect sizes for PRS estimation. The SNP effect sizes were developed from various methods (e.g. P+T⁴, LDpred^5,6, PRS-CS⁷, etc.) to obtain the highest prediction accuracy in the studied dataset. PRS metadata enables researchers to replicate PRS in independent cohorts and aggregate SNP effects to refine PRS and enhance the accuracy and generalizability in broader populations⁸. However, optimizing PRS performance requires methodological approaches to adjust GWAS estimate effect sizes that take into account correlated SNPs (i.e., linkage disequilibrium) and refine PRS for the target population^4,5,7,9–12. Furthermore, numerous scores are often present for single traits with varied validation metrics in non-overlapping cohorts. There is a lack of standardized approaches combining PRS from this growing corpus to enhance prediction accuracy and generalizability while minimizing bias, for a target cohort^8,11,13.

To address these issues, we sought to: 1) validate previously developed PRS in two geographically and ancestrally distinct cohorts, the All of Us Research Program (AoU) and the Genes & Health cohort, and 2) present and evaluate new methods for combining previously calculated PRS to maximize performance beyond all best performing published PRS. To better capture the genetic architecture of the outcome traits, we proposed PRSmix, a framework to combine PRS from the same trait with the outcome trait. Previous studies highlighted the effect of pleiotropic information on a trait’s genetic architecture^14,15. Therefore, we proposed PRSmix+ to additionally combine PRS from other genetically correlated traits to further improve the PRS for a given trait.

To assess the prediction improvement, we performed PRSmix and PRSmix+ for 47 traits in European ancestry and 32 traits in South Asian ancestry. We evaluated 1) the relative improvement of the proposed framework over the best-performing pre-existing PRS for each trait, 2) the efficient training sample sizes required to improve the PRS, 3) the predictive improvement in 6 groups including anthropometrics, blood counts, cancer, cardiometabolic, biochemistry and other conditions as the prediction accuracies varied in each group, and 4) the clinical utility and pleiotropic effect of the newly built PRS for coronary artery disease. Overall, we show that PRSmix and PRSmix+ significantly improved prediction accuracy. An R package for preprocessing and harmonizing the SNP effects from the PGS Catalog as well as assessing and combining the scores was developed to facilitate the combining of pre-existing PRS scores for both ancestry-specific and cross-ancestry contexts using the totality of published PRS. The development of this framework has the potential to improve precision health by improving the generalizability in the application of PRS¹⁶.

RESULTS

Overview of methods

A single PRS may only reflect genetic effects captured in the discovery dataset of a single study that may be only a part of the total genetic effects underlying the trait of interest. Therefore, we harmonized and combined multiple sets of PRS to establish a new set of scores, which gather information across studies and traits. Our approach leveraged multiple well-powered PRSs to improve prediction accuracy and is detailed in Fig. 1.

Figure 1. The framework of the trait-specific and cross-trait PRS integration.

In Phase 1, we obtained the SNP effects from the PGS Catalog and then harmonized the effect alleles as the alternative alleles in the independent cohorts. In each independent biobank (All of Us, Genes & Health), we estimated the PRS and split the data into training (80%) and testing (20%) datasets. In Phase 2, in the training dataset, we trained the Elastic Net model with high-power scores to estimate the mixing weights for the PRSs. The training phase could include PRSs from traits corresponding to outcomes (PRSmix) or all traits (PRSmix+). The training was adjusted for age, sex, and 10 principal components (PCs). In Phase 3, we adjusted the per-allele effect sizes from each single PRS by multiplying with the corresponding mixing weights obtained in the training phase. The final per-allele effect sizes are estimated as the weighted sum of the SNP effects across different single scores. In Phase 4, we evaluated the re-estimated per-allele effect sizes in the testing dataset.

Our combination frameworks leveraged the PGS Catalog¹⁷ as the resource of SNP effects to estimate single PRSs. To avoid overfitting, we used All of Us and Genes & Health cohorts (see Methods) due to non-overlapping samples from the original GWAS. We randomly divided the target cohort into a training set (80%) and a testing set (20%). We selected the most common traits from the PGS Catalog which have the highest number of PRS. For the stability of the linear combination, we curated binary traits with a prevalence > 2% in the target cohort. Continuous traits were assessed using partial R² which is estimated as the difference between the full model of PRS and covariates (age, sex, and 10 PCs) and the null model of only covariates. For binary traits, the prediction accuracy was converted to liability R² with disease prevalence approximated as the prevalence in the corresponding cohort.

To combine the scores, we employed Elastic Net¹⁸ to construct linear combinations of the PRS. We proposed two combination frameworks: 1) PRSmix combines the scores developed from the same outcome trait, and 2) PRSmix+ combines all the high-power scores across other traits. Trait-specific combinations, PRSmix, can leverage the PRSs developed from different studies and methods to more fully capture the genetic effects underlying the traits. It has also been shown that complex traits are determined by genes with pleiotropic effects¹⁵. Therefore, we additionally proposed a cross-trait combination, PRSmix+, to make use of pleiotropic effects and further improve prediction accuracy.

First, we evaluated the improvement for each method, defined as the fold-ratio of the method compared to the prediction accuracy of the best single PRS. For a fair comparison with the proposed framework, we selected the best single PRS from the training set and evaluated its performance in the testing set. First, we performed simulations to assess the improvement with various heritabilities and training sample sizes. We estimated the slope of improvement of prediction accuracy by increasing training sample sizes for various heritabilities.

Next, we applied the proposed frameworks in two distinct cohorts; (1) the All of Us program, in which 47 traits were tested in U.S. residents of European ancestry, and (2) the Genes & Health (G&H) cohort, in which 32 traits were tested in British South Asian ancestry (Supplementary Table 1). In each cohort, we compared the improvement of our proposed framework with the single best score from the PGS Catalog. We estimated the averaged fold-ratio as a measure of the improvement of prediction accuracy by our approach, compared to the best single score from PGS Catalog. We also classified the traits into 6 categories as anthropometrics, blood counts, cancer, cardiometabolic, biochemistry, and other conditions (Supplementary Table 2 and 3). Cancer traits were not considered in the younger Genes & Health cohort due to their low prevalence (<2%). We then present additional detailed analyses for coronary artery disease focused on clinical utility improvements relative to existing PRS.

Simulations were used to evaluate the combination frameworks

To compare the performance of PRSmix and PRSmix+ against the best single PRS and evaluate the sample sizes needed for training the mixing weights, we performed simulations with real genotypes of European ancestry in the UK Biobank given the large sample sizes available (Fig. 2). Briefly, we randomly split 7,000 individuals as a testing data set mimicking the testing size of 20% of real data. In the remaining dataset, we used 200,000 individuals for GWAS to estimate the SNP effect sizes for PRS calculations. Finally, with the rest of the data, we randomly selected different sample sizes as the training sample to evaluate the sample sizes needed to train the mixing weights. To assess the improvement of PRS performance, we computed the fold-ratio of prediction accuracy R² between PRSmix and PRSmix+ against the best-performing single simulated PRS.

Figure 2. Simulations to demonstrate the predictive improvement of PRSmix and PRSmix+.

The points and triangles represent the mean fold-ratio of R² between (a) PRSmix and (b) PRSmix+, respectively, versus the best single PRS. (c) The improvement per logarithm with base 10 of sample size for various heritabilities was represented as a slope of a linear regression of fold-ratio ∼ log10(N). In simulations, the correlation within simulated trait-specific PRSs was 0.8, and the correlation between trait-specific and correlated PRSs was 0.4 (see Methods). The whiskers demonstrate confidence intervals across 200 replications. The dashed red lines represent the reference for fold-ratio equal 1 for (a) and (b), and equal 0 for (c).

Our results showed that the trait-specific combination, PRSmix, showed no improvement with the training sample smaller than 500 for most of the traits. Our simulations illustrated that traits with low heritability required a larger sample size to achieve an improvement compared to traits with high heritability (Fig. 2a and 2b). PRSmix demonstrated a better performance compared to the best single PRS with training sample sizes from N_training = 200 samples for the high heritable trait (h² = 0.4) to N_training = 5000 samples for the low heritable trait (h²=0.05) (Fig. 2a and 2b). We observed that PRSmix demonstrated a saturation of improvement from N_training = 10,000. PRSmix+ demonstrated negligible further improvement when the training sample size was increased from 30,000 but maintained consistent improvement relative to PRSmix and the best single PRS. Moreover, we observed that traits with higher heritability or higher best prediction accuracy of a single PRS demonstrated a smaller improvement compared to traits with a smaller heritability (Fig. 2c).

Combining trait-specific PRS improves prediction accuracy (PRSmix)

To determine if a trait-specific combination, namely PRSmix, would improve the accuracy of PRS prediction, we used data from European ancestry participants in the All of Us research program who had undergone whole genome sequencing, and Genes & Health participants of South Asian ancestry. We randomly split the independent cohorts into training (80%) and testing sets (20%). The training set was used to train the weights of each PRS, referred as mixing weights, that indicate how much each PRS explain the phenotypic variance in the training set, and the PRS accuracies were evaluated in the testing set (Fig. 1). We curated 47 traits and 32 traits in the All of Us and Genes & Health cohorts, respectively. For binary traits, we removed traits with a prevalence of smaller than 2% (see Methods, Supplementary Table 1). Traits with the best-performance trait-specific single PRS which showed a lack of power were also removed. Overall, we observed a significant improvement compared to 1 using a two-tailed paired t-test with PRSmix. PRSmix significantly improves the prediction accuracy compared to the best PRS estimated from the PGS Catalog. PRSmix improved 1.22-fold (95% CI: [1.17; 1.27]; P-value < 2 × 10⁻¹⁶) and 1.19-fold (95% CI: [1.11; 1.27]; P-value = 1.92 × 10⁻⁶) compared to the best PRS from PGS Catalog for European ancestry and South Asian ancestry, respectively.

In European ancestry, we observed the greatest improvement of PRSmix against the best single PRS for rheumatoid arthritis of 1.93-fold. Furthermore, in South Asian ancestry, we observed that PRSmix of coronary artery disease had the best improvement of 2.32-fold compared to the best-performance single PRS. Details of the prediction accuracy are shown in Supplementary Fig. 1, 2 and Supplementary Table 2, 3. This was consistent with findings in simulations since traits with a lower single PRS performance demonstrated a better improvement with the combination strategy.

Cross-trait combination further improved PRS accuracy and highlighted the contribution of pleiotropic effects

We next assessed the contribution of pleiotropic effects from cross-trait PRSs to determine if these would further improve the combination framework (PRSmix+), by including high-power PRSs from within 2600 PRSs in the PGS Catalog. To evaluate the power of PRS and improve computational efficiency, we employed the theoretic power and variance of partial R² for continuous traits and liability R² for binary traits (see Methods). We observed that PRSmix+ further improved the prediction accuracy compared to the best PGS Catalog in European ancestry (Fig. 3a) and South Asian ancestry (Fig. 3b). We observed an improvement of 1.70-fold (95% CI: [1.47; 1.93]; P-value = 2.13 × 10⁻⁹ and 1.42-fold (95% CI: [1.25; 1.59]; P-value = 8.01 × 10⁻⁷) higher compared to the best PGS Catalog for European ancestry and South Asian ancestry, respectively. PRSmix+ significantly improved the prediction accuracy compared to PRSmix, in both European and South Asian ancestry with 1.42-fold (95% CI: [1.22; 1.62]; P-value = 2.32 × 10⁻⁵) and 1.19-fold (95% CI: [1.07; 1.32]; P-value = 0.001), respectively (Supplementary Fig. 3).

Figure 3. Comparison of PRSmix and PRSmix+ versus the best PGS Catalog in European and South Asian ancestries.

The relative improvement compared to the best single PRS was assessed in (a) the European ancestry in the All of US cohort and (b) South Asian ancestry in the Genes & Health cohort. PRSmix combines trait-specific PRSs and PRSmix+ combines additional PRSs from other traits. The best PGS Catalog score was selected by the best performance trait-specific score in the training sample and evaluated in the testing sample. The prediction accuracy (R²) was calculated as partial R² which is a difference of R² between the model with PRS and covariates including age, sex, and 10 PCs versus the base model with only covariates. Prediction accuracy for binary traits was assessed with liability-R² where disease prevalence was approximately estimated as a proportion of cases in the testing set. The whiskers reflect the maximum and minimum values within the 1.5 × interquartile range. The bars represent the ratio of prediction accuracy of PRSmix and PRSmix+ versus the best PRS from the PGS Catalog across 47 traits and 32 traits in All of Us and Genes and Heath cohorts, respectively, and the whiskers demonstrate 95% confidence intervals. P-values for significance difference of the fold-ratio from 1 using a two-tailed paired t-test. PRS: Polygenic risk scores.

Consistent with our simulation results, a smaller improvement was observed for traits with a higher baseline prediction accuracy from PGS Catalog (Supplementary Fig. 4), noting that the baseline prediction accuracy depends on the heritability and genetic architecture (i.e. polygenicity). In contrast, more improvement was observed for traits with lower heritability, thus lower prediction accuracy, when comparing the single best PRS (Fig. 1c).

Prediction accuracy and predictive improvement across various types of traits

We next compared PRSmix and PRSmix+ with the best PRS estimated from the PGS Catalog across 6 categories, including anthropometrics, blood counts, cancer, cardiometabolic, biochemistry, and other conditions (see Methods). PRSmix demonstrates a higher prediction accuracy across all types of traits in both European and South Asian ancestries (Fig. 4). We observed a similar trend in the predictive performance of PRSmix+ across different types of traits. In European, the smallest improvement was in anthropometric traits of 1.20-fold (95% CI: [1.11; 1.28]; P-value = 3.4 × 10⁻⁶) and “other conditions” (including depression, asthma, migraine, current smoker, hypothyroid, osteoporosis, glaucoma, rheumatoid arthritis, and gout) obtained the highest mean predictive improvement of 2.08-fold (95% CI: [1.25; 2.89]; P-value = 9.9 × 10⁻³) (Supplementary Table 4). In South Asian ancestry, the mean predictive improvement was highest but also with high variance in “other conditions” (including asthma, migraine, current smoker, and rheumatoid arthritis) type. Biochemistry demonstrated the smallest improvement of 1.23-fold (95% CI: [1.15; 1.31]; P-value = 5.8 × 10⁻⁹).

Figure 4. Prediction accuracy and improvement across various types of traits in the European and South Asian ancestry.

We classified the traits into 6 main categories for European ancestry in the All of Us cohort and 5 categories for South Asian ancestry in the Genes & Health cohort due to the low prevalence of cancer traits in Genes & Health. The prediction accuracies, (a) and (c), are estimated as partial R² and liability R² for continuous traits and binary traits, respectively. The relative improvements, (b) and (d), are estimated as the fold-ratio between the prediction accuracies of PRSmix and PRSmix+ against the best PGS Catalog. The order on the axis followed the decrease in the prediction accuracy of PRSmix+. The boxplots in (a) and (c) show the first to the third quartile of prediction accuracies for 47 traits and 32 traits in European and South Asian ancestries, respectively. The whiskers reflect the maximum and minimum values within the 1.5 × interquartile range for each group. The bars in (b) and (d) represent the mean prediction accuracy across the traits in that group and the whiskers demonstrate 95% confidence intervals. The red dashed line in (b) and (d) represents the ratio equal to 1 as a reference for comparison with the best PGS Catalog score. The asterisk (*) and (**) indicate P-value < 0.05 and P-value < 0.05 / number of traits in each type with a two-tailed paired t-test, respectively.

Clinical utility for coronary artery disease

To evaluate the utility of the proposed methods, we assessed the PRSmix and PRSmix+ for coronary artery disease (CAD), which is the leading cause of disability and premature death among adults^19–21. The single best CAD PRSs (PRS_CAD) s from the PGS Catalog in the training sample was from Koyama S. et al²². and Tamlander M. et al.²³ in European and South Asian ancestries, respectively (Supplementary Fig. 5). Liability R² in the testing sample with Koyama S et al. for European ancestry was 0.019 (95% CI: [0.013; 0.025]; P-value = 1.87 × 10⁻⁹) and with Tamlander M. et al. for South Asian ancestry was 0.006 (95% CI: [0.003; 0.009]; P-value = 2.39 × 10⁻⁴) (Fig. 5).

Figure 5. Comparison of prediction accuracies with PRSmix, PRSmix+ and CAD PRS from PGS Catalog.

PRSmix was computed as a linear combination of CAD PRS and PRSmix+ was computed as a linear combination of all significant PRS obtained from the PGS Catalog. The PRSs were evaluated by liability R² in the (a) European ancestry from the All of Us cohort and b) South Asian ancestry from the Genes & Health cohort. The bars indicate the mean prediction accuracy and the whiskers show 95% confidence intervals. CAD, coronary artery disease.

Subsequently, we assessed the clinical utility of the integrative model with PRS and established clinical risk factors, including age, sex, total cholesterol, HDL-C, systolic blood pressure, BMI, type 2 diabetes, current smoking status versus the traditional model with clinical risk factors. (Fig. 6 and Supplementary Table 5). In European ancestry, the CAD PRSmix+ integrative score improved the continuous net reclassification of 35% (95% CI: [22%; 48%]; P-value = 7.08 × 10⁻⁸) compared to PRSmix (30%; 95% CI: [18%; 42%]; P-value = 9.11 × 10⁻⁷) and the best PRS from the PGS Catalog (19%; 95% CI: [5%; 33%]; P-value = 0.007). In South Asian ancestry, the integrated score with PRSmix+ showed significant continuous net reclassification of 27% (95% CI: [16%; 38%]; P-value = 6.07 × 10⁻⁷) compared to PRSmix (15%; 95% CI: [9%; 20%]; P-value = 7.18 × 10⁻⁶) and the best PGS Catalog (7%; 95% CI: [1%; 13%]; P-value = 0.02). Our results also demonstrated an improvement in net reclassification for models without clinical risk factors (Supplementary Table 5).

Figure 6. Net reclassification improvement (NRI) for coronary artery disease with the addition of polygenic risk scores to the baseline model in European and South Asian ancestries.

The baseline model for risk prediction includes age, sex, total cholesterol, HDL-C, systolic blood pressure, BMI, type 2 diabetes, and current smoking status. We compared the integrative models with PGS Catalog, PRSmix, and PRSmix+ in addition to clinical risk factors versus the baseline model with only factors. The points indicate the mean estimate for continuous NRI and the whiskers indicate 95% confidence intervals estimated from 500 bootstraps. HDL-C: High-density lipoprotein; BMI: Body mass index. NRI: Net Reclassification Improvement.

We assessed the incremental area under the curve (AUC) between the full model of PRS and covariates and the null model with only covariates (Supplementary Table 6). PRSmix+ demonstrated an incremental AUC of 0.02 (95% CI: [0.018; 0.02]; P-value < 2.2×10⁻¹⁶) and 0.008 (95% CI: [0.007; 0.009]; P-value<2.2×10⁻¹⁶) in European and South Asian ancestries, respectively. PRSmix obtained an incremental AUC of 0.013 (95% CI: [0.013; 0.014]; P-value < 2.2×10⁻¹⁶) and 0.006 (95% CI: [0.005; 0.007]; P-value < 2.2×10⁻¹⁶) in European and South Asian ancestries, respectively. The best PGS Catalog had the smallest incremental AUC of 0.007 (95% CI: [0.007; 0.008]; P-value<2.2×10⁻¹⁶) and 0.003 (95% CI: [0.002; 0.003]; P-value < 2.2×10⁻¹⁶) in European and South Asian ancestries, respectively.

We also compared the risks for individuals in the top decile versus the remaining population (Supplementary Table 7). For European ancestry, an increased risk with OR per 1-SD of the best PGS Catalog, PRSmix and PRSmix+ were 1.39 (95% CI: [1.27-1.52]; P-value < 1.52 × 10⁻¹⁶), 1.52 (95% CI: [1.39-1.67]; P-value < 2.2×10⁻¹⁶) and 1.66 (95% CI = [1.51; 1.82]; P-value < 2.2×10⁻¹⁶), respectively. The top decile of PRSmix+ compared to the remaining population demonstrated an increased risk of OR = 2.54 (95% CI: [1.97; 3.25]; P-value = 3.91 × 10⁻¹³). The top decile for the best PGS Catalog versus the remainder was OR = 2.14 (95% CI: [1.66; 2.74]; P-value = 2.27 × 10⁻⁹). For South Asian ancestry, an increased risk with OR per 1-SD of the best PGS Catalog, PRSmix and PRSmix+ was 1.24 (95% CI: [1.13; 1.37]; P-value < 1.52×10⁻¹⁶), 1.39 (95% CI: [1.33; 1.46]; P-value < 2.2 × 10⁻¹⁶), 1.40 (95% CI: [1.27; 1.55]; P-value < 2.2×10⁻¹⁶) and 1.50 (95% CI = [1.36; 1.66]; P-value < 2.2×10⁻¹⁶), respectively. In South Asian ancestry, PRSmix+ demonstrated an OR of 2.34 (95% CI: [1.79; 3.05]; P-value = 4.22 × 10⁻¹⁰), and with the best PGS Catalog, OR was 1.73 (95% CI: [1.30; 2.28]; P-value = 1.31 × 10⁻⁴) for the top decile versus the remaining population.

Moreover, we observed that there is a plateau of improvement for PRSmix from the training size of 5000 in both European and South Asian ancestries (Supplementary Fig. 6), which aligned with our simulations (Fig. 2a and 2b). Our results demonstrated the generalization of our combination methods across diverse ancestries to improve prediction accuracy. With PRSmix+, our empirical result showed that there was a modest improvement with training sample sizes larger than 5,000.

Finally, we conducted phenome-wide association studies (PheWAS) in All of Us between PRS_CAD with 1815 phecodes to compare the pleiotropy of PRS and assess the relationship between CAD PRS and disease phenotypes given the inherent use of pleiotropy in development (Supplementary Table 8). As expected, PRSmix+ had a stronger association for ischemic heart disease relative to the single best PRS from the PGS Catalog. Despite extensive use of pleiotropy in performance, PRSmix+ associations with cardiometabolic risk factors were only mildly greater in risk increase (Supplementary Table 8). The PheWAS result for PRSmix+ aligned with the list of traits from the selected PRS (Supplementary Fig. 7, and Supplementary Table 8)

DISCUSSION

In this paper, we propose a trait-specific framework (PRSmix), and cross-trait framework (PRSmix+) to leverage the combined power of existing scores. We performed and evaluated our method using the All of Us and Genes & Health cohorts showcasing a framework to develop the most optimal PRS for a given trait in a target population leveraging all existing PRS. Across 47 traits in All of Us cohort and 32 traits in the Genes & Health cohort with either continuous traits or binary traits with prevalence > 2%, we demonstrated substantial improvement in average prediction R² by using a linear combination with Elastic Net. The empiric observations are concordant with simulations. To our knowledge, there has been a number of emerging studies to combine PRS, but there is a limited number of frameworks that comprehensively evaluate, harmonize, and leverage the combination of these scores^8,13,24. Our studies permit several conclusions for the development, implementation, and transferability of PRS.

First, externally derived and validated PRS are generally not the most optimal PRS for a given cohort. Consistent with other risk predictors, recalibration within the ultimate target population improves performance²⁵. By leveraging the PGS Catalog, our work carefully harmonizes the risk alleles to estimate PRS across all scores and provides newly estimated per-allele SNP effects (provided to the PGS Catalog) to assist the interpretability of the models.

Second, previous studies selected an arbitrary training sample size to estimate the mixing weights, which may lead to a poor power of the combination frameworks and inaccurate estimate of sampling variance¹⁰. We assessed the expected sample sizes to estimate the mixing weights via simulations and real data. Our results demonstrated that while low heritability traits benefit the most, they require a greater training sample size.

Third, we leveraged all PRS, including those not trained on the primary trait, to systematically optimize PRS for a target cohort. We showed that PRSmix improved the prediction by combining the scores matching the outcome trait. In addition, we showed that PRSmix+ was able to leverage the power of cross-traits, which highlighted the contribution of pleiotropic effects to enhance PRS performance. We leverage prior work demonstrating the effects of pleiotropy on complex traits^15,26,27. It is noted that our proposed framework is related to the metaPRS approach advanced by Abraham et al. for stroke, however, selected with prior knowledge⁸. Our framework utilizes all PRSs available in the PGS Catalog. Additional summary statistics could be added to further enhance the models. We let our model penalize the high-power PRS without the need for prior knowledge. We also observed that our method could identify more related risk factors to include compared to previous work conducted on stroke (Supplementary Fig. 8). Therefore, our method is more comprehensive in an unbiased way in terms of choosing the risk factors and traits to include with empirically improved performance.

Fourth, greater performance is observed even for non-European ancestry groups underrepresented in GWAS and PRS studies. We empirically demonstrate the value of training and incorporating pleiotropy with all available PRS to improve performance, including multiple metrics of clinical utility for CAD prediction in multiple ancestries. In South Asian ancestry, we observed that PRSmix and PRSmix+ demonstrated a significant improvement with the best improvement for CAD. Of note for CAD, the relative improvements in South Asian ancestry were higher than in European ancestry for PRSmix and equivalent for PRSmix+. Transferability of PRS has been shown to improve the clinical utility of PRS in non-European ancestry^16,28. Although the prediction accuracy for South Asian ancestry is still limited, our results highlighted the transferability of predictive improvement with PRSmix and PRSmix+ to South Asian ancestry. We anticipate that ongoing and future efforts to improve our understanding of the genetic architecture in non-European ancestries will further improve the transferability of PRS across ancestry.

Lastly, traits with low heritability or generally low-performing single PRS benefit the most from this approach, especially with PRSmix+, such as migraine in both European and South Asian ancestries. Additionally, our results showed that pleiotropic effects play an important role in understanding and improving prediction accuracies of complex traits. However, anthropometric traits, which are highly polygenic²⁹ and have good predictive performance using the best PGS Catalog, also showed improvement with the combination framework in both European and South Asian ancestries.

Given that PRSmix+ outperformed PRSmix, one might consider if there is a reason to use PRSmix instead of PRSmix+. We observed that in cases of highly heritable traits or high performance with a single PRS, there was only marginal improvement of PRSmix+ over PRSmix. In this scenario, PRSmix could provide similar predictive performance while being less time-consuming because trait-specific PRS inputs only are required. However, for traits with lower heritability PRSmix+ shows a marked improvement over PRSmix and would be preferred. Wang et al.³⁰ showed that the theoretical prediction accuracy of the target trait using the PRS from the correlated trait is a function of genetic correlation, heritability, number of genetic variants and sample size. Future directions could include defining the minimum parameters required for the performance of the PRSmix+ model to improve on single trait-specific PRS.

Our work has several limitations. First, the majority of scores from PGS Catalog were developed in European ancestry populations. Further non-European SNP effects will likely improve the single PRS power, which may in turn, also improve the prediction accuracy of our proposed methods. Second, the Elastic Net makes a strong assumption that the outcome trait depends on a linear association with the PRS and covariates. However, a recent study demonstrated there is no statistical significance difference between linear and non-linear combinations for neuropsychiatric disease¹³. Third, we did not validate the mixing weights in an independent cohort. We expect that in the future, there will be emerging large independent biobanks, but prior non-genetic work demonstrates the value of internal calibration for optimal risk prediction. Fourth, we estimated the mixing weights for each single SNP as a mixing weight of the PRS. Future studies could consider linkage disequilibrium between the SNPs and functional annotations of each SNP. Fifth, our frameworks were conducted on binary traits with a prevalence > 2%. Additional combination PRS models are emerging that seek to use preexisting genotypic data from genetically related, but low prevalence conditions, to improve the prediction accuracy of rare conditions¹³. Sixth, the baseline demographic characteristics (i.e., age, sex, social economic status) in the target cohort might limit the validation and transferability of PRS³¹. Although these factors were considered by using a subset of the target cohorts as training data, it is necessary to have PRS developed on similar baseline characteristics. Lastly, with the expanding of all biobanks, there might be no perfect distinction between the samples deriving PRS and the testing cohort, future studies may consider the potential intersection samples to train the linear combination.

In conclusion, our framework demonstrates that leveraging different PRS either trait-specific or cross-trait can substantially improve model stability and prediction accuracy beyond all existing PRS for a target population. Importantly, we provide software to achieve this goal in independent cohorts.

METHODS

Data

The All of Us Research Program

The All of Us Research Program is a longitudinal cohort continuously enrolling (starting May 2017) U.S. adults ages 18 years and older from across the United States, with an emphasis on promoting inclusion of diverse populations traditionally underrepresented in biomedical research, including gender and sexual minorities, racial and ethnic minorities, and participants with low levels of income and educational attainment.³² Participants in the program can opt-in to providing self-reported data, linking electronic health record data, and providing physical measurement and biospecimen data.³³ Details about the All of Us study goals and protocols, including survey instrument development,³⁴ participant recruitment, data collection, and data linkage and curation were previously described in detail.^33,35

Data can be accessed through the secure All of Us Researcher Workbench platform, which is a cloud-based analytic platform that was built on the Terra platform.³⁶ Researchers gain access to the platform after they complete a 3-step process including registration, completion of ethics training, and attesting to a data use agreement attestation.³⁷ All of Us uses a tiered approach based on what genomic data is accessible through the Controlled Tier, and includes both whole genome sequencing (WGS), genotyping array variant data in multiple formats, as well as variant annotations, access to computed ancestry, and quality reports.³⁸ This study includes data on the 98,600 participants with (WGS) data in the All of Us v5 Curated Data Repository release. Participant data in this data release was collected between May 6, 2018 and April 1, 2021. This project is registered in the All of Us program under the workspace name “Polygenic risk score across diverse ancestries and biobanks.”

The Genes & Health Biobank

Genes & Health is a community-based genetics study enrolling British South Asian, with an emphasis on British Bangladeshi (two-thirds) and British Pakistani (remaining) people, with a goal of recruiting at least 100,000 participants. Currently, over 52,000 participants have enrolled since 2015. All participants have consented for lifelong electronic health record access and genetic analysis. The study was approved by the London South East National Research Ethics Service Committee of the Health Research Authority. 97.4% of participants in Genes & Health are in the lowest two quintiles of the Index of Multiple Deprivation in the United Kingdom. The cohort is broadly representative of the background population with regard to age, but slightly over-sampled with females and those with medical problems since two-thirds of people were recruited in healthcare settings such as General Practitioner surgeries³⁹.

The Polygenic Score (PGS) Catalog

Polygenic risk scores were obtained from the Polygenic Score (PGS) Catalog¹⁷, which is a publicly accessible resource cataloging published PRS, including the metadata. The metadata provides information describing the computational algorithms used to generate the score, and performance metrics to evaluate a PRS¹⁷. At the time of this study, over 2,600 PRS were cataloged in the PGS Catalog (version July 18, 2022) designed to predict 538 distinct traits.

Clinical Outcomes

Clinical phenotypes were curated using a combination of electronic health record data, direct physical measurements, and/or self-reported personal medical history data, from the All of Us v5 Data Release as detailed in Supplementary Table 13. Individuals in the Genes and Health cohort were also curated with similar definitions based on electronic health record and ICD10 (Supplementary Table 14). Traits with the best performing single trait-specific PRS with power < 0.95 such as hemoglobin, sleep apnea, and depression were removed. Binary traits with a prevalence < 2% were removed.

A linear combination of scores

We proposed PRSmix to combine PRS of outcome traits and PRSmix+ to combine high-power PRS (defined in the following subsection) from all traits obtained from PGS Catalog. The linear combination was conducted by using an Elastic Net algorithm from the “glmnet” R package (version 4.1) to combine the estimated PRS. First, we randomly split the independent cohorts into 80% of training and 20% testing. The PRS in the training set was standardized with mean 0 and variance 1. Before conducting linear combination, we first evaluated the performance of each individual PRS by their power and P-value (see below). An Elastic Net algorithm was used with 5-fold cross-validation and default parameters to estimate the mixing weights of each PRS. The mixing weights were then divided by the corresponding original standard deviation of the PRS in the training set.

Where and σ_i is the mixing weight estimated from the Elastic Net and standard deviation of PRS_i in the training set, respectively. is the adjusted mixing weight for PRS_i. To derive the per-allele effect sizes from the combination framework, we multiplied the SNP effects with the corresponding adjusted mixing weights:

Where is the adjusted effect size of SNP_j and β_ij is the original effect sizes of SNP_j in PRS_i. We set β_ij = 0 if SNP_j is not in PRS_i. The adjusted effect sizes were then utilized to calculate the final PRS.

The mixing weights for PGS Catalog scores for PRSmix and PRSmix+ in European ancestry are provided in Supplementary Table 9 and Supplementary Table 10, respectively. For South Asian ancestry, the mixing weights for PRSmix and PRSmix+ in European ancestry are provided in Supplementary Table 11 and Supplementary Table 12, respectively.

Power and variance of PRS accuracy

We selected high-power PRS to conduct the combination by assessing the power and variance of prediction accuracy. The power of PRS can be estimated based on the power of the two-tailed test of association as follow^3,40: where ϕ is the Chi-squared distribution function, α is the significance level, and λ is the non-centrality parameter which can be estimated as where N, R² is the sample size and estimated prediction accuracy in the testing set, respectively. R² can be estimated as partial R² or liability R² for continuous traits and binary traits, respectively. Briefly, partial R² compared the difference in goodness-of-fit between a full model with PRS and covariates including age, sex, and first 10 PCs, and a null model with only covariates. Additionally, for binary traits, liability R² was estimated with the disease prevalence approximated as the prevalence in the samples. The theoretical variance and standard error of R² can be estimated as follow^41–43:

Therefore, we can analytically estimate the confidence interval of prediction accuracy for each of the score. We selected high-power scores defined as power > 0.95 with P-value > 0.05 or P-value > 1.9 × 10⁻⁵ (0.05/2600) for the combination with Elastic Net.

To compare the improvement, for instance between PRSmix and the best PGS Catalog, we estimate the mean fold-ratio of R² across different traits with its 95% confidence interval and evaluated the significance difference from 1 using a two-tailed paired t-test.

Simulations

We used UK Biobank European ancestry to conduct simulations for trait-specific and crosstrait combinations. Overall, we simulated 7 traits with heritability h² equal to 0.05, 0.1, 0.2, and 0.5. We randomly selected M=1000 causal SNPs among 1.1 million HapMap3 variants with INFO > 0.6, MAF > 0.01 and P-value Hardy-Weinberg equilibrium > 10⁻⁷. We removed individuals with PC1 and PC2 > 3 standard deviation from the mean. We randomly remove one in a pair of related individuals with closer than 2nd degree. The genetic components were simulated as PRSs where PRS1, PRS2, and PRS3 are considered trait-specific scores with genetic correlations are 0.8 and 0.4 for cross-trait scores. PRS4, PRS5 and PRS6 are simulated as pleiotropic effects on the outcome traits with genetic correlation equal to 0.4.

The SNP effects for PRSs are simulated by a multivariate normal distribution MVN(0, Σ) where Σ is the covariance matrix between PRSs. The main diagonal contains the heritability of the traits as h²/M and the covariance between PRSs are simulated as r_g * h²/M where r_g is the genetic correlation between PRSs (0.8 for trait-specific scores and 0.4 for cross-trait scores). The PRSs of the outcome are estimated by the weighted combination of PRS where the weights follow U(0,1). 7 phenotypes were simulated as y = g + e,e ∼ N (0,1 − h²) where g is PRS and e is the residuals.

We split the simulated cohort into 3 data sets for: 1) GWAS 2) training set: training the mixing weights with a linear combination and 3) testing set: testing the combined PRS. We incorporated PRS1, PRS2 and PRS3 to assess the trait-specific PRSmix framework. We combined all 6 single PRS to evaluate the cross-trait PRSmix+ framework. We compared the fold-ratio of the R² of the combined PRS to the R² of best single PRS to assess the improvement of the combination strategy. To evaluate the improvement across different heritabilities, we estimated the slope of improvement per log10(N) increase of training sample sizes on the fold-ratio of predictive improvement.

Sample and genotyping quality control

The AoU data version 5 contains more than 700 million variants from whole genome sequencing³³. We curated European ancestry by predicted genetic ancestry with a probability > 90% provided by AoU yielding 48,351 individuals in the AoU. For variant quality control beyond AoU central efforts, we further filtered SNPs to include MAF > 0.001 which retained 9,538,437 SNPs. We performed a similar quality control for imputed genotype data for South Asian ancestry in the Genes & Health cohort with additional criteria of INFO score > 0.6 and genotype missing rate < 5%. Individuals with a missing rate > 5% were removed. Eventually, 44,396 individuals and 8,935,207 SNPs remained in Genes & Health.

Assessment of clinical utility

We applied PRSmix and PRSmix+ for coronary artery disease as a clinical application. The phenotypic algorithm includes at least one ICD or CPT code below: ICD9 410x, 411x, 412x; ICD10 I22x, I23x, I24.1, I25.2 CPT 92920-92979 (PCI), 33533-33536, 33517-33523, 33510-33516 (CABG) or self-reported personal history of MI or CAD. CAD in Genes and Health cohort was defined with at least one ICD10 I22x, I23x, I24.1, I25 or operation codes K401, K402, K403, K404, K411, K451, K452, K453, K454, K455, K491, K492, K499, K502, K751, K752, K753, K754, K758, K759 or SNOMED codes 1755008, 22298006, 54329005, 57054005, 65547006, 70211005, 70422006, 73795002, 233838001, 304914007, 401303003, 401314000.

The category-free NRI was used to evaluate the clinical utility. NRI was calculated by adding the PRS to the baseline logistic model including age, sex, the first 10 principal components, and clinical risk factors. The clinical risk factors include total cholesterol, HDL-C, BMI, type 2 diabetes, and current smoking status or model includes only age, sex, and 10 principal components. NRI was calculated as the sum of NRI for cases and NRI for controls:

P (up|case) and P(down|case) estimate the proportion of cases that had higher or lower risk after classification with logistic regression, respectively. The confidence interval for NRI was estimated with 500 bootstraps. We also compared the risk increase between individuals in the top decile of PRS versus those remaining in the population. In addition to liability R² to compare the PRS performance, we also used the incremental area under the curve (AUC) to compare the PRS. The incremental AUC was estimated as the difference between the AUC of models with the integrative score versus the model with only clinical variables.

Phenome-wide association study

We obtained the list of 1815 phecodes from the PheWAS website (last accessed December 2022)⁴⁴. The phecodes were based on ICD-9 and ICD-10 to classify individuals. PheWAS was conducted on European ancestry only in AoU. For each phecodes as the outcome, we conducted an association analysis using logistic regression on PRS and adjusted for age, sex, and first 10 PCs. The significance threshold for PheWAS was estimated as 2.75 × 10⁻⁵ (0.05/1815) after Bonferroni correction.

Data Availability

The PGS Catalog is freely available at https://www.pgscatalog.org/. Our new scores are deposited in the PGS Catalog. The All of Us and Genes & Health individual-level data is a controlled access dataset and may be granted at https://www.researchallofus.org/ and https://www.genesandhealth.org/, respectively.

https://www.researchallofus.org/

https://www.genesandhealth.org/

Data availability

The weights from the PRSmix and PRSmix+ scores in this manuscript have been returned to the PGS Catalog. The R package to implement PRSmix and PRSmix+ in independent datasets is at https://github.com/buutrg/PRSmix.

Software/analyses

Analyses were performed on the AoU Researcher Workbench in Jupyter Notebook 14 using R version 4.0.0 programming language. Results are reported in compliance with the AoU Data and Statistics Dissemination Policy.

CONFLICT OF INTEREST

P.N. reports grants from Allelica, Amgen, Apple, Boston Scientific, Genentech, and Novartis, is a consultant to Allelica, Apple, AstraZeneca, Blackstone Life Sciences, Foresite Labs, HeartFlow, Novartis, Genentech, and GV, scientific advisory board membership to Esperion Therapeutics, Preciseli, and TenSixteen Bio, is a scientific co-founder of TenSixteen Bio, and spousal employment at Vertex Pharmaceuticals, all unrelated to the present work. Others declare no conflict of interest.

ACKNOWLEDGEMENT

We would like to thank Alkes L. Price for critical comments for this works. L.E.H. is supported by the National Human Genome Research Institute (K08HG012221). P.N. is supported by grants from NHGRI (U01HG011719), NHLBI (R01HL142711, R01HL127564, R01HL151152), and Massachusetts General Hospital (Paul & Phyllis Fireman Endowed Chair in Vascular Medicine). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

The All of Us Research Program is supported by the National Institutes of Health, Office of the Director: Regional Medical Centers: 1 OT2 OD026549; 1 OT2 OD026554; 1 OT2 OD026557; 1 OT2 OD026556; 1 OT2 OD026550; 1 OT2 OD 026552; 1 OT2 OD026553; 1 OT2 OD026548; 1 OT2 OD026551; 1 OT2 OD026555; IAA #: AOD 16037; Federally Qualified Health Centers: HHSN 263201600085U; Data and Research Center: 5 U2C OD023196; Biobank: 1 U24 OD023121; The Participant Center: U24 OD023176; Participant Technology Systems Center: 1 U24 OD023163; Communications and Engagement: 3 OT2 OD023205; 3 OT2 OD023206; and Community Partners: 1 OT2 OD025277; 3 OT2 OD025315; 1 OT2 OD025337; 1 OT2 OD025276. In addition, the All of Us Research Program would not be possible without the partnership of its participants.

Genes & Health is/has recently been core-funded by Wellcome (WT102627, WT210561), the Medical Research Council (UK) (M009017), Higher Education Funding Council for England Catalyst, Barts Charity (845/1796), Health Data Research UK (for London substantive site), and research delivery support from the NHS National Institute for Health Research Clinical Research Network (North Thames). Genes & Health is/has recently been funded by Alnylam Pharmaceuticals, Genomics PLC; and a Life Sciences Industry Consortium of Bristol-Myers Squibb Company, GlaxoSmithKline Research and Development Limited, Maze Therapeutics Inc, Merck Sharp & Dohme LLC, Novo Nordisk A/S, Pfizer Inc, Takeda Development Centre Americas Inc.

We thank Social Action for Health, Centre of The Cell, members of our Community Advisory Group, and staff who have recruited and collected data from volunteers. We thank the NIHR National Biosample Centre (UK Biocentre), the Social Genetic & Developmental Psychiatry Centre (King’s College London), Wellcome Sanger Institute, and Broad Institute for sample processing, genotyping, sequencing and variant annotation. We thank: Barts Health NHS Trust, NHS Clinical Commissioning Groups (City and Hackney, Waltham Forest, Tower Hamlets, Newham, Redbridge, Havering, Barking and Dagenham), East London NHS Foundation Trust, Bradford Teaching Hospitals NHS Foundation Trust, Public Health England (especially David Wyllie), Discovery Data Service/Endeavour Health Charitable Trust (especially David Stables), NHS Digital - for GDPR-compliant data sharing backed by individual written informed consent.

Most of all we thank all of the volunteers participating in the All of Us Research Program and Genes & Health.

REFERENCES

1.↵
Catalog, P. G. S. PGS Catalog - the Polygenic Score Catalog. http://www.pgscatalog.org/.
2.↵
Choi, S. W., Mak, T. S.-H. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
OpenUrl PubMed
3.↵
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS Genet. 9, e1003348 (2013).
OpenUrl CrossRef PubMed
4.↵
Choi, S. W. & O’Reilly, P. SA20 - PRSice 2: POLYGENIC RISK SCORE SOFTWARE (UPDATED) AND ITS APPLICATION TO CROSS-TRAIT ANALYSES. Eur. Neuropsychopharmacol. 29, S832 (2019).
OpenUrl
5.↵
Privé, F., Arbel, J. & Vilhjálmsson, B. J. LDpred2: better, faster, stronger. Bioinformatics (2020) doi:10.1093/bioinformatics/btaa1029.
OpenUrl CrossRef PubMed
6.↵
Vilhjálmsson, B. J. et al. Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores. Am. J. Hum. Genet. 97, 576–592 (2015).
OpenUrl CrossRef PubMed
7.↵
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
OpenUrl
8.↵
Abraham, G. et al. Genomic risk score offers predictive performance comparable to clinical risk factors for ischaemic stroke. Nat. Commun. 10, 5819 (2019).
OpenUrl PubMed
9.↵
Chung, W. et al. Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes. Nat. Commun. 10, 569 (2019).
OpenUrl CrossRef
10.↵
Weissbrod, O. et al. Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores. Nat. Genet. 54, 450–458 (2022).
OpenUrl CrossRef
11.↵
Inouye, M. et al. Genomic Risk Prediction of Coronary Artery Disease in 480,000 Adults: Implications for Primary Prevention. J. Am. Coll. Cardiol. 72, 1883–1893 (2018).
OpenUrl FREE Full Text
12.↵
Ruan, Y. et al. Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580 (2022).
OpenUrl
13.↵
Albiñana, C. et al. Multi-PGS enhances polygenic prediction: weighting 937 polygenic scores. Preprint at https://doi.org/10.1101/2022.09.14.22279940.
14.↵
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
OpenUrl CrossRef PubMed
15.↵
Li, C., Yang, C., Gelernter, J. & Zhao, H. Improving genetic risk prediction by leveraging pleiotropy. Hum. Genet. 133, 639–650 (2014).
OpenUrl CrossRef PubMed
16.↵
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics vol. 51 584–591 Preprint at https://doi.org/10.1038/s41588-019-0379-x (2019).
OpenUrl CrossRef PubMed
17.↵
Lambert, S. A. et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425 (2021).
OpenUrl
18.↵
Buch, G., Schulz, A., Schmidtmann, I., Strauch, K. & Wild, P. S. A systematic review and evaluation of statistical methods for group variable selection. Stat. Med. 42, 331– 352 (2023).
OpenUrl
19.↵
Klarin, D. & Natarajan, P. Clinical utility of polygenic risk scores for coronary artery disease. Nat. Rev. Cardiol. 19, 291–301 (2022).
OpenUrl CrossRef
20.
Heart Association Council on Epidemiology, A. Heart disease and stroke statistics— 2022 update: a report from the American Heart Association. Circulation (2022).
21.↵
Arnett, D. K. et al. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation 140, e596– e646 (2019).
OpenUrl CrossRef PubMed
22.↵
Koyama, S. et al. Population-specific and trans-ancestry genome-wide analyses identify distinct and shared genetic risk loci for coronary artery disease. Nat. Genet. 52, 1169– 1177 (2020).
OpenUrl CrossRef PubMed
23.↵
Tamlander, M. et al. Integration of questionnaire-based risk factors improves polygenic risk scores for human coronary heart disease and type 2 diabetes. Commun Biol 5, 158 (2022).
OpenUrl
24.↵
Zhang, H. et al. Novel methods for multi-ancestry polygenic prediction and their evaluations in 5.1 million individuals of diverse ancestry. bioRxiv (2022) doi:10.1101/2022.03.24.485519.
OpenUrl Abstract/FREE Full Text
25.↵
Sud, M. et al. Population-Based Recalibration of the Framingham Risk Score and Pooled Cohort Equations. J. Am. Coll. Cardiol. 80, 1330–1342 (2022).
OpenUrl
26.↵
Carroll, R. J., Bastarache, L. & Denny, J. C. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics 30, 2375– 2376 (2014).
OpenUrl CrossRef PubMed Web of Science
27.↵
Bastarache, L., Denny, J. C. & Roden, D. M. Phenome-Wide Association Studies. JAMA 327, 75–76 (2022).
OpenUrl
28.↵
Wang, M. et al. Validation of a Genome-Wide Polygenic Score for Coronary Artery Disease in South Asians. J. Am. Coll. Cardiol. 76, 703–714 (2020).
OpenUrl FREE Full Text
29.↵
Lloyd-Jones, L. R. et al. Improved polygenic prediction by Bayesian multiple regression on summary statistics. Nat. Commun. 10, 5086 (2019).
OpenUrl
30.↵
Wang, Y., Tsuo, K., Kanai, M., Neale, B. M. & Martin, A. R. Challenges and opportunities for developing more generalizable polygenic risk scores. Annu. Rev. Biomed. Data Sci. 5, 293–320 (2022).
OpenUrl
31.↵
Mostafavi, H. et al. Variable prediction accuracy of polygenic scores within an ancestry group. Elife 9, (2020).
32.↵
Mapes, B. M. et al. Diversity and inclusion for the All of Us research program: A scoping review. PLoS One 15, e0234962 (2020).
OpenUrl CrossRef PubMed
33.↵
The “All of Us” Research Program. N. Engl. J. Med. 381, 668–676 (2019).
OpenUrl CrossRef PubMed
34.↵
Cronin, R. M. et al. Development of the initial surveys for the All of Us Research Program. Epidemiology 30, 597–608 (2019).
OpenUrl CrossRef PubMed
35.↵
All of Us Research Program Protocol. All of Us Research Program | NIH https://allofus.nih.gov/about/all-us-research-program-protocol (2020).
36.↵
Pereira, F. Home. Terra.Bio https://terra.bio/ (2020).
37.↵
Researcher Workbench. https://www.researchallofus.org/workbench/.
38.↵
Data Methods – All of Us Research Hub. https://www.researchallofus.org/data-tools/methods.
39.↵
Finer, S. et al. Cohort Profile: East London Genes & Health (ELGH), a community-based population genomics and health study in British Bangladeshi and British Pakistani people. International Journal of Epidemiology vol. 49 20–21i Preprint at https://doi.org/10.1093/ije/dyz174 (2020).
OpenUrl PubMed
40.↵
Lee, S. H., Clark, S. & van der Werf, J. H. J. Estimation of genomic prediction accuracy from reference populations with varying degrees of relationship. PLoS One 12, e0189775 (2017).
OpenUrl CrossRef
41.↵
Wishart, J., Kondo, T. & Elderton, E. M. The mean and second moment coefficient of the multiple correlation coefficient, in samples from a normal population. Biometrika 22, 353 (1931).
OpenUrl CrossRef Web of Science
42.
Stuart, A., Ord, K. & Arnold, S. Kendall’s Advanced Theory of Statistics, Classical Inference and the Linear Model. (Wiley, 2010).
43.↵
Momin, M. M., Lee, S., Wray, N. R. & Lee, S. H. Significance tests for R2 of out-of-sample prediction using polygenic scores. Am. J. Hum. Genet. (2023) doi:10.1016/j.ajhg.2023.01.004.
OpenUrl CrossRef
44.↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics 26, 1205–1210 (2010).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted February 26, 2023.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Genetic and Genomic Medicine

Subject Areas

All Articles

Addiction Medicine (349)
Allergy and Immunology (668)
Allergy and Immunology (668)
Anesthesia (181)
Cardiovascular Medicine (2648)
Dentistry and Oral Medicine (316)
Dermatology (223)
Emergency Medicine (399)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
Epidemiology (12228)
Forensic Medicine (10)
Gastroenterology (759)
Genetic and Genomic Medicine (4103)
Geriatric Medicine (387)
Health Economics (680)
Health Informatics (2657)
Health Policy (1005)
Health Systems and Quality Improvement (985)
Hematology (363)
HIV/AIDS (851)
Infectious Diseases (except HIV/AIDS) (13695)
Intensive Care and Critical Care Medicine (797)
Medical Education (399)
Medical Ethics (109)
Nephrology (436)
Neurology (3882)
Nursing (209)
Nutrition (577)
Obstetrics and Gynecology (739)
Occupational and Environmental Health (695)
Oncology (2030)
Ophthalmology (585)
Orthopedics (240)
Otolaryngology (306)
Pain Medicine (250)
Palliative Medicine (75)
Pathology (473)
Pediatrics (1115)
Pharmacology and Therapeutics (466)
Primary Care Research (452)
Psychiatry and Clinical Psychology (3432)
Public and Global Health (6527)
Radiology and Imaging (1403)
Rehabilitation Medicine and Physical Therapy (814)
Respiratory Medicine (871)
Rheumatology (409)
Sexual and Reproductive Health (410)
Sports Medicine (342)
Surgery (448)
Toxicology (53)
Transplantation (185)
Urology (165)

[1] 1.↵
Catalog, P. G. S. PGS Catalog - the Polygenic Score Catalog. http://www.pgscatalog.org/.

[2] 2.↵
Choi, S. W., Mak, T. S.-H. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
OpenUrl PubMed

[3] 3.↵
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS Genet. 9, e1003348 (2013).
OpenUrl CrossRef PubMed

[4] 4.↵
Choi, S. W. & O’Reilly, P. SA20 - PRSice 2: POLYGENIC RISK SCORE SOFTWARE (UPDATED) AND ITS APPLICATION TO CROSS-TRAIT ANALYSES. Eur. Neuropsychopharmacol. 29, S832 (2019).
OpenUrl

[5] 5.↵
Privé, F., Arbel, J. & Vilhjálmsson, B. J. LDpred2: better, faster, stronger. Bioinformatics (2020) doi:10.1093/bioinformatics/btaa1029.
OpenUrl CrossRef PubMed

[6] 6.↵
Vilhjálmsson, B. J. et al. Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores. Am. J. Hum. Genet. 97, 576–592 (2015).
OpenUrl CrossRef PubMed

[7] 7.↵
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
OpenUrl

[8] 8.↵
Abraham, G. et al. Genomic risk score offers predictive performance comparable to clinical risk factors for ischaemic stroke. Nat. Commun. 10, 5819 (2019).
OpenUrl PubMed

[9] 9.↵
Chung, W. et al. Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes. Nat. Commun. 10, 569 (2019).
OpenUrl CrossRef

[10] 10.↵
Weissbrod, O. et al. Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores. Nat. Genet. 54, 450–458 (2022).
OpenUrl CrossRef

[11] 11.↵
Inouye, M. et al. Genomic Risk Prediction of Coronary Artery Disease in 480,000 Adults: Implications for Primary Prevention. J. Am. Coll. Cardiol. 72, 1883–1893 (2018).
OpenUrl FREE Full Text

[12] 12.↵
Ruan, Y. et al. Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580 (2022).
OpenUrl

[13] 13.↵
Albiñana, C. et al. Multi-PGS enhances polygenic prediction: weighting 937 polygenic scores. Preprint at https://doi.org/10.1101/2022.09.14.22279940.

[14] 14.↵
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
OpenUrl CrossRef PubMed

[15] 15.↵
Li, C., Yang, C., Gelernter, J. & Zhao, H. Improving genetic risk prediction by leveraging pleiotropy. Hum. Genet. 133, 639–650 (2014).
OpenUrl CrossRef PubMed

[16] 16.↵
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics vol. 51 584–591 Preprint at https://doi.org/10.1038/s41588-019-0379-x (2019).
OpenUrl CrossRef PubMed

[17] 17.↵
Lambert, S. A. et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425 (2021).
OpenUrl

[18] 18.↵
Buch, G., Schulz, A., Schmidtmann, I., Strauch, K. & Wild, P. S. A systematic review and evaluation of statistical methods for group variable selection. Stat. Med. 42, 331– 352 (2023).
OpenUrl

[19] 19.↵
Klarin, D. & Natarajan, P. Clinical utility of polygenic risk scores for coronary artery disease. Nat. Rev. Cardiol. 19, 291–301 (2022).
OpenUrl CrossRef

[20] 20.
Heart Association Council on Epidemiology, A. Heart disease and stroke statistics— 2022 update: a report from the American Heart Association. Circulation (2022).

[21] 21.↵
Arnett, D. K. et al. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation 140, e596– e646 (2019).
OpenUrl CrossRef PubMed

[22] 22.↵
Koyama, S. et al. Population-specific and trans-ancestry genome-wide analyses identify distinct and shared genetic risk loci for coronary artery disease. Nat. Genet. 52, 1169– 1177 (2020).
OpenUrl CrossRef PubMed

[23] 23.↵
Tamlander, M. et al. Integration of questionnaire-based risk factors improves polygenic risk scores for human coronary heart disease and type 2 diabetes. Commun Biol 5, 158 (2022).
OpenUrl

[24] 24.↵
Zhang, H. et al. Novel methods for multi-ancestry polygenic prediction and their evaluations in 5.1 million individuals of diverse ancestry. bioRxiv (2022) doi:10.1101/2022.03.24.485519.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Sud, M. et al. Population-Based Recalibration of the Framingham Risk Score and Pooled Cohort Equations. J. Am. Coll. Cardiol. 80, 1330–1342 (2022).
OpenUrl

[26] 26.↵
Carroll, R. J., Bastarache, L. & Denny, J. C. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics 30, 2375– 2376 (2014).
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Bastarache, L., Denny, J. C. & Roden, D. M. Phenome-Wide Association Studies. JAMA 327, 75–76 (2022).
OpenUrl

[28] 28.↵
Wang, M. et al. Validation of a Genome-Wide Polygenic Score for Coronary Artery Disease in South Asians. J. Am. Coll. Cardiol. 76, 703–714 (2020).
OpenUrl FREE Full Text

[29] 29.↵
Lloyd-Jones, L. R. et al. Improved polygenic prediction by Bayesian multiple regression on summary statistics. Nat. Commun. 10, 5086 (2019).
OpenUrl

[30] 30.↵
Wang, Y., Tsuo, K., Kanai, M., Neale, B. M. & Martin, A. R. Challenges and opportunities for developing more generalizable polygenic risk scores. Annu. Rev. Biomed. Data Sci. 5, 293–320 (2022).
OpenUrl

[31] 31.↵
Mostafavi, H. et al. Variable prediction accuracy of polygenic scores within an ancestry group. Elife 9, (2020).

[32] 32.↵
Mapes, B. M. et al. Diversity and inclusion for the All of Us research program: A scoping review. PLoS One 15, e0234962 (2020).
OpenUrl CrossRef PubMed

[33] 33.↵
The “All of Us” Research Program. N. Engl. J. Med. 381, 668–676 (2019).
OpenUrl CrossRef PubMed

[34] 34.↵
Cronin, R. M. et al. Development of the initial surveys for the All of Us Research Program. Epidemiology 30, 597–608 (2019).
OpenUrl CrossRef PubMed

[35] 35.↵
All of Us Research Program Protocol. All of Us Research Program | NIH https://allofus.nih.gov/about/all-us-research-program-protocol (2020).

[36] 36.↵
Pereira, F. Home. Terra.Bio https://terra.bio/ (2020).

[37] 37.↵
Researcher Workbench. https://www.researchallofus.org/workbench/.

[38] 38.↵
Data Methods – All of Us Research Hub. https://www.researchallofus.org/data-tools/methods.

[39] 39.↵
Finer, S. et al. Cohort Profile: East London Genes & Health (ELGH), a community-based population genomics and health study in British Bangladeshi and British Pakistani people. International Journal of Epidemiology vol. 49 20–21i Preprint at https://doi.org/10.1093/ije/dyz174 (2020).
OpenUrl PubMed

[40] 40.↵
Lee, S. H., Clark, S. & van der Werf, J. H. J. Estimation of genomic prediction accuracy from reference populations with varying degrees of relationship. PLoS One 12, e0189775 (2017).
OpenUrl CrossRef

[41] 41.↵
Wishart, J., Kondo, T. & Elderton, E. M. The mean and second moment coefficient of the multiple correlation coefficient, in samples from a normal population. Biometrika 22, 353 (1931).
OpenUrl CrossRef Web of Science

[42] 42.
Stuart, A., Ord, K. & Arnold, S. Kendall’s Advanced Theory of Statistics, Classical Inference and the Linear Model. (Wiley, 2010).

[43] 43.↵
Momin, M. M., Lee, S., Wray, N. R. & Lee, S. H. Significance tests for R2 of out-of-sample prediction using polygenic scores. Am. J. Hum. Genet. (2023) doi:10.1016/j.ajhg.2023.01.004.
OpenUrl CrossRef

[44] 44.↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics 26, 1205–1210 (2010).
OpenUrl CrossRef PubMed Web of Science