Inference of causal and pleiotropic effects with multiple weak genetic instruments: application to effect of adiponectin on type 2 diabetes =========================================================================================================================================== * Paul M McKeigue * Andrii Iakovliev * Athina Spiliopoulou * Buddhiprabha Erabadda * Helen M Colhoun ## Abstract Current methods for Mendelian randomization (MR) analyses are restricted to single SNP instruments, and cannot reliably infer causality with instruments that are mostly weak and pleiotropic. We describe methods to overcome these limitations: key innovations are construction of scalar instruments from multiple SNPs, use of a regularized horseshoe prior, and hypothesis tests based on the marginal likelihood of the causal effect parameter. To demonstrate the approach, we constructed genotypic instruments from unlinked *trans*-pQTLs detected in two large GWAS studies of plasma proteins, and tested the top 20 genes for which the aggregated effects of the instruments was associated with type 2 diabetes in the UK Biobank cohort. The only protein with clear evidence of a causal effect on type 2 diabetes was adiponectin, encoded by *ADIPOQ*: standardized log odds ratio -0.34 (95% CI -0.44 to -0.24) using UK Biobank instruments. These results have implications for the design and analysis of Mendelian randomization studies. Where the exposure under study is expression of a gene, restricting the instruments to *cis*-acting variants is likely to miss causal effects. Tests based on the marginal likelihood should supersede other methods of testing for causality in the presence of pleiotropy. ## Introduction Instrumental variable analysis with genetic instruments (“Mendelian randomization”) has been widely used to infer causal effects of exposures (broadly defined to include behavioural traits, biomarkers and gene expression levels) on diseases. The biggest methodological challenge is how to infer causality when some of the genetic instruments have direct (pleiotropic) effects on the outcome that are not mediated through the exposure under study. These pleiotropic effects are usually not directly observed, and their distribution over the instruments is unknown. Inference of causality in the presence of pleiotropic effects has relied on makeshift procedures for the construction of estimators such as weighted medians that downweight or exclude instruments that are outliers.1 The results depend on which estimator is used: recent guidelines suggest that “investigators should pick a sensible range of methods to assess the sensitivity of their findings.”2 The motivation for the work reported here was to infer causality from genetic instruments constructed from *trans*-QTLs that perturb expression (as levels of the transcript or encoded protein) of a gene. Mendelian randomization studies of gene expression are usually restricted to *cis*-QTLs, on the basis that *trans*-effects are too weak and pleiotropic to be used as genetic instruments.2 Weak instruments are excluded because weak estimators constructed from ratios of estimated coefficients have unstable sampling properties where the denominators of these ratios – the instrument-exposure coefficients – can take values close to zero. With recently-described Bayesian methods that use an unregularized horseshoe prior to marginalize over the distribution of pleiotropic effects to infer the causal effect,3,4 there is no need to exclude weak instruments. However few real-world application of these methods have been reported. For studies that use *trans*-QTLs as instruments, a key limitation of all existing methods is that they cannot aggregate the effects of multiple SNPs at each locus to construct unlinked instruments that can be modelled as independent. The usual procedure is to select a single SNP that is reliably associated with the exposure from each genomic region in which genetic associations with the exposure have beeen detected. This does not use all the available information about the effects of variants in the region on the exposure, and the results may depend upon which variants are selected. This paper describes methods that overcome these limitations, and demonstrates their application to infer causal effects of circulating proteins on type 2 diabetes in the UK Biobank cohort. ## Methods ### Statistical model For a Mendelian randomization study with *J* unlinked genetic instruments, we specify a model with three parameters: * *α* vector of coefficients of effects of the instruments on exposure *X*. * *β* vector of coefficients of direct (pleiotropic) effects of the instruments on outcome *Y* * *θ* causal effect of *X* on *Y* From summary statistics we have estimates ![Graphic][1] of the effects *α, γ* of the *J* instruments on the exposure and the outcome, with corresponding standard errors *s**α*, *s**γ*. We assume that these estimates have been adjusted for relevant covariates such as genetic background. As the instruments are unlinked, we can model the estimates as independent Gaussian variables conditional on the true values. ![Formula][2] The crude effect of each instrument on the outcome is the sum of the direct effect and the causal effect. ![Formula][3] With priors on *β, θ* we can compute the joint posterior distribution of these parameters given ![Graphic][4]. As the form of the distribution of pleiotropic effects over loci is unknown, any realistic statistical model has to specify a prior on these effects that encompassses a broad family of symmetric distributions ranging from a spike-and-slab mixture to a Gaussian. One such prior is the horseshoe,5 which has been applied to inference of causal effects from Mendelian randomization using individual-level data3 or summary-level data.4 A limitation of the original horseshoe is that the posterior distribution is difficult to sample from because the curvature varies. This problem is not obvious when Gibbs sampling is used,4 but is revealed by the divergence diagnostics that are implemented in the NUTS (No U-Turn Sampling) algorithm used in Stan and other probabilistic programming languages.6 Another limitation of the original horseshoe is that there is no way of specifying a realistic prior on the size of the nonzero effects in the “slab” component. When modelling the genetics of complex traits, we already know that the effects of common genetic variants on a complex trait are usually small. The regularized horseshoe prior, known as the “Finnish horseshoe”, overcomes these limitations as described below. We name our method “MR-Hevo”, using the Finnish word for a horse to distinguish it from the recently-described “MR-Horse” method which uses an unregularized horseshoe prior.4 Fig 1 shows the model as a directed acyclic graph in plate notation. The regularized horseshoe prior for the regression coefficients *β*1, …, *β**J* is ![Fig 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F1.medium.gif) [Fig 1.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F1) Fig 1. MR-Hevo model as a directed acyclic graph in plate notation. Stochastic nodes are shown as ellipses with continuous borders, deterministic nodes as ellipses with dashed borders. Observed nodes are shaded. ![Formula][5] Half-Cauchy priors are specified on the unregularized local scale parameters *λ**j* and the global scale parameter *τ*. ![Formula][6] The heavy tail of the half-Cauchy priors on *λ**j* allows some of the regression coefficients to escape the shrinkage imposed by small values of the global parameter *τ*. These nonzero coefficients are the slab component of the spike-and-slab distribution. A weakly informative prior is specified for the regularization parameter *η*: ![Formula][7] Even the largest direct effects will be regularized as a Gaussian with standard deviation *η*. The scaling factor *s*slab is specified based on prior information about the expected size of the largest direct effects. To constrain *η* so that the sampling algorithm does not diverge, we set ν = 2; this translates to a half-*t*2df prior on the scale of the largest direct effects. ### Shrinkage coefficients and effective number of nonzero effects Calculation of the shrinkage coefficients and the effective number of nonzero parameters does not affect the results of statistical modelling, but is helpful for interpretation of what the horseshoe prior is learning. The shrinkage coefficient *κ**j* for the *j*th regression coefficient *β**j*, with prior ![Graphic][8] and Gaussian likelihood with Fisher information ℐ*j* can be defined as the fraction by which the information in the prior shrinks the posterior mean of *β**j* from the maximum likelihood estimate: ![Formula][9] This definition of the shrinkage coefficient ignores the regularization parameter *η*, which imposes a lower bound on the shrinkage of *β**j*. The prior on each shrinkage coefficient has a horseshoe shape. The shrinkage coefficients *κ**j* can take values from 0 (no shrinkage) to 1 (complete shrinkage). ℐ*j* is asymptotically equivalent to the inverse variance of the maximum likelihood estimate of *β**j* at *θ* = 0. When using summary statistics, the approximate variance of the maximum likelihood estimate of the ratio *β**j* = *γ**j**/α**j* can be calculated by the delta method. The effective fraction *f* of nonzero coefficients is ![Formula][10] The prior expectation of *f* is ![Formula][11] The information ℐ*j* on the coefficient *β**j* depends on the sample sizes in the studies from which the summary statistics ![Graphic][12] and ![Graphic][13] were obtained. With more information, a smaller value of *τ* is required to impose the same prior expectation of *f*. Piironen and Vehtari recommend setting the scale of the prior on *τ* to be consistent with this prior expectation.6 For this analysis we have set the scale of the Cauchy prior on *τ* to so that the prior median *τ* is the value at which the prior expectation of *f* given ℐ1, …, ℐ*J* is 0.2, and we have set the scale of the Inverse-Gamma prior on *η*2 as 0.1, encoding our prior knowledge that effects of any single genomic region on type 2 diabetes are of modest size. We examined the sensitivity of the results to these prior settings as described below. ### Computational methods The NUTS algorithsm is implemented in several recently developed probabilistic programming languages, including Stan, PyMC, and NumPyro. MR-Hevo is implemented in NumPyro, which has a more robust implementation of the NUTS algorithm than Stan giving fewer divergent transitions. To improve posterior geometry, the Cauchy distributions are parameterized as mixture distributions (Gaussian with inverse gamma distribution of scale parameter). A divergent transitions rate of less than 1 in 1000 is considered acceptable. This criterion was met with the regularized horseshoe, but not with the original horseshoe for which the divergent transition rate was about 5%. To obtain the marginal likelihood as a function of the parameter of interest, we simply divide the posterior by the prior on that parameter. We calculated the likelihood of the causal effect parameter *θ* by fitting a kernel density to the posterior samples of *θ*, weighting each observation by the inverse of the prior. A quadratic function was fitted by least-squares to the logarithm of this likelihood function. Where the log-likelihood is asymptotically quadratic, a confidence interval for *θ* and a test of the null hypothesis *θ* = 0 can be obtained by standard methods. Where the log-likelihood is not quadratic, inference should be based on the posterior distribution rather than on sampling theory. ### Constructing scalar instruments from multiple SNPs #### Instrument-exposure coefficients From summary statistics we have for each clump of exposure-associated SNPs a vector ![Graphic][14] of univariate coefficient estimates for the effect of the SNPs on the exposure. Multivariable coefficient estimates ![Graphic][15] are calculated by premultiplying the univariate coefficients by the inverse of the correlation matrix **Σ*****G*** between the SNP genotypes. ![Formula][16] The correlation matrix **Σ*****G*** is obtained from a reference panel such as 1000 Genomes. A shrinkage penalty (equivalent to ridge regression) can be imposed by adding a penalty factor to the diagonal elements of **Σ*****G***. Where the correlation matrix is singular or ill-conditioned, a pseudo-inverse can be used to calculate the multivariable coefficients. For an individual with genotypes ***G*** at the exposure-associated SNPs, a locus-specific score *S* predicting the exposure is calculated as ![Graphic][17]. Because the score *S* is calculated from the genotypes and the genotype-exposure coefficients, we cannot use it as a genotypic instrument in a model of the relationship of the genotype-outcome coefficients to the genotype-exposure coefficients. We can factor the dot product ![Graphic][18] as the product of two scalars: the magnitude of the multivariable coefficient vector ![Graphic][19] and a pseudo-genotype ![Graphic][20], where ![Graphic][21] is the correlation between ***G*** and ![Graphic][22], geometrically equivalent to the cosine of the angles between these vectors. We can then substitute ![Graphic][23] for the scalar coefficient estimate ![Graphic][24] and ![Graphic][25] for the scalar instrument *Z* as a pseudo-genotype in the statistical model. The derived coefficient for the estimated effect of the instrument on the exposure is always positive; flipping the coding of the alleles would flip the sign of *ρ**G,α*. We can calculate the standard error of ![Graphic][26]as ![Graphic][27] where ℐ*α* is the Fisher information on the slope of the linear regression of *X* on *Z*, given by ![Formula][28] where *N**α* is the sample size of the study from which the estimated univariate coefficients ![Graphic][29] were obtained, Var (*Z*) is the variance of the scalar instrument *Z* estimated in a reference panel, and ![Graphic][30] is the variance of the exposure *X*. If an estimate of ![Graphic][31] is not given, it can be calculated from the allele frequency *a**j* of the *j*th SNP, the standard error *s**G*(*j*) of the *j*th univariate coefficient estimate ![Graphic][32], and the sample size *N**α* as ![Formula][33] . #### Instrument-outcome coefficients For this study, where we have individual-level data in UK Biobank for the scalar instrument *Z* and the outcome *Y*, we estimated the coefficients for the effect of the scalar instruments *Z* on the outcome *Y* and its standard error by fitting logistic regression models. A method for calculating the instrument-outcome coefficient ![Graphic][34] and its standard error from summary-level data for SNP-outcome coefficients is described below. For each instrument we standardize the pseudo-genotypes *Z* to unit variance and scale the coefficient estimates accordingly, so that the variance explained by the *j*th linear predictor ![Graphic][35], which can be interpreted as the “strength of the instrument”, is proportional to the square of the scalar coefficient ![Graphic][36]. The prior on the scale of the pleiotropic effects *β* of the instruments *Z* is thus independent of the strength of the instrument. Where only summary-level data are available for the genotype-outcome associations, an estimate ![Graphic][37] of the coefficient for the effect of the scalar instrument *Z* on the outcome and its standard error can be obtained using individual-level genotype data from a reference panel. First, the vector ![Graphic][38] of estimated multivariable coefficients is calculated by premultiplying the estimates ![Graphic][39] of the univariate coefficients by ![Graphic][40]. The coefficient ![Graphic][41] for the scalar instrument *Z* is then estimated by minimizing the sum of the squared differences between the predictor calculated from *Z* and the linear predictor calculated from the vector of genotypes ***G***, where the sum is taken over the genotypes of all *n* individuals in the reference panel. ![Formula][42] Equating to zero the derivative of this expression with respect to *γ*, substituting ![Graphic][43] for *Z**i* and factoring the dot product ![Graphic][44] we obtain ![Formula][45] The ratio in this expression can be recognized as a weighted average of the ratio ![Graphic][46], with weights ![Graphic][47]. The standard error of ![Graphic][48] is ![Graphic][49] where ℐ*γ* is the Fisher information on the slope of the regression of *Y* on *Z*, given (for a logistic regression model) by ![Formula][50] where *N**γ* and *p* are respectively the total sample size and the the proportion of cases in the dataset from which the coefficient estimates ![Graphic][51] were obtained. ### Example: effect of adiponectin on type 2 diabetes As the methods described above overcome some of the limitations of standard methods for design and analysis of Mendelian randomization studies, they may help to resolve contradictory results obtained in earlier studies. As an example, we examine the effect of adiponectin, encoded by the *ADIPOQ* gene, on the risk of type 2 diabetes. Low adiponectin levels are strongly associated with type 2 diabetes. Of four previous studies using Mendelian randomization to investigate whether low adiponectin levels cause type 2 diabetes, two reported no evidence of a causal effect7,8 and two reported evidence for a causal effect.9,10 We obtained coefficient estimates for the effects of SNPs on circulating levels of adiponectin from two studies: * The DeCODE study of 35559 Icelanders in which 4719 proteins on the SomaLogic v4 panel were measured in plasma.11 2207 aptamers on this platform that appeared to cross-react with complement factor H were excluded. The criteria for identifying these aptamers were: a *trans*-pQTL at the *CFH* locus and no *cis*-pQTL for the protein that the aptamer was designed to detect and association of the *trans*-score with age-related macular degeneration. * The UK Biobank proteomics study of 54306 individuals in which 2923 proteins on the Olink Explore panel were measured in plasma12 The calculated standard deviation of adiponectin levels was 1.14 in the DeCODE dataset and 0.83 in the UK Biobank dataset. In each genomic region where there was at least one SNP-exposure association with *p* < 10*−*6, all SNPs with *p* < 10*−*6 separated by no more than 1 Mb were included in the calculation of the multivariable coefficient vector. This yielded 26 unlinked genotypic instruments from the DeCODE study, calculated from 567 SNPs, and 44 instruments from the UK Biobank proteomics study, calculated from 945 SNPs. Each instrument was annotated with a list of nearby genes, and the genes on each list were matched to a list of 256 genes for which SNP associations with type 2 diabetes have been reported at *p* < 5× 10*−*8, extracted from GWAS Catalog. The target dataset with genotypes and type 2 diabetes status was the UK Biobank cohort. Ethical approval for the UK Biobank study was granted in 2011 by the North West Multi-centre Research Ethics Committee (11/NW/0382), and renewed every five years since then. Informed consent was obtained for all participants in UK Biobank. The work described herein was approved by the UK Biobank under application number 23652. Separate analyses were undertaken using the instruments constructed from these two proteomics studies. When using instruments obtained from the UK Biobank proteomics study, the 54306 participants who were in the proteomics study were excluded from the target dataset. Because the individuals in the proteomics studies were predominantly of European ancestry, the target dataset was restricted to individuals of European ancestry. Of 451447 unrelated individuals with nonmissing phenotype and genotype data, 37981 were classified as cases of type 2 diabetes and the all others were classified as noncases. For each instrument, the pseudo-genotypes were standardized to unit variance and a logistic regression model was fitted with type 2 diabetes as outcome variable. The other covariates were sex and the first five principal components of the SNP relationship matrix. The coefficients for the effect of each scalar instrument on type 2 diabetes were used to fit the model. *Cis*-pQTLs were excluded. For comparison with inference based on the marginal likelihood of the causal effect parameter, three widely-used estimators for the causal effect were calculated from the coefficient ratios: the inverse-variance weighted mean of the coefficient ratios which assumes no pleiotropy, the weighted median, and the penalized weighted median which downweights outliers.1 The coefficient ratio estimates and their standard errors were calculated by the delta method, based on second-order Taylor expansions for the moments of the distribution of the ratio of two independent normally-distributed variables. Standard errors for the weighted median and penalized weighted median estimators were calculated by a parametric bootstrap method. To generate the sampling distribution of the weighted median, we used 4000 draws from the posterior distribution of the regularized shrinkage parameters ![Graphic][52] and the instrument effects *α**j*. At each draw we sampled the direct effects *β**j* of *J* new genetic instruments, and sampled the coefficient ratio estimates ![Graphic][53] (calculated by the delta method) conditional on *β**j*, *α**j* and the standard errors s ![Graphic][54] of these estimates. The standard deviation of the weighted median of the coefficient ratio estimates obtained from this posterior predictive distribution was calculated and used to obtain *p*-values and confidence intervals. ## Results Table S1 shows summary statistics for the genetic instruments for adiponectin derived from each pQTL study. There is suprisingly little overlap between the 24 *trans*-pQTLs detected in the DeCODE study and the 43 *trans*-pQTLs detected in UK Biobank. Only eleven *trans*-pQTLs – gene-poor regions at 219.4 Mb on chromosome 1, 34.2 Mb on chr 6, 139.5 Mb on chromosome 6, and regions containing *ALAS1, ADRB1, SPON1, PDE3A, ABCB9, CDH13, AKR1B1P7*, and *MIR6813* – were detected in both studies at a threshold of *p* < 10*−*6. This may reflect the low power to detect weak *trans*-pQTLs even in these large datasets. In both studies there is a *cis*-QTL at the *ADIPOQ* transcription site. In the DeCODE study an extended *cis*-pQTL is detected 2 Mb upstream of the transcription site. Only two pQTLs – *SPON1* and *APOC1* – contained genes that were listed in GWAS Catalog as associated with type 2 diabetes. Figure 2 shows that 22 of 24 *trans*-pQTL instruments from DeCODE and 39 of 43 *trans*-pQTL instruments from UKBB are inversely associated with type 2 diabetes. The *cis*-pQTL instruments are not associated with type 2 diabetes. ![Fig 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F2.medium.gif) [Fig 2.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F2) Fig 2. Plot of coefficients of regression of type 2 diabetes on each instrument against coefficients of regression of adiponectin levels on each instrument. Size of each data point is inversely proportional to the standard error of the ratio estimate. The value of each estimator is shown as the slope of a line passing through the origin. *Cis*-pQTLs are excluded from these estimates. Table 2 shows the estimates of the causal effect parameter obtained by each method, as log odds ratios for unit change in adiponectin levels. The maximum likelihood estimates obtained by marginalizing over direct effects were -0.4 (95% CI -0.52 to -0.27) using DeCODE instruments and -0.34 (95% CI -0.44 to -0.24) using UK Biobank instruments. In comparison with the weighted median or penalized weighted median estimators, the confidence intervals obtained from the likelihood function were wider and the *p*-values were more conservative. View this table: [Table 1.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/T1) Table 1. Posterior summaries of global parameters of MR-Hevo models of the effect of instruments for adiponectin levels on type 2 diabetes View this table: [Table 2.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/T2) Table 2. Effect of adiponectin on type 2 diabetes: comparison of causal effect parameter estimates obtained with different methods Supplementary Figure S2 shows the posterior distribution and log-likelihood of the causal effect parameter using UK Biobank instruments. As expected with this large sample size, the posterior distribution is approximately Gaussian and the log-likelihood is well approximated by a quadratic function. Supplementary Table 1 shows posterior summaries of the model parameters. For each of the two sets of instruments, the prior median of the global shrinkage parameter *τ* was set to a value that translated to an expectation of 0.2 for the effective fraction *f* of instruments with nonzero pleiotropic effects. The posterior medians of *τ* and *f* were much lower than the prior medians. Consistent with this low estimate of the fraction of instruments that have nonzero pleiotropic effects, Figure 3 shows that only a few instruments have clearly escaped shrinkage: *of the DeCODE instruments these were APOC1 CDH13* and *ABCB9*, and of the UK Biobank instruments these were *CDH13, ABCB9* and *CCND2*. ![Fig 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F3.medium.gif) [Fig 3:](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F3) Fig 3: Effects of trans-QTLs for adiponectin on type 2 diabetes: posterior medians and 80% credible intervals for direct effects and shrinkage coefficients Specifying different values for the scale of the prior on the global shrinkage parameter *τ* which regulates the number of nonzero pleiotropic effects, or for the scale of the prior on the regularization parameter *η* which regulates the size of these effects, did not appreciably alter the results. Fig S3 compares, for the top 20 genome-wide aggregated *trans*-effect scores associated with type 2 diabetes in the UK Biobank cohort, the *p*-values calculated from the weighted median and penalized weighted median estimators (using the procedure described above to calculate the standard deviation of these estimators), with *p*-values based on the marginal likelihood. *ADIPOQ* was the only gene for which *p*-values based on the marginal likelihood provided strong evidence of a causal effect. The p-values obtained from tests based on the sampling distribution of the weighted median were broadly concordant with those obtained from tests based on the marginal likelihood but were less extreme. This is to be expected, as the median is less efficient than the maximum likelihood estimate. ## Discussion Mendelian randomization analysis, combined with related approaches including transcriptom-wide association studies and co-localization analysis, has been widely used to infer causal effects of gene expression on outcomes. The most widely-used methods use summary statistics for the coefficients for the effect of each SNP on the outcome and on the exposure from non-overlapping samples.13 “Estimators” for the causal effect are constructed from the ratios of SNP-outcome to SNP-exposure coefficients. Bayesian methods for marginalizing over the unobserved pleiotropic effects to infer the causal effect parameter in a Mendelian randomization study have been described previously.3,4 The methods described in this paper extend this approach: (1) by allowing unlinked scalar genetic instruments to be constructed from all SNPs associated with the exposure without having to restrict to one SNP from each genomic region; (2) by using a regularized horseshoe prior which allows encoding of prior knowledge that genetic effects of a single genomic region on a complex trait are usually small; and (3) by constructing classical hypothesis tests based on the likelihood of the causal effect parameter. The difficulties associated with using ratios of coefficients to construct estimators are eliminated in a Bayesian framework, where nothing is “estimated”. Where the log-likelihood is asymptotically quadratic, statistical theory guarantees that the maximum-likelihood estimate has the sampling properties that are desirable for an estimator: consistency, minimum variance and unbiasedness. Where the log-likelihood is not approximately quadratic, it is preferable to base inference directly on the likelihood function rather than on the sampling properties of an estimator. Because Bayesian inference does not rely on the sampling properties of ratios, there is no need to exclude “weak instruments”. This makes it possible to integrate Mendelian randomization into genome-wide aggregated *trans*-effects analysis, which attempts to identify core genes on which the polygenic effects of many variants coalesce via *trans*-effects to influence complex traits.14 In this approach, the genetic instruments are clumps of SNPs with *trans*-effects on the expression of a gene as transcript or circulating protein. Using instruments constructed from *trans*-pQTLs in two different studies that used different assays for the protein, there is clear support for a causal effect of low adiponectin levels on type 2 diabetes. This is apparent even without a formal statistical analysis, as the signs of the effects of the instruments are almost all negative. Earlier studies that failed to detect a causal effect were restricted to *cis*-acting SNPs.7,8 Some of these *cis*-acting SNPs affect not only the measured levels but also the splicing of the gene product,15 so their effects on measured adiponectin levels do not necessarily correspond to effects on the functional protein. The most direct human evidence for a causal effect of low adiponectin on type 2 diabetes comes from reports of families in which type 2 diabetes segregated with rare loss-of-function variants in *ADIPOQ*.16 In the obese mouse model, overexpression of adiponectin in the liver protected against diabetes.17 Though adiponectin analogues have been studied experimentally, none has reached the clinical stage of drug development. Misleading results from *cis*-Mendelian randomization studies may have discouraged development of a possible drug target. Where, as in this example, the exposure under study is a gene transcript or a circulating protein, study designs that use only *cis*-acting variants as genetic instruments have been advocated.18 The rationale for thse “*cis*-MR” designs is that *cis*-acting variants are less likely to have pleiotropic effects on the outcome that are not mediated through the level of the circulating protein. This supposition is questionable: especially where *cis*-acting variants affect the splicing of the transcript, the relationship between measured levels and functional activity of the protein may be broken. More fundamentally, restriction to *cis*-QTLs focuses on the genes least relevant to disease risk. Most disease/trait-associated SNPs are not co-localized with *cis*-eQTLs detected in relevant cell types.19 Recent studies have shown that disease-relevant genes are enriched with redundant enhancer domains and depleted of *cis*-eQTLs of large effect, as would be expected if perturbation of these genes has large effects on fitness.20,21 Most of the SNP heritability of gene expression is attributable to *trans*-effects.22 Bayesian hypothesis testing is based on the Likelihood Principle: all information conveyed by observations that supports one model or one parameter value over another is contained in the ratio of the likelihoods of these models or parameter values. The likelihood-based approach to causal inference described in this paper is aligned with the not yet universally-accepted principle that causal inference is just a special case of statistical inference, requiring attention to assumptions about exchangeability between observations (of exposure and outcome) and predictions of the effect of perturbing the exposure.23 In this decision-theoretic framework there is no need to invoke counterfactuals24 or to require causal effects to be expressed as marginal rather than as conditional effects. In this example, the likelihood of a model with a causal effect parameter and only a few nonzero pleiotropic effects is higher than the likelihood of a model with no causal effect that requires more nonzero pleiotropic effect parameters to adjust to the data to obtain the same fit. Thus even though the causal effect parameter is not identifiable because for any setting of the causal effect parameter we can find a setting of pleiotropic effect parameters that has the same fit to the data, a formal hypothesis test can be obtained. With horseshoe priors, which allow the distribution of the pleiotropic effects to be learned from the data, this model comparison is performed “under the hood,”25 without the computational inconvenience of having to average over all possible partitions of the variables into two disjoint sets (spike-and-slab) as in the contamination mixture model26 or over different settings of the number of nonzero effect parameters.27 A corollary of inference based on model comparison is that reliable inference of causality in the presence of pleiotropy requires a relatively large number of unlinked genetic instruments to allow learning the distribution of pleiotropic effects. The statistical model assumes that the effects of the instruments on the exposure and their direct effects on the outcome are independent in magnitude and direction: this has been denoted the InSIDE (Instrument Strength Independent of Direct Effect) assumption28 though this term has also been used for the less stringent assumption that “the magnitude of the pleiotropic effects are independent of the SNP-exposure associations.”29 With the procedure described here for constructing scalar instruments, the estimates for the coefficients of the instrument-exposure effects can take only positive values (a negative effect would simply correspond to flipping of the allele coding), and the direct effects are assumed to arise from a distribution with zero mean. This assumption that the instrument-exposure effects and the direct effects are uncoupled in direction is termed “balanced pleiotropy”. It is possible in principle to infer a causal effect if the instrument-exposure effects and the direct effects are assumed to be uncoupled in magnitude even if they are coupled in direction28 but it is not obvious why this would be a biologically realistic assumption. If the statistical model is extended to allow the instrument-exposure effects and the direct effects to be coupled in magnitude and direction, it is not possible to infer causality without other information about pleiotropy because models with and without a causal effect can fit the data equally well with the same number of adjustable parameters. It may be more useful to formulate and test possible hypotheses about mechanisms that could give rise to coupling of instrument-exposure effects with direct effects on the outcome, as this may identify a shared aetiological pathway. For adiponectin and type 2 diabetes, for instance, coupling could arise if the pQTLs for adiponectin are QTLs for a trait such as obesity that causes low adiponectin levels and increased risk of type 2 diabetes. In this situation other circulating proteins such as leptin that are biomarkers for obesity would show a similar pattern of apparently causal pQTL effects on type 2 diabetes. More generally, pQTLs that have pleiotropic effects should be detectable on a heat map of correlations between genotypic scores for multiple proteins or transcripts. Guidelines for the design of Mendelian randomization studies have recently been updated.2 Based on the work described here, some suggestions for revisions to these guidelines are listed below. * For inference of causal effects where pleiotropic effects may be present, the optimal method is to compute the likelihood of the causal effect parameter by marginalizing over the distribution of pleiotropic effects. This is robust to the inclusion of weak instruments, and is computationally efficient with modern tools for Bayesian computation. Investigators should not “pick a range of methods”, but assess the sensitivity of their results to prior assumptions about the distribution of unobserved pleiotropic effects. There is no advantage to using tests based on the sampling distribution of estimators such as the weighted median, when the likelihood can be calculated as a function of the parameter of interest. * There is no need to restrict the selection of SNPs to one SNP from each genomic region: the methods described here allow scalar instruments to be constructed from multiple SNPs. This maximize the strength of the genetic instruments that can be obtained for each genomic region, and ensures that the results do not depend upon an arbitrary choice of which SNP to select from each region. * Study designs that rely on using variants from a single genomic region, for instance using only *cis*-acting variants to study the effects of a gene transcript or gene product on a disease or trait, are likely to give misleading results and should be deprecated. Because *cis*-acting variants are likely to affect the function of the gene through mechanisms (such as altered splicing) other than the measured levels of the transcript or gene product, it is preferable to report the effects of a *cis*-QTL separately. ## Data Availability UK Biobank data are available to approved researchers via managed access. A guide to accessing data is available at [https://biobank.ctsu.ox.ac.uk/crystal/exinfo.cgi?src=accessing\_data\_guide](https://biobank.ctsu.ox.ac.uk/crystal/exinfo.cgi?src=accessing_data_guide) [https://github.com/molepi-precmed/mrhevo](https://github.com/molepi-precmed/mrhevo). ## Declarations ### Data and code availability A Python script using NumPyro to fit the MR-Hevo model, together with the summary-level data used in this paper, is available at [https://github.com/molepi-precmed/mrhevo](https://github.com/molepi-precmed/mrhevo). ## Declaration of interests The authors declare no competing interests. ## Web resources GWAS Catalog: [https://www.ebi.ac.uk/gwas/](https://www.ebi.ac.uk/gwas/) ## Supplementary information ### Supplementary Figures ![Fig S1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F4.medium.gif) [Fig S1.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F4) Fig S1. Pairs plot of posterior samples using UK Biobank instruments for adiponectin levels ![Fig S2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F5.medium.gif) [Fig S2.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F5) Fig S2. Posterior density and log-likelihood of causal effect parameter *θ* for effect of adiponectin on type 2 diabetes, using UK Biobank instruments ![Fig S3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/04/09/2023.12.15.23300008/F6.medium.gif) [Fig S3.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/F6) Fig S3. Comparison of p-values from weighted median estimators with p-values from marginal likelihood, for top 20 associations of genome-wide aggregated *trans*-scores with type 2 diabetes in UK Biobank cohort ### Supplementary Tables View this table: [Table S1.](http://medrxiv.org/content/early/2024/04/09/2023.12.15.23300008/T3) Table S1. Summary statistics for regression of type 2 diabetes and adiponectin levels on genetic instruments ## Acknowledgements No specific funding was received for this work. This research has been conducted using the UK Biobank Resource under application number 23652. The development of the GENOSCORES platform was supported by a Springboard Award (SBF006/1109) to AS from the Academy of Medical Sciences, supported in turn by the Wellcome Trust, the UK Government Department of Business, Energy and Industrial Strategy, the British Heart Foundation, and Diabetes UK. AI was supported by the Medical Research Council Cross Disciplinary Fellowship (XDF) Programme (MC_FE_00035). HC is supported by an endowed chair from the AXA Research Foundation. ## Footnotes * Figures revised. Additional discussion of other models, sampling algorithms and estimation methods. * Received December 15, 2023. * Revision received April 8, 2024. * Accepted April 9, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/) ## References 1. 1.Bowden, J., Davey Smith, G., Haycock, P.C., and Burgess, S. (2016). Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol 40, 304–314. doi:10.1002/gepi.21965. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/gepi.21965&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27061298&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 2. 2.Burgess, S., Davey Smith, G., Davies, N.M., Dudbridge, F., Gill, D., Glymour, M.M., Hartwig, F.P., Kutalik, Z., Holmes, M.V., Minelli, C., et al. (2023). Guidelines for performing Mendelian randomization investigations: Update for summer 2023. Wellcome Open Res 4, 186. doi:10.12688/wellcomeopenres.15555.3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.12688/wellcomeopenres.15555.3&link_type=DOI) 3. 3.Berzuini, C., Guo, H., Burgess, S., and Bernardinelli, L. (2020). A Bayesian approach to Mendelian randomization with multiple pleiotropic variants. Biostatistics 21, 86–101. doi:10.1093/biostatistics/kxy027. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biostatistics/kxy027&link_type=DOI) 4. 4.Grant, A.J., and Burgess, S. (2024). A Bayesian approach to Mendelian randomization using summary statistics in the univariable and multivariable settings with correlated pleiotropy. Am J Hum Genet 111, 165–180. doi:10.1016/j.ajhg.2023.12.002. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2023.12.002&link_type=DOI) 5. 5.Carvalho, C.M., Polson, N.G., and Scott, J.G. (2009). Handling Sparsity via the Horseshoe. In Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics (PMLR), pp. 73–80 6. 6.Piironen, J., and Vehtari, A. (2017). Sparsity information and regularization in the horseshoe and other shrinkage priors. Electronic Journal of Statistics 11, 5018–5051. 7. 7.Yaghootkar, H., Lamina, C., Scott, R.A., Dastani, Z., Hivert, M.-F., Warren, L.L., Stancáková, A., Buxbaum, S.G., Lyytikäinen, L.-P., Henneman, P., et al. (2013). Mendelian randomization studies do not support a causal role for reduced circulating adiponectin levels in insulin resistance and type 2 diabetes. Diabetes 62, 3589–3598. doi:10.2337/db13-0128. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZGlhYmV0ZXMiO3M6NToicmVzaWQiO3M6MTA6IjYyLzEwLzM1ODkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNC8wOS8yMDIzLjEyLjE1LjIzMzAwMDA4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 8. 8.Chen, Z., Bai, Y., Long, X., Luo, Q., Wen, Z., Li, Y., Huang, S., Yan, Y., and Mo, Z. (2020). Effects of Adiponectin on T2DM and Glucose Homeostasis: A Mendelian Randomization Study. Diabetes Metab Syndr Obes 13, 1771–1784. doi:10.2147/DMSO.S248352. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/DMSO.S248352&link_type=DOI) 9. 9.Nielsen, M.B., Çolak, Y., Benn, M., and Nordestgaard, B.G. (2021). Low Plasma Adiponectin in Risk of Type 2 Diabetes: Observational Analysis and One- and Two-Sample Mendelian Randomization Analyses in 756,219 Individuals. Diabetes 70, 2694–2705. doi:10.2337/db21-0131. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZGlhYmV0ZXMiO3M6NToicmVzaWQiO3M6MTA6IjcwLzExLzI2OTQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNC8wOS8yMDIzLjEyLjE1LjIzMzAwMDA4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 10. 10.Zhang, M., Chen, X., Zhu, Y., Yin, L., Quan, Z., Ou, Y., and He, B. (2022). Causal associations of circulating adiponectin with cardiometabolic diseases and osteoporotic fracture. Sci Rep 12, 6689. doi:10.1038/s41598-022-10586-1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-022-10586-1&link_type=DOI) 11. 11.Ferkingstad, E., Sulem, P., Atlason, B.A., Sveinbjornsson, G., Magnusson, M.I., Styrmisdottir, E.L., Gunnarsdottir, K., Helgason, A., Oddsson, A., Halldorsson, B.V., et al. (2021). Large-scale integration of the plasma proteome with genetics and disease. Nat Genet 53, 1712–1721. doi:10.1038/s41588-021-00978-w. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00978-w&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34857953&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 12. 12.Sun, B.B., Chiou, J., Traylor, M., Benner, C., Hsu, Y.-H., Richardson, T.G., Surendran, P., Mahajan, A., Robins, C., Vasquez-Grinnell, S.G., et al. (2023). Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338. doi:10.1038/s41586-023-06592-6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-023-06592-6&link_type=DOI) 13. 13.Hemani, G., Zheng, J., Elsworth, B., Wade, K.H., Haberland, V., Baird, D., Laurin, C., Burgess, S., Bowden, J., Langdon, R., et al. (2018). The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408. doi:10.7554/eLife.34408. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.34408&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29846171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 14. 14.Iakovliev, A., McGurnaghan, S.J., Hayward, C., Colombo, M., Lipschutz, D., Spiliopoulou, A., Colhoun, H.M., and McKeigue, P.M. (2023). Genome-wide aggregated trans-effects on risk of type 1 diabetes: A test of the “omnigenic” sparse effector hypothesis of complex trait genetics. Am J Hum Genet 110, 913–926. doi:10.1016/j.ajhg.2023.04.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2023.04.003&link_type=DOI) 15. 15.Lee, B.P., Lloyd-Laney, H.O., Locke, J.M., McCulloch, L.J., Knight, B., Yaghootkar, H., Cory, G., Kos, K., Frayling, T.M., and Harries, L.W. (2016). Functional characterisation of ADIPOQ variants using individuals recruited by genotype. Mol Cell Endocrinol 428, 49–57. doi:10.1016/j.mce.2016.03.020. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mce.2016.03.020&link_type=DOI) 16. 16.Cook, J.R., and Semple, R.K. (2010). Hypoadiponectinemia–cause or consequence of human “insulin resistance”? J Clin Endocrinol Metab 95, 1544–1554. doi:10.1210/jc.2009-2286. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1210/jc.2009-2286&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20164291&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 17. 17.Yamauchi, T., Kamon, J., Waki, H., Imai, Y., Shimozawa, N., Hioki, K., Uchida, S., Ito, Y., Takakuwa, K., Matsui, J., et al. (2003). Globular adiponectin protected ob/ob mice from diabetes and ApoE-deficient mice from atherosclerosis. J Biol Chem 278, 2461–2468. doi:10.1074/jbc.M209033200. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEwOiIyNzgvNC8yNDYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDQvMDkvMjAyMy4xMi4xNS4yMzMwMDAwOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. 18.Schmidt, A.F., Finan, C., Gordillo-Marañón, M., Asselbergs, F.W., Freitag, D.F., Patel, R.S., Tyl, B., Chopade, S., Faraway, R., Zwierzyna, M., et al. (2020). Genetic drug target validation using Mendelian randomisation. Nat Commun 11, 3255. doi:10.1038/s41467-020-16969-0. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-16969-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32591531&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 19. 19.Chun, S., Casparino, A., Patsopoulos, N.A., Croteau-Chonka, D.C., Raby, B.A., De Jager, P.L., Sunyaev, S.R., and Cotsapas, C. (2017). Limited statistical evidence for shared genetic effects of eQTLs and autoimmune-disease-associated loci in three major immune-cell types. Nat Genet 49, 600–605. doi:10.1038/ng.3795. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3795&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 20. 20.Wang, X., and Goldstein, D.B. (2020). Enhancer Domains Predict Gene Pathogenicity and Inform Gene Discovery in Complex Disease. Am J Hum Genet 106, 215–233. doi:10.1016/j.ajhg.2020.01.012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.01.012&link_type=DOI) 21. 21.Mostafavi, H., Spence, J.P., Naqvi, S., and Pritchard, J.K. (2023). Systematic differences in discovery of genetic effects on gene expression and complex traits. Nat Genet 55, 1866–1875. doi:10.1038/s41588-023-01529-1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-023-01529-1&link_type=DOI) 22. 22.Ouwens, K.G., Jansen, R., Nivard, M.G., van Dongen, J., Frieser, M.J., Hottenga, J.-J., Arindrarto, W., Claringbould, A., van Iterson, M., Mei, H., et al. (2020). A characterization of cis- and trans-heritability of RNA-Seq-based gene expression. Eur J Hum Genet 28, PMID: 32591531 PMCID: PMC7320010. 3255. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-16969-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32591531&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 23. 23.Rohde, D. (2022). Causal Inference, is just Inference: A beautifully simple idea that not everyone accepts. In I (Still) Can’t Believe It’s Not Better! Workshop at NeurIPS 2021 (PMLR), pp. 75–79. 24. 24.1. D. Corfield and 2. J. Williamson Dawid, A.P. (2001). Causal Inference without Counterfactuals. In Foundations of Bayesianism Applied Logic Series., D. Corfield and J. Williamson, eds. (Springer Netherlands), pp. 37–74. doi:10.1007/978-94-017-1586-7_3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-94-017-1586-7_3&link_type=DOI) 25. 25.Carvalho, C.M., Polson, N.G., and Scott, J.G. (2010). The horseshoe estimator for sparse signals. Biometrika 97, 465–480. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biomet/asq017&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000280559700015&link_type=ISI) 26. 26.Burgess, S., Foley, C.N., Allara, E., Staley, J.R., and Howson, J.M.M. (2020). A robust and e?cient method for Mendelian randomization with hundreds of genetic variants. Nat Commun 11, 376. doi:10.1038/s41467-019-14156-4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-019-14156-4&link_type=DOI) 27. 27.Xue, H., Shen, X., and Pan, W. (2021). Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. Am J Hum Genet 108, 1251–1269. doi:10.1016/j.ajhg.2021.05.014. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2021.05.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34214446&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 28. 28.Bowden, J., Davey Smith, G., and Burgess, S. (2015). Mendelian randomization with invalid instruments: Effect estimation and bias detection through Egger regression. Int J Epidemiol 44, 512–525. doi:10.1093/ije/dyv080. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyv080&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26050253&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F09%2F2023.12.15.23300008.atom) 29. 29.Bowden, J., Burgess, S., and Smith, G.D. (2017). Diffculties in Testing the Instrument Strength Independent of Direct Effect Assumption in Mendelian Randomization. JAMA Cardiology 2, 929–930. doi:10.1001/jamacardio.2017.1572. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamacardio.2017.1572&link_type=DOI) [1]: /embed/inline-graphic-1.gif [2]: /embed/graphic-1.gif [3]: /embed/graphic-2.gif [4]: /embed/inline-graphic-2.gif [5]: /embed/graphic-4.gif [6]: /embed/graphic-5.gif [7]: /embed/graphic-6.gif [8]: /embed/inline-graphic-3.gif [9]: /embed/graphic-7.gif [10]: /embed/graphic-8.gif [11]: /embed/graphic-9.gif [12]: /embed/inline-graphic-4.gif [13]: /embed/inline-graphic-5.gif [14]: /embed/inline-graphic-6.gif [15]: /embed/inline-graphic-7.gif [16]: /embed/graphic-10.gif [17]: /embed/inline-graphic-8.gif [18]: /embed/inline-graphic-9.gif [19]: /embed/inline-graphic-10.gif [20]: /embed/inline-graphic-11.gif [21]: /embed/inline-graphic-12.gif [22]: /embed/inline-graphic-13.gif [23]: /embed/inline-graphic-14.gif [24]: /embed/inline-graphic-15.gif [25]: /embed/inline-graphic-16.gif [26]: /embed/inline-graphic-17.gif [27]: /embed/inline-graphic-18.gif [28]: /embed/graphic-11.gif [29]: /embed/inline-graphic-19.gif [30]: /embed/inline-graphic-20.gif [31]: /embed/inline-graphic-21.gif [32]: /embed/inline-graphic-22.gif [33]: /embed/graphic-12.gif [34]: /embed/inline-graphic-23.gif [35]: /embed/inline-graphic-24.gif [36]: /embed/inline-graphic-25.gif [37]: /embed/inline-graphic-26.gif [38]: /embed/inline-graphic-27.gif [39]: /embed/inline-graphic-28.gif [40]: /embed/inline-graphic-29.gif [41]: /embed/inline-graphic-30.gif [42]: /embed/graphic-13.gif [43]: /embed/inline-graphic-31.gif [44]: /embed/inline-graphic-32.gif [45]: /embed/graphic-14.gif [46]: /embed/inline-graphic-33.gif [47]: /embed/inline-graphic-34.gif [48]: /embed/inline-graphic-35.gif [49]: /embed/inline-graphic-36.gif [50]: /embed/graphic-15.gif [51]: /embed/inline-graphic-37.gif [52]: /embed/inline-graphic-38.gif [53]: /embed/inline-graphic-39.gif [54]: /embed/inline-graphic-40.gif