Revisiting Bias in Odds Ratios

Ivo M Foppa; Fredrick S Dahlgren

doi:10.1101/2021.02.28.21252604

Abstract

Ratio measures of effect, such as the odds ratio (OR), are consistent, but the presumption of their unbiasedness is founded on a false premise: The equality of the expected value of a ratio and the ratio of expected values. We show that the invalidity of this assumptions is an important source of empirical bias in ratio measures of effect, which is due to properties of the expectation of ratios of count random variables. We investigate ORs (unconfounded, no effect modification), proposing a correction that leads to “almost unbiased” estimates. We also explore ORs with covariates. We find substantial bias in OR estimates for smaller sample sizes, which can be corrected by the proposed method. Bias correction is more elusive for adjusted analyses. The notion of unbiasedness of OR for the effect of interest for smaller sample sizes is challenged.

Introduction

Ratio measures of effect are widely used in epidemiology. In particular for case-control studies of etiology or intervention effectiveness, the odds ratio (OR) is of great importance (Pearce, 1993). The true OR is defined as where p₁ and p₀ represent exposure prevalences in cases and controls, respectively. If exposure prevalence remains constant over the study period and subjects are enrolled by “incidence density sampling”, the OR represents the factor by which the “exposure” multiplies the incidence rate in the unexposed (Greenland and Thomas, 1982).

The consistent maximum likelihood (ML) estimator of ϕ (Gart, 1962) is where x₁ and x₀ are exposed and unexposed cases and y₁ and y₀ are exposed and unexposed controls, respectively. In the following discussion we use OR to refer to the ML estimator of the true OR ϕ.

The problem

Here, we investigate bias in the OR, where bias ∊ is

Assuming independence of x₁, x₀, y₁, y₀, 𝔼 (OR) can be written as but, as neither nor are defined because of zero denominators (Griffin, 1992), the whole expression (4) remains undefined. With the expectation undefined, bias (3) cannot be determined. If, in the context of an observational study, instances where there are either no unexposed cases (x₀) or exposed controls (y₁) will be discarded because no OR (2) can be computed. On average, the OR will therefore be better characterized by a situation where the variables in the denominator (x₀, y₁) are assigned truncated Poisson distributions; truncation here refers to restriction of the sample space of x to ℤ⁺. The truncated

Poisson distribution, denoted by Poi*(µ), has the following form (Griffin, 1992):

The expected value of a random variable distributed according to a truncated Poisson is given by

Letting and being distributed according to a truncated Poisson distributions parametrized by µ₀ and γ₁ respectively, a “truncated” OR arises:

The expectation of OR_t (7) is defined, but the expectations and do not have a closed-form expression. However, using Jensen’s inequality (Casella and Berger, 1990), we have

Therefore, indicating that the lower bound of 𝔼 (OR_t) is . However, as , it might happen that that expression neutralizes any biases.

“Almost unbiased” estimators of and

An “almost unbiased” estimator (Chapman, 1952) for ratios of Poisson parameters such as or , however, does exist. Chapman showed that , for w₀ = x₀ + 1 is “almost unbiased” for , as long as µ₀ is “not too small.” The same holds for , for z₁ = y₁ + 1 which is “almost unbiased” for .

The expectation of the ratio can be derived as follows:

The derivation of the expectation of the ratio follows (10). It is worth noting that the expectation of is therefore not simply , as one might naïvely expect with 𝔼(w₀) = µ₀ + 1, but a negatively biased expression that quickly converges to with increasing µ₀. The equivalent, of course, holds for the expectation of Using this,

The expected value of OR₊₁ is

Therefore, the expected value 𝔼(OR₊₁) is the lower bound of 𝔼(OR_t) (9). In contrast, Hauck et al. recommended to add 0.25 to each of the terms (x₁, x₀, y₁, y₀) of the ML estimator to calculate OR_+0.25 (Hauck, Anderson, and Leahy III, 1982).

A simulation study

Unconfounded odds ratio

We simulated 100,000 data sets from case-control studies. The number of unexposed cases were simulated to arise according to a Poisson distribution with parameter µ₀ ∈ (5, 10, 20, 50, 100, 1000) with a true incidence rate ratio ϕ = 2 and a control-case ratio (ratio of the expected number of controls to the expected number of cases) of 2, corresponding to expected sample sizes of 30, 60, 120, 300, 600 and 6000, respectively. ORs could not be computed for 834 and 6 datasets with µ₀ = 5 and µ₀ = 10, respectively, because of zero denominators. For µ₀ = 5, corresponding to an expected sample size of 30, the average OR was 3.15, while the corrected analysis, that could make use of all datasets OR₊₁ was minimally biased downward, with the MSE a little less than a quarter of the one of the uncorrected ORs (Table 1).

View this table:

Table 1:

Mean and mean square error (MSE) of OR, OR* (adding 1 to the number of unexposed cases and exposed controls) and OR *2 (adding 0.25 to the number of unexposed cases and exposed controls), respectively, as a function of µ₀. The true OR was ϕ = 2 and the control-to-case ratio was 2. For each value of µ₀ 100,000 simulations were run.

For µ₀ = 5, OR_+0.25 was even more strongly upwards biased than OR and for other values of µ₀ only marginally superior to OR, both in terms of bias and in terms of MSE. While both bias and MSE of OR and OR_+0.25 were always larger than for OR₊₁, the differences vanished with large µ₀ (Table 1). We did not consider OR_+0.25 any further.

The odds ratio adjusted by one confounder

To investigate the situation where the odds ratio is confounded by a binary covariate, which increases the risk of the outcome independently of the exposure of interest by 50% and which is moderately independently associated with the exposure of interest (confounder odds ratio of exposed vs. unexposed=1.2). We conducted logistic regression analyses, adjusting the analysis by the confounder. We analyzed the data in the native form and after applying one of three corrections:

Adding one to cases unexposed to the exposure of interest and adding one to exposed controls (correction #1);
Adding one to cases unexposed to either the exposure of interest or the confounder and adding one to controls exposed to either (correction #2);
Adding one to cases unexposed to both the exposure of interest and the confounder and adding one to controls exposed to neither (correction #3);

We investigated six levels of µ₀₀, which represents the mean number of cases unexposed to both the exposure of interest and the confounder (µ₀₀ ∈ (5, 10, 20, 50, 100, 1000)), corresponding to expected sample sizes of 92, 184, 368, 919, 1,838 and 18,375, respectively. The assumed control-to-case ratio was 2. For each setting we conducted 100,000 simulations and calculated mean and MSE for OR, and For µ₀₀ ∈ (5, 10) we also computed mean and MSE after excluding 72,382 and 5,762 datasets, respectively, for which the smallest stratum size was < 5. The uncorrected OR was substantially biased upwards for sample sizes under a thousand. was essentially unbiased (Table 2). In the restricted analysis for µ₀₀ = 5 the bias for OR was lower than in the unrestricted analysis, but it was the only case for which was substantially biased, downward about the same amount as OR was biased upwards. The other corrections always led to a downward bias and were clearly inferior to , with consistently lower MSEs.

View this table:

Table 2:

Mean and mean square error (MSE) of OR, OR*¹, OR*² and OR*³ (see text) for different values of µ₀₀ when one confounder is adjusted for (see text) by logistic regression analysis. The odds ratio is the exponentiated model coefficient corresponding to the exposure of interest. For µ₀₀ ∈ (5, 10), mean and MSE were also calculated after exclusion of datasets in which the smallest stratum size in cases or controls was < 5 were discarded. The true OR was ϕ = 2 and the control-to-case ratio was 2. For each value of µ₀₀ 100,000 simulations were run.

Discussion and conclusion

We examined bias in ratio measures of effect, in particular ORs. As the expected value of an OR is undefined, the bias is not defined either. This kind of problem for ratios of Poisson random variable is well known (Griffin, 1992)—an OR is a ratio of two such ratios. However, even though the bias is not defined for ORs, we can examine the empirical properties of ORs. In fact, ORs are consistent, but more than trivially “biased” (the quotes are owed to the fact that this is not bias in the strict sense), i.e. on average off the true value, even if sample sizes are “reasonable”. This phenomenon has been largely ignored in the epidemiologic literature. Even though these empirical biases are more pronounced for small sample sizes, they are unrelated to sample size problems of large-sample statistical methods. We have shown that empirical ratio measure biases can be improved by adding 1 to the denominators. In the absence of confounders and effect modifiers that adjustment (OR₊₁; see equation (11)) leads to an “almost unbiased” OR estimate. We also found that the correction proposed by Gart (Gart, 1962), adding 0.25 to each count used to calculate the OR, performs poorly.

For the situation where there is one additional covariate we were able to identify a data correction procedure that works well . Future research is needed to better characterize the problem for more complex multivariate situations.

In summary, we examined statistical properties of the expectation of ratios of count random variables as an important source of empirical bias in ORs. This challenges the notion that ORs are, under very general assumptions, “good” estimates for the effects of interest even if sample sizes are relatively small.

Data Availability

Not applicable

Acknowledgments

The authors do not have a conflict of interest.

References

↵
Casella, G. and R.L. Berger (1990). Statistical Inference. Duxbury advanced series. Brooks/Cole Publishing Company.
↵
Chapman, Douglas G (1952). “On tests and estimates for the ratio of Poisson means”. In: Annals of the Institute of Statistical Mathematics 4.1, pp. 45–49.
↵
Gart, John J (1962). “On the combination of relative risks”. In: Biometrics 18.4, pp. 601–610.
OpenUrl
↵
Greenland, Sander and Duncan C Thomas (1982). “On the need for the rare disease assumption in case-control studies”. In: American journal of epidemiology 116.3, pp. 547–553.
OpenUrl
↵
Griffin, Tralissa F (1992). “Distribution of the ratio of two poisson random variables”. MA thesis.
↵
Hauck, Walter W, Sharon Anderson, and Francis J Leahy III. (1982). “Finite-sample properties of some old and some new estimators of a common odds ratio from multiple 2 × 2 tables”. In: Journal of the American Statistical Association 77.377, pp. 145–152.
OpenUrl
↵
Pearce Neil (Dec. 1993). “What Does the Odds Ratio Estimate in a Case-Control Study?” In: International Journal of Epidemiology 22.6, pp. 1189–1192.
OpenUrl