Habitual Coffee Consumption Increases Risks for Metabolic Diseases: Genome-wide Association Studies and a Phenotype-wide Two Sample Mendelian Randomization Analysis

Jiuling Li; Tasnim Choudhury; Miaoran Zhang; Lanlan Chen; Jianping Wen; Wanqing Liu; Peng Chen

doi:10.1101/2021.03.08.21253114

Abstract

Background and aims Coffee is one of the most widely consumed beverages in the world and has received considerable concerns regarding its impact on human health. Mendelian randomization (MR) could be valuable to explore the potential health effects of coffee via instrumental variables. In this study, we aim to identify novel genetic loci associated with habitual coffee consumption using genome-wide meta-analysis (GWMA) and to evaluate the broad impact of coffee consumption on human health and disease risk via a large-scale, phenotype-wide, two sample Mendelian randomization (TSMR) analysis.

Methods We conducted a genome-wide association study (GWAS) among 283,926 coffee consumers of European ancestry in the UK Biobank (UKBB) to identify single nucleotide polymorphisms (SNPs) associated with the amount of coffee consumption (cups/day, GWAS 1), caffeine intake (GWAS 2) as well as the intake of non-caffeine substance in coffee (GWAS 3). The GWAS 1 results were further combined with the published results from the Coffee and Caffeine Genetics Consortium (CCGC) for a GWMA. TSMR were performed to evaluate the causal-relationship between coffee/caffeine/non-caffeine substance consumption and 1,101 diseases and health traits.

Results The GWMA identified 50 lead SNPs among 19 genomic regions for habitual coffee consumption. Nine out of the 19 loci were novel, including ADAMTSL4-AS1, CACNA2D2, LINC02123-ADCY2, UBD-SNORD32B, SEMA4D-GADD45G, LOC101929457-LINGO1, RAI1, HCN2,and BRWD1. The GWAS 2 and 3 identified 2 (SORCS2 and SLC39A8) and 5 (LINC02060-LINC00461, AGR3-AHR, PRR4-TAS2R14, CYP1A1-CYP1A2, and FTO) genomic regions, respectively. TSMR analysis indicated that coffee consumption increased the risk of high blood lipids, obesity, and diabetes. Meanwhile, intake of caffeine and non-caffeine coffee components decreased and increased some of the blood lipids levels, respectively.

Conclusions Our study provided evidence that habitual coffee consumption could increase the risk of metabolic perturbations. The bioactive components in coffee, other than caffeine, may be more harmful to human health. Our findings have significant implications for global public health given the increasing burden of metabolic diseases.

Introduction

Coffee is one of the most widely consumed beverages worldwide. The research community has been long debating whether coffee consumption is beneficial or harmful to health. Overall, observational studies favored the beneficial effect of coffee consumption in reducing the risk of metabolic syndrome, obesity, type 2 diabetes (T2D), cardiovascular disease, and several specific cancers^1-4. However, evidence from randomized controlled trials (RCT) have shed light on the detrimental effects of coffee consumption, such as elevated blood lipids, blood glucose level, and fasting insulin level^5-8.

The differences in the health outcomes of coffee consumption between the observational studies and RCT highlighted the necessity for further clarification. Mendelian randomization (MR) studies, which were considered to be advantageous to observational studies as they were similar to the RCT design but with comparable length of duration as those of the observational studies. They provided a potentially cost-effective strategy to examine the causal relationship between coffee consumption and health outcomes in human populations. A recent MR study found that coffee consumption increased the risk of osteoarthrosis and obesity⁹, while no significant effect on blood lipids or T2D was identified. Our recent MR analysis did not reveal a significant causal impact of coffee intake on nonalcoholic fatty liver disease (NAFLD)¹⁰. However, many of these MR studies on coffee consumption were limited in several aspects, with the limited power as a major issue, given the moderate effect of genetic alleles on coffee consumption habit. Therefore, the identification of more genetic susceptibility alleles underlying the coffee drinking habit among larger populations will increase the power for MR analysis. As such, the causal impact of coffee consumption on a broad range of health outcomes can be further elucidated.

In this study, we performed a genome-wide meta-analysis of coffee consumption among 375,388 individuals of European ancestry, which led to the identification of additional novel loci for habitual coffee consumption. Utilizing these expanded list of genetic risk alleles as instrument for coffee consumption, we further conducted two-sample MR (TSMR) analyses between coffee consumption and a large number of health outcomes previously studied among various GWAS. Furthermore, the GWASs of the consumption of caffeine and other coffee-containing, non-caffeine components were also conducted, respectively, and TSMR was also conducted. We observed a significantly detrimental causal impact of habitual coffee consumption and metabolic perturbations, which may be largely attributed to the non-caffeine components in coffee other than caffeine.

Methods

Study design and cohorts

The study was conducted using the UK Biobank data resources under application number 53536. As shown in Figure 1, we conducted a GWAS of coffee consumption (GWAS 1) in the UK Biobank cohort (UKBB) and a genome-wide meta-analysis by combining the published results from the Coffee and Caffeine Genetics Consortium (CCGC)¹¹. The UKBB cohort included over 500,000 adult individuals recruited from the UK population between 2006 and 2010. The extensive phenotypic and genotypic data were collected among all participants¹². The quality control on the genotype data followed the procedure recommended by the UKBB¹³. Our GWAS analysis was restricted to the coffee drinkers of European-ancestry. The participants who did not drink coffee were excluded. Finally, 283,926 participants were available for association testing. The genotypic data were further imputed based on the Haplotype Reference Consortium and the UK10K + 1000 genomes reference panel¹⁴. The CCGC study investigated two phenotypes, the quantitative coffee consumption (phenotype 1) and the comparison between high and low coffee consumers (phenotype 2). In the current study, phenotype 1, which included up to 91,462 coffee consumers of European ancestry, was used in the meta-analysis. This combination resulted in a total sample size of 375,388. Functional annotation of GWAS summary was performed using FUMA GWAS¹⁵. To investigate the causal effect of coffee consumption, we employed a two-sample MR (TSMR) analysis to screen a broad spectrum of phenotypes in MR-Base¹⁶. After the enrichment analysis, the significant causal effects were further examined using a one-sample MR (OSMR) analysis using the phenotype defined in UKBB. To distinguish the effects of caffeine and other non-caffeine substances consumption on human health, we carried out the caffeine GWAS (GWAS 2) and other substances GWAS (GWAS 3) using the individual-level data of UKBB. The two phenotypes were defined as below. TSMR was also conducted after FUMA annotation.

Figure 1.

A schematic diagram of the study design.

Phenotype definitions

Our GWAS 1 was focused on the amount of daily coffee consumption (cups/day). In the UKBB data, coffee consumption (cups/day) and coffee type were surveyed at baseline using the touchscreen questionnaire. The amount of coffee consumption was determined by the question ‘How many cups of coffee do you drink each day (include decaffeinated coffee)?’. If participants reported that they drink less than one cup of coffee per day, their cups per day were set as 0. Participants with very high coffee consumption (> 8 cups/day) were excluded. Notably, the first phenotype of the CCGC study used in the subsequent meta-analysis study was also the number of cups of predominantly regular-type coffee consumed per day among coffee consumers¹¹. Therefore, our new GWAS 1 is consistent with this previously conducted GWAS on coffee consumption. The coffee type was also coded and included as a covariate to be adjusted in the GWAS analysis. The coffee type was surveyed by the question “What type of coffee do you usually drink?” The options included decaffeinated coffee, instant coffee, ground coffee and others which were coded as 1, 2, 3, and 4, respectively.

GWAS 2 aimed to identify genetic variants associated with caffeine consumption, in which 283,926 consumers with intake of any type of coffee were included. The consumers of decaffeinated coffee were then coded as category 0, with the consumers of instant coffee, ground coffee, and other types of coffee coded as category 1. GWAS 3 aimed to identify genetic variants associated with other non-caffeine substances contained in coffee consumption, in which “non-coffee drinkers” were coded as 0, with the drinkers consuming decaffeinated coffee as category 1.

The genome-wide association study and meta-analysis

To identify genetic variants associated with the daily amount of coffee consumption (cups/day), caffeine consumption, and non-caffeine substance consumption, three GWAS were performed using mixed linear model adjusted for age, sex, body mass index, smoking status (never, previous, current), and the first 5 genetic principal components. Coffee type was adjusted for coffee consumption GWAS in addition. The genetic principal components were calculated from the linkage disequilibrium (LD) pruned (r² <0.1) array genotype data of the participants of European ancestry. The autosomal SNPs with minor allele frequency (MAF) > 0.01, imputation INFO score > 0.8, missing rate <0.05, and HWE-Pval >1×10⁻⁶ were used in the genome-wide association study and meta-analysis. The meta-analysis was performed by combining the GWAS 1 results with that of the CCGC phenotype 1 (cups/day) GWAS using a fixed-effects inverse-variance weighted model¹⁷.

Functional annotation of genome-wide association study and meta-analysis

We used the web-based tool FUMA GWAS to define genomic risk loci and obtained functional information of relevant SNPs in these loci¹⁵. First, lead SNPs were defined using a genome-wide significant P value (5×10⁻⁸) and LD r² <0.05. All SNPs with significant P value (5×10⁻⁸) in LD (r² ≥0.05) with one of the lead SNPs were candidate SNPs. Further, genomic risk loci were identified by merging LD blocks if they were less than 250kb apart.

Gene-mapping was based on two strategies. Firstly, positional mapping was performed by selecting exonic and splicing-site SNPs with CADD score ≥12.37¹⁸. Secondly, expression quantitative trait locus (eQTL) mapping was used to map SNPs to genes that show a significant eQTL association with these SNPs. The eQTL mapping was conducted using data generated in GTEx v8¹⁹, and only cis-eQTLs (SNPs within 1Mb of a gene of interest) were included. The Benjamini-Hochberg false discovery rate (FDR)²⁰ of 0.05 was used to define significant eQTL associations. Gene enrichment and tissue specificity expression analysis were conducted using FUMA¹⁵ and TSEA (http://genetics.wustl.edu/jdlab/tsea/), respectively. We used PhenoScanner to identify the pleiotropic effects of top lead SNPs^{21; 22}.

Mendelian randomization study

The TSMR was performed in an inverse variance weighted (IVW) approach using lead SNPs associated with exposure as instrumental variables (IVs). For coffee consumption as the exposure, we used all 50 lead SNPs reaching the genome-wide level significance as IVs. For both caffeine and non-caffeine coffee intake as exposures, given the small number of SNPs reaching the genome-wide significance level, we used SNPs with a p value <10⁻⁵ as IVs to reduce potential pleotropic effects. This results 38 and 83 IVs for caffeine and non-caffeine substance intake, respectively. For all candidate IVs, only when the p-value of the IVs in an outcome is at least greater than 0.001, can it be used to infer the causality between the exposure and outcomes. The outcomes were the phenotypes available in MR-Base (N=1,101). MR-Egger regression and Cochran’s Q test were used to detect the pleiotropic effect and the heterogeneity of the IVs. The causal effects estimated using the IVs with MR-egger intercept p value ≤0.05 or Cochran’s Q p value ≤0.05 were considered to be biased. The enrichment analysis of the significant causal effects (IVW p ≤0.05) in the categories defined by MR-Base was conducted using hypergeometric distribution test (https://systems.crump.ucla.edu/hypergeometric/index.php).

To reduce the false positive rate, the significant causal effects of coffee consumption (cups/day) on health outcomes were further validated using OSMR analysis using individual level UKBB data. In our OSMR study, the causal effect of coffee consumption on an outcome was estimated by the association between the coffee polygenic risk score (coffee-PRS) and the outcome. For each participant in the UKBB, the coffee PRS was calculated by adding together the allele dosages of the instrumental variables, weighted by their association effects with coffee consumption. The association with a dichotomous or continuous outcome was estimated using a logistic regression or linear regression model, respectively, without an adjustment. For random blood glucose, the linear model was adjusted for the self-report fasting time.

Statistical analysis

The linear mixed model was estimated using the Genome-wide Complex Trait Analysis (GCTA)²³. The genome-wide meta-analysis was performed using METAL¹⁷. The TSMR analysis was conducted using the TwoSampleMR package of R (version 0.4.25). The polygenic risk score (PRS) and the explanation of the coffee consumption by IVs (r²) were evaluated by PRSice-2²⁴. The F-statistic was calculated by the following formula to estimate the statistical power of lead SNPs: N was the sample size, and k was the number of IVs.

Linear regression and logistic regression were conducted using R software (version 4.0.2, https://www.r-project.org/).

Results

GWAS of coffee consumption (cups/day), caffeine consumption, and non-caffeine coffee consumption

Our GWAS 1 on coffee consumption involved 283,926 coffee consumers and 9,462,639 SNPs with MAF>0.01, imputation INFO score>0.8, and missing rate <0.05. The full details of the samples are provided in Supplementary Table 1. At this stage, we found 18 loci (Supplementary Table 2). After the meta-analysis by further combining the data of the CCGC GWAS, we were able to identify an additional significant locus (rs1571536, SEMA4D-GADD45G) (Figure 2A, Supplementary Table 3). Regional plots are available in the online resources (Supplementary Figure S1-2). Of the total of 19 identified loci, 6 loci including rs1260326 (GCKR), rs1481012 (ABCG2), rs4410790 (AGR3-AHR), rs799166 (MLXIPL-VPS37D), rs17685 (POR), and rs2472297 (CYP1A1-CYP1A2), were previously reported by CCGC¹¹. Four loci, rs2867110 (LOC105373352-TMEM18), rs476828 (PMAIP1-MC4R), rs56113850 (CYP2A6), and rs6512309 (PCMTD2), were identified by Zhong VW et al²⁵. The 9 newly identified loci were rs6655975 (ADAMTSL4-AS1), rs1467913 (CACNA2D2), rs12519880(LINC02123-ADCY2), rs1235162(UBD-SNORD32B), rs1571536(SEMA4D-GADD45G), rs2667773 (LOC101929457-LINGO1), rs11078398 (RAI1), rs113534512 (HCN2), and rs3945 (BRWD1). Among which, rs6655975 (ADAMTSL4-AS1), rs1571536 (SEMA4D-GADD45G), rs2667773 (LOC101929457-LINGO1), and rs3945 (BRWD1) were nominally significant in the CCGC GWAS study at a significant level of 0.05 with the same direction for the associations (Table 1). For the locus 9 (MLXIPL-VPS37D), rs7800944 was identified as an index SNP in CCGC. In our results, the lead SNP rs799166 is in LD (r² =0.36, among European Caucasian population) with rs7800944, and was predicted to be located in a SMAD2 binding site in JASPAR²⁶. For the loci 13 and 17, the lead SNPs (rs2667773 and rs56113850) were not present in the CCGC GWAS findings. However, their LD proxy SNPs (rs2667768 and rs1496402) (r² >0.6 with the aforementioned lead SNPs among Caucasian population) were observed as the corresponding lead SNPs, respectively. Therefore, of the 50 lead SNPs, 35 were nominally validated in the previous CCGC study (p<0.05), while 15 were newly observed as lead SNPs only in the current meta-analysis (Supplementary Table 4).

Figure 2.

The Manhattan plot displays the genome-wide associations between SNPs and coffee consumption (A), caffeine consumption (B), and non-caffeine substances consumption (C). The x-axis represents genomic position of variants. The y-axis shows the strength of the associations (–log₁₀ P). The dash line indicates the genome wide significance level of p=5e-8

View this table:

Table 1. The Top SNPs significantly associated with coffee consumption, caffeine consumption, and non-caffeine substances consumption, respectively.

The participants and SNPs used in the GWAS 2 were the same as the GWAS 1. At this stage, we found rs112764911 (SORCS2) and rs13107325 (SLC39A8) to be associated with caffeine consumption at the genome-wide level (Table 1, Figure 2B). Regional plots are available in the online resources (Supplementary Figure S3-4).

Our GWAS 3 of non-caffeine substances consumption involved 137,371 participants and 9,462,277 SNPs with MAF>0.01, imputation INFO score>0.8, and missing rate <0.05. We identified 5 lead SNPs associated with non-caffeine substances consumption, including rs2067919 (LINC02060), rs4410790 (AHR), rs1201669374 (PRR4), rs2472297 (CYP1A1), and rs11642015 (FTO) (Table 1, Figure 2C). Regional plots are available in the online resources (Supplementary Figure S5-6).

Functional interpretation and pleiotropic effect of genetic variants

We examined the potential causal variants within the identified SNPs (n=2,597) associated with coffee consumption, SNPs (n=14) associated with caffeine consumption, and SNPs (n=268) associated with other non-caffeine substances consumption, we found that the majority of these SNPs are located in intergenic and intronic areas (Supplementary Figure S7-9). Ninety-five SNPs (Supplementary Table 5) had likely deleterious impacts (CADD score >12.37) on gene functions ¹⁸. Six nonsynonymous among 95 SNPs located at gene exon region, including rs79217743 (LMAN1L), rs2231142 (ABCG2), rs35332062 (MLXIPL), rs6720 (MDH2), rs113534512 (HCN2), and rs1057868 (POR).

We also examined whether the identified SNPs are also eQTLs for nearby genes. The results were included in Supplementary Table 6. We found that 1941, 4 and 219 SNPs that are associated with the three phenotypes of GWAS 1, 2 and 3 are also significant eQTLs (FDR<0.05) for 180 genes in at least one tissue, respectively. For example, rs56113850-C associated with coffee consumption, which is also significantly associated with increased expression of both CYP1A6 and CYP1A7 (Supplementary Table 7). Novel risk alleles for coffee consumption, rs6655975-A, rs1571536-C, rs2667773-A, and rs3945-G, are associated with increased expression of ADAMTSL4-AS1, GADD45G, LINGO1, and BRWD1, respectively (Supplementary Table 7). Rs12898397-C, a missense variant with deleterious impact (CADD score=24.2) on ULK3, is associated with increased coffee consumption while a decreased expression of ULK3 in multiple tissues (Supplementary Table 7). Rs11642015-T, associated with non-caffeine substances consumption, is associated with increased expression of FTO.

The gene enrichment analysis showed that these 180 GWAS SNP-associated genes were mainly involved in small molecule metabolic process, xenobiotic metabolic process, oxygenase p450 pathway, and generation of precursor metabolites and energy (Figure 3). Enrichment for tissue-specific expression of these genes showed that they were significantly overrepresented in heart and liver (FDR <0.1) (Figure 4).

Figure 3.

Pathway enrichment of the 180 genes associated with GWAS identified SNPs. The enrichment analysis was performed using GENE2FUNC in FUMA. The top 10 significantly enrichments (adjusted P <0.05) were available in the plot.

Figure 4.

Expression enrichment analyses of the 180 genes associated with GWAS-identified SNPs. The tissue specific gene expression enrichment was analyzed using TSEA. Genes were significantly enriched in the liver and heart tissue (FDR <0.1).

Some of the lead SNPs alleles associated with the amount of coffee consumption were also associated with other traits in previously published GWA studies (Supplementary Tables 8). For instance, the alleles of rs2867110-G and rs476828-C were associated with increased BMI²⁷; rs1260326-C, rs3792253-C and rs799166-G were associated with lowered triglycerides²⁸; rs1467913-C,rs351237-G, rs12902040-T,rs9783698-G,and rs12914012-T were associated with lowered height^{29; 30}. In addition, several alleles associated with higher coffee consumption were associated with decreased impedance of body (e.g. rs11078398-G), lowered creatinine in the urine (e.g. rs2472297-T), and increased age at menarche (e.g. rs3945-G)³⁰. The lead SNP rs13107325-C associated with increased caffeine consumption was also associated with decreased body mass index²⁷. The lead SNPs rs2067919-C and rs11642015-T associated with increased non-caffeine substances consumption were also associated with decreased alcohol intake frequency and increased risk of diabetes, respectively^{30; 31}.

Causal relationship between coffee consumption and the health consequences: TSMR

We used the 50 lead SNPs provided by our meta-analysis (Supplementary Table 4) as the IVs in our TSMR analysis. TSMR analyses involved 1,101 phenotypes in MR-Base as outcomes. For each outcome, only when the p-value of the IVs in the outcome is at least greater than 0.001, can it be used to infer the causality between coffee consumption and outcomes.

As shown in table 2, briefly we found significant causal relationships between coffee consumption and increased serum total cholesterol (id=933, beta=0.133 SD, p=1.27×10⁻³, FDR=1.86×10⁻²), serum total triglycerides (id=934, beta=0.154 SD, p=3.75×10⁻⁴, FDR=9.53×10⁻³), total cholesterol in LDL (id=895, beta=0.191 SD, p=4.49×10⁻⁶, FDR=9.34×10⁻⁴), apolipoprotein B (id=843, beta=0.233 SD, p=4.38×10⁻⁸, FDR=4.54×10⁻⁵), but decreased total cholesterol in HDL (id=864, beta=−0.128 SD, p=8.76×10⁻⁴, FDR=1.45×10⁻²). Furthermore, coffee consumption increased the risk of “less severe obesity”, including overweight (id=93, beta=0.124 log odds, p=8.87×10⁻³, FDR=7.75×10⁻²) and obesity class 1 (id=90, beta=0.195 log odds, p=4.45×10⁻³, FDR=4.43×10⁻²), but was not associated with obesity class 2 or 3. At the same time, waist circumference (id=61, beta=0.06 cm, p=9.21×10⁻⁴, FDR=1.45×10⁻²), waist-to-hip ratio (id=74, beta=0.09 SD, p=2.1×10⁻⁴, FDR>0.05), and body mass index (id=785, beta=−0.08 kg/m^2, p=7.19×10⁻⁴, FDR=1.29×10⁻²) were also increased with coffee consumption. Besides, coffee consumption was shown to increase the risk of type 2 diabetes (id=25, beta=1.15 log odds, p=3.18×10⁻³, FDR=3.45×10⁻²). In addition, the area under the curve (AUC) of insulin levels (id=760, AUCIns, beta=−0.250 mU*min/L, p=2.48×10⁻², FDR>0.05), and corrected insulin response (id=761, CIR, beta=−0.206 SD, p= 3.21×10⁻², FDR>0.05) during an OGTT were all decreased by coffee consumption. Other traits observed in our TSMR analysis can be found in Supplementary Table 9. Notably, coffee consumption showed differential risk on two types of ovarian cancer, with an increased risk for endometrioid ovarian cancer (id=1125, beta=0.349 log odds, p=1.84×10⁻³, FDR=2.37×10⁻²) but a decreased risk of low grade and low malignant serous ovarian cancer (id=1229, beta=−0.331 log odds, p=1.05×10⁻², FDR>0.05).

View this table:

Table 2. The causal associations between coffee consumption and human health outcomes based on the TSMR analysis.

To reveal the diseases categories or traits which are mostly affected by coffee consumption, we conducted an enrichment analysis of the significant causal effects. The results showed that the health outcomes causally driven by the coffee consumption were significantly enriched in several MR-base categories, i.e. blood lipids, fatty acids or amino acids in plasma/serum, and anthropometric measurements (Figure 5).

Figure 5.

The enrichment analysis of the outcomes causally associated with the genetically driven coffee consumption among the phenotypic categories of various traits defined in MR-Base. For each category, the −log₁₀ of the enrichment p value was indicated on the left, while the enriched fold change was indicated on the right. The phenotype categories with FDR ≤0.05 are highlighted in dark red.

Validation of the associations between coffee consumption and health outcomes: OSMR

We used the IVs actually used in each outcome in the TSMR to construct the PRS score separately and inferred the relationship with the corresponding outcomes.

As shown in table 3, OSMR analyses were performed to further validate the significant findings in TSMR. In general, our results showed a consistent association between coffee consumption and similar metabolic traits as noted above. Briefly, the coffee-PRS was positively associated with cholesterol (p=1.65×10⁻⁴), LDL (p=7.76×10⁻⁶), apolipoprotein B (p=1.24×10⁻⁵), but negatively associated with HDL (6.56×10⁻³). Furthermore, coffee-PRS was positively associated with waist circumference (p=4.08×10⁻¹⁰), hip circumference (p=7.56×10⁻⁹), body mass index (p=3.18×10⁻¹⁵) and weight (p=2.66×10⁻⁸). Coffee-PRS was also positively associated with glycated haemoglobin (HbA1c, p= 3.28×10⁻⁶) and diabetes (p=2.00×10⁻⁴). Lastly, coffee-PRS also showed a trend of negative association with ovarian cancer (p=4.80×10⁻²).

View this table:

Table 3. The causal associations between coffee consumption and human health in UKBB based on OSMR analyses.

Causal relationship between the consumption of caffeine and non-caffeine components in coffee and metabolic traits

We set out to examine which components in coffee may lead to the potential detrimental effects of coffee consumption on metabolic perturbations. We conducted TSMR using 38 SNPs (Supplementary Table 10) from GWAS 2 and 83 SNPs (Supplementary Table 11) from GWAS 3 with p<10⁻⁵ as a liberal cut-off for selecting the IVs, to examine the effects of caffeine or other non-caffeine components in coffee on coffee-consumption associated metabolic traits identified in the aforementioned TSMR analysis. We found that caffeine exposure was negatively associated with concentration of chylomicrons and largest VLDL particles (id=958, beta= −1.128 SD, p=2.76×10⁻²), and concentration of medium VLDL particles (id=913, beta=−0.976 SD, p=3.18×10⁻²). While the intake of other components in coffee increased the total cholesterol in LDL (id=895, beta=0.320, p=3.50×10⁻²) (Supplementary Table 12).

Discussion

We performed the largest-to-date GWA and meta-analysis on coffee consumption. We further for the first time conducted GWA studies on caffeine intake and decaffeinated coffee intake. Our analyses identified novel loci associated with each of the three phenotypes, which provide new insights into the genetic basis underlying the coffee consumption behavior among human populations. Moreover, by leveraging these genetic findings, we performed large-scale MR analyses to assess the causal relationship between different coffee intake behavior and the health outcomes. Our study indicated that, unlike what have found in many observational studies^{1-4; 32}, coffee consumption may causally lead to increased risks for metabolic diseases, and the more coffee consumed, the worse the outcomes. This finding is consistent with several studies based on RCTs^5-9, having important implications for public health.

In our GWA studies, we confirmed all 10 known¹¹ and newly identified 9 genetic loci associated with coffee consumption among a large, combined European population. A recent GWAS also using UKBB samples identified fewer loci associated with coffee consumption²⁵. The discrepancy between this GWAS²⁵ and our coffee consumption GWAS 1 may be attributed to the sample selection and inclusion of different covariates. The former study included both coffee and non-coffee drinkers in UKBB European population²⁵, while our GWAS 1 was restricted to the coffee drinkers of European-ancestry.

Furthermore, statistical models of the former adjusted for age, sex, BMI and top 20 principal components²⁵, while our statistical models additionally adjusted for coffee type and smoking. These settings allow us to combine the CCGC GWAS study to perform the meta-analysis. In addition, we also identify 2 and 5 novel loci significantly associated with caffeine and non-caffeine coffee components. The enrichment analysis of the genes whose transcription is associated with these SNPs revealed pathways related to small molecule metabolic process, xenobiotic metabolic process, oxygenase p450 pathway, etc., highlighted that genetic variants altering the metabolism of caffeine and related active xenobiotic compounds in coffee are likely the major determinants for coffee consumption behaviors.

This is consistent with the previous identification¹¹. Meanwhile, these GWAS SNP-regulated genes are enriched in the liver and heart tissue, with also a significant enrichment related to the generation of precursor metabolites and energy (Figure 4), suggesting a potentially overlap between the coffee consumption and energy metabolism. In addition, our genome-wide meta-analysis of coffee consumption identified candidate SNPs that may have deleterious impacts (CADD score >12.37) on gene functions. In particular, rs12898397-C (CADD score =24.2), a missense variant with deleterious impact on ULK3 (Unc-51 like kinase 3), was associated with increased coffee consumption but decreased expression of ULK3. ULK3 is a serine/threonine protein kinase that acts as a regulator of sonic hedgehog (SHH) signaling and autophagy. ULK3 low expression may induce the dysregulation of autophagy, which participates in controlling the metabolic functions of liver via multiple ways³³.

We performed GWA studies on caffeine consumption and non-caffeine substances consumption, which have never been investigated at the genome-wide level. We found that rs112764911 (SORCS2) and rs13107325 (SLC39A8) were associated with caffeine consumption, and the latter may be also associated with coffee consumption (p=2.48×10⁻⁴). Interestingly both genes were identified among a number of GWA studies to be associated with multiple phenotypes especially neuropsychiatric diseases and traits, e.g. SNPs in SORCS2 are highly expressed in the brain tissue, and previously was associated with attention function in attention deficit hyperactive disorder³⁴, general risk tolerance and risk behavior³⁵, alcohol withdrawal symptom³⁶, depressive and manic episodes in bipolar disorder³⁷, etc.; while SNPs in SLC39A8 were also associated with schizophrenia³⁸, bipolar disorder³⁹, and intelligence⁴⁰, etc., indicating the impact of caffeine on central nerve system and the potential connection between caffeine intake and neuropsychiatric reactions. In addition, we identified 5 loci (LINC02060, AHR, PRR4, CYP1A1, and FTO) associated with drinking of decaffeinated coffee. Two of them (AHR and CYP1A1) are also associated with coffee consumption in GWAS. AHR is known to be activated by many xenobiotic compounds, e.g. polycyclic aromatic hydrocarbons (PAHs) in coffee⁴¹. AHR response elements reside in the bidirectional promoter region located at chromosome 15q24, which associated with transcriptional activation of both CYP1A1 and CYP1A2^42-44. While CYP1A1 plays an important role in metabolizing and the detoxification of PAHs, CYP1A2 directly metabolizes caffeine⁴⁵, which may explain the overlapping identification between the two GWA studies. In addition, the FTO gene is known to be associated with BMI and involved in energy metabolism, further indicated the close connection between coffee intake and energy intake/homeostasis. Our study warrants continued investigations for the detailed mechanism underlying how these genes determining the caffeine or decaffeinated coffee intake.

In order to investigate the potential impact of regular consumption of coffee, caffeine or non-caffeine coffee constituents on human health, we performed MR analyses and found that coffee consumption may causally lead to altered risks for multiple clinical outcomes which are enriched in metabolic perturbations, especially increased risks for dyslipidemia, obesity, and diabetes. Meanwhile, non-caffeine substances consumption increased the risks for high blood lipids, while caffeine consumption decreased the risks for high blood lipids. The impact of coffee on human health has been long speculated to be attributed to the various bioactive components contained in coffee, such as caffeine, chlorogenic acid, diterpenoids, PAHs, etc. In general, due to the bitter taste of coffee, coffee consumption is more likely to be associated with increased intake of sugary, thereby increase the risk of diabetes and obesity^{46; 47}. Meanwhile, after consumers switch from caffeinated coffee to decaffeinated coffee, LDL cholesterol and apolipoprotein B increase, suggesting that other coffee components other than caffeine may be responsible for the high blood lipids⁴⁸. Consistent with our findings, caffeine has been demonstrated to have a beneficial impact on lipid metabolism, which reduces intrahepatic lipid content and stimulates β-oxidation in hepatic cells and liver via regulating the autophagy-lysosomal pathway signaling⁴⁹. While, diterpenoids contained in coffee may be an important factor leading to the increase of blood lipid level, and its impact on increasing blood lipid level may be related to its impact on the activity of serum lipid transporters⁵⁰. A study tested the effect of cafestol, a diterpenoid in coffee, by giving to 10 healthy male volunteers for 28 days. Relative to baseline values, cafestol raised the activity of cholesterylester transfer protein by 18 +/−12% and of phospholipid transfer protein by 21 +/−14% (both P < 0.001), which may be associated with elevated serum VLDL and LDL cholesterol⁵⁰.

Furthermore, filtered coffee containing a relative lower amount of diterpenoids does not increase blood lipid levels^{51; 52}. In addition, an observational study showed that C-peptide, a marker of insulin secretion, decreased with every additional cup of decaffeinated coffee (0.063 ng/ml; P = 0.0003), which indicated the potential function of non-caffeine substances for diabetic risk⁵³. Taken together, our data suggest that the detrimental effects of coffee consumption on health outcomes may be due to the intake of non-caffeine components. While even though caffeine may exert some beneficial impact on lipids homeostasis, this impact may not be sufficient to compensate the deleterious effects of the non-caffeine components in the coffee. The detailed mechanism underlying the association between coffee consumption and metabolic perturbations remains to be further investigated.

It is noteworthy that our MR analyses also suggested a potential causal relationship between coffee consumption and ovarian cancer, with an opposing risk for different subtypes of the disease. While the detailed mechanism underlying this association remains unclear. Also, caffeine has also been demonstrated to decrease estrogen but increase progesterone, which has been weakly associated with ovarian cancer risk^{54; 55}. More studies are needed to further clarify this relationship.

Limitations

The current study was conducted in the participants of European ancestry. Whether our findings can be generalized to other ethnic groups remains to be validated in future studies. Moreover, although the study has identified 19 genomic loci (9 of which are novel loci) associated with coffee consumption, they only explain a small proportion of the phenotype variance, indicating that our MR results were driven by a limited proportion of genetic susceptibility. Our conclusion may not represent the full image of the health impact of coffee consumption. Meanwhile, the findings of GWAS 2 and 3 studies need further independent validations as well.

Conclusions

This study identified novel genetic loci associated with multiple coffee consumption behaviors and provided evidence that coffee consumption increases the risk for metabolic diseases. This could have significant implications for global public health given the increasing burden of metabolic diseases.

Data Availability

The study was conducted using the UK Biobank data resources under application number 53536.The coffee consumption GWAS for genome-wide meta-analysis was provided by the Coffee and Caffeine Genetics Consortium.A large number of phenotype previously studied among various GWAS were provided by MR-Base.

https://digitalhub.northwestern.edu/

https://www.mrbase.org

Author contributions

J.L, W.L and P.C conceived the study. P.C contributed to the acquisition of UKBB data. J.L, M.Z and L.C analyzed the data. J.L, W.L, J.W and P.C interpreted the results. J.L, T.C and W.L wrote the first draft of the manuscript. All authors revised the manuscript and approved the submission.

Acknowledgements

This work was supported by the “Changbai Mountain Scholar” Distinguished Professor Awarding Program of the Department of Education of Jilin Province, China. This work was supported in part by the Start-Up Fund (W.L) of the Wayne State University.

Footnotes

Figure 1 revised.

References

1.↵
Ding, M., Bhupathiraju, S.N., Satija, A., van Dam, R.M., and Hu, F.B. (2014). Long-term coffee consumption and risk of cardiovascular disease: a systematic review and a dose-response meta-analysis of prospective cohort studies. Circulation 129, 643–659.
OpenUrl Abstract/FREE Full Text
2.
Ding, M., Bhupathiraju, S.N., Chen, M., van Dam, R.M., and Hu, F.B. (2014). Caffeinated and decaffeinated coffee consumption and risk of type 2 diabetes: a systematic review and a dose-response meta-analysis. Diabetes care 37, 569–586.
OpenUrl Abstract/FREE Full Text
3.
Nordestgaard, A.T., Thomsen, M., and Nordestgaard, B.G. (2015). Coffee intake and risk of obesity, metabolic syndrome and type 2 diabetes: a Mendelian randomization study. International journal of epidemiology 44, 551–565.
OpenUrl CrossRef PubMed
4.↵
Poole, R., Kennedy, O.J., Roderick, P., Fallowfield, J.A., Hayes, P.C., and Parkes, J. (2017). Coffee consumption and health: umbrella review of meta-analyses of multiple health outcomes. BMJ 359, j5024.
OpenUrl Abstract/FREE Full Text
5.↵
Lane, J.D., Feinglos, M.N., and Surwit, R.S. (2008). Caffeine increases ambulatory glucose and postprandial responses in coffee drinkers with type 2 diabetes. Diabetes care 31, 221–222.
OpenUrl FREE Full Text
6.
Greenberg, J.A., Owen, D.R., and Geliebter, A. (2010). Decaffeinated coffee and glucose metabolism in young men. Diabetes care 33, 278–280.
OpenUrl Abstract/FREE Full Text
7.
Corrêa, T.A., Rogero, M.M., Mioto, B.M., Tarasoutchi, D., Tuda, V.L., César, L.A., and Torres, E.A. (2013). Paper-filtered coffee increases cholesterol and inflammation biomarkers independent of roasting degree: a clinical trial. Nutrition (Burbank, Los Angeles County, Calif) 29, 977–981.
OpenUrl CrossRef PubMed
8.↵
van Dam, R.M., Pasman, W.J., and Verhoef, P. (2004). Effects of coffee consumption on fasting blood glucose and insulin concentrations: randomized controlled trials in healthy volunteers. Diabetes care 27, 2990–2992.
OpenUrl FREE Full Text
9.↵
Nicolopoulos, K., Mulugeta, A., Zhou, A., and Hyppönen, E. (2020). Association between habitual coffee consumption and multiple disease outcomes: A Mendelian randomisation phenome-wide association study in the UK Biobank. Clin Nutr.
10.↵
Zhang, Y., Liu, Z., Choudhury, T., Cornelis, M.C., and Liu, W. (2020). Habitual coffee intake and risk for nonalcoholic fatty liver disease: a two-sample Mendelian randomization study. European journal of nutrition.
11.↵
Cornelis, M.C., Byrne, E.M., Esko, T., Nalls, M.A., Ganna, A., Paynter, N., Monda, K.L., Amin, N., Fischer, K., Renstrom, F., et al. (2015). Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption. Molecular psychiatry 20, 647–656.
OpenUrl CrossRef PubMed
12.↵
Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., Downey, P., Elliott, P., Green, J., Landray, M., et al. (2015). UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 12, e1001779.
OpenUrl CrossRef PubMed
13.↵
Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L.T., Sharp, K., Motyer, A., Vukcevic, D., Delaneau, O., O’Connell, J., et al. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209.
OpenUrl CrossRef PubMed
14.↵
McCarthy, S., Das, S., Kretzschmar, W., Delaneau, O., Wood, A.R., Teumer, A., Kang, H.M., Fuchsberger, C., Danecek, P., Sharp, K., et al. (2016). A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics 48, 1279–1283.
OpenUrl CrossRef PubMed
15.↵
Watanabe, K., Taskesen, E., van Bochoven, A., and Posthuma, D. (2017). Functional mapping and annotation of genetic associations with FUMA. Nature communications 8, 1826.
OpenUrl
16.↵
Hemani, G., Zheng, J., Elsworth, B., Wade, K.H., Haberland, V., Baird, D., Laurin, C., Burgess, S., Bowden, J., Langdon, R., et al. (2018). The MR-Base platform supports systematic causal inference across the human phenome. eLife 7.
17.↵
Willer, C.J., Li, Y., and Abecasis, G.R. (2010). METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics (Oxford, England) 26, 2190–2191.
OpenUrl CrossRef PubMed Web of Science
18.↵
Kircher, M., Witten, D.M., Jain, P., O’Roak, B.J., Cooper, G.M., and Shendure, J. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nature genetics 46, 310–315.
OpenUrl CrossRef PubMed
19.↵
(2015). Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science (New York, NY) 348, 648–660.
OpenUrl
20.↵
Benjamini, Y., and Hochberg, Y. (1995). Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. 57, 289–300.
OpenUrl
21.↵
Staley, J.R., Blackshaw, J., Kamat, M.A., Ellis, S., Surendran, P., Sun, B.B., Paul, D.S., Freitag, D., Burgess, S., Danesh, J., et al. (2016). PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics (Oxford, England) 32, 3207–3209.
OpenUrl CrossRef PubMed
22.↵
Kamat, M.A., Blackshaw, J.A., Young, R., Surendran, P., Burgess, S., Danesh, J., Butterworth, A.S., and Staley, J.R. (2019). PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics (Oxford, England) 35, 4851–4853.
OpenUrl PubMed
23.↵
Yang, J., Lee, S.H., Goddard, M.E., and Visscher, P.M. (2011). GCTA: a tool for genome-wide complex trait analysis. American journal of human genetics 88, 76–82.
OpenUrl CrossRef PubMed
24.↵
Choi, S.W., and O’Reilly, P.F. (2019). PRSice-2: Polygenic Risk Score software for biobank-scale data. GigaScience 8.
25.↵
Zhong, V.W., Kuang, A., Danning, R.D., Kraft, P., van Dam, R.M., Chasman, D.I., and Cornelis, M.C. (2019). A genome-wide association study of bitter and sweet beverage consumption. Human molecular genetics 28, 2449–2457.
OpenUrl
26.↵
In. (
27.↵
Locke, A.E., Kahali, B., Berndt, S.I., Justice, A.E., Pers, T.H., Day, F.R., Powell, C., Vedantam, S., Buchkovich, M.L., Yang, J., et al. (2015). Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206.
OpenUrl CrossRef PubMed
28.↵
Teslovich, T.M., Musunuru, K., Smith, A.V., Edmondson, A.C., Stylianou, I.M., Koseki, M., Pirruccello, J.P., Ripatti, S., Chasman, D.I., Willer, C.J., et al. (2010). Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713.
OpenUrl CrossRef PubMed Web of Science
29.↵
Wood, A.R., Esko, T., Yang, J., Vedantam, S., Pers, T.H., Gustafsson, S., Chu, A.Y., Estrada, K., Luan, J., Kutalik, Z., et al. (2014). Defining the role of common variation in the genomic and biological architecture of adult human height. Nature genetics 46, 1173–1186.
OpenUrl CrossRef PubMed
30.↵
Neale, B.M. UK Biobank GWAS. In. (http://www.nealelab.is/uk-biobank.
31.↵
Gaulton, K.J., Ferreira, T., Lee, Y., Raimondo, A., Mägi, R., Reschen, M.E., Mahajan, A., Locke, A., Rayner, N.W., Robertson, N., et al. (2015). Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci. Nature genetics 47, 1415–1425.
OpenUrl CrossRef PubMed
32.↵
Kim, Y., Je, Y., and Giovannucci, E. (2019). Coffee consumption and all-cause and cause-specific mortality: a meta-analysis by potential modifiers. European journal of epidemiology 34, 731–752.
OpenUrl
33.↵
Madrigal-Matute, J., and Cuervo, A.M. (2016). Regulation of Liver Metabolism by Autophagy. Gastroenterology 150, 328–339.
OpenUrl
34.↵
Alemany, S., Ribases, M., Vilor-Tejedor, N., Bustamante, M., Sanchez-Mora, C., Bosch, R., Richarte, V., Cormand, B., Casas, M., Ramos-Quiroga, J.A., et al. (2015). New suggestive genetic loci and biological pathways for attention function in adult attention-deficit/hyperactivity disorder. Am J Med Genet B Neuropsychiatr Genet 168, 459–470.
OpenUrl
35.↵
Karlsson Linner, R., Biroli, P., Kong, E., Meddens, S.F.W., Wedow, R., Fontana, M.A., Lebreton, M., Tino, S.P., Abdellaoui, A., Hammerschlag, A.R., et al. (2019). Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat Genet 51, 245–257.
OpenUrl
36.↵
Smith, A.H., Ovesen, P.L., Skeldal, S., Yeo, S., Jensen, K.P., Olsen, D., Diazgranados, N., Zhao, H., Farrer, L.A., Goldman, D., et al. (2018). Risk Locus Identification Ties Alcohol Withdrawal Symptoms to SORCS2. Alcohol Clin Exp Res 42, 2337–2348.
OpenUrl
37.↵
Fabbri, C., and Serretti, A. (2016). Genetics of long-term treatment outcome in bipolar disorder. Prog Neuropsychopharmacol Biol Psychiatry 65, 17–24.
OpenUrl
38.↵
Goes, F.S., McGrath, J., Avramopoulos, D., Wolyniec, P., Pirooznia, M., Ruczinski, I., Nestadt, G., Kenny, E.E., Vacic, V., Peters, I., et al. (2015). Genome-wide association study of schizophrenia in Ashkenazi Jews. American journal of medical genetics Part B, Neuropsychiatric genetics: the official publication of the International Society of Psychiatric Genetics 168, 649–659.
OpenUrl
39.↵
Wu, Y., Cao, H., Baranova, A., Huang, H., Li, S., Cai, L., Rao, S., Dai, M., Xie, M., Dou, Y., et al. (2020). Multi-trait analysis for genome-wide association study of five psychiatric disorders. Translational psychiatry 10, 209.
OpenUrl
40.↵
Savage, J.E., Jansen, P.R., Stringer, S., Watanabe, K., Bryois, J., de Leeuw, C.A., Nagel, M., Awasthi, S., Barr, P.B., Coleman, J.R.I., et al. (2018). Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nature genetics 50, 912–919.
OpenUrl CrossRef PubMed
41.↵
Ishikawa, T., Takahashi, S., Morita, K., Okinaga, H., and Teramoto, T. (2014). Induction of AhR-mediated gene transcription by coffee. PloS one 9, e102152.
OpenUrl
42.↵
Jorge-Nebert, L.F., Jiang, Z., Chakraborty, R., Watson, J., Jin, L., McGarvey, S.T., Deka, R., and Nebert, D.W. (2010). Analysis of human CYP1A1 and CYP1A2 genes and their shared bidirectional promoter in eight world populations. 31, 27–40.
OpenUrl
43.
Jorge-Nebert, L.F., Jiang, Z., Chakraborty, R., Watson, J., Jin, L., McGarvey, S.T., Deka, R., and Nebert, D.W. (2010). Analysis of human CYP1A1 and CYP1A2 genes and their shared bidirectional promoter in eight world populations. Human mutation 31, 27–40.
OpenUrl PubMed
44.↵
Ueda, R., Iketaki, H., Nagata, K., Kimura, S., Gonzalez, F.J., Kusano, K., Yoshimura, T., and Yamazoe, Y. (2006). A common regulatory region functions bidirectionally in transcriptional activation of the human CYP1A1 and CYP1A2 genes. Molecular pharmacology 69, 1924–1930.
OpenUrl Abstract/FREE Full Text
45.↵
Kot, M., and Daniel, W.A. (2008). The relative contribution of human cytochrome P450 isoforms to the four caffeine oxidation pathways: an in vitro comparative study with cDNA-expressed P450s including CYP2C isoforms. Biochemical pharmacology 76, 543–551.
OpenUrl CrossRef PubMed
46.↵
Keast, R.S., Sayompark, D., Sacks, G., Swinburn, B.A., and Riddell, L.J. (2011). The influence of caffeine on energy content of sugar-sweetened beverages: ‘the caffeine-calorie effect’. European journal of clinical nutrition 65, 1338–1344.
OpenUrl CrossRef PubMed
47.↵
Keast, R.S., Swinburn, B.A., Sayompark, D., Whitelock, S., and Riddell, L.J. (2015). Caffeine increases sugar-sweetened beverage consumption in a free-living population: a randomised controlled trial. The British journal of nutrition 113, 366–371.
OpenUrl
48.↵
Superko, H.R., Bortz, W., Jr.., Williams, P.T., Albers, J.J., and Wood, P.D. (1991). Caffeinated and decaffeinated coffee effects on plasma lipoprotein cholesterol, apolipoproteins, and lipase activity: a controlled, randomized trial. The American journal of clinical nutrition 54, 599–605.
OpenUrl Abstract/FREE Full Text
49.↵
Sinha, R.A., Farah, B.L., Singh, B.K., Siddique, M.M., Li, Y., Wu, Y., Ilkayeva, O.R., Gooding, J., Ching, J., Zhou, J., et al. (2014). Caffeine stimulates hepatic lipid metabolism by the autophagy-lysosomal pathway in mice. Hepatology (Baltimore, Md) 59, 1366–1380.
OpenUrl CrossRef PubMed
50.↵
van Tol, A., Urgert, R., de Jong-Caesar, R., van Gent, T., Scheek, L.M., de Roos, B., and Katan, M.B. (1997). The cholesterol-raising diterpenes from coffee beans increase serum lipid transfer protein activity levels in humans. Atherosclerosis 132, 251–254.
OpenUrl PubMed
51.↵
Kurzrock, T., and Speer, K. (2007). Diterpenes and Diterpene Esters in Coffee. Food Reviews International 17, 433–450.
OpenUrl
52.↵
Jee, S.H., He, J., Appel, L.J., Whelton, P.K., Suh, I., and Klag, M.J. (2001). Coffee consumption and serum lipids: a meta-analysis of randomized controlled clinical trials. American journal of epidemiology 153, 353–362.
OpenUrl CrossRef PubMed Web of Science
53.↵
Wu, T., Willett, W.C., Hankinson, S.E., and Giovannucci, E. (2005). Caffeinated coffee, decaffeinated coffee, and caffeine in relation to plasma C-peptide levels, a marker of insulin secretion, in U.S. women. Diabetes care 28, 1390–1396.
OpenUrl Abstract/FREE Full Text
54.↵
Kotsopoulos, J., Eliassen, A.H., Missmer, S.A., Hankinson, S.E., and Tworoger, S.S. (2009). Relationship between caffeine intake and plasma sex hormone concentrations in premenopausal and postmenopausal women. Cancer 115, 2765–2774.
OpenUrl CrossRef PubMed Web of Science
55.↵
Beral, V., Gaitskell, K., Hermon, C., Moser, K., Reeves, G., and Peto, R. (2015). Menopausal hormone use and ovarian cancer risk: individual participant meta-analysis of 52 epidemiological studies. Lancet (London, England) 385, 1835–1842.
OpenUrl

View the discussion thread.

Posted March 09, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Genetic and Genomic Medicine

Subject Areas

All Articles

Addiction Medicine (349)
Allergy and Immunology (668)
Allergy and Immunology (668)
Anesthesia (181)
Cardiovascular Medicine (2648)
Dentistry and Oral Medicine (316)
Dermatology (223)
Emergency Medicine (399)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
Epidemiology (12228)
Forensic Medicine (10)
Gastroenterology (759)
Genetic and Genomic Medicine (4103)
Geriatric Medicine (387)
Health Economics (680)
Health Informatics (2657)
Health Policy (1005)
Health Systems and Quality Improvement (985)
Hematology (363)
HIV/AIDS (851)
Infectious Diseases (except HIV/AIDS) (13695)
Intensive Care and Critical Care Medicine (797)
Medical Education (399)
Medical Ethics (109)
Nephrology (436)
Neurology (3882)
Nursing (209)
Nutrition (577)
Obstetrics and Gynecology (739)
Occupational and Environmental Health (695)
Oncology (2030)
Ophthalmology (585)
Orthopedics (240)
Otolaryngology (306)
Pain Medicine (250)
Palliative Medicine (75)
Pathology (473)
Pediatrics (1115)
Pharmacology and Therapeutics (466)
Primary Care Research (452)
Psychiatry and Clinical Psychology (3432)
Public and Global Health (6527)
Radiology and Imaging (1403)
Rehabilitation Medicine and Physical Therapy (814)
Respiratory Medicine (871)
Rheumatology (409)
Sexual and Reproductive Health (410)
Sports Medicine (342)
Surgery (448)
Toxicology (53)
Transplantation (185)
Urology (165)

[1] 1.↵
Ding, M., Bhupathiraju, S.N., Satija, A., van Dam, R.M., and Hu, F.B. (2014). Long-term coffee consumption and risk of cardiovascular disease: a systematic review and a dose-response meta-analysis of prospective cohort studies. Circulation 129, 643–659.
OpenUrl Abstract/FREE Full Text

[2] 2.
Ding, M., Bhupathiraju, S.N., Chen, M., van Dam, R.M., and Hu, F.B. (2014). Caffeinated and decaffeinated coffee consumption and risk of type 2 diabetes: a systematic review and a dose-response meta-analysis. Diabetes care 37, 569–586.
OpenUrl Abstract/FREE Full Text

[3] 3.
Nordestgaard, A.T., Thomsen, M., and Nordestgaard, B.G. (2015). Coffee intake and risk of obesity, metabolic syndrome and type 2 diabetes: a Mendelian randomization study. International journal of epidemiology 44, 551–565.
OpenUrl CrossRef PubMed

[4] 4.↵
Poole, R., Kennedy, O.J., Roderick, P., Fallowfield, J.A., Hayes, P.C., and Parkes, J. (2017). Coffee consumption and health: umbrella review of meta-analyses of multiple health outcomes. BMJ 359, j5024.
OpenUrl Abstract/FREE Full Text

[5] 5.↵
Lane, J.D., Feinglos, M.N., and Surwit, R.S. (2008). Caffeine increases ambulatory glucose and postprandial responses in coffee drinkers with type 2 diabetes. Diabetes care 31, 221–222.
OpenUrl FREE Full Text

[6] 6.
Greenberg, J.A., Owen, D.R., and Geliebter, A. (2010). Decaffeinated coffee and glucose metabolism in young men. Diabetes care 33, 278–280.
OpenUrl Abstract/FREE Full Text

[7] 7.
Corrêa, T.A., Rogero, M.M., Mioto, B.M., Tarasoutchi, D., Tuda, V.L., César, L.A., and Torres, E.A. (2013). Paper-filtered coffee increases cholesterol and inflammation biomarkers independent of roasting degree: a clinical trial. Nutrition (Burbank, Los Angeles County, Calif) 29, 977–981.
OpenUrl CrossRef PubMed

[8] 8.↵
van Dam, R.M., Pasman, W.J., and Verhoef, P. (2004). Effects of coffee consumption on fasting blood glucose and insulin concentrations: randomized controlled trials in healthy volunteers. Diabetes care 27, 2990–2992.
OpenUrl FREE Full Text

[9] 9.↵
Nicolopoulos, K., Mulugeta, A., Zhou, A., and Hyppönen, E. (2020). Association between habitual coffee consumption and multiple disease outcomes: A Mendelian randomisation phenome-wide association study in the UK Biobank. Clin Nutr.

[10] 10.↵
Zhang, Y., Liu, Z., Choudhury, T., Cornelis, M.C., and Liu, W. (2020). Habitual coffee intake and risk for nonalcoholic fatty liver disease: a two-sample Mendelian randomization study. European journal of nutrition.

[11] 11.↵
Cornelis, M.C., Byrne, E.M., Esko, T., Nalls, M.A., Ganna, A., Paynter, N., Monda, K.L., Amin, N., Fischer, K., Renstrom, F., et al. (2015). Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption. Molecular psychiatry 20, 647–656.
OpenUrl CrossRef PubMed

[12] 12.↵
Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., Downey, P., Elliott, P., Green, J., Landray, M., et al. (2015). UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 12, e1001779.
OpenUrl CrossRef PubMed

[13] 13.↵
Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L.T., Sharp, K., Motyer, A., Vukcevic, D., Delaneau, O., O’Connell, J., et al. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209.
OpenUrl CrossRef PubMed

[14] 14.↵
McCarthy, S., Das, S., Kretzschmar, W., Delaneau, O., Wood, A.R., Teumer, A., Kang, H.M., Fuchsberger, C., Danecek, P., Sharp, K., et al. (2016). A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics 48, 1279–1283.
OpenUrl CrossRef PubMed

[15] 15.↵
Watanabe, K., Taskesen, E., van Bochoven, A., and Posthuma, D. (2017). Functional mapping and annotation of genetic associations with FUMA. Nature communications 8, 1826.
OpenUrl

[16] 16.↵
Hemani, G., Zheng, J., Elsworth, B., Wade, K.H., Haberland, V., Baird, D., Laurin, C., Burgess, S., Bowden, J., Langdon, R., et al. (2018). The MR-Base platform supports systematic causal inference across the human phenome. eLife 7.

[17] 17.↵
Willer, C.J., Li, Y., and Abecasis, G.R. (2010). METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics (Oxford, England) 26, 2190–2191.
OpenUrl CrossRef PubMed Web of Science

[18] 18.↵
Kircher, M., Witten, D.M., Jain, P., O’Roak, B.J., Cooper, G.M., and Shendure, J. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nature genetics 46, 310–315.
OpenUrl CrossRef PubMed

[19] 19.↵
(2015). Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science (New York, NY) 348, 648–660.
OpenUrl

[20] 20.↵
Benjamini, Y., and Hochberg, Y. (1995). Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. 57, 289–300.
OpenUrl

[21] 21.↵
Staley, J.R., Blackshaw, J., Kamat, M.A., Ellis, S., Surendran, P., Sun, B.B., Paul, D.S., Freitag, D., Burgess, S., Danesh, J., et al. (2016). PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics (Oxford, England) 32, 3207–3209.
OpenUrl CrossRef PubMed

[22] 22.↵
Kamat, M.A., Blackshaw, J.A., Young, R., Surendran, P., Burgess, S., Danesh, J., Butterworth, A.S., and Staley, J.R. (2019). PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics (Oxford, England) 35, 4851–4853.
OpenUrl PubMed

[23] 23.↵
Yang, J., Lee, S.H., Goddard, M.E., and Visscher, P.M. (2011). GCTA: a tool for genome-wide complex trait analysis. American journal of human genetics 88, 76–82.
OpenUrl CrossRef PubMed

[24] 24.↵
Choi, S.W., and O’Reilly, P.F. (2019). PRSice-2: Polygenic Risk Score software for biobank-scale data. GigaScience 8.

[25] 25.↵
Zhong, V.W., Kuang, A., Danning, R.D., Kraft, P., van Dam, R.M., Chasman, D.I., and Cornelis, M.C. (2019). A genome-wide association study of bitter and sweet beverage consumption. Human molecular genetics 28, 2449–2457.
OpenUrl

[26] 26.↵
In. (

[27] 27.↵
Locke, A.E., Kahali, B., Berndt, S.I., Justice, A.E., Pers, T.H., Day, F.R., Powell, C., Vedantam, S., Buchkovich, M.L., Yang, J., et al. (2015). Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206.
OpenUrl CrossRef PubMed

[28] 28.↵
Teslovich, T.M., Musunuru, K., Smith, A.V., Edmondson, A.C., Stylianou, I.M., Koseki, M., Pirruccello, J.P., Ripatti, S., Chasman, D.I., Willer, C.J., et al. (2010). Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Wood, A.R., Esko, T., Yang, J., Vedantam, S., Pers, T.H., Gustafsson, S., Chu, A.Y., Estrada, K., Luan, J., Kutalik, Z., et al. (2014). Defining the role of common variation in the genomic and biological architecture of adult human height. Nature genetics 46, 1173–1186.
OpenUrl CrossRef PubMed

[30] 30.↵
Neale, B.M. UK Biobank GWAS. In. (http://www.nealelab.is/uk-biobank.

[31] 31.↵
Gaulton, K.J., Ferreira, T., Lee, Y., Raimondo, A., Mägi, R., Reschen, M.E., Mahajan, A., Locke, A., Rayner, N.W., Robertson, N., et al. (2015). Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci. Nature genetics 47, 1415–1425.
OpenUrl CrossRef PubMed

[32] 32.↵
Kim, Y., Je, Y., and Giovannucci, E. (2019). Coffee consumption and all-cause and cause-specific mortality: a meta-analysis by potential modifiers. European journal of epidemiology 34, 731–752.
OpenUrl

[33] 33.↵
Madrigal-Matute, J., and Cuervo, A.M. (2016). Regulation of Liver Metabolism by Autophagy. Gastroenterology 150, 328–339.
OpenUrl

[34] 34.↵
Alemany, S., Ribases, M., Vilor-Tejedor, N., Bustamante, M., Sanchez-Mora, C., Bosch, R., Richarte, V., Cormand, B., Casas, M., Ramos-Quiroga, J.A., et al. (2015). New suggestive genetic loci and biological pathways for attention function in adult attention-deficit/hyperactivity disorder. Am J Med Genet B Neuropsychiatr Genet 168, 459–470.
OpenUrl

[35] 35.↵
Karlsson Linner, R., Biroli, P., Kong, E., Meddens, S.F.W., Wedow, R., Fontana, M.A., Lebreton, M., Tino, S.P., Abdellaoui, A., Hammerschlag, A.R., et al. (2019). Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat Genet 51, 245–257.
OpenUrl

[36] 36.↵
Smith, A.H., Ovesen, P.L., Skeldal, S., Yeo, S., Jensen, K.P., Olsen, D., Diazgranados, N., Zhao, H., Farrer, L.A., Goldman, D., et al. (2018). Risk Locus Identification Ties Alcohol Withdrawal Symptoms to SORCS2. Alcohol Clin Exp Res 42, 2337–2348.
OpenUrl

[37] 37.↵
Fabbri, C., and Serretti, A. (2016). Genetics of long-term treatment outcome in bipolar disorder. Prog Neuropsychopharmacol Biol Psychiatry 65, 17–24.
OpenUrl

[38] 38.↵
Goes, F.S., McGrath, J., Avramopoulos, D., Wolyniec, P., Pirooznia, M., Ruczinski, I., Nestadt, G., Kenny, E.E., Vacic, V., Peters, I., et al. (2015). Genome-wide association study of schizophrenia in Ashkenazi Jews. American journal of medical genetics Part B, Neuropsychiatric genetics: the official publication of the International Society of Psychiatric Genetics 168, 649–659.
OpenUrl

[39] 39.↵
Wu, Y., Cao, H., Baranova, A., Huang, H., Li, S., Cai, L., Rao, S., Dai, M., Xie, M., Dou, Y., et al. (2020). Multi-trait analysis for genome-wide association study of five psychiatric disorders. Translational psychiatry 10, 209.
OpenUrl

[40] 40.↵
Savage, J.E., Jansen, P.R., Stringer, S., Watanabe, K., Bryois, J., de Leeuw, C.A., Nagel, M., Awasthi, S., Barr, P.B., Coleman, J.R.I., et al. (2018). Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nature genetics 50, 912–919.
OpenUrl CrossRef PubMed

[41] 41.↵
Ishikawa, T., Takahashi, S., Morita, K., Okinaga, H., and Teramoto, T. (2014). Induction of AhR-mediated gene transcription by coffee. PloS one 9, e102152.
OpenUrl

[42] 42.↵
Jorge-Nebert, L.F., Jiang, Z., Chakraborty, R., Watson, J., Jin, L., McGarvey, S.T., Deka, R., and Nebert, D.W. (2010). Analysis of human CYP1A1 and CYP1A2 genes and their shared bidirectional promoter in eight world populations. 31, 27–40.
OpenUrl

[43] 43.
Jorge-Nebert, L.F., Jiang, Z., Chakraborty, R., Watson, J., Jin, L., McGarvey, S.T., Deka, R., and Nebert, D.W. (2010). Analysis of human CYP1A1 and CYP1A2 genes and their shared bidirectional promoter in eight world populations. Human mutation 31, 27–40.
OpenUrl PubMed

[44] 44.↵
Ueda, R., Iketaki, H., Nagata, K., Kimura, S., Gonzalez, F.J., Kusano, K., Yoshimura, T., and Yamazoe, Y. (2006). A common regulatory region functions bidirectionally in transcriptional activation of the human CYP1A1 and CYP1A2 genes. Molecular pharmacology 69, 1924–1930.
OpenUrl Abstract/FREE Full Text

[45] 45.↵
Kot, M., and Daniel, W.A. (2008). The relative contribution of human cytochrome P450 isoforms to the four caffeine oxidation pathways: an in vitro comparative study with cDNA-expressed P450s including CYP2C isoforms. Biochemical pharmacology 76, 543–551.
OpenUrl CrossRef PubMed

[46] 46.↵
Keast, R.S., Sayompark, D., Sacks, G., Swinburn, B.A., and Riddell, L.J. (2011). The influence of caffeine on energy content of sugar-sweetened beverages: ‘the caffeine-calorie effect’. European journal of clinical nutrition 65, 1338–1344.
OpenUrl CrossRef PubMed

[47] 47.↵
Keast, R.S., Swinburn, B.A., Sayompark, D., Whitelock, S., and Riddell, L.J. (2015). Caffeine increases sugar-sweetened beverage consumption in a free-living population: a randomised controlled trial. The British journal of nutrition 113, 366–371.
OpenUrl

[48] 48.↵
Superko, H.R., Bortz, W., Jr.., Williams, P.T., Albers, J.J., and Wood, P.D. (1991). Caffeinated and decaffeinated coffee effects on plasma lipoprotein cholesterol, apolipoproteins, and lipase activity: a controlled, randomized trial. The American journal of clinical nutrition 54, 599–605.
OpenUrl Abstract/FREE Full Text

[49] 49.↵
Sinha, R.A., Farah, B.L., Singh, B.K., Siddique, M.M., Li, Y., Wu, Y., Ilkayeva, O.R., Gooding, J., Ching, J., Zhou, J., et al. (2014). Caffeine stimulates hepatic lipid metabolism by the autophagy-lysosomal pathway in mice. Hepatology (Baltimore, Md) 59, 1366–1380.
OpenUrl CrossRef PubMed

[50] 50.↵
van Tol, A., Urgert, R., de Jong-Caesar, R., van Gent, T., Scheek, L.M., de Roos, B., and Katan, M.B. (1997). The cholesterol-raising diterpenes from coffee beans increase serum lipid transfer protein activity levels in humans. Atherosclerosis 132, 251–254.
OpenUrl PubMed

[51] 51.↵
Kurzrock, T., and Speer, K. (2007). Diterpenes and Diterpene Esters in Coffee. Food Reviews International 17, 433–450.
OpenUrl

[52] 52.↵
Jee, S.H., He, J., Appel, L.J., Whelton, P.K., Suh, I., and Klag, M.J. (2001). Coffee consumption and serum lipids: a meta-analysis of randomized controlled clinical trials. American journal of epidemiology 153, 353–362.
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Wu, T., Willett, W.C., Hankinson, S.E., and Giovannucci, E. (2005). Caffeinated coffee, decaffeinated coffee, and caffeine in relation to plasma C-peptide levels, a marker of insulin secretion, in U.S. women. Diabetes care 28, 1390–1396.
OpenUrl Abstract/FREE Full Text

[54] 54.↵
Kotsopoulos, J., Eliassen, A.H., Missmer, S.A., Hankinson, S.E., and Tworoger, S.S. (2009). Relationship between caffeine intake and plasma sex hormone concentrations in premenopausal and postmenopausal women. Cancer 115, 2765–2774.
OpenUrl CrossRef PubMed Web of Science

[55] 55.↵
Beral, V., Gaitskell, K., Hermon, C., Moser, K., Reeves, G., and Peto, R. (2015). Menopausal hormone use and ovarian cancer risk: individual participant meta-analysis of 52 epidemiological studies. Lancet (London, England) 385, 1835–1842.
OpenUrl

Habitual Coffee Consumption Increases Risks for Metabolic Diseases: Genome-wide Association Studies and a Phenotype-wide Two Sample Mendelian Randomization Analysis

Abstract

Introduction

Methods

Study design and cohorts

Phenotype definitions

The genome-wide association study and meta-analysis

Functional annotation of genome-wide association study and meta-analysis

Mendelian randomization study

Statistical analysis

Results

GWAS of coffee consumption (cups/day), caffeine consumption, and non-caffeine coffee consumption

Functional interpretation and pleiotropic effect of genetic variants

Causal relationship between coffee consumption and the health consequences: TSMR

Validation of the associations between coffee consumption and health outcomes: OSMR

Causal relationship between the consumption of caffeine and non-caffeine components in coffee and metabolic traits

Discussion

Limitations

Conclusions

Data Availability

Author contributions

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area