Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Genetic heterogeneity and subtypes of major depression

Thuy-Dung Nguyen, Arvid Harder, Ying Xiong, Kaarina Kowalec, Sara Hägg, Na Cai, Ralf Kuja-Halkola, Christina Dalman, Patrick F Sullivan, Yi Lu
doi: https://doi.org/10.1101/2021.03.05.21252911
Thuy-Dung Nguyen
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
2Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arvid Harder
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ying Xiong
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kaarina Kowalec
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
3College of Pharmacy, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sara Hägg
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Na Cai
4Helmholtz Pioneer Campus, Helmholtz Zentrum München, Neuherberg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ralf Kuja-Halkola
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christina Dalman
2Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick F Sullivan
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
5Department of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yi Lu
1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
2Department of Global Public Health, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: lu.yi{at}ki.se
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Background Major depression (MD) is a heterogeneous disorder; however, the extent to which genetic factors distinguish MD patient subgroups (genetic heterogeneity) remains uncertain. This study sought evidence for genetic heterogeneity in MD.

Methods Using UK Biobank cohort, the authors defined 16 MD subtypes within eight comparison groups (vegetative symptoms, symptom severity, comorbid anxiety disorder, age at onset, recurrence, suicidality, impairment and postpartum depression; N∼3,000-47,000). To compare genetic architecture of these subtypes, subtype-specific genome-wide association studies were performed to estimate SNP-heritability, and genetic correlations within subtype comparison and with other related disorders or traits.

Results MD subtypes were divergent in their SNP-heritability, and genetic correlations both within subtype comparisons and with other related disorders/traits. Three subtype comparisons (age at onset, suicidality, and impairment) showed significant differences in SNP-heritability; while genetic correlations within subtypes comparisons ranged from 0.55 to 0.86, suggesting genetic profiles are only partially shared among MD subtypes. Furthermore, subtypes that are more clinically challenging, e.g., early-onset, recurrent, suicidal, more severely impaired, had stronger genetic correlations with other psychiatric disorders. MD with atypical features showed a positive genetic correlation (+0.40) with BMI while a negative correlation (−0.09) was found in those with non-atypical symptoms. Novel genomic loci with subtype-specific effects were identified.

Conclusions These results provide the most comprehensive evidence to date for genetic heterogeneity within MD, and suggest that the phenotypic complexity of MD can be effectively reduced by studying the subtypes which share partially distinct etiologies.

INTRODUCTION

Major depression (MD) is a common psychiatric disorder that affects 15% of the population during lifetime.(1) Individuals with MD vary considerably in symptoms, severity, course, treatment response, and neurobiology.(2) MD heterogeneity is a major research and clinical challenge.(3) Despite major efforts in epidemiological, clinical, and biological psychiatry, this decade-long challenge remains largely unresolved.(4-6) MD subtypes have been proposed within five major categories that focused on: symptoms (typical versus atypical, with or without concomitant anxiety, etc.), etiology (with or without trauma or postpartum exposure), time of onset/time course (early-versus late-onset, recurrent), sex, and treatment outcome (treatment responsive versus resistant).(6) Many of these subtypes, however, exhibit unclear distinctions in underlying biology, psychosocial factors, and treatment efficacy.(6) One of the key biological component is genetics—the extent to which genetic factors distinguish these MD subtypes (i.e. genetic heterogeneity) is largely unknown.

Given its relatively low heritability (30-40%)(7, 8), identifying MD subtypes that are more heritable is of particular importance. Among the proposed subtypes, the sex difference in heritability is the most intensively studied, and current findings support that MD is more heritable in women than in men.(9) Early-onset, recurrent MD, and postpartum depression have been suggested to confer higher genetic liability from family-based studies, which was subsequently confirmed using polygenic risk scores (PRS) in recent MD genome-wide association studies (GWAS).(9-13) Comparisons of MD subtypes between early-versus late-onset, atypical versus non-atypical, with or without adversity have yielded interesting findings (e.g., the genetic overlap with metabolic traits was only found in MD with atypical features subtype, but not among those with non-atypical symptoms).(14) The studies to-date that have used genetic approaches to index the heterogeneity of MD subtypes are encouraging (summarized in Table 1) but overall impeded by a paucity of large cohorts with similar ascertainment, phenotyping, and genotyping.(5) As a result, a systematic comparison across the MD subtypes is lacking and overall evidence for genetic heterogeneity within MD is inconclusive.

View this table:
  • View inline
  • View popup
Table 1.

Summary of current literature on MD subtype heterogeneity

The goal of this study was to test genetic heterogeneity in clinically-informed MD subtypes. To accomplish this, we systematically evaluated 16 subtypes in the unique UK Biobank (UKB) cohort with large-scale genomic data and a wide array of phenotypic measures uniformly assessed. In particular, we compared genetic architectures among subtypes by quantifying differences in heritability (i.e., measuring the relative importance of genetic effects on phenotypic variance) and estimating genetic correlations (i.e., to determine if underlying genetic risk factors are identical) within subtype comparisons and with other traits.

METHODS and MATERIALS

To identify MD subtypes and compare their genetic architectures, we carefully selected phenotypes and large-scale genotype data from the UKB. The full protocol and scripts are available via Github.

Participant and phenotype definitions

UKB is a population-based cohort of over 500,000 adults (age 37-73) from across the United Kingdom.(15) UKB has phenotypic data from questionnaires, health records, biological sampling, and physical measurements. Information about mental health including MD was collected using various sources, including touchscreen questionnaires, nurse interviews, hospital admission records, and web-based mental health questionnaires (MHQ) follow-up. The UKB data profile were available elsewhere (15) and briefly described in Supplementary S1.1.

MD case definition

Cases were identified using five MD definitions, including (i) lifetime MD based on the Composite International Diagnostic Interview (CIDI) Short Form; (ii) ICD-coded MD based on linked hospital admission records; (iii) Probable MD based on Smith et al.(16); (iv) Self-reported MD as part of past and current medical conditions; and (v) MD cardinal symptoms of anhedonia and dysphoria (Supplementary table S2.1). These MD definitions have been used in previous studies.(17-19) Because some definitions were available only for parts of the UKB samples, to maximize sample size for MD subtypes, we included individuals who met criteria for at least one of the five MD definitions as cases. MD subtypes were all nested in the broad MD group but coming from different MD definitions (Supplementary table S2.2).

MD subtypes

According to major clinical features in MD, we defined 16 MD subtypes within eight comparison dimensions including (i) MD with versus without atypical features based on vegetative symptoms of hypersomnia and weight gain; (ii) severe versus mild/moderate MD based on symptom severity defined in Smith et al.(16) or ICD codes; (iii) MD with or without comorbid anxiety disorder either self-reported or based on ICD codes; (iv) early-versus late-onset MD based on age at which first experienced a ≥2-week episode of cardinal symptoms; (v) recurrent MD vs single-episode MD based on the number of episodes self-reported or ICD codes; (vi) MD with or without suicidal thoughts or self-harm either experienced recently or during the worst episode; (vii) MD with mild, moderate, severe impairment on normal roles; and (viii) postpartum depression (PPD), either self-reported or based on ICD codes (Table 2). The majority of these subtypes are included in the five major categories proposed in the previous meta-review; while the subtypes on suicidality and on impairment—related to general outcomes of MD—are extensions of the category focused on treatment outcomes (Supplementary S1.1, table S2.3).(6)

View this table:
  • View inline
  • View popup
Table 2.

MD subtypes and sample sizes

Control group

We used a common control group without lifetime history of MD to compare with all but the subtypes of comorbid anxiety disorder and PPD. From the entire UKB population, we excluded those with any indications of MD using five MD case criteria described above, and two additional exclusion criteria, help-seeking MD and antidepressant use (medication list in Supplementary table S2.4). We further excluded those with ICD-diagnoses of anxiety disorders from the controls for the MD subtype with or without comorbid anxiety disorder. For PPD, we restricted controls to women who reported giving at least one live birth. (Supplementary table S2.1)

Exclusionary criteria for cases and controls

We excluded any case or control who met lifetime criteria for schizophrenia, schizoaffective disorder, and bipolar disorder I (including unipolar mania) (Supplementary table S2.1). Thus, anyone who had ICD-diagnosis of schizophrenia/psychosis, bipolar disorder, mania or reported any use of antipsychotics or lithium for psychiatric symptoms (Supplementary table S2.4) were excluded from analyses. Application of these criteria removed 2,385 MD cases and 231 controls (Supplementary figure S3.1).

Genotyping, quality control, imputation

Genotype data were available for 488,363 UKB participants, after a stringent quality control procedure and imputation using combined reference panels of Haplotype Reference Consortium (HRC) and UK10K merged with 1000 Genomes phase 3.(15) 459,590 individuals remained after the exclusion of subjects with low-quality genotype data, unmatched ID with phenotype data, consent withdrawal, and non-European ancestry outliers (Supplementary figure S3.1).

Statistical analysis

Genome-wide association studies (GWAS)

We generated GWAS summary statistics for MD subtypes to estimate SNP-heritability and genetic correlations for computational efficiency. In the UKB, about 30% of the participants were found to be related to at least one other person in the cohort up to the 3rd degree.(15) Cryptic relatedness within sample could bias results in GWAS, while restricting to the unrelated individuals would cause a major loss of statistical power. We therefore performed the mixed linear model-based GWAS analysis (fastGWA) to retain related individuals in the UKB.(20) We first constructed a sparse genetic relationship matrix (GRM) for all included individuals of European ancestries, and then conducted case-control GWAS for each subtype using fastGWA module in GCTA, adjusting for sex, age, and the first 10 PCs(20) (Supplementary S2.2).

For subtype-specific GWAS with genome-wide significant SNPs (p≤5×10−8), we identified independent genomic loci using SNP2GENE module in FUMA(21); then compared our loci with the latest published MD GWAS results which consisted of samples from the Psychiatric Genomics Consortium (PGC), UKB, and 23andMe.(19)

SNP-Heritability

We estimated SNP-heritability (h2SNP) on the observed scale for each MD subtype using linkage disequilibrium score regression (LDSC).(22) LDSC estimates h2SNP by regressing GWAS summary statistics on LD scores estimated from a reference population (1000 genomes European samples). To allow comparison between subtypes, we converted the observed h2SNP to the liability scale, and as previously suggested(23), we corrected for oversampling and extreme phenotyping using sample prevalence, and proportions of the population as cases and controls. Except for PPD, the subtype-specific population prevalence was calculated as MD lifetime prevalence (15%) scaled by the literature-based proportion of subtype in MD, and we used 85% as the proportion of the population as non-MD controls (details in Supplementary table S2.5). We provided a figure showing the impact of population case prevalence estimates on h2SNP. When comparing heritability estimates within subtype comparisons, we cannot directly test for the statistical significance of the difference in estimates due sample overlap; therefore, we considered that estimates are significantly different when non-overlapping confidence intervals are presented.

Genetic correlation

Genetic correlations (rg) were estimated using High-Definition Likelihood (HDL) method which yields more precise estimates of genetic correlations than LDSC (Supplementary S2.2).(24) We estimated rg within subtype comparisons using the LD reference computed from 335,265 Genomic British individuals in the UKB.

To examine whether the subtypes differ in their genetic overlap with other psychiatric disorders and traits, we also estimated genetic correlations between these MD subtypes and 11 traits (six psychiatric disorders, neuroticism, self-reported well-being, body mass index, and two cognitive traits) and compared results within subtype comparisons. These disorders and traits were chosen given the strong evidence for their genetic correlations with MD, or in some cases, for their causal effects on MD.(13, 19) We have used the summary statistics from the latest published GWAS for the calculations of rg using HDL.(19, 25-34)

Sensitivity analyses

To examine whether our broad MD definition that included less strictly defined cases may bias results, we further restricted the analyses to the CIDI-based definition—previously suggested as the closest to the gold standard for diagnosing MD in the UKB(17, 35, 36)—and performed similar analyses for all subtypes except impairment (Supplementary S2.2).

RESULTS

Of 459,590 individuals included in this study (54% females, mean age at recruitment 57 (SD 8.00)), 126,506 (27.53%) met at least one of the five definitions for MD (i.e., broad MD phenotype). After applying exclusion criteria, we retained 124,121 cases and 250,229 controls. Compared with controls, MD cases had more females (64% vs 47%), higher Townsend deprivation index (mean - 1.33, SD 3.02 vs -1.63, SD 2.90), more lifetime smokers (57% vs 52%), but did not differ in mean BMI (mean 27.3, SD 4.6 vs 27.3, SD 5.0).

The estimates of SNP-heritability varied across the five MD case definitions, and for the broad MD phenotype it was 7.38% (95% CI= 6.75-8.01%) (Supplementary figure S3.2).

Differences in genetic architecture reflect subtype heterogeneity

Overall, SNP-heritability estimates tended to be higher in MD subtypes with more severe manifestation (e.g., MD with atypical features, comorbid anxiety disorder, PPD, severe impairment and severe symptoms subtypes) (Figure 1a). All of the subtype comparisons had higher h2SNP estimates for the more severe manifestation, and three (age at onset, suicidality, and impairment) showed significant differences in h2SNP estimates (Figure 1a-b). All examined genetic correlations within comparisons were significantly less than one and the estimates ranged between 0.55-0.86 (Figure 1c).

Figure 1.
  • Download figure
  • Open in new tab
Figure 1. SNP-heritability and pair-wise genetic correlation for MD subtypes.

(a) SNP-heritability of MD subtypes on liability scale for each subtype. The bars show point estimates. The error bar shows 95% CI. Same color coding is used for subtypes in the same comparison group. The horizontal line shows SNP-heritability for the broad MD phenotype (h2SNP =0.74). The sample and population prevalence used for liability-scale conversion available in Supplementary table S2.5. b) SNP-heritability of MD subtypes on liability-scale for a range of population case prevalence. Each panel shows one comparison group. Shaded areas show 95% CI for SNP-heritability on liability scale. Population control prevalence is fixed for each subtype as in Supplementary table S2.5. (b) Pair-wise genetic correlation between subtypes within comparison groups. Error bars show 95% CI. Co-anxiety: MD with comorbid anxiety; Non-co. anxiety: MD without comorbid anxiety. Colors indicate the same comparison group as in (a).

The h2SNP estimate for MD with atypical features was the highest among all subtypes, and it was almost twice higher than the estimate for non-atypical MD even though the 95% confidence intervals overlapped (13.35%, CI=7.57-19.13% and 7.48%, CI=6.60-8.36%). The genetic correlation between MD subtype with atypical features and subtype without atypical features was the lowest among all comparisons (rg=0.55, CI=0.44-0.66) (Figure 1c). The two subtypes did not significantly differ in their genetic correlations with PGC MD (Figure 2); instead major differences were found in their correlations with anorexia nervosa and ADHD. Consistent with previous findings (14, 37, 38), MD with atypical features showed a strong positive rg with BMI (0.40, CI=0.34-0.46) while non-atypical MD showed a small negative rg instead (rg=-0.09, CI=-0.13 to -0.06). Furthermore, positive correlations with cognitive traits were observed in non-atypical MD (rg=0.36 and 0.35 with educational attainment and intelligence) which were not found in MD with atypical features (corresponding rg= 0.04 and 0.07).

Figure 2.
  • Download figure
  • Open in new tab
Figure 2. Genetic correlations (rg) between MD subtypes with other psychiatric disorders and related traits.

Each panel shows rg with other traits for each subtype comparison. rg with other traits for each subtype are in different colors. Error bars show 95% CI. Vertical dash lines in each panel at rg=0. Horizontal dash line separates psychiatric and other traits. Co-anxiety: MD with comorbid anxiety; Non-co. anxiety: MD without comorbid anxiety.

The MD subtype with severe symptoms had slightly higher h2SNP estimate than the one with mild/moderate symptoms, although the two estimates were not significantly different. The rg within comparison was significantly lower than 1 (0.80, CI=0.68-0.92). However, the two subtypes did not differ in their correlations with other traits except for a stronger rg with schizophrenia found in the subtype with severe symptoms (Figure 1-2).

Assuming the proportions of MD cases with and without comorbid anxiety disorder at 55% and 45%, respectively(39), the former subtype was more heritable than the latter (h2SNP=11.35%, CI=10.12-12.58%, for MD with comorbid anxiety disorder, compared with 9.43%, CI=7.92-10.95%, for MD without anxiety disorder). The rg within this comparison was 0.80 (CI=0.73-0.88) (Figure 1). Furthermore, the subtype with comorbid anxiety disorder showed higher genetic correlations with MD, schizophrenia and neuroticism, as well as lower correlations with cognitive traits, when compared with the subtype without anxiety disorder (Figure 2).

The SNP-heritability of early-onset MD was three times higher than that of the late-onset subtype (9.97%, CI=8.89-11.04% compared with 3.25%, CI=2.46-4.04%). The rg within comparison was 0.76 (CI=0.68-0.85). (Figure 1). Significant differences in their rg with other traits were observed, including higher genetic correlations in early-onset MD with PGC MD, schizophrenia, anorexia nervosa, and autism spectrum disorder, than in late-onset MD (Figure 2).

Recurrent and single-episode MD had similar h2SNP estimates (7.94% and 7.47%). However, their rg was significantly lower than one (0.83, CI=0.74-0.92) (Figure 1). Compared with single-episode cases, recurrent MD had stronger positive correlations with schizophrenia, bipolar disorder, anorexia nervosa, while lower genetic correlation with BMI (Figure 2).

The MD subtype with suicidal thoughts was more heritable than the subtype without (8.14%, CI=7.20-9.07% and 6.25%, CI=5.46-7.03%). The rg within this comparison was 0.79 (CI=0.73-0.84). The two subtypes in this comparison significantly differed in their genetic correlations with the majority of the other traits considered. Compared with the subtype without suicidal thoughts, the suicidal subtype showed substantially higher positive rg with PGC MD, schizophrenia, neuroticism, and negative rg with well-being; while its rg with cognitive traits was much weaker (Figure 1-2).

For subtypes based on impairment, the h2SNP estimates increased with the degree of impairment, roughly in a dose-response relationship, i.e., mild impairment had the lowest h2SNP (3.72%, CI=3.09-4.34%), followed by moderate (5.62%, CI=4.81-6.44%) and severe impairment (10.42%, CI=9.08-11.76%). This dose-response relationship was also reflected in the pair-wise genetic correlation estimates, with the rg comparing mild and severe impairment (0.65, CI=0.58-0.72) markedly lower than the other two correlations (Figure 1). We observed a clear trend, that is, the more severe impairment in the subtype, the stronger genetic correlation it had with other psychiatric disorders and neuroticism (positive rg), and with self-reported well-being (negative rg), while less severe impairment was more strongly associated with cognitive traits (positive rg) and with BMI (negative rg) (Figure 2).

The h2SNP of PPD was estimated at 11.31% (CI=6.61-16.0%) which was higher compared with h2SNP of broad MD phenotype. PPD showed significant positive rg with other psychiatric disorders, with the strongest rg observed in PGC MD as expected (0.61, CI=0.53-0.69), and with neuroticism (rg=0.34) and cognitive traits (rg=0.35 and 0.41 with educational attainment and intelligence), and a negative rg with well-being (rg=-0.39) (Figure 2).

The broad MD definition was used above to allow sufficient statistical power in analyzing each subtype. We further assessed the impact of MD definition by performing a sensitivity analysis based on more strictly defined MD cases. The SNP heritability of the CIDI-based definition was in line with previously reported (h2SNP=15.7%, CI=13.4-18.1%, Supplementary figure S3.2)(17, 35). Restricting the analyses to the CIDI-based cases, the results were highly similar, except for the comparisons of symptom severity and recurrence, where the CIs of the rg estimates now included one due to markedly reduced sample sizes in these subtypes (Supplementary table S2.7).

Stratified GWAS reveal novel subtype-specific loci

Over all 16 subtype-specific GWAS, we identified 47 genome-wide significant loci (45 non-overlapping) associated with nine subtypes. Less than half (22 loci) were significant in our largest GWAS of broad MD. Comparing with the latest published MD GWAS (19), we found 14 loci that have not been reported on MD, with 3 for early-onset, 3 for recurrent, 3 for suicidal MD, 2 for non-suicidal, 1 for non-atypical symptoms, 1 for moderate impairment and 1 for PPD (Table 3; full results on the 45 loci in Supplementary table S2.6). The majority (64%) of these 14 loci showed no statistically significant association with the other subtype in comparison (P>0.05; Supplementary S2.6), suggesting subtype-specific effects. The chromosome 2 locus for recurrent MD, with the leading SNP rs6431690, was significant even after the stringent Bonferroni correction (P<3.125*10−9).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3.

14 genome-wide significant loci from MD subtype-specific GWAS, undetected in the Howard et al 2019.

DISCUSSION

In this comprehensive report using the large-scale UKB data, we compared the genetic architectures of 16 MD subtypes and demonstrated that these subtypes were divergent in their SNP-heritability and genetic correlations both within subtype comparisons and with other related disorders/traits. Our results provide convincing evidence for genetic heterogeneity within MD, as indexed by its clinical subtypes. These findings suggest that the complexity in the phenotype of MD can be effectively reduced by studying the subtypes which share partially distinct etiologies. In particular, we note the following key findings:

First, clinically-informed subtypes are, in general, genetically more homogeneous than considering all types of MD together. Accurately identifying more homogenous forms is the first step to reduce heterogeneity in MD. The majority of the subtypes showed higher estimates of SNP-heritability compared with MD of all forms. Our results corroborated previous findings from family-based studies that early-onset, recurrent MD and PPD represent more heritable MD subtypes.(10, 12) We further extended the list to include MD with atypical features, MD with or without comorbid anxiety disorder, and with severe impairment. In contrast, subtypes with lower heritability than all-form MD are those with mild/moderate clinical manifestation or with late onset.

Second, we demonstrated subtype heterogeneity in both SNP heritability and genetic correlations. All subtype comparisons showed non-identical genetic sharing (i.e., rg between subtypes significantly differ from unity) and some had heritability differences (i.e., h2SNP significantly differ between subtypes). Interestingly, the subtype comparisons on vegetative symptoms, time onset, and impairment showed the strongest evidence for genetic heterogeneity, meaning these clinical features characterize major etiological differences within MD.

However, the observed genetic correlations across subtype comparisons were moderate to high, 0.55-0.86, revealing substantial genetic overlaps between subtypes. The level of genetic correlation can be translated into the proportion of genetic variance in one trait attributable to that of another (rg2).(17) Thus, it would suggest about 30-70% of genetic variances are shared within subtype comparisons. In line with previous estimates of genetic correlations between male versus female MD(9) and across MD symptoms(40), our findings confirm that the genetic profiles of MD subtypes are only partially distinct.

The estimates of genetic correlations between subtypes need to be benchmarked against genetic correlations between different psychiatric disorders (e.g., schizophrenia and bipolar disorder, two clinically distinct psychiatric disorders, had a rg of ∼0.70 (28)), between different datasets but with same phenotype (e.g., mean rg∼0.76 across the seven cohorts at PGC MD(13)), and between different populations (e.g., rg∼1 between schizophrenia samples of East Asian and European ancestries(41)). Genetic correlations can be found lower than one due to differences in phenotype definitions, populations, or technical factors(42). In this study, we minimized these potential differences by using the single large sample from UKB. We also restricted the estimation of genetic correlations to within subtype comparisons instead of pair-wise comparisons across all subtypes, to limit the impact of phenotypic differences between subtypes (e.g., we found mean rg across all subtypes was indeed lower than that within comparison groups). Our genetic correlation estimates are thus reliable for quantifying overall genetic sharing between MD subtypes.

Third, the MD subtypes preserve the overall pattern of genetic sharing found between MD (of all forms) and other psychiatric disorders, but differ in the relationships with other traits. MD was shown to be positively correlated with many psychiatric disorders (e.g., rg∼0.3 with schizophrenia and bipolar disorders) and with BMI (rg=0.09), and negatively correlated with educational attainment (rg=-0.13).(13, 19) A similar level of genetic correlations was found between MD subtypes and other psychiatric disorders; notably, we found stronger correlations in the MD subtypes that are more clinically challenging, especially early-onset, recurrent, suicidal, more severely impaired. Regarding their relationships with other traits, MD subtypes showed some differences compared with all MD. The positive correlation found between MD and BMI was only detected in MD with atypical features, but with a markedly higher estimate (rg∼0.5). This result concurred with previous findings mainly using PRS or other samples.(14, 37, 38, 43) In contrast with the negative value found in all MD, we found positive correlations with educational attainment in many MD subtypes. However, this finding might be specific to the UKB cohort as previous research have shown that participation in mental health survey and other optional components is genetically correlated with higher education and intelligence.(44)

Taken together, our findings provide an improved understanding on heritable MD subtypes and overall genetic sharing between subtypes. These results have strong implications in the gene mapping strategies for MD. Current efforts predominantly aim to maximize samples size. The alternative strategy—to reduce phenotypic heterogeneity through more homogeneous phenotype— has not been fully evaluated, potentially due to theoretical and methodological challenges.(45) This strategy relies on the premise that “clinical heterogeneity in MD emerges from an aggregation of different underlying liabilities expressed through partially distinct biological pathways” (45) which, to the best knowledge, was not proven. Limited by a lack of large-scale dataset with deep phenotyping, prior studies were only able to focus on a few key subtypes.(5, 45) Our comprehensive report, by contrast, convincingly demonstrated genetic heterogeneity in MD, and thus forms a strong theoretical basis for this strategy. We further illustrated the potential of such strategy by performing stratified GWAS on each subtype. This yielded the identification of 47 independent genomic loci, a third of which were undetected in the latest MD GWAS with about 5- to 10-fold more cases than in our subtype-specific analyses. These results warrant further replications in large biobanks with consistent genotyping and phenotyping.

Here we used the UKB data which provide the unique opportunity to evaluate multiple subtypes with sufficient statistical power. We, however, note the following limitations in the context of interpreting the results. First, we were unable to study all MD subtypes, especially the treatment-related subtypes, as more refined clinical and treatment data would be required. We also acknowledge that the quality of phenotypic definitions varied across the subtypes studied, with those relying on self-reported and retrospective recalls of symptoms more compromised than the others. Finally, “healthy volunteer bias” was known for UKB(46) and likely to contribute to part of our results.

Etiological heterogeneity hinders treatment efficacy. Our finding of ubiquitous subtype heterogeneity within MD underscores the potential of drug development and treatment optimization for patient subgroups to achieve precision psychiatry.

URLs

Full protocol and scripts available via Github: https://github.com/Thuy-Dung-Nguyen/MD-subtypes;

UK Biobank Showcase User Guide (2017): http://biobank.ctsu.ox.ac.uk/crystal/crystal/exinfo/ShowcaseUserGuide.pdf;

UK Biobank-Mental health web-based questionnaire (2017) http://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/mental_health_online.pdf;

GCTA-fastGWA: https://cnsgenomics.com/software/gcta/#fastGWA;

FUMA: https://fuma.ctglab.nl;

LDSC: https://github.com/bulik/ldsc;

HDL: https://github.com/zhenin/HDL;

Howard et al. 2019 MD GWAS summary results: https://datashare.is.ed.ac.uk/handle/10283/3203.

Data Availability

Data used for this manuscript is not available for public access. Please contact the corresponding author for questions regarding data. Code that was used to generate the results is available on GitHub link below.

https://github.com/Thuy-Dung-Nguyen/MD-subtypes

Disclosures

PFS reports the following potentially competing financial interests. Current: Lundbeck (advisory committee, grant recipient), RBNC Therapeutics (advisory committee, stock ownership). CMB reports: Shire (grant recipient, Scientific Advisory Board member); Idorsia (consultant); Pearson (author, royalty recipient)

Acknowledgement

This research has been conducted using the UK Biobank Resource under Application Number 22224. This study was funded by the US NIMH grant (R01 MH123724) and the European Union’s Horizon 2020 research and innovation program under grant agreement No 847776. PFS was supported by the Swedish Research Council (Vetenskapsrådet, award D0886501), the Horizon 2020 Program of the European Union (COSYN, RIA grant agreement n° 610307), and US NIMH (U01 MH109528 and R01 MH077139). YL is in part supported by a 2018 NARSAD Young Investigator Grant from the Brain & Behaviour Research Foundation and US NIMH (R01 MH123724).

The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at UPPMAX server partially funded by the Swedish Research Council through grant agreement no. 2018-05973.

REFERENCES

  1. 1.↵
    World Health O. Depression and other common mental disorders: global health estimates. Geneva: World Health Organization; 2017 2017. Contract No.: WHO/MSD/MER/2017.2.
  2. 2.↵
    Fried EI, Nesse RM. Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. J Affect Disord. 2015;172:96–102.
    OpenUrlCrossRefPubMed
  3. 3.↵
    Flint J, Kendler KS. The genetics of major depression. Neuron. 2014;81(3):484–503.
    OpenUrlCrossRefPubMed
  4. 4.↵
    Beijers L, Wardenaar KJ, van Loo HM, Schoevers RA. Data-driven biological subtypes of depression: systematic review of biological approaches to depression subtyping. Molecular Psychiatry. 2019;24(6):888–900.
    OpenUrl
  5. 5.↵
    Cai N, Choi KW, Fried EI. Reviewing the genetics of heterogeneity in depression: operationalizations, manifestations and etiologies. Human Molecular Genetics. 2020;29(R1):R10–R8.
    OpenUrl
  6. 6.↵
    Harald B, Gordon P. Meta-review of depressive subtyping models. J Affect Disord. 2012;139(2):126–40.
    OpenUrlCrossRefPubMed
  7. 7.↵
    Polderman TJC, Benyamin B, de Leeuw CA, Sullivan PF, van Bochoven A, Visscher PM, et al. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nature Genetics. 2015;47(7):702–9.
    OpenUrlCrossRefPubMed
  8. 8.↵
    Sullivan PF, Neale MC, Kendler KS. Genetic epidemiology of major depression: review and meta-analysis. Am J Psychiatry. 2000;157(10):1552–62.
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Kendler KS, Ohlsson H, Lichtenstein P, Sundquist J, Sundquist K. The Genetic Epidemiology of Treated Major Depression in Sweden. American Journal of Psychiatry. 2018;175(11):1137–44.
    OpenUrlCrossRef
  10. 10.↵
    Fernandez-Pujals AM, Adams MJ, Thomson P, McKechanie AG, Blackwood DHR, Smith BH, et al. Epidemiology and Heritability of Major Depressive Disorder, Stratified by Age of Onset, Sex, and Illness Course in Generation Scotland: Scottish Family Health Study (GS:SFHS). PloS one. 2015;10(11):e0142197–e.
    OpenUrlCrossRefPubMed
  11. 11.
    Power RA, Tansey KE, Buttenschøn HN, Cohen-Woods S, Bigdeli T, Hall LS, et al. Genome-wide Association for Major Depression Through Age at Onset Stratification: Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium. Biological Psychiatry. 2017;81(4):325–35.
    OpenUrl
  12. 12.↵
    Viktorin A, Meltzer-Brody S, Kuja-Halkola R, Sullivan PF, Landén M, Lichtenstein P, et al. Heritability of Perinatal Depression and Genetic Overlap With Nonperinatal Depression. Am J Psychiatry. 2016;173(2):158–65.
    OpenUrl
  13. 13.↵
    Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A, et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nature Genetics. 2018;50(5):668–81.
    OpenUrlCrossRefPubMed
  14. 14.↵
    Milaneschi Y, Lamers F, Peyrot WJ, Abdellaoui A, Willemsen G, Hottenga JJ, et al. Polygenic dissection of major depression clinical heterogeneity. Mol Psychiatry. 2016;21(4):516–22.
    OpenUrlCrossRef
  15. 15.↵
    Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–9.
    OpenUrlCrossRefPubMed
  16. 16.↵
    Smith DJ, Nicholl BI, Cullen B, Martin D, Ul-Haq Z, Evans J, et al. Prevalence and Characteristics of Probable Major Depression and Bipolar Disorder within UK Biobank: Cross-Sectional Study of 172,751 Participants. PLOS ONE. 2013;8(11):e75362.
    OpenUrlCrossRefPubMed
  17. 17.↵
    Cai N, Revez JA, Adams MJ, Andlauer TFM, Breen G, Byrne EM, et al. Minimal phenotyping yields genome-wide association signals of low specificity for major depression. Nature Genetics. 2020;52(4):437–47.
    OpenUrlCrossRef
  18. 18.
    Hall LS, Adams MJ, Arnau-Soler A, Clarke TK, Howard DM, Zeng Y, et al. Genome-wide meta-analyses of stratified depression in Generation Scotland and UK Biobank. Transl Psychiatry. 2018;8(1):9.
    OpenUrl
  19. 19.↵
    Howard DM, Adams MJ, Clarke T-K, Hafferty JD, Gibson J, Shirali M, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nature Neuroscience. 2019;22(3):343–52.
    OpenUrlPubMed
  20. 20.↵
    Jiang L, Zheng Z, Qi T, Kemper KE, Wray NR, Visscher PM, et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nature Genetics. 2019;51(12):1749–55.
    OpenUrl
  21. 21.↵
    Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nature Communications. 2017;8(1):1826.
    OpenUrl
  22. 22.↵
    Bulik-Sullivan BK, Loh P-R, Finucane HK, Ripke S, Yang J, Patterson N, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics. 2015;47(3):291–5.
    OpenUrlCrossRefPubMed
  23. 23.↵
    Yap CX, Sidorenko J, Marioni RE, Yengo L, Wray NR, Visscher PM. Misestimation of heritability and prediction accuracy of male-pattern baldness. Nat Commun. 2018;9(1):2537.
    OpenUrlCrossRef
  24. 24.↵
    Ning Z, Pawitan Y, Shen X. High-definition likelihood inference of genetic correlations across human complex traits. Nature Genetics. 2020;52(8):859–64.
    OpenUrl
  25. 25.↵
    Demontis D, Walters RK, Martin J, Mattheisen M, Als TD, Agerbo E, et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet. 2019;51(1):63–75.
    OpenUrlCrossRefPubMed
  26. 26.
    Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019;51(3):431–44.
    OpenUrlCrossRefPubMed
  27. 27.
    Watson HJ, Yilmaz Z, Thornton LM, Hübel C, Coleman JRI, Gaspar HA, et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet. 2019;51(8):1207–14.
    OpenUrl
  28. 28.↵
    Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, Trubetskoy V, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat Genet. 2019;51(5):793–803.
    OpenUrlCrossRefPubMed
  29. 29.
    Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50(8):1112–21.
    OpenUrlCrossRefPubMed
  30. 30.
    Savage JE, Jansen PR, Stringer S, Watanabe K, Bryois J, de Leeuw CA, et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat Genet. 2018;50(7):912–9.
    OpenUrlCrossRefPubMed
  31. 31.
    Ripke S, Walters JTR, Donovan MC. Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia. medRxiv. 2020:2020.09.12.20192922.
  32. 32.
    Pulit SL, Stoneman C, Morris AP, Wood AR, Glastonbury CA, Tyrrell J, et al. Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry. Hum Mol Genet. 2019;28(1):166–74.
    OpenUrl
  33. 33.
    Okbay A, Baselmans BM, De Neve JE, Turley P, Nivard MG, Fontana MA, et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet. 2016;48(6):624–33.
    OpenUrlCrossRefPubMed
  34. 34.↵
    Nagel M, Jansen PR, Stringer S, Watanabe K, de Leeuw CA, Bryois J, et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat Genet. 2018;50(7):920–7.
    OpenUrl
  35. 35.↵
    Glanville KP, Coleman JRI, Howard DM, Pain O, Hanscombe KB, Jermy B, et al. Multiple measures of depression to enhance validity of major depressive disorder in the UK Biobank. BJPsych Open. 2021;7(2):e44.
    OpenUrl
  36. 36.↵
    Glanville KP, Coleman JRI, Howard DM, Pain O, Hanscombe KB, Jermy B, et al. Multiple measures of depression to enhance validity of Major Depressive Disorder in the UK Biobank. medRxiv. 2020:2020.09.18.20196451.
  37. 37.↵
    Badini I, Coleman JRI, Hagenaars SP, Hotopf M, Breen G, Lewis CM, et al. Depression with atypical neurovegetative symptoms shares genetic predisposition with immuno-metabolic traits and alcohol consumption. Psychol Med. 2020:1–11.
  38. 38.↵
    Milaneschi Y, Lamers F, Penninx BWJH. Dissecting Depression Biological and Clinical Heterogeneity—The Importance of Symptom Assessment Resolution. JAMA Psychiatry. 2021.
  39. 39.↵
    Kaufman J, Charney D. Comorbidity of mood and anxiety disorders. Depress Anxiety. 2000;12 Suppl 1:69–76.
    OpenUrlCrossRefPubMedWeb of Science
  40. 40.↵
    Thorp JG, Marees AT, Ong JS, An J, MacGregor S, Derks EM. Genetic heterogeneity in self-reported depressive symptoms identified through genetic analyses of the PHQ-9. Psychol Med. 2019:1–12.
  41. 41.↵
    Lam M, Chen CY, Li Z, Martin AR, Bryois J, Ma X, et al. Comparative genetic architectures of schizophrenia in East Asian and European populations. Nat Genet. 2019;51(12):1670–8.
    OpenUrl
  42. 42.↵
    Baselmans BML, Yengo L, van Rheenen W, Wray NR. Risk in Relatives, Heritability, SNP-Based Heritability, and Genetic Correlations in Psychiatric Disorders: A Review. Biological Psychiatry. 2020.
  43. 43.↵
    Milaneschi Y, Lamers F, Peyrot WJ, Baune BT, Breen G, Dehghan A, et al. Genetic Association of Major Depression With Atypical Features and Obesity-Related Immunometabolic Dysregulations. JAMA Psychiatry. 2017;74(12):1214–25.
    OpenUrl
  44. 44.↵
    Tyrrell J, Zheng J, Beaumont R, Hinton K, Richardson TG, Wood AR, et al. Genetic predictors of participation in optional components of UK Biobank. bioRxiv. 2020:2020.02.10.941328.
  45. 45.↵
    Schwabe I, Milaneschi Y, Gerring Z, Sullivan PF, Schulte E, Suppli NP, et al. Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies. Psychol Med. 2019;49(16):2646–56.
    OpenUrlCrossRefPubMed
  46. 46.↵
    Fry A, Littlejohns TJ, Sudlow C, Doherty N, Adamska L, Sprosen T, et al. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. American Journal of Epidemiology. 2017;186(9):1026–34.
    OpenUrlCrossRefPubMed
  47. 47.
    Cai N, Bigdeli TB, Kretzschmar W, Li Y, Liang J, Song L, et al. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature. 2015;523(7562):588–91.
    OpenUrlCrossRefPubMed
  48. 48.
    Coleman JRI, Peyrot WJ, Purves KL, Davis KAS, Rayner C, Choi SW, et al. Genome-wide gene-environment analyses of major depressive disorder and reported lifetime traumatic experiences in UK Biobank. Molecular psychiatry. 2020;25(7):1430–46.
    OpenUrl
  49. 49.
    Peterson RE, Cai N, Dahl AW, Bigdeli TB, Edwards AC, Webb BT, et al. Molecular Genetic Analysis Subdivided by Adversity Exposure Suggests Etiologic Heterogeneity in Major Depression. The American journal of psychiatry. 2018;175(6):545–54.
    OpenUrl
Back to top
PreviousNext
Posted March 08, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Genetic heterogeneity and subtypes of major depression
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Genetic heterogeneity and subtypes of major depression
Thuy-Dung Nguyen, Arvid Harder, Ying Xiong, Kaarina Kowalec, Sara Hägg, Na Cai, Ralf Kuja-Halkola, Christina Dalman, Patrick F Sullivan, Yi Lu
medRxiv 2021.03.05.21252911; doi: https://doi.org/10.1101/2021.03.05.21252911
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Genetic heterogeneity and subtypes of major depression
Thuy-Dung Nguyen, Arvid Harder, Ying Xiong, Kaarina Kowalec, Sara Hägg, Na Cai, Ralf Kuja-Halkola, Christina Dalman, Patrick F Sullivan, Yi Lu
medRxiv 2021.03.05.21252911; doi: https://doi.org/10.1101/2021.03.05.21252911

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Psychiatry and Clinical Psychology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)