Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Increasing Sample Diversity in Psychiatric Genetics – Introducing a new Cohort of Patients with Schizophrenia and Controls from Vietnam – Results from a Pilot Study

VT Nguyen, View ORCID ProfileA Braun, View ORCID ProfileJ Kraft, View ORCID ProfileTMT Ta, View ORCID ProfileGM Panagiotaropoulou, VP Nguyen, TH Nguyen, View ORCID ProfileV Trubetskoy, CT Le, TTH Le, XT Pham, View ORCID ProfileI Heuser-Collier, NH Lam, View ORCID ProfileK Böge, IM Hahne, M Bajbouj, View ORCID ProfileMM Zierhut, View ORCID ProfileE Hahn, View ORCID ProfileS Ripke
doi: https://doi.org/10.1101/2021.04.21.21255615
VT Nguyen
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
bNational Institute of Mental Health, Bach Mai Hospital, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
A Braun
cCharité – Universitätsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charité Mitte, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for A Braun
J Kraft
cCharité – Universitätsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charité Mitte, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for J Kraft
TMT Ta
eBerlin Institute of Health at Charité – Universitätsmedizin Berlin, BIH Academy, Clinician Scientist Program
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for TMT Ta
GM Panagiotaropoulou
cCharité – Universitätsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charité Mitte, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for GM Panagiotaropoulou
VP Nguyen
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
TH Nguyen
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
V Trubetskoy
cCharité – Universitätsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charité Mitte, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for V Trubetskoy
CT Le
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
bNational Institute of Mental Health, Bach Mai Hospital, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
TTH Le
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
bNational Institute of Mental Health, Bach Mai Hospital, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
XT Pham
aDepartment of Psychiatry, Hanoi Medical University, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
I Heuser-Collier
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for I Heuser-Collier
NH Lam
fHanoi Mental Hospital, Long Biên, Hà Nội, Việt Nam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
K Böge
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for K Böge
IM Hahne
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
M Bajbouj
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
MM Zierhut
eBerlin Institute of Health at Charité – Universitätsmedizin Berlin, BIH Academy, Clinician Scientist Program
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for MM Zierhut
E Hahn
gCharité – Universitätsmedizin Berlin, Department of Psychiatry, Campus Benjamin Franklin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for E Hahn
S Ripke
cCharité – Universitätsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charité Mitte, Berlin, Germany
dStanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for S Ripke
  • For correspondence: sripke{at}broadinstitute.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Objectives Genome-Wide Association Studies (GWAS) of Schizophrenia (SCZ) have provided new biological insights; however, most cohorts are of European ancestry. As a result, derived polygenic risk scores (PRS) show decreased predictive power when applied to populations of different ancestries. We aimed to assess the feasibility of a large-scale data collection in Hanoi, Vietnam, contribute to international efforts to diversify ancestry in SCZ genetic research and examine the transferability of SCZ-PRS to individuals of Vietnamese Kinh ancestry.

Methods In a pilot study, 368 individuals (including 190 SCZ cases) were recruited at the Hanoi Medical University’s associated psychiatric hospitals and outpatient facilities. Data collection included sociodemographic data, baseline clinical data, clinical interviews assessing symptom severity and genome-wide SNP genotyping. SCZ-PRS were generated using different training data sets: i) European, ii) East-Asian and iii) trans-ancestry GWAS summary statistics from the latest SCZ GWAS meta-analysis.

Results SCZ-PRS significantly predicted case status in Vietnamese individuals using mixed-ancestry (R2 liability=4.9%, p=6.83*10−8), East-Asian (R2 liability=4.5%, p=2.73*10−7) and European (R2 liability=3.8%, p = 1.79*10−6) discovery samples.

Discussion Our results corroborate previous findings of reduced PRS predictive power across populations, highlighting the importance of ancestral diversity in GWA studies.

1 Introduction

Schizophrenia (SCZ) is a severe and often chronic psychiatric disorder that consistently affects between 0,5-1% of the worldwide population (McGrath et al., 2008). It is a clinically heterogeneous disorder characterised by perceptual distortions such as delusions and hallucinations, disorganised thinking, negative symptoms including reduced affective resonance, avolition, lack of motivation, marked cognitive deficits, as well as mood disturbances (Galderisi et al., 2018). With a high disease burden (GBD 2016 Disease and Injury Incidence and Prevalence Collaborators, 2017), loss of social functioning, and reduced life-expectancy (Walker et al., 2015), SCZ has a substantial impact on public health-related costs (Cloutier et al., 2016). To this date, SCZ remains challenging to treat, as only about 13.5% of patients meet the criteria for full recovery (Charlson et al., 2018), and advances in treatment options have been hampered — at least in part — by our incomplete knowledge of SCZ etiology and pathophysiological mechanisms.

As with all complex traits and diseases, schizophrenia exhibits a diffuse pattern of underlying genetic variation in which many common genetic variants contribute to disease risk. In recent years large-scale genome-wide association studies (GWAS) have provided robust evidence for many susceptibility loci within the genome (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014); the most recent SCZ GWAS meta-analysis identified 276 distinct genomic regions (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2020). While genetic studies can provide insights into the biological underpinnings of SCZ, they also offer potential biomarkers that could inform treatment strategies, patient management, prognosis, and possibly early intervention. A growing number of studies have directed their attention towards the utility of polygenic risk scores (PRS) in clinical psychiatry. PRS are typically constructed using genome-wide summary statistics and represent an individual’s propensity towards certain traits or disorders attributable to common genetic variation (International Schizophrenia Consortium et al., 2009). To establish the clinical relevance of genetic findings, it becomes increasingly important to invest in large, well-phenotyped SCZ cohorts; especially as patients affected by severe psychotic disorders are often underrepresented in population-based research initiatives (J. Martin et al., 2016)

Although increases in sample size boost statistical power, and consequently the discovery of novel risk-conferring variants (Levinson et al., 2014) and predictive capacity of PRS, cohorts of non-European ancestries — especially those from low- and middle-income countries — are still sparse in GWA studies. Multiple works have demonstrated that the prediction accuracy of PRS depends on the ancestral composition of discovery cohorts and their genetic divergence from the target sample (Bigdeli et al., 2020; Lam et al., 2019). As a result, polygenic scores derived from Eurocentric GWAS suffer from an attenuated predictive power when transferred to non-European populations (Duncan et al., 2019; A. R. Martin et al., 2019). Consequently, PRS are less informative for non-European populations; hence, patients from these populations will benefit less from polygenic scores and their potential applications. Although there is emerging evidence for a shared genetic architecture of schizophrenia across diverse populations (Lam et al., 2019), the incomplete transferability of GWAS findings is driven by three fundamental factors: 1) differences in allele frequencies across populations, as GWAS are powered only for common variants of small or modest effect-size; 2) differences in linkage disequilibrium patterns that affect how well causal variants are tagged by genotype arrays and imputation reference panels; 3) population genetic factors such as genetic drift, polygenic selection, and demographic history (Duncan et al., 2019; A. R. Martin et al., 2017, 2019). To improve GWAS findings’ generalizability, encouraging ancestral diversity in study samples has taken high priority in the field of psychiatric genetics (Mills & Rahal, 2020).

Despite a surge of East-Asian samples in the latest schizophrenia GWAS (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2020), a representative Vietnamese cohort is still lacking. To close this gap, we conducted a case-control pilot study in Hanoi, Vietnam, in which sociodemographic-, genetic-, and clinical data from a total of 378 participants were collected. Here we provide a first report on data collection’s feasibility while providing insights into clinical data and current treatment of patients with schizophrenia recruited in Hanoi, Vietnam. Moreover, we aim to facilitate genetic discoveries by contributing to global research endeavors to diversify genetic samples and clinical data in schizophrenia research. Ultimately, this pilot study sets the foundation for the recruitment of a sizeable Vietnamese well characterized cohort that includes state-of-the-art phenotyping procedures in the field of psychiatric genetics.

2 Methods

2.1 Local Recruitment and Data Collection

For this pilot study, 190 patients with schizophrenia and 188 control subjects, all of Vietnamese Kinh ancestry, were recruited between August and December 2017. All participants were at least 18 years old and provided full written informed consent upon study enrollment. Before data collection, study procedures were reviewed and approved by the institutional ethical review board of the Hanoi Medical University. All in-patients were recruited at teaching hospitals of the Department of Psychiatry at Hanoi Medical University.

Additionally, outpatients were recruited from associated community medical centers around the Hanoi metropolitan area. In outpatient facilities, data collection was supervised by senior psychiatrists of the staff at the Department of Psychiatry, Hanoi Medical University. Diagnosis of schizophrenia according to ICD-10 criteria (F20.X) was ascertained by clinicians with at least three years of psychiatric experience and confirmed by a senior attending psychiatrist. Current symptom severity was assessed with the Vietnamese Translation of the Positive and Negative Syndrome Scale (Kay et al., 1987), which encompasses the dimensions of positive, negative, and general psychopathology. Each patient was interviewed and rated independently by two clinicians who received prior training in conducting the semi-structured interview. If no consensus was reached, ratings were discussed with a senior psychiatrist. Medical and demographic parameters, including medication, duration of illness, and age at onset, were recorded during routine examinations and/or extracted from the patients’ medical chart. Control subjects were recruited from the staff at the Hanoi Medical University across different occupational groups. To obtain genetic information, saliva samples were collected from each subject using OraGene-510 DNA-Self-Collection Kits (Genotek, Ottawa, Ontario, Canada) according to the manufacturer’s instructions.

2.2 Genetic Data

2.2.1 Genome-wide Genotyping

Genetic material was extracted from saliva following established standard protocols. Genome-wide genotyping of DNA samples was performed on Infinitum Global Screening array-23 (GSA) MD BeadChip (Illumina, San Diego, CA) at ERASMUS Medical Center, Human Genotyping Facility (HuGe-F), Rotterdam, Netherlands. SNP-level data was analyzed using a central pipeline ‘RICOPILI’ (Lam et al., 2020) on the LISA computer cluster.

2.2.2 Quality Control

According to the Psychiatric Genomics Consortium (PGC) standards, quality control was performed on the newly recruited cohort. The default quality control parameters for retaining subjects and SNPs were: SNP missingness < 0.05 (before sample removal); subject missingness < 0.02; autosomal heterozygosity deviation (| Fhet | < 0.2); SNP missingness < 0.02 (after sample removal); difference in SNP missingness between cases and controls < 0.02; and SNP Hardy-Weinberg equilibrium (P > 10−6 in controls or P > 10−10 in cases). Seven individuals who did not meet QC criteria were excluded resulting in a data set of 370 individuals (184 cases and 186 controls) and 367,920 SNPs. Post-QC Manhattan and QQ plots did not show unexpected genome-wide significant association or general inflation.

2.2.3 Genotype Imputation

Genotype imputation was performed using the pre-phasing/imputation stepwise approach implemented in EAGLE/MINIMAC3 (with a variable chunk size of 132 genomic chunks and default parameters). The imputation reference set consisted of 54,330 phased haplotypes with 36,678,882 variants from the publicly available HRC reference release 1.1 (McCarthy et al., 2016).

2.2.4 Population Stratification and Relatedness Testing

After imputation, we identified SNPs with very high imputation quality (INFO >0.8) and low missingness (<1%) for further quality control. After linkage disequilibrium pruning (r2 > 0.02) and frequency filtering (MAF > 0.05), there were 116,203 autosomal SNPs in the data set. This SNP set was used for robust relatedness testing and population structure analysis using PLINK (Purcell et al., 2007). Thirteen pairs of subjects with identity-by-descent PI-HAT value of > 0.2 were identified. One member of each pair was removed at random after preferentially retaining cases over controls. Principal component estimation was done with the same collection of autosomal SNPs, and fifteen ancestry outliers were removed from the data set (see supplementary figures S2-S3). Supplementary figure S3 (A) shows that Vietnamese individuals cluster within other Asian populations from the 1000 Genomes reference population (Auton et al., 2015). Figure S3 (B) illustrates that the Vietnamese Kinh population is distinctly stratified from China and Japan when plotted only against Asian populations. We tested the first 20 principal components for phenotype association (using logistic regression with study indicator variables included as covariates). Seven principal components, namely PC1, PC2, PC3, PC4, PC5, PC8, and PC14, were significantly associated (P<0.05) with SCZ status, and these were added to the GWAS model as covariates in order to control for population stratification. The final analysis consisted of 343 individuals (174 cases and 169 controls) and 8,332,179 SNPs. We tested all SNPs for association under an additive logistic regression model using PLINK and the derived principal components as covariates.

2.2.5 Polygenic Risk Scoring

We obtained summary statistics from the latest PGC schizophrenia GWAS (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2020) to calculate PRS for individuals in our target data set. Polygenic scoring was performed using (a) East-Asian (EAS) specific meta-analysis (14,004 cases and 16,757 controls), (b) European (EUR) specific meta-analysis (53,386 cases and 77,258 controls) and (c) trans-ancestry (MIX) meta-analysis (67,390 cases and 94,015 controls) as training statistics. Genome-wide summary statistics were LD pruned and “clumped” by discarding variants within 500 kb of, and in r2 ≥ 0.1 with another (more significant) marker, resulting in 61 805 (EAS), 90,713, (EUR) 87,671 variants (MIX), respectively. Clumping for the East-Asian ancestry was conducted with the LD information of the East-Asian subset of the HRC reference panel. The EUR and MIX training data were clumped using LD information from the entire HRC reference panel. This set of LD-independent SNPs was used to construct a PRS at a range of P-value thresholds (5×10−8, 1×10−6, 1×10−4, 0.001, 0.01, 0.05, 0.1, 0.2, 0.5, 1.0). For a given sample, we take each LD-independent SNP and multiply the marginal SNP effect from the logistic regression by the number of risk alleles. The resulting products were summed over all variants for each individual so that a single score reflecting the overall polygenic burden for schizophrenia is obtained.

2.2.6 Consistency of Directions of Effects

In an additional final step, we assessed the robustness of genetic associations in the Vietnamese pilot sample by performing a replication analysis using the PGC trans-ancestry discovery sample as a discovery data set. We used a binomial sign test to establish that SNP associations replicated en masse in an independent PGC sample at various p-value thresholds by showing that the number of concordant effect directions for index SNPs was significantly greater than the null hypothesis of randomly oriented effects. The Vietnam summary statistics were then combined with those from the PGC primary GWAS dataset using an inverse variance-weighted fixed effects model and overlapping LD-clumps were merged into distinct genomic regions.

2.3 Polygenic Association Testing

Individual-level PRS at each p-value threshold (Pd) were tested for association with SCZ status (Schizophrenia vs. No Schizophrenia) in logistic regression models. To adjust for fine-grained, population-stratification principal components (PCs) 1, 2, 3, 4, 5, 8, and 14 (selected for being significantly associated with SCZ status) were included as covariates in each model. Nagelkerke’s R2, as a measure of “variance explained” for logistic regression, was determined by comparing the full model (with PRS and PCs) against the null model containing only PCs. We report Nagelkerke’s R2 on the liability scale (Lee et al., 2012), which corrects for a higher case-control-ratio in the target dataset relative to the general population’s prevalence estimated 0.7% (McGrath et al., 2008). Using LD-clumped genome-wide summary statistics without any p-value filter, individuals were grouped into five quintiles based on their individual polygenic burden (SCZ-PRS). Odds ratios for schizophrenia diagnosis were obtained by comparing individuals with low polygenic risk (bottom 20%, or bottom quintile) to individuals in strata with increased polygenic loadings for SCZ (second to the fifth quintile). Each step was repeated for three different training data sets: East-Asian GWAS (EAS), European GWAS (EUR), and mixed-ancestry GWAS (PGC3 SCZ-GWAS).

3 Results

3.1 Sample Characteristics

A total of 378 participants were included in the study, of which 190 are cases diagnosed with schizophrenia and 188 are control subjects. Sociodemographic parameters of all case and control subjects are shown in Table 1. All clinical parameters of the case sample are presented in Table 2.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

Sociodemographic characteristics of the health control and patient sample

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Clinical parameters

3.2 Polygenic Risk Profiling

Next, we assessed how much variance in schizophrenia risk could be attributed to underlying genetic variation. After quality control (see Supplementary material), 343 individuals remained for polygenic risk score analysis. Schizophrenia polygenic risk scores (SCZ-PRS) were generated with GWAS summary statistics from East-Asian (EAS), European (EUR), and mixed-ancestry populations as training sets. In line with previous findings, individual-level SCZ-PRS were significantly associated with schizophrenia case status (Fig 1, Supplementary Table S1). However, the predictive accuracy of SCZ-PRS among Vietnamese participants varied with the ancestral composition of the training data sets. The strongest effect sizes were observed when PRS were derived from trans-ancestry summary statistics, which explained ∼4.9 % of the variance in schizophrenia liability (p = 6.83 x 10−8 at Pd < 1). Likewise, PRS constructed on East-Asian GWAS summary statistics (PGC SCZ wave 3) accounted for ∼4.5 % of the variability in schizophrenia risk (p = 2.73 x 10−7 at Pd < 1). PRS trained on European ancestry GWAS results also predicted SCZ status in the Vietnamese target sample with an explained variance of only ∼3.9 % (p = 1.79 x 10−6 at Pd < 0.01). As expected, best overall predictions were observed at more inclusive p-value thresholds that capture a larger proportion of the polygenic signal by including more independent common variants below the genome-wide significance level.

Fig 1.
  • Download figure
  • Open in new tab
Fig 1.

Associations of SCZ-PRS with schizophrenia in a cohort of 169 controls and 174 cases of Vietnamese Kinh ancestry. A) The variance explained (R2 liability) by SCZ-PRS at different p-value thresholds (Pd; x-axis) is shown on the y-axis. P-values are displayed on top of each bar: 1* < 0.05, 2* < 0.01, 3* < 0.005, 4* <0.001, 5* < 1×10−4. SCZ-PRS were constructed on different GWAS discovery samples: East-Asian GWAS (left panel), European-ancestry GWAS (center panel), and trans-ancestry SCZ3-GWAS (right panel). B) Distribution of z-transformed polygenic risk scores among cases and controls at Pd < 1. C) odds ratios for schizophrenia per SCZ-PRS (Pd < 1) quintile relative to the first quintile. Error bars represent 95% Confidence Intervals.

A similar pattern can be observed when inspecting odds ratios for the association between SCZ-PRS and schizophrenia diagnosis (Fig 1c, Supplementary Table S3). Overall, individuals who carry a higher polygenic burden are at higher odds for having a schizophrenia diagnosis, ORs range from 1.97 (95% CI, 0.96 - 4.12) at the second PRS-quintile up to 5.74 (95% CI, 2.70 - 12.69) at the fifth PRS-quintile, using the first quintile as the reference group and mixed-ancestry training populations for polygenic scoring. When considering scores obtained from east-Asian ancestry GWAS, odds ratios for SCZ are slightly lower, ranging from 1.50 (95% CI, 0.72 - 3.14) at the second PRS-quintile up to 4.47 (95% CI, 2.13 – 9.66) at the top PRS-quintile. By comparison, observed ORs are markedly lower for EUR-derived SCZ-PRS; the OR at the fifth PRS quintile is 3.39 (95% CI, 1.63 – 7.26) relative to the first PRS quintile.

Overall, reported predictive values of PRS and odds ratios at the top PRS-quintile in our Vietnamese sample are consistent with those of other East-Asian cohorts reported within the latest SCZ GWAS meta-analysis (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2020).

3.3 Replication Analysis

Complementing PRS association testing, we explored if SNP associations at varying p-value thresholds (Pd) in the mixed-ancestry SCZ3-GWAS replicate in the present Vietnamese-ancestry sample (replication data set). We observed that the fraction of SNPs with directions of effect congruent to the replication data set was consistently greater than 50%. Specifically, of the 182 index SNPs that attained genome-wide significance in the SCZ3-GWAS meta-analysis and that were also available in the Vietnamese cohort, 110 SNPs (60%) showed the same direction of effect, which was significantly greater than expected by chance (binomial p-value = 0.003). Results of the replication analysis and binomial sign test across different p-value thresholds are reported in the supplementary material (Supplementary Table S3). When meta-analyzed with SCZ3-GWAS, 249 loci reach genome-wide significance, in contrast to 244 loci identified by SCZ3-GWAS alone, indicating that by adding this or a potentially larger Vietnamese cohort could help to discover novel SCZ-associated loci in future GWAS meta-analysis.

4 Discussion

With this pilot study, the feasibility of a large-scale data collection of patients diagnosed with schizophrenia in Vietnam has been shown for the first time. We were able to implement a successful recruitment scheme at the Department of Psychiatry of the Hanoi Medical University, with the National Institute of Mental Health staff at Bach Mai Hospital and the Hanoi Psychiatric Hospital participating in data collection. This study established an efficient pipeline for data generation and subsequent analyses that will be upscaled with future collaborative endeavors. With these central foundations set in place, we aim to continue and intensify our data collection efforts and establish more in-depth phenotyping going forward.

There are a few practical and clinical considerations concerning the recruitment of patients with schizophrenia in Vietnam. First, patients were primarily recruited through in-patient facilities. For genetic research, that may prove to be an advantage since the clinical population in Vietnam comprises mostly cases with severe illness courses of schizophrenia, long duration of illness, multiple prior hospitalizations, and a high symptom load, especially for negative and residual symptoms (Maramis et al., 2011). This pattern can be considered representative of inpatients often treated for long durations for chronic symptoms of schizophrenia in Vietnam, since individuals with a less severe course of illness typically won’t seek or receive hospital treatment due to stigma or a lack of illness recognition for less severe illness courses (Martensen et al., 2020; Nguyen et al., 2019; Ta et al., 2016, 2018). This pattern is reflected by the fact that even today, more than 50% of all patients treated in psychiatric hospitals in Vietnam have a primary psychiatric diagnosis of schizophrenia (Maramis et al., 2011; Martensen et al., 2018, 2020). In our pilot sample, most of the patients were hospitalised several times due to schizophrenia. Nearly all patients, including the outpatients, still exhibited chronic, residual, or negative symptoms when assessed by the PANSS at the time of recruitment. To increase clinical heterogeneity among patients, we will integrate more outpatient facilities in the Hanoi metropolitan area as additional study sites in future studies.

In the PRS and replication analysis, we corroborated some key findings of previous publications. First, we observed that PRS derived from trans-ancestry GWAS perform better than EAS-derived PRS, and both predict schizophrenia more accurately than EUR-derived PRS. Similarly, leave-one-out PRS analysis in the latest SCZ GWAS meta-analysis demonstrated that PRS in Asian cohorts explained consistently more variance in schizophrenia risk when mixed-ancestry summary statistics were used for polygenic scoring compared to Asian training populations (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2020). Consistent with this finding, Lam et al. (2019) demonstrated that EUR-derived PRS accounted for less variance in schizophrenia liability than EAS-derived PRS. Second, we demonstrated that risk-conferring SNPs in the present Vietnamese samples largely showed same-direction effects compared to SCZ PGC3 summary statistics. The inclusion of our sample, especially after scaling-up the sample size, will probably uncover novel susceptibility loci, especially since an earlier meta-analysis of East-Asian (Lam et al., 2019) and Latino and admixed African samples (Bigdeli et al., 2020) strongly support this notion. Due to practical limitations, we focused on Kinh ancestry individuals, representing 87% of the Vietnamese population. Given the genetic diversity among Vietnamese populations (Macholdt et al., 2020), our genotyping initiative might be extended to other ethnic groups living in Vietnam and bordering geographical areas in South-East-Asia. Thus, the addition of currently understudied populations such as Vietnamese ethnicities will likely help refine our understanding of the genetic architecture of schizophrenia.

Ancestral diversity among study participants in discovery GWAS has a substantial impact on prediction accuracy of polygenic scores and their applicability across populations. Further inclusion of East- and South-East-Asian cohorts will likely improve the polygenic risk prediction in Vietnamese individuals and vice versa, more than by a comparable addition of European ancestry samples. This is increasingly important as PRS are beginning to find their way into clinical practice outside of psychiatry. Beyond disease risk prediction, the clinical utility of PRS may range from genotype-informed screening methods and stratification to tailored intervention strategies (Torkamani et al., 2018). To translate genome-wide data into clinically actionable information, in-depth clinical and valid phenotypic information must be collected along with genetic data. While these pilot data are too small to perform meaningful secondary analyses, e.g., on symptom profiles, such cohorts will play a pivotal role in making potential PRS-based applications available for a wider population.

5 Conclusion and Outlook

This pilot research project’s main objective was to establish key research and clinical infrastructures for a larger cross-sectional Vietnamese cohort of patients with schizophrenia and healthy controls. In the next phase, we will extend our data collection efforts by introducing additional study sites in Vietnam, both in northern and southern provinces, as well as a translated and validated comprehensive phenotyping battery. This includes trained assessment, that is harmonised with our current research initiative in Berlin, Germany. This assessment is comprised of an in-depth neuropsychological evaluation and a selection of questionnaires that assess environmental risk factors, a broad range of health-related data, as well as outcome measures such as quality of life and social functioning. With this prospective cohort, we intend to build a valuable resource for trans-ancestry GWAS and secondary analyses on cross-cultural gene-environment interactions to elucidate potential etiological trajectories associated with an increased liability to Schizophrenia. In doing so, we aim to contribute to a worldwide endeavor to enhance well-powered and ancestrally diverse GWAS samples and explore trans-ancestry and culture specific gene-environment interactions in psychiatry.

Data Availability

Individual-level genotypes are available from the corresponding author upon reasonable request.

Disclosure of interest

None of the authors involved in this manuscript report conflicts of interest.

Acknowledgements

We thank the Schizophrenia Working Group of the Psychiatric Genomics Consortium for granting permission to use European, East-Asian and trans-ancestry GWAS summary statistics (wave 3) for this study. The Stanley Center for Psychiatric Research financially supported the collection of this sample. The statistical analysis was carried out on the Dutch LISA computer cluster powered by SURFsara. We further thank the participants and additional staff of Hanoi Psychiatric Hospital, Hanoi Medical University, and the National Institute of Mental Health, Vietnam, for taking part in this study.

References

  1. ↵
    Auton, A., Abecasis, G. R., Altshuler, D. M., Durbin, R. M., Abecasis, G. R., Bentley, D. R., Chakravarti, A., Clark, A. G., Donnelly, P., Eichler, E. E., Flicek, P., Gabriel, S. B., Gibbs, R. A., Green, E. D., Hurles, M. E., Knoppers, B. M., Korbel, J. O., Lander, E. S., Lee, C., … National Eye Institute, N. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. https://doi.org/10.1038/nature15393
    OpenUrlCrossRefPubMed
  2. ↵
    Bigdeli, T. B., Genovese, G., Georgakopoulos, P., Meyers, J. L., Peterson, R. E., Iyegbe, C. O., Medeiros, H., Valderrama, J., Achtyes, E. D., Kotov, R., Stahl, E. A., Abbott, C., Azevedo, M. H., Belliveau, R. A., Bevilacqua, E., Bromet, E. J., Byerley, W., Carvalho, C. B., Chapman, S. B., … Genomic Psychiatry Cohort (GPC) Consortium. (2020). Contributions of common genetic variants to risk of schizophrenia among individuals of African and Latino ancestry. Molecular Psychiatry, 25(10), 2455–2467. https://doi.org/10.1038/s41380-019-0517-y
    OpenUrl
  3. ↵
    Charlson, F. J., Ferrari, A. J., Santomauro, D. F., Diminic, S., Stockings, E., Scott, J. G., McGrath, J. J., & Whiteford, H. A. (2018). Global Epidemiology and Burden of Schizophrenia: Findings From the Global Burden of Disease Study 2016. Schizophrenia Bulletin, 44(6), 1195–1203. https://doi.org/10.1093/schbul/sby058
    OpenUrlCrossRefPubMed
  4. ↵
    Cloutier, M., Aigbogun, M. S., Guerin, A., Nitulescu, R., Ramanakumar, A. V., Kamat, S. A., DeLucia, M., Duffy, R., Legacy, S. N., Henderson, C., Francois, C., & Wu, E. (2016). The Economic Burden of Schizophrenia in the United States in 2013. The Journal of Clinical Psychiatry, 77(6), 764–771. PubMed. https://doi.org/10.4088/jcp.15m10278
    OpenUrlCrossRef
  5. ↵
    Duncan, L., Shen, H., Gelaye, B., Meijsen, J., Ressler, K., Feldman, M., Peterson, R., & Domingue, B. (2019). Analysis of polygenic risk score usage and performance in diverse human populations. Nature Communications, 10(1), 3328. https://doi.org/10.1038/s41467-019-11112-0
    OpenUrl
  6. ↵
    GBD 2016 Disease and Injury Incidence and Prevalence Collaborators. (2017). Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: A systematic analysis for the
  7. ↵
    Global Burden of Disease Study 2016. Lancet (London, England), 390(10100), 1211–1259. PubMed. https://doi.org/10.1016/S0140-6736(17)32154-2
    OpenUrl
  8. ↵
    International Schizophrenia Consortium, Purcell, S. M., Wray, N. R., Stone, J. L., Visscher P.M., O’Donovan, M. C., Sullivan, P. F., & Sklar, P. (2009). Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature, 460(7256), 748–752. PubMed. https://doi.org/10.1038/nature08185
    OpenUrlCrossRefPubMedWeb of Science
  9. ↵
    Kay, S. R., Fiszbein, A., & Opler, L. A. (1987). The Positive and Negative Syndrome Scale (PANSS) for Schizophrenia. Schizophrenia Bulletin, 13(2), 261–276. https://doi.org/10.1093/schbul/13.2.261
    OpenUrlCrossRefPubMedWeb of Science
  10. ↵
    Lam, M., Awasthi, S., Watson, H. J., Goldstein, J., Panagiotaropoulou, G., Trubetskoy, V., Karlsson, R., Frei, O., Fan, C.-C., De Witte, W., Mota, N. R., Mullins, N., Brügger, K., Lee, S. H., Wray, N. R., Skarabis, N., Huang, H., Neale, B., Daly, M. J., … Ripke, S. (2020). RICOPILI: Rapid Imputation for COnsortias PIpeLIne. Bioinformatics, 36(3), 930–933. https://doi.org/10.1093/bioinformatics/btz633
    OpenUrlPubMed
  11. ↵
    Lam, M., Chen, C.-Y., Li, Z., Martin, A. R., Bryois, J., Ma, X., Gaspar, H., Ikeda, M., Benyamin, B., Brown, B. C., Liu, R., Zhou, W., Guan, L., Kamatani, Y., Kim, S.-W., Kubo, M., Kusumawardhani, A. A. A. A., Liu, C.-M., Ma, H., … Huang, H. (2019). Comparative genetic architectures of schizophrenia in East Asian and European populations. Nature Genetics, 51(12), 1670–1678. PubMed. https://doi.org/10.1038/s41588-019-0512-x
    OpenUrl
  12. ↵
    Lee, S. H., Goddard, M. E., Wray, N. R., & Visscher, P. M. (2012). A Better Coefficient of Determination for Genetic Profile Analysis. Genetic Epidemiology, 36(3), 214–224. https://doi.org/10.1002/gepi.21614
    OpenUrlCrossRefPubMed
  13. ↵
    Levinson, D. F., Mostafavi, S., Milaneschi, Y., Rivera, M., Ripke, S., Wray, N. R., & Sullivan, P. F. (2014). Genetic studies of major depressive disorder: Why are there no genome-wide association study findings and what can we do about it? Biological Psychiatry, 76(7), 510–512. PubMed. https://doi.org/10.1016/j.biopsych.2014.07.029
    OpenUrlCrossRefPubMed
  14. ↵
    Macholdt, E., Arias, L., Duong, N. T., Ton, N. D., Van Phong, N., Schröder, R., Pakendorf, B., Van Hai, N., & Stoneking, M. (2020). The paternal and maternal genetic history of Vietnamese populations. European Journal of Human Genetics, 28(5), 636–645. https://doi.org/10.1038/s41431-019-0557-4
    OpenUrl
  15. ↵
    Maramis, A., Van Tuan, N., & Minas, H. (2011). Mental health in southeast Asia. The Lancet, 377(9767), 700–702. https://doi.org/10.1016/S0140-6736(10)62181-2
    OpenUrl
  16. ↵
    Martensen, L. K., Hahn, E., Cao, T. D., Schomerus, G., Nguyen, M. H., Böge, K., Nguyen, T. D., Mungee, A., Dettling, M., Angermeyer, M. C., & Ta, T. M. T. (2018). Impact of perceived course of illness on the desire for social distance towards people with symptoms of schizophrenia in Hanoi, Vietnam. Psychiatry Research, 268, 206–210. https://doi.org/10.1016/j.psychres.2018.05.046
    OpenUrl
  17. ↵
    Martensen, L. K., Hahn, E., Duc, C. T., Schomerus, G., Böge, K., Dettling, M., Angermeyer, M. C., Nguyene, V. T., & Ta, T. M. T. (2020). Impact and differences of illness course perception on the desire for social distance towards people with symptoms of depression or schizophrenia in Hanoi, Vietnam. Asian Journal of Psychiatry, 50, 101973. https://doi.org/10.1016/j.ajp.2020.101973
    OpenUrl
  18. ↵
    Martin, A. R., Gignoux, C. R., Walters, R. K., Wojcik, G. L., Neale, B. M., Gravel, S., Daly, M. J., Bustamante, C. D., & Kenny, E. E. (2017). Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. American Journal of Human Genetics, 100(4), 635–649. PubMed. https://doi.org/10.1016/j.ajhg.2017.03.004
    OpenUrlCrossRefPubMed
  19. ↵
    Martin, A. R., Kanai, M., Kamatani, Y., Okada, Y., Neale, B. M., & Daly, M. J. (2019). Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics, 51(4), 584–591. https://doi.org/10.1038/s41588-019-0379-x
    OpenUrlCrossRefPubMed
  20. ↵
    Martin, J., Tilling, K., Hubbard, L., Stergiakouli, E., Thapar, A., Davey Smith, G., O’Donovan, M. C., & Zammit, S. (2016). Association of Genetic Risk for Schizophrenia With Nonparticipation Over Time in a Population-Based Cohort Study. American Journal of Epidemiology, 183(12), 1149–1158. PubMed. https://doi.org/10.1093/aje/kww009
    OpenUrlCrossRefPubMed
  21. ↵
    McCarthy, S., Das, S., Kretzschmar, W., Delaneau, O., Wood, A. R., Teumer, A., Kang, H. M., Fuchsberger, C., Danecek, P., Sharp, K., Luo, Y., Sidore, C., Kwong, A., Timpson, N., Koskinen, S., Vrieze, S., Scott, L. J., Zhang, H., Mahajan, A., … Haplotype Reference Consortium. (2016). A reference panel of 64,976 haplotypes for genotype imputation. Nature Genetics, 48(10), 1279–1283. PubMed. https://doi.org/10.1038/ng.3643
    OpenUrlCrossRefPubMed
  22. ↵
    McGrath, J., Saha, S., Chant, D., & Welham, J. (2008). Schizophrenia: A Concise Overview of Incidence, Prevalence, and Mortality. Epidemiologic Reviews, 30(1), 67–76. https://doi.org/10.1093/epirev/mxn001
    OpenUrlCrossRefPubMedWeb of Science
  23. ↵
    Mills, M. C., & Rahal, C. (2020). The GWAS Diversity Monitor tracks diversity by disease in real time. Nature Genetics, 52(3), 242–243. https://doi.org/10.1038/s41588-020-0580-y
    OpenUrl
  24. ↵
    Nguyen, T., Tran, T., Tran, H., Tran, T., & Fisher, J. (2019). The burden of clinically significant symptoms of common and severe mental disorders among adults in Vietnam: A population-based cross-sectional survey. BMC Public Health, 19(1), 1173. https://doi.org/10.1186/s12889-019-7513-7
    OpenUrl
  25. ↵
    Schizophrenia Working Group of the Psychiatric Genomics Consortium. (2014). Biological insights from 108 schizophrenia-associated genetic loci. Nature, 511(7510), 421– 427. https://doi.org/10.1038/nature13595
    OpenUrl
  26. ↵
    Schizophrenia Working Group of the Psychiatric Genomics Consortium, Ripke, S., Walters, J. T., & O’Donovan, M. C. (2020). Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia. MedRxiv, 2020.09.12.20192922. https://doi.org/10.1101/2020.09.12.20192922
  27. Ta, T. M. T., Böge, K., Cao, T. D., Schomerus, G., Nguyen, T. D., Dettling, M., Mungee, A., Martensen, L. K., Diefenbacher, A., Angermeyer, M. C., & Hahn, E. (2018). Public attitudes towards psychiatrists in the metropolitan area of Hanoi, Vietnam. Asian Journal of Psychiatry, 32, 44–49. PubMed. https://doi.org/10.1016/j.ajp.2017.11.031
    OpenUrl
  28. ↵
    Ta, T. M. T., Zieger, A., Schomerus, G., Cao, T. D., Dettling, M., Do, X. T., Mungee, A., Diefenbacher, A., Angermeyer, M. C., & Hahn, E. (2016). Influence of urbanity on perception of mental illness stigma: A population based study in urban and rural Hanoi, Vietnam. International Journal of Social Psychiatry, 62(8), 685–695. https://doi.org/10.1177/0020764016670430
    OpenUrlCrossRefPubMed
  29. ↵
    Torkamani, A., Wineinger, N. E., & Topol, E. J. (2018). The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics, 19(9), 581–590. https://doi.org/10.1038/s41576-018-0018-x
    OpenUrlCrossRefPubMed
  30. ↵
    Walker, E. R., McGee, R. E., & Druss, B. G. (2015). Mortality in Mental Disorders and Global Disease Burden Implications: A Systematic Review and Meta-analysis. JAMA Psychiatry, 72(4), 334–341. https://doi.org/10.1001/jamapsychiatry.2014.2502
    OpenUrl
Back to top
PreviousNext
Posted April 27, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Increasing Sample Diversity in Psychiatric Genetics – Introducing a new Cohort of Patients with Schizophrenia and Controls from Vietnam – Results from a Pilot Study
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Increasing Sample Diversity in Psychiatric Genetics – Introducing a new Cohort of Patients with Schizophrenia and Controls from Vietnam – Results from a Pilot Study
VT Nguyen, A Braun, J Kraft, TMT Ta, GM Panagiotaropoulou, VP Nguyen, TH Nguyen, V Trubetskoy, CT Le, TTH Le, XT Pham, I Heuser-Collier, NH Lam, K Böge, IM Hahne, M Bajbouj, MM Zierhut, E Hahn, S Ripke
medRxiv 2021.04.21.21255615; doi: https://doi.org/10.1101/2021.04.21.21255615
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Increasing Sample Diversity in Psychiatric Genetics – Introducing a new Cohort of Patients with Schizophrenia and Controls from Vietnam – Results from a Pilot Study
VT Nguyen, A Braun, J Kraft, TMT Ta, GM Panagiotaropoulou, VP Nguyen, TH Nguyen, V Trubetskoy, CT Le, TTH Le, XT Pham, I Heuser-Collier, NH Lam, K Böge, IM Hahne, M Bajbouj, MM Zierhut, E Hahn, S Ripke
medRxiv 2021.04.21.21255615; doi: https://doi.org/10.1101/2021.04.21.21255615

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)