The relationship of maternal gestational mass spectrometry-derived metabolites with offspring congenital heart disease: results from multivariable and Mendelian randomization analyses ======================================================================================================================================================================================= * Kurt Taylor * Nancy McBride * Jian Zhao * Sam Oddie * Rafaq Azad * John Wright * Ole A. Andreassen * Isobel D Stewart * Claudia Langenberg * Maria Magnus * Maria Carolina Borges * Massimo Caputo * Deborah A Lawlor ## Abstract **Background** It is plausible that maternal pregnancy metabolism influences risk of offspring congenital heart disease (CHD). We sought to explore this through a systematic approach using different methods and data. **Methods** We undertook multivariable logistic regression of the odds of CHD for 923 Mass Spectrometry (MS)-derived metabolites in a sub-sample of a UK birth cohort (Born in Bradford (BiB); N = 2,605, 46 CHD cases). We considered metabolites reaching a p-value threshold <0.05 to be suggestively associated with CHD. We sought validation of our findings, by repeating the multivariable regression analysis within the BiB cohort for any metabolite that was measured by nuclear magnetic resonance (NMR) or clinical chemistry (N = 7,296, 87 CHD cases), and by using genetic risk scores (GRS: weighted genetic risk scores of single nucleotide polymorphisms (SNPs) that were associated with each metabolite) in Mendelian randomization (MR) analyses. MR analyses were performed in BiB and two additional European birth cohorts (N = 38,662, 319 CHD cases). **Results** In the main multivariable analyses, we identified 44 metabolites suggestively associated with CHD, including those from the following super pathways: amino acids, lipids, co-factors and vitamins, xenobiotics, nucleotides, energy, and several unknown molecules. Of these 44, isoleucine and leucine were available in the larger BiB cohort (NMR), and for these the results were validated. MR analyses were possible for 27/44 metabolites and for 11 there was consistency with multivariable regression results. **Conclusions** In summary, we have used complimentary data sources and statistical techniques to construct layers of evidence. We found that amino acid metabolism during pregnancy, several lipids (more specifically androgenic steroids), and levels of succinylcarnitine could be important contributing factors for CHD. Key words * Metabolites * congenital heart disease * Born in Bradford * ALSPAC * MoBa ## Introduction Congenital heart diseases (CHDs) are the most common congenital anomaly affecting approximately 6-8 per 1000 live births and 10% of stillbirths. They are the leading cause of death from congenital anomalies 1. Approximately 20% of CHD cases can be attributed to known chromosomal anomalies, gene disorders or teratogens 2. The causes of the remaining cases are unknown. Identifying causes of CHDs is important for improving aetiological understanding and developing potential targets for intervention. Metabolomics technologies have enabled the quantification of a large number of metabolites in a biological sample. Metabolites are small-molecule intermediates and products of metabolism. The metabolome, the complete set of metabolites in biological tissues/fluids, is influenced by both genotype and environment, and dynamically responds to environmental influences. Analyses of maternal metabolomic profiles could identify causal mechanisms leading to CHDs 3. Because the metabolome reflects interactions of genomic, environmental (e.g., air pollution), behavioural (e.g., smoking) and pathophysiological states (e.g., body composition), examining associations of it with CHDs could help elucidate modifiable upstream risk factors and/or potential molecular targets for intervention to prevent CHDs. Studies have explored maternal molecular markers and found that offspring of women with a compromised vitamin D status (defined as 25-hydroxyvitamin D < 50 nmol/l in comparison to adequate defined as > 75 nmol/l) 4 and lipid profile 5,6 have an increased risk of CHDs. Other work has shown that poor glucose control and diabetes during pregnancy can increase CHD risk 7–9. However, these studies focus on single or few biomarkers. Exploring the wider metabolome could provide opportunities to improve our understanding of the molecular mechanisms that underpin CHDs 3. Previous work has explored metabolomics in maternal serum as a predictor of offspring CHDs and uncovered potentially relevant biological pathways 10. The study found more than 100 metabolites that differed between CHD cases and non-cases concluding that abnormal lipid metabolism was an important feature of CHD pregnancies. Other research has explored potential biomarkers of maternal urine metabolomics with offspring CHDs (N = 70 CHD cases and 70 controls) 11. Their results indicated that short chain fatty acids and aromatic amino acid metabolism may be relevant to CHDs. Replication of these results are warranted. A recent retrospective study in a Chinese population performed metabolomic analyses using maternal amniotic fluid and found that two metabolites (uric acid and proline) were elevated in CHD affected pregnancies 12. In summary, there have been studies uncovering potentially important biological pathways associated with offspring CHDs. However, pregnancy metabolomic studies are still relatively novel with scope for future research to provide new insights and seek replication of previous findings. The aim of this study was to explore associations of the maternal metabolome quantified by an untargeted mass spectrometry (MS) platform and the odds of CHD in the offspring. To address this aim we searched for relevant studies within The LifeCycle Project-EU Child Cohort Network 13 to identify any study with detailed untargeted maternal gestational metabolomic data and offspring CHD information. We identified only one cohort with relevant data in a subgroup: the Born in Bradford (BiB) cohort (N = 2,605 participants; 46 CHD cases) 14,15. Recognising that these novel data were potentially underpowered, we sought internal validation of metabolites suggestively associated with CHD, by repeating the multivariable regression analysis within the BiB cohort for any metabolite that was measured by nuclear magnetic resonance (NMR) or clinical chemistry in larger numbers (N = 7,296, 87 CHD cases). We subsequently searched the MR-PREG consortia studies 16,17 for cohorts with maternal genome-wide data and offspring CHD information that could be used for Mendelian randomization (MR) analyses of associations of genetic instruments for maternal metabolites. We performed pooled MR analyses across three MR-PREG cohorts meeting our criteria (N = 38,663, 319 CHD cases) for any metabolites that were: (i) suggestively associated with CHD in BiB (P<0.05) and (ii) had summary data in the most recent metabolomic genome-wide association study (GWAS). ## Methods ### Study design and participants A schematic overview of the study design is illustrated in **Figure 1**. We excluded children of multiple births because they differ from single births for congenital anomaly outcomes 18,19. For multivariable metabolomic analyses, we used data from the BiB cohort as this was the only cohort that had measures of a substantial number of metabolites reflecting a wide range of metabolic paths assessed during pregnancy and CHD outcomes 15. We also explored internal validation of any findings with a p-value < 0.05 within the BiB study where equivalent (or near equivalent) measures to any on the MS platform markers are available from other sources. BiB is a population-based prospective birth cohort, including 12,453 women across 13,776 pregnancies who were recruited at their oral glucose tolerance test (OGTT) at approximately 26–28 weeks’ gestation 14. Eligible women had an expected delivery between March 2007 and December 2010. The use of a multivariable p-value threshold of <0.05 to take associations forward into further validation analyses is appropriate as an initial screen, for a relatively rare outcome, to avoid missing potential causal effects. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/03/31/2022.02.04.22270425/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2022/03/31/2022.02.04.22270425/F1) Figure 1. An overview of the study design. BiB has pregnancy mass spectrometry derived metabolomics in two separate datasets. Dataset 1 was completed in December 2017 and included 1,000 maternal pregnancy samples. Dataset 2 was completed in December 2018 and consisted of 2,000 maternal pregnancy samples within a case cohort design. The selection of participants into the two MS metabolomic datasets are shown in flowcharts in Figure S1. Abbreviations: CHD, congenital heart disease; BiB, Born in Bradford; NMR, Nuclear Magnetic Resonance; MR, Mendelian Randomization; GWAS, genome-wise association study; ALSPAC, Avon Longitudinal Study of Parents and Children; MoBa, Norwegian Mother, Father and Child Cohort. To be included in MR analyses, studies and participants had to have genome-wide data in mothers and CHD data in the offspring. Three cohorts contributed to MR analyses: BiB, the Avon Longitudinal Study of Parents and Children (ALSPAC) and the Norwegian Mother, Father and Child Cohort Study (MoBa). ALSPAC is a UK prospective birth cohort study which was devised to investigate the environmental and genetic factors of health and development 20–22. Pregnant women resident in Avon, UK with expected dates of delivery 1st April 1991 to 31st December 1992 were invited to take part in the study. The initial number of pregnancies enrolled is 14,541 (for these at least one questionnaire has been returned or a “Children in Focus” clinic had been attended by 19/07/99). Of these initial pregnancies, there was a total of 14,676 fetuses, resulting in 14,062 live births and 13,988 children who were alive at 1 year of age. MoBa is a population-based pregnancy cohort study conducted by the Norwegian Institute of Public Health 23,24. Participants were recruited from all over Norway from 1999-2008. The women consented to participation in 41% of the pregnancies. The cohort includes approximately 114,500 children, 95,200 mothers and 75,200 fathers. The current study is based on 12 of the quality-assured data files released for research in 2019. The establishment of MoBa and initial data collection was based on a license from the Norwegian Data Protection Agency and approval from The Regional Committees for Medical and Health Research Ethics. The MoBa cohort is currently regulated by the Norwegian Health Registry Act. The current study was approved by The Regional Committees for Medical and Health Research Ethics (2018/1256). ### Sample collection and metabolomic profiling in BiB Of the 13,776 pregnancies in the BiB cohort, 11,480 had a fasting blood sample taken during the OGTT (n = 10,574 [92%] between 26–28 weeks’ gestation, with the remaining women being within 11–39 weeks’ gestation). Samples were taken by trained phlebotomists working in the antenatal clinic of the Bradford Royal Infirmary and sent immediately to the hospital laboratory. The metabolomics data in the BiB cohort has previously been described in detail 15. In brief, metabolomics analysis was performed on ethylenediamine tetraacetic acid (EDTA) plasma samples around 26-28 weeks’ gestation. The untargeted MS metabolomics analysis of over 1,000 metabolites was performed at Metabolon, Inc. (Durham, North Carolina, USA). Quality control of the metabolite data was conducted by Metabolon. The classes of metabolites include amino acids, carbohydrates, cofactors and vitamins, energy, lipids, nucleotides, partially characterised molecules, peptides, and xenobiotics. These super-pathways, as defined by Metabolon, are also further subdivided into ∼80 sub-pathways. Metabolite concentrations were quantified using area under the curve of primary MS ions and were expressed as the multiple of the median (MoM) value for all batches processed on the given day. The MoM more closely reflects the biological variation rather than technical variation between samples or analysis platform 25. Due to the timing of funding acquisition, samples were sent to Metabolon in two separate batches. Dataset 1 was completed in December 2017 and included 1,000 maternal pregnancy samples. Dataset 2 was completed in December 2018 and consisted of 2,000 maternal pregnancy samples within a case cohort design. Over-sampled cases were removed to obtain a representative sample. The selection of participants into the two MS metabolomic datasets are shown in flowcharts in Figure S1 and have been described in detail previously 15. ### Confounders In multivariable regression analyses in BiB, we adjusted for the following maternal characteristics based on their known or plausible influence on maternal metabolites and on CHD: age, ethnicity, parity, residential neighbourhood Index of Multiple Deprivation (IMD), body mass index (BMI), smoking, and alcohol consumption. Details of the methods for how confounders were assessed are provided in the Supplementary File 1 (Text S1). ### Congenital heart disease outcomes In BiB, cases were identified from either the Yorkshire and Humber congenital anomaly register database, which will tend to pick up most cases that were diagnosed antenatally and in the early postnatal period of life, or through linkage to primary care (up until aged 5), which will have picked up any additional cases, in particular those that might have been less severe and not identified antenatally/in early life 26. All BiB cases were confirmed postnatally and were assigned ICD-10 codes. We used ICD-10 codes to assign CHD cases according to the European surveillance of congenital anomalies (EUROCAT) guidelines. In the ALSPAC cohort, cases were obtained from a range of data sources, including health record linkage and questionnaire data up until age 25 following European EUROCAT guidelines 27. In MoBa, information on whether a child had a CHD or not (yes/no) was obtained through linkage to the Medical Birth Registry of Norway (MBRN). All maternity units in Norway must notify births to the MBRN, and information on malformations are reported to the registry up to 12 months postpartum 28. Further details on defining CHDs including ICD codes are shown in Text S2 and Table S1 (Supplementary File 1). ### Genetic data The rationale for performing MR analyses was to explore replication using a different method with two additional independent studies and to explore causation. Metabolites are affected by multiple disease processes as well as numerous environmental exposures; therefore, understanding the metabolic pathways implicated in CHD is nontrivial. MR can help discriminate causal from non-causal metabolites because genetic variants are less likely to be confounded by the socioeconomic and environmental factors that might bias causal estimates in conventional multivariable regression 29, but may be biased by a path from the metabolomic genetic score to CHD, for example via horizontal pleiotropy or fetal genotype 30. Consistent results from both increase confidence that the result is causal. #### Genotyping in each cohort ALSPAC mothers were genotyped using Illumina human660K quad single nucleotide polymorphism (SNP) chip, and ALSPAC children were genotyped using Illumina HumanHap550 quad genome-wide SNP genotyping platform. Genotype data for both ALSPAC mothers and children were imputed against the Haplotype Reference Consortium v1.1 reference panel, after performing the QC procedure (minor allele frequency (MAF) ≥ 1%, a call rate ≥ 95%, in Hardy-Weinberg equilibrium (HWE), correct sex assignment, no evidence of cryptic relatedness, and of European descent). The samples of the BiB cohort (mothers and offspring) were processed on three different type of Illumina chips: HumanCoreExome12v1.0, HumanCoreExome12v1.1 and HumanCoreExome24v1.0. Genotype data were imputed against UK10K + 1000 Genomes reference panel, after a similar QC procedure (a call rate ≥ 99.5%, correct sex assignment, no evidence of cryptic relatedness, correct ethnicity assignment). In MoBa, blood samples were obtained from both parents during pregnancy and from mothers and children (umbilical cord) at birth 31. Genotyping has had to rely on several projects - each contributing with resources to genotype subsets of MoBa over the last decade. The data used in the present study was derived from a cohort of genotypes samples from four MoBa batches. The MoBa genetics QC procedure involved MAF ≥ 1%, a call rate ≥ 95%, in HWE, correct sex assignment, and no evidence of cryptic relatedness. Further details of the genotyping methods for each cohort are provided in the Supplementary File 1 (Text S3) including flow charts showing selection of participants (Figure S2). #### GWAS data and SNP selection We aimed to construct weighted GRSs for metabolites that had a p-value <0.05 (referred to throughout as “suggestively associated” with CHDs) in the multivariable regression analyses using BiB data. To do this, we cross-referenced our suggestive associations with large relevant GWAS. We used summary data from two GWAS. In the first, the authors explored the genetic effects of 174 metabolites (compared with the 923 included in our study) 32. To ensure independent associations, SNPs used from the first GWAS were selected at p < 5 × 10−8 and were clumped to ensure independence at linkage disequilibrium (LD) r2 = 0.001 and a distance of 10,000 kb using the TwoSampleMR package 33. In the second (unpublished), the authors performed a GWAS of metabolon metabolite levels using samples from the EPIC-Norfolk 34 and INTERVAL studies 35. 14,296 participants were included in a discovery set (5,841 from EPIC-Norfolk; 8,455 from INTERVAL) and 5,698 from EPIC-Norfolk in a validation set. The authors performed exact conditional analyses to identify independent associations. A total of 913 metabolites were taken forward for their GWAS analysis. #### Genetic risk score generation GRSs were calculated using SNPs previously associated in largescale GWAS with metabolites (described above) by adding up the number of metabolite increasing alleles among the selected SNPs after weighting each SNP by its effect on the corresponding metabolite: ![Formula][1] where w is the weight (i.e., the beta-coefficient of association of the SNP with the exposure from the published GWAS) and SNP is the genotype dosage of exposure-increasing alleles at that locus (i.e., 0, 1, or 2 exposure-raising alleles). After matching metabolites suggestively associated with CHDs at P<0.05 from multivable regression analyses and removing indels, selected SNPs were extracted from the imputed genotype data in dosage format using QCTOOL (v2.0) and VCF tools (v 0.1.12b) in ALSPAC and BiB, respectively. PLINK (v1.9) was then used to construct the GRS for each exposure coded so that an increased score associated with increased levels of metabolite. In MoBa, we constructed the GRSs from the QC’d data in PLINK format. If a SNP was missing, a proxy SNP was used where available based on r2 > 0.8 using the European reference panel in the LDLink R package 36. ### Statistical analysis Analyses were performed in R version 4.0.2 (R Foundation for Statistical Computing, Vienna, Austria). An analysis plan was written and uploaded to the Open Science Framework before analyses commenced, where any subsequent changes to analyses were documented along with the rationale 37. We used scaled imputed data (in which missing data have been imputed and the multiple of median values transformed to standard deviation (SD)-scores) which was log transformed. Any metabolite (in either dataset) where there was too little variation for meaningful analyses (defined as < 440 unique values) was excluded 38. Transformed metabolite values were converted to standard deviation SD units. There were 1,100 and 1,150 quantified metabolites included in dataset 1 and 2, respectively, with 923 of these present in both datasets. #### Multivariable regression (metabolomic) analyses We used logistic regression to estimate odds ratios (ORs) and 95% confidence intervals (CIs) of any CHD per SD higher metabolite, with and without adjustment for confounders. As we are interested in potential causal effects, we present confounder adjusted results throughout. Analyses were done separately in the two BiB datasets and results pooled using fixed-effects meta-analyses. Given that CHD is rare and binary, we accepted an uncorrected P<0.05 (from meta-analyses) for the metabolite being suggestively associated with CHD in the offspring (but requiring further validation). We took these metabolites forward to MR analyses. The MS-platform used in BiB includes measures of xenobiotics which are synthetic chemicals that are not synthesised by humans. Their presence in the circulation usually reflects endogenous exposures, such as medications and supplements. Given that these metabolites would not be present in all participants (and therefore have high missingness), many were removed (86/154 (56%)) from the dataset given the metabolite inclusion criteria of 440 unique observations mentioned above. Therefore, we performed an exploratory additional analysis using xenobiotics (N = 154) as binary variables (1 = yes; metabolite is detected in the sample, 0 = no; metabolite is not detected in the sample). We present adjusted ORs of these binary variables (any presence vs none) with CHDs. We sought to internally validate any of the metabolites suggestively associated with CHDs that were also measured in BiB in larger numbers using different methods. After matching suggestive associations, we used data from the NMR platform (N = 2 metabolites) and did not use any data from the clinical chemistry measurements. More information on the BiB NMR data including methods, QC and participant information has been described in detail previously 15. #### Mendelian randomization analyses We undertook MR in each of the 3 cohorts, including all BiB, ALSPAC, and MoBa participants with maternal genetic data and offspring CHD data. Logistic regression was used to estimate the OR of CHD per SD change in GRS, with adjustment for the first 10 genetic principal components (PCs) with additional adjustment for genetic chip, genetic batch, and imputation batch in MoBa. The key assumptions for MR are: (i) relevance assumption - the genetic instruments are robustly associated with the exposure and relevant to the population being studied (i.e. here pregnant women). We tested the association of the GRS of each metabolite with metabolite levels during pregnancy in BiB dataset 2. (ii) Independence assumption - The IV outcome association is not confounded. Such confounding could occur as a result of population stratification. To minimise this, we adjusted GRS-CHDs associations for the first 10 genetic PCs. We also repeated the MR analyses without the inclusion of BiB, given that BiB has a unique ethnic structure of South Asians and White Europeans. (iii) Exclusion restriction criteria - The genetic variant is not related to the outcome other than via its association with the exposure. We assessed pleiotropy by estimating the variance explained in all metabolites by each of the GRSs by undertaking the linear regression of every metabolite measured in BiB on each GRS. If the variance explained in other metabolites was similar or greater than to that explained in the candidate risk metabolite, this would suggest that there is low metabolite-specificity for the GRS and potential horizontal pleiotropic bias via the other metabolite(s). Importantly, however, this approach of testing GRS specificity does not distinguish between vertical pleiotropy (e.g. the GRS influences the candidate metabolite which is the precursor of another metabolite that affects CHD) and horizontal pleiotropy (e.g. the GRS influences two metabolites that affect CHD independently). We also check consistency of MR results when additionally adjusting for fetal genotype 30. We performed MR analyses separately in BiB, ALSPAC and MoBa and report pooled results from random-effect meta-analyses for all three cohorts and fixed-effect meta-analyses for MR analyses excluding BiB (i.e., ALSPAC and MoBa). ## Results ### Main BiB multivariable regression analyses **Table 1** shows the distributions of characteristics for the women in both BiB datasets. In total, there were 2,605 mother-offspring pairs with 46 CHD cases included in the BiB multivariable regression metabolomic analyses. N.B. for consistency and clarity, we refer to metabolites here by their super-pathways (as defined by Metabolon). A metabolite might have a different super-pathway and chemical group. For example, N-Acetylcarnosine is a metabolite that is part of the amino acid super-pathway, but it is not an amino acid itself. Where available, we include Human Metabolome Database (HMDB) IDs with all numerical results which can assist the reader in finding further information on the structure and function of a metabolite. The super-pathways that included the largest proportions of the 923 metabolites were lipids (38%), unknown (22%), amino acids (18%) and Xenobiotics (8%), with other super-paths having ≤ 3% of the total (**Table 2)**. View this table: [Table 1.](http://medrxiv.org/content/early/2022/03/31/2022.02.04.22270425/T1) Table 1. Participant characteristics for the Born in Bradford metabolomic analyses. View this table: [Table 2.](http://medrxiv.org/content/early/2022/03/31/2022.02.04.22270425/T2) Table 2. Showing the breakdown of metabolites in our dataset (N = 923) into the 10 super-pathways as defined by Metabolon. Of the 923 metabolites quantified in both BiB datasets, 44 (4.8%) were associated with any CHD, at P < 0.05, in confounder adjusted pooled analyses (**Figure 2**). We observed suggestive effects (i.e., confounder adjusted associations reaching the p-value threshold <0.05) with several amino acids, lipids and co-factors and vitamins. There were also suggestive effects for two xenobiotics, one nucleotide, one energy metabolite and some partially characterised and unknown metabolites (**Figure 2**). None of the 22 peptide or 19 carbohydrate-related metabolites associated with CHD at this p-value threshold. Of the 18 lipid-related metabolites associated with CHD, 13 were positively associated (i.e., increased odds) (e.g., Glycolithocholate Sulfate: adjusted odds ratio (aOR) per SD increase in metabolite: 1.73 95% CI (1.21, 2.48)) and 5 were negatively associated (decreased odds) (e.g. Phosphocholine: aOR 0.65 (0.47, 0.90)). All but one (N−Acetylcarnosine) of the 10 amino acid-related metabolites were negatively associated with CHDs (e.g. isoleucine: aOR: 0.67 (0.49, 0.92)). 3 of the 4 co-factors and vitamins were negatively associated, whereas 1 (biliverdin) was positively associated (aOR 1.41 (1.07, 1.86)). The one nucleotide was negatively associated (inosine 5’−Monophosphate (Imp): aOR 0.59 (0.36, 0.99)) and the one energy related metabolite positively associated (succinylcarnitine (C4): aOR 1.42 (1.02, 1.97)). Benzoate and Saccharin were the two xenobiotics associated with CHDs in main analyses both showing positive associations. Results for associations of all metabolites (irrespective of p-value) in unadjusted and confounder adjusted analyses from the pooled datasets, and each dataset separately are provided in Supplementary File 2 (Tables S5-S7), including HMDB IDs where applicable. ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/03/31/2022.02.04.22270425/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2022/03/31/2022.02.04.22270425/F2) Figure 2. Pooled confounder adjusted associations of maternal pregnancy metabolites with offspring congenital heart disease in the Born in Bradford cohort (N = 2,391 & N CHD cases = 42) The associations show confounder adjusted odds ratios of CHD per standard deviation change in log-transformed metabolite levels for the 44 (out of 923) metabolites that associated with CHD at p-value <0.05 separated by super pathways as defined by Metabolon. Metabolites were measured at ∼26-28 weeks’ gestation. Heterogeneity statistics and separate associations for datasets 1 and 2 are reported in Supplementary Tables S5-S7. Associations were adjusted for maternal age, ethnicity, parity, Index of Multiple Deprivation, body mass index, smoking and alcohol intake. Abbreviations: PCMs, partially characterised molecules; OR, odds ratio; CHD, congenital heart disease; SD, standard deviation. In the analysis treating xenobiotics as binary variables, after removal of metabolites with no exposed cases, there were 6 xenobiotic metabolites suggestively associated with offspring CHDs (Supplementary File 1: Table S2). 2 out of the 6 showed positive associations: saccharin, which was also associated in main analyses (adjusted odds ratio (aOR) for the presence of metabolite vs not: 2.16 95% CI (1.02, 5.13)) – an artificial sweetener) and salicyluric glucuronide (aOR: 2.27 (1.16, 4.29)) – a metabolite involved in aspirin metabolism). The remaining 4 showing negative associations are all part of the food component/plant metabolite sub pathway (Table S2). ### Internal validation using NMR or clinical chemistry measures of suggestive associations from main multivariable regression analyses It was possible to explore 2 of the 44 metabolites suggestively associated with CHDs in the larger BiB sample. In comparable confounder adjusted analyses, NMR measured amino acids isoleucine and leucine were available on 7,296 mothers, with 87 having an offspring with CHD. Results for these two amino acids were highly consistent between the two samples/assay methods (aOR per SD increase in MS isoleucine 0.67 (0.49, 0.92) vs 0.65 (0.50, 0.84) for NMR isoleucine and aOR per SD increase in MS leucine 0.69 (0.51, 0.94) vs 0.67 (0.53, 0.85) for NMR leucine). ### Validating findings with Mendelian randomization The distributions of offspring and maternal characteristics for MR analyses in BiB, ALSPAC and MoBa are displayed in Table S3 (Supplementary File 1). It was possible to explore MR replication for 27 of the 44 metabolites that associated with CHD in multivariable analyses (the other 17 were either not available in the GWAS or had genetic variants not available in the cohorts, Figure S3). All but 3 of the GRSs (24/27 (89%)) were associated with the corresponding metabolite during pregnancy in BiB (with R2 values ranging from 0.3% to 34%, for the remaining 3 the associations were wide with confidence intervals that included the null (Table S4; Supplementary File 1). Of the 27 GRSs, 3 were specific for the metabolite they were instrumenting (i.e. had the strongest association with it and little evidence of associations with other metabolites; N-acetylcarnosine, phosphocholine and succinylcarnitine). 18 GRSs were associated with the metabolite they were instrumenting and several others that were correlated with that metabolite (e.g. the biliverdin GRS was associated with it and also similarly with other hepatic-related metabolites). 6 GRSs were more strongly associated with other (uncorrelated) metabolites than the one they were instrumenting (scatter plots for all 27 GRSs are shown in Figure S4; Supplementary File 1). The 6 non-specific GRS were for indolelactate, glycolithocholate sulfate, isoleucine, leucine, myo-inositol and taurolithocholate 3-sulfate (MR results for these should be treated with caution and are denoted in **Figure 3B** by white-filled points). ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/03/31/2022.02.04.22270425/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2022/03/31/2022.02.04.22270425/F3) Figure 3. Showing results comparing the main confounder adjusted associations of maternal metabolites with offspring CHDs (Panel A: N = 2,391 & N CHD cases = 42in the Born in Bradford cohort) to the Mendelian randomization analyses of maternal genetic risk scores and offspring CHDs (Panel B: N = 38,662 & N CHD cases = 319 across 3 cohorts). N.B. results from each analysis are presented on different scales; we are not attempting to quantify estimates in the MR analyses, the aim is to compare the direction of effect. The confounder adjusted associations are as above in Figure 2. The MR analyses are adjusted for the top 10 genetic principal components and genetic batches in MoBa. In Panel B, the metabolite genetic risk scores filled with white appeared to be non-specific for the metabolite we were trying to instrument (i.e. the risk score relates to several other metabolites more strongly than the specific named metabolite).The metabolites filled in black were either metabolite-specific or specific to the metabolite and other correlated metabolites (see scatter plots in Figure S4). The results were pooled using random effects meta-analyses; individual study results and P-values for heterogeneity are shown in Supplementary Table S8. Abbreviations: BiB, Born in Bradford; CHD, congenital heart disease; GRS, genetic risk score; MR, Mendelian randomization; OR, odds ratio; CI, confidence interval. MR analyses replicated and provided causal evidence for a potential protective effect of higher levels of the amino acids leucine, indolelactate and isoleucine on CHD, but for the other amino acids MR results were either very close to the null or in the opposite direction (**Figure 3**). Seven of the lipid-related metabolites that were positively associated in multivariable regression were also replicated in MR analyses (6 of which were highly correlated androgenic steroid metabolites), as was the energy related metabolite succinylcarnitine (**Figure 3**). For the 11 metabolites where we consider the MR GRS analyses providing some evidence of replication and a potential causal effect, 7 of the GRSs were specific for the metabolite alone and/or also for its correlates. Individual study results and P-values for heterogeneity are included in Supplementary File 2 (Table S8). MR results were largely unchanged when excluding BiB from analyses (Table S8) and when adjusting for offspring genotype (Table S9; Supplementary File 2). ## Discussion Maternal metabolism is important for healthy fetal growth and development. To our knowledge no previous study has examined the association of detailed maternal metabolites with risk of CHD within a causal framework. In this novel study we found 44 metabolites (of 923) suggestively associated with CHD. These included metabolites related to amino acids, lipids, co-factors and vitamins, unknown molecules, xenobiotics, nucleotides and energy. In separate xenobiotics analyses, there was some evidence that metabolites related to aspirin and saccharine may increase odds of CHD, whereas metabolites related to plant food components may reduce odds. Two of the amino acids, where it was possible to explore replication, replicated in BiB in larger numbers using an alternative metabolomics platform. In MR analyses, there was directional consistency for 11/27 metabolites that could be explored with this method. We found that maternal amino acid metabolism during pregnancy, several lipids (more specifically androgenic steroids), and levels of succinylcarnitine could be important contributing factors to offspring CHD risk. 9 out of the 10 amino acids suggestively associated with CHDs were negatively associated suggesting that deficiencies in certain amino acids during pregnancy could contribute to offspring CHDs. Previous research found that amino acid concentrations measured in amniotic fluid were lower in patients with CHDs 39, a similar pattern to what we find here. We were able to replicate our findings for isoleucine and leucine in larger numbers in BiB which improves the confidence in the findings. The MR analyses also provided evidence to support the direction of association for these metabolites. However, the GRSs for isoleucine and leucine were non-specific and so these results should be treated with caution. 18 of the 44 maternal metabolites suggestively associated with CHD were part of the lipid super pathway, which is the most common super pathway measured by the Metabolon platform. Previous work reported that an abnormal lipid profile (defined as elevated cholesterol and apolipoprotein B) 5, abnormal lipid metabolism (defined as a disturbance in phosphatidyl-choline and various sphingolipids and choline metabolism) 13 and high maternal blood lipids 6 are a feature of CHD pregnancies. We were able to take forward 15 (out of 18) of the lipid metabolites and replicated the direction of effect for 7. All except 1 of these 7 replicated metabolites were androgenic steroids and so were highly correlated. Steroids are important for numerous functions during gestation, particularly for normal placental function 40. Here we present evidence of a potential causal effect (associated in metabolomic analyses with consistent direction of effect in MR analyses) of positive associations between maternal gestational androgenic steroid metabolites and offspring CHDs. Levels of bilirubin and biliverdin were positively associated with CHDs. MR analyses were only possible for biliverdin but were inconclusive with wide confidence intervals. Thus, these two compounds, which are involved in heme catabolism, should be further investigated for a possible role in CHD development. Levels of succinylcarnitine were also positively associated with CHDs and we found good replications in MR analyses with consistent directions of effect and a GRS that appeared highly specific for succinylcarnitine. Succinylcarnitine is an acylcarnitine which are a group of metabolites responsible for beta oxidation of fatty acids and mitochondrial function 41. It is well documented that fatty acids play an important role in embryonic and fetal development 42,43. We included analyses of partially characterised and unknown metabolites in our results as with increasing evidence from genomic studies, previously unknown metabolites are having their function identified. With future studies identifying the function of some of these unknown/partially characterized metabolites, our results could shed light on the aetiology of CHDs. A key strength of this study is the unique data that we have in BiB to support novel analyses of associations of a wide range of maternal metabolic paths with offspring CHD risk. We were not able to identify any other study with such data. However, we realised, even before analyses, that we would have limited statistical power with just 46 CHD cases. This motivated us to think about ways of trying to replicate any findings in larger samples either through finding measures of the same metabolites available from other assays in larger samples or using GRSs as instruments for the metabolites. In the initial multivariable regression analyses we adjusted for potential confounders. We defined suggestive associations based on a p-value threshold < 0.05, i.e., not taking account of multiple testing. When we apply a Bonferroni corrected threshold (P < 0.0001) none of the associations pass this (Supplementary Table S7; Supplementary File 2). Given the novel nature of this study and use of the initial multivariable regression in BiB to select associations for further follow-up (replication and MR), we felt this was appropriate. As with any ‘screening’ for further analyses we wanted to ensure that we would not miss potential causal effects. We recognise that selecting results based on a p-value threshold is problematic as some associations with higher p-values might have associations of a magnitude that could be clinically important, but there would also be potential for several false positives. Also, we limited MR analyses only to those metabolites that associated with p < 0.05 rather than undertaking these analyses on all of the 923 metabolites. Our reason for this was that having searched for all studies with maternal genome wide data and offspring CHD outcomes we identified only three cohorts and recognised that for MR analyses pooled results from these might also have limited power. The limited power in both multivariable and MR analyses also meant that we could only examine associations with any CHD and not subtypes. MR analyses are sensitive to their assumptions that the GRS is statistically strongly associated with the metabolite in pregnancy. We examined associations of these with pregnancy metabolite levels and are careful in our interpretation of results in relation to this. Methods that are available for exploring potential bias due to horizontal pleiotropy in two-sample MR were not possible here. We know that many of the 923 metabolites will be biologically related to each other and with our sample size it would be difficult to robustly distinguish effects of correlated metabolites. We explored this by examining the strength of association (proportion of variation explained) of each of the 27 GRSs with all other metabolites available in BiB dataset 2. Stronger or similar associations with other metabolites would suggest that the GRS is not a specific instrument for the metabolite that we are using it for. In this case this could be because of known biological relations. For example, we know biologically that many of the lipid metabolites are related to each other, and we saw this with similar proportions of variation explained by the GRS of the androgenic steroid lipid metabolites with other androgenic steroid lipid metabolites. As such we would interpret results for these metabolites as supporting an effect of maternal androgenic steroid metabolites on CHD, but we cannot be specific about which ones are driving this. Similar or stronger variation of a GRS for other metabolites could be related to vertical pleiotropy, i.e., the metabolite for which the GRS is instrumenting strongly influences other metabolites that are related to CHD with the other metabolites partly mediating the effect of the focused metabolite. This would not bias the result. However, this could also occur with horizontal pleiotropy where the GRS, independently of the metabolite of interest, influences other metabolites that are risk factors for CHD. With our current data we are not able to distinguish between these two. A further limitation of this study is that maternal plasma/serum metabolomics data were derived at a single timepoint around 26-28 weeks’ gestation. Fetal cardiac development starts early in pregnancy and much of the development occurs in the first trimester 44. Here, we are assuming that metabolite levels around 26-28 weeks’ gestation are good proxies for levels in early pregnancy - when the offspring heart is forming. Previous work has shown that between person differences throughout pregnancy remain largely consistent (i.e., those with a high level of a metabolite in early pregnancy tend to have a similarly high level of a metabolite in later pregnancy) 45. Similarly, and worth mentioning, the effects obtained from MR studies are often interpreted as the lifetime effect of the exposure (metabolites) in question 46. In summary, we have used metabolomics data obtained during pregnancy to explore how the maternal metabolome may contribute to offspring CHDs. We found evidence that amino acid metabolism during pregnancy, several lipids (more specifically androgenic steroids), and levels of succinylcarnitine could be important contributing factors. Our analysis pipeline, which involved seeking replication of metabolite associations by harnessing large-scale GWAS data, provides scope to improve the reliability of findings and should prove to be more useful as these datasets continue to grow. Metabolomics could prove to be an important tool for identifying biological pathways that may lead to identification of prevention targets to decrease the disease burden of CHDs. To do this, future research will require international collaboration of more and larger studies with detailed metabolomics data in pregnancy, ideally with some of these having repeat measures across pregnancy and offspring CHD data. ## Supporting information Supplementary File 1 [[supplements/270425_file02.pdf]](pending:yes) Supplementary File 2 [[supplements/270425_file03.xlsx]](pending:yes) ## Data Availability Before you contact BiB study, please make sure you have read the Guidance for Collaborators: [https://borninbradford.nhs.uk/research/guidance-for-collaborators/](https://borninbradford.nhs.uk/research/guidance-for-collaborators/)). The ALSPAC data management plan ([http://www.bristol.ac.uk/alspac/researchers/data-access/documents/alspac-data-management-plan.pdf](http://www.bristol.ac.uk/alspac/researchers/data-access/documents/alspac-data-management-plan.pdf)) describes in detail the policy on data sharing, which is through a system of managed open access. Scientists are encouraged to make use of the BiB study data, which are available through a system of managed open access. Please note that the study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool" and reference the following webpage: [http://www.bristol.ac.uk/alspac/researchers/our-data/](http://www.bristol.ac.uk/alspac/researchers/our-data/). MoBa data are used by researchers and research groups at both the Norwegian Institute of Public Health and other research institutions nationally and internationally. The research must adhere to the aims of MoBa and the participants' given consent. All use of data and biological material from MoBa is subject to Norwegian legislation. More information can be found on the study website ([https://www.fhi.no/en/studies/moba/for-forskere-artikler/research-and-data-access/](https://www.fhi.no/en/studies/moba/for-forskere-artikler/research-and-data-access/)). ## Ethical approval and consent to participate Ethical approval for ALSPAC was obtained from the ALSPAC Law and Ethics committee and local research ethics committees (NHS Haydock REC: 10/H1010/70). Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time. At age 18, study children were sent ‘fair processing’ materials describing ALSPAC’s intended use of their health and administrative records and were given clear means to consent or object via a written form. Data were not extracted for participants who objected, or who were not sent fair processing materials. For BiB, Ethics approval has been obtained for the main platform study and all of the individual sub studies from the Bradford Research Ethics Committee. Written consent was obtained from all participants. The establishment of MoBa and initial data collection was based on a license from the Norwegian Data Protection Agency and approval from The Regional Committees for Medical and Health Research Ethics. The MoBa cohort is now based on regulations related to the Norwegian Health Registry Act. ## Availability of data and materials The ALSPAC data management plan ([http://www.bristol.ac.uk/alspac/researchers/data-access/documents/alspac-data-management-plan.pdf](http://www.bristol.ac.uk/alspac/researchers/data-access/documents/alspac-data-management-plan.pdf)) describes in detail the policy on data sharing, which is through a system of managed open access. Scientists are encouraged to make use of the BiB study data, which are available through a system of managed open access. Please note that the study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool” and reference the following webpage: [http://www.bristol.ac.uk/alspac/researchers/our-data/](http://www.bristol.ac.uk/alspac/researchers/our-data/). Before you contact BiB study, please make sure you have read the Guidance for Collaborators: [https://borninbradford.nhs.uk/research/guidance-for-collaborators/](https://borninbradford.nhs.uk/research/guidance-for-collaborators/)). MoBa data are used by researchers and research groups at both the Norwegian Institute of Public Health and other research institutions nationally and internationally. The research must adhere to the aims of MoBa and the participants’ given consent. All use of data and biological material from MoBa is subject to Norwegian legislation. More information can be found on the study website ([https://www.fhi.no/en/studies/moba/for-forskere-artikler/research-and-data-access/](https://www.fhi.no/en/studies/moba/for-forskere-artikler/research-and-data-access/)). ## Sources of funding This study was funded by the European Research Council under the European Union’s Seventh Framework Programme (FP/2007-2013) / (grant agreement No 669545), US National Institute of Health (R01 DK10324) British Heart Foundation (CS/16/4/32482 and AA/18/7/34219) K. Taylor is supported by a British Heart Foundation Doctoral Training Program (FS/17/60/33474). K. Taylor, Dr McBride, Dr Borges, and Prof Lawlor work in a unit that is supported by the University of Bristol and UK Medical Research Council (MC_UU_00011/6). Prof Lawlor’s contribution is also supported by a British Heart Foundation Chair in Cardiovascular Science and Clinical Epidemiology (CH/F/20/90003) and a NIHR Senior Investigator (NF-0616-10102). Prof Caputo is supported by the British Heart Foundation Chair in Congenital Heart Disease (CH/1/32804). Dr. Magnus has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 947684). This research was also supported by the Research Council of Norway through its Centres of Excellence funding scheme (Project No. 262700). M.C. Borges is supported by a Vice Chancellor’s fellowship from the University of Bristol. BiB study has received core support from Wellcome Trust (223601/Z/21/Z and WT101597MA), a joint grant from the UK Medical Research Council and UK Economic and Social Science Research Council (MR/N024397/1), the British Heart Foundation (CS/16/4/32482), and the NIHR Applied Research Collaboration Yorkshire and Humber (NIHR200166) and Clinical Research Network. Core funding for ALSPAC is provided by the UK Medical Research Council and Wellcome (217065/Z/19/) and the University of Bristol. Many grants have supported different data collections, including for some of the data used in this publication, and a comprehensive list of grant funding is available on the ALSPAC website ([http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf](http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf)). GWAS data was generated by Sample Logistics and Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of America) using support from 23andMe. The views expressed in this publication are those of the author(s) and not necessarily those of the National Institute for Health Research or the Department of Health and Social Care. The Norwegian Mother, Father and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. The EPIC-Norfolk study ([https://doi.org/10.22025/2019.10.105.00004](https://doi.org/10.22025/2019.10.105.00004)) has received funding from the Medical Research Council (MR/N003284/1 MC-UU\_12015/1 and MC_UU_00006/1) and Cancer Research UK (C864/A14136). The genetics work in the EPIC-Norfolk study was funded by the Medical Research Council (MC_PC_13048). Metabolite measurements in the EPIC-Norfolk study were supported by the MRC Cambridge Initiative in Metabolic Science (MR/L00002/1) and the Innovative Medicines Initiative Joint Undertaking under EMIF grant agreement no. 115372. ## Disclosures DAL has received support from Medtronic Ltd. and Roche Diagnostics for biomarker research unrelated to those presented in this paper. OAA is a consultant to HealthLytix. ## Acknowledgements BiB (Born in Bradford) study is only possible because of the enthusiasm and commitment of the children and parents. We are grateful to all the participants, practitioners, and researchers who have made BiB study happen. We are extremely grateful to all the families who took part in The Aovn Longitudinal Study of Parents and Children (ALSPAC) study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. The Norwegian Mother, Father and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. We are grateful to all the participating families in Norway who take part in this on-going cohort study. We thank the Norwegian Institute of Public Health (NIPH) for generating high-quality genomic data. This research is part of the HARVEST collaboration, supported by the Research Council of Norway (#229624). We also thank the NORMENT Centre for providing genotype data, funded by the Research Council of Norway (#223273), South East Norway Health Authorities and Stiftelsen Kristian Gerhard Jebsen. We further thank the Center for Diabetes Research, the University of Bergen for providing genotype data and performing quality control and imputation of the data funded by the ERC AdG project SELECTionPREDISPOSED, Stiftelsen Kristian Gerhard Jebsen, Trond Mohn Foundation, the Research Council of Norway, the Novo Nordisk Foundation, the University of Bergen, and the Western Norway Health Authorities. We are grateful to all the participants who have been part of the EPIC-Norfolk metabolomics project and to the many members of the study teams at the University of Cambridge who have enabled this research. ## Footnotes * Minor edits of manuscript after comments from co-authors and prior to journal submission. * Received February 4, 2022. * Revision received March 31, 2022. * Accepted March 31, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Wang H, Naghavi M, Allen C, Barber RM, Bhutta ZA, Carter A, et al. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: a systematic analysis for the Global Burden of Disease Study 2015. The Lancet [Internet]. 2016 Oct [cited 2020 Feb 20];388(10053):1459–544. Available from: [https://linkinghub.elsevier.com/retrieve/pii/S0140673616310121](https://linkinghub.elsevier.com/retrieve/pii/S0140673616310121) 2. 2.Blue GM, Kirk EP, Sholler GF, Harvey RP, Winlaw DS. Congenital heart disease: current knowledge about causes and inheritance. Medical Journal of Australia [Internet]. 2012 Aug [cited 2020 Feb 5];197(3):155–9. Available from: [https://onlinelibrary.wiley.com/doi/abs/10.5694/mja12.10811](https://onlinelibrary.wiley.com/doi/abs/10.5694/mja12.10811) 3. 3.Monni G, Atzori L, Corda V, Dessolis F, Iuculano A, Hurt KJ, et al. Metabolomics in Prenatal Medicine: A Review. Front Med [Internet]. 2021 Jun 25 [cited 2021 Jul 26];8:645118. Available from: [https://www.frontiersin.org/articles/10.3389/fmed.2021.645118/full](https://www.frontiersin.org/articles/10.3389/fmed.2021.645118/full) 4. 4.Koster MPH, van Duijn L, Krul-Poel YHM, Laven JS, Helbing WA, Simsek S, et al. A compromised maternal vitamin D status is associated with congenital heart defects in offspring. Early Hum Dev. 2018;117:50–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.earlhumdev.2017.12.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 5. 5.Smedts HPM, van Uitert EM, Valkenburg O, Laven JSE, Eijkemans MJC, Lindemans J, et al. A derangement of the maternal lipid profile is associated with an elevated risk of congenital heart disease in the offspring. Nutrition, Metabolism and Cardiovascular Diseases [Internet]. 2012 Jun [cited 2020 Mar 27];22(6):477–85. Available from: [https://linkinghub.elsevier.com/retrieve/pii/S0939475310001900](https://linkinghub.elsevier.com/retrieve/pii/S0939475310001900) 6. 6.Cao L, Du Y, Zhang M, Wang F, Zhao J-Y, Ren Y-Y, et al. High maternal blood lipid levels during early pregnancy are associated with increased risk of congenital heart disease in offspring. Acta Obstet Gynecol Scand. 2021 Oct;100(10):1806–13. 7. 7.Priest JR, Yang W, Reaven G, Knowles JW, Shaw GM. Maternal Midpregnancy Glucose Levels and Risk of Congenital Heart Disease in Offspring. JAMA Pediatr. 2015 Dec;169(12):1112–6. 8. 8.Øyen N, Diaz LJ, Leirgul E, Boyd HA, Priest J, Mathiesen ER, et al. Prepregnancy Diabetes and Offspring Risk of Congenital Heart Disease: A Nationwide Cohort Study. Circulation. 2016 Jun 7;133(23):2243–53. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMzMvMjMvMjI0MyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzMxLzIwMjIuMDIuMDQuMjIyNzA0MjUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 9. 9.Simeone RM, Devine OJ, Marcinkevage JA, Gilboa SM, Razzaghi H, Bardenheier BH, et al. Diabetes and congenital heart defects: a systematic review, meta-analysis, and modeling project. Am J Prev Med. 2015 Feb;48(2):195–204. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amepre.2014.09.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25326416&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 10. 10.Bahado-Singh RO, Ertl R, Mandal R, Bjorndahl TC, Syngelaki A, Han B, et al. Metabolomic prediction of fetal congenital heart defect in the first trimester. Vol. 70, Obstetrical and Gynecological Survey. 2015. p. 9–11. 11. 11.Xie D, Luo Y, Xiong X, Lou M, Liu Z, Wang A, et al. Study on the Potential Biomarkers of Maternal Urine Metabolomics for Fetus with Congenital Heart Diseases Based on Modified Gas Chromatograph-Mass Spectrometer. Biomed Res Int. 2019;2019:1905416. 12. 12.Li Y, Sun Y, Yang L, Huang M, Zhang X, Wang X, et al. Analysis of Biomarkers for Congenital Heart Disease Based on Maternal Amniotic Fluid Metabolomics. Front Cardiovasc Med. 2021;8:671191. 13. 13.Jaddoe VWV, Felix JF, Andersen A-MN, Charles M-A, Chatzi L, Corpeleijn E, et al. The LifeCycle Project-EU Child Cohort Network: a federated analysis infrastructure and harmonized data of more than 250,000 children and parents. Eur J Epidemiol [Internet]. 2020 Jul 23 [cited 2020 Jul 27];709–24. Available from: [https://doi.org/10.1007/s10654-020-00662-z](https://doi.org/10.1007/s10654-020-00662-z) 14. 14.Wright J, Small N, Raynor P, Tuffnell D, Bhopal R, Cameron N, et al. Cohort Profile: The Born in Bradford multi-ethnic family cohort study. International Journal of Epidemiology [Internet]. 2013 Aug 1 [cited 2020 Mar 23];42(4):978–91. Available from: <[https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dys112](https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dys112)> 15. 15.Taylor K, McBride N, J Goulding N, Burrows K, Mason D, Pembrey L, et al. Metabolomics datasets in the Born in Bradford cohort. Wellcome Open Res [Internet]. 2021 Sep 10 [cited 2021 Oct 5];5:264. Available from: [https://wellcomeopenresearch.org/articles/5-264/v2](https://wellcomeopenresearch.org/articles/5-264/v2) 16. 16.Yang Q, Borges MC, Sanderson E, Magnus MC, Kilpi F, Collings PJ, et al. Associations of insomnia on pregnancy and perinatal outcomes: Findings from Mendelian randomization and conventional observational studies in up to 356,069 women. medRxiv [Internet]. 2021 Jan 1;2021.10.07.21264689. Available from: [http://medrxiv.org/content/early/2021/10/10/2021.10.07.21264689.abstract](http://medrxiv.org/content/early/2021/10/10/2021.10.07.21264689.abstract) 17. 17.Brand JS, Gaillard R, West J, McEachan RRC, Wright J, Voerman E, et al. Associations of maternal quitting, reducing, and continuing smoking during pregnancy with longitudinal fetal growth: Findings from Mendelian randomization and parental negative control studies. PLoS Med [Internet]. 2019 Nov 13 [cited 2020 Jul 1];16(11):e1002972. Available from: [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853297/](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853297/) 18. 18.Glinianaia S V., Rankin J, Wright C. Congenital anomalies in twins: A register-based study. Human Reproduction. 2008; 19. 19.Best KE, Rankin J. Increased risk of congenital heart disease in twins in the North of England between 1998 and 2010. Heart (British Cardiac Society). 2015;101(22):1807–12. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiaGVhcnRqbmwiO3M6NToicmVzaWQiO3M6MTE6IjEwMS8yMi8xODA3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMzEvMjAyMi4wMi4wNC4yMjI3MDQyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 20. 20.Boyd A, Golding J, Macleod J, Lawlor DA, Fraser A, Henderson J, et al. Cohort Profile: The ‘Children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. International Journal of Epidemiology [Internet]. 2013 Feb [cited 2020 Mar 3];42(1):111–27. Available from: [https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dys064](https://academic.oup.com/ije/article-lookup/doi/10.1093/ije/dys064) 21. 21.Fraser A, Macdonald-Wallis C, Tilling K, Boyd A, Golding J, Davey Smith G, et al. Cohort Profile: the Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int J Epidemiol. 2013 Feb;42(1):97–110. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dys066&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22507742&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000316699300011&link_type=ISI) 22. 22.Northstone K, Lewcock M, Groom A, Boyd A, Macleod J, Timpson N, et al. The Avon Longitudinal Study of Parents and Children (ALSPAC): an update on the enrolled sample of index children in 2019. Wellcome Open Res [Internet]. 2019 Mar 14 [cited 2020 Mar 10];4:51. Available from: [https://wellcomeopenresearch.org/articles/4-51/v1](https://wellcomeopenresearch.org/articles/4-51/v1) 23. 23.Magnus P, Birke C, Vejrup K, Haugan A, Alsaker E, Daltveit AK, et al. Cohort Profile Update: The Norwegian Mother and Child Cohort Study (MoBa). Int J Epidemiol. 2016;45(2):382–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyw029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27063603&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 24. 24.Magnus P, Irgens LM, Haug K, Nystad W, Skjaerven R, Stoltenberg C, et al. Cohort profile: the Norwegian Mother and Child Cohort Study (MoBa). Int J Epidemiol. 2006 Oct;35(5):1146–50. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyl170&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16926217&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000241429200008&link_type=ISI) 25. 25.Sovio U, Goulding N, McBride N, Cook E, Gaccioli F, Charnock-Jones DS, et al. A maternal serum metabolite ratio predicts fetal growth restriction at term. Nature Medicine [Internet]. 2020 Mar [cited 2020 May 19];26(3):348–53. Available from: [https://www.nature.com/articles/s41591-020-0804-9](https://www.nature.com/articles/s41591-020-0804-9) 26. 26.Bishop C, Small N, Mason D, Corry P, Wright J, Parslow RC, et al. Improving case ascertainment of congenital anomalies: findings from a prospective birth cohort with detailed primary care record linkage. bmjpo [Internet]. 2017 Nov [cited 2020 Mar 4];1(1):e000171. Available from: [http://bmjpaedsopen.bmj.com/lookup/doi/10.1136/bmjpo-2017-000171](http://bmjpaedsopen.bmj.com/lookup/doi/10.1136/bmjpo-2017-000171) 27. 27.Taylor K, Thomas R, Mumme M, Golding J, Boyd A, Northstone K, et al. Ascertaining and classifying cases of congenital anomalies in the ALSPAC birth cohort. Wellcome Open Res [Internet]. 2020 Oct 6 [cited 2020 Oct 13];5:231. Available from: [https://wellcomeopenresearch.org/articles/5-231/v1](https://wellcomeopenresearch.org/articles/5-231/v1) 28. 28.Leirgul E, Fomina T, Brodwall K, Greve G, Holmstrøm H, Vollset SE, et al. Birth prevalence of congenital heart defects in Norway 1994-2009—A nationwide study. American Heart Journal [Internet]. 2014 Dec [cited 2022 Jan 18];168(6):956–64. Available from: [https://linkinghub.elsevier.com/retrieve/pii/S0002870314004980](https://linkinghub.elsevier.com/retrieve/pii/S0002870314004980) 29. 29.1. Cardon L Smith GD, Lawlor DA, Harbord R, Timpson N, Day I, Ebrahim S. Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology. Cardon L, editor. PLoS Med [Internet]. 2007 Dec 11 [cited 2020 Dec 10];4(12):e352. Available from: [https://dx.plos.org/10.1371/journal.pmed.0040352](https://dx.plos.org/10.1371/journal.pmed.0040352) 30. 30.Lawlor DA, Richmond R, Warrington N, McMahon G, Smith GD, Bowden J, et al. Using Mendelian randomization to determine causal effects of maternal pregnancy (intrauterine) exposures on offspring outcomes: Sources of bias and methods for assessing them. Wellcome Open Res [Internet]. 2017 Feb 14 [cited 2020 Nov 11];2:11. Available from: [https://wellcomeopenresearch.org/articles/2-11/v1](https://wellcomeopenresearch.org/articles/2-11/v1) 31. 31.Rønningen KS, Paltiel L, Meltzer HM, Nordhagen R, Lie KK, Hovengen R, et al. The biobank of the Norwegian Mother and Child Cohort Study: a resource for the next 100 years. Eur J Epidemiol. 2006;21(8):619–25. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10654-006-9041-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17031521&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000241168300006&link_type=ISI) 32. 32.Lotta LA, Pietzner M, Stewart ID, Wittemans LBL, Li C, Bonelli R, et al. A cross-platform approach identifies genetic regulators of human metabolism and health. Nat Genet [Internet]. 2021 Jan [cited 2021 Feb 16];53(1):54–64. Available from: [http://www.nature.com/articles/s41588-020-00751-5](http://www.nature.com/articles/s41588-020-00751-5) 33. 33.Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018 May 30;7:e34408. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.34408&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29846171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 34. 34.Day N, Oakes S, Luben R, Khaw KT, Bingham S, Welch A, et al. EPIC-Norfolk: study design and characteristics of the cohort. European Prospective Investigation of Cancer. Br J Cancer. 1999 Jul;80 Suppl 1:95–103. 35. 35.Di Angelantonio E, Thompson SG, Kaptoge S, Moore C, Walker M, Armitage J, et al. Efficiency and safety of varying the frequency of whole blood donation (INTERVAL): a randomised trial of 45 000 donors. Lancet. 2017 Nov 25;390(10110):2360–71. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(17)31928-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 36. 36.Myers TA, Chanock SJ, Machiela MJ. LDlinkR: An R Package for Rapidly Calculating Linkage Disequilibrium Statistics in Diverse Populations. Front Genet. 2020;11:157. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) 37. 37.Taylor, Kurt. Using an untargeted metabolomics platform to explore associations between maternal metabolites and congenital heart disease in the offspring. 2020 [cited 2021 Feb 16]; Available from: [https://osf.io/zf4k6/](https://osf.io/zf4k6/) 38. 38.McBride N, Yousefi P, Sovio U, Taylor K, Vafai Y, Yang T, et al. Do mass-spectrometry-derived metabolomics improve prediction of pregnancy-related disorders? Findings from a UK birth cohort with independent validation. medRxiv [Internet]. 2021 Jan 1;2021.05.04.21256218. Available from: [http://medrxiv.org/content/early/2021/05/04/2021.05.04.21256218.abstract](http://medrxiv.org/content/early/2021/05/04/2021.05.04.21256218.abstract) 39. 39.Lai G, Li X, Zhang B, Wang L, He R, Zhao Y. Decreased Amino Acid Concentrations are Involved in Congenital Heart Disease. Ann Nutr Metab. 2019;74(3):257–63. 40. 40.Parsons AM, Bouma GJ. A Potential Role and Contribution of Androgens in Placental Development and Pregnancy. Life [Internet]. 2021 Jul 1 [cited 2022 Jan 6];11(7):644. Available from: [https://www.mdpi.com/2075-1729/11/7/644](https://www.mdpi.com/2075-1729/11/7/644) 41. 41.1. He B Mai M, Tönjes A, Kovacs P, Stumvoll M, Fiedler GM, Leichtle AB. Serum Levels of Acylcarnitines Are Altered in Prediabetic Conditions. He B, editor. PLoS ONE [Internet]. 2013 Dec 16 [cited 2022 Jan 6];8(12):e82459. Available from: [https://dx.plos.org/10.1371/journal.pone.0082459](https://dx.plos.org/10.1371/journal.pone.0082459) 42. 42.Innis SM. Fatty acids and early human development. Early Hum Dev. 2007 Dec;83(12):761–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.earlhumdev.2007.09.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17920214&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251521100005&link_type=ISI) 43. 43.McKeegan PJ, Sturmey RG. The role of fatty acids in oocyte and early embryo development. Reprod Fertil Dev. 2011;24(1):59–67. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22394718&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F31%2F2022.02.04.22270425.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000297647800008&link_type=ISI) 44. 44.Gittenberger-de Groot AC, Bartelings MM, Deruiter MC, Poelmann RE. Basics of Cardiac Development for the Understanding of Congenital Heart Malformations. Pediatr Res [Internet]. 2005 Feb [cited 2020 Sep 4];57(2):169–76. Available from: [http://www.nature.com/doifinder/10.1203/01.PDR.0000148710.69159.61](http://www.nature.com/doifinder/10.1203/01.PDR.0000148710.69159.61) 45. 45.Mills HL, White SL, Pasupathy D, Briley AL, Santos Ferreira DL, Seed PT, et al. The effect of a lifestyle intervention in obese pregnant women on gestational metabolic profiles: findings from the UK Pregnancies Better Eating and Activity Trial (UPBEAT) randomised controlled trial. BMC Med [Internet]. 2019 Dec [cited 2020 Mar 26];17(1):15. Available from: [https://bmcmedicine.biomedcentral.com/articles/10.1186/s12916-018-1248-7](https://bmcmedicine.biomedcentral.com/articles/10.1186/s12916-018-1248-7) 46. 46.Sanderson E, Richardson TG, Morris TT, Tilling K, Davey Smith G. Estimation of causal effects of a time-varying exposure at multiple time points through Multivariable Mendelian randomization. medRxiv [Internet]. 2022 Jan 1;2022.01.04.22268740. Available from: [http://medrxiv.org/content/early/2022/01/05/2022.01.04.22268740.abstract](http://medrxiv.org/content/early/2022/01/05/2022.01.04.22268740.abstract) [1]: /embed/graphic-2.gif