Abstract
Background Breast cancer risk assessment typically relies on consideration of mammography, family history, reproductive history, in addition to assessment of major mutations. However, quantifying the impact of environmental factors, such as lifestyle, can be challenging. DNA methylation (DNAm) presents a promising opportunity for additional information, as it captures effects from both genetics and environment. Previous studies have identified associations and predicted the risk of breast cancer using DNAm in blood, however, these studies did not distinguish between genetic and environmental influences.
Results Pre-diagnosis blood samples were obtained from 32 monozygotic (MZ)and 76 dizygotic (DZ) female twin pairs discordant for breast cancer. DNAm was profiled using Illumina 450k and EPIC platform. Samples were collected at the mean age of 56 years (standard deviation (SD) =9.90), while the mean age at diagnosis was 66.76 years (SD=6.64). The mean age for censoring controls was 75.20 years (SD=9.45). To identify individual DNAm sites (Cytosine-phosphate-Guanine, CpG) and differentially methylated regions (DMRs) associated with breast cancer risk, survival analysis using paired Cox proportional hazard modeling was performed. Due to the paired modeling, shared genetic and environmental effects, age at entry, and age at sampling were controlled for.
We identified 212 CpGs (p<6.4*10^-8) and 15 DMRs associated with breast cancer risk across all pairs, with three DMRs overlapping with the individual CpGs. All but one of the 212 CpGs had lower DNAm in cases, suggesting a prevailing trend of hypomethylation of blood DNA prior to breast cancer diagnosis. Altogether 197/212 significant CpGs were also differentially methylated within the 32 MZ twin pairs discordant for breast cancer, suggesting that DNAm at these CpGs is likely independent of genetic effects. Prior research suggests that estrogen regulates at least five of the top CpGs identified. Hence, methylation at these sites may reflect individual differences in estrogen exposure.
Conclusion In conclusion, most of the identified CpGs associated with future breast cancer diagnosis appear to be independent of genetic effects. This suggests that DNAm could potentially serve as a valuable biomarker for environmental risk factors for breast cancer and may offer potential benefits as a complementary tool to current risk assessment procedures.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was funded by the European Unions Horizon 2020 Research and Innovation Programme, Marie Sklodowska-Curie (JK, grant number 859860). This project has received further funding from the Academy of Finland (MO, grant numbers 297908, 328685 & JK, grant 336823) and the Sigrid Juselius Foundation (MO and JK).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Informed consent was obtained before the beginning of the studies in 1975, and upon every contact with the study subjects. When clinical investigations were undertaken with sampling of biological material, written informed consent was obtained. Ethics approval of these procedures was provided in multiple studies, the last one on the transfer of biological samples to the THL Biobank by the Hospital District of Helsinki and Uusimaa ethics board in 2018. The two Danish twin studies (LSADT and MADT) were approved by the Regional Committees on Health Research Ethics for Southern Denmark (S-VF-19980072), and written informed consents were obtained from all participants.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The Finnish Twin Cohort datasets utilized in the current study are stored in the Biobank of the Finnish Institute for Health and Welfare, Helsinki, Finland. The data is publicly available for use by qualified researchers through a standardized application procedure.
List of abbreviations
- BC
- breast cancer
- CI
- confidence interval
- CpG
- Cytosine-phosphate-Guanine
- DNAm
- DNA methylation
- DZ
- dizygotic
- FDR
- false discovery rate
- HR
- Hazard Ratio
- LSADT
- Longitudinal Study of Aging in Danish Twins Study
- MADT
- Middle Age Danish Twins Study
- MZ
- monozygotic
- OR
- Odds Ratio
- SD
- standard deviation