PT - JOURNAL ARTICLE AU - Argyri, Katerina D. AU - Gallos, Ioannis K. AU - Amditis, Angelos AU - Dionysiou, Dimitra D. TI - Exposomics and Cardiovascular Diseases: A Scoping Review of Machine Learning Approaches AID - 10.1101/2024.07.19.24310695 DP - 2024 Jan 01 TA - medRxiv PG - 2024.07.19.24310695 4099 - http://medrxiv.org/content/early/2024/07/30/2024.07.19.24310695.short 4100 - http://medrxiv.org/content/early/2024/07/30/2024.07.19.24310695.full AB - Cardiovascular disease has been established as the world’s number one killer, causing over 20 million deaths per year. This fact, along with the growing awareness of the impact of exposomic risk factors on cardiovascular diseases, has led the scientific community to leverage machine learning strategies as a complementary approach to traditional statistical epidemiological studies that are challenged by the highly heterogeneous and dynamic nature of exposomics data. The principal objective served by this work is to identify key pertinent literature and provide an overview of the breadth of research in the field of machine learning applications on exposomics data with a focus on cardiovascular diseases. Secondarily, we aimed at identifying common limitations and meaningful directives to be addressed in the future. Overall, this work shows that, despite the fact that machine learning on exposomics data is under-researched compared to its application on other members of the -omics family, it is increasingly adopted to investigate different aspects of cardiovascular diseases.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study did not receive any fundingAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThis is a scoping review paper.AdaBoostAdaptive BoostingAENET-IAdaptive Elastic-Net with main effects and pairwise interactionsAIArtificial IntelligenceANNArtificial Neural NetworkAPSAverage Precision ScoreAUC-PRArea Under the Precision Recall CurveAUC-ROCArea Under the Receiver Operating Characteristic CurveAUCArea Under the CurveBAGBagging (regressor or classifier based on context)BARTBayesian additive regression treeBKMRBayesian Kernel Machine RegressionBMIBody Mass IndexCARTClassification And Regression TreeCatBoostCategorical BoostingCNNConvolutional Neural NetworkCVDCardio-Vascular DiseaseGBGradient BoostingDLDeep LearningDTDecision TreeELSTMEnhanced Long Short-Term Memory ModelENElastic NetERSEnvironmental Risk ScoreExWASExposome-Wide Association StudyFDRFalse Discovery RateFNRFalse Negative RateFPRFalse Positive RateGGTGamma-Glutamyl TransferaseGSVGoogle Street ViewIDIIntegrated Discrimination ImprovementIFIsolation ForestKNNk-nearest neighborsKOBTKnockoff Boosted TreesLASSOLeast Absolute Shrinkage and Selection OperatorLDLLow-Density LipoproteinsLGBMLight Gradient Boosting MachineLMEMLinear Mixed Effects ModelLOO-CVLeave-One-Out Cross-ValidationLRLogistic RegressionLSTMLong Short-Term Memory ModelMAEMean Absolute ErrorMAPEMean Absolute Percentage ErrorMCCMatthew’s Correlation CoefficientMIMyocardial InfarctionMLMachine LearningMLPMulti-Layer PerceptronMSEMean-Squared ErrorMSPEMean-Squared Prediction ErrorNBNaïve BayesNPVNegative Predictive ValueNRICategorical Net Reclassification ImprovementPCAPrincipal Component AnalysisPRESSRFRandom ForestRMSERoot Mean Squared ErrorSHAPSHapley Additive exPlanationsSVCSupport Vector ClassificationSVMSupport Vector MachinesXGBoostExtreme Gradient Boosting