Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Variable-Selection ANOVA Simultaneous Component Analysis (VASCA)

View ORCID ProfileJosé Camacho, Raffaele Vitale, David Morales-Jimenez, Carolina Gómez-Llorente
doi: https://doi.org/10.1101/2022.06.13.22276334
José Camacho
1Signal Theory, Networking and Communications Department, University of Granada, Granada, 18014, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for José Camacho
  • For correspondence: josecamacho{at}ugr.es
Raffaele Vitale
2Univ. Lille, CNRS, LASIRE (UMR 8516), Laboratoire Avancé de Spectroscopie pour les Interactions, la Réactivité et l’Environnement, F-59000, Lille, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Morales-Jimenez
1Signal Theory, Networking and Communications Department, University of Granada, Granada, 18014, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carolina Gómez-Llorente
3Department of Biochemistry and Molecular Biology II, School of Pharmacy, Institute of Nutrition and Food Technology “José Mataix”, Biomedical Research Center, University of Granada, Granada, 18160, Spain
4ibs.GRANADA, Instituto de Investigación Sanitaria, Granada, 18012, Spain. CIBEROBN (Physiopathology of Obesity and Nutrition CB12/03/30038), Instituto de Salud Carlos III, 28029, Madrid, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Motivation ANOVA Simultaneous Component Analysis (ASCA) is a popular method for the analysis of multivariate data yielded by designed experiments. Meaningful associations between factors/interactions of the experimental design and measured variables in the data set are typically identified via significance testing, with permutation tests being the standard go-to choice. However, in settings with large numbers of variables, the “holistic” testing approach of ASCA (all variables considered) often overlooks statistically significant effects encoded by only a few variables.

Results We propose Variable-selection ASCA (VASCA), a method that generalizes ASCA through variable selection, augmenting its statistical power without inflating the Type-I error risk. The method is evaluated with simulations and with a real data set from a multi-omic clinical experiment. We show that VASCA is more powerful than both ASCA and the widely-adopted False Discovery Rate (FDR) controlling procedure; the latter is used as a benchmark for variable selection based on multiple significance testing. We further illustrate the usefulness of VASCA for exploratory data analysis in comparison to the popular Partial Least Squares Discriminant Analysis (PLS-DA) method and its sparse counterpart (sPLS-DA).

Availability The code for VASCA is available in the MEDA Toolbox at https://github.com/josecamachop/MEDA-Toolbox

Contact josecamacho{at}ugr.es

Supplementary information Supplementary data are available at Bioinformatics online.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work is partly supported by the Agencia Andaluza del Conocimiento, Regional Government of Andalucia, in Spain, and ERDF (European Regional Development Fund) funds through project B-TIC136-UGR20. The work of D. Morales-Jimenez is supported in part by the State Research Agency (AEI) of Spain and the European Social Fund under grant RYC2020-030536-I and by AEI under grant PID2020- 118139RB-I00

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study protocol was approved by the local Ethics Committee of Granada (Reference 8/15) and was conducted according to the standards given in the Declaration of Helsinki (Edinburg 2000 revised), the Good Clinical Practice of the European Union (document 111/3976/88 July 1990) and legal in-forced Spanish regulations, which regulated the clinical investigation in human beings.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

No real data is explicitly generated for this paper. The code for the simulation study is available upon reasonable request to the authors. The code for VASCA is available in the MEDA Toolbox at https://github.com/josecamachop/MEDA-Toolbox

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted June 16, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Variable-Selection ANOVA Simultaneous Component Analysis (VASCA)
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Variable-Selection ANOVA Simultaneous Component Analysis (VASCA)
José Camacho, Raffaele Vitale, David Morales-Jimenez, Carolina Gómez-Llorente
medRxiv 2022.06.13.22276334; doi: https://doi.org/10.1101/2022.06.13.22276334
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Variable-Selection ANOVA Simultaneous Component Analysis (VASCA)
José Camacho, Raffaele Vitale, David Morales-Jimenez, Carolina Gómez-Llorente
medRxiv 2022.06.13.22276334; doi: https://doi.org/10.1101/2022.06.13.22276334

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)