Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Unsupervised machine-learning identifies clinically distinct subtypes of ALS that reflect different genetic architectures and biological mechanisms

View ORCID ProfileThomas P Spargo, View ORCID ProfileHeather Marriott, Guy P Hunt, View ORCID ProfileOliver Pain, Renata Kabiljo, Harry Bowles, William Sproviero, View ORCID ProfileAlexandra C Gillett, Isabella Fogh, Project MinE ALS Sequencing Consortium, Peter M. Andersen, Nazli A. Başak, Pamela J. Shaw, Philippe Corcia, Philippe Couratier, View ORCID ProfileMamede de Carvalho, Vivian Drory, Jonathan D. Glass, Marc Gotkine, Orla Hardiman, John E. Landers, Russell McLaughlin, Jesús S. Mora Pardina, Karen E. Morrison, Susana Pinto, Monica Povedano, Christopher E. Shaw, View ORCID ProfileVincenzo Silani, Nicola Ticozzi, View ORCID ProfilePhilip Van Damme, Leonard H. van den Berg, Patrick Vourc’h, Markus Weber, View ORCID ProfileJan H. Veldink, Richard J.B. Dobson, Ahmad Al Khleifat, Nicholas Cummins, Daniel Stahl, View ORCID ProfileAmmar Al-Chalabi, Alfredo Iacoangeli
doi: https://doi.org/10.1101/2023.06.12.23291304
Thomas P Spargo
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
3NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust and King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Thomas P Spargo
Heather Marriott
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Heather Marriott
Guy P Hunt
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
4Perron Institute for Neurological and Translational Science, Nedlands, WA 6009, Australia
5Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Murdoch, WA 6150, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Oliver Pain
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Oliver Pain
Renata Kabiljo
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harry Bowles
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
William Sproviero
6Department of Psychiatry, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandra C Gillett
7Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexandra C Gillett
Isabella Fogh
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter M. Andersen
8Department of Clinical Science, Umeå University, Umeå SE-901 85, Sweden
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nazli A. Başak
9Koc University, School of Medicine, Translational Medicine Research Center, NDAL, Istanbul, 34010, Turkey
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pamela J. Shaw
10Sheffield Institute for Translational Neuroscience (SITraN), University of Sheffield, Sheffield S10 2HQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philippe Corcia
11UMR 1253, Université de Tours, Inserm, Tours 37044, France
12Centre de référence sur la SLA, CHU de Tours, Tours 37044, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philippe Couratier
13Centre de référence sur la SLA, CHRU de Limoges, Limoges, France
14UMR 1094, Université de Limoges, Inserm, Limoges 87025, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mamede de Carvalho
15Instituto de Fisiologia, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Lisbon 1649-028, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mamede de Carvalho
Vivian Drory
16Department of Neurology, Tel-Aviv Sourasky Medical Centre, Tel-Aviv 64239, Israel
17Sackler Faculty of Medicine, Tel-Aviv University, Tel-Aviv 6997801, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan D. Glass
18Department of Neurology, Emory University School of Medicine, Atlanta, Georgia, GA 30322, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marc Gotkine
19Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem 91904, Israel
20Agnes Ginges Center for Human Neurogenetics, Department of Neurology, Hadassah Medical Center, Jerusalem 91120, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Orla Hardiman
21Academic Unit of Neurology, Trinity Biomedical Sciences Institute, Trinity College Dublin, Dublin D02 PN40, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John E. Landers
22Department of Neurology, UMass Chan Medical School, Worcester, MA 01655, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Russell McLaughlin
23Complex Trait Genomics Laboratory, Smurfit Institute of Genetics, Trinity College Dublin, Dublin D02 PN40, Ireland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jesús S. Mora Pardina
24ALS Unit, Hospital San Rafael, Madrid, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Karen E. Morrison
25School of Medicine, Dentistry and Biomedical Sciences, Queen’s University Belfast, Belfast BT9 7BL, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susana Pinto
15Instituto de Fisiologia, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Lisbon 1649-028, Portugal
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Monica Povedano
26Functional Unit of Amyotrophic Lateral Sclerosis (UFELA), Service of Neurology, Bellvitge University Hospital, L’Hospitalet de Llobregat, Barcelona 08907, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher E. Shaw
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vincenzo Silani
27Department of Neurology Stroke Unit and Laboratory of Neuroscience, Istituto Auxologico Italiano, IRCCS, Milan 20149, Italy
28Department of Pathophysiology and Transplantation, “Dino Ferrari” Center, Università degli Studi di Milano, Milan 20122
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vincenzo Silani
Nicola Ticozzi
27Department of Neurology Stroke Unit and Laboratory of Neuroscience, Istituto Auxologico Italiano, IRCCS, Milan 20149, Italy
28Department of Pathophysiology and Transplantation, “Dino Ferrari” Center, Università degli Studi di Milano, Milan 20122
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philip Van Damme
29Department of Neurology, University Hospitals Leuven and Department of Neuroscience, KU Leuven, Leuven 3000, Belgium
30VIB, Center for Brain and Disease Research, Leuven 3000, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Philip Van Damme
Leonard H. van den Berg
31Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht 3584 CX, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick Vourc’h
11UMR 1253, Université de Tours, Inserm, Tours 37044, France
32Service de Biochimie et Biologie molécularie, CHU de Tours, Tours 37044, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Markus Weber
33Neuromuscular Diseases Unit/ALS Clinic, Kantonsspital St. Gallen, 9007 St. Gallen, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jan H. Veldink
31Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht 3584 CX, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jan H. Veldink
Richard J.B. Dobson
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
3NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust and King’s College London, London, UK
34Institute of Health Informatics, Universit College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ahmad Al Khleifat
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicholas Cummins
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Stahl
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ammar Al-Chalabi
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
35King’s College Hospital, Bessemer Road, London, SE5 9RS, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ammar Al-Chalabi
Alfredo Iacoangeli
1Maurice Wohl Clinical Neuroscience Institute, King’s College London, Department of Basic and Clinical Neuroscience, London, UK
2Department of Biostatistics and Health Informatics, King’s College London, London, UK
3NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust and King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: alfredo.iacoangeli{at}kcl.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease characterised by a highly variable clinical presentation and multifaceted genetic and biological bases that translate into great patient heterogeneity. The identification of homogeneous subgroups of patients in terms of both clinical presentation and biological causes, could favour the development of effective treatments, healthcare, and clinical trials. We aimed to identify and characterise homogenous clinical subgroups of ALS, examining whether they represent underlying biological trends.

Methods Latent class clustering analysis, an unsupervised machine-learning method, was used to identify homogenous subpopulations in 6,523 people with ALS from Project MinE, using widely collected ALS-related clinical variables. The clusters were validated using 7,829 independent patients from STRENGTH. We tested whether the identified subgroups were associated with biological trends in genetic variation across genes previously linked to ALS, polygenic risk scores of ALS and related neuropsychiatric traits, and in gene expression data from post-mortem motor cortex samples.

Results We identified five ALS subgroups based on patterns in clinical data which were general across international datasets. Distinct genetic trends were observed for rare variants in the SOD1 and C9orf72 genes, and across genes implicated in biological processes relevant to ALS. Polygenic risk scores of ALS, schizophrenia and Parkinson’s disease were also higher in distinct clusters with respect to controls. Gene expression analysis identified different altered biological processes across clusters reflecting the genetic differences. We developed a machine learning classifier based on our model to assign subgroup membership using clinical data available at first visit, and made it available on a public webserver at http://latentclusterals.er.kcl.ac.uk.

Conclusion ALS subgroups characterised by highly distinct clinical presentations were discovered and validated in two large independent international datasets. Such groups were also characterised by different underlying genetic architectures and biology. Our results showed that data-driven patient stratification into more clinically and biologically homogeneous subtypes of ALS is possible and could help develop more effective and targeted approaches to the biomedical and clinical study of ALS.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study did not receive any funding

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵$ co-senior authors.

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted June 13, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Unsupervised machine-learning identifies clinically distinct subtypes of ALS that reflect different genetic architectures and biological mechanisms
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Unsupervised machine-learning identifies clinically distinct subtypes of ALS that reflect different genetic architectures and biological mechanisms
Thomas P Spargo, Heather Marriott, Guy P Hunt, Oliver Pain, Renata Kabiljo, Harry Bowles, William Sproviero, Alexandra C Gillett, Isabella Fogh, Project MinE ALS Sequencing Consortium, Peter M. Andersen, Nazli A. Başak, Pamela J. Shaw, Philippe Corcia, Philippe Couratier, Mamede de Carvalho, Vivian Drory, Jonathan D. Glass, Marc Gotkine, Orla Hardiman, John E. Landers, Russell McLaughlin, Jesús S. Mora Pardina, Karen E. Morrison, Susana Pinto, Monica Povedano, Christopher E. Shaw, Vincenzo Silani, Nicola Ticozzi, Philip Van Damme, Leonard H. van den Berg, Patrick Vourc’h, Markus Weber, Jan H. Veldink, Richard J.B. Dobson, Ahmad Al Khleifat, Nicholas Cummins, Daniel Stahl, Ammar Al-Chalabi, Alfredo Iacoangeli
medRxiv 2023.06.12.23291304; doi: https://doi.org/10.1101/2023.06.12.23291304
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Unsupervised machine-learning identifies clinically distinct subtypes of ALS that reflect different genetic architectures and biological mechanisms
Thomas P Spargo, Heather Marriott, Guy P Hunt, Oliver Pain, Renata Kabiljo, Harry Bowles, William Sproviero, Alexandra C Gillett, Isabella Fogh, Project MinE ALS Sequencing Consortium, Peter M. Andersen, Nazli A. Başak, Pamela J. Shaw, Philippe Corcia, Philippe Couratier, Mamede de Carvalho, Vivian Drory, Jonathan D. Glass, Marc Gotkine, Orla Hardiman, John E. Landers, Russell McLaughlin, Jesús S. Mora Pardina, Karen E. Morrison, Susana Pinto, Monica Povedano, Christopher E. Shaw, Vincenzo Silani, Nicola Ticozzi, Philip Van Damme, Leonard H. van den Berg, Patrick Vourc’h, Markus Weber, Jan H. Veldink, Richard J.B. Dobson, Ahmad Al Khleifat, Nicholas Cummins, Daniel Stahl, Ammar Al-Chalabi, Alfredo Iacoangeli
medRxiv 2023.06.12.23291304; doi: https://doi.org/10.1101/2023.06.12.23291304

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neurology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)