Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Integration of Machine Learning to Identify Diagnostic Genes in Leukocytes for Acute Myocardial Infarction Patients

View ORCID ProfileLin Zhang, Yue Liu, Kaiyue Wang, Xiangqin Ou, Jiashun Zhou, Houliang Zhang, Min Huang, Zhenfang Du, Sheng Qiang
doi: https://doi.org/10.1101/2023.09.07.23295181
Lin Zhang
1State Key Laboratory of Component-based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Jinghai, Tianjin 301617, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lin Zhang
Yue Liu
2Department of Nephropathy, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang, Jiangsu 215600, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kaiyue Wang
1State Key Laboratory of Component-based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Jinghai, Tianjin 301617, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiangqin Ou
3The First Affiliated Hospital of Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou 550025, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jiashun Zhou
4Tianjin Jinghai District Hospital, 14 Shengli Road, Jinghai, Tianjin 301699, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Houliang Zhang
4Tianjin Jinghai District Hospital, 14 Shengli Road, Jinghai, Tianjin 301699, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Min Huang
2Department of Nephropathy, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang, Jiangsu 215600, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhenfang Du
2Department of Nephropathy, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang, Jiangsu 215600, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: qiangsheng660{at}163.com zyydzf{at}163.com
Sheng Qiang
2Department of Nephropathy, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang, Jiangsu 215600, P. R. China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: qiangsheng660{at}163.com zyydzf{at}163.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Acute myocardial infarction (AMI) has two clinical characteristics: high missed diagnosis and dysfunction of leukocytes. Transcriptional RNA on leukocytes is closely related to the course evolution of AMI patients. We hypothesized that transcriptional RNA in leukocytes might provide potential diagnostic value for AMI. Integration machine learning (IML) was first used to explore AMI discrimination genes. The following clinical study was performed to validate the results.

Methods A total of four AMI microarrays (derived from the Gene Expression Omnibus) were included in this study (220 sample size), and the controls were identified as patients with stable coronary artery disease (SCAD). At a ratio of 5:2, GSE59867 was included in the training set, while GSE60993, GSE62646, and GSE48060 were included in the testing set. IML was explicitly proposed in this research, which is composed of six machine learning algorithms, including support vector machine (SVM), neural network (NN), random forest (RF), gradient boosting machine (GBM), decision trees (DT), and least absolute shrinkage and selection operator (LASSO). IML had two functions in this research: filtered optimized variables and predicted the categorized value. Furthermore, 40 individuals were recruited, and the results were verified.

Results Thirty-nine differentially expressed genes (DEGs) were identified between controls and AMI individuals from the training sets. Among the thirty-nine DEGs, IML was used to process the predicted classification model and identify potential candidate genes with overall normalized weights >1. Finally, Two genes (AQP9 and SOCS3) show their diagnosis value with the area under the curve (AUC) > 0.9 in both the training and testing sets. The clinical study verified the significance of AQP9 and SOCS3. Notably, more stenotic coronary arteries or severe Killip classification indicated higher levels of these two genes, especially SOCS3. These two genes correlated with two immune cell types, monocytes and neutrophils.

Conclusion AQP9 and SOCS3 in leukocytes may be conducive to identifying AMI patients with SCAD patients. AQP9 and SOCS3 are closely associated with monocytes and neutrophils, which might contribute to advancing AMI diagnosis and shed light on novel genetic markers. Multiple clinical characteristics, multicenter, and large-sample relevant trials are still needed to confirm its clinical value.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The research was funded by Suzhou Science & Technology Development Plan (SYSD2019222). Zhangjiagang science and technology plan project (ZKS2135), Youth science and technology project of Zhangjiagang Municipal Health Commission (ZJGQNKJ202211).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Ethics Review Committee of Jinghai District Hospital approved the study.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Availability of data and material

The datasets presented in this study can be found online. The names of the repositories and GEO numbers can be found below: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE59867;https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE60993;https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE62646;https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE48060.

  • Abbreviation

    AUC
    Area under the Curve
    AMI
    Acute Myocardial Infarction
    IML
    Integration Machine Learning
    DEGs
    Differently Expressed Genes
    KEGG-GSEA
    Kyoto Encyclopedia of Genes and Genomes-Gene Set Enrichment Analysis
    GO
    Gene Ontology
    DO
    Disease Ontology
    MF
    Molecular Function
    BP
    Biological Process
    CC
    Cellular Components
    SVM
    Support Vector Machine
    ML
    Machine Learning
    LASSO
    Least Absolute Shrinkage and Selection Operator
    RF
    Random Forest
    GBM
    Gradient Boosting Machine
    DT
    Decision Trees
    NN
    Neural Network.
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted September 08, 2023.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Integration of Machine Learning to Identify Diagnostic Genes in Leukocytes for Acute Myocardial Infarction Patients
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Integration of Machine Learning to Identify Diagnostic Genes in Leukocytes for Acute Myocardial Infarction Patients
    Lin Zhang, Yue Liu, Kaiyue Wang, Xiangqin Ou, Jiashun Zhou, Houliang Zhang, Min Huang, Zhenfang Du, Sheng Qiang
    medRxiv 2023.09.07.23295181; doi: https://doi.org/10.1101/2023.09.07.23295181
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Integration of Machine Learning to Identify Diagnostic Genes in Leukocytes for Acute Myocardial Infarction Patients
    Lin Zhang, Yue Liu, Kaiyue Wang, Xiangqin Ou, Jiashun Zhou, Houliang Zhang, Min Huang, Zhenfang Du, Sheng Qiang
    medRxiv 2023.09.07.23295181; doi: https://doi.org/10.1101/2023.09.07.23295181

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Cardiovascular Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)