Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Interpretable machine learning model for data driven classification of Oral Health Related Quality of Life in Patients with Type 2 Diabetes Mellitus

View ORCID ProfileRoomani Srivastava, View ORCID ProfileR Murali, View ORCID ProfileMeena Jain, View ORCID ProfileKshitij Jadhav
doi: https://doi.org/10.1101/2024.05.03.24306811
Roomani Srivastava
1Indian Institute of Technology Bombay, Mumbai, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Roomani Srivastava
R Murali
2Krishnadevaraya College of Dental Sciences and Hospital, Bangalore, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for R Murali
Meena Jain
3Manav Rachna Dental College and Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Meena Jain
Kshitij Jadhav
1Indian Institute of Technology Bombay, Mumbai, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kshitij Jadhav
  • For correspondence: kshitij.jadhav{at}iitb.ac.in
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Type 2 Diabetes Mellitus(T2DM) is a debilitating condition with a number of complications including those of the oral cavity which can further deteriorate patient’s general and oral health related quality of life (OHRQoL). Machine Learning (ML) can help assign an individual’s propensity to develop poor OHRQoL, given a set of variables, and at the same time identify the most important features contributing to this outcome. Previously inferential statistical methods have attempted to explain this, albeit with limited success. The aim of this cross sectional study is to determine the impact on OHRQoL in T2DM patients, and identify features most likely to be associated with this outcome and to compare ML and DL analytical methods with inferential statistics. Twelve-hundred T2DM patients were subjected to OHRQoL and demographic data questionnaires and WHO Oral Health Assessment form. K-means Clustering was performed to label individuals as having or not having an impact on OHRQoL. Class imbalance was addressed by undersampling of the majority class using informed subset selection. Further, using the collected data as input features we developed ML algorithms (Naive Bayes(NB), Random Forest(RF), Logistic Regression(LR), Kernel Support Vector Machine(SVM) and Artificial Neural Network(ANN)), to accurately classify individuals with or with-out poor oral health related quality of life (OHRQoL) and utilized SHapley Additive exPlanations (SHAP) analysis for feature importance. The best performing model was SVM (AUC=0.983; Sensitivity=1) for classifying the patients into into poor OHRQoL. SHAP values were highest for Age, Prosthetic Need, Tobacco use and years since onset of diabetes. Features closely related to diabetes, that is, periodontal pockets and loss of attachment were not identified as relevant by inferential statistics, but were deemed as important features associated with poor OHRQoL by SHAP analysis.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study did not receive any funding

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

IRB of Krishnadevaraya College of Dental Sciences gave ethical approval for this work

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • 22D1628{at}iitb.ac.in, kshitij.jadhav{at}iitb.ac.in, https://www.iitb.ac.in/

  • iyemurali{at}gmail.com, https://www.kcdsh.org/

  • profmeenajain{at}gmail.com, https://manavrachna.edu.in/mrdc

  • Based on certain reviewer comments, justifications and certain limitations have been mentioned in the discussion.

Data Availability

All data produced in the present study are available upon reasonable request to the authors, after relevant permissions from parent institution where the study was conducted.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted August 12, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Interpretable machine learning model for data driven classification of Oral Health Related Quality of Life in Patients with Type 2 Diabetes Mellitus
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Interpretable machine learning model for data driven classification of Oral Health Related Quality of Life in Patients with Type 2 Diabetes Mellitus
Roomani Srivastava, R Murali, Meena Jain, Kshitij Jadhav
medRxiv 2024.05.03.24306811; doi: https://doi.org/10.1101/2024.05.03.24306811
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Interpretable machine learning model for data driven classification of Oral Health Related Quality of Life in Patients with Type 2 Diabetes Mellitus
Roomani Srivastava, R Murali, Meena Jain, Kshitij Jadhav
medRxiv 2024.05.03.24306811; doi: https://doi.org/10.1101/2024.05.03.24306811

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)