Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Predicting pulmonary function from the analysis of voice: a machine learning approach

Md. Zahangir Alam, Albino Simonetti, Rafaelle Billantino, Nick Tayler, Chris Grainge, View ORCID ProfilePandula Siribaddana, S. A. Reza Nouraei, View ORCID ProfileJames Batchelor, View ORCID ProfileM. Sohel Rahman, View ORCID ProfileEliane V. Mancuzo, View ORCID ProfileJohn W Holloway, View ORCID ProfileJudith A. Holloway, View ORCID ProfileFaisal I Rezwan
doi: https://doi.org/10.1101/2021.05.11.21256997
Md. Zahangir Alam
1Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
2Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Albino Simonetti
3Department of Information and Electrical Engineering and Applied Mathematics / DIEM, University of Salerno, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rafaelle Billantino
3Department of Information and Electrical Engineering and Applied Mathematics / DIEM, University of Salerno, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nick Tayler
4Peter Doherty Institute, The University of Melbourne, Melbourne VIC Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chris Grainge
5Hunter Medical Research Institute, The University of Newcastle, NSW, Australia
6Department of Respiratory Medicine, John Hunter Hospital, NSW, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pandula Siribaddana
7Postgraduate Institute of Medicine, University of Colombo, Sri Lanka
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Pandula Siribaddana
S. A. Reza Nouraei
8Clinical Informatics Research Unit, University of Southampton, Southampton, UK
9Robert White Centre for Airway Voice and Swallowing, Poole Hospital, Poole, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James Batchelor
8Clinical Informatics Research Unit, University of Southampton, Southampton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James Batchelor
M. Sohel Rahman
2Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for M. Sohel Rahman
Eliane V. Mancuzo
10Medical School, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Eliane V. Mancuzo
John W Holloway
1Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
11Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John W Holloway
Judith A. Holloway
11Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
12MSc Allergy, Faculty of Medicine, University of Southampton, Southampton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Judith A. Holloway
Faisal I Rezwan
1Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
13School of Water, Energy and Environment, Cranfield University, Cranfield, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Faisal I Rezwan
  • For correspondence: F.I.Rezwan{at}cranfield.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Providing proper timely treatment of asthma, self-monitoring can play a vital role in disease control. Existing methods (such as peak flow meter, smart spirometer) requires special equipment and are not always used by the patient. Using voice recording as surrogate measures of lung function can be used to assess asthma, which has good potential to self-monitor asthma and could be integrated into telehealth platforms. This study aims to apply machine learning approach to predict lung functions from recorded voice for asthma patients.

A threshold-based mechanism was designed to separate speech and breathing from recordings (323 recordings from 26 participants) and features extracted from these were combined with biological attributes and lung function (percentage predicted forced expiratory volume in 1 second, FEV1%). Three predictive models were developed: (a) regression models to predict lung function, (b) multi-class classification models to predict the severity, and (c) binary classification models to predict abnormality. Random Forest (RF), Support Vector Machine (SVM), and Linear Regression (LR) algorithms were implemented to develop these predictive models. Training and test samples were separated (70%:30% using balanced portioning). Features were normalised and 10-fold cross-validation used to measure the model’s training performances on the training samples. Models were then run on the test samples to measure the final performances.

The RF based regression model performed better with lowest root mean square error = 10.86, and mean absolute score = 11.47, as compared to other models. In predicting the severity of lung function, the SVM based model performed better with 73.20% accuracy. The RF based model performed better in binary classification models for predicting abnormality of lung function (accuracy = 0.85, F1-score = 0.84, and area under the receiver operating characteristic curve = 0.88).

The proposed machine learning approach can predict lung function (in terms of FEV1%), from the recorded voice files, better than other published approaches. These models can be extended to predict both the severity and abnormality of lung function with reasonable accuracies. This technique could be used to develop future telehealth solutions including smartphone-based applications which have potential to aid decision making and self-monitoring in asthma.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The study was funded by the Asthma, Allergy and Inflammation Research Charity, Southampton, UK.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study was approved by the Southampton and South West Hampshire local ethics committee, UK (LREC number 12/EE/0545) and by the Medicines and Healthcare Products Regulation Agency (MHRA), UK (MHRA number 11709/0246/001-0001).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Access to data and source code can be made available upon request.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted May 13, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Predicting pulmonary function from the analysis of voice: a machine learning approach
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Predicting pulmonary function from the analysis of voice: a machine learning approach
Md. Zahangir Alam, Albino Simonetti, Rafaelle Billantino, Nick Tayler, Chris Grainge, Pandula Siribaddana, S. A. Reza Nouraei, James Batchelor, M. Sohel Rahman, Eliane V. Mancuzo, John W Holloway, Judith A. Holloway, Faisal I Rezwan
medRxiv 2021.05.11.21256997; doi: https://doi.org/10.1101/2021.05.11.21256997
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Predicting pulmonary function from the analysis of voice: a machine learning approach
Md. Zahangir Alam, Albino Simonetti, Rafaelle Billantino, Nick Tayler, Chris Grainge, Pandula Siribaddana, S. A. Reza Nouraei, James Batchelor, M. Sohel Rahman, Eliane V. Mancuzo, John W Holloway, Judith A. Holloway, Faisal I Rezwan
medRxiv 2021.05.11.21256997; doi: https://doi.org/10.1101/2021.05.11.21256997

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)