Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Mitigating Machine Learning Bias Between High Income and Low-Middle Income Countries for Enhanced Model Fairness and Generalizability

View ORCID ProfileJenny Yang, View ORCID ProfileLei Clifton, Nguyen Thanh Dung, Nguyen Thanh Phong, Lam Minh Yen, Doan Bui Xuan Thy, Andrew A. S. Soltan, Louise Thwaites, David A. Clifton
doi: https://doi.org/10.1101/2024.02.01.24302010
Jenny Yang
1Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jenny Yang
  • For correspondence: jenny.yang{at}eng.ox.ac.uk
Lei Clifton
2Nuffield Department of Population Health, University of Oxford
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lei Clifton
Nguyen Thanh Dung
3Hospital for Tropical Diseases, Ho Chi Minh, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nguyen Thanh Phong
3Hospital for Tropical Diseases, Ho Chi Minh, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lam Minh Yen
4Oxford University Clinical Research Unit, Ho Chi Minh, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Doan Bui Xuan Thy
4Oxford University Clinical Research Unit, Ho Chi Minh, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrew A. S. Soltan
1Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford
2Nuffield Department of Population Health, University of Oxford
5Oxford Cancer & Haematology Centre, Oxford University Hospitals NHS Foundation Trust
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Louise Thwaites
4Oxford University Clinical Research Unit, Ho Chi Minh, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David A. Clifton
1Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford
6Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Collaborative efforts in artificial intelligence (AI) are increasingly common between high-income countries (HICs) and low-to middle-income countries (LMICs). Given the resource limitations often encountered by LMICs, collaboration becomes crucial for pooling resources, expertise, and knowledge. Despite the apparent advantages, ensuring the fairness and equity of these collaborative models is essential, especially considering the distinct differences between LMIC and HIC hospitals. In this study, we show that collaborative AI approaches can lead to divergent performance outcomes across HIC and LMIC settings, particularly in the presence of data imbalances. Through a real-world COVID-19 screening case study, we demonstrate that implementing algorithmic-level bias mitigation methods significantly improves outcome fairness between HIC and LMIC sites while maintaining high diagnostic sensitivity. We compare our results against previous benchmarks, utilizing datasets from four independent United Kingdom Hospitals and one Vietnamese hospital, representing HIC and LMIC settings, respectively.

Competing Interest Statement

DAC reports personal fees from Oxford University Innovation, personal fees from BioBeats, personal fees from Sensyne Health, outside the submitted work.

Funding Statement

This work was supported by the Wellcome Trust/University of Oxford Medical & Life Sciences Translational Fund (Award: 0009350), and the Oxford National Institute for Health and Care Research (NIHR) Biomedical Research Centre (BRC). This work was also supported by the Wellcome Trust (Awards: WT 214906/Z/18/Z and WT217650/Z/19/Z). JY is a Marie Sklodowska-Curie Fellow, under the European Union Horizon 2020 research and innovation programme (Grant agreement: 955681, MOIRA). AAS is an NIHR Academic Clinical Fellow (Award: ACF-2020-13-015). DAC was supported by a Royal Academy of Engineering Research Chair, an NIHR Research Professorship, the InnoHK Hong Kong Centre for Cerebro-cardiovascular Health Engineering (COCHE), and the Pandemic Sciences Institute at the University of Oxford. The funders of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the manuscript. The views expressed in this publication are those of the authors and not necessarily those of the funders.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

United Kingdom National Health Service (NHS) approval via the national oversight/regulatory body, the Health Research Authority (HRA), has been granted for use of routinely collected clinical data to develop and validate artificial intelligence models to detect Covid-19 (CURIAL; NHS HRA IRAS ID: 281832). The ethics committee at the Hospital for Tropical Diseases (HTD) approved use of the HTD dataset for COVID-19 diagnosis.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵† These authors jointly supervised this work.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted February 03, 2024.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Mitigating Machine Learning Bias Between High Income and Low-Middle Income Countries for Enhanced Model Fairness and Generalizability
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Mitigating Machine Learning Bias Between High Income and Low-Middle Income Countries for Enhanced Model Fairness and Generalizability
Jenny Yang, Lei Clifton, Nguyen Thanh Dung, Nguyen Thanh Phong, Lam Minh Yen, Doan Bui Xuan Thy, Andrew A. S. Soltan, Louise Thwaites, David A. Clifton
medRxiv 2024.02.01.24302010; doi: https://doi.org/10.1101/2024.02.01.24302010
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Mitigating Machine Learning Bias Between High Income and Low-Middle Income Countries for Enhanced Model Fairness and Generalizability
Jenny Yang, Lei Clifton, Nguyen Thanh Dung, Nguyen Thanh Phong, Lam Minh Yen, Doan Bui Xuan Thy, Andrew A. S. Soltan, Louise Thwaites, David A. Clifton
medRxiv 2024.02.01.24302010; doi: https://doi.org/10.1101/2024.02.01.24302010

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)