Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Classifying Refugee Status Using Common Features in EMR

Malia Morrison, Vanessa Nobles, Crista E. Johnson-Agbakwu, Celeste Bailey, View ORCID ProfileLi Liu
doi: https://doi.org/10.1101/2021.08.17.21262048
Malia Morrison
1College of Health Solutions, Arizona State University, Phoenix, AZ, 85004, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vanessa Nobles
1College of Health Solutions, Arizona State University, Phoenix, AZ, 85004, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Crista E. Johnson-Agbakwu
2Department of Obstetrics, Gynecology and Women’s Health, Valleywise Health, Phoenix, AZ, 85008, USA
3Creighton University School of Medicine -Phoenix Campus, Phoenix, AZ, 85008, USA
4District Medical Group, Mesa, AZ, 85201, USA
5Southwest Interdisciplinary Research Center, Watts College of Public Service and Community Solutions, Arizona State University, Tempe, AZ
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Celeste Bailey
2Department of Obstetrics, Gynecology and Women’s Health, Valleywise Health, Phoenix, AZ, 85008, USA
3Creighton University School of Medicine -Phoenix Campus, Phoenix, AZ, 85008, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Li Liu
1College of Health Solutions, Arizona State University, Phoenix, AZ, 85004, USA
6Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
7Department of Neurology, Mayo Clinic, Scottsdale, AZ 85259, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Li Liu
  • For correspondence: liliu{at}asu.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Objective Automated and accurate identification of refugees in healthcare databases is a critical first step to investigate healthcare needs of this vulnerable population and improve health disparities. This study developed a machine-learning method, named refugee identification system (RIS) that uses features commonly collected in healthcare databases to classify refugees and non-refugees.

Materials and Methods We compiled a curated data set consisting of 103 refugees and 930 non-refugees in Arizona. For each person in the curated data set, we collected age, primary language, and noise-masked home address. We supplemented de-identified individual-level data with state-level refugee resettlement statistics and world language statistics, then performed feature engineering to convert primary language and masked address into quantitative features. Finally, we built a random forest model to classify refugee status.

Results Evaluated on holdout testing data, RIS achieved a high classification accuracy of 0.97, specificity of 0.99, sensitivity of 0.85, positive predictive value of 0.88, and negative predictive value of 0.98. The receiver operating characteristic curve had an area under the curve value of 0.98. The source code is available at GitHub (https://github.com/liliulab/ris).

Discussion and Conclusion RIS is an automated, accurate, and scalable method to predict refugee status. It uses only de-identified information to protect patient privacy. The computational framework is adaptable to address similar challenges in other States. Its application enables large-scale investigation of refugee healthcare needs and improvement of health disparities.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

No external funding.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study is proved by the IRB at Valleywise Health.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • We added noises to the latitude and longitude coordinates of residential addresses. With these noise-masked features, we retrained the classification model.

Data Availability

Data are available upon request.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted August 22, 2022.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Classifying Refugee Status Using Common Features in EMR
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Classifying Refugee Status Using Common Features in EMR
Malia Morrison, Vanessa Nobles, Crista E. Johnson-Agbakwu, Celeste Bailey, Li Liu
medRxiv 2021.08.17.21262048; doi: https://doi.org/10.1101/2021.08.17.21262048
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Classifying Refugee Status Using Common Features in EMR
Malia Morrison, Vanessa Nobles, Crista E. Johnson-Agbakwu, Celeste Bailey, Li Liu
medRxiv 2021.08.17.21262048; doi: https://doi.org/10.1101/2021.08.17.21262048

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)