Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Performance of a Computational Phenotyping Algorithm for Sarcoidosis Using Diagnostic Codes in Electronic Medical Records: A Pilot Study from Two Veterans Affairs Medical Centers

View ORCID ProfileMohamed I Seedahmed, View ORCID ProfileIzabella Mogilnicka, View ORCID ProfileSiyang Zeng, View ORCID ProfileGang Luo, View ORCID ProfileCharles McCulloch, View ORCID ProfileLaura Koth, View ORCID ProfileMehrdad Arjomandi
doi: https://doi.org/10.1101/2021.02.02.21250980
Mohamed I Seedahmed
1San Francisco Veterans Affairs Medical Center, San Francisco, California, USA
2Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Department of Medicine, University of California San Francisco, California, USA
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mohamed I Seedahmed
  • For correspondence: mohamed.seedahmed{at}ucsf.edu
Izabella Mogilnicka
1San Francisco Veterans Affairs Medical Center, San Francisco, California, USA
5Department of Experimental Physiology and Pathophysiology, Laboratory of the Centre for Preclinical Research, Medical University of Warsaw, Warsaw, Poland
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Izabella Mogilnicka
Siyang Zeng
1San Francisco Veterans Affairs Medical Center, San Francisco, California, USA
4Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington, Seattle, Washington, USA
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Siyang Zeng
Gang Luo
4Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington, Seattle, Washington, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gang Luo
Charles McCulloch
3Department of Epidemiology & Biostatistics, University of California San Francisco, California, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Charles McCulloch
Laura Koth
2Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Department of Medicine, University of California San Francisco, California, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Laura Koth
Mehrdad Arjomandi
1San Francisco Veterans Affairs Medical Center, San Francisco, California, USA
2Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Department of Medicine, University of California San Francisco, California, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mehrdad Arjomandi
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Background The accuracy of identifying sarcoidosis cases in electronic medical records (EMR) using diagnostic codes is unknown.

Methods To estimate the statistical performance of using diagnostic codes, ICD-9 and ICD-10 diagnostic codes in identifying sarcoidosis cases in EMR, we searched the San Francisco and Palo Alto Veterans Affairs (VA) medical centers EMR and randomly selected 200 patients coded as sarcoidosis. To further improve diagnostic accuracy, we developed an “index of suspicion” algorithm to identify probable sarcoidosis cases based on clinical and radiographic features. We then determined the positive predictive value (PPV) of diagnosing sarcoidosis by two computational methods using ICD only and ICD plus the “index of suspicion” against the gold standard developed through manual chart review based on the American Thoracic Society (ATS) practice guideline. Finally, we determined healthcare providers’ adherence to the guidelines using a new scoring system.

Results The PPV of identifying sarcoidosis cases in VA EMR using ICD codes only was 71% (95%CI=64.7%-77.3%). The inclusion of our construct of “index of suspicion” along with the ICD codes significantly increased the PPV to 90% (95%CI=85.2%-94.6%). The care of sarcoidosis patients was more likely to be classified as “Fully” or “Substantially” adherent with the ATS practice guideline if their managing provider was a specialist (45% of primary care providers vs. 74% of specialists; P=0.008).

Conclusions Although ICD codes can be used as reasonable classifiers to identify sarcoidosis cases within EMR, using computational algorithms to extract clinical and radiographic information (“index of suspicion”) from unstructured data could significantly improve case identification accuracy.

Highlights

  • Identifying sarcoidosis cases using diagnostic codes in EMR has low accuracy.

  • “Unstructured data” contain information useful in identifying cases of sarcoidosis.

  • Computational algorithms could improve the accuracy and efficiency of case identification in EMR.

  • We introduce a new scoring system for assessing healthcare providers’ compliance with the American Thoracic Society (ATS) practice guideline.

  • Compliance scoring could help automatically assess sarcoidosis patients’ care delivery.

Competing Interest Statement

The authors have declared no competing interest.

Clinical Trial

n/a

Funding Statement

This work was supported by funds from the Department of Veterans Affairs Fellowship Award to MIS; the Flight Attendants Medical Research Institute (FAMRI) (CIA190001 to MA); Department of Veterans Affairs Clinical Sciences Research and Development (CSRD) (CXV-00125 to MA); the Tobacco-related Disease Research Program of the University of California (T29IR0715 to MA).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The University of California San Francisco Institutional Review Board and the Veterans Health Administration Research and Development Committee approved this study. [IRB Protocol #15-16660].

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵γ MA and LK: These authors share senior authorship.

  • Email Addresses | ORCID iDs: MIS: mohamed.seedahmed{at}ucsf.edu

    IM: izabella.mogilnicka{at}gmail.com

    SZ: siyang.zeng{at}ucsf.edu

    GL: luogang{at}uw.edu

    CM: charles.mcculloch{at}ucsf.edu

    LK: laura.koth{at}ucsf.edu

    MA: mehrdad.arjomandi{at}ucsf.edu

  • Authors’ Contributions: All authors read and approved the final manuscript.

    Conceived and designed the study research: MIS, LK, MA

    Developed study protocol: MIS, LK, MA

    Worked on the methods: MIS, IM, SZ, CM, LK, MA

    Analyzed and Interpreted data: MIS, IM, SZ, CM, LK, MA

    Prepared and/or edit the manuscript: MIS, IM, GL, CM, LK, MA

  • I found a grammatical mistake in the abstract on the method section which I fixed. Old: "Methods-To estimate the statistical performance of using diagnostic codes, ICD-9 and ICD-10, to identify sarcoidosis cases in EMR, we searched the …" New: "Methods-To estimate the statistical performance of using ICD-9 and ICD-10 diagnostic codes in identifying sarcoidosis cases in EMR, we searched the …"

Data Availability

Due to the sensitive nature of health data analyzed in the current study, data will remain confidential and are not publicly available.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted February 05, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Performance of a Computational Phenotyping Algorithm for Sarcoidosis Using Diagnostic Codes in Electronic Medical Records: A Pilot Study from Two Veterans Affairs Medical Centers
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Performance of a Computational Phenotyping Algorithm for Sarcoidosis Using Diagnostic Codes in Electronic Medical Records: A Pilot Study from Two Veterans Affairs Medical Centers
Mohamed I Seedahmed, Izabella Mogilnicka, Siyang Zeng, Gang Luo, Charles McCulloch, Laura Koth, Mehrdad Arjomandi
medRxiv 2021.02.02.21250980; doi: https://doi.org/10.1101/2021.02.02.21250980
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Performance of a Computational Phenotyping Algorithm for Sarcoidosis Using Diagnostic Codes in Electronic Medical Records: A Pilot Study from Two Veterans Affairs Medical Centers
Mohamed I Seedahmed, Izabella Mogilnicka, Siyang Zeng, Gang Luo, Charles McCulloch, Laura Koth, Mehrdad Arjomandi
medRxiv 2021.02.02.21250980; doi: https://doi.org/10.1101/2021.02.02.21250980

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Respiratory Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)