Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Establishing and characterising large COVID-19 cohorts after mapping the Information System for Research in Primary Care in Catalonia to the OMOP Common Data Model

View ORCID ProfileEdward Burn, Sergio Fernández-Bertolín, Erica A Voss, Clair Blacketer, Maria Aragón, Martina Recalde, Elena Roel, Andrea Pistillo, Berta Raventós, Carlen Reyes, Sebastiaan van Sandijk, Lars Halvorsen, Peter R Rijnbeek, Talita Duarte-Salles
doi: https://doi.org/10.1101/2021.11.23.21266734
Edward Burn
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
2Centre for Statistics in Medicine, University of Oxford
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Edward Burn
Sergio Fernández-Bertolín
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Erica A Voss
3Janssen Pharmaceutical Research and Development, Titusville, NJ, USA
4Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
5OHDSI collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Clair Blacketer
3Janssen Pharmaceutical Research and Development, Titusville, NJ, USA
4Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
5OHDSI collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maria Aragón
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martina Recalde
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
6Universitat Autònoma de Barcelona, Bellaterra, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elena Roel
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
6Universitat Autònoma de Barcelona, Bellaterra, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrea Pistillo
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Berta Raventós
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
6Universitat Autònoma de Barcelona, Bellaterra, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carlen Reyes
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sebastiaan van Sandijk
7Odysseus Data Services s.r.o., Prague, Czech Republic
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lars Halvorsen
8edenceHealth NV, Kontich, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter R Rijnbeek
4Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
5OHDSI collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Talita Duarte-Salles
1Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tduarte{at}idiapjgol.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Few datasets have been established that capture the full breadth of COVID-19 patient interactions with a health system. Our first objective was to create a COVID-19 dataset that linked primary care data to COVID-19 testing, hospitalisation, and mortality data at a patient level. Our second objective was to provide a descriptive analysis of COVID-19 outcomes among the general population and describe the characteristics of the affected individuals.

Methods We mapped patient-level data from Catalonia, Spain, to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). More than 3,000 data quality checks were performed to assess the readiness of the database for research. Subsequently, to summarise the COVID-19 population captured, we established a general population cohort as of the 1st March 2020 and identified outpatient COVID-19 diagnoses or positive test results for SARS-CoV-2, hospitalisations with COVID-19, and COVID-19 deaths during follow-up, which went up until 30th June 2021.

Findings Mapping data to the OMOP CDM was performed and high data quality was observed. The mapped database was used to identify a total of 5,870,274 individuals, who were included in the general population cohort as of 1st March 2020. Over follow up, 604,472 had either an outpatient COVID-19 diagnosis or positive test result, 58,991 had a hospitalisation with COVID-19, 5,642 had an ICU admission with COVID-19, and 11,233 had a COVID-19 death. People who were hospitalised or died were more commonly older, male, and with more comorbidities. Those admitted to ICU with COVID-19 were generally younger and more often male than those hospitalised in general and those who died.

Interpretation We have established a comprehensive dataset that captures COVID-19 diagnoses, test results, hospitalisations, and deaths in Catalonia, Spain. Extensive data checks have shown the data to be fit for use. From this dataset, a general population cohort of 5.9 million individuals was identified and their COVID-19 outcomes over time were described.

Funding Generalitat de Catalunya and European Health Data and Evidence Network (EHDEN).

Competing Interest Statement

All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf. EAV and CB are employees of Janssen Research and Development LLC and shareholders of Johnson & Johnson (J&J) stock.

Funding Statement

This project is funded by the Health Department from the Generalitat de Catalunya with a grant for research projects on SARS-CoV-2 and COVID-19 disease organized by the Direccio General de Recerca i Innovacio en Salut. This project has received support from the European Health Data and Evidence Network (EHDEN) project. EHDEN received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No 806968. The JU receives support from the European Union's Horizon 2020 research and innovation programme and EFPIA. The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study was approved by the Clinical Research Ethics Committee of the IDIAPJGol (project code: 20/070-PCV).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵* joint first-authors

Data Availability

In accordance with current European and national law, the data used in this study is only available for the researchers participating in this study. Thus, we are not allowed to distribute or make publicly available the data to other parties. However, researchers from public institutions can request data from SIDIAP if they comply with certain requirements. Further information is available online (https://www.sidiap.org/index.php/menu-solicitudesen/application-proccedure) or by contacting SIDIAP (sidiap{at}idiapjgol.org).

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted November 24, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Establishing and characterising large COVID-19 cohorts after mapping the Information System for Research in Primary Care in Catalonia to the OMOP Common Data Model
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Establishing and characterising large COVID-19 cohorts after mapping the Information System for Research in Primary Care in Catalonia to the OMOP Common Data Model
Edward Burn, Sergio Fernández-Bertolín, Erica A Voss, Clair Blacketer, Maria Aragón, Martina Recalde, Elena Roel, Andrea Pistillo, Berta Raventós, Carlen Reyes, Sebastiaan van Sandijk, Lars Halvorsen, Peter R Rijnbeek, Talita Duarte-Salles
medRxiv 2021.11.23.21266734; doi: https://doi.org/10.1101/2021.11.23.21266734
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Establishing and characterising large COVID-19 cohorts after mapping the Information System for Research in Primary Care in Catalonia to the OMOP Common Data Model
Edward Burn, Sergio Fernández-Bertolín, Erica A Voss, Clair Blacketer, Maria Aragón, Martina Recalde, Elena Roel, Andrea Pistillo, Berta Raventós, Carlen Reyes, Sebastiaan van Sandijk, Lars Halvorsen, Peter R Rijnbeek, Talita Duarte-Salles
medRxiv 2021.11.23.21266734; doi: https://doi.org/10.1101/2021.11.23.21266734

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)