PT - JOURNAL ARTICLE AU - Molina-Mora, Jose Arturo AU - González, Alejandra AU - Jiménez-Morgan, Sergio AU - Cordero-Laurent, Estela AU - Brenes, Hebleen AU - Soto-Garita, Claudio AU - Sequeira-Soto, Jorge AU - Duarte-Martínez, Francisco TI - Clinical profiles at the time of diagnosis of COVID-19 in Costa Rica during the pre-vaccination period using a machine learning approach AID - 10.1101/2021.06.18.21259157 DP - 2021 Jan 01 TA - medRxiv PG - 2021.06.18.21259157 4099 - http://medrxiv.org/content/early/2021/06/23/2021.06.18.21259157.short 4100 - http://medrxiv.org/content/early/2021/06/23/2021.06.18.21259157.full AB - Background The clinical manifestations of COVID-19 disease, caused by the SARS-CoV-2 virus, define a large spectrum of symptoms that are mainly dependent on the human host conditions. In Costa Rica, almost 319 000 cases have been reported during the first third of 2021, contrasting to the 590 000 fully vaccinated people. In the pre-vaccination period (the year 2020), this country accumulated 169 321 cases and 2185 deaths.Methods To describe the clinical presentations at the time of diagnosis of COVID-19 in Costa Rica during the pre-vaccination period, we implemented a symptom-based clustering using machine learning to identify clusters or clinical profiles among 18 974 records of positive cases. Profiles were compared based on symptoms, risk factors, viral load, and genomic features of the SARS-CoV-2 sequence.Results A total of seven COVID-19 clinical profiles were identified, which were characterized by a specific composition of symptoms. In the comparison between clusters, a lower viral load was found for the asymptomatic group, while the risk factors and the SARS-CoV-2 genomic features were distributed among all the clusters. No other distribution patterns were found for age, sex, vital status, and hospitalization.Conclusion During the pre-vaccination time in Costa Rica, the clinical manifestations at the time of diagnosis of COVID-19 were described in seven profiles. The host co-morbidities and the SARS-CoV-2 genotypes are not specific of a particular profile, rather they are present in all the groups, including asymptomatic cases. In further analyses, these results will be compared against the profiles of cases during the vaccination period.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was funded by INCIENSA and Vicerrectoria de Investigacion, Universidad de Costa Rica, with the Project C0196 Protocolo bioinformatico y de inteligencia artificial para el apoyo de la vigilancia epidemiologica basada en laboratorio del virus SARS-CoV-2 mediante la identificacion de patrones genomicos y clinico-demograficos en Costa Rica (2020-2022).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was approved by INCIENSA (INCIENSA-DG-of-2020-174) and the scientific committee of CIET-UCR (242-2020). Data were collected for epidemiological surveillance according to the Costa Rican regulation Law 8270 (May 17th, 2002), in which no additional consent was required for retrospective studies of archived and anonymized samples.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesProcessed data is found in the Supplementary material.