ABSTRACT
We outline the development of the Health Data Nexus, a data platform which enables data storage and access management with a cloud-based computational environment. We describe the importance of this secure platform in an evolving public sector research landscape that utilizes significant quantities of data, particularly clinical data acquired from health systems, as well as the importance of providing meaningful benefits for three targeted user groups: data providers, researchers, and educators. We then describe the implementation of governance practices, technical standards, and data security and privacy protections needed to build this platform, as well as example use-cases highlighting the strengths of the platform in facilitating dataset acquisition, novel research, and hosting educational courses, workshops, and datathons. Finally, we discuss the key principles that informed the platform’s development, highlighting the importance of flexible uses, collaborative development, and open-source science.
Competing Interest Statement
The author MM holds non-controlling shares in Signal1 AI. The authors RC, KS, DH, and KR are all employed by Upside Labs.
Funding Statement
Funding for the creation of T-CAIREM and the Health Data Nexus has been supported by the Temerty Foundation through a transformational gift.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
(january.adams{at}utoronto.ca), (rafal{at}upsidelab.io), (karol.szuster{at}upsidelab.io), (dj{at}upsidelab.io), (zoryana.salo{at}utoronto.ca), (rutvik.solanki{at}utoronto.ca), (muhammad.mamdani{at}unityhealth.to), (alistair{at}glowyr.ca), (kasia{at}upsidelab.io), (tpollard{at}mit.edu), (david.rotenberg{at}camh.ca),
List of Abbreviations
- A2B
- Academia-to-Business
- AI
- Artificial Intelligence
- BSD
- Berkeley Software Distribution
- CT
- Computed Tomography
- FHIR
- Fast Healthcare Interoperability Resources
- GCP
- Google Cloud Platform
- GIM
- General Internal Medicine
- GPU
- Graphics Processing Unit
- HDN
- Health Data Nexus
- IRB
- Institutional Review Board
- ML
- Machine Learning
- PHIPA
- Personal Health Information Protection Act
- PIA
- Privacy Impact Assessment
- REB
- Research Ethics Board
- T-CAIREM
- Temerty Centre for Artificial Intelligence Research and Education in Medicine
- TCPS
- Tri-Council Policy Statement
- TCPS-CORE
- Tri-Council Policy Statement - Course on Research Ethics
- TRA
- Threat Risk Assessment
- VADA
- Visual and Automated Disease Analytics