Oncoexporter: Conversion of NCI CRDC Data to the GA4GH Phenopacket Schema
If you’re attending the EACR 2024, make sure you hear Dr. Daoud Meerzaman’s poster presentation on using Oncoexporter to convert Cancer Research Data Commons (CRDC) data into the Global Alliance for Genomics and Health (GA4GH) Phenopackets schema.
Dr. Meerzaman will illustrate the application of the Phenopacket standard to CRDC data and demonstrate a subsequent application to a simple clinical analysis.
Providing tools to convert CRDC data into the Phenopacket schema will enable further integration and analysis of a broad array of cancer data. This integration could unlock important insights into cancer development, progression, and treatment hidden in the massive reservoirs of heterogeneous cancer data sets worldwide.
The GA4GH Phenopacket schema is a common framework for capturing phenotypic and genotypic data in a way that works well with downstream machine learning applications.
The CRDC is a cloud-based infrastructure that provides public and controlled access to multiple large-scale cancer data sets. You can use NCI Cloud Resources to analyze these data sets without downloading.
Dr. Meerzaman is the Computational Genomics and Bioinformatics Branch chief at NCI’s Center for Biomedical Informatics and Information Technology. Dr. Meerzaman and his team provide bioinformatics analysis support for life sciences and clinical and translational research for intramural scientists, including NCI’s Division of Cancer Epidemiology and Genetics and Center for Cancer Research.
Upcoming Events
-
The 2025 AACI Catchment Area Data Excellence (CADEx) ConferenceJanuary 29, 2025 - January 31, 2025Innovation and AI in OncologyJanuary 29, 2025NCI Symposium on Translational Technologies for Global HealthMarch 19, 2025 - March 20, 2025