NCI Cancer Research Data Commons

The vision for the Cancer Research Data Commons (CRDC) is a virtual, expandable infrastructure that provides secure access to many different data types across scientific domains, allowing users to analyze, share, and store results, leveraging the storage and elastic compute, or ability to easily scale resources, of the cloud. The ability to combine diverse data types and perform cross-domain analysis of large datasets can lead to new discoveries in cancer prevention, treatment and diagnosis, and supports the goals of precision medicine and the Cancer Moonshot℠.

The NCI Cancer Research Data Commons (CRDC) provides biomedical researchers, tool developers, and data scientists with access to data from NCI programs through the Genomic Data Commons, NCI Cloud Resources, and Proteomics Data Commons. The CRDC allows users to analyze, share, and store results, and is growing to include a wider range of data, including proteomics, imaging, and canine.

The CRDC provides access to data from NCI programs such as The Cancer Genome Atlas (TCGA) and its pediatric counterpart, Therapeutically Applicable Research to Generate Effective Treatments (TARGET), and The Clinical Proteomics Tumor Analysis Consortium (CPTAC), through:  

The CRDC is growing to include a wider range of data. The fundamental principles of the CRDC include:

The Data Commons Framework describes the core principles and components on which the CRDC is being built.