CANcer Distributed Learning Environment (CANDLE)
CANcer Distributed Learning Environment (CANDLE) is an open source, collaboratively developed software platform that provides deep learning methodologies for accelerating cancer research.
CANDLE supports the Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) program, a research program and cross-agency collaboration between NCI and Department of Energy (DOE) and other related areas of cancer research.
Driven by scientific challenges in cancer research defined by JDACS4C pilot efforts, CANDLE capabilities enable advances in exascale computing through support from DOE’s Exascale Computing Project. DOE’s Argonne National Lab spearheads the project with participation across the JDACS4C national laboratories: Oak Ridge National Lab, Lawrence Livermore National Lab, Los Alamos National Lab, and NCI’s Frederick National Lab for Cancer Research. To further develop CANDLE, scientists across the JDACS4C collaboration work in a genuine team-science manner with combined expertise in cancer and computational, data, and physical sciences.
Features of CANDLE
The CANDLE platform allows cancer researchers access to key features at the forefront of the data science frontier shaped by advances in deep learning.
- Hyperparameter optimization efficiently identifies the most effective model implementations.
- Scalable data parallelism speeds machine learning where very large data are required.
As a scalable system, the CANDLE platform can be installed and run on systems ranging from a laptop to the largest supercomputers available for scientific research.
CANDLE leverages the best, emerging open source innovations developed by the deep learning community. This allows for the extension and delivery of a scalable platform for deep learning that cancer researchers can access and adapt for use with their own challenges.
- ECP CANDLE GitHub Organization hosts the benchmark codes, documentation, tutorials, and database schema.
- ECP CANDLE FTP Site hosts all the public datasets for the benchmarks from the three pilots.
The CANDLE benchmarks deliver working examples of large-scale deep learning applied to different cancer research challenges. Benchmark models were developed to provide a performance baseline to measure progress and improvement and implement deep learning architectures relevant to scientific challenges in cancer research and opportunities in exascale computing.
Through the deep learning capabilities of CANDLE, demonstrated in the benchmarks, cancer researchers can produce faster results in several areas:
- Identification of key molecular interactions, based on molecular dynamic simulations of proteins, specifically RAS
- Predictions of tumor response to drug treatments, based on molecular features of tumor cells and drug descriptors
- Better characterization of cancer patient trajectories and outcomes using a growing compendium of clinical information
Connecting the Cancer Community
CANDLE helps bring the cancer research community together, with hands-on workshops that offer opportunities for researchers to share insights and benchmark examples that demonstrate how deep learning and CANDLE can be used broadly to accelerate cancer research.
The Future of CANDLE
The CANDLE project has already delivered software for each of the JDACS4C pilots and other cancer research projects.
Future work is underway for new releases of CANDLE, including:
- Model optimization joined with parallel computing to increase capabilities for even larger amounts of data, and
- Development of new areas in which deep learning can accelerate cancer research