Cancer Data Science Pulse

Using Cancer Data Science to Advance Your Research

Data science can help advance your work in cancer research and medicine. No matter where you are in your data science journey, we have articles and resources for you to explore.

Data science captures various techniques and tools that researchers, clinicians, and data scientists use to drive the field of cancer research forward. Maybe machine learning can help you predict patient responses and treatment plans, or data visualization methods will help you more effectively share your findings. Perhaps you’re curious about what a cancer researcher needs to know about working with a data scientist. Get answers to these questions, and more, from the following articles!

Data, Clinical Trials, and You

Performing a CIViC Duty—A Community-Driven Resource for Interpreting Data on Cancer Variants

If you’re planning to set up a clinical trial, you can use the information in this blog to help you determine your baselines and parameters. See what’s been done with different therapeutics and the outcomes before you set up your trial! You’ll also learn about CIViC, an open-source platform that connects researchers to the latest published findings on a wide range of cancer variant interpretations. 

Making Sense of Data Sharing (and Management)

Your Guide to the 2023 NIH Data Management and Sharing Policy

Whether you’re new to data science or have questions about properly managing and sharing the data you work with, this blog can help. Effectively sharing data with others in the cancer community greatly impacts the field’s advancement; everyone benefits from following this policy!

Funding for Data Science Projects

Discover the New Index of NCI Studies Web Application to Get NCI Research Outputs 

If you’re looking for examples of how NCI funding could help support data science in your work, check out this article on the Index of NCI Studies. You’ll be able to browse through the outputs from NCI-supported grants and get a better sense of NCI funding and data for cancer research.

NCI-Funded Partnership Uses AI to Improve Diversity in Cancer Clinical Trials

Are you passionate about improving diversity in clinical trials? Data science plays a vital role in ensuring clinical trials are representative. This article will give you a look at how NCI-funded researchers partner with experts to improve diversity in cancer research through artificial intelligence.

You’ve Got Data! Now What?

Visualizing Data Using Circular Heatmaps and Biplots—Pro-Tips from NCI Researchers

Are you comfortable with the basics of data science but need help using your data most effectively? As the name suggests, data visualization lets you create a visual component to your data, which can be helpful in gaining insights from your data and sharing your data with others. In this blog, you’ll learn more about two types of data visualization graphics that you may want to use in your research.

Collaboration is Key

Federated Learning – A Solution for Democratizing Data for Cancer Research?

Does the scope of data science seem overwhelming? There are vast amounts of data from many sources, challenges with data diversity and bias, and accessibility issues to consider. This blog walks you through a novel approach to using data science for cancer research. Give this a read if you’re curious about the impact of collaboration across the field of cancer data science.

Now that you’ve had a chance to explore data science topics and think you might be interested in learning more about the tools available to you; where do you go? Check out the sampling below of some of the types of tools and resources you can access!


Resource/Tool Name

What is it?

What does it do?

How can you use it?

ATOM Modeling PipeLine


The ATOM Modeling PipeLine (AMPL) is an open source, modular, extensive software pipeline for building and sharing models.

Use AMPL to generate machine learning models for your work in advancing silico drug discovery.



CANcer Distributed Learning Environment (CANDLE) is an open source software platform that improves prediction accuracy of machine learning algorithms.

Explore CANDLE to find resources that help with hyperparameter optimization for more accurate machine learning in your cancer research.



Informatics Technology for Cancer Research (ITCR) is an NCI-program supporting informatics technology across cancer research.

Visit this website to discover the wide range of tools in the ITCR portfolio. You can use these tools for your work in omics, imaging, network biology, clinical research, and data standards.

Cancer Drug Response Prediction

Data Set

This provides dataframes (e.g., gene expression and drug response data, etc.) and supporting metadata used in the Pilot 1 project.

Check out this resource if you’re looking into systematically modeling tumor drug response with deep learning models more suited for large-scale data. 

Are there additional topics you’d like to see covered to help you understand what data science can do for you and your research? Is there something you wish an informatics tool could make simpler for you? Leave a comment below and we’ll follow up with more information!

You can also subscribe for a weekly email highlighting new resources, events, and opportunities.

Older Post
Job Interview Questions for Cancer Data Scientists and How to Answer Them
Newer Post
NCI’s National Cancer Plan and Data’s Critical Role

Leave a Reply

Vote below about this page’s helpfulness.

Your email address will not be published.

Curing cancer may result in a demographic catastrophe
Thank you for your interest in the topic. NCI's mission is to lead, conduct, and support cancer research across the nation to advance scientific knowledge and help all people live longer, healthier lives. At CBIIT, we're committed to using data science to advance that goal!
The visuals you've included in this post really help to illustrate your points. They add a lot of value and make the content more engaging.
Thank you for the feedback; we’re glad to hear you found the visuals engaging! We’ll keep that in mind with our future pieces as well. We hope you’ll come back to read more!