Cancer Data Science Pulse

A Quick Start Guide to Cancer Data Science for Clinical Oncology

We’re excited to share new updates and reading material to this blog to help you in your research! Scroll through to see our latest additions.

Whether you are in the data science field, interested in developing computational solutions for clinical oncology, or a clinical researcher, we’ve curated a list of data sets, tools, and learning resources to showcase how these disciplines can and are working together to empower cancer research. Cancer data science can drive clinical oncology forward. For example, researchers develop computer models to serve as a cancer patient’s digital twin to capture real-time dynamics to create predictive models and, one day, guide treatment decisions. Also, national sequencing efforts could equip clinicians with the knowledge of how a patient’s genome impacts their response to medications.

Explore Clinical and Biological Data Online

With so much data available, we’ve pulled together a list of links to data sets for common cancer types across some of our NCI Cancer Research Data Commons (CRDC) resources. Collectively, these resources offer access to more than one million files of experimental and clinical data from landmark NCI studies and other grantee projects. Some of the clinical attributes include:

  • demographics,
  • diagnoses,
  • treatments, and
  • environmental exposure.

You can explore high-level trends in these data sets directly from the online portals or analyze the data with the library of robust computational tools provided through NCI’s Cloud Resources.

Having trouble understanding what a particular term means? Our semantics resources and services provide definitions of common data elements and standard data ontologies like the Clinical Data Interchange Standards Consortium (CDISC).
Cancer Site

Genomic Data Commons*

(Genomic and clinical data)

Proteomic Data Commons

(Proteomic and clinical data)

Imaging Data Commons

(Medical imaging, digital pathology, and clinical annotations)

Lung5,331 Cases437 Cases4,728 Cases
Breast4,054 Cases313 Cases15,808 Cases
Colorectal2,937 Cases195 Cases1,929 Cases
Kidney2,421 Cases261 Cases1,378 Cases
Pancreas1,205 Cases282 Cases558 Cases

*Note: To find the links for each cancer site in the Genomic Data Commons, visit the exploration page to build a cohort.

Looking for NCI-funded clinical or translational data? NCI’s CRDC has launched its new Clinical and Translational Data Commons! To learn more about the available data and features, read the short announcement.

Find Bioinformatics Tools for Predictive Oncology

In addition to what the CRDC offers, there are many analytical tools and pipelines to help you mine and extract meaningful insights from clinical data. We’ve added a few selections from our partners focused on supporting predictive and precision oncology analysis, but you can find more tools through the “Resources for Researchers” search engine.

  • Accelerating Therapeutics for Opportunities in Medicine (ATOM) Consortium: This public-private consortium has developed the ATOM Modeling PipeLine (AMPL), an open source, free-to-use software for building and sharing models that advance in silico drug discovery.
  • NCI-Department of Energy (DOE) Collaboration: NCI and DOE developed predictive artificial intelligence (AI) and machine learning (ML) models of drug responses in pre-clinical models of cancer to improve and expedite the selection and development of new targeted therapies. Its tools are available online through their capabilities catalog.
  • Informatics Technology for Cancer Research (ITCR): This trans-NCI program supports investigator-initiated and research-driven informatics tool development. The ITCR program offers several resources for clinical research, including tools for integrating and analyzing electronic medical records, databases cataloging clinically actionable information for personalized cancer therapy, and training courses to advance your skills.
  • Childhood Cancer Data Initiative (CCDI) Molecular Targets Platform (MTP): This knowledge base allows you to browse and identify associations among molecular targets, diseases, and drugs. Watch a recording on how to navigate, identify, and prioritize data in the CCDI MTP.

Learn About Cancer Data Science in Precision Oncology

If you’re new to the world of cancer data science and its application to clinical research, check out these blogs, news articles, and event recordings that cover some of the basics and examples of innovative work made possible through the intersection of these disciplines.

For regular updates on NCI’s cancer data science efforts, training events, and blog, subscribe to our weekly RSS.

Training

What makes a good AI model?

Read five tips on how to use AI in your research or clinical practice.

Cancer Data Science Lifecycle

Apply our six-stage cancer data science process to your research. You’ll learn about everything from data cleaning (including our blogs that outline the importance of semantics and common data elements) to predictive modeling.

101 Course

Take our free, easy-to-follow courses on critical skills for each of the data science life cycle. You’ll get beginner-level introductions to topics like cloud computing and popular data science technologies.

Training Guide Library

Explore instructional guides and resources that elaborate on each of the cancer data science lifecycle stages.

Data Science in Clinical Care

How can you apply data science to your clinical profession? Here are some examples:

Cancer Detection

Automated AI Model Aids in Early Detection of Pancreatic Cancer

Use AI to find undetectable cancers on scans of a normal-looking pancreas long before clinical symptoms are visible.

Seeking a Better Biopsy? NCI-Funded Researchers Are Using Machine Learning to Identify Exosome Biomarkers

Read how NCI-funded researchers used ML to characterize a cancer biomarker based on exosomes. Their biomarker worked well using non-invasive sources, such as blood and urine, allowing the researchers to catch cancer early, even in tumors of undetermined origins.

Cancer Diagnosis

Biomedical Data Fusion Lab
Watch the recording and learn about leveraging data at different scales for personalized diagnosis, prognosis, and therapy in oncology and neuroscience.

NCI-Funded Researchers Develop a New Model for Interpreting Pathology
Classify cancer and predict its progress through a new NCI-funded approach. This approach helps annotate and analyze whole-slide images to contribute to your research. You can get the pre-trained model and see demonstrations, too.

Cancer Treatment

NCI Uses AI to Take the Guesswork Out of Assessing Prostate Cancer Images

See how our AI model offers another tool to help predict if cancer will spread or re-occur, giving us vital information for monitoring disease and tailoring treatment to each.

Affordable, Interpretable, and Equitable AI for Precision Oncology

Get advice by watching a video on utilizing AI to forecast patient response to treatment and track their progress in a clinical setting.

Machine Learning in Cancer Care Delivery: Moving from Model Validation to Clinical Workflow

Watch a video to learn how to transition from creating and validating ML tools and incorporate them into patient care.

Theranostics and AI—The Next Advance in Cancer Precision Medicine

Learn how AI and data are helping researchers “see” cancer in a new way, resulting in a more precise way of targeting cancer treatment. By using molecular imaging, we can identify areas of tumors most likely to respond to treatment and select the most effective therapies in those instances.

Are we missing resources you would want to see? We may already have it. Leave a comment and we’ll follow-up with additional information.
NCI CBIIT Staff
Older Post
The Good, the Bad, and the Unexplained: Five Tips for Evaluating an AI Product

Leave a Reply

Vote below about this page’s helpfulness.

Your email address will not be published.

Thank you for letting me know that computer models are seen to guide treatment decisions one day. My friend wants her cancer treatment to be effective. I think it's best for her to seek guidance from an oncology specialist.
https://ctradonc.com/
We’re glad you found the information helpful. Data science contributes greatly to the field of cancer research, and if you’d like to learn more, please check out our Cancer Data Science Pulse blog where we frequently explore this topic further: https://datascience.cancer.gov/news-events/blog
I read your blog. I found it very informative. I am a big fan of your blogs. I feel the blog aligns perfectly with our services. We are providing data science courses with real-work experience which is ideal for those who wish to have a career transition or start a fresh career path in data science along with a 100% job assurance commitment visit our website <a href=https://skillslash.com/data-science-course-in-pune> Data Science Course in pune </a>. These courses are wonderful for professionals.
Thank you for your interest in our blogs, we’re glad you’re a fan. Data scientists are integral to the future of cancer research and precision medicine!
"Cancer Data Science is playing an increasingly vital role in the field of Clinical Oncology, providing new insights and opportunities for precision medicine. With the use of advanced analytical methods and large amounts of data, we can now better understand the underlying biology of cancer, identify novel therapeutic targets and predict patient outcomes, ultimately leading to improved patient care and outcomes. We are excited about the potential for Cancer Data Science to revolutionize the way we diagnose and treat cancer."
https://www.technobridge.in/clinical-research-course.html



We’re excited about how cancer data science is advancing the ways we treat and diagnose cancer too! We look forward to continuing to provide our readers with helpful resources and current news about the field of data science for cancer research. Be sure to check out our other blogs too!
Thanks for sharing the details of this website
We're glad it’s been helpful for you!