Training Guide Library

Training Guide Library

In a hurry and need quick instructions on the cancer data science lifecycle stages? Browse this list of guides and resources.

Data Generation and Collection

Identify and gather the data you need to address a problem.

  • Beginner
  • Advanced
    • [Training Video Recording] WebMeV | Discover how to use this intuitive, web-based, bioinformatics analysis toolkit designed for non-bioinformaticians. This walkthrough includes steps on how to upload data files, run a single-cell analysis (using the tools available within the toolkit), and how to navigate/create public data sets available within MebMeV.

Data Cleaning

Fix discrepancies and handle missing values in your data.

  • Beginner

Data Exploration and Analysis

Study your data, then form a hypothesis.

  • Beginner
    • [Article] Exploring and Analyzing Data: The Basics | Get the fundamentals on what it is, why it matters, and how you can do it effectively.
    • [Training Video Recording] WebMeV | Receive a demonstration on how this web-based software for genomic data analysis can upload data and perform various analyses such as normalization, clustering, and principal component analysis.
    • [Training Video Recording] XNAT | Learn about this open source imaging informatics software platform that enables data ingestion, curation, annotation, quality control, and computational workflows using Docker containers.
    • [Training Video Recording] User-Friendly Analysis of Spatial Transcriptomics with spatialGE | Learn more about this user-friendly web application that integrates the spatial R package. This package, enhanced with additional spatial transcriptomics (ST) analysis methods (such as SpaGCN, STdeconvolve, and InSituType), makes it more valuable for the cancer research community.
  • Advanced
    • [Blog] An Introduction to Cloud Computing | Get tips on how to manage platform costs, access, and training. See also how NCI connects researchers to the cloud with the Cancer Research Data Commons.
    • [Training Video Recording] An Introduction to Gene Set Enrichment Analysis (GSEA) and the Molecular Signatures Database (MSigDB) | Learn how the GSEA analysis tool operates, how to use MSigDB to compare your data against well annotated gene sets, and how to run GSEA with MSigDB.
    • [Training Video Recording] TCIA Jupyter Learning Lab | Explore a variety of use cases for identifying The Cancer Imaging Archive (TCIA) data sets, and learn how to download them using Jupyter Notebooks. You’ll also learn how to utilize TCIA for data exploration and downloading data.
    • [Training Video Recording] TumorDecon | Discover how digital cytometry methods and their applications assist in tumor research.

Predictive Modeling

Use computational tools like machine learning models to make predictions with your data.

Data Visualization

Communicate your data findings using interactive images, plots, and charts.

Data Sharing

Accelerate discovery by making your data available to others.


Submit Feedback

Help us help you! If you believe content is missing or needs modifying, please let us know. Leave us a comment below, or send an email to NCI CBIIT.

Updated:
Vote below about this page’s helpfulness.