Cancer Data Science Pulse

Visualizing RNA-seq Data—Pro-Tips From an NCI Bioinformatics Engineer

Data Sharing

Informatics Tools

Training

October 5, 2022

In this new blog series, we're posting samples and tips on visualizing cancer data. To kick us off, Dr. Alida Palmisano, a bioinformatics engineer in the Biometric Research Program in NCI’s Computational and Systems Biology Branch, Division of Cancer Treatment and Diagnosis, shares her ideas for visualizing complex single-cell RNA-sequencing (scRNA-seq) data.

Our hope is that these images will spark your imagination when showcasing your research data.

Data from Dong R, et al. Single-Cell Characterization of Malignant Phenotypes and Developmental Trajectories of Adrenal Neuroblastoma. Cancer Cell. 9;38(5):716-733, 2020, PMID: 32946775.

What type of graphic is it?

It’s a composition of various plots for single cell transcriptome sequencing data. scRNA-seq measures the RNA molecules within each cell of a given sample to provide a snapshot of the cells’ transcriptome (i.e., the genes that are being transcribed when the cells are collected).

Pro-Tip: High-dimensional data like scRNA-seq are challenging to show. Using visualization strategies that work together (like a puzzle) can help piece together helpful biological insights.

Why is the graphic important?

With scRNA-seq, we can use a single experiment to capture a moment in time in highly heterogenous tissues. We can see the genes that are being transcribed (i.e., the transcriptome), composing many dynamic biological processes. The high dimensionality of the data (e.g., large numbers of genes and cells, complex biological processes) is a challenge that requires a variety of visualization strategies, which have to work together (like a puzzle) to reveal helpful biological insights.

Pro-Tip: Use each visualization strategy to generate a hypothesis. Make sure that each “puzzle piece” fits together to fully address your research question.

How did you create it?

I generated the figures with R, using a popular package for single cell data analysis called Seurat. Additional visualization like bar plots were generated using ggplot.

What should I consider when visualizing this kind of data?

Remember that the underlying data are extremely high dimensional. Using a single visualization approach will give you a limited view of the information, which can be extremely biased! Select a visualization strategy that addresses your hypothesis and make sure that each “piece of the puzzle” fits together in a way that leads to useful information. Also, remember that all the tools have many tunable parameters that may greatly impact the way the figures look and ultimately the hypothesis you derive from them.

Pro-Tip: My favorite visualization type is bar charts. Charts can embed a lot of complexity without affecting their overall intuitive interpretation.

What’s your favorite and why?

In general, my favorite visualization type is bar charts because they can embed a lot of complexity without affecting their overall intuitive interpretation. You can use colors, patterns, order, orientation, and much more to convey both simple and complex messages. However, bar charts, as with any other visualization type, have their limitations. Always remember to visualize the same data in several different ways to see which combination of techniques tells the story you want to share.

Alida Palmisano, Ph.D.

Bioinformatics Engineer, Biometric Research Program, Computational and Systems Biology Branch, Division of Cancer Treatment and Diagnosis, NCI

Monika Davare on October 07, 2022 at 11:28 a.m.

Enjoyed reading this content. Helpful.

CBIIT Staff on October 18, 2022 at 03:10 p.m.

We’re glad you found the blog helpful! We have several additional blogs on informatics tools and data, if you’d like to explore more https://datascience.cancer.gov/news-events/blog?blog_category_id=36

Cancer Data Science Pulse

Visualizing RNA-seq Data—Pro-Tips From an NCI Bioinformatics Engineer

What type of graphic is it?

Why is the graphic important?

How did you create it?

What should I consider when visualizing this kind of data?

What’s your favorite and why?

Leave a Reply

Monika Davare on October 07, 2022 at 11:28 a.m.

CBIIT Staff on October 18, 2022 at 03:10 p.m.

SUBSCRIBE TO UPDATES

Categories

Archive

Cancer Data Science Pulse

Visualizing RNA-seq Data—Pro-Tips From an NCI Bioinformatics Engineer

What type of graphic is it?

Why is the graphic important?

How did you create it?

What should I consider when visualizing this kind of data?

What’s your favorite and why?

Leave a Reply

Monika Davare on October 07, 2022 at 11:28 a.m.

CBIIT Staff on October 18, 2022 at 03:10 p.m.

SUBSCRIBE TO UPDATES

Categories

Archive

Follow Us on LinkedIn