Marshaling Public Data for Lean and Powerful Splicing Studies

Presentation/Conference

January 16, 2020 11:00 a.m. - 12:00 p.m. ET

NCI's Center for Cancer Research (CCR), through the Bioinformatics Training and Education Program (BTEP) Distinguished Speaker Series, will present "Marshaling Public Data for Lean and Powerful Splicing Studies."

The Sequence Read Archive (SRA) now contains over a million accessions. Such archives are potential gold mines for researchers, but they are not organized for everyday use by scientists. The situation resembles the early days of the World Wide Web before search engines made the web easy to use. Ben Langmead will describe his team's work* on making large public RNA sequencing datasets easy to use. He will explain their multi-layered design, with one layer for scalable and uniform analysis (Rail-RNA), another for forming easy-to-use summarized (recount2), and a third for indexing the summaries and making them queryable (Snaptron). Altogether, the system allows scientists to pose scientific questions over vast gene expression and splicing summaries. He will describe collaborations where these tools were applied to:

evaluate hypotheses about prevalence or specificity of splicing patterns;
characterize completeness of the gene annotations we use to understand splicing patterns;
reveal patterns in public data that ultimately changed the study design and allowed more targeted hypotheses to be tested with less new data generation.

*This presentation describes joint work with Chris Wilks, Abhinav Nellore, Jonathan Ling, Seth Blackshaw, Luigi Marchionni, Jeff Leek, Kasper Hansen, Andrew Jaffe, and others.

Webex is open to the public; NIH employees are encouraged to register.

Ben Langmead

Ben Langmead is a computational biologist and associate professor in the Computational Biology and Medicine Group at Johns Hopkins University.

SUBSCRIBE TO UPDATES

Upcoming Events

The Human Tumor Atlas Network (HTAN): Exploring Tumor Evolution in Time and Space
January 14, 2025

NCI Office of Data Sharing Webinar: Navigating Data Access and Best Practices
January 15, 2025

The 2025 AACI Catchment Area Data Excellence (CADEx) Conference
January 29, 2025 - January 31, 2025

Innovation and AI in Oncology
January 29, 2025

NCI Symposium on Translational Technologies for Global Health
March 19, 2025 - March 20, 2025

See all upcoming events

Marshaling Public Data for Lean and Powerful Splicing Studies

SUBSCRIBE TO UPDATES

Upcoming Events

CATEGORIES

Past Events

Marshaling Public Data for Lean and Powerful Splicing Studies

SUBSCRIBE TO UPDATES

Upcoming Events

CATEGORIES

Past Events

Follow Us on LinkedIn