Advancing Computational Productivity Through Automation
This talk will describe the challenges in the area of scientific workflows, including how they are used to advance science in a number of domains, and how state-of-the-art software systems, such as Pegasus, meet the application and computing infrastructure challenges. Pegasus enables scientists to describe the workflows in an abstract, resource-independent way. That description includes the definition of the workflow steps and the data they take in and generate, but does not include low-level cyber-infrastructure information. Given the abstract workflow description and the information about the execution environment (composed of potentially distributed data sources and systems), a planner can map the computational tasks onto the available resources and plan the movement of data across distributed resources. The planning process also opens up opportunities for performance optimization and fault-tolerance. The talk will describe example applications, including LIGO, the gravitational-wave physics experiment that recently confirmed the existence of gravitational waves. The talk will touch upon the issues the applications face, and how Pegasus can help them execute in a number of different environments: campus clusters, distributed resources, and clouds.
Ewa Deelman is a Research Associate Professor at the University of Southern California (USC) Computer Science Department and a Research Director, at the USC Information Sciences Institute (ISI). Dr. Deelman's research interests include the design and exploration of collaborative, distributed scientific environments, with particular emphasis on automation of scientific workflow and management of computing resources, as well as the management of scientific data. Her work involves close collaboration with researchers from a wide spectrum of disciplines. At ISI she leads the Science Automation Technologies group that is responsible for the development of the Pegasus Workflow Management software. In 2007, Dr. Deelman edited Workflows in e-Science: Scientific Workflows for Grids, which was published by Springer. She is also the founder of the annual Workshop on Workflows in Support of Large-Scale Science, which is held in conjunction with the Super Computing conference. In 1997, Dr. Deelman received her Ph.D. in Computer Science from the Rensselaer Polytechnic Institute.
- Cancer Research Data Commons and Other NCI Infrastructures in Support of Data ScienceSeptember 19, 2021AttentiveChrome: Deep Learning for Predicting Gene Expression from Histone ModificationsSeptember 22, 2021“Le Grand et Le Petit”: Splicing Factors SF3B1 and SUGP1 and Their Cancer Mutations Leading to Aberrant Acceptor UsageSeptember 22, 2021The Future of Clinical Trial Data Sharing.... The Art of The PossibleSeptember 23, 2021Genomic Data Commons Single Cell RNA-Seq SupportSeptember 27, 2021Virtual Workshop on Next-Generation Sequencing and Radiomics: Resource Requirements for Acceleration of Clinical Applications Including AISeptember 29, 2021 - September 30, 2021