escience2019 has ended
Wednesday, September 25 • 1:00pm - 1:30pm
Custom Execution Environments with Containers in Pegasus-Enabled Scientific Workflows

Sign up or log in to save this to your schedule and see who's attending!

Karan Vahi (University of Southern California), Mats Rynge (University of Southern California), George Papadimitriou (University of Southern California), Duncan Brown (Syracuse University), Rajiv Mayani (University of Southern California), Rafael Ferreira da Silva (University of Southern California), Ewa Deelman (University of Southern California), Anirban Mandal (University of North Carolina), Eric Lyons (University of Massachusetts at Amherst), and Michael Zink (University of Massachusetts at Amherst)

Science reproducibility is a cornerstone feature in scientific workflows. In most cases, this has been implemented as a way to exactly reproduce the computational steps taken to reach the final results. While these steps are often completely described, including the input parameters, datasets, and codes, the environment in which these steps are executed is only described at a higher level with endpoints and operating system name and versions. Though this may be sufficient for reproducibility in the short term, systems evolve and are replaced over time, breaking the underlying workflow reproducibility. A natural solution to this problem is containers, as they are well defined, have a lifetime independent of the underlying system, and can be user-controlled so that they can provide custom environments if needed. This paper highlights some unique challenges that may arise when using containers in distributed scientific workflows. Further, this paper explores how the Pegasus Workflow Management System implements container support to address such challenges.


Karan Vahi

University of Southern California

Wednesday September 25, 2019 1:00pm - 1:30pm
Boardroom West