Award Abstract # 2103508
Collaborative Research: Elements: Simulation-driven Evaluation of Cyberinfrastructure Systems

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: UNIVERSITY OF SOUTHERN CALIFORNIA
Initial Amendment Date: June 30, 2021
Latest Amendment Date: October 13, 2022
Award Number: 2103508
Award Instrument: Standard Grant
Program Manager: Marlon Pierce
mpierce@nsf.gov
 (703)292-7743
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: August 1, 2021
End Date: July 31, 2025 (Estimated)
Total Intended Award Amount: $315,027.00
Total Awarded Amount to Date: $315,027.00
Funds Obligated to Date: FY 2021 = $315,027.00
History of Investigator:
  • Ewa Deelman (Principal Investigator)
    deelman@isi.edu
  • Loic Pottier (Former Principal Investigator)
  • Rafael Ferreira da Silva (Former Principal Investigator)
Recipient Sponsored Research Office: University of Southern California
3720 S FLOWER ST FL 3
LOS ANGELES
CA  US  90033
(213)740-7762
Sponsor Congressional District: 34
Primary Place of Performance: University of Southern California
4676 Admiralty Way, Ste 1001
Marina del Rey
CA  US  90292-6611
Primary Place of Performance
Congressional District:
36
Unique Entity Identifier (UEI): G88KLJR3KYT5
Parent UEI:
NSF Program(s): Software Institutes
Primary Program Source: 01002122DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 9102, 8004, 7923, 077Z
Program Element Code(s): 800400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Most scientific breakthroughs and discoveries are now preconditioned on performing complex processing of vast amounts of data as conveniently, reliably, and efficiently as possible. This requires high-end interconnected compute and storage resources, as well as software systems to automate the processing on these resources. An enormous amount of effort has been invested in producing such "cyberinfrastructure" software systems. And yet, developing and evolving these systems so that they are as efficient as possible, while anticipating future cyberinfrastructure opportunities and needs, is an open challenge. This project transforms the way in which these systems are evaluated, so that their capabilities can be developed and evolved judiciously. The traditional evaluation approach is to observe executions of these systems on real-world hardware resources. Although seemingly natural, this approach suffers from many shortcomings. Instead, this project focuses on simulating these executions. Simulation has tremendous, and untapped, potential for transforming the development cycle of cyberinfrastructure systems. Specifically, this project produces software elements that can be easily integrated into existing and future systems to afford them with simulation capabilities. These capabilities make it possible for developers to put their systems through the wringer and observe their behaviors for arbitrary operating conditions, including ones that go beyond current hardware platforms and scientific applications. Simply put, these capabilities will make it possible to establish a solid experimental science approach for the development of cyberinfrastructure systems that support current and future scientific endeavors that are critical to the development of our society.

The cyberinfrastructure has been the object of intensive research and development, resulting in a rich set of interoperable software systems that are used to support science. A key challenge is the development of systems that can execute application workloads efficiently, while anticipating future cyberinfrastructure opportunities and needs. This project aims to transform the way in which these systems are evaluated, so that their capabilities can be evolved based on a sound, quantitative experimental science approach. The traditional evaluation approach is to use full-fledged software stacks to execute application workloads on actual cyberinfrastructure deployments. Unfortunately, this approach suffers from several shortcomings: real-world experiments are time- and labor-intensive, and they are limited to currently available hardware and software configurations. An alternative to real-world experiments that does not suffer from these shortcomings is simulation, i.e., the implementation and use of a software artifact that models the functional and performance behaviors of software and hardware stacks of interest. This project uses simulation to transform the way in which cyberinfrastructure systems are evaluated as part of their long-term development cycles. This is achieved via software elements for enhancing production cyberinfrastructure systems with simulation capabilities so as to enable quantitative evaluation of these systems for arbitrary execution scenarios. Creating these scenarios requires little labor, and executions can be simulated accurately and orders of magnitude faster than their real-world counterparts. Furthermore, simulations are perfectly reproducible and observable. While this approach is general, its effectiveness will be demonstrated by applying it to a number of production systems, namely, workflow management systems. This project capitalizes on the years of development invested in the SimGrid and WRENCH simulation frameworks.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Coleman, Tainã and Casanova, Henri and Pottier, Loïc and Kaushik, Manav and Deelman, Ewa and Ferreira da Silva, Rafael "WfCommons: A framework for enabling scientific workflow research and development" Future Generation Computer Systems , v.128 , 2022 https://doi.org/10.1016/j.future.2021.09.043 Citation Details
Casanova, Henri and Deelman, Ewa and Gesing, Sandra and Hildreth, Michael and Hudson, Stephen and Koch, William and Larson, Jeffrey and McDowell, Mary Ann and Meyers, Natalie and Navarro, John-Luke and Papadimitriou, George and Tanaka, Ryan and Taylor, Ia "Emerging Frameworks for Advancing Scientific Workflows Research, Development, and Education" 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS) , 2021 https://doi.org/10.1109/WORKS54523.2021.00015 Citation Details
Casanova, H. and Wong Y. C. and Pottier, L. and Ferreira da Silva, R. "On the Feasibility of Simulation-driven Portfolio Scheduling for Cyberinfrastructure Runtime Systems" Job Scheduling Strategies for Parallel Processing , 2022 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page