Award Abstract # 2103489
Collaborative Research: Elements: Simulation-driven Evaluation of Cyberinfrastructure Systems

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: UNIVERSITY OF HAWAII
Initial Amendment Date: June 30, 2021
Latest Amendment Date: June 30, 2021
Award Number: 2103489
Award Instrument: Standard Grant
Program Manager: Marlon Pierce
mpierce@nsf.gov
 (703)292-7743
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: August 1, 2021
End Date: July 31, 2025 (Estimated)
Total Intended Award Amount: $284,973.00
Total Awarded Amount to Date: $284,973.00
Funds Obligated to Date: FY 2021 = $284,973.00
History of Investigator:
  • Henri Casanova (Principal Investigator)
    henric@hawaii.edu
Recipient Sponsored Research Office: University of Hawaii
2425 CAMPUS RD SINCLAIR RM 1
HONOLULU
HI  US  96822-2247
(808)956-7800
Sponsor Congressional District: 01
Primary Place of Performance: University of Hawaii
1680 East-West Road, POST 317
Honolulu
HI  US  96822-2327
Primary Place of Performance
Congressional District:
01
Unique Entity Identifier (UEI): NSCKLFSSABF2
Parent UEI:
NSF Program(s): Software Institutes
Primary Program Source: 01002122DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 9150, 8004, 7923, 077Z
Program Element Code(s): 800400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Most scientific breakthroughs and discoveries are now preconditioned on performing complex processing of vast amounts of data as conveniently, reliably, and efficiently as possible. This requires high-end interconnected compute and storage resources, as well as software systems to automate the processing on these resources. An enormous amount of effort has been invested in producing such "cyberinfrastructure" software systems. And yet, developing and evolving these systems so that they are as efficient as possible, while anticipating future cyberinfrastructure opportunities and needs, is an open challenge. This project transforms the way in which these systems are evaluated, so that their capabilities can be developed and evolved judiciously. The traditional evaluation approach is to observe executions of these systems on real-world hardware resources. Although seemingly natural, this approach suffers from many shortcomings. Instead, this project focuses on simulating these executions. Simulation has tremendous, and untapped, potential for transforming the development cycle of cyberinfrastructure systems. Specifically, this project produces software elements that can be easily integrated into existing and future systems to afford them with simulation capabilities. These capabilities make it possible for developers to put their systems through the wringer and observe their behaviors for arbitrary operating conditions, including ones that go beyond current hardware platforms and scientific applications. Simply put, these capabilities will make it possible to establish a solid experimental science approach for the development of cyberinfrastructure systems that support current and future scientific endeavors that are critical to the development of our society.

The cyberinfrastructure has been the object of intensive research and development, resulting in a rich set of interoperable software systems that are used to support science. A key challenge is the development of systems that can execute application workloads efficiently, while anticipating future cyberinfrastructure opportunities and needs. This project aims to transform the way in which these systems are evaluated, so that their capabilities can be evolved based on a sound, quantitative experimental science approach. The traditional evaluation approach is to use full-fledged software stacks to execute application workloads on actual cyberinfrastructure deployments. Unfortunately, this approach suffers from several shortcomings: real-world experiments are time- and labor-intensive, and they are limited to currently available hardware and software configurations. An alternative to real-world experiments that does not suffer from these shortcomings is simulation, i.e., the implementation and use of a software artifact that models the functional and performance behaviors of software and hardware stacks of interest. This project uses simulation to transform the way in which cyberinfrastructure systems are evaluated as part of their long-term development cycles. This is achieved via software elements for enhancing production cyberinfrastructure systems with simulation capabilities so as to enable quantitative evaluation of these systems for arbitrary execution scenarios. Creating these scenarios requires little labor, and executions can be simulated accurately and orders of magnitude faster than their real-world counterparts. Furthermore, simulations are perfectly reproducible and observable. While this approach is general, its effectiveness will be demonstrated by applying it to a number of production systems, namely, workflow management systems. This project capitalizes on the years of development invested in the SimGrid and WRENCH simulation frameworks.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 12)
Monniot, Julien and Tessier, François and Casanova, Henri and Antoniu, Gabriel "Simulation of Large-Scale HPC Storage Systems: Challenges and Methodologies" , 2024 https://doi.org/10.1109/HiPC62374.2024.00031 Citation Details
Casanova, H. and K. Berney, K. and Chastel, S. and R. Ferreira da Silva, Rafael "WfCommons: Data Collection and Runtime Experiments using Multiple Workflow Systems" 1st IEEE International Workshop on Workflows in Distributed Environments (WiDE) , 2023 Citation Details
Casanova. H. and Wong Y. C. and Pottier, L. and Ferreira da Silva, R. "On the Feasibility of Simulation-driven Portfolio Scheduling for Cyberinfrastructure Runtime Systems" 25th Workshop on Job Scheduling Strategies for Parallel Processing , 2022 Citation Details
Casanova, Henri and Berney, Kyle and Chastel, Serge and Da Silva, Rafael Ferreira "WfCommons: Data Collection and Runtime Experiments using Multiple Workflow Systems" 1st IEEE International Workshop on Workflows in Distributed Environments , 2023 https://doi.org/10.1109/COMPSAC57700.2023.00290 Citation Details
Casanova, Henri and Deelman, Ewa and Gesing, Sandra and Hildreth, Michael and Hudson, Stephen and Koch, William and Larson, Jeffrey and McDowell, Mary Ann and Meyers, Natalie and Navarro, John-Luke and Papadimitriou, George and Tanaka, Ryan and Taylor, Ia "Emerging Frameworks for Advancing Scientific Workflows Research, Development, and Education" 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS) , 2021 https://doi.org/10.1109/WORKS54523.2021.00015 Citation Details
Casanova, Henri and Giersch, Arnaud and Legrand, Arnaud and Quinson, Martin and Suter, Frédéric "Lowering entry barriers to developing custom simulators of distributed applications and platforms with SimGrid" Parallel Computing , v.123 , 2025 https://doi.org/10.1016/j.parco.2025.103125 Citation Details
Coleman, Tainã and Casanova, Henri and Ferreira da Silva, Rafael "Automated generation of scientific workflow generators with WfChef" Future Generation Computer Systems , v.147 , 2023 https://doi.org/10.1016/j.future.2023.04.031 Citation Details
Coleman, Taina and Casanova, Henri and Maheshwari, Ketan and Pottier, Loic and Wilkinson, Sean R. and Wozniak, Justin and Suter, Frederic and Shankar, Mallikarjun and Da Silva, Rafael Ferreira "WfBench: Automated Generation of Scientific Workflow Benchmarks" 13th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer System (PBMS) , 2022 https://doi.org/10.1109/PMBS56514.2022.00014 Citation Details
Coleman, Tainã and Casanova, Henri and Pottier, Loïc and Kaushik, Manav and Deelman, Ewa and Ferreira da Silva, Rafael "WfCommons: A framework for enabling scientific workflow research and development" Future Generation Computer Systems , v.128 , 2022 https://doi.org/10.1016/j.future.2021.09.043 Citation Details
Horzela, Maximilian and Casanova, Henri and Giffels, Manuel and Gottmann, Artur and Hofsaess, Robin and Quast, Günter and Rossi_Tisbeni, Simone and Streit, Achim and Suter, Frédéric "Modeling Distributed Computing Infrastructures for HEP Applications" EPJ Web of Conferences , v.295 , 2024 https://doi.org/10.1051/epjconf/202429504032 Citation Details
McDonald, Jesse and Dobbs, John and Wong, Yick Ching and da_Silva, Rafael Ferreira and Casanova, Henri "An exploration of online-simulation-driven portfolio scheduling in workflow management systems" Future Generation Computer Systems , 2024 https://doi.org/10.1016/j.future.2024.07.005 Citation Details
(Showing: 1 - 10 of 12)

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page