
NSF Org: |
RISE Integrative and Collaborative Education and Research (ICER) |
Recipient: |
|
Initial Amendment Date: | August 26, 2019 |
Latest Amendment Date: | August 26, 2019 |
Award Number: | 1928288 |
Award Instrument: | Standard Grant |
Program Manager: |
Eva Zanzerkia
RISE Integrative and Collaborative Education and Research (ICER) GEO Directorate for Geosciences |
Start Date: | September 1, 2019 |
End Date: | August 31, 2024 (Estimated) |
Total Intended Award Amount: | $331,932.00 |
Total Awarded Amount to Date: | $331,932.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
1 E JACKSON BLVD CHICAGO IL US 60604-2287 (312)362-7388 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
243 S. Wabash Ave. Chicago IL US 60604-2301 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | EarthCube |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.050 |
ABSTRACT
For science to reliably support new discoveries, its results must be reproducible. This has proven to be a challenge in many fields including, most notably, fields that rely on computational studies as a means for supporting new discoveries. Reproducibility in these studies is particularly difficult because they require open sharing of data and models and careful control by the original researcher. This is to ensure that products can be run on later generations of hardware and software and produce consistent results. This project will develop software that helps support computational reproducibility and makes it easier and more efficient for geoscientists to preserve, share, repeat and replicate scientific computations. The Broader Impacts of this project include a collaboration between computer scientists, hydrologists and the Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI) for the hydrology research community. With over 3500 users, and holding over 8000 model and data resources, this collaboration will bring improved tools and best practices to a broad and diverse community of geoscientists. Beyond hydrology, the methods and tools developed as part of this project have the potential to be extended to the solid Earth and space science geoscience domains. They also have the potential to inform the reproducibility evaluation process as currently undertaken by journals and publishers. The projct will also conduct workshops to train researchers and be used in the classroom at Utah Sate Universtiy, DePaul University and the University of Virginia.
Emphasis on the importance of research reproducibility is steadily rising, however many studies still continue to not be reproducible. Reproducibility in computational studies is particularly difficult because of the challenges involved in completely documenting the data, models and procedures used together with the underlying hardware and software dependencies. The reproducibility workbench software (ReproBench) developed in this project will address reproducibility questions by establishing a container-based reproducible workflow that will make it easy and efficient for geoscientists to verify scientific results. Automation and documentation are two key methods for improving verification and, in general, the conduct of reproducible science. This project will build-from past investments: (I) automated containerization methods, through the Sciunit project, and (II) well-documented, community-adopted interfaces, through HydroShare, and bring these investments together to establish a novel, robust, and reproducible workflow. By applying this workflow to water-related science use cases, this project will demonstrate how to preserve, share, repeat, and replicate scientific results. The interfaces can become an exemplar for other community cyberinfrastructure that, akin to Hydrology, aims to share data and models at a large scale. In establishing this workflow, the ReproBench project team combines expertise in cyberinfrastructure, domain science, and reproducible computational data science. By leveraging Sciunit, ReproBench brings formal methods for the conduct of reproducible computational science into the geosciences.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This project improved the conduct of reproducible science by evaluating, developing and advancing container technology within domain-specific cyberinfrastructure. Through this funding we were able to answer questions such as: (i) Can published results be repeated and replicated in different environments? (ii) If yes, how similar are they to previous results? (iii) Can we further verify the results by extending them and allowing scientists to make meaningful changes that are automatically and systematically tracked?
We answered these questions within the context of virtual environments and containers, which are increasingly used as FAIR methods for the conduct of reproducible science. The project achieved the following goals: (i) It developed a novel container technology for notebook-based scientific workflows, (ii) It showed how application virtualization scales in comparison to moden commercial containerization technology, and finally (iii) it applied the developed container technology to a rich set of Hydrology use cases available via the community HydroShare portal, both showing the innumerable studies that can be reproduced and also showing the limitation that comparing statistical results is challenging.
The impact of this project was achieved in technical communities of computing and data management and Hydrology and broader geoscience. The outcomes were in the form of publications (8), presentations (5), and keynote talks (2). It lead to a successful collabration which show-cased value in conducting cross-disciplinary science. The project contributed to the professional development of 10 MS, 2 PhD students and 1 post doc during its span.
Last Modified: 11/20/2024
Modified by: Tanu Malik
Please report errors in award information by writing to: awardsearch@nsf.gov.