Award Abstract # 1664061
Collaborative Research: SI2-SSI: Cyberinfrastructure for Advancing Hydrologic Knowledge through Collaborative Integration of Data Science, Modeling and Analysis

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: UTAH STATE UNIVERSITY
Initial Amendment Date: September 14, 2017
Latest Amendment Date: August 23, 2018
Award Number: 1664061
Award Instrument: Standard Grant
Program Manager: Seung-Jong Park
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2017
End Date: September 30, 2022 (Estimated)
Total Intended Award Amount: $2,760,000.00
Total Awarded Amount to Date: $2,809,998.00
Funds Obligated to Date: FY 2017 = $2,760,000.00
FY 2018 = $49,998.00
History of Investigator:
  • David Tarboton (Principal Investigator)
    david.tarboton@usu.edu
  • Alva Couch (Co-Principal Investigator)
  • Daniel Ames (Co-Principal Investigator)
  • Jeffery Horsburgh (Co-Principal Investigator)
  • Martyn Clark (Co-Principal Investigator)
Recipient Sponsored Research Office: Utah State University
1000 OLD MAIN HL
LOGAN
UT  US  84322-1000
(435)797-1226
Sponsor Congressional District: 01
Primary Place of Performance: Utah State University
8200 Old Main Hill
Logan
UT  US  84322-8200
Primary Place of Performance
Congressional District:
01
Unique Entity Identifier (UEI): SPE2YDWHDYU4
Parent UEI:
NSF Program(s): Hydrologic Sciences,
Special Initiatives,
EAR-Earth Sciences Research,
Software Institutes,
EarthCube
Primary Program Source: 01001718DB NSF RESEARCH & RELATED ACTIVIT
01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z, 8004, 7433, 7556, 8009
Program Element Code(s): 157900, 164200, 689800, 800400, 807400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Researchers across the country and around the world expend tremendous resources to gather and analyze vast stores of hydrologic data and populate a myriad of models to better understand hydrologic phenomena and find solutions to vexing water problems. Each of those researchers has limited money, time, computational capacity, data storage, and ability to put that data to productive use. What if they could combine their efforts to make collaboration easier? What if those collected data sets and processed model outputs could be used collaboratively to help advance hydrologic understanding beyond their original purpose? HydroShare is a system to advance hydrologic science by enabling the scientific community to more easily and freely share products resulting from their research, not just the scientific publication summarizing a study, but also the data and models used to create the scientific publication. HydroShare supports the sharing and publication of hydrologic data and models. This capability is necessary for community model development, execution, and evaluation and to improve reproducibility and community trust in scientific findings through transparency. As a platform for collaboration and running models on advanced computational infrastructure, HydroShare enhances the capability for data intensive research in hydrology and other aligned sciences. HydroShare is designed to help researchers easily meet the sharing requirements of data management plans while at the same time providing value added functionality that makes metadata capture more effective and helps researchers improve their work productivity. This project will extend the capabilities of the HydroShare cyberinfrastructure to enhance support for scientific methods, advance the social capabilities of HydroShare to enable improved collaborative research, integrate with 3rd party consumer data storage systems to provide more flexible and sustainable data storage. and establish an application testing environment to empower researchers to develop their own computer programs to act on and work with data in HydroShare. Empowering HydroShare users with the ability to rapidly develop web application programs opens the door to unforeseen, innovative combinations of data and models. WRF-Hydro, the framework for the NOAA National Water Model, will be used as a use case for collaboration on model development. Since WRF-Hydro is used by NOAA as part of the National Water Model (NWM), this collaboration opens possibilities for transfer of research to operations. Collectively, this functionality will provide a computing framework for transforming the practice of broad science communities to leverage advances in data science and computation and accelerate discovery.

HydroShare is a system for sharing hydrologic data and models aimed at giving hydrologists the cyberinfrastructure needed to manage data, innovate and collaborate in research to solve water problems. It addresses the challenges of sharing data and hydrologic models to support collaboration and reproducible hydrologic science through the publication of hydrologic data and models. With HydroShare users can: (1) share data and models with colleagues; (2) manage who has access to shared content; (3) share, access, visualize and manipulate a broad set of hydrologic data types and models; (4) use the web services interface to program automated and client access; (5) publish data and models to meet the requirements of research project data management plans; (6) discover and access data and models published by others; and (7) use web apps to visualize, analyze, and run models on data. This project will extend the capabilities of HydroShare to: (1) enhance support for scientific methods enabling systematic data and model analysis and hypothesis testing; (2) advance the social capabilities of HydroShare to enable improved collaborative research; (3) integrate with 3rd party consumer data storage systems to provide more flexible and sustainable data storage; and (4) establish an application testing environment to empower researchers to develop their own computer programs to act on and work with data in HydroShare. Under development since 2012 and first released in 2014, HydroShare supports the sharing and publication of hydrologic data and models. This capability is necessary for community model development, execution, and evaluation. As a platform for collaboration and cloud based computation on network servers remote from the user, HydroShare enhances the capability for data intensive research in hydrology and other aligned sciences. HydroShare is innovative from a computer science and CI perspective in the way computation and data sharing are framed as a network computing platform that integrates data storage, organization, discovery, and programmable actions through web applications (web apps). Support for these three key elements of computation allows researchers to easily employ services beyond the desktop to make data storage and manipulation more reliable and scalable, while improving ability to collaborate and reproduce results. The generation of new understanding, through integration of information from multiple sources and reuse and collaborative enrichment of research data and models, will be enhanced. Structured and systematic model process intercomparisons and alternative hypothesis testing will be enabled, bringing, through user friendly CI, the latest thinking in advancing hydrologic modeling to a broad community of earth science researchers, thereby transforming research practices and the knowledge generated from this research. Interoperability with consumer cloud storage will greatly ease entry of content into HydroShare and support its sustainability. This meshing of the rigorous metadata model of HydroShare with consumer file sharing will enhance reproducibility as well as provide an innovative mechanism for sharing and collaboration. Empowering HydroShare users with the ability to rapidly develop web apps opens the door to unforeseen, innovative combinations of data and models. WRF- Hydro will be used as a use case for collaboration on model development. WRF-Hydro provides a reach-based high resolution representation of hydrologic processes, and offers the potential to bring together scientists working at scales from research catchments on the order of 1 to 100s of square kilometers as well as those working at regional to continental scales and cut across disciplines from environmental engineering to aquatic ecologists. Since WRF-Hydro is used by NOAA as part of the National Water Model (NWM), this collaboration opens possibilities for transfer of research to operations. This project will adapt current best practices in CI for interoperability and extensibility to serve this multidisciplinary community of scientists. HydroShare has already had a broader impact, with documented rapid growth in use and uptake by other projects including in EarthCube. It will become sustainable community CI through operation as part of the Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) Water Data Center (WDC) facility. The use of WRF- Hydro/NWM, as a driving use case, will advance CI for community based model improvement. Through the Summer Young Innovators Program at the National Water Center (NWC), supported by the National Weather Service (NWS) and operated by CUAHSI, a pathway already exists to translate research findings to the operational needs of federal agencies participating in the NWC. HydroShare already touches a broad and diverse community, with user base including Native American tribes, hydrologic science students, and faculty researchers across the U.S. This proposal builds on the success of HydroShare to extend its capabilities and broaden model hypothesis testing, collaborative data sharing, and open app development across earth science research and education.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 21)
Wilkins-Diehr, Nancy and Miller, Mark and Brookes, Emre H and Arora, Ritu and Chourasia, Amit and Calyam, Prasad and Jennewein, Douglas M and Nandigam, Viswanath and LaMar, M Drew and Cleveland, Sean B and Newman, Greg and Wang, Shaowen and Zaslavsky, Ily "Measuring Success: How Science Gateways Define Impact" Gateways 2019 , 2019 Citation Details
Choi, Young-Don and Maghami, Iman and Goodall, Jonathan L and Band, Lawrence and Nassar, Ayman and Lin, Laurence and Saby, Linnea and Li, Zhiyu and Wang, Shaowen and Calloway, Chris and Yi, Hong and Seul, Martin and Ames, Daniel P and Tarboton, David G "Toward reproducible and interoperable environmental modeling: Integration of HydroShare with server-side methods for exposing large-extent spatial datasets to models" Environmental Modelling & Software , v.183 , 2025 https://doi.org/10.1016/j.envsoft.2024.106239 Citation Details
Essawy, Bakinam T. and Goodall, Jonathan L. and Zell, Wesley and Voce, Daniel and Morsy, Mohamed M. and Sadler, Jeffrey and Yuan, Zhihao and Malik, Tanu "Integrating scientific cyberinfrastructures to improve reproducibility in computational hydrology: Example for HydroShare and GeoTrust" Environmental Modelling & Software , v.105 , 2018 https://doi.org/10.1016/j.envsoft.2018.03.025 Citation Details
Gan, Tian and Tarboton, David G. and Dash, Pabitra and Gichamo, Tseganeh Z. and Horsburgh, Jeffery S. "Integrating hydrologic modeling web services with online data sharing to prepare, store, and execute hydrologic models" Environmental Modelling & Software , v.130 , 2020 10.1016/j.envsoft.2020.104731 Citation Details
Gan, Tian and Tarboton, David G. and Horsburgh, Jeffery S. and Dash, Pabitra and Idaszak, Ray and Yi, Hong "Collaborative sharing of multidimensional space-time data in a next generation hydrologic information system" Environmental Modelling & Software , v.129 , 2020 10.1016/j.envsoft.2020.104706 Citation Details
Hart, Julia and Bandaragoda, Christina and Ramirez-Toro, Graciela "Sharing Data Helps Puerto Ricans Rebound After Hurricane Maria" Eos , v.100 , 2019 10.1029/2019EO121651 Citation Details
Horsburgh, Jeffery S. and Hooper, Richard P. and Bales, Jerad and Hedstrom, Margaret and Imker, Heidi J. and Lehnert, Kerstin A. and Shanley, Lea A. and Stall, Shelley "Assessing the state of research data publication in hydrology: A perspective from the Consortium of Universities for the Advancement of Hydrologic Science, Incorporated" WIREs Water , v.7 , 2020 https://doi.org/10.1002/wat2.1422 Citation Details
Li, Ting and Stanislawski, Lawrence V. and Brockmeyer, Tyler and Wang, Shaowen and Shavers, Ethan "OpenCLC: An open-source software tool for similarity assessment of linear hydrographic features" SoftwareX , v.11 , 2020 10.1016/j.softx.2020.100401 Citation Details
Padmanabhan, Anand and Ziao, Ximo and Vandewalle, Rebecca C. and Baig, Furqan and Michel, Alexander and Li, Zhiyu and Wang, Shaowen "CyberGIS-compute for enabling computationally intensive geospatial research" Proceedings of the 3rd ACM SIGSPATIAL International Workshop on APIs and Libraries for Geospatial Data Science (SpatialAPI21) , 2021 https://doi.org/10.1145/3486189.3490017 Citation Details
Phuong, Jimmy and Bandaragoda, Christina and Istanbulluoglu, Erkan and Beveridge, Claire and Strauch, Ronda and Setiawan, Landung and Mooney, Sean D. "Automated retrieval, preprocessing, and visualization of gridded hydrometeorology data products for spatial-temporal exploratory analysis and intercomparison" Environmental Modelling & Software , v.116 , 2019 10.1016/j.envsoft.2019.01.007 Citation Details
Rosenberg, David E. and Filion, Yves and Teasley, Rebecca and Sandoval-Solis, Samuel and Hecht, Jory S. and van Zyl, Jakobus E. and McMahon, George F. and Horsburgh, Jeffery S. and Kasprzyk, Joseph R. and Tarboton, David G. "The Next Frontier: Making Research More Reproducible" Journal of Water Resources Planning and Management , v.146 , 2020 10.1061/(ASCE)WR.1943-5452.0001215 Citation Details
(Showing: 1 - 10 of 21)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This project has developed and advanced HydroShare, cyberinfrastructure for enabling hydrologic knowledge discovery through collaboration and data sharing. HydroShare is a web-based repository and hydrologic information system for users to share, collaborate around, and publish data, models, workflows, and applications associated with water related research. HydroShare is operated by the Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI), an NSF supported non-profit organization with membership that includes 100+ U.S. universities and several other institutions both within the U.S. and internationally. As CUAHSI's primary data management platform, HydroShare serves to support open science practice following Findable, Accessible, Interoperable and Reusable (FAIR) principles for the hydrology and water resources research community. There are at present over 13000 resources in HydroShare, over 1000 of which have been permanently published with citable digital object identifiers, many linked to published papers supporting transparency and reproducibility of the research reported in these papers, and reuse of the associated data, adding value to research investments both in the research and CUAHSI and the development of HydroShare. In HydroShare a Resource is a single item of digital content that may comprise multiple data files and/or computer programs described with metadata structured to make it reusable and, where applicable, machine readable. The HydroShare repository also links with connected computational systems providing immediate value to users through the ability to reduce the needs for software installation and configuration and to document workflows, enhancing reproducibility.

 

HydroShare development has been motivated by and founded on the notion that solving big problems in water research depends on collaboration and data sharing.  It requires teamwork and integration of information from multiple sources. This is enabled by open data and easy to use cyberinfrastructure.  The CUAHSI HydroShare platform and data repository, and linked computing resources were developed to address these needs.

 

The development of HydroShare has resulted in a number of important outcomes.  First and foremost is the HydroShare repository website itself that serves as a portal for accessing, sharing and publishing data.  Underpinning this is the resource data model that establishes data, models and workflows as first class shareable or social objects that can be stored in HydroShare and used in collaboration.  The access control model that allows resources to be private, while still in preparation, then discoverable, privately shared for review purposes, public and permanently published with a citable digital object identifier is tuned to the needs and wishes of the hydrology community.  Community and group functionality in the repository serve the needs of specific stakeholder communities, such as the Critical Zone Network that has adopted HydroShare as one of its data management solutions.

 

Beyond resource storage the repository application programming interface supports machine readability of resource content and metadata.  This enables important connectivity and interoperability with third party web-based platforms that, among other things, can support distributed and high-performance computing.  Exploiting this functionality, the HydroShare project has established two, somewhat general purpose, JupyterHub computing platforms: (1) CUAHSI JupyterHub and (2) CyberGIS-Jupyter for water.  HydroShare web app connectors allow a resource in HydroShare to be opened with these linked computing platforms.  This results in the resource content being copied into the computing platform where analysis can be conducted, for example reusing the data, or reproducing and extending the results from a workflow encoded in a Jupyter Notebook.  Reusability and reproducibility are important capabilities that HydroShare has advanced.  CUAHSI JupyterHub is targeted at relatively small computing applications in support of education.  It is deployed in the Google Cloud.  CyberGIS-Jupyter for water is deployed on the NSF supported ACCESS high-performance computing resources for more computationally intensive analyses and modeling.

 

HydroShare has been widely adopted in the hydrology community and has been used in multiple separate projects advancing various aspects of hydrology community cyberinfrastructure, some of the more prominent being:

  • Data Science Analytics for Water: A project to facilitate easier access to hydrologic data via two Python client packages (NSF 1931297, Horsburgh PI).

  • Critical Zone Collaborative Network Coordinating Hub (NSF 2012893, Bales PI)

  • NSF Harnessing the Data Revolution Institute for Geospatial Understanding through an Integrative Discovery Environment (I-GUIDE) (NSF 2118329, Wang PI)

  • NOAA Cooperative Institute for Research to Operations in Hydrology.

 

 


Last Modified: 01/26/2023
Modified by: David G Tarboton

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page