
NSF Org: |
EAR Division Of Earth Sciences |
Recipient: |
|
Initial Amendment Date: | July 26, 2017 |
Latest Amendment Date: | April 3, 2023 |
Award Number: | 1740694 |
Award Instrument: | Standard Grant |
Program Manager: |
Eva Zanzerkia
EAR Division Of Earth Sciences GEO Directorate for Geosciences |
Start Date: | September 1, 2017 |
End Date: | September 30, 2023 (Estimated) |
Total Intended Award Amount: | $1,449,662.00 |
Total Awarded Amount to Date: | $1,449,662.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
21 N PARK ST STE 6301 MADISON WI US 53715-1218 (608)262-3822 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
1215 West Dayton Street Madison WI US 53706-1600 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | EarthCube |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.050 |
ABSTRACT
This grant supports a partnership between geochronologists who have built and run laboratory facilities that are designed to measure the ages of rocks using radioisotopic and astronomical methods, geoscientists who are building synthetic databases that depend critically on accurate and precise ages of rocks in order to test hypotheses in the Earth and life sciences, and computer scientists who are building infrastructure components that are now being used broadly in education and research. The aim is to address a 'grand challenge' in the Earth sciences: to develop a fully integrated four-dimensional digital Earth so that we may fully understand dynamic Earth system evolution through time. To meet this goal the team is developing: (1) a robust cyberinfrastructure to manage and expose data produced by multiple distributed geochronology laboratory facilities around the USA, (2) a digital mechanism to enable scientists of all types to readily discover and use geochronologic data, while at the same time keeping age estimates closely connected to geochronology lab expertise and underlying, laboratory-specific data, (3) protocols and workflows that pass geochronological data and metadata from labs to synthetic geological and paleobiological databases and data repositories, and (4) software that can harness this new geochronological infrastructure and leverage it in order to generate age models for broad swaths of rocks and thereby enable the correlation of Earth system records across a range of nested spatial and temporal scales. Institutions formally collaborating in these efforts include: University of Wisconsin-Madison, University of Arizona, Boise State University, New Mexico Institute of Mining and Technology, and University of Minnesota.
Geochronological data are central to our understanding of Earth's past and future. This collaboration between geo- and computer scientists is: (1) creating cyberinfrastructure that better leverages existing and new laboratory-generated geochronologic data, and (2) integrating this infrastructure with with synthetic databases including: Paleo Biology Data Base (https://paleobiodb.org), Neotoma Data Base (http://www.neotomadb.org), Macrostrat (https://macrostrat.org), as well as the Integrated Earth Data Alliance (IEDA) Geochron (http://www.geochron.org/) data repository using a standard that can be widely applied by others. The team's approach is unique in that it involves the parallel efforts of both the producers and consumers of geochronologic data as well as technical staff who have a working knowledge of geologic and biologic databases. The team is also well positioned to broadly serve geochronology because it engages three different geochemical/radioisotopic systems that address different geological problems and time scales. One of the overarching science goals of EarthCube is to characterize the key processes, interactions and feedbacks operating at and across different temporal and spatial scales and biological, chemical, mechanical, and physical domains. These modest, but concrete, steps allow the development of templates for distributed, laboratory cyberinfrastructure and geochronologically grounded models that can be adapted and used across Earth science communities. The aim is to share best practices and move Earth scientists towards an open, frictionless transfer of data and knowledge. The impacts of this project extend beyond the participating laboratories and collaborations and include: (1) three young geoscientists (1 PhD student and 2 Postdoctoral scholars) are gaining the cyberinformatics experience to become next generation faculty/research leaders, (2) a set of standards for distributed laboratory server operations is being created and implemented, forming the foundation for establishing a global network of lab-derived geochronological data, and (3) the GeochronAPI deployed across this network is providing a mechanism by which to integrate and synthesize geochronological data into independently developed applications, including geodiscovery-oriented Flyover Country (http://fc.umn.edu). Through workshops the team is engaging a wide spectrum of geochronologists to converge on an open standard for the GeochronAPI system, and their work with the synthetic databases highlighted above engages a large community spanning bio- and geoscience. All of the project's software, both lab-centric and external facing, is being made accessible in public GitHub repositories to encourage open, creative development.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This project created a partnership between geochronologists and geoinformatics researchers geared toward synthesizing and communicating geochronologic measurements made in state-of-the-art laboratories to community-level data archival facilities and the general public. Based on input from the geochronology community, we designed and released an integrated software solution for "lab information management" that could be used across fields of geochronology and geochemistry, while being geared towards a lab facility that maintains long-term stewardship of their data. Thus, this project aimed to close a gap that existed between the producers of geochronological and other geochemical measurements and the community of researchers and educators in both public and private settings that depend on those measurements for their work. This was done in a way that also helped to improve the internal data management systems in geochronological/geochemical laboratory facilities, providing across-the-board benefits throughout the lifecycle of a geochemical measurement.
The resulting Sparrow software program and documentation, freely available as an open source project and distributed via GitHub and a dedicated website (https://sparrow-data.org/), provides labs with a flexible and powerful relational database, authenticated web portal, and file-tracking system for the management of their data archive. The system also has a standardized "application programming interface" (API) for distributing data and representing results in other data systems. This laboratory "inward and outward facing" data management system provides benefits to laboratory facilities as well as to end-users who need to incorporate high-quality geochemical data into their own workflows, all while preserving complete traceability of data provenance and metadata related to various sample preparation and data reduction steps.
We built and deployed several lab data pipelines using this new system, engaging the efforts of lab professional staff, postdocs, graduate and undergraduate students to track, standardize, and augment data records to build unified data archives. Several labs outside of the program also collaborated to build and deploy prototype instances of the Sparrow software stack that we developed. Multiple instances are now in use in labs at this moment, with the University of Colorado TRaIL lab, an outside collaborator, among others having incorporated the system into its daily operations. Interest in incorporating our software stack into other laboratory facilities remains high, and it is our hope that distributed open-source software development work continues to improve the system past the term of this grant.
Further, during the course of our work, we trained several geoscience undergraduates, graduate students, and postdocs in data management and software development, some of whom went on to related jobs in private industry; the postdoc played an instrumental role in the operational management of the project beginning in year 1. Through this effort, the postdoctoral scholar gained significant personnel and project management experience and ultimately went on to become a Research Scientist and eventually a Principal Investigator of subsequent related cyberinfrastructure and geoscience data management and distribution projects.
Last Modified: 01/26/2024
Modified by: Shanan E Peters
Please report errors in award information by writing to: awardsearch@nsf.gov.