Award Abstract # 1948386
Collaborative Research: Neotoma Paleoecology Database, a Multi-Proxy, International, Community-Curated Data Resource for Global Change Research

NSF Org: EAR
Division Of Earth Sciences
Recipient: LEHIGH UNIVERSITY
Initial Amendment Date: April 22, 2020
Latest Amendment Date: August 12, 2021
Award Number: 1948386
Award Instrument: Continuing Grant
Program Manager: Raleigh Martin
ramartin@nsf.gov
 (703)292-7199
EAR
 Division Of Earth Sciences
GEO
 Directorate for Geosciences
Start Date: June 1, 2020
End Date: October 31, 2024 (Estimated)
Total Intended Award Amount: $47,702.00
Total Awarded Amount to Date: $47,702.00
Funds Obligated to Date: FY 2020 = $33,806.00
FY 2021 = $13,896.00
History of Investigator:
  • Robert Booth (Principal Investigator)
    robert.booth@lehigh.edu
Recipient Sponsored Research Office: Lehigh University
526 BRODHEAD AVE
BETHLEHEM
PA  US  18015-3008
(610)758-3021
Sponsor Congressional District: 07
Primary Place of Performance: Earth & Environmental Science Dept
1 West Packer Avenue
Bethlehem
PA  US  18015-3001
Primary Place of Performance
Congressional District:
07
Unique Entity Identifier (UEI): E13MDBKHLDB5
Parent UEI:
NSF Program(s): GEOINFORMATICS
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
01002122DB NSF RESEARCH & RELATED ACTIVIT

01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):
Program Element Code(s): 725500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.050

ABSTRACT

The recent geological record can offer useful insight into how species adapt (or fail to adapt) to large and rapid environmental change. Understanding past global changes necessitates the assembly and analysis of many individual records, collected by many scientists, from many parts of the world. This project will continue the support and development of the Neotoma Paleoecology Database, a community data resource in which experts contribute their data, curate these data to meet community standards, and freely access and use the data in research and teaching. Examples of research that uses the Neotoma resource include understanding the causes and consequences of past extinction, assessing the sensitivity of species to large environmental changes that accompanied past ice ages, and measuring how quickly species adapt to past rapid environmental changes. Neotoma?s open data access will also facilitate new forms of education and outreach, by giving students and teachers direct access to research-quality data and empowering them to run their own analyses of past climate and biodiversity dynamics. This will help train the next generation of the scientific workforce, with crossover skills in both geological and data sciences.

Neotoma has become a research resource in paleoecology, paleoclimatology, macroecology, biogeography, and environmental archaeology, with rapid growth in data volume, kinds of data supported, and active third-party and externally-supported research projects. Gathering thousands of site-level records into high-quality, global-scale, data networks requires substantial effort. Datasets must be carefully checked, taxon names harmonized, age-depth models updated, and uncertainties quantified. Neotoma will continue to address these challenges through a model of centralized infrastructure and distributed data governance. Priorities for this project are: 1) establishing international coverage for data types and research communities that now have critical masses of trained data stewards and data volumes in North America (pollen, diatoms, ostracods, testate amoebae, vertebrates), and 2) establishing critical masses of data and data stewards for recently added data types (specimen-level stable isotopes, organic biomarkers). Cyberinfrastructure improvements will include: 1) a web-based interface for bulk-uploading metadata and data from many sites, 2) better support for specimen-level isotopic measurements, 3) better visualization and analytical support for micropaleontological species-environment calibration datasets, and 4) a rebuild of Neotoma?s home webpage. Sustainability priorities include: 1) support for the recently launched non-profit EarthLife Consortium Foundation, and 2) a workshop in which leaders from Neotoma and other community data resources will receive training in building strategic sustainability plans, learn tools to manage finances and plan for the future, develop strategies to diversify their funding base, and enhance communication skills. These updates and improvements to Neotoma will enhance the existing data resource for current users, and will help early-career researchers, for whom data availability and data-science training can open new career pathways.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Williams, John W. and Spanbauer, Trisha L. and Heintzman, Peter D. and Blois, Jessica and Capo, Eric and Goring, Simon J. and Monchamp, Marie-Eve and Parducci, Laura and Von Eggers, Jordan M. and Alsos, Inger Greve and Bowler, Chris and Coolen, Marco J.L. "Strengthening global-change science by integrating aeDNA with paleoecoinformatics" Trends in Ecology & Evolution , 2023 https://doi.org/10.1016/j.tree.2023.04.016 Citation Details
Uhen, Mark D. and Buckland, Philip I. and Goring, Simon J. and Jenkins, Julian P. and Williams, John W. "The EarthLife Consortium API: an extensible, open-source service for accessing fossil data and taxonomies from multiple community paleodata resources" Frontiers of Biogeography , v.0 , 2021 https://doi.org/10.21425/F5FBG50711 Citation Details
Bradshaw, Richard H. and Styles, Bonnie and Giesecke, Thomas and Flantua, Suzette G. and Bittmann, Felix and Williams, John W. "Eric C. Grimm 19512020" Vegetation History and Archaeobotany , v.30 , 2021 https://doi.org/10.1007/s00334-021-00828-z Citation Details
George, Adrian K. and Roth, Robert E. and Widell, Sydney and Williams, John W. "Range Mapper: An Adaptable Process for Making and Using Interactive, Animated Web Maps of Late-Quaternary Open Paleoecological Data" Open Quaternary , v.9 , 2023 https://doi.org/10.5334/oq.114 Citation Details
Huang, Huai-Hsuan M. and Yasuhara, Moriaki and Horne, David J. and Perrier, Vincent and Smith, Alison J. and Brandão, Simone N. "Ostracods in databases: State of the art, mobilization and future applications" Marine Micropaleontology , v.174 , 2022 https://doi.org/10.1016/j.marmicro.2022.102094 Citation Details
Mottl, Ondej and Flantua, Suzette G. A. and Bhatta, Kuber P. and Felde, Vivian A. and Giesecke, Thomas and Goring, Simon and Grimm, Eric C. and Haberle, Simon and Hooghiemstra, Henry and Ivory, Sarah and Kune, Petr and Wolters, Steffen and Seddon, Alist "Global acceleration in rates of vegetation change over the past 18,000 years" Science , v.372 , 2021 https://doi.org/10.1126/science.abg1685 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The recent fossil record provides some of our best available evidence about how species and ecosystems have adapted to large climate changes and the intensifying impact of human societies.  Collecting these data is laborious and expensive, yet these datasets are highly reusable, for many individual sites can be gathered together and reanalyzed to study biodiversity dynamics at local to global scales.  The Neotoma Paleoecology Database supports global-change research by providing an open, ethical, sustainable, high-quality, and community-curated repository for multiple kinds of paleoecological data. With sustained support from NSF, Neotoma has become one of the top international data resources for paleoecological data, widely used by global change scientists, biogeographers, paleoecologists, and other geoscientists as a trusted resource for their research, teaching, and outreach. Project outcomes are grouped into three areas: 1) Data growth and stewardship, 2) Enhancement of software services, and 3) Training and empowering the next generation of scientists.

1) Data Growth:  Over the last four years, data volumes increased by 30 to 45%, increasing from 7.3 to 10.5 million observations and from 39,329 to 49,745 datasets.  These datasets have an estimated replacement cost of over $1.4 billion dollars, and so are effectively irreplaceable.  Neotoma achieves high data quality via expert curation, working closely with community experts who serve as data contributors and data stewards. The most sustained growth has been in the data types with expert stewards directly supported by this award:  vertebrates, pollen, diatoms, ostracodes, and testate amoebae.  New data types directly supported by this award include isotopic data (an indicator of past vertebrate diet and behavior) and lead-210 data (an estimate of sediment age).  As Neotoma’s reputation has grown, other scientific groups have asked to join Neotoma as constituent databases and add their data.  New data types now being supported or in the process of being added include ancient environmental DNA, marine micropaleontological data, paleofire data, and speleothem (cave) records. Using one data system to support many different kinds of paleoenvironmental data helps streamline costs and enhance data sustainability while offering scientists a high-quality data resource that houses a wide variety of paleodata types

2) Enhanced Services:  A) Because the work involved in data upload and curation is the major rate-limiting step, a new system, called DataBUS, has been built to support large-volume uploads of data to Neotoma, with multiple automated validation checks.  DataBUS is now being tested with research groups.  The Neotoma website was fully redesigned and relaunched, with highly positive feedback from users.  New versions were released of the Application Programming Interfaces (2.0) and R package (neotoma2), which are used by scientists and developers building their own scientific workflows to access and use Neotoma data.  A usability and usefulness survey of Neotoma Explorer, an interactive map-based interface for finding and exploring Neotoma data, has both showcased the popularity of Explorer and identified areas where future improvements can be made.  Neotoma sustainability has been enhanced by moving the entire software stack to the Amazon Web Services Cloud, with support from the NSF Cloudbank program.  

3) Empowering Scientists:  The popularity of Neotoma is demonstrated by its high level of scientific usage.  Over 1,600 scientific papers have cited Neotoma or its constituent databases, with a combined H-factor of 139.  Neotoma data are routinely incorporated into scientific projects studying past ecological and environmental change at local to global scales.  Over the last four years, Neotoma stewards and developers have led dozens of workshops to train other scientists in how to use Neotoma services and join Neotoma as data contributors and stewards.  The enhanced data access provided by Neotoma is essential for early career scientists, for whom data access can be a critical barrier to scientific progress, and who need advanced skills in data science for successful careers in academia, government, and the private sector. 

 


Last Modified: 02/24/2025
Modified by: Robert K Booth

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page