
NSF Org: |
IIS Division of Information & Intelligent Systems |
Recipient: |
|
Initial Amendment Date: | July 27, 2016 |
Latest Amendment Date: | September 11, 2017 |
Award Number: | 1563950 |
Award Instrument: | Continuing Grant |
Program Manager: |
Sylvia Spengler
sspengle@nsf.gov (703)292-7347 IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2016 |
End Date: | August 31, 2020 (Estimated) |
Total Intended Award Amount: | $724,000.00 |
Total Awarded Amount to Date: | $724,000.00 |
Funds Obligated to Date: |
FY 2017 = $490,644.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
2221 UNIVERSITY AVE SE STE 100 MINNEAPOLIS MN US 55414-3074 (612)624-5599 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
200 Union Street SE Minneapolis MN US 55455-0159 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Info Integration & Informatics |
Primary Program Source: |
01001718DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
While the past decade has seen considerable advances in predicting missing entries in data matrices, existing approaches have demonstrated sobering performance and several limitations in an important scientific problem: characterizing plant traits, such as plant height, seed mass, leaf area, and leaf nitrogen, over space and time. Detailed global maps of plant traits will enable accurate quantification of terrestrial ecosystem functions, such as agricultural and forest productivity, and regulation of atmospheric CO2 levels. This project uses the largest and most comprehensive plant trait database on the planet (TRY, www.try-db.org) to develop a detailed characterization of plant functional traits and trait diversity at relatively fine scales across most of the terrestrial land surface. In doing so, the project produces the first detailed uncertainty quantified maps of plant traits across all of earth's major land ecosystems as well as their future projections. The project trains a new generation of interdisciplinary scientists who can cross the traditional boundaries between computer science, spatial statistics, and the Earth sciences.
The research in the project makes substantial advances on Bayesian probabilistic models for matrix gap filling or matrix completion, as well as spatiotemporal gap filling with emphasis on continuous fields. In particular, the project develops probabilistic matrix completion models which can incorporate domain specific hierarchies, such as plant taxonomic or phylogenetic trees, as well as spatial variations across different environmental regimes. The project also develops methods for gap filling in continuous fields based on spatiotemporal process models along with highly scalable inference methods based on dynamic nearest-neighbor Gaussian processes. The models and methods are expected to have impact beyond the scope of quantifying ecosystem functions.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
While there has been considerable advances in models for gap filling, i.e., predicting missing entries in a data matrix or continuous field, existing models have been found to perform poorly in an important scientific problem: characterizing plant traits, such as specific leaf area (SLA), leaf nitrogen concentration (Leaf N), and leaf phosphorus concentration (Leaf P), towards accurate quantification of terrestrial ecosystem functions, such as agricultural and forest productivity, and regulation of atmospheric CO2 levels. Plant traits are key to understanding, predicting, and ameliorating the response of biodiversity and terrestrial ecosystems to anthropogenic disturbance and expected environmental changes, e.g., land use changes, increasing level of atmospheric CO2 concentration, or climate change. A detailed uncertainty quantified global map of plant traits, e.g., SLA, Leaf N, Leaf P, etc., over space and time, projecting into the future, will help considerably advance Earth system models (ESMs, coupled models of the atmosphere, ocean, sea ice, and terrestrial land surface), which form the basis for future climate projections. The focus of the project was on developing probabilistic models for matrix gap filling (MGF) for data matrices and spatiotemporal gap filling (STGP) for spatial fields, along with applications of such models to gap filling plant trait databases and generating global maps of key plant traits which can be used as input to ESMs.
The project accomplished all of the planned tasks, leading to advances in probabilistic modeling of data matrices and spatial fields of variables, and their applications to terrestrial ecosystem modeling. For MGF, our project developed a new model based on Bayesian hierarchical probabilistic matrix factorization (BHPMF) which uses a domain based hierarchy to generate uncertainty quantified gap-filling of data matrices. The model was used to gap-fill the TRY database, the largest database on plant traits. We have also developed spatiotemporal process models for STGF along with highly scalable inference based on dynamic nearest-neighbor Gaussian processes. We have also developed gradient boosted normalizing flows for flexible probabilistic modeling in high dimensions based on deep generative models. In addition, for efficient computations for approximate inference in complex probabilistic models, we have developed regularized variational inference, conjugate-computation variational inference, and related algorithms which are computationally efficient, scalable, and stable.
Our advances in probabilistic models for spatial gap filling have been used to generate one of the first uncertainty quantified global maps of key plant traits, such as leaf N and leaf P. Further, such uncertainty quantified trait maps have been used as input to ESMs yielding improvements in carbon flux modeling compared to the state-of-the-art approach in land surface modeling based on deterministic plant functional types. These simulations have increased the variation in plant traits by orders of magnitude compared to the current paradigm. Our results on improved carbon flux modeling may have a transformative impact on how land surface modeling is done going forward.
Last Modified: 12/31/2020
Modified by: Arindam Banerjee
Please report errors in award information by writing to: awardsearch@nsf.gov.