Award Abstract # 1563950
III: Medium: Collaborative Research: Bayesian Modeling and Inference for Quantifying Terrestrial Ecosystem Functions

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: REGENTS OF THE UNIVERSITY OF MINNESOTA
Initial Amendment Date: July 27, 2016
Latest Amendment Date: September 11, 2017
Award Number: 1563950
Award Instrument: Continuing Grant
Program Manager: Sylvia Spengler
sspengle@nsf.gov
 (703)292-7347
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2016
End Date: August 31, 2020 (Estimated)
Total Intended Award Amount: $724,000.00
Total Awarded Amount to Date: $724,000.00
Funds Obligated to Date: FY 2016 = $233,356.00
FY 2017 = $490,644.00
History of Investigator:
  • Arindam Banerjee (Principal Investigator)
    arindamb@illinois.edu
  • Peter Reich (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Minnesota-Twin Cities
2221 UNIVERSITY AVE SE STE 100
MINNEAPOLIS
MN  US  55414-3074
(612)624-5599
Sponsor Congressional District: 05
Primary Place of Performance: University of Minnesota-Twin Cities
200 Union Street SE
Minneapolis
MN  US  55455-0159
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): KABJZBBJ4B54
Parent UEI:
NSF Program(s): Info Integration & Informatics
Primary Program Source: 01001617DB NSF RESEARCH & RELATED ACTIVIT
01001718DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7364, 7924
Program Element Code(s): 736400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

While the past decade has seen considerable advances in predicting missing entries in data matrices, existing approaches have demonstrated sobering performance and several limitations in an important scientific problem: characterizing plant traits, such as plant height, seed mass, leaf area, and leaf nitrogen, over space and time. Detailed global maps of plant traits will enable accurate quantification of terrestrial ecosystem functions, such as agricultural and forest productivity, and regulation of atmospheric CO2 levels. This project uses the largest and most comprehensive plant trait database on the planet (TRY, www.try-db.org) to develop a detailed characterization of plant functional traits and trait diversity at relatively fine scales across most of the terrestrial land surface. In doing so, the project produces the first detailed uncertainty quantified maps of plant traits across all of earth's major land ecosystems as well as their future projections. The project trains a new generation of interdisciplinary scientists who can cross the traditional boundaries between computer science, spatial statistics, and the Earth sciences.

The research in the project makes substantial advances on Bayesian probabilistic models for matrix gap filling or matrix completion, as well as spatiotemporal gap filling with emphasis on continuous fields. In particular, the project develops probabilistic matrix completion models which can incorporate domain specific hierarchies, such as plant taxonomic or phylogenetic trees, as well as spatial variations across different environmental regimes. The project also develops methods for gap filling in continuous fields based on spatiotemporal process models along with highly scalable inference methods based on dynamic nearest-neighbor Gaussian processes. The models and methods are expected to have impact beyond the scope of quantifying ecosystem functions.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A. Moreno-Martínez et al. "A methodology to derive global maps of leaf traits using remote sensing and climate data" Remote Sensing of Environment , 2018
Farideh Fazayeli, Arindam Banerjee, Jens Kattge, Franziska Schrodt, PeterReich "Uncertainty Quantified Matrix Completion using Bayesian Hierarchical MatrixFactorization, Package 'BHMPF'" Repository CRAN , 2017
Habacuc Flores-Moreno, Farideh Fazayeli, Arindam Banerjee, Abhirup Datta,Jens Kattge, Ethan E. Butler, Owen K. Atkin, Kirk Wythers, Ming Chen, MadhurAnand, Michael Bahn, Sabina Burrascano, Chaeho Byun, J. Hans C. Cornelissen,Joseph Craine, Andres Gonza "Robustness of trait connections between multiple plant organs acrossenvironmental gradients and growth forms" Global Ecology and Biogeography (GEB) , 2019
H. Bruelheide et al.. "Global trait?environment relationships of plant communities" Nature Ecology & Evolution , 2018
J. Golmohammadi and I. Ebert-Uphoff and S. He and Y. Deng and A. Banerjee "High-Dimensional Dependency Structure Learning for Physical Processes" International Conference on Data Mining (ICDM) , 2017
K. Christakopoulou and A. Banerjee "Adversarial Attacks on Oblivious Recommenders" ACM Recommender Systems Conference (RecSys) , 2019
Madani N, JS Kimball, AP Ballantyne, DLR Affleck, Peter M Bodegom, Peter B Reich, Jens Kattge, Anna Sala, Mona Nazeri, Matthew O Jones, Maosheng Zhao, Steven W Running "Future global productivity will be affected by plant trait response to climate" Scientific Reports 8 , 2018
R. Giaquinto and A. Banerjee "DAPPER: Scaling Dynamic Author Persona Topic Model to Billion Word Corpora" International Conference on Data Mining (ICDM) , 2018
R. Giaquinto and A. Banerjee "Topic Modeling on Health Journals with Regularized Variational Inference" AAAI Conference on Artificial Intelligence (AAAI) , 2018
Robert Giaquinto and Arindam Banerjee "Gradient Boosted Normalizing Flows" Advances in Neural Information Processing Systems (NeurIPS) , 2020

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

While there has been considerable advances in models for gap filling, i.e., predicting missing entries in a data matrix or continuous field, existing models have been found to perform poorly in an important scientific problem: characterizing plant traits, such as specific leaf area (SLA), leaf nitrogen concentration (Leaf N), and leaf phosphorus concentration (Leaf P), towards accurate quantification of terrestrial ecosystem functions, such as agricultural and forest productivity, and regulation of atmospheric CO2 levels. Plant traits are key to understanding, predicting, and ameliorating the response of biodiversity and terrestrial ecosystems to anthropogenic disturbance and expected environmental changes, e.g., land use changes, increasing level of atmospheric CO2 concentration, or climate change. A detailed uncertainty quantified global map of plant traits, e.g., SLA, Leaf N, Leaf P, etc., over space and time, projecting into the future, will help considerably advance Earth system models (ESMs, coupled models of the atmosphere, ocean, sea ice, and terrestrial land surface), which form the basis for future climate projections. The focus of the project was on developing probabilistic models for matrix gap filling (MGF) for data matrices and spatiotemporal gap filling (STGP) for spatial fields, along with applications of such models to gap filling plant trait databases and generating global maps of key plant traits which can be used as input to ESMs.

The project accomplished all of the planned tasks, leading to advances in probabilistic modeling of data matrices and spatial fields of variables, and their applications to terrestrial ecosystem modeling. For MGF, our project developed a new model based on Bayesian hierarchical probabilistic matrix factorization (BHPMF) which uses a domain based hierarchy to generate uncertainty quantified gap-filling of data matrices. The model was used to gap-fill the TRY database, the largest database on plant traits. We have also developed spatiotemporal process models for STGF along with highly scalable inference based on dynamic nearest-neighbor Gaussian processes. We have also developed gradient boosted normalizing flows for flexible probabilistic modeling in high dimensions based on deep generative models. In addition, for efficient computations for approximate inference in complex probabilistic models, we have developed regularized variational inference, conjugate-computation variational inference, and related algorithms which are computationally efficient, scalable, and stable.

Our advances in probabilistic models for spatial gap filling have been used to generate one of the first uncertainty quantified global maps of key plant traits, such as leaf N and leaf P. Further, such uncertainty quantified trait maps have been used as input to ESMs yielding improvements in carbon flux modeling compared to the state-of-the-art approach in land surface modeling based on deterministic plant functional types. These simulations have increased the variation in plant traits by orders of magnitude compared to the current paradigm. Our results on improved carbon flux modeling may have a transformative impact on how land surface modeling is done going forward.


Last Modified: 12/31/2020
Modified by: Arindam Banerjee

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page