NSF Award Search: Award # 1550547

Award Abstract # 1550547

Collaborative Research: SI2-SSI: Integrating Data with Complex Predictive Models under Uncertainty: An Extensible Software Framework for Large-Scale Bayesian Inversion

NSF Org:	OAC Office of Advanced Cyberinfrastructure (OAC)
Recipient:	UNIVERSITY OF CALIFORNIA, MERCED
Initial Amendment Date:	August 10, 2016
Latest Amendment Date:	August 10, 2016
Award Number:	1550547
Award Instrument:	Standard Grant
Program Manager:	Seung-Jong Park OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering
Start Date:	September 1, 2016
End Date:	August 31, 2021 (Estimated)
Total Intended Award Amount:	$475,000.00
Total Awarded Amount to Date:	$475,000.00
Funds Obligated to Date:	FY 2016 = $475,000.00
History of Investigator:	Noemi Petra (Principal Investigator) npetra@ucmerced.edu
Recipient Sponsored Research Office:	University of California - Merced 5200 N LAKE RD MERCED CA US 95343-5001 (209)201-2039
Sponsor Congressional District:	13
Primary Place of Performance:	University of California - Merced 5200 N. Lake Road Merced CA US 95343-5705
Primary Place of Performance Congressional District:	13
Unique Entity Identifier (UEI):	FFM7VPAG8P92
Parent UEI:
NSF Program(s):	PROBABILITY, APPLIED MATHEMATICS, COMPUTATIONAL MATHEMATICS, Software Institutes, CDS&E-MSS
Primary Program Source:	01001617DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):	8009, 7433, 8004, 8251, 9263, 4444
Program Element Code(s):	126300, 126600, 127100, 800400, 806900
Award Agency Code:	4900
Fund Agency Code:	4900
Assistance Listing Number(s):	47.070

ABSTRACT

Scientists often use mathematical models to predict the behavior of natural and engineered systems. These models are therefore fundamental to scientific and engineering progress and hence relevant to NSF's science mission. Most models of realistic physical systems use complex formulae (such as, partial differential equations) involving many variables. When using such a model for predicting the future behavior of a system, a scientist has to provide initial values for all the variables. This can be difficult because input values may not be directly measureable. Thus, scientists often must use "inverse" computations to calculate the initial input values of the variables of a system model based on external observations of the real world. In other words, scientists seek to infer inputs to a computer model of a physical process from real observational data of the outputs. There are many examples of inverse computations, ranging from computing the important dimensions of an organ from its CAT scan, reconstructing the source of a sound by measuring its volume and frequency at various places, calculating the density of the Earth from measurements of its gravity field, or calculating the initial condition of the atmosphere (temperature, pressure, etc.) from satellite and weather station observations over a time interval. Inverse problems are ubiquitous across all of science and engineering (and beyond). Many solutions exist for inverse problems, i.e. solutions that fit the data to the observations. However, there are variations in the solutions identified. That is, the solutions of an inverse problem are subject to uncertainty. Bayesian inferencing provides a systematic mathematical framework for characterizing this uncertainty. However, the Bayesian solution of inverse problems for large-scale complex models require enormous computational power. Only recently have algorithms begun to emerge that are computationally tractable. However, these algorithms have remained out of the reach of the mainstream of scientists who solve inverse problems, due to their complexity and the need for deeper information from the forward model. This project aims to develop, distribute, and support open-source software that encodes state-of-the-art algorithms for the solution of large-scale complex Bayesian inverse problems and is robust, scalable, flexible, modular, widely accessible, and easy to use.

The project builds heavily on two complementary open-source software libraries the team has been developing: MUQ at MIT, and hIPPYlib at UT-Austin/UC-Merced. MUQ provides a spectrum of powerful Bayesian inversion models and algorithms, but expects forward models to come equipped with gradients/Hessians to permit large-scale solution. hIPPYlib implements powerful large-scale gradient/Hessian-based inverse solvers in an environment that can automatically generate needed derivatives, but it lacks full Bayesian capabilities. By integrating these two complementary libraries, the project will result in a robust, scalable, and efficient software framework that realizes the benefits of each to tackle complex large-scale Bayesian inverse problems across a broad spectrum of scientific and engineering disciplines. The resulting software, that will be distributed under an open-source license, will provide an environment for rapid development of inverse models equipped with gradient/Hessian information; benchmark problems for evaluation and comparison of algorithms; and tutorial problems for training and testing purposes.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 13)

Show All

Babaniyi, Olalekan and Nicholson, Ruanui and Villa, Umberto and Petra, Noémi "Inferring the basal sliding coefficient field for the Stokes ice sheet model under rheological uncertainty" The Cryosphere , v.15 , 2021 https://doi.org/10.5194/tc-15-1731-2021 Citation Details

Constantinescu, Emil M. and Petra, Noémi and Bessac, Julie and Petra, Cosmin G. "Statistical Treatment of Inverse Problems Constrained by Differential Equations-Based Models with Stochastic Terms" SIAM/ASA Journal on Uncertainty Quantification , v.8 , 2020 10.1137/18M122073X Citation Details

Emil Constantinescu, Cosmin Petra, Julie Bessac, Noemi Petra "Statistical Treatment of Inverse Problems Constrained by Differential Equations-based Models with Stochastic Terms" SIAM / ASA Journal on Uncertainty Quantification , v.8 , 2020 , p.170

Fatehiboroujeni, Soheil and Petra, Noemi and Goyal, Sachin "Linearized Bayesian inference for Youngs modulus parameter field in an elastic model of slender structures" Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , v.476 , 2020 10.1098/rspa.2019.0476 Citation Details

Ghattas, Omar and Isaac, Toby and Petra, Noemi and Stadler, Georg "Scalable Algorithms for Bayesian Inference of Large-Scale Models from Large-Scale Data" International Conference on Vector and Parallel Processing, High Performance Computing for Computational Science - VECPAR 2016, Lecture Notes in Computer Science, Springer , v.10150 , 2017 https://doi.org/10.1007/978-3-319-61982-8_1 Citation Details

Ghattas, Omar and Marzouk, Youssef and Parno, Matt and Petra, Noemi and Stadler, Georg and Villa, Umberto "Students Tackle Bayesian Inverse Problems in the Colorado Rockies" SIAM news , 2019 Citation Details

Kim, KT and Villa, U and Parno, M and Petra, N and Marzouk, Y and Ghattas, O "hIPPYlib-MUQ: A Bayesian Inference Software Framework for Integration of Data with Complex Predictive Models under Uncertainty" 33nd International Conference on Parallel Computational Fluid Dynamics , 2021 Citation Details

Nicholson, Ruanui and Petra, Noémi and Kaipio, Jari P "Estimation of the Robin coefficient field in a Poisson problem with uncertain conductivity field" Inverse Problems , v.34 , 2018 10.1088/1361-6420/aad91e Citation Details

Olalekan Babaniyi, Ruanui Nicholson, Umberto Villa, Noemi Petra "Inferring the basal sliding coefficient field for the Stokes ice sheet model under rheological uncertainty" The Cryosphere , v.15 , 2021 , p.1731 10.5194/tc-15-1731-2021

Ruanui Nicholson, Noémi Petra and Jari P. Kaipio "Estimation of the Robin coefficient field in a Poisson problem with uncertain conductivity field" Inverse Problems , v.34 , 2018 10.1088%2F1361-6420%2Faad91e

Umberto Villa, Noemi Petra, Omar Ghattas "hIPPYlib: An Extensible Software Framework for Large-Scale Inverse Problems Governed by PDEs: Part I: Deterministic Inversion and Linearized Bayesian Inference" ACM Transactions on Mathematical Software (TOMS) , v.47 , 2021 10.1145/3428447

(Showing: 1 - 10 of 13)

Show All

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The overarching goal of this collaborative project between UT Austin, MIT, and UC Merced was the development and dissemination of a high-performance, open-source software framework incorporating a suite of advanced algorithms for the solution of Bayesian inverse problems. Bayesian inference provides a systematic framework for integration of data with mathematical models to quantify the uncertainty in the solution of the inverse problem. However, solution of Bayesian inverse problems governed by complex forward models described by partial differential equations (PDEs) remains prohibitive with black-box Markov chain Monte Carlo (MCMC) methods. Under this grant, we extended and interfaced two complementary open source software packages, hIPPYlib and MUQ. hIPPYlib solves PDE-constrained inverse problems using adjoint-based first and second derivatives, but it lacks full Bayesian capabilities. MUQ provides a spectrum of powerful Bayesian inversion models and algorithms that accelerate MCMC sampling by exploiting the geometry and intrinsic low-dimensionality of parameter space via derivative information and low rank approximation. However, MUQ expects forward models to come equipped with gradients and Hessians to permit large-scale solution. By combining these two complementary libraries, we created hIPPYlib-MUQ, a robust, scalable, and efficient software framework that realizes the benefits of each and allows researchers to tackle complex large-scale Bayesian inverse problems across a broad spectrum of scientific and engineering disciplines.

Extensive research efforts have been devoted to overcome the prohibitiveness of Bayesian inverse problems governed by large-scale complex PDE-based models. With rapid progress in high-performance computing, and advances in scalable PDE solvers, repeated evaluations of forward PDE models for different input parameters are becoming more tractable. Furthermore, structure-exploiting MCMC methods have effectively facilitated the exploration of complex posterior distributions. Finally, dimension reduction methods have proved to significantly reduce the computational cost of MCMC simulations. However, applying and combining these advanced techniques can be extremely challenging. Our goal with this project were twofold: (i) to make these advanced Bayesian inversion algorithms accessible to domain scientists; (ii) to provide an environment that expedites the development of new algorithms, and that helps developers benchmark their work. In particular, the major scientific and broader impact outcomes of our project include:

1. Software public releases: hIPPYlib (https://hippylib.github.io/download/) and hIPPYlib-MUQ (https://github.com/hippylib/hippylib2muq)

2. Docker images containing pre-installed softwares and tutorial examples: hIPPYlib: https://hub.docker.com/r/hippylib/toms and hIPPYlib-MUQ: https://hub.docker.com/r/ktkimyu/hippylib2muq

3. Documentations: hIPPYlib (https://hippylib.github.io/tutorial_v2.3.0/) and hIPPYlib-MUQ (https://hippylib2muq.readthedocs.io/en/latest/)

4. Two ACM TOMS papers describing the hIPPYlib (published, https://arxiv.org/abs/1909.03948) and hIPPYlib-MUQ (submitted, https://arxiv.org/abs/2112.00713) software frameworks.

5. Development of teaching materials and tutorials: the capabilities of the hIPPYlib and hIPPYlib-MUQ software frameworks have been demonstrated and shared in the form of Jupyter notebooks (interactive tutorials that mix instruction and theory with editable and runnable code).

6. hIPPYlib-MUQ as research tool: hIPPYlib and hIPPYlib-MUQ are being applied to a broad spectrum of challenging Bayesian inverse problems, including inference of ice sheet basal friction from InSAR-based surface velocities, statistical treatment of inverse problems constrained by stochastic model, accounting for model errors in inverse problems, etc. As such, several articles (published, submitted, or in preparation) and PhD thesis resulted from hIPPYlib and hIPPYlib-MUQ's algorithms and their applications to a spectrum of challenging inverse problems. A comprehensive list of applications and list of publications can be found at https://hippylib.github.io/research/.

7. hIPPYlib-MUQ as teaching tool: hIPPYlib was used as the computational foundation for one graduate-level inverse problems course at UC Merced (hIPPYlib has been used in inverse problems courses at a number of other universities as well.) In addition, hIPPYlib and MUQ were heavily employed at various workshops and summer schools including ICERM (Summer 2015), SAMSI (Summer 2016), the 2018 SIAM Gene Golub Summer School (co-organized by the PIs of this grant), where graduate students or early career researchers have been trained in the field of computational inverse problems. More recently, the PI introduced hIPPYlib-MUQ at the "Women in Inverse Problems" workshop orginized by the Banff International Research Station for Mathematical Innovation and Discovery (BIRS).

8. Mentorship and training at UC Merced: this grant offered support and training opportunities in Bayesian inversion theory, algorithms, and software for three postdoctoral researchers, several graduate students (via the PI's research group and her graduate-level inverse problems class), and three undergraduate students.

Last Modified: 01/04/2022
Modified by: Noemi Petra

Please report errors in award information by writing to: awardsearch@nsf.gov.

Success

Error