
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | August 10, 2016 |
Latest Amendment Date: | August 10, 2016 |
Award Number: | 1550547 |
Award Instrument: | Standard Grant |
Program Manager: |
Seung-Jong Park
OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2016 |
End Date: | August 31, 2021 (Estimated) |
Total Intended Award Amount: | $475,000.00 |
Total Awarded Amount to Date: | $475,000.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
5200 N LAKE RD MERCED CA US 95343-5001 (209)201-2039 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
5200 N. Lake Road Merced CA US 95343-5705 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
PROBABILITY, APPLIED MATHEMATICS, COMPUTATIONAL MATHEMATICS, Software Institutes, CDS&E-MSS |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Scientists often use mathematical models to predict the behavior of natural and engineered systems. These models are therefore fundamental to scientific and engineering progress and hence relevant to NSF's science mission. Most models of realistic physical systems use complex formulae (such as, partial differential equations) involving many variables. When using such a model for predicting the future behavior of a system, a scientist has to provide initial values for all the variables. This can be difficult because input values may not be directly measureable. Thus, scientists often must use "inverse" computations to calculate the initial input values of the variables of a system model based on external observations of the real world. In other words, scientists seek to infer inputs to a computer model of a physical process from real observational data of the outputs. There are many examples of inverse computations, ranging from computing the important dimensions of an organ from its CAT scan, reconstructing the source of a sound by measuring its volume and frequency at various places, calculating the density of the Earth from measurements of its gravity field, or calculating the initial condition of the atmosphere (temperature, pressure, etc.) from satellite and weather station observations over a time interval. Inverse problems are ubiquitous across all of science and engineering (and beyond). Many solutions exist for inverse problems, i.e. solutions that fit the data to the observations. However, there are variations in the solutions identified. That is, the solutions of an inverse problem are subject to uncertainty. Bayesian inferencing provides a systematic mathematical framework for characterizing this uncertainty. However, the Bayesian solution of inverse problems for large-scale complex models require enormous computational power. Only recently have algorithms begun to emerge that are computationally tractable. However, these algorithms have remained out of the reach of the mainstream of scientists who solve inverse problems, due to their complexity and the need for deeper information from the forward model. This project aims to develop, distribute, and support open-source software that encodes state-of-the-art algorithms for the solution of large-scale complex Bayesian inverse problems and is robust, scalable, flexible, modular, widely accessible, and easy to use.
The project builds heavily on two complementary open-source software libraries the team has been developing: MUQ at MIT, and hIPPYlib at UT-Austin/UC-Merced. MUQ provides a spectrum of powerful Bayesian inversion models and algorithms, but expects forward models to come equipped with gradients/Hessians to permit large-scale solution. hIPPYlib implements powerful large-scale gradient/Hessian-based inverse solvers in an environment that can automatically generate needed derivatives, but it lacks full Bayesian capabilities. By integrating these two complementary libraries, the project will result in a robust, scalable, and efficient software framework that realizes the benefits of each to tackle complex large-scale Bayesian inverse problems across a broad spectrum of scientific and engineering disciplines. The resulting software, that will be distributed under an open-source license, will provide an environment for rapid development of inverse models equipped with gradient/Hessian information; benchmark problems for evaluation and comparison of algorithms; and tutorial problems for training and testing purposes.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The overarching goal of this collaborative project between UT Austin, MIT, and UC Merced was the development and dissemination of a high-performance, open-source software framework incorporating a suite of advanced algorithms for the solution of Bayesian inverse problems. Bayesian inference provides a systematic framework for integration of data with mathematical models to quantify the uncertainty in the solution of the inverse problem. However, solution of Bayesian inverse problems governed by complex forward models described by partial differential equations (PDEs) remains prohibitive with black-box Markov chain Monte Carlo (MCMC) methods. Under this grant, we extended and interfaced two complementary open source software packages, hIPPYlib and MUQ. hIPPYlib solves PDE-constrained inverse problems using adjoint-based first and second derivatives, but it lacks full Bayesian capabilities. MUQ provides a spectrum of powerful Bayesian inversion models and algorithms that accelerate MCMC sampling by exploiting the geometry and intrinsic low-dimensionality of parameter space via derivative information and low rank approximation. However, MUQ expects forward models to come equipped with gradients and Hessians to permit large-scale solution. By combining these two complementary libraries, we created hIPPYlib-MUQ, a robust, scalable, and efficient software framework that realizes the benefits of each and allows researchers to tackle complex large-scale Bayesian inverse problems across a broad spectrum of scientific and engineering disciplines.
Extensive research efforts have been devoted to overcome the prohibitiveness of Bayesian inverse problems governed by large-scale complex PDE-based models. With rapid progress in high-performance computing, and advances in scalable PDE solvers, repeated evaluations of forward PDE models for different input parameters are becoming more tractable. Furthermore, structure-exploiting MCMC methods have effectively facilitated the exploration of complex posterior distributions. Finally, dimension reduction methods have proved to significantly reduce the computational cost of MCMC simulations. However, applying and combining these advanced techniques can be extremely challenging. Our goal with this project were twofold: (i) to make these advanced Bayesian inversion algorithms accessible to domain scientists; (ii) to provide an environment that expedites the development of new algorithms, and that helps developers benchmark their work. In particular, the major scientific and broader impact outcomes of our project include:
1. Software public releases: hIPPYlib (https://hippylib.github.io/download/) and hIPPYlib-MUQ (https://github.com/hippylib/hippylib2muq)
2. Docker images containing pre-installed softwares and tutorial examples: hIPPYlib: https://hub.docker.com/r/hippylib/toms and hIPPYlib-MUQ: https://hub.docker.com/r/ktkimyu/hippylib2muq
3. Documentations: hIPPYlib (https://hippylib.github.io/tutorial_v2.3.0/) and hIPPYlib-MUQ (https://hippylib2muq.readthedocs.io/en/latest/)
4. Two ACM TOMS papers describing the hIPPYlib (published, https://arxiv.org/abs/1909.03948) and hIPPYlib-MUQ (submitted, https://arxiv.org/abs/2112.00713) software frameworks.
5. Development of teaching materials and tutorials: the capabilities of the hIPPYlib and hIPPYlib-MUQ software frameworks have been demonstrated and shared in the form of Jupyter notebooks (interactive tutorials that mix instruction and theory with editable and runnable code).
6. hIPPYlib-MUQ as research tool: hIPPYlib and hIPPYlib-MUQ are being applied to a broad spectrum of challenging Bayesian inverse problems, including inference of ice sheet basal friction from InSAR-based surface velocities, statistical treatment of inverse problems constrained by stochastic model, accounting for model errors in inverse problems, etc. As such, several articles (published, submitted, or in preparation) and PhD thesis resulted from hIPPYlib and hIPPYlib-MUQ's algorithms and their applications to a spectrum of challenging inverse problems. A comprehensive list of applications and list of publications can be found at https://hippylib.github.io/research/.
7. hIPPYlib-MUQ as teaching tool: hIPPYlib was used as the computational foundation for one graduate-level inverse problems course at UC Merced (hIPPYlib has been used in inverse problems courses at a number of other universities as well.) In addition, hIPPYlib and MUQ were heavily employed at various workshops and summer schools including ICERM (Summer 2015), SAMSI (Summer 2016), the 2018 SIAM Gene Golub Summer School (co-organized by the PIs of this grant), where graduate students or early career researchers have been trained in the field of computational inverse problems. More recently, the PI introduced hIPPYlib-MUQ at the "Women in Inverse Problems" workshop orginized by the Banff International Research Station for Mathematical Innovation and Discovery (BIRS).
8. Mentorship and training at UC Merced: this grant offered support and training opportunities in Bayesian inversion theory, algorithms, and software for three postdoctoral researchers, several graduate students (via the PI's research group and her graduate-level inverse problems class), and three undergraduate students.
Last Modified: 01/04/2022
Modified by: Noemi Petra
Please report errors in award information by writing to: awardsearch@nsf.gov.