
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | August 24, 2018 |
Latest Amendment Date: | November 23, 2020 |
Award Number: | 1835443 |
Award Instrument: | Standard Grant |
Program Manager: |
Rob Beverly
OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | January 1, 2019 |
End Date: | December 31, 2023 (Estimated) |
Total Intended Award Amount: | $3,498,560.00 |
Total Awarded Amount to Date: | $3,498,560.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
77 MASSACHUSETTS AVE CAMBRIDGE MA US 02139-4301 (617)253-1000 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
77 Massachusetts Cambridge MA US 02139-4307 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Software Institutes |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Recent revolutions in data availability have radically altered activities across many fields within science, industry, and government. For instance, contemporary simulations medication properties can require the computational power of entire data centers, and recent efforts in astronomy will soon generate the largest image datasets in history. In such extreme environments, the only viable path forward for scientific discovery hinges on the development and exploitation of next-generation computational cyberinfrastructure of supercomputers and software. The development of this new computational infrastructure demands significant engineering resources, so it is paramount to maximize the infrastructure's potential for high impact and wide adoption across as many technical domains as possible. Unfortunately, despite this necessity, existing development processes often produce software that is limited to specific hardware, or requires additional expertise to use properly, or is overly specialized to a specific problem domain. Such "single-use" software tools are limited in scope, leading to underutilization by the wider scientific community. In contrast, this project seeks to develop methods and software for computer-based scientific analysis that are sufficiently powerful, flexible and accessible to (i) enable domain experts to achieve significant advancements within their domains, and (ii) enable innovative use of advanced computational techniques in unexpected scientific, technological and industrial applications. This project will apply these tools to a wide variety of specific scientific challenges faced by various research teams in astronomy, medicine, and energy management. These teams plan on using the proposed work to map out new star systems, develop new life-saving medications, and design new power systems that will deliver more energy to a greater number of homes and businesses at a lower cost than existing systems. Finally, this project will seek to leave a legacy of sustained societal benefit by educating students and practitioners in the broader scientific and engineering communities via exposure to state-of-the-art computational techniques.
Through close collaboration with research teams in statistical astronomy, pharmacometrics, power systems optimization, and high-performance computing, this project will deliver cyberinfrastructure that will effectively and effortlessly enable the next generation of computer-based scientific analysis and discovery. To ensure the practical applicability of the developed cyberinfrastructure, the project will focus on three target scientific applications: (i) economically viable decarbonization of electrical power networks, (ii) real-time analysis of extreme-scale astronomical image data, and (iii) pharmacometric modeling and simulation for drug analysis and discovery. While tackling these specific problems will constitute an initial stress test of the proposed cyberinfrastructure, it is the ultimate goal of the project that the developed tools be sufficiently performant, accessible, composable, flexible and adaptable to be applied to the widest possible range of problem domains. To achieve this vision, the project will build and improve various software tools for computational optimization, machine learning, parallel computing, and model-based simulation. Particular attention will be paid to the proposed cyberinfrastructure's composability with new and existing tools for scientific analysis and discovery. The pursuit of these goals will require the design and implementation of new programming language abstractions to allow close integration of high-level language features with low-level compiler optimizations. Furthermore, maximally exploiting proposed cyberinfrastructure will require research into new methods that combine state-of-the-art techniques from optimization, machine learning, and high-performance computing.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The goal of this project was to develop methods and software for computer-based scientific analysis that are sufficiently powerful, flexible and accessible to (i) enable domain experts to achieve significant advancements within their domains, and (ii) enable innovative use of advanced computational techniques in unexpected scientific, technological and industrial applications. The project achieved this vision by 1) building and improving various tools for computational optimization, machine learning, parallel computing, and model-based simulation, and 2) using these tools to produce concrete advancements in scientific challenges associated to economically viable decarbonization of electrical power networks and pharmacometric modeling and simulation for drug analysis and discovery.
With regards to computational optimization, we made several contributions around modeling languages for mathematical optimization and interior point algorithms for convex optimization. In particular, by exploiting the structure of a sub-class of convex optimization known as conic programming problems we developed a solver that can make significant performance improvements in various problems related to nonparametric distribution estimation, optimal control, experimental design and the capacity of a classical-quantum channel. This solver is accessible through the JuMP modeling language and supports advanced modeling abstractions through user-defined cones and abstract numeric types (including complex numbers and arbitrary precision floating point numbers).
With regards to machine learning, parallel computing, and model-based simulation we introduced integrated high-performance symbolic-numeric computational frameworks for scientific computing and scientific machine learning. In particular, this infrastructure includes numerical methods for ordinary and stochastic differential equations that (i) can take advantage of both general-purpose distributed parallel infrastructure and problem-specific hardware (e.g. GPUs), and (ii) are compatible with machine learning techniques through advanced automatic differentiation techniques.
We also tested the effectiveness of the developed methods and infrastructure by applying them to two specific problems. First, we used the tools for computational optimization to develop a new capacity expansion planning framework for power systems. This framework can build an optimal investment strategy/pathway subject to uncertain policies such as CO2 limits and/or renewable energy mandates over several years into the future while accounting for detailed operation at an hourly resolution. Second, we used the tools for scientific machine learning as a foundation for the model-based analyses to accelerate vaccine clinical trials.
All the developed methodology is available through an open-source software ecosystem that has been independently and widely used for research and education. In particular, we supported various activities to nurture the community around this ecosystem to ensure its sustained use and development beyond the lifetime of this award.
Last Modified: 05/08/2024
Modified by: Juan Pablo Vielma
Please report errors in award information by writing to: awardsearch@nsf.gov.