Award Abstract # 1835443
Framework: Software: Next-Generation Cyberinfrastructure for Large-Scale Computer-Based Scientific Analysis and Discovery

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: MASSACHUSETTS INSTITUTE OF TECHNOLOGY
Initial Amendment Date: August 24, 2018
Latest Amendment Date: November 23, 2020
Award Number: 1835443
Award Instrument: Standard Grant
Program Manager: Rob Beverly
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: January 1, 2019
End Date: December 31, 2023 (Estimated)
Total Intended Award Amount: $3,498,560.00
Total Awarded Amount to Date: $3,498,560.00
Funds Obligated to Date: FY 2018 = $3,498,560.00
History of Investigator:
  • Alan Edelman (Principal Investigator)
    EDELMAN@MATH.MIT.EDU
  • Juan Pablo Vielma (Co-Principal Investigator)
Recipient Sponsored Research Office: Massachusetts Institute of Technology
77 MASSACHUSETTS AVE
CAMBRIDGE
MA  US  02139-4301
(617)253-1000
Sponsor Congressional District: 07
Primary Place of Performance: Massachusetts Institute of Technology
77 Massachusetts
Cambridge
MA  US  02139-4307
Primary Place of Performance
Congressional District:
07
Unique Entity Identifier (UEI): E2NYLCDML6V1
Parent UEI: E2NYLCDML6V1
NSF Program(s): Software Institutes
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z, 077Z, 7925, 8004
Program Element Code(s): 800400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Recent revolutions in data availability have radically altered activities across many fields within science, industry, and government. For instance, contemporary simulations medication properties can require the computational power of entire data centers, and recent efforts in astronomy will soon generate the largest image datasets in history. In such extreme environments, the only viable path forward for scientific discovery hinges on the development and exploitation of next-generation computational cyberinfrastructure of supercomputers and software. The development of this new computational infrastructure demands significant engineering resources, so it is paramount to maximize the infrastructure's potential for high impact and wide adoption across as many technical domains as possible. Unfortunately, despite this necessity, existing development processes often produce software that is limited to specific hardware, or requires additional expertise to use properly, or is overly specialized to a specific problem domain. Such "single-use" software tools are limited in scope, leading to underutilization by the wider scientific community. In contrast, this project seeks to develop methods and software for computer-based scientific analysis that are sufficiently powerful, flexible and accessible to (i) enable domain experts to achieve significant advancements within their domains, and (ii) enable innovative use of advanced computational techniques in unexpected scientific, technological and industrial applications. This project will apply these tools to a wide variety of specific scientific challenges faced by various research teams in astronomy, medicine, and energy management. These teams plan on using the proposed work to map out new star systems, develop new life-saving medications, and design new power systems that will deliver more energy to a greater number of homes and businesses at a lower cost than existing systems. Finally, this project will seek to leave a legacy of sustained societal benefit by educating students and practitioners in the broader scientific and engineering communities via exposure to state-of-the-art computational techniques.

Through close collaboration with research teams in statistical astronomy, pharmacometrics, power systems optimization, and high-performance computing, this project will deliver cyberinfrastructure that will effectively and effortlessly enable the next generation of computer-based scientific analysis and discovery. To ensure the practical applicability of the developed cyberinfrastructure, the project will focus on three target scientific applications: (i) economically viable decarbonization of electrical power networks, (ii) real-time analysis of extreme-scale astronomical image data, and (iii) pharmacometric modeling and simulation for drug analysis and discovery. While tackling these specific problems will constitute an initial stress test of the proposed cyberinfrastructure, it is the ultimate goal of the project that the developed tools be sufficiently performant, accessible, composable, flexible and adaptable to be applied to the widest possible range of problem domains. To achieve this vision, the project will build and improve various software tools for computational optimization, machine learning, parallel computing, and model-based simulation. Particular attention will be paid to the proposed cyberinfrastructure's composability with new and existing tools for scientific analysis and discovery. The pursuit of these goals will require the design and implementation of new programming language abstractions to allow close integration of high-level language features with low-level compiler optimizations. Furthermore, maximally exploiting proposed cyberinfrastructure will require research into new methods that combine state-of-the-art techniques from optimization, machine learning, and high-performance computing.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 26)
Anantharaman, Ranjan and Ma, Yingbo and Gowda, Shashi and Laughman, Chris and Shah, Viral and Edelman, Alan and Rackauckas, Chris "Accelerating Simulation of Stiff Nonlinear Systems using Continuous-Time Echo State Networks" Proceedings of the AAAI 2021 Spring Symposium on Combining Artificial Intelligence and Machine Learning with Physical Sciences , 2021 Citation Details
Arnold, Julian and Schäfer, Frank "Replacing Neural Networks by Optimal Analytical Predictors for the Detection of Phase Transitions" Physical Review X , v.12 , 2022 https://doi.org/10.1103/PhysRevX.12.031044 Citation Details
Arya, Gaurav and "Automatic Differentiation of Programs with Discrete Randomness" Advances in Neural Information Processing Systems , v.35 , 2022 Citation Details
Besard, Tim and Churavy, Valentin and Edelman, Alan and Sutter, Bjorn De "Rapid software prototyping for heterogeneous and distributed platforms" Advances in Engineering Software , v.132 , 2019 https://doi.org/10.1016/j.advengsoft.2019.02.002 Citation Details
Coey, Chris and Kapelevich, Lea and Vielma, Juan Pablo "Conic Optimization with Spectral Functions on Euclidean Jordan Algebras" Mathematics of Operations Research , 2022 https://doi.org/10.1287/moor.2022.1324 Citation Details
Coey, Chris and Kapelevich, Lea and Vielma, Juan Pablo "Performance enhancements for a generic conic interior point algorithm" Mathematical Programming Computation , 2022 https://doi.org/10.1007/s12532-022-00226-0 Citation Details
Coey, Chris and Kapelevich, Lea and Vielma, Juan Pablo "Solving Natural Conic Formulations with Hypatia.jl" INFORMS Journal on Computing , v.34 , 2022 https://doi.org/10.1287/ijoc.2022.1202 Citation Details
Cunis, Torbjørn and Legat, Benoît "Sequential sum-of-squares programming for analysis of nonlinear systems " , 2023 https://doi.org/10.23919/ACC55779.2023.10156153 Citation Details
Dixit, Vaibhav Kumar and Rackauckas, Christopher "GlobalSensitivity.jl: Performant and Parallel GlobalSensitivity Analysis with Julia" Journal of Open Source Software , v.7 , 2022 https://doi.org/10.21105/joss.04561 Citation Details
Edelman, Alan and Jeong, Sungwoo "On the Cartan decomposition for classical random matrix ensembles" Journal of Mathematical Physics , v.63 , 2022 https://doi.org/10.1063/5.0087010 Citation Details
Giordano, Mose and Klower, Milan and Churavy, Valentin "Productivity meets Performance: Julia on A64FX" 2022 IEEE International Conference on Cluster Computing (CLUSTER) , 2022 https://doi.org/10.1109/CLUSTER51413.2022.00072 Citation Details
(Showing: 1 - 10 of 26)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The goal of this project was to develop methods and software for computer-based scientific analysis that are sufficiently powerful, flexible and accessible to (i) enable domain experts to achieve significant advancements within their domains, and (ii) enable innovative use of advanced computational techniques in unexpected scientific, technological and industrial applications. The project achieved this vision by 1) building and improving various tools for computational optimization, machine learning, parallel computing, and model-based simulation, and 2) using these tools to produce concrete advancements in scientific challenges associated to economically viable decarbonization of electrical power networks and pharmacometric modeling and simulation for drug analysis and discovery. 

 

With regards to computational optimization, we made several contributions around modeling languages for mathematical optimization and interior point algorithms for convex optimization. In particular, by exploiting the structure of a sub-class of convex optimization known as conic programming problems we developed a solver that can make significant performance improvements in various problems related to nonparametric distribution estimation, optimal control, experimental design and the capacity of a classical-quantum channel. This solver is accessible through the JuMP modeling language and supports advanced modeling abstractions through user-defined cones and abstract numeric types (including complex numbers and arbitrary precision floating point numbers). 

 

With regards to machine learning, parallel computing, and model-based simulation we introduced integrated high-performance symbolic-numeric computational frameworks for scientific computing and scientific machine learning. In particular, this infrastructure includes numerical methods for ordinary and stochastic differential equations that (i) can take advantage of both general-purpose distributed parallel infrastructure and problem-specific hardware (e.g. GPUs), and (ii) are compatible with machine learning techniques through advanced automatic differentiation techniques. 

 

We also tested the effectiveness of the developed methods and infrastructure by applying them to two specific problems. First, we used the tools for computational optimization to develop a new capacity expansion planning framework for power systems. This framework can build an optimal investment strategy/pathway subject to uncertain policies such as CO2 limits and/or renewable energy mandates over several years into the future while accounting for detailed operation at an hourly resolution. Second, we used the tools for scientific machine learning as a foundation for the model-based analyses to accelerate vaccine clinical trials.

 

All the developed methodology is available through an open-source software ecosystem that has been independently and widely used for research and education. In particular, we supported various activities to nurture the community around this ecosystem to ensure its sustained use and development beyond the lifetime of this award.


 

 


Last Modified: 05/08/2024
Modified by: Juan Pablo Vielma

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page