
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | September 8, 2018 |
Latest Amendment Date: | September 8, 2018 |
Award Number: | 1835874 |
Award Instrument: | Standard Grant |
Program Manager: |
Bogdan Mihaila
bmihaila@nsf.gov (703)292-8235 OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | October 1, 2018 |
End Date: | September 30, 2022 (Estimated) |
Total Intended Award Amount: | $1,089,470.00 |
Total Awarded Amount to Date: | $1,089,470.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
110 21ST AVE S NASHVILLE TN US 37203-2416 (615)322-2631 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
Department of Chem & Biomol Eng. Nashville TN US 37212-2010 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
OFFICE OF MULTIDISCIPLINARY AC, DMR SHORT TERM SUPPORT, Software Institutes |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
As molecular-based computer simulations of both naturally occurring and man-made (synthetic) materials become increasingly used to predict their properties, the reproducibility of these simulations becomes an increasingly important issue. These simulations are complex, require large amounts of computer time, and are usually performed manually - i.e., put together one at a time, from all the components that go into such a simulation, including the models for how molecules interact with each other (known as forcefields). In addition, there has been much interest in being able to perform such computational simulations on large sets of different but related systems in order to screen for desirable properties, leading to the discovery of new materials and their incorporation into applications twice as rapidly and at half the cost of existing, primarily experimental, methods. This ambition is the basis for the national Materials Genome Initiative (MGI), making reproducibility even more important. In this project, nine research groups from eight universities are combining their expertise to create a software environment, called the Molecular Simulation Design Framework (MoSDeF) that will enable the automation of molecular-based computer simulations of soft materials (such as fluids, polymers, and biological systems) and will enable MGI-style screening of such systems. MoSDeF is open source and the use of MoSDeF will enable reproducibility in molecular-based computer simulations, because all simulation steps, all input data, and all codes used will be publicly accessible to anyone to reproduce a published simulation. MoSDeF will contribute to reproducibility through standardization and maintaining the provenance of forcefields, one of the most common sources of irreproducibility in molecular-based simulations.
Reproducibility in scientific research has become a prominent issue. Computational scientists, along with the rest of the scientific community, are grappling with the central question: How can a study be performed and published in such a way that it can be replicated by others? Answering this question is essential to the scientific enterprise and increasingly urgent, as reproducibility issues faced in small-scale studies will only be compounded as researchers look to harness the ever expanding computational power to perform large-scale Materials Genome Initiative (MGI) inspired screening studies, thus growing the number of simulations by orders of magnitude. Addressing the issues of reproducibility in soft matter simulation is particularly challenging, given the complexity of the simulation inputs and workflows, and the all-to-common usage of closed-source software. In this proposal, nine leading research groups (from Vanderbilt, U Michigan, Notre Dame U, U Delaware, Boise State U, U Houston, Wayne State U, and U Minnesota), representing a broad range of expertise, and an equally broad range of science applications, simulation codes, algorithms and analysis tools, along with computer scientists from Vanderbilt's Institute for Software Integrated Systems (ISIS), are committing to invest their expertise and capabilities to transform the mindset of molecular simulationists to perform and publish their simulations in such a way as to be Transparent, Reproducible, Usable by others, and Extensible (TRUE). Most of the investigators are recent or current holders of grants from the software program (i.e., S2I2, SSI or SSE grants); thus, the project builds upon, and brings synergy to, an existing large investment in molecular simulation software by NSF. To drive the community towards performing simulation that are TRUE, new software tools to facilitate best practices will be developed. Specifically, this will be achieved by expanding the capabilities of the open-source molecular simulation design framework (MoSDeF), which was initiated at Vanderbilt with support from two NSF grants. MoSDeF is a modular, scriptable Python framework that includes modules for programmatic system construction, encoding and applying force field usage rules, and workflow management, allowing the exact procedures used to setup and perform a simulation to be capture, version-controlled, and preserved. Continued development of the existing MoSDeF modules will be performed to support a wider range of force fields, molecular models, and open-source simulation engines. The creation of a plugin architecture for community extension, and the development of new modules for force field optimization, free energy calculations, and screening, will further allow MoSDeF can achieve these goals.
This project is supported by the Office of Advanced Cyberinfrastructure in the Directorate for Computer & Information Science & Engineering and the Division of Materials Research and the Division of Chemistry in the Directorate of Mathematical and Physical Sciences.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This collaborative project involving 8 universities (Vanderbilt, Michigan, Notre Dame, Minnesota, Delaware, Boise State, Wayne State, and Houston) focused on dramatically expanding the capabilities of the Molecular Simulation Design Framework (MoSDeF, web site mosdef.org), an open-source set of Python libraries that are designed to address issues of automation/efficiency, accuracy, and reproducibility in molecular simulations. Molecular simulation of particle-based systems generally refers to trajectory-based molecular dynamics (MD, in which the motions of the atoms/molecules are propagated by solving Newton’s equation of motion, or a variant thereof, to obtain a trajectory of positions and momenta over time), Monte Carlo approaches (MC, in which a sequence of configurations for the system is generated by a stochastic Markov process, leading to the appropriate probability distribution), or MD/MC hybrids. Based solely on knowledge of the chemical composition and state conditions of a system (temperature, pressure or volume, number(s) of molecules or chemical potential(s)), molecular simulations allow one to predict thermodynamic and other properties, some of which are not accessible to experimental measurements, by averaging over trajectories, thus providing molecular-level insight
into its collective behavior.
The project has resulted in 23 refereed journal publications and more than 70 conference and seminar presentations. More than 20 open- source codes and databases have been developed from scratch (the MoSDeF GMSO, hybrid MC/MD simulations, MoSDeF-GOMC integration, MoSDeF-Cassandra integration, MoSDeF CP2K integration, forcefield derivation,, Boise State’s repositories for managing, teaching, deploying, and analyzing simulations of polymers for structural and photovoltaic applications) or extended (mBuild, Foyer, TraPPE web page, HOOMD-blue, freud, signac, and adaptation of multi-state iterative Boltzmann inversion coarse-graining methodology) This project contributed to the training and professional development of 20 graduate students, 7 post-doctoral researchers and 11 undergraduate researchers. The project has been utilized in numerous undergraduate and graduate courses, and been the subject of multiple hand-on workshops, with the result that many hundreds of people have been introduced to molecular simulation via MoSDeF.
Last Modified: 01/27/2023
Modified by: Peter T Cummings
Please report errors in award information by writing to: awardsearch@nsf.gov.