Award Abstract # 1835874
Collaborative Research: NSCI Framework: Software for Building a Community-Based Molecular Modeling Capability Around the Molecular Simulation Design Framework (MoSDeF)

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: VANDERBILT UNIVERSITY
Initial Amendment Date: September 8, 2018
Latest Amendment Date: September 8, 2018
Award Number: 1835874
Award Instrument: Standard Grant
Program Manager: Bogdan Mihaila
bmihaila@nsf.gov
 (703)292-8235
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2018
End Date: September 30, 2022 (Estimated)
Total Intended Award Amount: $1,089,470.00
Total Awarded Amount to Date: $1,089,470.00
Funds Obligated to Date: FY 2018 = $1,089,470.00
History of Investigator:
  • Peter Cummings (Principal Investigator)
    peter.cummings@vanderbilt.edu
  • Akos Ledeczi (Co-Principal Investigator)
  • Clare McCabe (Co-Principal Investigator)
  • Christopher Iacovella (Co-Principal Investigator)
Recipient Sponsored Research Office: Vanderbilt University
110 21ST AVE S
NASHVILLE
TN  US  37203-2416
(615)322-2631
Sponsor Congressional District: 05
Primary Place of Performance: Vanderbilt University
Department of Chem & Biomol Eng.
Nashville
TN  US  37212-2010
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): GTNBNWXJ12D5
Parent UEI:
NSF Program(s): OFFICE OF MULTIDISCIPLINARY AC,
DMR SHORT TERM SUPPORT,
Software Institutes
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z, 054Z, 077Z, 7237, 7569, 7925, 8004, 8009, 9216, 9263
Program Element Code(s): 125300, 171200, 800400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

As molecular-based computer simulations of both naturally occurring and man-made (synthetic) materials become increasingly used to predict their properties, the reproducibility of these simulations becomes an increasingly important issue. These simulations are complex, require large amounts of computer time, and are usually performed manually - i.e., put together one at a time, from all the components that go into such a simulation, including the models for how molecules interact with each other (known as forcefields). In addition, there has been much interest in being able to perform such computational simulations on large sets of different but related systems in order to screen for desirable properties, leading to the discovery of new materials and their incorporation into applications twice as rapidly and at half the cost of existing, primarily experimental, methods. This ambition is the basis for the national Materials Genome Initiative (MGI), making reproducibility even more important. In this project, nine research groups from eight universities are combining their expertise to create a software environment, called the Molecular Simulation Design Framework (MoSDeF) that will enable the automation of molecular-based computer simulations of soft materials (such as fluids, polymers, and biological systems) and will enable MGI-style screening of such systems. MoSDeF is open source and the use of MoSDeF will enable reproducibility in molecular-based computer simulations, because all simulation steps, all input data, and all codes used will be publicly accessible to anyone to reproduce a published simulation. MoSDeF will contribute to reproducibility through standardization and maintaining the provenance of forcefields, one of the most common sources of irreproducibility in molecular-based simulations.

Reproducibility in scientific research has become a prominent issue. Computational scientists, along with the rest of the scientific community, are grappling with the central question: How can a study be performed and published in such a way that it can be replicated by others? Answering this question is essential to the scientific enterprise and increasingly urgent, as reproducibility issues faced in small-scale studies will only be compounded as researchers look to harness the ever expanding computational power to perform large-scale Materials Genome Initiative (MGI) inspired screening studies, thus growing the number of simulations by orders of magnitude. Addressing the issues of reproducibility in soft matter simulation is particularly challenging, given the complexity of the simulation inputs and workflows, and the all-to-common usage of closed-source software. In this proposal, nine leading research groups (from Vanderbilt, U Michigan, Notre Dame U, U Delaware, Boise State U, U Houston, Wayne State U, and U Minnesota), representing a broad range of expertise, and an equally broad range of science applications, simulation codes, algorithms and analysis tools, along with computer scientists from Vanderbilt's Institute for Software Integrated Systems (ISIS), are committing to invest their expertise and capabilities to transform the mindset of molecular simulationists to perform and publish their simulations in such a way as to be Transparent, Reproducible, Usable by others, and Extensible (TRUE). Most of the investigators are recent or current holders of grants from the software program (i.e., S2I2, SSI or SSE grants); thus, the project builds upon, and brings synergy to, an existing large investment in molecular simulation software by NSF. To drive the community towards performing simulation that are TRUE, new software tools to facilitate best practices will be developed. Specifically, this will be achieved by expanding the capabilities of the open-source molecular simulation design framework (MoSDeF), which was initiated at Vanderbilt with support from two NSF grants. MoSDeF is a modular, scriptable Python framework that includes modules for programmatic system construction, encoding and applying force field usage rules, and workflow management, allowing the exact procedures used to setup and perform a simulation to be capture, version-controlled, and preserved. Continued development of the existing MoSDeF modules will be performed to support a wider range of force fields, molecular models, and open-source simulation engines. The creation of a plugin architecture for community extension, and the development of new modules for force field optimization, free energy calculations, and screening, will further allow MoSDeF can achieve these goals.

This project is supported by the Office of Advanced Cyberinfrastructure in the Directorate for Computer & Information Science & Engineering and the Division of Materials Research and the Division of Chemistry in the Directorate of Mathematical and Physical Sciences.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Black, Jana E. and Summers, Andrew Z. and Iacovella, Christopher R. and Cummings, Peter T. and McCabe, Clare "Investigation of the Impact of Cross-Polymerization on the Structural and Frictional Properties of Alkylsilane Monolayers Using Molecular Simulation" Nanomaterials , v.9 , 2019 10.3390/nano9040639 Citation Details
Cummings, Peter T and Gilmer, Justin B "Open-source molecular modeling software in chemical engineering" Current Opinion in Chemical Engineering , v.23 , 2019 10.1016/j.coche.2019.03.008 Citation Details
Cummings, Peter T. and MCabe, Clare and Iacovella, Christopher R. and Ledeczi, Akos and Jankowski, Eric and Jayaraman, Arthi and Palmer, Jeremy C. and Maginn, Edward J. and Glotzer, Sharon C. and Anderson, Joshua A. and Ilja Siepmann, J. and Potoff, Jeffr "Opensource molecular modeling software in chemical engineering focusing on the Molecular Simulation Design Framework" AIChE Journal , v.67 , 2021 https://doi.org/10.1002/aic.17206 Citation Details
DeFever, Ryan S. and Matsumoto, Ray A. and Dowling, Alexander W. and Cummings, Peter T. and Maginn, Edward J. "MoSDeF Cassandra: A complete Python interface for the Cassandra Monte Carlo software" Journal of Computational Chemistry , v.42 , 2021 https://doi.org/10.1002/jcc.26544 Citation Details
Klein, Christoph and Summers, Andrew Z. and Thompson, Matthew W. and Gilmer, Justin B. and McCabe, Clare and Cummings, Peter T. and Sallai, Janos and Iacovella, Christopher R. "Formalizing atom-typing and the dissemination of force fields with foyer" Computational Materials Science , v.167 , 2019 10.1016/j.commatsci.2019.05.026 Citation Details
Moore, Timothy C. and Yang, Alexander H. and Ogungbesan, Olu and Hartkamp, Remco and Iacovella, Christopher R. and Zhang, Qi and McCabe, Clare "Influence of Single-Stranded DNA Coatings on the Interaction between Graphene Nanoflakes and Lipid Bilayers" The Journal of Physical Chemistry B , v.123 , 2019 10.1021/acs.jpcb.9b04042 Citation Details
Summers, Andrew Z. and Gilmer, Justin B. and Iacovella, Christopher R. and Cummings, Peter T. and MCabe, Clare "MoSDeF, a Python Framework Enabling Large-Scale Computational Screening of Soft Matter: Application to Chemistry-Property Relationships in Lubricating Monolayer Films" Journal of Chemical Theory and Computation , v.16 , 2020 10.1021/acs.jctc.9b01183 Citation Details
Thompson, Matthew W. and Gilmer, Justin B. and Matsumoto, Ray A. and Quach, Co D. and Shamaprasad, Parashara and Yang, Alexander H. and Iacovella, Christopher R. and McCabe, Clare and Cummings, Peter T. "Towards molecular simulations that are transparent, reproducible, usable by others, and extensible (TRUE)" Molecular Physics , v.118 , 2020 10.1080/00268976.2020.1742938 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This collaborative project involving 8 universities (Vanderbilt, Michigan, Notre Dame, Minnesota, Delaware, Boise State, Wayne State, and Houston) focused on dramatically expanding the capabilities of the Molecular Simulation Design Framework (MoSDeF, web site mosdef.org), an open-source set of Python libraries that are designed to address issues of automation/efficiency, accuracy, and reproducibility in molecular simulations. Molecular simulation of particle-based systems generally refers to trajectory-based molecular dynamics (MD, in which the motions of the atoms/molecules are propagated by solving Newton’s equation of motion, or a variant thereof, to obtain a trajectory of positions and momenta over time), Monte Carlo approaches (MC, in which a sequence of configurations for the system is generated by a stochastic Markov process, leading to the appropriate probability distribution), or MD/MC hybrids. Based solely on knowledge of the chemical composition and state conditions of a system (temperature, pressure or volume, number(s) of molecules or chemical potential(s)), molecular simulations allow one to predict thermodynamic and other properties, some of which are not accessible to experimental measurements, by averaging over trajectories, thus providing molecular-level insight
into its collective behavior.  

The project has resulted in 23 refereed journal publications and more than 70 conference and seminar presentations. More than 20 open- source codes and databases have been developed from scratch (the MoSDeF GMSO, hybrid MC/MD simulations,  MoSDeF-GOMC integration,  MoSDeF-Cassandra integration, MoSDeF CP2K integration, forcefield derivation,Boise State’s repositories for managing,  teaching,  deploying,  and analyzing  simulations of polymers for structural and photovoltaic applications) or extended (mBuildFoyer,  TraPPE web page,  HOOMD-bluefreudsignac and adaptation of multi-state iterative Boltzmann inversion coarse-graining methodology) This project contributed to the training and professional development of 20 graduate students, 7 post-doctoral researchers and 11 undergraduate researchers. The project has been utilized in numerous undergraduate and graduate courses, and been the subject of multiple hand-on workshops, with the result that many hundreds of people have been introduced to molecular simulation via MoSDeF.

 


Last Modified: 01/27/2023
Modified by: Peter T Cummings

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page