Award Abstract # 1741057
BIGDATA: IA: Collaborative Research: In Situ Data Analytics for Next Generation Molecular Dynamics Workflows

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: UNIVERSITY OF DELAWARE
Initial Amendment Date: August 30, 2017
Latest Amendment Date: February 13, 2018
Award Number: 1741057
Award Instrument: Standard Grant
Program Manager: Almadena Chtchelkanova
achtchel@nsf.gov
 (703)292-7498
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2017
End Date: August 31, 2018 (Estimated)
Total Intended Award Amount: $979,987.00
Total Awarded Amount to Date: $979,987.00
Funds Obligated to Date: FY 2017 = $0.00
History of Investigator:
  • Michela Taufer (Principal Investigator)
    taufer@utk.edu
  • Trilce Estrada (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Delaware
550 S COLLEGE AVE
NEWARK
DE  US  19713-1324
(302)831-2136
Sponsor Congressional District: 00
Primary Place of Performance: University of Delaware
210 Hullihen Hall
Newark
DE  US  19716-2553
Primary Place of Performance
Congressional District:
00
Unique Entity Identifier (UEI): T72NHKM259N3
Parent UEI:
NSF Program(s): Big Data Science &Engineering
Primary Program Source: 01001718DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7433, 7942, 8083, 9150
Program Element Code(s): 808300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Molecular dynamics simulations studying the classical time evolution of a molecular system at atomic resolution are widely recognized in the fields of chemistry, material sciences, molecular biology and drug design; these simulations are one of the most common simulations on supercomputers. Next-generation supercomputers will have dramatically higher performance than do current systems, generating more data that needs to be analyzed (i.e., in terms of number and length of molecular dynamics trajectories). The coordination of data generation and analysis cannot rely on manual, centralized approaches as it does now. This interdisciplinary project integrates research from various areas across programs such as computer science, structural molecular biosciences, and high performance computing to transform the centralized nature of the molecular dynamics analysis into a distributed approach that is predominantly performed in situ. Specifically, this effort combines machine learning and data analytics approaches, workflow management methods, and high performance computing techniques to analyze molecular dynamics data as it is generated, save to disk only what is really needed for future analysis, and annotate molecular dynamics trajectories to drive the next steps in increasingly complex simulations' workflows.

The investigators tackle the data challenge of data analysis of molecular dynamics simulations on the next-generation supercomputers by (1) creating new in situ methods to trace molecular events such as conformational changes, phase transitions, or binding events in molecular dynamics simulations at runtime by locally reducing knowledge on high-dimensional molecular organization into a set of relevant structural molecular properties; (2) designing new data representations and extend unsupervised machine learning techniques to accurately and efficiently build an explicit global organization of structural and temporal molecular properties; (3) integrating simulation and analytics into complex workflows for runtime detection of changes in structural and temporal molecular properties; and (4) developing new curriculum material, online courses, and online training material targeting data analytics. The project's harnessed knowledge of molecular structures' transformations at runtime can be used to steer simulations to more promising areas of the simulation space, identify the data that should be written to congested parallel file systems, and index generated data for retrieval and post-simulation analysis. Supported by this knowledge, molecular dynamics workflows such as replica exchange simulations, Markov state models, and the string method with swarms of trajectories can be executed ?from the outside? (i.e., without reengineering the molecular dynamics code).

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 15)
An, Seoyoung and Channing, Georgia and Schuman, Catherine and Taufer, Michela "VINARCH: A Visual Analytics Interactive Tool for Neural Network Archaeology" Proceedings of the IEEE Cluster Conference (CLUSTER) , 2023 https://doi.org/10.1109/CLUSTERWorkshops61457.2023.00020 Citation Details
Bram, Yaron and Duan, Xiaohua and Nilsson-Payant, Benjamin E. and Chandar, Vasuretha and Wu, Hao and Shore, Derek and Fajardo, Alvaro and Sinha, Saloni and Hassan, Nora and Weinstein, Harel and TenOever, Benjamin R. and Chen, Shuibing and Schwartz, Robert "Dual-Reporter System for Real-Time Monitoring of SARS-CoV-2 Main Protease Activity in Live Cells Enables Identification of an Allosteric Inhibition Path" ACS Bio & Med Chem Au , v.2 , 2022 https://doi.org/10.1021/acsbiomedchemau.2c00034 Citation Details
Caino-Lores, Silvina and Cuendet, Michel and Marquez, Jack and Kots, Ekaterina and Estrada, Trilce and Deelman, Ewa and Weinstein, Harel and Taufer, Michela "Runtime Steering of Molecular Dynamics Simulations Through In Situ Analysis and Annotation of Collective Variables" , 2023 https://doi.org/10.1145/3592979.3593420 Citation Details
Channing, Georgia and Patel, Ria and Olaya, Paula and Rorabaugh, Ariel and Miyashita, Osamu and Caino-Lores, Silvina and Schuman, Catherine and Tama, Florence and Taufer, Michela "Composable Workflow for Accelerating Neural Architecture Search Using In Situ Analytics for Protein Classification" 52nd International Conference on Parallel Processing (ICPP) , 2023 https://doi.org/10.1145/3605573.3605636 Citation Details
Choi A, Kots ED "Analysis of the molecular determinants for furin cleavage of the spike protein S1/S2 site in defined strains of the prototype coronavirus murine hepatitis virus (MHV)." Virus Research , 2023 Citation Details
Do, Tu Mai and Pottier, Loïc and Ferreira da Silva, Rafael and CaínoLores, Silvina and Taufer, Michela and Deelman, Ewa "Performance assessment of ensembles of in situ workflows under resource constraints" Concurrency and Computation: Practice and Experience , 2022 https://doi.org/10.1002/cpe.7111 Citation Details
Do, Tu M. and Pottier, L. and Thomas, S. and Ferreira da Silva, R. and Cuendet, M. A. and Weinstein, H. and Estrada, T. and Taufer, M. and Deelman, E. "A Novel Metric to Evaluate In Situ Workflows" Lecture notes in computer science , v.12137 , 2020 https://doi.org/10.1007/978-3-030-50371-0_40 Citation Details
Khelashvili, George and Kots, Ekaterina and Cheng, Xiaolu and Levine, Michael V. and Weinstein, Harel "The allosteric mechanism leading to an open-groove lipid conductive state of the TMEM16F scramblase" Communications Biology , v.5 , 2022 https://doi.org/10.1038/s42003-022-03930-8 Citation Details
Kots, Ekaterina and Weinstein, Harel "Revealing the allosteric mechanism of pH-dependence in the proton-activated chloride channel" Biophysical Journal , v.122 , 2023 https://doi.org/10.1016/j.bpj.2022.11.475 Citation Details
Kots, Ekaterina D. and Shore, Derek M. and Weinstein, Harel "Adaptive Sampling using a Geometric Brownian Motion Model to Predict MD Trajectory Mobility on a Free Energy Surface" Biophysical Journal , v.120 , 2021 https://doi.org/10.1016/j.bpj.2020.11.690 Citation Details
Luettgau, Jakob and Martinez, Heberth and Tarcea, Glenn and Scorzelli, Giorgio and Pascucci, Valerio and Taufer, Michela "Studying Latency and Throughput Constraints for Geo-Distributed Data in the National Science Data Fabric" HPDC '23: Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing , 2023 https://doi.org/10.1145/3588195.3595948 Citation Details
(Showing: 1 - 10 of 15)

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page