Skip to feedback

Award Abstract # 2226663
MFB: Novel Graph Neural Networks to Understand, Predict, and Design Allosteric Transcription Factors

NSF Org: CHE
Division Of Chemistry
Recipient: GEORGIA TECH RESEARCH CORP
Initial Amendment Date: September 8, 2022
Latest Amendment Date: September 8, 2022
Award Number: 2226663
Award Instrument: Standard Grant
Program Manager: Tingyu Li
tli@nsf.gov
 (703)292-4949
CHE
 Division Of Chemistry
MPS
 Directorate for Mathematical and Physical Sciences
Start Date: September 1, 2022
End Date: August 31, 2025 (Estimated)
Total Intended Award Amount: $1,485,925.00
Total Awarded Amount to Date: $1,485,925.00
Funds Obligated to Date: FY 2022 = $1,485,925.00
History of Investigator:
  • Corey Wilson (Principal Investigator)
    corey.wilson@chbe.gatech.edu
  • Matthew Realff (Co-Principal Investigator)
  • Yao Xie (Co-Principal Investigator)
Recipient Sponsored Research Office: Georgia Tech Research Corporation
926 DALNEY ST NW
ATLANTA
GA  US  30318-6395
(404)894-4819
Sponsor Congressional District: 05
Primary Place of Performance: Georgia Institute of Technology
225 North Avenue
Atlanta
GA  US  30332-0002
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): EMW9FC8J3HN4
Parent UEI: EMW9FC8J3HN4
NSF Program(s): Special Initiatives,
CHEMISTRY PROJECTS
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 075Z, 1757, 8038
Program Element Code(s): 164200, 199100
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.041, 47.049, 47.070

ABSTRACT

In this Molecular Foundations for Biotechnology (MFB) project, Professors Corey J. Wilson, Matthew J. Realff, and Yao Xie at the Georgia Institute of Technology are leveraging both novel experimental and machine learning strategies to understand, predict, and design allosteric communication in a family of proteins called transcription factors that regulate gene expression in living systems. Protein allostery is an important protein function which enables communication between different parts of a functional protein that are widely separated. Our lack of understanding of the mechanism of allostery prevents scientist and engineers from designing this critically important function. By combining the tools of molecular biology and artificial intelligence, this project aims to decipher structure/activity patterns for naturally occurring and engineered transcription factors at the molecular level, specifically at the level of individual amino acids. Understanding the rules that govern allosteric communication would, in principle, enable investigators to design new transcription factors for a variety of high-impact applications such as manipulating the composition of bacteria in the gut. This project involves a blend of biochemistry, biophysics, engineering, and machine learning approaches that will facilitate student engagement across traditional disciplinary boundaries. In addition to diverse student involvement, the broader impacts of this project will include the development of innovative pedagogical modules in the areas of machine learning and biological engineering. This project will contribute to the development of a diverse and engaged STEM (science, technology, engineering and mathematics) workforce, building a firm foundation for a lifetime of contributions to research, education, and their integration.

Protein allostery is a vitally important protein function that has proven to be a vexing problem to understand at the molecular level. The goal of this project is to decipher the underlying molecular mechanisms by which the allosteric signal traverses the scaffold across several naturally occurring and engineer transcription factors with alternate allosteric controls from the broader LacI/GalR family of protein homologues. In general, allosteric communication involves networks of non-neighboring amino acid positions; therefore, traditional pairwise computational approaches (e.g., molecular mechanics simulations, and related computer-aided protein design strategies) are of limited use in understanding and designing allosteric networks a priori. Accordingly, this project seeks to develop novel machine learning approaches and complementary experimental strategies to accelerate scientific progress and transform the nature of studying and designing allosteric communication. This project has the potential to lead to a paradigm shift with regard to the origins and construction (design) of allosteric networks in a single fold. Moreover, the machine learning approaches developed in this project can in principle be applied to other complex network problems beyond the designated protein systems ? e.g., distillation column sequences, communication systems, and power grid systems.

This project is jointly supported by the Division of Chemistry, the Division of Chemical, Bioengineering, Environmental and Transport Systems, and the Division of Information and Intelligent Systems.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hersey, Ashley N. and Kay, Valerie E. and Lee, Sumin and Realff, Matthew J. and Wilson, Corey J. "Engineering allosteric transcription factors guided by the LacI topology" Cell Systems , v.14 , 2023 https://doi.org/10.1016/j.cels.2023.04.008 Citation Details
Huang, Brian D. and Kim, Dowan and Yu, Yongjoon and Wilson, Corey J. "Engineering intelligent chassis cells via recombinase-based MEMORY circuits" Nature Communications , v.15 , 2024 https://doi.org/10.1038/s41467-024-46755-1 Citation Details
Milner, Prasaad T. and Zhang, Ziqiao and Herde, Zachary D. and Vedire, Namratha R. and Zhang, Fumin and Realff, Matthew J. and Wilson, Corey J. "Performance Prediction of Fundamental Transcriptional Programs" ACS Synthetic Biology , v.12 , 2023 https://doi.org/10.1021/acssynbio.2c00593 Citation Details
Selvakumar, Raja and Kumar, Ishita and Onajobi, Glory J and Yu, Yongjoon and Wilson, Corey J "Engineering living therapeutics and diagnostics: A new frontier in human health" Current Opinion in Systems Biology , v.37 , 2024 https://doi.org/10.1016/j.coisb.2023.100484 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page