
NSF Org: |
CCF Division of Computing and Communication Foundations |
Recipient: |
|
Initial Amendment Date: | January 24, 2020 |
Latest Amendment Date: | June 24, 2024 |
Award Number: | 1943008 |
Award Instrument: | Continuing Grant |
Program Manager: |
Stephanie Gage
sgage@nsf.gov (703)292-4748 CCF Division of Computing and Communication Foundations CSE Directorate for Computer and Information Science and Engineering |
Start Date: | July 1, 2020 |
End Date: | June 30, 2025 (Estimated) |
Total Intended Award Amount: | $500,000.00 |
Total Awarded Amount to Date: | $500,000.00 |
Funds Obligated to Date: |
FY 2021 = $97,369.00 FY 2022 = $100,030.00 FY 2023 = $102,792.00 FY 2024 = $105,661.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
3124 TAMU COLLEGE STATION TX US 77843-3124 (979)862-6777 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
188 Bizzell Street College Station TX US 77843-3128 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | FET-Fndtns of Emerging Tech |
Primary Program Source: |
01002122DB NSF RESEARCH & RELATED ACTIVIT 01002223DB NSF RESEARCH & RELATED ACTIVIT 01002324DB NSF RESEARCH & RELATED ACTIVIT 01002425DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Proteins are essential parts of biological systems that often function through interactions. Toward understanding and engineering biological systems, data are rapidly accumulating on what proteins and what protein-protein interactions (PPIs) are present in such systems, but a major barrier remains as knowledge is limited on how proteins interact in 3-dimensional (3D) space. This project is designed to help fill the knowledge gap by developing computational methods that predict mechanism-revealing 3D structures formed by PPIs. While developing such methods, a data-focused yet physics-rationalized approach will be pursued, which is expected to advance the state of the knowledge across natural science and artificial intelligence. The outcome of the project will facilitate deciphering and engineering genome-wide PPIs for wide applications such as novel therapeutics, clean energy, and smart materials. The project is also designed with educational activities to promote the awareness, participation, training, and communication of data-driven science discovery for students, educators, domain scientists, and general public. The highly interdisciplinary research and education activities will be integrated to foster a diverse globally-competitive workforce, including historically underrepresented groups, to be ready for the era of big data.
The research goal of this project is to advance the state of the art for structural PPI prediction and re-think and tackle the problem as explaining how pairs of proteins, represented in various data forms such as texts, graphs, or images, interact under governing physics. In pursuit of the goal, the research objectives of the project involve three levels of PPI structural prediction of increasing resolutions and challenges: residue-level contact maps, residue-level distance distributions, and atom-level 3D structures. Initiated by these objectives, novel machine learning algorithms will be developed and contribute to foundational algorithm research, including the effective integration and learning from heterogeneous data as well as the flexible representation and incorporation of domain knowledge. Such advance in foundational algorithm research will expand the applicability of PPI structural prediction to genome-scale and learn physical principles underlying diverse PPIs rather than ?memorizing? patterns in similar PPIs. Moreover, such methodological advance is expected to impact broad application fields beyond PPI structural prediction. The proposed research is integrated with an educational plan by feeding research results and trained personnel to multi-scale education and outreach activities, involving educated students in research, and engaging general public in citizen science. New curricular and co-curricular activities will be developed to enhance the accessibility to interdisciplinary data-science training for a diverse student body and domain scientists. Also, multi-level outreach activities in collaboration with existing programs will be used to foster the awareness of and interest in interdisciplinary data science among diverse middle- and high-school students as well as the general public.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
Please report errors in award information by writing to: awardsearch@nsf.gov.