Award Abstract # 1146300
COLLABORATIVE: ABI Development: Methodology for Pattern Creation, Imprint Validation, and Discovery from the Annotated Biological Web

NSF Org: DBI
Division of Biological Infrastructure
Recipient: ST. BONAVENTURE UNIVERSITY
Initial Amendment Date: April 9, 2012
Latest Amendment Date: June 29, 2016
Award Number: 1146300
Award Instrument: Continuing Grant
Program Manager: Jen Weller
DBI
 Division of Biological Infrastructure
BIO
 Directorate for Biological Sciences
Start Date: September 1, 2012
End Date: August 31, 2017 (Estimated)
Total Intended Award Amount: $150,701.00
Total Awarded Amount to Date: $180,805.00
Funds Obligated to Date: FY 2012 = $32,835.00
FY 2013 = $65,843.00

FY 2014 = $52,023.00

FY 2016 = $30,104.00
History of Investigator:
  • Xiao-Ning Zhang (Principal Investigator)
    xzhang@sbu.edu
Recipient Sponsored Research Office: Saint Bonaventure University
3261 WEST STATE RD
SAINT BONAVENTURE
NY  US  14778-9800
(716)375-2435
Sponsor Congressional District: 23
Primary Place of Performance: Saint Bonaventure University
NY  US  14778-2500
Primary Place of Performance
Congressional District:
23
Unique Entity Identifier (UEI): JLNZDL8P2DP7
Parent UEI:
NSF Program(s): ADVANCES IN BIO INFORMATICS
Primary Program Source: 01001213DB NSF RESEARCH & RELATED ACTIVIT
01001314DB NSF RESEARCH & RELATED ACTIVIT

01001415DB NSF RESEARCH & RELATED ACTIVIT

01001617DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1165, 7556, 9178
Program Element Code(s): 116500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

Collaborative grants have been awarded to the University of Maryland, the University of Iowa and St. Bonaventure University to develop a methodology that exploits the wealth of annotation knowledge, notably Gene Ontology (GO) and Plant Ontology (PO) annotations of Arabidopsis genes. Motivated by the availability of rich and as yet insufficiently tapped collections of gene annotations, the project aims to facilitate the discovery of hidden knowledge that could be the basis of further scientific research. The methodology will extract patterns of interest from annotation graphs (pattern discovery). Literature-based methods will extract sentences that validate the biological meaning underlying these patterns (pattern validation). To demonstrate the methodology, the PattArAn tool (Patterns in Arabidopsis Annotations) will be customized for Arabidopsis. PattArAn will provide the user with a graphical presentation of patterns of Arabidopsis genes and associated GO and PO CV terms. Graph data mining techniques and efficient algorithmic solutions to identify dense subgraphs (DSG) and to perform graph summarization (GS) will be developed. Algorithms to mine the literature for relevant sentences for an extracted pattern (referred to as the imprint) will be developed. PattArAn will enable iterative exploration and will incorporate allied steps such as consulting gene function prediction. The project will involve collaboration with biologists for building and refining annotation graphs, and validating patterns to ensure relevance to their research.

The project makes broad contributions to the Arabidopsis thaliana community. PattArAn may assist Arabidopsis curators to manage GO-PO annotations and complement existing tools such as Textpresso and AraNet. It can also be used to bootstrap an annotation database for other plant species given that their genome sequence information is available. The project offers significant research and educational experiences for graduate students (University of Maryland and Iowa) and undergraduate students (St. Bonaventure University). Team members will continue to mentor women and students from under-represented communities, participate in outreach activities, lead a Journal Club, etc. The outcomes from this research project will be disseminated via biology and bioinformatics venues. More information may be obtained at the project website: https://wiki.umiacs.umd.edu/clip/pattaran/.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Alexandra Paz, Xiao-Ning Zhang "The GMO Industry: A Neglected Earthly Frontier." Journal of Hunger and Environmental Nutrition , 2016
Chen, Samuel L. and Rooney, Timothy J. and Hu, Anna R. and Beard, Hunter S. and Garrett, Wesley M. and Mangalath, Leann M. and Powers, Jordan J. and Cooper, Bret and Zhang, Xiao-Ning "Quantitative Proteomics Reveals a Role for SERINE/ARGININE-Rich 45 in Regulating RNA Metabolism and Modulating Transcriptional Suppression via the ASAP Complex in Arabidopsis thaliana" Frontiers in Plant Science , v.10 , 2019 10.3389/fpls.2019.01116 Citation Details
Kevin Cilano, Zachary Mazanek, Mahmuda Khan, Sarah Metcalfe, Xiao-Ning Zhang "A New Mutation, hap1-2, Reveals a CTerminal Domain Function in AtMago Proteinand Its Biological Effects in MaleGametophyte Development in Arabidopsisthaliana" PLOS ONE , 2016 10.1371/journal.pone.0148200
Padmini Srinivasan, Xiao-Ning Zhang, Roxane Bouten, Caren Chang "Ferret: a sentence-based literature scanning system" BMC Bioinformatics , 2015 DOI 10.1186/s12859-015-0630-0
Xiao-Ning Zhang, Cecilia Mo, Wesley Garrett, Bret Cooper "Phosphothreonine 218 is required for the function of SR45.1 in regulating flower petal development in Arabidopsis" Plant Signaling & Behavior , v.9 , 2014 , p.e29134 http://dx.doi.org/10.4161/psb.29134

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This project aims to develop tools that exploits the wealth of human-curated knowledge in the literature and in the annotation evidence. The tools would benefit biologists who have a range of information discover needs from quickly understanding the functional behavior of a large number of genes to developing a deeper understanding of the relationship of genes within a biological pathway.

 

In reaching this goal, the collaboration team at St Bonaventure University, University of Iowa and University of Maryland worked together to develop a sentence-retrieval tool, Ferret, and a visualization tool, semEP, to create meaningful clusters of genes and their GO annotations. Both tools were field tested with research data from independent biologists and optimized to make it user-friendly. The SBU PI also used Ferret to examine GO enrichment results for an RNA-seq experiment. It confirmed GO enrichment results on an unexpected defense role of a splicing regulator, serine-arginine rich 45 (SR45), in Arabidopsis, which has never been studied before.

 

To integrate the above project into the biology curriculum, the SBU PI Designed and implemented a bioinformatics lab exercise to teach students vocabularies and visualizations of human-curated knowledge from the literature and in the annotation evidence. She also designed and implemented a bioinformatics research program that involved for 11 high school sophomores and juniors to work with 4 undergraduate students and 4 faculty at SBU for 2 weeks. With a bioinformatics undergraduate major, the SBU PI designed and implemented a bioinformatics workshop curriculum with a focus on breast cancer for K-12 math and science teachers.

Research results were presented at professional meetings annually. Both the HS student research and teacher workshop were disseminated via WNY STEM HUB, facebook, and LinkedIn. The teacher workshop photos are disseminated as a vimeo slideshow: https://vimeo.com/228492495.

 


Last Modified: 09/10/2017
Modified by: Xiao-Ning Zhang

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page