Award Abstract # 1546858
Research-PGR Orphan Genes: An Untapped Genetic Reservoir of Novel Traits Driving Evolutionary Adaptation and Crop Improvement

NSF Org: IOS
Division Of Integrative Organismal Systems
Recipient: IOWA STATE UNIVERSITY OF SCIENCE AND TECHNOLOGY
Initial Amendment Date: August 9, 2016
Latest Amendment Date: September 10, 2018
Award Number: 1546858
Award Instrument: Continuing Grant
Program Manager: Gerald Schoenknecht
gschoenk@nsf.gov
 (703)292-5076
IOS
 Division Of Integrative Organismal Systems
BIO
 Directorate for Biological Sciences
Start Date: August 1, 2016
End Date: July 31, 2022 (Estimated)
Total Intended Award Amount: $2,175,310.00
Total Awarded Amount to Date: $2,175,310.00
Funds Obligated to Date: FY 2016 = $1,066,167.00
FY 2017 = $546,816.00

FY 2018 = $562,327.00
History of Investigator:
  • Eve Wurtele (Principal Investigator)
    mash@iastate.edu
  • Kevin Bassler (Co-Principal Investigator)
Recipient Sponsored Research Office: Iowa State University
1350 BEARDSHEAR HALL
AMES
IA  US  50011-2103
(515)294-5225
Sponsor Congressional District: 04
Primary Place of Performance: Iowa State University
IA  US  50011-3223
Primary Place of Performance
Congressional District:
Unique Entity Identifier (UEI): DQDBM7FGJPC5
Parent UEI: DQDBM7FGJPC5
NSF Program(s): Plant Genome Research Project,
Cross-BIO Activities
Primary Program Source: 01001617DB NSF RESEARCH & RELATED ACTIVIT
01001718DB NSF RESEARCH & RELATED ACTIVIT

01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7577, 9109, 9150, 9178, 9179, BIOT
Program Element Code(s): 132900, 727500
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

Genes confer the primary traits that are passed on from generation to generation in plants and animals. Where, when and how new genes arise are far-reaching biological questions with practical implications. If scientists could identify emergent genes that confer new traits, the breeding potential for crops could be greatly expanded. During the quest for understanding where new genes come from, it is now known that genes can arise anew from regions of the genome where there were none previously. These so-called "orphan genes" may be key ways that species can evolve and adapt to challenging environments through the expression of new traits. To uncover orphan genes, this project taps into the sequenced genomes of maize, a major crop of worldwide importance, and Brassica, another model crop. Both species have major genomic resources available for orphan gene discovery. The research will identify orphan genes and evaluate traits possibly conferred by the genes in maize lines adapted to particular conditions. Candidate orphan genes that influence agronomically important traits will be selected and functionally analyzed. Computational tools will be developed to mine the sequence datasets and the resulting data will be integrated into community databases. At all stages, students will be trained in computational and genomic science. To reach high school students and early undergraduate students, new computer games modules are being developed and tested with the intention of increasing an understanding of the function and potential of orphan genes. A long-term outcome is that researchers and society will be able to design new solutions to improving crops through harnessing orphan genes.

The premise that new genes can arise from non-genic DNA sequences is borne out from massive DNA and RNA sequencing data. This concept sharply contrasts with the long-accepted view that novel gene functions primarily arise from a slow process of accumulated mutations and rearrangements of already-established genes. A hypothesis is that a major role of orphan genes is to regulate the defense and metabolic responses that enable evolutionary adaptation to new environments. This research will identify orphan genes of major agronomic species, focusing first on maize and Brassica. These results will inform a systematic analysis of orphan genes at the level of subspecies, thus categorizing orphan genes in the context of the adaptation and selection that has occurred as the result of human intervention for improved agronomic traits. Based on the resultant data, specific orphan genes will be selected for experimental functional analysis. Data will be integrated into community databases, and code will be available to the public. New computer game modules will be targeted to high school and early undergraduate students. The goal is to develop data and computational tools that facilitate predictive understanding of the function of orphan genes in driving evolutionary adaptation, to harness these resources for improving crops, and to disseminate the information to researchers and students. These capabilities will empower researchers to explore the significance of recently-emerged orphan genes, and transform fundamental knowledge into innovative solutions that improve crop traits.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 28)
Arendsee, Zebulun and Li, Jing and Singh, Urminder and Seetharam, Arun and Dorman, Karin and Wurtele, Eve Syrkin "phylostratr: A framework for phylostratigraphy" bioRxiv , 2018 , p.360164
Arendsee, Zebulun and Li, Jing and Singh, Urminder and Seetharam, Arun and Dorman, Karin and Wurtele, Eve Syrkin "phylostratr: A framework for phylostratigraphy" BioRxiv , 2019 , p.360164 https://doi.org/10.1101/360164
Arendsee Z, Jennifer Chang, Wurtele ES "rmonad: Pipelines you can compute on" R journal , 2021
Bhandary, Priyanka and Seetharam, Arun S and Arendsee, Zebulun and Hur, Manhoi and Wurtele, Eve Syrkin "Raising orphans from a metadata morass: a researcher's guide to re-use of public?omics data" Plant Science , 2017
Bhandary, Priyanka and Seetharam, Arun S and Arendsee, Zebulun W and Hur, Manhoi and Wurtele, Eve Syrkin "Raising orphans from a metadata morass: A researcher's guide to re-use of public?omics data" Plant science , v.267 , 2018 , p.32--47
Francisco J Enguita, Ana Lúcia Leitão, J Tyson McDonald, Viktorija Zaksas, Saswati Das, Diego Galeano, Deanne Taylor, Eve Syrkin Wurtele, Amanda Saravia-Butler, Stephen B Baylin, Robert Meller, D Marshall Porterfield, Douglas C Wallace, Jonathan C Schisle "The interplay between lncRNAs, RNA-binding proteins and viral genome during SARS-CoV-2 infection reveals strong connections with regulatory events involved in RNA metabolism" Theranostics , v.12 , 2022 , p.3946 10.7150/thno.73268
Hur, Manhoi and Ware, Rebecca L and Park, Junkoo and McKenna, Amy M and Rodgers, Ryan P and Nikolau, Basil J and Wurtele, Eve S and Marshall, Alan G "Statistically significant differences in composition of petroleum crude oils revealed by volcano plots generated from ultrahigh resolution fourier transform ion cyclotron resonance mass spectra" Energy \& fuels , v.32 , 2018 , p.1206--121
Hur, Manhoi and Ware, Rebecca L and Park, Junkoo and McKenna, Amy M and Rodgers, Ryan P and Nikolau, Basil J and Wurtele, Eve S and Marshall, Alan G "Statistically Significant Differences in Composition of Petroleum Crude Oils Revealed by Volcano Plots Generated from Ultrahigh Resolution Fourier Transform Ion Cyclotron Resonance Mass Spectra" Energy \& Fuels , v.32 , 2018 , p.1206--121
International Arabidopsis Informatics Consortium and Doherty, Colleen and Friesner, Joanna and Gregory, Brian and Loraine, Ann and Megraw, Molly and Provart, Nicholas and Slotkin, R Keith and Town, Chris and Assmann, Sarah M and others "Arabidopsis bioinformatics resources: The current state, challenges, and priorities for the future" Plant Direct , v.3 , 2019 , p.e00109 https://doi.org/10.1002/pld3.109
Jiahao Guo, Pramesh Singh, and Kevin E. Bassler "Reduced network extremal ensemble learning (RenEEL) scheme for community detection in complex networks" Scientific Reports , 2019
Jing Li, Urminder Singh, Priyanka Bhandary, Jacqueline Campbell, Zebulun Arendsee, Arun S. Seetharam, Eve Syrkin Wurtele "Foster thy young: Enhanced prediction of orphan genes in assembled genomes" NAR , v.50 , 2021 , p.e37-e37 https://doi.org/10.1093/nar/gkab1238
(Showing: 1 - 10 of 28)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Every organism forms completely new genes from within its genome as it evolves. Many are lost, but those that are useful to the organism are selected for and modified over time. These genes, called "orphan genes", provide novel functions to enable survival in new environments.  For example, some are a key to fighting a lethal pathogen, others help an organism withstand drought stress,  still other enocde toxins that enable an organism to capture its prey. However, even in well studied species like maize, mice and humans, many orphan genes have not been identified. 

In this research we have developed a new software, pyripipe, to identify orphan genes from genomes of any species.   Its code is open source, which means it can be built upon by other computational biologists. The pyripipe software is publicly-available and free.  Thus, the first major outcome of our research is computational tools that researchers can use to identify functionally-important oprhan genes in any organism.

The second major outcome of our research is that we elucidate thousands of novel orphan genes in maize, Brassica, and humans. We have done this by applying our new software to identify and characterize orphan genes from these species.  For this, we have leveraged the tens of thousands of DNA and RNA sequences deposited into public databases. Because orphans can be considered as the fodder for new proteins, these genes can be studied to reveal mechanisms of the origin of protein structural domains. Furthermore, these genes hold the promise of new traits for plant breeding and, in humans, for mitigating disease. 


Last Modified: 01/24/2023
Modified by: Eve S Wurtele

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page