
NSF Org: |
MCB Division of Molecular and Cellular Biosciences |
Recipient: |
|
Initial Amendment Date: | March 21, 2011 |
Latest Amendment Date: | May 12, 2013 |
Award Number: | 1050161 |
Award Instrument: | Continuing Grant |
Program Manager: |
Manju Hingorani
mhingora@nsf.gov (703)292-7323 MCB Division of Molecular and Cellular Biosciences BIO Directorate for Biological Sciences |
Start Date: | April 1, 2011 |
End Date: | March 31, 2016 (Estimated) |
Total Intended Award Amount: | $1,320,000.00 |
Total Awarded Amount to Date: | $1,320,000.00 |
Funds Obligated to Date: |
FY 2013 = $460,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
107 S INDIANA AVE BLOOMINGTON IN US 47405-7000 (317)278-3473 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
107 S INDIANA AVE BLOOMINGTON IN US 47405-7000 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Genetic Mechanisms |
Primary Program Source: |
01001314DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.074 |
ABSTRACT
Intellectual Merit. With the ability to completely characterize the genomes of closely related species and individuals within species, it is now possible to elucidate the mechanisms by which evolution proceeds at the molecular level, via both the promotion of adaptations within species and the establishment of new species. This project involves a comparative survey of the genome sequences of the complete set of cryptic species of the Paramecium aurelia assemblage of ciliated protozoans. Just prior to the radiation of this complex, the common ancestor experienced a complete doubling of the nuclear genome, and preliminary evidence suggests that silencing of alternative redundant gene copies in sister lineages has led to map changes that may operate as effective reproductive isolating barriers. The relatively young age of the complex, combined with its large number of constituent species and relatively simple genomic architecture, provides a powerful and unprecedented resource for understanding the roles that gene duplication plays in the generation of biodiversity. By establishing the complete history of all ancestral gene copies over a finely dissected phylogeny, the patterns of preservation vs. demise of various functional classes of duplicate genes will be evaluated. The analyses will also reveal the temporal patterns of gene loss that eventually lead to the acquisition of new equilibrium genomic states in the descendant taxa, as well as clarify the extent to which gene resurrections occur. With the inclusion of information on gene expression, several key hypotheses on the evolution of duplicate genes will be tested. Lending an exceptional level of power to the analyses is the availability of information on the rate and complete molecular spectrum of mutations for two aurelia species. This provides a formal basis for deciphering the forces of evolution operating on duplicate genes by providing a null model for the fates of genes in the absence of selection (e.g., positive selection for preservation or active promotion of gene loss by mutational degradation). As the first study of this sort in a natural assemblage of unicellular eukaryotes, this project has the potential to greatly expand our understanding the mechanisms of genome evolution, providing a complement to the much richer set of observations on multicellular species.
Broader Impacts. The data generated by this study will serve as a critical and permanent resource for the Paramecium genetics community, while also providing the first detailed data on the dynamics of duplicate genes on an evolutionarily interpretable time scale. The data will be organized into the existing ParameciumDB web-based data system, allowing users to readily query the entire species assemblage for the status and evolutionary history of the full set of paralogous genes back to the ancestor of the P. aurelia complex. In addition, the database will be integrated into a community-level effort at incorporating ciliates in classroom research. Finally, the project will support the training of a graduate student and a postdoctoral fellow, with a goal of establishing them as leaders in the re-emerging field of Paramecium evolutionary genetics.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The primary goal of this project was to complete a comparative survey of the genome sequences of the set of morphologically cryptic species of the Paramecium aurelia assemblage, a group of ciliated protozoans with ancient roots. As the first study of this sort in a natural assemblage of unicellular eukaryotes, this project has greatly expanded our understanding of the mechanisms of genome evolution, providing a complement to the much richer availability of observations on multicellular species. Notably, the two-genome duplication scenario for the P. aurelia complex is identical in form to (and potentially more ancient than) that postulated for the origin of morphological diversification of the vertebrates. Yet, the ciliate assemblage exhibits near absolute stasis at the morphological level, despite underlying changes at the molecular level. Thus, the results of this project are of considerable relevance to ongoing speculation regarding the connection between genome duplication and morphological innovation.
Intellectual merit. By establishing the complete history of all ancestral gene copies over a finely dissected phylogeny, it has been possible to evaluate the patterns of preservation vs. demise of various functional classes of duplicate genes. The analyses also reveal the temporal patterns of gene loss that eventually lead to the acquisition of new equilibrium genomic states in the descendant taxa. This shows, in turn, how reciprocal losses of sister genes in related taxa has an effect similar to gene relocation, thereby resulting in the passive evolution of reproductive isolation among sister taxa. Our results reveal a remarkable simplicity of Paramecium genome architecture, most notably introns never exceeding 30 base pairs in length, and intergenic regions typically shorter than 100 base pairs. Not only do these very large cells have some of the most streamlined genomes in all of eukaryotes, but their chromosomal gene order has remained nearly constant for over 500 million years.
Using information on gene expression, tests of several key hypotheses on the evolution of duplicate genes have revealed that ancestral gene-expression level is the primary determinant of the fate of a duplicate gene. This has led to the development of a model for evolution of expression level after a gene duplication, related to earlier ideas on qualitative subfunctionalization. Under this model, duplicated genes initially share identical biochemical functions and are retained because of dosage constraints. However, mutations in regulatory regions and trans-acting factors gradually allow divergence of expression level between the two copies. The expression level of each individual duplicated gene can evolve neutrally so long as the joint expression of the pair remains roughly constant. However, once a high level of imbalance is reached, the copy with the lowest expression level contributes a small enough fraction to total expression that selection no longer opposes its loss. This model provides an explanation as to why the most highly expressed duplicate genes (which are furthest from the absorbing barrier of low expression) exhibit the highest longevities.
Owing to the extreme simplicity of the intergenic regions associated with Paramecium genes, it has been possible to identify candidate transcription-factor binding sites (TFBSs, both repressors and activators) by bioinformatics methods. These results show that the distances of such elements from their client genes are highly correlated with gene expression, enabling us to indirectly validate repressor and enhancer effects even prior to functional analysis.
Broader impact. A central goal o...
Please report errors in award information by writing to: awardsearch@nsf.gov.