Award Abstract # 2016160
EAGER: Development of a tool to rapidly and cost-effectively sequence the exome of any organism

NSF Org: DEB
Division Of Environmental Biology
Recipient: UNIVERSITY OF RHODE ISLAND
Initial Amendment Date: March 9, 2020
Latest Amendment Date: March 7, 2025
Award Number: 2016160
Award Instrument: Standard Grant
Program Manager: Leslie J. Rissler
lrissler@nsf.gov
 (703)292-4628
DEB
 Division Of Environmental Biology
BIO
 Directorate for Biological Sciences
Start Date: May 1, 2020
End Date: October 31, 2025 (Estimated)
Total Intended Award Amount: $298,978.00
Total Awarded Amount to Date: $380,130.00
Funds Obligated to Date: FY 2020 = $298,978.00
FY 2023 = $81,152.00
History of Investigator:
  • Jonathan Puritz (Principal Investigator)
    jpuritz@uri.edu
Recipient Sponsored Research Office: University of Rhode Island
75 LOWER COLLEGE RD RM 103
KINGSTON
RI  US  02881-1974
(401)874-2635
Sponsor Congressional District: 02
Primary Place of Performance: University of Rhode Island
RI  US  02881-2015
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): CJDNG9D14MW7
Parent UEI: NSA8T7PLC9K3
NSF Program(s): Evolutionary Processes
Primary Program Source: 01002324DB NSF RESEARCH & RELATED ACTIVIT
01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 097Z, 102Z, 7916, 9150
Program Element Code(s): 112700
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

Understanding the interaction between genotype, phenotype, and the environment is one of the greatest challenges in biology. Researchers face a two-fold challenge in experimental design: 1) sampling enough individuals to accurately characterize populations and 2) sequencing the most informative part of the genome, the base pairs that cause phenotypic change. Even with major technological advances in DNA sequencing, it is still too expensive, for most organisms, to sequence the entire genome of multiple individuals. Using probes to target specific locations in the genome has the advantage of focusing sequencing on the functional areas of the genome. However, current methods require preexisting genomic resources for probe design and substantial financial resources for probe synthesis, limiting use to already well-studied organisms and long-term projects. This risky project will develop a method for capturing specific areas of the genome for sequencing with no need for existing genomic resources, removing the time and cost of probe development. This new method will enable the rapid and cost-effective sequencing of the portions of the genome involved in adaptation and will provide unprecedented capacity to detect selection in captive and wild populations. If successful, this new method will enable the assessment of rapid adaptation to short-term ecological disasters and long-term climate change, directly assisting with successful mitigation, conservation, and restoration efforts.

The over-arching goal of the proposed research is to develop Expressed Exome Capture Sequencing (EecSeq) into a cost and time efficient method of exome capture for any organism. To do this, the project takes a three phased approach, (1) focusing on laboratory protocol optimization and improvement, (2) validation with whole genome sequencing (WGS), traditional exome capture, and RNAseq, and (3) the development of an open source, reproducible bioinformatics pipeline, including de novo assembly. Phase one will optimize three key elements in the EecSeq protocol: probe and insert length, probe and capture pool diversity, and the hybridization process. Optimizing all three parameters will maximize the number of sequenced exomic basepairs and on-target reads, increasing the number of individuals that can be sequenced simultaneously while greatly reducing costs. The second phase of the project will use two independent approaches to validate EecSeq genotypes, including a comparison of EecSeq to traditional exome capture, a set of reference individuals with WGS data. The final phase will leverage the chromosome-level assembly of the eastern oyster genome along with results from both previous phases to develop two complementing de novo assembly methods for EecSeq: one utilizing captured genomic reads and a hybrid method that will utilize sequences from the cDNA probes (when sequenced) and captured genomic reads. All experiments, data analysis, and presentation will take place in a completely open and reproducible science pipeline, which should lead to an efficient step-by-step laboratory protocol and a de novo bioinformatic pipeline for EecSeq that incorporates locus assembly and annotation for any organism.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page