Award Abstract # 1732253
Research PGR: Structural variant landscapes in tomato genomes and their role in natural variation, domestication and crop improvement

NSF Org: IOS
Division Of Integrative Organismal Systems
Recipient: COLD SPRING HARBOR LABORATORY
Initial Amendment Date: May 31, 2017
Latest Amendment Date: July 20, 2020
Award Number: 1732253
Award Instrument: Continuing Grant
Program Manager: Diane Jofuku Okamuro
dokamuro@nsf.gov
 (703)292-4508
IOS
 Division Of Integrative Organismal Systems
BIO
 Directorate for Biological Sciences
Start Date: July 1, 2017
End Date: December 31, 2022 (Estimated)
Total Intended Award Amount: $4,608,789.00
Total Awarded Amount to Date: $4,703,387.00
Funds Obligated to Date: FY 2017 = $2,499,279.00
FY 2018 = $1,042,102.00

FY 2019 = $94,598.00

FY 2020 = $1,067,408.00
History of Investigator:
  • Zachary Lippman (Principal Investigator)
    lippman@cshl.edu
  • Esther Van der Knaap (Co-Principal Investigator)
  • Joyce Van Eck (Co-Principal Investigator)
  • Michael Schatz (Co-Principal Investigator)
Recipient Sponsored Research Office: Cold Spring Harbor Laboratory
1 BUNGTOWN RD
COLD SPG HBR
NY  US  11724-2202
(516)367-8307
Sponsor Congressional District: 03
Primary Place of Performance: Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor
NY  US  11724-2209
Primary Place of Performance
Congressional District:
03
Unique Entity Identifier (UEI): GV31TMFLPY88
Parent UEI:
NSF Program(s): Plant Genome Research Project
Primary Program Source: 01001718DB NSF RESEARCH & RELATED ACTIVIT
01001819DB NSF RESEARCH & RELATED ACTIVIT

01001920DB NSF RESEARCH & RELATED ACTIVIT

01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1329, 7577, 9109, 9178, BIOT
Program Element Code(s): 132900
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

Genome DNA sequences for many crops have been determined in the last two decades, providing the blueprints to discover genes that underlie key agricultural traits. However, a great challenge is identifying the differences in DNA between related varieties of the same crop, which are responsible for the subtle trait variation that plant breeders exploit to improve productivity. A major contributor to this trait variation is 'genome structural variation' where pieces of DNA are deleted, inserted, or rearranged resulting in changes in gene expression. This project will focus on how structural variation contributed to domestication and breeding of tomatoes. A related goal is to expand and develop new molecular tools to create structural variation for crop improvement. This project will improve US agriculture by providing new knowledge and tools to efficiently and predictably enhance crop productivity. A major part of the project will also include training of young scientists in fundamental principles of plant genome research that can be applied to agriculture. This knowledge will also be shared through outreach programs in inner city New York schools that do not have access to research opportunities. Project personnel will develop hands-on teaching activities that will highlight the importance of plant genomics and new genome editing technologies to improve crops and meet the agricultural needs of the 21st century.


Limited knowledge on the extent and diversity of structural variation in plant genomes is hindering the ability to link genes to important crop phenotypes. This project will unite new long-read sequencing technologies, computational biology, developmental and quantitative genetics, and genome editing to elucidate and manipulate structural variation (SV) at a scale never before achieved for a major crop. Tomato provides a powerful system due to its relatively small and high quality reference genome and availability of resequenced genomes. By applying SV-detection algorithms to existing short-read Illumina sequencing data from hundreds of accessions, more than 40 genomes will be selected, capturing the majority of predicted SV diversity, to establish new reference genomes using the latest long-read sequencing technology (PacBio and 10X Genomics). From these data, a compendium of validated SVs will be generated and integrated with ongoing genome-wide association studies. Significant gene-associated SVs, including those affecting gene activity measured by genome-wide transcript profiling, will be characterized using CRISPR/Cas9 gene editing and quantitative phenotypic analyses, focusing on reproductive traits that drive crop productivity. In parallel, CRISPR/Cas9 gene editing will be used to generate a collection of SV mutations in known yield and fruit quality genes in two related wild Solanaceae with agricultural potential, with the goal of achieving major steps towards domestication and for comparative developmental genetics studies. This project will greatly expand our knowledge of genomic diversity in tomato, and provide a road map for dissecting SVs in other crops, where such knowledge can be exploited to improve productivity.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 15)
Aganezov, Sergey and Yan, Stephanie M. and Soto, Daniela C. and Kirsche, Melanie and Zarate, Samantha and Avdeyev, Pavel and Taylor, Dylan J. and Shafin, Kishwar and Shumate, Alaina and Xiao, Chunlin and Wagner, Justin and McDaniel, Jennifer and Olson, Na "A complete reference genome improves analysis of human genetic variation" Science , v.376 , 2022 https://doi.org/10.1126/science.abl3533 Citation Details
Alonge, M and Lebeigle, L and Kirsche, M and Aganezov, S and Wang, X and Lippman, ZB and Schatz, MC and Soyk, S "Automated assembly scaffolding elevates a new tomato system for high-throughput genome editing" bioRxiv , 2021 https://doi.org/10.1101/2021.11.18.469135 Citation Details
Alonge, Michael and Lebeigle, Ludivine and Kirsche, Melanie and Jenike, Katie and Ou, Shujun and Aganezov, Sergey and Wang, Xingang and Lippman, Zachary B. and Schatz, Michael C. and Soyk, Sebastian "Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing" Genome Biology , v.23 , 2022 https://doi.org/10.1186/s13059-022-02823-7 Citation Details
Altemose, Nicolas and Logsdon, Glennis A. and Bzikadze, Andrey V. and Sidhwani, Pragya and Langley, Sasha A. and Caldas, Gina V. and Hoyt, Savannah J. and Uralsky, Lev and Ryabov, Fedor D. and Shew, Colin J. and Sauria, Michael E. and Borchers, Matthew an "Complete genomic and epigenetic maps of human centromeres" Science , v.376 , 2022 https://doi.org/10.1126/science.abl4178 Citation Details
Das, Arun and Schatz, Michael C. "Sketching and sampling approaches for fast and accurate long read classification" BMC Bioinformatics , v.23 , 2022 https://doi.org/10.1186/s12859-022-05014-0 Citation Details
Frick, Elizabeth M. and Sapkota, Manoj and Pereira, Lara and Wang, Yanbing and Hermanns, Anna and Giovannoni, James J. and van der Knaap, Esther and Tieman, Denise M. and Klee, Harry J. "A family of methyl esterases converts methyl salicylate to salicylic acid in ripening tomato fruit" Plant Physiology , 2022 https://doi.org/10.1093/plphys/kiac509 Citation Details
He, Jia and Alonge, Michael and Ramakrishnan, Srividya and Benoit, Matthias and Soyk, Sebastian and Reem, Nathan T and Hendelman, Anat and Van Eck, Joyce and Schatz, Michael C and Lippman, Zachary B "Establishing Physalis as a Solanaceae model system enables genetic reevaluation of the inflated calyx syndrome" The Plant Cell , v.35 , 2022 https://doi.org/10.1093/plcell/koac305 Citation Details
Hoyt, Savannah J. and Storer, Jessica M. and Hartley, Gabrielle A. and Grady, Patrick G. and Gershman, Ariel and de Lima, Leonardo G. and Limouse, Charles and Halabian, Reza and Wojenski, Luke and Rodriguez, Matias and Altemose, Nicolas and Rhie, Arang an "From telomere to telomere: The transcriptional and epigenetic state of human repeat elements" Science , v.376 , 2022 https://doi.org/10.1126/science.abk3112 Citation Details
Kirsche, Melanie and Prabhu, Gautam and Sherman, Rachel and Ni, Bohan and Battle, Alexis and Aganezov, Sergey and Schatz, Michael C. "Jasmine and Iris: population-scale structural variant comparison and analysis" Nature Methods , 2023 https://doi.org/10.1038/s41592-022-01753-3 Citation Details
Kwon, Choon-Tak and Tang, Lingli and Wang, Xingang and Gentile, Iacopo and Hendelman, Anat and Robitaille, Gina and Van Eck, Joyce and Xu, Cao and Lippman, Zachary B. "Dynamic evolution of small signalling peptide compensation in plant stem cell control" Nature Plants , v.8 , 2022 https://doi.org/10.1038/s41477-022-01118-w Citation Details
Manoj Sapkota1, Lara Pereira1 "Structural variation underlies functional diversity at methyl salicylate loci in tomato" PLOS genetics , 2023 Citation Details
(Showing: 1 - 10 of 15)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Crop improvement relies on the ongoing selection of natural DNA sequence variations that influence gene functions and the traits they control, such as crop yield. Over the past two decades, scientists have identified key genes and mutations crucial for enhancing economically significant traits in crop plants, including those involved in the earliest stages of crop domestication from their wild counterparts. Recent advances in rapid, high-throughput DNA sequencing technologies have provided genome blueprints for many crop plants. However, a significant portion of the sequence variation that is responsible for crop improvement has remained undiscovered. In particular, more complex DNA sequence variations, such as insertions or deletions of DNA segments within or near genes, are often missed by first generation high-throughput sequencing technologies that only capture short DNA snippets across genomes.

This project addressed this issue by using new technologies that sequence genomes in large, overlapping segments, allowing for the identification and analysis of difficult-to-detect structural DNA mutations. Focusing on tomatoes and their closest wild ancestors as a model, we sequenced 100 varieties and wild species to identify and characterize these structural variants, and investigate how they: i) shape plant genomes and ii) contribute to crop domestication and improvement. By creating and utilizing new computational tools for long-read DNA sequencing, we discovered over 200,000 new mutations in tomatoes, many of which impacted gene function and trait variation. As examples, we found previously undetected mutations responsible for differences in tomato fruit flavor, size and yield. In some instances, we showed that mutations in multiple genes were combined during breeding programs, resulting in complex genetic relationships that ultimately altered productivity traits in quantitative ways.

In a second goal of the project, we explored whether discoveries from major crops, like tomato, could be applied to improve "indigenous crops," which have the potential to become industrialized crops, but have received less attention in both fundamental and applied research. We established blueprints for groundcherries, a close relative of tomatoes that produce edible sweet berries, and used CRISPR genome editing to modify the functions of multiple genes, resulting in increased fruit size and number.

The project trained numerous scientists and also promoted public education in plant biology. Our graduate students and postdocs secured positions in academia and industry, while younger trainees in college pursued further education in biology and education. We also created an outreach program with a Queens, NY elementary school focused on plant engineering curriculum. Finally, we developed a community-science outreach project based on our groundcherry research involving hundreds of participants in dozens of states, which enhanced public understanding and acceptance of the role of leveraging plant genomics and genome editing to improve crops now and in the future.

 


Last Modified: 04/17/2023
Modified by: Zachary B Lippman

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page