
NSF Org: |
IIS Division of Information & Intelligent Systems |
Recipient: |
|
Initial Amendment Date: | June 11, 2020 |
Latest Amendment Date: | April 24, 2024 |
Award Number: | 1947257 |
Award Instrument: | Standard Grant |
Program Manager: |
Sylvia Spengler
sspengle@nsf.gov (703)292-7347 IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2020 |
End Date: | August 31, 2024 (Estimated) |
Total Intended Award Amount: | $174,232.00 |
Total Awarded Amount to Date: | $235,432.00 |
Funds Obligated to Date: |
FY 2021 = $12,600.00 FY 2022 = $12,600.00 FY 2023 = $24,000.00 FY 2024 = $12,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
1025 N BROADWAY MILWAUKEE WI US 53202-3109 (414)277-7300 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
1025 N. Broadway Milwaukee WI US 53202-3109 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Info Integration & Informatics |
Primary Program Source: |
01002324DB NSF RESEARCH & RELATED ACTIVIT 01002425DB NSF RESEARCH & RELATED ACTIVIT 01002021DB NSF RESEARCH & RELATED ACTIVIT 01002122DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
The last decade has seen the cost of DNA sequencing plummet. Consequently, the potential to sequence the genome of every living organism is within our grasp. Genome assemblies often require significant "polishing," which is often a manual and labor-intensive process. New methods are needed to directly analyze fragmented or unassembled genomic data. Analysis of these genomes include the identification of physical rearrangements such as inversions. Large inversions have significant impacts on the biology of organisms and their evolution. Existing computational methods for identifying inversions have been primarily tested on and developed for well-studied, "reference" genomes. This project seeks to develop new inversion detection and association testing methods suitable for the large and growing number of fragmented and/or unassembled genomes that are becoming available. Undergraduate research assistants will be funded as active collaborators on the project.
So-called "k-mer" methods have become popular in the last decade for the analysis of unassembled genomics or metagenomics data. This project seeks to utilize k-mers, unsupervised learning, and association testing to identify inversions in fragmented or poorly assembled population genomics data. Since millions of association tests will be run per data set, the methods will be accelerated using GPUs. The resulting method and software will be developed in conjunction with undergraduate research assistants and released under an open-source license.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Last Modified: 02/11/2025
Modified by: Ronald James Nowling
Please report errors in award information by writing to: awardsearch@nsf.gov.