Award Abstract # 2043905
CAREER: Evaluation of machine learning algorithms for understanding and predicting adaptation to multivariate environments with a Model Validation Program (MVP)

NSF Org: OCE
Division Of Ocean Sciences
Recipient: NORTHEASTERN UNIVERSITY
Initial Amendment Date: March 5, 2021
Latest Amendment Date: July 10, 2024
Award Number: 2043905
Award Instrument: Continuing Grant
Program Manager: Jayne Gardiner
jgardine@nsf.gov
 (703)292-4828
OCE
 Division Of Ocean Sciences
GEO
 Directorate for Geosciences
Start Date: July 1, 2021
End Date: June 30, 2026 (Estimated)
Total Intended Award Amount: $1,459,937.00
Total Awarded Amount to Date: $1,459,937.00
Funds Obligated to Date: FY 2021 = $876,998.00
FY 2022 = $147,579.00

FY 2023 = $350,924.00

FY 2024 = $84,436.00
History of Investigator:
  • Kathleen Lotterhos (Principal Investigator)
    k.lotterhos@northeastern.edu
Recipient Sponsored Research Office: Northeastern University
360 HUNTINGTON AVE
BOSTON
MA  US  02115-5005
(617)373-5600
Sponsor Congressional District: 07
Primary Place of Performance: Northeastern University
430 Nahant Road
Nahant
MA  US  01908-1638
Primary Place of Performance
Congressional District:
06
Unique Entity Identifier (UEI): HLTMVS2JZBS6
Parent UEI:
NSF Program(s): Evolutionary Processes,
BIOLOGICAL OCEANOGRAPHY,
EDUCATION/HUMAN RESOURCES,OCE
Primary Program Source: 01002324DB NSF RESEARCH & RELATED ACTIVIT
01002425DB NSF RESEARCH & RELATED ACTIVIT

01002223DB NSF RESEARCH & RELATED ACTIVIT

01002122DB NSF RESEARCH & RELATED ACTIVIT

01002122DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 9117, 1174, 1045, 006Z, 4444, 7308, 102Z, 8214, 1097
Program Element Code(s): 112700, 165000, 169000
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.050, 47.074

ABSTRACT

Environmental change can be rapid and involve multiple aspects of the environment changing at the same time, such as warming and increased disease pressure. Rapid environmental change threatens the productivity of aquaculture and crops on which humans depend. Predicting organisms' vulnerabilities to rapid and multifactor environmental change, however, is a major scientific challenge. A hurdle to addressing this challenge arises from the complex and non-intuitive ways that organisms adapt, through changes at the level of the DNA sequence, to many environmental stresses at the same time. Thus, there is a need for new approaches to understand and predict adaptation in multivariate environments. To address this need, this project integrates research and education with a Model Validation Program (MVP). The research is developing and evaluating Machine Learning Algorithms (MLAs) for understanding and predicting adaptation of organisms to multivariate environments from their DNA sequences. To evaluate MLAs, this research combines both data simulation and an empirical test in the field with the Eastern Oyster, which provide important ecosystem services and support a multi-million dollar industry. For oysters, this research is studying how temperature, disease pressure, and salinity interact with evolutionary history to determine fitness in the field. This research advances efforts toward addressing the major scientific challenge of predicting adaptation in complex environments by integrating concepts across the frontiers of marine, evolutionary, and statistical sciences in a new way. Machine learning and model validation are not traditionally taught in the marine and environmental sciences, but are becoming increasingly relevant to these fields. As part of a broader education program, this research is developing MVP Learning Modules for high school students and undergraduates, which help students build the foundational knowledge they need to critically evaluate and apply models. Modules are being disseminated to hundreds of students in the greater Boston area and are being made available online for widespread use. The MVP mentoring program is training graduate students, undergraduates, and high school students in marine evolutionary ecology, statistical genomics, and machine learning. This research addresses a pressing societal need to more informatively match genotypes to environments for restoration, farming, and assisted gene flow efforts. Results are being disseminated to stakeholders in the oyster industry.

The goal of this research is to evaluate if MLAs, which can model non-linearities, can be used to understand and predict adaptation to multivariate environments under a wide range of scenarios. In Objective 1, the Principal Investigator (PI) is creating simulated datasets with different aspects of realism, and using them to evaluate and refine the MLAs. This novel set of simulations is studying genome evolution under high gene flow in complex, multivariate environments. In Objective 2, the PI is building on their expertise with the Eastern oyster to evaluate the MLAs in a field setting. The PI is first developing a comprehensive seascape genomic dataset and using it to train MLAs to predict an individual's multivariate environment based on a single nucleotide polymorphism genotype. Then, the PI is testing if the MLA prediction can predict the fitness of different genotypes from across the species range when raised in common garden field conditions. In Objective 3, the PI is integrating research and education by using the data obtained from Objs. 1 and 2 to develop a series of original "MVP Learning Modules" with interactive web apps for persons at different levels of understanding, using the relatable example of an oyster restoration project. This research lays the foundation for future studies by producing datasets that could become classical examples for developing and benchmarking innovative modeling approaches.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lotterhos, Katie E. "The paradox of adaptive trait clines with nonclinal patterns in the underlying genes" Proceedings of the National Academy of Sciences , v.120 , 2023 https://doi.org/10.1073/pnas.2220313120 Citation Details
Puritz, Jonathan B. and Guo, Ximing and Hare, Matthew and He, Yan and Hillier, LaDeana W. and Jin, Shubo and Liu, Ming and Lotterhos, Katie E. and Minx, Pat and Modak, Tejashree and Proestou, Dina and Rice, Edward S. and Tomlinson, Chad and Warren, Wesley "A second unveiling: Haplotig masking of the eastern oyster genome improves populationlevel inference" Molecular Ecology Resources , 2023 https://doi.org/10.1111/1755-0998.13801 Citation Details
Guo, Ximing and Puritz, Jonathan B. and Wang, Zhenwei and Proestou, Dina and Allen, Standish and Small, Jessica and Verbyla, Klara and Zhao, Honggang and Haggard, Jaime and Chriss, Noah and Zeng, Dan and Lundgren, Kathryn and Allam, Bassem and Bushek, Dav "Development and Evaluation of High-Density SNP Arrays for the Eastern Oyster Crassostrea virginica" Marine Biotechnology , v.25 , 2023 https://doi.org/10.1007/s10126-022-10191-3 Citation Details
Láruson, Áki Jarl and Fitzpatrick, Matthew C. and Keller, Stephen R. and Haller, Benjamin C. and Lotterhos, Katie E. "Seeing the forest for the trees: Assessing genetic offset predictions from gradient forest" Evolutionary Applications , v.15 , 2022 https://doi.org/10.1111/eva.13354 Citation Details
Lind, Brandon_M and Lotterhos, Katie_E "The accuracy of predicting maladaptation to new environments with genomic data" Molecular Ecology Resources , v.25 , 2024 https://doi.org/10.1111/1755-0998.14008 Citation Details
Lotterhos, Katie E. "Interpretation issues with genomic vulnerability arise from conceptual issues in local adaptation and maladaptation" Evolution Letters , 2024 https://doi.org/10.1093/evlett/qrae004 Citation Details
Lotterhos, Katie E. and Fitzpatrick, Matthew C. and Blackmon, Heath "Simulation Tests of Methods in Evolution, Ecology, and Systematics: Pitfalls, Progress, and Principles" Annual Review of Ecology, Evolution, and Systematics , v.53 , 2022 https://doi.org/10.1146/annurev-ecolsys-102320-093722 Citation Details
Lotterhos, Katie E "Principles in experimental design for evaluating genomic forecasts" Methods in Ecology and Evolution , v.15 , 2024 https://doi.org/10.1111/2041-210X.14379 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page