
NSF Org: |
DBI Division of Biological Infrastructure |
Recipient: |
|
Initial Amendment Date: | August 5, 2020 |
Latest Amendment Date: | September 20, 2023 |
Award Number: | 2021909 |
Award Instrument: | Standard Grant |
Program Manager: |
Matthew Herron
mherron@nsf.gov (703)292-5361 DBI Division of Biological Infrastructure BIO Directorate for Biological Sciences |
Start Date: | January 1, 2021 |
End Date: | July 31, 2024 (Estimated) |
Total Intended Award Amount: | $166,189.00 |
Total Awarded Amount to Date: | $166,189.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
MAIN CAMPUS WASHINGTON DC US 20057 (202)625-0100 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
37th and O Streets NW Washington DC US 20007-2145 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Cross-BIO Activities |
Primary Program Source: |
|
Program Reference Code(s): | |
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.074 |
ABSTRACT
This Design activity will result in a proposal to create a Biology Integration Institute that will synthesize recent advances in wildlife virology and pursue new insights about the ecology and evolution of the global virome. The pandemic emergence of SARS-CoV-2 is only the latest development in an accelerating trend of dangerous viruses emerging from wildlife. Global travel, urbanization, and increasing human-wildlife contact have all made it easier for these viruses to emerge. In the future, climate change and land use change will reassemble the global virome even further, forcing mammals to cross continents, meet in new ecosystems, and exchange viruses thousands of times more, potentially unleashing even more threats to global health. At least 10,000 of these mammal viruses might have the potential to infect humans, but most of the global virome is still undescribed: only about 1% of mammal viruses have been discovered, and a much smaller fraction in other vertebrates. With so little data, it is difficult to predict which viruses will pose a future threat, or where, when, and how they could emerge. Predicting the next pandemic threat will require new data spanning biological scales, from single genes up to deep evolutionary time, and new statistical methods from the cutting edge of computer science and mathematics. In addition, the project will host summer residencies for trainees and develope new coursework that combines biology with hands-on computer science labs.
The project assembles a group of virologists, computer scientists, statisticians, and ecologists to explore cutting edge scientific questions about methodology, inference, and impact. The project has three aims: (1) synthesizing existing data about host-virus associations for all vertebrate clades; (2) developing novel approaches to predict host-virus interaction networks, using novel data streams like viral strain diversity characterized from genomes, or receptor data from immunological studies; and (3) developing frameworks for actionable science that will put viral ecology to use for global health science and security. These aims will be accomplished through collaborative workshops. In doing so it will establish a foundation for a full Implementation proposal to develop an Emerging Virus Institute.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This project established the Viral Emergence Research Initiative (Verena; viralemergence.org), which leverages open data, artificial intelligence, and viral ecology to make better predictive models for pandemic risk assessment and prevention. The Design project supported the development of a successful Implementation track proposal, and Verena was awarded an NSF Biology Integration Institute grant in 2022 (DBI 2213854; PI: Carlson; 2022-2027).
We developed an open data ecosystem by synthesizing and standardizing existing data on host-virus interactions in nature and in the laboratory. Our flagship project, The Global Virome in One Network (VIRION), integrates several existing data sources into the most comprehensive record of the vertebrate virome. We also developed minimum reporting standards for vector competence experiments and wildlife disease surveillance, in order to improve data findability, interoperability, and reusability; used systematic reviews to benchmark research effort related to bat coronavirus surveillance, arbovirus vector competence experiments, and species distribution modeling of tick and mosquito disease vectors; developed two software packages that provide long-term access to a unique and otherwise unfindable database of insect pathogens, and support data harmonization using NCBI taxonomic resources; and launched the Pathogen Harmonized Observatory (PHAROS: pharos.viralemergence.org), a public online platform to support real-time sharing of wildlife disease surveillance data.
We used these data to develop new statistical and machine learning models of host-pathogen interactions, with a focus on zoonotic and vector-borne viruses. We found that some groups of mammals appear to harbor more viruses that pose a risk to humans, but often, this reflects sampling bias: for example, animals that live in cities are better studied, while endangered species are often harder to sample. These biases have less impact on risk assessment at the level of individual viruses, and we found that our models could reliably identify animal viruses that can infect humans, especially when they incorporated data on both viral genomes and animal hosts. We also simulated how climate and land use change are restructuring the mammal-virus network, and found that new opportunities for cross-species transmission often coincide with human-settled areas. Because they can travel long distances in a single lifetime, we showed that bats have a unique ability to bring viruses into new ecosystems, and – where they come into contact with “stepping stone” hosts like primates and livestock, especially in places like southeast Asia – they may create new pathways for zoonotic emergence.
Motivated by the Covid-19 pandemic, we started developing models of the ecology and emergence of bat coronaviruses. We estimated that there are hundreds of species of bats with undiscovered betacoronaviruses (the group that includes SARS-CoV, SARS-CoV-2, and MERS-CoV); these species are found worldwide, but particularly concentrated in sub-Saharan Africa and Indonesia. We also used theoretical models to show that global hotspots of bat coronavirus diversity and emergence in east and southeast Asia and the Middle East can be explained by bat evolutionary history. Both sets of models predict that undiscovered viruses could be circulating in bats across the Indian subcontinent, where limited sampling has been done – suggesting a potential global blindspot in pandemic risk assessment. In a meta-analysis of studies from the last 20 years, we found that across nearly 90,000 bat samples, the strongest predictors of coronavirus detection have been repeated sampling of the same field site and intestinal tissue sampling – a clue about the tissue tropism of these viruses that mirrors findings about persistent infections with SARS-CoV-2 in humans.
In the first two years of the project, the Covid-19 pandemic prevented in-person project meetings or conference travel. Instead, we held a virtual conference on the use of artificial intelligence for pathogen risk assessment, with attendees representing several major global outbreak surveillance projects and non-governmental organizations in global health. We also repurposed our unused travel budget to support student research assistants and an expanded investigator team. Our project trained six students in ecological synthesis research (3 undergraduate, 2 Ph.D. students, 1 medical student), and supported the development of three new courses (2 graduate-level, 1 undergraduate-level) at Georgetown University and Louisiana State University. Trainees and investigators presented research virtually at several ecology and infectious disease related conferences. We also organized a “CodeFest” outreach event at Louisiana State University with 15 undergraduate attendees that used our data ecosystem to spark excitement about coding in R and collaborative problem solving.
We communicated our findings to policymakers and the public through social media, press, and Congressional testimony. Our work was covered in over 50 news articles in the first two years, including profiles of the project in Nature, Scientific American, and The New York Times. According to CarbonBrief, our study on climate change and cross-species viral transmission broke the all-time record for press coverage and social media attention for any study on climate change (based on Altmetric score); the study was also cited in the 2023 Economic Report of the President.
Last Modified: 12/17/2024
Modified by: Colin J Carlson
Please report errors in award information by writing to: awardsearch@nsf.gov.