Award Abstract # 1353346
EAGER: Immunization in Influence and Virus Propagation on Large Networks

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY
Initial Amendment Date: September 5, 2013
Latest Amendment Date: September 5, 2013
Award Number: 1353346
Award Instrument: Standard Grant
Program Manager: Jun Huan
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 15, 2013
End Date: August 31, 2015 (Estimated)
Total Intended Award Amount: $88,821.00
Total Awarded Amount to Date: $88,821.00
Funds Obligated to Date: FY 2013 = $88,821.00
History of Investigator:
  • B Aditya Prakash (Principal Investigator)
    badityap@cc.gatech.edu
Recipient Sponsored Research Office: Virginia Polytechnic Institute and State University
300 TURNER ST NW
BLACKSBURG
VA  US  24060-3359
(540)231-5281
Sponsor Congressional District: 09
Primary Place of Performance: Virginia Polytechnic Institute and State University
VA  US  24061-0001
Primary Place of Performance
Congressional District:
09
Unique Entity Identifier (UEI): QDE5UHE5XD16
Parent UEI: X6KEFGLHSJX7
NSF Program(s): Info Integration & Informatics
Primary Program Source: 01001314DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7364, 7916
Program Element Code(s): 736400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Given a graph, like a social/computer network or the blogosphere, in which an infection (or meme or virus) has been spreading for some time, how does one select the k best nodes for immunization/quarantining immediately? This team was the first to show that the propagation (specifically, the so-called "epidemic threshold") depends on a single number, the first eigenvalue of the adjacency matrix of the network, for any graph and almost any propagation model in the literature. This team also gave linear-time provably near-optimal algorithms for static pre-emptive node/edge removal, by minimizing the eigenvalue on arbitrary graphs. They were also the first to give a a linear-time algorithm to automatically detect the number and identity of possible culprits under perfect information, carefully using the Minimum Description Length principle, again on arbitrary graphs.

The major thrust of this proposal is: Given a graph, a virus model (SIR, SIS etc.), a set of already infected nodes, and a fixed
budget of k nodes/edges to immunize or quarantine, can one quickly find an optimal or near-optimal solution to best contain the virus?

Technical Merit: This is the first to study the short-term immunization problem on arbitrary graphs. The problem has received limited attention in past literature: the few current results (except the PI's past work, see related work) all are on specific graphs like random graphs, and not arbitrary graphs. The focus of this work is on scalable techniques (linear or sub-quadratic on nodes/edges) which can be applied to large graphs.

Impact: The work has numerous immediate applications in public health and epidemiology, e.g., designing dynamic "what to do next" policies etc. Leveraging state-of-the-art simulators from the Virginia Bio-Informatics Institute, this work helps in realistic simulations, as well as in making more informed choices and policy decisions for future. The work also has high broader impact, as propagation-style processes on networks appear in many other settings like viral marketing, cyber security, social media like Twitter and blogs etc.

Education: The PI will incorporate research findings in graduate level classes, give tutorials at conferences, and aim to engage undergraduate students from underrepresented groups into this exciting area of research through programs like NSF REU and MAOP/VTURCS (Minority Academic Opportunities Program and VT Undergraduate Research in CS) at VT.

For further information, please see the project web page:
URL: http://www.cs.vt.edu/~badityap/NSF-PROJECTS/EAGER-13/

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sudip Saha, Abhijin Adiga, B. Aditya Prakash and Anil Vullikanti "Approximation Algorithms for Reducing the Spectral Radius to control Epidemic Spread" SIAM Data Mining Conference (SDM) , 2015
Yao Zhang and B. Aditya Prakash "Data-Aware Vaccine Allocation over Large Networks" ACM Transactions on Knowledge Discovery and Data Mining , 2015
Yao Zhang and B. Aditya Prakash "Scalable Vaccine Distribution in Large Graphs given Uncertain Data" ACM Conference on Information and Knowledge Management (CIKM) , 2014

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This proposal aimed to answer the following question: Given an infected snapshot of the network, what are the best nodes to immunize (vaccinate) in the network? Viruses or contagions like the flu spread on population contact networks. Similarly, online contagions like memes spread on social networks. There exist many stochastic models inspired by epidemiology which model these propagation processes. In this proposal, given a graph, like a social/computer network or the blogosphere, in which an infection (or meme or virus) has been spreading for some time, and a virus propagation model (like SIR, SIS etc.), we want to select the k best nodes for immunization/quarantining immediately. We call this problem 'short-term immunization'. We proposed to develop new scalable algorithms for short­ term immunization on arbitrary graphs, under full information. We also proposed to test our techniques on basic as well more realistic simulations and datasets. The broader goal of this line of research is to investigate more such 'data-ware' approaches to immunization given both perfect or noisy data.

We were the first to study the short-term immunization problem on arbitrary graphs, as opposed to the vast majority of analytical work which focusses on 'pre-emptive' strategies. Further we were also the first to examine multiple natural uncertainty models of infection information and design robust immunization policies. We proposed DAVA, a novel scalable (sub-quadratic in size of the underlying network) algorithm to distribute vaccines over large-scale networks under a perfect data-aware environment, which outperforms well-known competitors, e.g, 'Netshield', 'Pagerank' etc., by up to 10 times in magnitude. We further provided a variant which does not reconstruct intermediate data structures from scratch upon each allocation. We also proposed two novel algorithms, SAMPLE-CAS and EXPECT-MAX, to allocate vaccines in large networks considering multiple natural uncertain environments, which outperform several state-of-the-art algorithms, getting substantial gains in both number of nodes saved and running time. We also proposed rigorous approximation algorithms for the pre-emptive immunization problem. We leveraged multiple techniques like dominator trees (from software flow theory) which were not commonly used in this area. We also tested all our algorithms on realistic city-wide epidemiological datasets. In short, this grant has resulted in four full research papers at leading venues (both conferences and journals) including SDM 2014, CIKM 2014, SDM 2015 and ACM TKDD 2015. 

This grant supported PhD. students, including a female graduate student. Material from this research appeared in graduate classes, invited talks at national labs, and companies, and in lectures at NSF REU summer schools for underrepresented undergraduate groups at VT by the PI. Further this research has enabled numerous follow-on work from natural realistic generalizations, to newer techniques for old problems, to the submission of larger proposals to study related problems using real diagnostic data, in collaboration with epidemiologists and computational biologists.

The immediate impact of the work is high due to applications in public health and epidemiology such as designing dynamic ‘what to do next’ policies. As we leverage state- of-the-art simulators, this also helps in making more informed choices and policy decisions for future. Further the broader impact is also high as propagation-style processes on networks appear in many other settings like viral marketing, cyber security (malware), social media like Twitter and blogs etc.

More information can be found at: http://people.cs.vt.edu/~badityap/NSF-PROJECTS/EAGER-13/


Last Modified: 12/07/2015
Modified by: B. Aditya Prakash

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page