Award Abstract # 1650041
EAGER: Universal Sketches for Network Monitoring

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: THE JOHNS HOPKINS UNIVERSITY
Initial Amendment Date: August 24, 2016
Latest Amendment Date: August 24, 2016
Award Number: 1650041
Award Instrument: Standard Grant
Program Manager: Tracy Kimbrel
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2016
End Date: August 31, 2018 (Estimated)
Total Intended Award Amount: $99,999.00
Total Awarded Amount to Date: $99,999.00
Funds Obligated to Date: FY 2016 = $99,999.00
History of Investigator:
  • Vladimir Braverman (Principal Investigator)
    vb21@rice.edu
Recipient Sponsored Research Office: Johns Hopkins University
3400 N CHARLES ST
BALTIMORE
MD  US  21218-2608
(443)997-1898
Sponsor Congressional District: 07
Primary Place of Performance: Johns Hopkins University
3400 N Charles ST
Baltimore
MD  US  21218-2608
Primary Place of Performance
Congressional District:
07
Unique Entity Identifier (UEI): FTMTDMBR29C7
Parent UEI: GS4PNKTRNKL3
NSF Program(s): Algorithmic Foundations
Primary Program Source: 01001617DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7916, 7926
Program Element Code(s): 779600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Network management is multi-faceted and encompasses a range of tasks including traffic engineering, attack and anomaly detection, and forensic analysis. Each such management task requires accurate and timely statistics on different application-level metrics of interest, such as the flow size distribution, "heavy hitters" (frequently occurring data values), and entropy measures, as well as the detection of changes or unusual patterns. Streaming algorithms have generated many important practical advances since the creation of the field in the mid 1990s. For a given data set, these techniques typically use one pass over the data, and use a small amount of memory to compute a desired statistic. A class of randomized algorithms known as "sketches" has contributed to many practical solutions for databases, networks, and other domains which entail processing large amounts of data, such as astronomy. In preliminary work, the PI with collaborators introduced a flexible technique, UnivMon (short for Universal Monitoring), for a wide range of monitoring tasks by leveraging recent theoretical advances in streaming algorithms. The main task of this project is to significantly improve the theoretical foundations of UnivMon. The UnivMon tool should be a particularly useful artifact for exposing undergraduate students to streaming and network monitoring concepts. The PI also believes that the topic of this project presents a compelling opportunity for developing graduate students with expertise in the theory of network monitoring. In particular, the PI plans to leverage his existing graduate-level course offerings as vehicles to integrate findings from this research project.

While the body of work in data streaming and sketching has made significant contributions to network monitoring, each network metric of interest requires special purpose algorithms. An ideal monitoring framework would offer generality by delaying the binding to specific applications of interest but at the same time providing the required fidelity for estimating these metrics. Achieving generality and high fidelity simultaneously has been an elusive goal both in theory and in practice. Universal streaming develops a single universal sketch which is provably accurate for estimating a large class of functions. In essence, the generality of universal streaming enables one to delay binding the data plane to specific monitoring tasks, while still providing accuracy that is comparable to (if not better than) running custom sketches using similar compute resources. To accomplish the system improvements needed for practical deployment, the project will attempt to advance the state of the theory by solving the following open problems: (1) extending the PI's preliminary techniques to handle data expiration, thus allowing the maintenance of statistics over a window of recent data; and (2) constructing algorithms for computing universal sketches with memory usage within a constant factor of optimal, refining previous results that are within a polylogarithmic factor of optimal.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 12)
Anand, I and Liu, Z and Jin, X and Venkataraman, S and Braverman, V and Stoic, I "ASAP: Fast, Approximate Graph Pattern Mining at Scale" Proceedings of the USENIX conference , 2018 Citation Details
B?asiok, Jaros?aw and Braverman, Vladimir and Chestnut, Stephen R. and Krauthgamer, Robert and Yang, Lin F. "Streaming symmetric norms via measure concentration" Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing , 2017 10.1145/3055399.3055424 Citation Details
Blum, Avrim and Braverman, Vladimir and Kumar, Ananya and Lang, Harry and Yang, Lin F. "Approximate Convex Hull of Data Streams" The 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018) , 2018 Citation Details
Braverman, V and Frahling, G and Lang, H and Sohler, C and Yang, L "Clustering High Dimensional Dynamic Data Streams" International Conference on Machine Learning, PMLR , 2018 Citation Details
Braverman, Vladimir and Chestnut, Stephen and Krauthgamer, Robert and Li, Yi and Woodruff, David and Yang, Lin "Matrix Norms in Data Streams: Faster, Multi-Pass and Row-Order" Proceedings of the 35 th International Conference on Machine Learning (ICML 18) , 2018 Citation Details
Braverman, Vladimir and Chestnut, Stephen R. and Ivkin, Nikita and Nelson, Jelani and Wang, Zhengyu and Woodruff, David P. "BPTree: An ? 2 Heavy Hitters Algorithm Using Constant Memory" Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems , 2017 10.1145/3034786.3034798 Citation Details
Braverman, Vladimir and Grigorescu, Elena and Lang, Harry and Woodruff, David and Samson, Zhou "Nearly Optimal Distinct Elements and Heavy Hitters on Sliding Windows" Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2018). , 2018 Citation Details
Braverman, Vladimir and Lang, Harry and Levin, Keith "Accurate Low-Space Approximation of Metric k-Median for Insertion-Only Streams" Conference on Algorithms and Discrete Applied Mathematics (CALDAM 2017) , 2017 Citation Details
Braverman, Vladimir and Woodruff, David and Yang, Lin "Revisiting Frequency Moment Estimation in Random Order Streams" 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). , 2018 Citation Details
Ivkin, Nikita and Liu, Zaoxing and Yang, Lin and Suresh, Srinivas and Lemson, Gerard and Neyrinck, Mark and Szalay, Alexander and Braverman, Vladimir and Budavari, Tamas "Scalable streaming tools for analyzing N-body simulations: Finding halos and investigating excursion sets in one pass" Astronomy and computing , 2018 Citation Details
Iyer, A and Liu, Z and Jin, X and Venkataraman, S and Braverman, V and Stoica, I "Towards Fast and Scalable Graph Pattern Mining" 10th {USENIX} Workshop on Hot Topics in Cloud Computing (HotCloud 18) , 2018 Citation Details
(Showing: 1 - 10 of 12)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Network management is multi-faceted and encompasses a range of tasks including traffic engineering, attack and anomaly detection, and forensic analysis.While the body of work in data streaming and sketching has made significant contributions to networkmonitoring, each network metric of interest requires special purpose algorithms.In our preliminary work, we introduced a flexible technique, UnivMon, for a wide range of monitoringtasks by leveraging recent theoretical advances in universal streaming. Universal streaming developsa single universal sketch which is provably accurate for estimating a large class of functions.
This research improves the theoretical foundations of UnivMon. Specifically, we proposed two important imporvements.NitroSketch that is a significant improvement to UnivMon. The Nitro sketch is a general and efficient software sketching framework that enables line-rate packetprocessing for a broad spectrum of sketching algorithms. NitroSketch has provable worst-case guarantees, without needing any distributional assumptions about the traffic. We do this by systematically identifying the fundamental performanceWe also introduce a UnivMon on Interval queries that is  a first integral solution that is (i) sketch based and thus spaceefficient, (ii) supports specifying the time frame of interest aspart of its queries, and (iii) enables multiple measurement tasksinside the same data structure. Also, this award partially supported research that brings significant theoretical and practical improvements in important problems in the streaming and sketching setting such as clustering, randomized numerical linear algebra and beyond.This research resulted in publications in top computer science conferences such as STOC, ICALP and OSDI.


Last Modified: 11/03/2018
Modified by: Vladimir M Braverman

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page