Award Abstract # 1162112
NeTS: Medium: Collaborative Research: Optimizing Network Support for Cloud Services: From Short-Term Measurements to Long-Term Planning

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: THE TRUSTEES OF PRINCETON UNIVERSITY
Initial Amendment Date: April 18, 2012
Latest Amendment Date: September 5, 2014
Award Number: 1162112
Award Instrument: Continuing Grant
Program Manager: Darleen Fisher
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: May 1, 2012
End Date: April 30, 2017 (Estimated)
Total Intended Award Amount: $600,000.00
Total Awarded Amount to Date: $600,000.00
Funds Obligated to Date: FY 2012 = $300,000.00
FY 2013 = $150,000.00

FY 2014 = $150,000.00
History of Investigator:
  • Jennifer Rexford (Principal Investigator)
Recipient Sponsored Research Office: Princeton University
1 NASSAU HALL
PRINCETON
NJ  US  08544-2001
(609)258-3090
Sponsor Congressional District: 12
Primary Place of Performance: Princeton University
4 New South
Princeton
NJ  US  08544-2020
Primary Place of Performance
Congressional District:
12
Unique Entity Identifier (UEI): NJ1YPQXQG7U5
Parent UEI:
NSF Program(s): Special Projects - CNS,
Networking Technology and Syst
Primary Program Source: 01001213DB NSF RESEARCH & RELATED ACTIVIT
01001314DB NSF RESEARCH & RELATED ACTIVIT

01001415DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7363, 7924, 9102
Program Element Code(s): 171400, 736300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Online Service Providers (OSPs) host a wide range of application services, including email, Web search, video streaming, and multiplayer games, on servers in data centers all over the world. Each service has specific performance requirements. For example, Web search and multiplayer games need low latency, whereas video streaming and bulk file transfers need high throughput. Clients access these services from a wide variety of geographic locations over access networks with wildly different performance. Offering good performance to these diverse clients at a reasonable cost is the life blood of any OSP.

OSPs affect client performance by controlling content routing (selecting which data center should serve a client request) and network routing (selecting interdomain paths to clients, or paths within the OSP's own backbone), and by longer-term planning of future data centers and relationships with upstream ISPs. Unfortunately, OSPs have relatively poor visibility into end-to-end performance and do not adapt both content and network routing to maximize performance; in addition, OSP operators lack good models for deciding where to place the next server or data center, or which ISPs to select as neighbors.

To address the wide-area networking needs of online services, this project is designing, implementing, deploying, and evaluating practical techniques that allow OSPs to perform content and network routing (and make longer-term placement decisions), based on timely and accurate information about end-to-end performance and transit costs. The project is developing techniques to help OSP operators measure, control, and plan the wide-area connectivity between distributed services and their clients, and between the servers themselves. The project tasks include: (1) designing performance-measurement techniques and conduct measurement-driven studies of OSP traffic management; (2) designing, modeling, and prototyping protocols for joint optimization of content and network routing, and traffic management within an OSP backbone; and (3) driving long-term planning of server placement and ISP peer selection based on models of transit costs.

To evaluate our algorithms together, and "in the wild", the project will use experimental platforms for network monitoring (BISmark, M-Lab, and, where available, measurement servers in ISP backbone networks), content and network routing (DONAR and Transit Portal), cloud computing (VICCI), and programmable networking (OpenFlow).

Broader Impact:
The PIs are working with industry to evaluate and deploy the solutions on operational networks. They will also continue their close collaboration on graduate networking curriculum development to include the research topics and experimental platforms in this project. As part of the project outreach, the PIs are organizing "summer camps" (drawing on their earlier experiences with summer camps for the VINI and BISmark projects) to bring under-represented students to their institutions for summer internships. The PIs will also work with under-represented regions and institutions to deploy their infrastructure and engage faculty and students in research projects using the platforms.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Amitabha Ghosh, Sangtae Ha, Edward Crabbe, and Jennifer Rexford "Scalable mutli-class traffic management in data-center backbone networks" IEEE Journal on Selected Areas in Communications , v.31 , 2013 , p.2673 10.1109/JSAC.2013.131208
A. Phokeer, D. Johnson, M. Densmore, N. Feamster "A First Look at Mobile Internet Use in Township Communities in South Africa" ACM Symposium on Computing for Development , 2016
Mojgan Ghasemi, Partha Kanuparthy, Ahmed Mansy, Theophilus Benson, and Jennifer Rexford "Performance characterization of a commercial video streaming service" Internet Measurement Conference , 2016
Mojgan Ghasemi, Theophilus Benson, and Jennifer Rexford "Dapper: Data plane performance diagnosis of TCP" ACM Symposium on SDN Research , 2017
M. Shahbaz, N. Feamster "The Case for an Intermediate Representation for Programmable Data Planes" ACM SIGCOMM Symposium on SDN Research , 2016
Naga Katta, Aditi Ghag, Mukesh Hira, Isaac Keslassy, and Jennifer Rexford "CLOVE: How I learned to stop worrying about the core and love the edge" ACM SIGCOMM HotNets Workshop , 2016
Naga Katta, Mukesh Hira, Changhoon Kim, Anirudh Sivaraman, and Jennifer Rexford "HULA: Scalable load balancing using programmable data-planes" Symposium on SDN Research , 2016
S. Grover, R. Ensafi, N. Feamster "A Case Study of Traffic Demand Response to Broadband Service-Plan Upgrades" Passive and Active Measurement Conference , 2016
Xin Jin, Nathan Farrington, and Jennifer Rexford "Your data center switch is trying too hard" Symposium on SDN Research , 2016

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The vast majority of popular Internet applications, from electronic mail to Web search, and from video streaming to multi-player games, run on servers managed by Online Service Providers (OSPs) like Google, Microsoft, or Akamai.  In this project, we: (1) design and evaluate techniques for OSPs to offer good performance to users at reasonable cost, by optimizing how they create, store, and deliver content; (2) study how both access link speeds and the utilization/congestion of the interconnects between ISPs and OSPs ultimately affect application performance. Our research ranges from measuring and characterizing existing services to designing and evaluating new measurement and optimization techniques, often by collaborating with operational OSPs or the Internet service providers that help them deliver traffic to users.  

 The main outcomes of the project are:

 Measurement-based diagnosis of OSP performance: Our measurement studies have explored the current state of utilization at Internet interconnection points and showed that, contrary to conventional public wisdom, these interconnection points typically have ample spare capacity to support applications such as streaming video. We have uncovered the root cause of performance problems for video streaming (jointly with Yahoo) causes of congestion at Internet eXchange Points. We have also studied how wireless network performance in the home can also prove to be a limiting factor for fixed and mobile broadband Internet access; these results have formed the core of reports published by the Federal Communications Commission and others. These studies have also allowed researchers and practitioners to identify and fix underlying performance problems, and have informed the broader policy debate on the challenges of how the Internet supports the delivery of high-bandwidth video content.

 Optimizing the performance of OSPs: We have designed and evaluated new techniques for OSPs to optimize network paths and sending rates within private backbone networks (jointly with Google) and data-center networks (jointly with Barefoot Networks and VMware), the network topology and routing in wide-area networks (jointly with Sodero Networks and AT&T), allocation of disk space in edge caching servers (jointly with Akamai), and server selection and routing in delivering data across the Internet to end users. Our PECAN work showed that traffic engineering offers minimal benefit beyond optimal cache placement for improving the performance of Web applications. Our work studying the relationship of OSP application performance and access network throughput showed that certain applications, such as Web page loads, suffer diminishing returns as access link performance increases; the optimizations that we implemented in home routers that prefetch the DNS records of commonly requested web pages and maintain active TCP connections to popular web pages can reduce page load time for popular sites in the home while incurring only minimal data usage overhead.

 Network mechanisms to improve performance: We have designed new mechanisms for network devices, such as home routers or data-center switches, to help improve performance. For example, we created new measurement techniques that can collect performance data for individual users efficiently on high-speed switches, a new programmable software switch to enable innovation in data-center networks, and a flexible network assembly language for programming future high-speed switches.

Together, these new techniques give OSPs a much better way to measure, optimize, and design their services -- and the underlying network -- to offer good performance at reasonable cost.  In addition, the project led to the training of numerous undergraduate and graduate students in networking research, as well as the creation of open-source software for the broader community.


 

 


Last Modified: 06/16/2017
Modified by: Jennifer Rexford

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page