NSF Award Search: Award # 2008971

Award Abstract # 2008971

NSF-BSF: CNS Core: Small: Machine Learning for Real-Time Network Rate Control

NSF Org:	CNS Division Of Computer and Network Systems
Recipient:	UNIVERSITY OF ILLINOIS
Initial Amendment Date:	August 10, 2020
Latest Amendment Date:	August 10, 2020
Award Number:	2008971
Award Instrument:	Standard Grant
Program Manager:	Deepankar Medhi dmedhi@nsf.gov (703)292-2935 CNS Division Of Computer and Network Systems CSE Directorate for Computer and Information Science and Engineering
Start Date:	October 1, 2020
End Date:	September 30, 2023 (Estimated)
Total Intended Award Amount:	$500,000.00
Total Awarded Amount to Date:	$500,000.00
Funds Obligated to Date:	FY 2020 = $500,000.00
History of Investigator:	Philip Godfrey (Principal Investigator) pbg@illinois.edu
Recipient Sponsored Research Office:	University of Illinois at Urbana-Champaign 506 S WRIGHT ST URBANA IL US 61801-3620 (217)333-2187
Sponsor Congressional District:	13
Primary Place of Performance:	University of Illinois at Urbana-Champaign 506 S Wright Street Urbana IL US 61801-3620
Primary Place of Performance Congressional District:	13
Unique Entity Identifier (UEI):	Y8CWNJRCNN91
Parent UEI:	V2PHZ2CSCH63
NSF Program(s):	Networking Technology and Syst
Primary Program Source:	01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):	7923
Program Element Code(s):	736300
Award Agency Code:	4900
Fund Agency Code:	4900
Assistance Listing Number(s):	47.070

ABSTRACT

Any time an application uses the Internet, behind the scenes, rate control algorithms decide how quickly to transmit data. These algorithms have a critical task; sending too slowly causes delay or reduced video quality, and sending too quickly causes congestion that impacts both the sender and other users. Rate control is a persistent challenge in networking, as it has to deal with a wide range of dynamic environments while making millisecond-level decisions with limited information. This project is developing new approaches to rate control based on an area of machine learning known as reinforcement learning, leading to potential improvements in performance and functionality.

At a high level, the project seeks to develop a fundamental understanding of the use of reinforcement learning for rate control, and apply that understanding to the areas of adaptive bitrate (ABR) video as commonly used in modern web-based video, and to transport layer congestion control, such as the Transmission Control Protocol (TCP). The work will begin with algorithmic foundations by exploring what level of complexity of learning algorithm (ranging from bandit algorithms to deep neural networks) is necessary to achieve high performance. Next, the project will broaden the semantics of inputs and outputs of rate control, including a scavenger rate control protocol and an improved multipath TCP. Finally, the project will use novel automated methods to improve the robustness of rate control protocols in unexpected environments.

The results of this project can offer significant performance improvement for deployed protocols, which is of increasing need as modern and emerging applications have ever more demanding network requirements. For example, low latency communication and high quality real-time video are valuable for interactive conferencing, augmented and virtual reality, Internet of Things, edge computing, and more. The project also plans to provide research opportunities for underrepresented groups.

The results of this project, including papers and open-source code, will be available at http://pccproject.net and at the code repository, https://github.com/PCCproject

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Gilad, Tomer and Rozen-Schiff, Neta and Godfrey, P. Brighten and Raiciu, Costin and Schapira, Michael "MPCC: online learning multipath transport" ACM CoNEXT , 2020 https://doi.org/10.1145/3386367.3433030 Citation Details

Meng, Tong and Cai, Christopher and Godfrey, Brighten and Schapira, Michael "Toward greater scavenger congestion control deployment: implementations and interfaces" ACM/IRTF Applied Networking Research Workshop (ANRW) , 2021 https://doi.org/10.1145/3472305.3472323 Citation Details

Meng, Tong and Schiff, Neta Rozen and Godfrey, P. Brighten and Schapira, Michael "PCC Proteus: Scavenger Transport And Beyond" ACM SIGCOMM 2020 , 2020 https://doi.org/10.1145/3387514.3405891 Citation Details

William Sentosa, Balakrishnan Chandrasekaran "DChannel: Accelerating Mobile Applications With Parallel High-bandwidth and Low-latency Channels" 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI) , 2023 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Rate control algorithms are central to the Internet, at both the transport and application layers. At the transport layer, congestion control (CC) has a crucial impact on user experience for Internet services such as video streaming and voice-over-IP, as well as emerging Internet services such as augmented and virtual reality, Internet of Things, edge computing, and more. At the application layer, adaptive bitrate (ABR) video streaming algorithms optimize video quality in real time. Rate control has emerged as one of the most important persistent challenges in networking. Rate control algorithms have to deal with networks that vary greatly in sizes, link capacities, latency, level of competition between connections, and more factors, all of which can change rapidly. Despite receiving only a small trickle of information about the network (one data point per packet), rate control algorithms must make millisecond-level decisions. Even recently-deployed protocols like BBR, deployed by Google and Microsoft respectively, have significant performance shortcomings.

This project seeks to develop a fundamental understanding of the use of reinforcement learning for rate control, and is applying that knowledge to improve protocol performance and functionality in ABR video and congestion control. To this end, the project produced four key technical results. First, the project developed PCC Proteus, a "scavenger" congestion controller, which utilizes available bandwidth for non-time-sensitive workloads (such as software upgrades and backups) without disturbing time-sensitive workloads. PCC Proteus leveraged an architecture based on online learning, and outperformed past scavenger designs (including LEDBAT). Second, the project extended that learning-based congestion control framework to support multiple paths, significantly outperforming the widely-used Multipath TCP (MPTCP) protocol. Third, the project developed a multipath packet steering system called DChannel that utilizes two channels -- one high bandwidth, one low latency -- simultaneously. Such channels have been standardized in 5G cellular networks, where the project's experiments show that even though the low-latency channel offers just 1% of the bandwidth of the high bandwidth channel, DChannel's use of both channels can improve web page load time and responsiveness of common mobile apps by 16-40%. Finally, the project developed a system to use machine learning as an adversary, to perform robustness testing of ABR video and congestion control protocols. This helps protocol designers to discover environments where their protocols perform poorly, and to fix the discovered problems.

These results can have broad impact. Congestion control protocols are used in nearly every networked application. Multiple paths or communication channels are becoming an important scenario, given the prevalence of mobile devices with multiple network interfaces and emerging 5G slicing. The project's results can broaden the applicability of these emerging technologies. Finally, the results on adversarial testing are important to improve the dependability of these core infrastructure protocols. The project trained multiple graduate and undergraduate student researchers. Results were disseminated via publications, presentations, and open-source releases of code and data.

Last Modified: 03/16/2024
Modified by: Philip B Godfrey

Please report errors in award information by writing to: awardsearch@nsf.gov.

Success

Error