Award Abstract # 1763929
CSR: Collaborative Research: Mobile Elastic Edge Clouds for Scalable, Low-Latency Services

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: REGENTS OF THE UNIVERSITY OF CALIFORNIA AT RIVERSIDE
Initial Amendment Date: August 4, 2018
Latest Amendment Date: July 9, 2021
Award Number: 1763929
Award Instrument: Standard Grant
Program Manager: Karen Karavanic
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2018
End Date: September 30, 2022 (Estimated)
Total Intended Award Amount: $112,614.00
Total Awarded Amount to Date: $120,614.00
Funds Obligated to Date: FY 2018 = $112,614.00
FY 2019 = $8,000.00
History of Investigator:
  • Kadangode Ramakrishnan (Principal Investigator)
    kk@cs.ucr.edu
Recipient Sponsored Research Office: University of California-Riverside
200 UNIVERSTY OFC BUILDING
RIVERSIDE
CA  US  92521-0001
(951)827-5535
Sponsor Congressional District: 39
Primary Place of Performance: University of California-Riverside
CA  US  92521-0001
Primary Place of Performance
Congressional District:
39
Unique Entity Identifier (UEI): MR5QC5FCAVH5
Parent UEI:
NSF Program(s): Special Projects - CNS,
CSR-Computer Systems Research
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 9251, 7354, 7924
Program Element Code(s): 171400, 735400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Smart wearables, the Internet of Things, and new application types, such as augmented reality promise to revolutionize how people interact with technology in their daily lives. While embedded and smart devices have growing capabilities, they still rely on a backend cloud infrastructure to provide additional storage and computational capacity. However, these new application types have characteristics such as strict performance requirements and frequent mobility that are ill suited for today's centralized clouds. This project will develop new system architectures that will increase the scalability, elasticity, and mobility of "edge" applications that connect to mobile users.

Towards this end, the project will explore the communication and system architectures needed to effectively support edge cloud services. The project will leverage advances in network function virtualization to provide high performance networking, and will explore the communication and Operating System primitives needed to support scalable middleboxes and application endpoints. Using this platform as a base, the project will design models that capture the new challenges inherent in mobile edge cloud workloads. These models will be used to guide elastic scaling algorithms.

We are increasingly reliant on mobile computing devices to guide our cars, help us keep in touch with others, gather data of our surroundings, and more. The mobile elastic edge cloud platform being developed in this project will help improve the scalability, agility, and efficiency of edge clouds, allowing them to support new types of performance critical applications. The researchers will engage a broad range of students from the undergraduate to Ph.D. levels in the educational and research activities of this grant.

There will be a project website (http://faculty.cs.gwu.edu/timwood/projects/me2c) that includes all of the artifacts produced throughout the project as well as links to key related technologies and papers. The web repository will include all of the source code developed during the course of the project, documentation with guidance to adopters on using the software, and links to all the papers published and technical reports that are released publicly. The project web page will be maintained for a period of five years after the end of the project.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 27)
Dhakal, Aditya and Kulkarni, Sameer G and Ramakrishnan, K. K. "ECML: Improving Efficiency of Machine Learning in Edge Clouds" 2020 IEEE 9th International Conference on Cloud Networking (CloudNet) , 2020 https://doi.org/10.1109/CloudNet51028.2020.9335804 Citation Details
Qi, Shixiong and Kulkarni, Sameer G. and Ramakrishnan, K. K. "Assessing Container Network Interface Plugins: Functionality, Performance, and Scalability" IEEE Transactions on Network and Service Management , v.18 , 2021 https://doi.org/10.1109/TNSM.2020.3047545 Citation Details
Qi, Shixiong and Monis, Leslie and Zeng, Ziteng and Wang, Ian-chin and Ramakrishnan, K. K. "SPRIGHT: extracting the server from serverless computing! high-performance eBPF-based event-driven, shared-memory processing" SIGCOMM '22: Proceedings of the ACM SIGCOMM 2022 Conference , 2022 https://doi.org/10.1145/3544216.3544259 Citation Details
Qi, Shixiong and Zeng, Ziteng and Monis, Leslie and Ramakrishnan, K. K. "MiddleNet: A Unified, High-Performance NFV and Middlebox Framework With eBPF and DPDK" IEEE Transactions on Network and Service Management , v.20 , 2023 https://doi.org/10.1109/TNSM.2023.3256891 Citation Details
Tran, Vincent and Tourrilhes, Jean and Ramakrishnan, K. K. and Sharma, Puneet "Accurate Available Bandwidth Measurement with Packet Batching Mitigation for High Speed Networks" 2021 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN) , 2021 https://doi.org/10.1109/LANMAN52105.2021.9478812 Citation Details
Wang, Ian-Chin and Qi, Shixiong and Liri, Elizabeth and Ramakrishnan, K. K. "Towards a Proactive Lightweight Serverless Edge Cloud for Internet-of-Things Applications" 2021 IEEE International Conference on Networking, Architecture and Storage (NAS) , 2021 https://doi.org/10.1109/NAS51552.2021.9605384 Citation Details
Wang, I. and Liri, E. and Ramakrishnan, K. K. "Supporting IoT Applications with Serverless Edge Clouds" 2020 IEEE 9th International Conference on Cloud Networking (CloudNet) , 2020 https://doi.org/10.1109/CloudNet51028.2020.9335805 Citation Details
Zeng, Ziteng and Monis, Leslie and Qi, Shixiong and Ramakrishnan, K. K. "MiddleNet: A High-Performance, Lightweight, Unified NFV and Middlebox Framework" 2022 IEEE 8th International Conference on Network Softwarization (NetSoft) , 2022 https://doi.org/10.1109/NetSoft54395.2022.9844083 Citation Details
Zhou, Liang and Bhuyan, Laxmi and Ramakrishnan, K. K. "Goldilocks: Adaptive Resource Provisioning in Containerized Data Centers" 39th IEEE International Conference on Distributed Computing Systems (ICDCS 2019) , 2019 Citation Details
Zhou, Liang and Bhuyan, Laxmi N. and Ramakrishnan, K. K. "Cottage: Coordinated Time Budget Assignment for Latency, Quality and Power Optimization in Web Search" IEEE International Symposium on High-Performance Computer Architecture (HPCA) , 2022 https://doi.org/10.1109/HPCA53966.2022.00017 Citation Details
Zhou, Liang and Bhuyan, Laxmi N. and Ramakrishnan, K. K. "DREAM: DistRibuted Energy-Aware traffic Management for Data Center Networks" Proceedings of the Tenth ACM International Conference on Future Energy Systems , 2019 10.1145/3307772.3328291 Citation Details
(Showing: 1 - 10 of 27)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

New applications that promise to revolutionize how people interact with technology in their daily lives have unique characteristics such as strict latency requirements and frequent mobility that are ill suited for today's centralized clouds. We designed a new Mobile Elastic Edge Cloud (ME2C) architecture to provide a programmable, converged architecture that can be customized for the diverse needs of emerging applications.

One of the primary contributions is to provide support on an edge platform for machine learning applications which require the computational support provided by GPU accelerators at the edge. Since many real-time applications require low-latency, high throughput machine learning (ML) functions resident in the network and in the cloud to perform learning and inference, having sufficient computing capability at the edge can be a significant enabler. We developed a better multiplexing on GPUs on edge cloud platforms to support running ML applications, especially Deep Neural Networks on high-end GPUs, such as the nVIDIA V100 GPUs. We created GSLICE, a framework to have controlled spatial and temporal sharing of the GPU by multiple DNNs. We have a true virtualization of the GPU that can be used my many of the applications using the edge cloud. We also used this to improve the tuning of DNNs, using Slice-Tune, which utilizes GSLICE as a base framework.

The second area we have worked on has been to provide function as a service (serverless computing) on resource-constrained edge platforms. We have worked on Container Network Interface (CNI) plug-in evaluations, developing a better framework for autoscaling and function placement as well as load balancing in serverless edge computing, and also looking at using the extended Berkeley Packet Filter (eBPF) as a way to develop an improved side car for serverless computing in KNative. We extended the Knative serverless framework to improve its performance, scalability, and fairness for an edge cloud computing serverless platform that we call "Mu", primarily focused on the control plane. We also looked at supporting IoT workloads on edge clouds using Knative serverless frameworks and how to improve the support for it. We also looked at how CNIs can be improved to provide fast initialization of virtualization for an agile edge cloud computing environment. Finally, we developed an eBPF-based shared-memory framework called SPRIGHT that substantially enhances the data plane for serverless computing.

A third area was to develop improved energy management in data centers, in particular leveraging the affinity of functions that communicate to place and migrate them so that related functions that communicate frequently are co-located. Our work in this area involved improving power management in Data Centers, especially smaller data centers. There were several aspects we worked on on. We combined Dynamic Voltage and Frequency Scaling (DVFS) techniques to take advantage of slack in latency of search queries and also Machine Learning techniques to predict the per-query service time and the error in the prediction to determine the right time to boost the CPU frequency.  We also worked on improving our framework for improved consolidation of data centers while carefully factoring in the cost of migration of containerized applications from one server to another in a data center using graph-partitioning to improve placement and migration of containers.

We have published several papers in all these areas. Two students got their PhDs (one advised by me, one advised by another colleague although I worked a lot with the student) and a 3rd PhD student is close to graduation at the time the project ended. I also supported a post-doctoral fellow who went on to a faculty position. The project also supported 3 undergraduate student researchers as part of a REU supplement. One undergraduate student (from an under-represented minority) worked on understanding and evaluating cloud orchestration environments and container networking interfaces (CNI). A second undergraduate student worked on understanding how to use edge clouds for automatic speech recognition, especially in the context of emergency management. A third undergraduate student examined using edge clouds, and shared memory for performing aggregation for Federated Learning. Multiple MS students also worked on projects related to this grant.

The research on this project was done in close collaboration with our partner, Prof. Tim Wood of George Washington University. The project has led to several publications in international conferences and also journal publications. I believe that our ideas on GPU multiplexing will significantly improve how GPUs are used in edge clouds. Similarly, the control plane and data plane development will improve KNative and other Kubernetes based cloud environments to have a better autoscaling algorithm, placement/scheduling and an improved shared-memory based data plane. The use of eBPF to improve the sidecar functionality of serverless frameworks will also have long term impact.

Overall, this has been a very successful series of efforts supported by the NSF grant, and we are very grateful to NSF for this support.

 


Last Modified: 06/29/2023
Modified by: Kadangode K Ramakrishnan

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page