Award Abstract # 2211888
CNS Core: Medium:Model-driven Resource Management for Avoiding Performance Pitfalls in Edge Computing

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: UNIVERSITY OF MASSACHUSETTS
Initial Amendment Date: August 22, 2022
Latest Amendment Date: September 9, 2024
Award Number: 2211888
Award Instrument: Continuing Grant
Program Manager: Marilyn McClure
mmcclure@nsf.gov
 (703)292-5197
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2022
End Date: September 30, 2026 (Estimated)
Total Intended Award Amount: $1,199,134.00
Total Awarded Amount to Date: $882,486.00
Funds Obligated to Date: FY 2022 = $581,353.00
FY 2024 = $301,133.00
History of Investigator:
  • Prashant Shenoy (Principal Investigator)
    shenoy@cs.umass.edu
  • David Irwin (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Massachusetts Amherst
101 COMMONWEALTH AVE
AMHERST
MA  US  01003-9252
(413)545-0698
Sponsor Congressional District: 02
Primary Place of Performance: University of Massachusetts Amherst
Research Administration Building
Hadley
MA  US  01035-9450
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): VGJHK59NMPK9
Parent UEI: VGJHK59NMPK9
NSF Program(s): CSR-Computer Systems Research
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
01002425DB NSF RESEARCH & RELATED ACTIVIT

01002526DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7924
Program Element Code(s): 735400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

A variety of new distributed applications are emerging that have strict low-latency requirements, including real-time machine learning-based inference, Internet-of-Things services, mobile augmented reality, and cloud gaming. To support these applications, cloud providers are building out large-scale distributed edge infrastructures that can provide computing and storage resources in close proximity to end users. Yet, despite the significant network latency advantage of edge servers, edge computing remains vulnerable to numerous performance pitfalls that can lead to reduced performance. This counter-intuitive behavior primarily occurs when edge resource constraints or workload bursts cause high queuing delays and response times that significantly increase latency. To address this problem, this project will develop rigorous analytical models of edge and cloud performance to gain a fundamental understanding of when and why edge performance problems occur in practice. The project will then apply these models to design novel, but practical, resource management policies that can enable edge computing to provide low latency for a wide range of real-world applications. These policies include i) edge elasticity and bursting that adaptively scale edge resources within and across edge and cloud sites under workload spikes; ii) performance isolation for edge accelerators that flexibly multiplexes accelerators across applications to increase their utilization; and iii) dynamic resource provisioning and allocation for serverless edge computing that determines the resources needed by serverless containers to satisfy tail latency requirements. Collectively, these models and policies will enable edge computing to fulfill its potential to support new classes of low-latency applications.

The project has the potential for significant practical impact by enabling commercial cloud providers to offer low-latency edge cloud services, which is important in supporting numerous emerging applications with strict low-latency requirements. The project will conduct outreach by incorporating relevant research topics within summer programs for local middle and high school students. The project will also inject elements of edge computing and cloud computing into current graduate and advanced undergraduate classes at the PIs' institution. The project will emphasize recruiting a diverse group of undergraduate and graduate students through participation in REU programs and institutional diversity efforts. Finally, the software artifacts and datasets from the project will be made available to the research community as open source via the UMass Trace Repository.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 13)
Liang, Qianlin and Hanafy, Walid A. and Ali-Eldin, Ahmed and Shenoy, Prashant "Model-driven Cluster Resource Management for AI Workloads in Edge Clouds" ACM Transactions on Autonomous and Adaptive Systems , v.18 , 2023 https://doi.org/10.1145/3582080 Citation Details
Liang, Qianlin and Hanafy, Walid A. and Bashir, Noman and Ali-Eldin, Ahmed and Irwin, David and Shenoy, Prashant "Dlen: Enabling Flexible and Adaptive Model-serving for Multi-tenant Edge AI" ACM/IEEE Conference on Internet of Things Design and Implementation , 2023 https://doi.org/10.1145/3576842.3582375 Citation Details
Liang, Qianlin and Hanafy, Walid A and Bashir, Noman and Irwin, David and Shenoy, Prashant "Energy Time Fairness: Balancing Fair Allocation of Energy and Time for GPU Workloads" , 2023 https://doi.org/10.1145/3583740.3628435 Citation Details
Maji, Diptyaroop and Sitaraman, Ramesh K. and Shenoy, Prashant "DACF: day-ahead carbon intensity forecasting of power grids using machine learning" Proceedings of the Thirteenth ACM International Conference on Future Energy Systems (e-Energy 22) , 2022 https://doi.org/10.1145/3538637.3538849 Citation Details
Savasci, M and Souza, A and Wu, L and Irwin, D and Ali-Eldin, A and Shenoy, P "SLO-Power: SLO and Power-aware Elastic Scaling for Web Services" , 2024 Citation Details
Souza, Abel and Jasoria, Shruti and Chakrabarty, Basundhara and Bridgwater, Alexander and Lundberg, Axel and Skogh, Filip and Ali-Eldin, Ahmed and Irwin, David and Shenoy, Prashant "CASPER: Carbon-Aware Scheduling and Provisioning for Distributed Web Services" , 2023 https://doi.org/10.1145/3634769.3634812 Citation Details
Wang, Bin and Irwin, David and Shenoy, Prashant and Towsley, Don "INVAR: Inversion Aware Resource Provisioning and Workload Scheduling for Edge Computing" , 2024 https://doi.org/10.1109/INFOCOM52122.2024.10621417 Citation Details
Atrey, Akanksha and Sinha, Ritwik and Mitra, Saayan and Shenoy, Prashant "SODA: Protecting Proprietary Information in On-Device Machine Learning Models" , 2023 https://doi.org/10.1145/3583740.3626617 Citation Details
Bashir, Noman and Chandio, Yasra and Irwin, David and Anwar, Fatima M. and Gummeson, Jeremy and Shenoy, Prashant "Jointly Managing Electrical and Thermal Energy in Solar- and Battery-powered Computer Systems" 14th ACM International Conference on Future Energy Systems , 2023 https://doi.org/10.1145/3575813.3595191 Citation Details
Hanafy, Walid A. and Liang, Qianlin and Bashir, Noman and Irwin, David and Shenoy, Prashant "CarbonScaler: Leveraging Cloud Workload Elasticity for Optimizing Carbon-Efficiency" Proceedings of the ACM on Measurement and Analysis of Computing Systems , v.7 , 2023 https://doi.org/10.1145/3626788 Citation Details
Hanafy, Walid A. and Liang, Qianlin and Bashir, Noman and Souza, Abel and Irwin, David and Shenoy, Prashant "Going Green for Less Green: Optimizing the Cost of Reducing Cloud Carbon Emissions" Proceedings of ACM ASPLOS Conference , 2024 https://doi.org/10.1145/3620666.3651374 Citation Details
(Showing: 1 - 10 of 13)

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page