Award Abstract # 2212256
OAC Core: A Scalable and Deployable Container Orchestration Cyber Infrastructure Toolkit for Deploying Big Data Analytics Applications in Public Cloud

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY
Initial Amendment Date: June 24, 2022
Latest Amendment Date: June 24, 2022
Award Number: 2212256
Award Instrument: Standard Grant
Program Manager: Varun Chandola
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: July 1, 2022
End Date: March 31, 2023 (Estimated)
Total Intended Award Amount: $324,275.00
Total Awarded Amount to Date: $324,275.00
Funds Obligated to Date: FY 2022 = $0.00
History of Investigator:
  • Liting Hu (Principal Investigator)
    liting@ucsc.edu
Recipient Sponsored Research Office: Virginia Polytechnic Institute and State University
300 TURNER ST NW
BLACKSBURG
VA  US  24060-3359
(540)231-5281
Sponsor Congressional District: 09
Primary Place of Performance: Virginia Polytechnic Institute and State University
620 Drillfield Drive
BLACKSBURG
VA  US  24061-1050
Primary Place of Performance
Congressional District:
09
Unique Entity Identifier (UEI): QDE5UHE5XD16
Parent UEI: X6KEFGLHSJX7
NSF Program(s): OAC-Advanced Cyberinfrast Core
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z, 7923, 9102
Program Element Code(s): 090Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Today many big data analytics applications (e.g., fraud detection, social data analytics, education, climate modeling, epidemiology, and finance) need to process enormous datasets from geographically distributed locations. An emerging trend is to host these big data analytics applications in the public cloud. They can be packaged to run in a lightweight isolated execution environment (containers) and deployed on computing resources rented from public cloud providers, which can be updated and scaled seamlessly. However, the complex inter-container correlations and the heterogeneity of hardware resources pose significant challenges in managing these big data analytics applications in the public cloud. This project enables the easy deployment of containerized big data analytics applications in the public cloud and provides cloud providers with insights to better tune their systems for current and future big data workloads.

The goal of this project is to develop a scalable and deployable cyber infrastructure (CI) container orchestration toolkit for deploying large numbers of containerized big data analytics applications on heterogeneous nodes in state-of-the-art public multi/hybrid-cloud. This project spans three complementary thrusts: (i) a novel ?black-box? lightweight tool is implemented, which detects inter-container correlations for containerized big data analytics applications in a non-intrusive manner via hierarchical clustering and co-occurrence analysis; (ii) a novel scalable container scheduler is implemented, which deploys containerized big data analytics applications on heterogeneous nodes in the public cloud in a correlation-aware manner; and (iii) the system is implemented on open-source container orchestration tools and validated by subjecting it to experimentation on both the lab-based prototype and the practical, real-world data centers. In addition to its technical contributions, this project involves various educational and outreach activities as well. The results of the research are integrated into the undergraduate and graduate systems courses. Finally, the toolkit, source code, datasets, and course materials developed in this project are documented and open-sourced.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Liu, Jianshu and Wang, Qingyang and Liu, Jianshu and Zhang, Shungeng and Hu, Liting and Silva, Dilma Da "Sora: A Latency Sensitive Approach for Microservice Soft Resource Adaption" 24th ACM/IFIP International Middleware Conference , 2023 Citation Details
Xu, Hailu and Lin, Pei-Hung and Emani, Murali and Hu, Liting and Liao, Chunhua "XUnified: A Framework for Guiding Optimal Use of GPU Unified Memory" IEEE Access , v.10 , 2022 https://doi.org/10.1109/ACCESS.2022.3196008 Citation Details
Xu, Hailu and Liu, Pinchao and Ahmed, Sarker Tanzir and Da Silva, Dilma and Hu, Liting "Adaptive Fragment-based Parallel State Recovery for Stream Processing Systems" IEEE Transactions on Parallel and Distributed Systems , 2023 https://doi.org/10.1109/TPDS.2023.3251997 Citation Details
Xu, Hailu and Liu, Pinchao and Guan, Boyuan and Wang, Qingyang and Da Silva, Dilma and Hu, Liting "Achieving Online and Scalable Information Integrity by Harnessing Social Spam Correlations" IEEE Access , v.11 , 2023 https://doi.org/10.1109/ACCESS.2023.3236604 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page