Award Abstract # 1730128
II-EN: Collaborative Research: Enhancing the Parasol Experimental Testbed for Sustainable Computing

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: THE RESEARCH FOUNDATION FOR THE STATE UNIVERSITY OF NEW YORK
Initial Amendment Date: June 8, 2017
Latest Amendment Date: June 8, 2017
Award Number: 1730128
Award Instrument: Standard Grant
Program Manager: Marilyn McClure
mmcclure@nsf.gov
 (703)292-5197
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: July 1, 2017
End Date: June 30, 2022 (Estimated)
Total Intended Award Amount: $24,215.00
Total Awarded Amount to Date: $24,215.00
Funds Obligated to Date: FY 2017 = $24,215.00
History of Investigator:
  • Anshul Gandhi (Principal Investigator)
    anshul@cs.stonybrook.edu
  • Zhenhua Liu (Co-Principal Investigator)
Recipient Sponsored Research Office: SUNY at Stony Brook
W5510 FRANKS MELVILLE MEMORIAL LIBRARY
STONY BROOK
NY  US  11794-0001
(631)632-9949
Sponsor Congressional District: 01
Primary Place of Performance: SUNY at Stony Brook
NY  US  11794-3362
Primary Place of Performance
Congressional District:
01
Unique Entity Identifier (UEI): M746VC6XMNH9
Parent UEI: M746VC6XMNH9
NSF Program(s): CCRI-CISE Cmnty Rsrch Infrstrc
Primary Program Source: 01001718DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7359
Program Element Code(s): 735900
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

This project will enhance an experimental datacenter for sustainable computing. Datacenters consume vast amounts of energy, totaling about 1.8% of the US electricity usage in 2014. Thus, the energy efficiency, energy-related costs, and overall sustainability of datacenters are of critical concerns. NSF funded experimental green datacenter called Parasol has previously demonstrated that the combination of green design and intelligent software management systems can lead to significant reductions in energy consumption, carbon emission, and cost. The enhanced version of this project will update energy sources, network technologies and management software.

Running real experiments in live conditions using Parasol led to findings that were not possible in simulation. This proposal seeks to update and enhance Parasol with current and next generation power-efficient servers, improve network connectivity and integrate software-defined networking (SDN) and Wi-Fi capabilities, increase solar energy generation capacity, add a low emission fuel cell power source, diversify energy storage, and improve the cooling system to advance green computing. The PIs will need to update and enhance Parasol's current software stack for monitoring, programmatic control, and remote access for the new hardware enhancements. Specific research goals are resource management in green datacenters, that includes coordinated workload, cooling, and energy scheduling against environmental and load variability to maximize the benefits of green datacenters and to help improve grid power management leveraging accelerators such as GPUs and deep learning hardware, which promise excellent performance/watt ratios.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 12)
Cao, Yi and Jain, Arpit and Sharma, Kriti and Balasubramanian, Aruna and Gandhi, Anshul "When to use and when not to use BBR: An empirical analysis and evaluation study" IMC '19: Proceedings of the Internet Measurement Conference , 2019 10.1145/3355369.3355579 Citation Details
Comden, Joshua and Yao, Sijie and Chen, Niangjun and Xing, Haipeng and Liu, Zhenhua "Online Optimization in Cloud Resource Provisioning: Predictions, Regrets, and Algorithms" Proceedings of the ACM on Measurement and Analysis of Computing Systems , v.3 , 2019 https://doi.org/10.1145/3322205.3311087 Citation Details
Comden, Joshua and Yao, Sijie and Chen, Niangjun and Xing, Haipeng and Liu, Zhenhua "Online Optimization in Cloud Resource Provisioning: Predictions, Regrets, and Algorithms" Proceedings of the ACM on Measurement and Analysis of Computing Systems , 2019 Citation Details
Gandhi, A. and Ghose, K. and Gopalan, K. and Hussain, S. and Lee, D. and Liu, Y. and Liu, Z. and McDaniel, P. and Mu, S. and Zadok, E. "Metrics for Sustainability in Data Centers" Proceedings of the 1st Workshop on Sustainable Computer Systems Design and Implementation (HotCarbon'22) , 2022 Citation Details
Hafeez, Ubaid U. and Gandhi, A. "SLO-Aware Space-Time GPU Sharing for DL Workloads" Non-archival poster presentation in the 13th ACM Symposium on Cloud Computing , 2022 Citation Details
Hafeez, Ubaid U and Sun, Xiao and Gandhi, Anshul and Liu, Zhenhua "Towards Optimal Placement and Scheduling of DNN Operations with Pesto" Proceedings of the 22nd International Middleware Conference , 2021 https://doi.org/10.1145/3464298.3476132 Citation Details
Hafeez, Ubaid U and Wajahat, M and Gandhi, A. "ElMem: Towards an Elastic Memcached System" Proceedings of the International Conference on Distributed Computing Systems , 2018 Citation Details
Javadi, Seyyed Ahmad and Gupta, Harsh and Manhas, Robin and Sahu, Shweta and Gandhi, Anshul "EASY: Efficient Segment Assignment Strategy for Reducing Tail Latencies in Pinot" 2018 IEEE 38th International Conference on Distributed Computing Systems , 2018 10.1109/ICDCS.2018.00144 Citation Details
Le, Tan N. and Sun, Xiao and Chowdhury, Mosharaf and Liu, Zhenhua "AlloX: compute allocation in hybrid clusters" Eurosys 2020 , 2020 10.1145/3342195.3387547 Citation Details
Maghakian, Jessica and Comden, Joshua and Liu, Zhenhua "Online optimization in the Non-Stationary Cloud: Change Point Detection for Resource Provisioning (Invited Paper)" 2019 53rd Annual Conference on Information Sciences and Systems (CISS) , 2019 10.1109/CISS.2019.8692890 Citation Details
Shang, Xiaojun and Liu, Zhenhua and Yang, Yuanyuan "Online Service Function Chain Placement for Cost-effectiveness and Network Congestion Control" IEEE Transactions on Computers , 2020 https://doi.org/10.1109/TC.2020.3035991 Citation Details
(Showing: 1 - 10 of 12)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The Parasol “green” micro-datacenter was built to provide an isolated and instrumented testbed in support of research on sustainable computing. It has proven to be an effective instrument for research involving power/energy, cooling, and their coordination with and impact on resource management, scheduling, and system/application performance and dependability. As cloud computing evolves rapidly to include “edge” computing, Parasol will also provide an effective platform for studying edge datacenters and their uses to support novel application services. Unfortunately, Parasol was aging and its utility as a research instrument was degrading as infrastructure components and computing equipment started to fail.

The major goal of this project is to update and enhance Parasol to extend its lifetime and increase its capabilities.  Specifically, the goals for the project are to (1) populate the enhanced testbed with state-of-the-art heterogeneous servers, hardware accelerators, and improved network connectivity; (2) repair and enhance Parasol’s infrastructure with special emphasis on the cooling system; (3) update and enhance Parasol’s software stack for monitoring, programmatic control, and remote access so that the testbed can be used by a significantly expanded set of researchers from Rutgers, Ohio State, and Stony Brook; and, (4) continue to collect data from Parasol’s operation for use in simulation and analytical studies of green datacenters.

All of the above goals have been achieved at the conclusion of the project. Parasol is now equipped with state of the art heterogeneous computing servers and network switches, connected to a heterogeneous cluster in a machine room, and is fully operational. The renovated instrument will enable the PIs to carry out a rich research agenda that explores the entire hardware/software ecosystem in green (edge) datacenters, as well as interactions between green (edge) datacenters and external systems. Specific research areas include: (1) Resource management, including issues such as coordinated workload, cooling, and energy scheduling in the presence of environmental and load variability to maximize the benefits of green datacenters. (2) Scheduling of emerging edge datacenter applications such as object recognition and video processing on heterogeneous hardware, including CPUs, GPUs, and video encoding/decoding accelerators, and in the context of green edge datacenters. (3) Developing analytic models to manage accelerators and processing speeds across whole racks as energy budgets vary—i.e., computational sprinting. (4) Using green edge data and computation centers in distributed application workflows that span mobile, edge, in-transit, and cloud computing. (5) Exploring adaptive automatic configuration management frameworks that can help users and applications adapt to changing runtime conditions for appropriate tradeoff between application quality and resource availability and consumption. (6) Investigating scheduling, placement, and sharing of GPU resources to improve GPU utilization and reduce latency when execution DNN training and inference jobs on GPU-equipped servers. (7) Developing algorithms for the dynamic resource management of virtual machines in datacenters in an online manner and incorporating changepoint detection into the online resource management algorithms. (8) Developing algorithms for scheduling jobs over interchangeable resources such as CPUs and GPUs to optimize performance while providing fairness among users sharing the cluster.

In fact, as the renovation of Parasol was done in several steps, the PIs have already successfully used the instrument in a number of projects. This has led to a number of publications along the research directions described above, as well as open source products and data available through the PIs’ websites.

 


Last Modified: 11/03/2022
Modified by: Anshul Gandhi

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page