Award Abstract # 1421643
CSR: Small: Collaborative Research: Adaptive Memory Resource Management in a Data Center - A Transfer Learning Approach

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: WESTERN MICHIGAN UNIVERSITY
Initial Amendment Date: September 4, 2014
Latest Amendment Date: June 29, 2015
Award Number: 1421643
Award Instrument: Standard Grant
Program Manager: Marilyn McClure
mmcclure@nsf.gov
 (703)292-5197
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2014
End Date: September 30, 2019 (Estimated)
Total Intended Award Amount: $100,000.00
Total Awarded Amount to Date: $112,000.00
Funds Obligated to Date: FY 2014 = $100,000.00
FY 2015 = $12,000.00
History of Investigator:
  • Steven Carr (Principal Investigator)
    steve.carr@wmich.edu
Recipient Sponsored Research Office: Western Michigan University
1903 W MICHIGAN AVE
KALAMAZOO
MI  US  49008-5200
(269)387-8298
Sponsor Congressional District: 04
Primary Place of Performance: Western Michigan University
1903 W. Michigan Ave.
Kalamazoo
MI  US  49008-5200
Primary Place of Performance
Congressional District:
04
Unique Entity Identifier (UEI): J7WULLYGFRH1
Parent UEI:
NSF Program(s): CSR-Computer Systems Research
Primary Program Source: 01001415DB NSF RESEARCH & RELATED ACTIVIT
01001516DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7923, 9178, 9251
Program Element Code(s): 735400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Cloud computing has become a dominant scalable computing platform for both online services and conventional data-intensive computing (examples include Amazon's EC2, Microsoft's Azure, IBM's SmartCloud, etc.). Cloud computing data centers share computing resources among a large set of users, providing a cost effective means to allow users access to computational power and data storage not practical for an individual. A data center often has to over-commit its resources to meet Quality of Service contracts. The data center software needs to effectively manage its resources to meet the demands of users submitting a variety of applications, without any prior knowledge of these applications.

This work is focused on the issue of management of memory resources in a data center. Recent progress in transfer learning methods inspires this work in the creation of dynamic models to predict the cache and memory requirements of an application. The project has four main tasks: (i) an investigation into how recent advancements in transfer learning can help solve data center resource management problems, (ii) development of a dynamic cache predictor using on-the-fly virtual machine measurements, (iii) creation of a dynamic memory predictor using runtime characteristics of a virtual machine, and (iv) development of a unified resource management scheme creating a set of heuristics that dynamically adjust cache and memory allocation to fulfill Quality of Service goals. In tasks (i)-(iii), transfer learning methods are employed and explored to facilitate the transfer of knowledge and models to new system environments and applications based on extensive training on existing systems and benchmark applications. The prediction models and management scheme will be evaluated on common benchmarks including SPEC WEB and CloudSuite 2.0. The results of this research will have broad impact on the design and implementation of cloud computing data centers. The results will help improve resource utilization, boost system throughput, and improve predication performance in a cloud computing virtualization system. Additionally, the methods designed and knowledge they impart will advance understanding in both systems research and machine learning.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hao LiJialiang ChangZijiang YangSteve Carr "Memory Distance Measurement for Concurrent Programs" The 30th International Workshop on Languages and Compilers for Parallel Computing (LCPC 2017) , 2017
Nasser AlsaediSteve CarrAlvis Fong "Applying Supervised Learning to the Static Prediction of Locality-Pattern Complexity in Scientific Code." The 2018 International Conference on Machine Learning and Its Applications. , 2018

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Cloud computing has become a dominant for online services. To guarantee Quality of Service (QoS), A cloud center usually commits resources than necessary for time of high demand. This project focused on the effective management of machine resources in order to allow a cloud center to utilize fewer machines and achieve the same QoS.

Specifically, this research developed automatic techniques to 
	
	1) Predict the memory resources needed by an application from the source code using machine learning, and
	2) Use that prediction to manage the resources used by the application at run-time to achieve QoS

This research has advanced our understanding of both systems and machine learning as they apply to source code. The methods developed in this research will help improve resource utilization, boost system throughput, and provide predictable performance. Our research may thus benefit a broad range of end users
of cloud computing. 

This project helped fund the education of one Ph.D. student. The nature of this interdisciplinary research provided the student a deep insight
into two fundamental areas in computer science: computer systems and machine learning. In addition, an undergraduate student, and future graduate student in computer science, developed and understanding of the same areas as part of a Research Experience for Undergraduates supplemental award.

Last Modified: 12/11/2019
Modified by: Steven M Carr

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page