
NSF Org: |
CNS Division Of Computer and Network Systems |
Recipient: |
|
Initial Amendment Date: | September 4, 2014 |
Latest Amendment Date: | June 29, 2015 |
Award Number: | 1421643 |
Award Instrument: | Standard Grant |
Program Manager: |
Marilyn McClure
mmcclure@nsf.gov (703)292-5197 CNS Division Of Computer and Network Systems CSE Directorate for Computer and Information Science and Engineering |
Start Date: | October 1, 2014 |
End Date: | September 30, 2019 (Estimated) |
Total Intended Award Amount: | $100,000.00 |
Total Awarded Amount to Date: | $112,000.00 |
Funds Obligated to Date: |
FY 2015 = $12,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
1903 W MICHIGAN AVE KALAMAZOO MI US 49008-5200 (269)387-8298 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
1903 W. Michigan Ave. Kalamazoo MI US 49008-5200 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | CSR-Computer Systems Research |
Primary Program Source: |
01001516DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Cloud computing has become a dominant scalable computing platform for both online services and conventional data-intensive computing (examples include Amazon's EC2, Microsoft's Azure, IBM's SmartCloud, etc.). Cloud computing data centers share computing resources among a large set of users, providing a cost effective means to allow users access to computational power and data storage not practical for an individual. A data center often has to over-commit its resources to meet Quality of Service contracts. The data center software needs to effectively manage its resources to meet the demands of users submitting a variety of applications, without any prior knowledge of these applications.
This work is focused on the issue of management of memory resources in a data center. Recent progress in transfer learning methods inspires this work in the creation of dynamic models to predict the cache and memory requirements of an application. The project has four main tasks: (i) an investigation into how recent advancements in transfer learning can help solve data center resource management problems, (ii) development of a dynamic cache predictor using on-the-fly virtual machine measurements, (iii) creation of a dynamic memory predictor using runtime characteristics of a virtual machine, and (iv) development of a unified resource management scheme creating a set of heuristics that dynamically adjust cache and memory allocation to fulfill Quality of Service goals. In tasks (i)-(iii), transfer learning methods are employed and explored to facilitate the transfer of knowledge and models to new system environments and applications based on extensive training on existing systems and benchmark applications. The prediction models and management scheme will be evaluated on common benchmarks including SPEC WEB and CloudSuite 2.0. The results of this research will have broad impact on the design and implementation of cloud computing data centers. The results will help improve resource utilization, boost system throughput, and improve predication performance in a cloud computing virtualization system. Additionally, the methods designed and knowledge they impart will advance understanding in both systems research and machine learning.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Cloud computing has become a dominant for online services. To guarantee Quality of Service (QoS), A cloud center usually commits resources than necessary for time of high demand. This project focused on the effective management of machine resources in order to allow a cloud center to utilize fewer machines and achieve the same QoS.
Specifically, this research developed automatic techniques to
1) Predict the memory resources needed by an application from the source code using machine learning, and
2) Use that prediction to manage the resources used by the application at run-time to achieve QoS
This research has advanced our understanding of both systems and machine learning as they apply to source code. The methods developed in this research will help improve resource utilization, boost system throughput, and provide predictable performance. Our research may thus benefit a broad range of end users
of cloud computing.
This project helped fund the education of one Ph.D. student. The nature of this interdisciplinary research provided the student a deep insight
into two fundamental areas in computer science: computer systems and machine learning. In addition, an undergraduate student, and future graduate student in computer science, developed and understanding of the same areas as part of a Research Experience for Undergraduates supplemental award.
Last Modified: 12/11/2019
Modified by: Steven M Carr
Please report errors in award information by writing to: awardsearch@nsf.gov.