Award Abstract # 1319405
CSR: Small: The Pocket Datacenter

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: UNIVERSITY OF WISCONSIN SYSTEM
Initial Amendment Date: June 24, 2013
Latest Amendment Date: June 24, 2013
Award Number: 1319405
Award Instrument: Standard Grant
Program Manager: Marilyn McClure
mmcclure@nsf.gov
 (703)292-5197
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2013
End Date: August 31, 2017 (Estimated)
Total Intended Award Amount: $499,413.00
Total Awarded Amount to Date: $499,413.00
Funds Obligated to Date: FY 2013 = $499,413.00
History of Investigator:
  • Remzi Arpaci-Dusseau (Principal Investigator)
    remzi@cs.wisc.edu
  • Andrea Arpaci-Dusseau (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Wisconsin-Madison
21 N PARK ST STE 6301
MADISON
WI  US  53715-1218
(608)262-3822
Sponsor Congressional District: 02
Primary Place of Performance: University of Wisconsin-Madison
21 North Park Street
Madison
WI  US  53715-1218
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): LCLSJAGTNZQ7
Parent UEI:
NSF Program(s): CSR-Computer Systems Research
Primary Program Source: 01001314DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7923
Program Element Code(s): 735400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Research and development in cloud-based distributed systems is currently
hindered due to the incredible difficulty of running such systems at
scale. The Wisconsin Pocket Datacenter (PoD) project directly addresses this
dilemma by enabling researchers and developers to run large-scale distributed
systems on modest hardware resources. The key novel idea behind PoD is data
elision, which removes actual application data from the underlying disks,
memories, and network transfers of the distributed system under test. PoD thus
enables many virtual instances of such a system to be run upon just a few
physical machines, thus making testing, debugging, and in general development
simpler, faster, and less resource-intensive.

Cloud-based distributed systems represent a critically-important piece of
modern computing infrastructure. As users and corporations migrate their data
and computation into such facilities, it is increasingly important to ensure
that such systems operate correctly and efficiently. PoD enables developers to
debug existing systems and test new designs without excessive hardware
resources; with such a tool in their workbench, system developers will be able
to more readily build and test the next generation of cloud-based systems to
support the needs of a growing percentage of society.

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Large-scale storage systems are central to the modern information economy. Behind every important Internet service, including Google's Gmail, Facebook's social network, etc., lies the data that drives each service; thus, the scalable storage systems that store the vast quantities of data inside each service, and specifically their performance and reliability characteristics, are of extreme importance.

The "Pocket Datacenter" Project at the University of Wisconsin-Madison addressed these central problems in the design and implementation of large-scale computer systems. How can researchers and practitioners analyze existing large-scale storage systems so as to better understand their deficiencies? How can these results help drive the creation of new and improved storage systems?

The research done within the confines of the project advanced the state of the art in systems evaluation techniques in numerous critical directions. For example, the Chopper tool was developed and can pinpoint high tail latencies in file systems, using advanced statistical methods based on Latin Hypercube sampling. In addition, a new approach to reducing the high costs of translation in modern SSD-based storage systems was developed with the File System Devirtualizer (FSDV). A third example is the extension of previous emulation technology; although originally developed to analyze storage systems, here the basic idea was applied to improve storage performance in virtualized environments with a technique called System-Call Interception (SCI). A fourth advancement came in the form of a tool to analyze the robustness of existing distributed storage systems to correlated failures called the Protocol-Aware Crash Explorer (PACE). Finally, an extensive SSD simulation environment called WiscSim was created, enabling the deep analysis of the modern storage systems stack, finding many flaws in existing designs as well as suggesting new directions.

Overall, the Pocket Datacenter project has advanced our knowledge of how high-end storage systems should be designed, as well as our understanding of how to build tools to perform such analyses. The end goal of helping to realize significant improvements in existing systems, and helping to shape the future design and implementation of such systems, was thus achieved.


Last Modified: 11/29/2017
Modified by: Remzi H Arpaci-Dusseau

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page