Award Abstract # 1250886
BIGDATA: Small: DCM: Data Management for Analytics Applications on Modern Architecture

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: UNIVERSITY OF WISCONSIN SYSTEM
Initial Amendment Date: June 11, 2013
Latest Amendment Date: June 11, 2013
Award Number: 1250886
Award Instrument: Standard Grant
Program Manager: Sylvia Spengler
sspengle@nsf.gov
 (703)292-7347
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: June 1, 2013
End Date: May 31, 2018 (Estimated)
Total Intended Award Amount: $680,916.00
Total Awarded Amount to Date: $680,916.00
Funds Obligated to Date: FY 2013 = $680,916.00
History of Investigator:
  • Jignesh Patel (Principal Investigator)
    jignesh@cmu.edu
Recipient Sponsored Research Office: University of Wisconsin-Madison
21 N PARK ST STE 6301
MADISON
WI  US  53715-1218
(608)262-3822
Sponsor Congressional District: 02
Primary Place of Performance: University of Wisconsin-Madison
21 North Park Street
Madison
WI  US  53715-1218
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): LCLSJAGTNZQ7
Parent UEI:
NSF Program(s): Big Data Science &Engineering
Primary Program Source: 01001314DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7433, 7923, 8083
Program Element Code(s): 808300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

We are now in the midst of the big data revolution where enterprise services are increasingly being driven by operational and business models that are powered by data analysis. A key part in making big data successful is ensuring that basic data processing primitives can execute efficiently on large and every increasing volumes of data. However, data processing kernels today largely employ techniques that have been designed about three decades ago, and are now out of touch with modern hardware that has made a fundamental technological shift. First, driven by power consumption characteristics, modern processors now have multiple processing units (called cores) fabricated in a single chip. In contrast, processors just a few years ago were single core. Second, traditionally the storage media for data has been the magnetic hard disk. Now, data has started to move nearly permanently to higher levels of the memory hierarchy, and more specifically to main memory. The goal of this project is to rethink key aspects of data processing techniques for the modern many-core and main memory hardware environment. The research approach is to design, implement and evaluate various methods for data kernels that can be used to store and process data efficiently. In other words, the key focus is on producing data kernels that "run at the speed of modern hardware." Thus, this project aims to have a broad impact on the big data ecosystem by developing faster, cheaper and more energy-efficient data kernels.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 25)
Craig Chasseur, Yinan Li, Jignesh M. Patel "Enabling JSON Document Stores in Relational Systems" WebDB , 2013
Harshad Deshmukh, Hakan Memisoglu, Jignesh M. Patel "Adaptive Concurrent Query Execution Framework for an Analytical In-Memory Database System" IEEE BigData Congress , 2017
Craig Chasseur and Jignesh M. Patel "Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases" Proceedings of the VLDB Endowment (PVLDB), and to appear in the VLDB 2014 conference , v.6 , 2013 , p.http://ww
Craig Chasseur, Jignesh M. Patel "Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases" PVLDB , v.6 , 2013
H. V. Jagadish, Johannes Gehrke, Alexandros Labrinidis, Yannis Papakonstantinou, Jignesh M. Patel, Raghu Ramakrishnan, Cyrus Shahabi "Big data and its technical challenges" Commun. ACM , v.57 , 2014 , p.86
H. V. Jagadish, Johannes Gehrke, Alexandros Labrinidis, Yannis Papakonstantinou, Jignesh M. Patel, Raghu Ramakrishnan, Cyrus Shahabi "Big data and its technical challenges" Commun. ACM , v.57 , 2014
Jason Power, Yinan Li, Mark D. Hill, Jignesh M. Patel, David A. Wood "Implications of Emerging 3D GPU Architecture on the Scan Primitive" SIGMOD Record , v.44 , 2015
Jianqiao Zhu and Navneet Potti and Saket Saurabh and Jignesh M. Patel "Looking Ahead Makes Query Plans Robust" {PVLDB} , v.10 , 2017 , p.889--900
Jianqiao Zhu, Navneet Potti, Saket Saurabh, Jignesh M. Patel "Looking Ahead Makes Query Plans Robust" PVLDB , v.10 , 2017
Jignesh M. Patel ":Operational NoSQL Systems: What's New and What's Next?" IEEE Computer , v.49 , 2016
Jignesh M. Patel "Operational NoSQL Systems: What's New and What's Next?" {IEEE} Computer , v.49 , 2016 , p.23--30 10.1109/MC.2016.118
(Showing: 1 - 10 of 25)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The move towards data-driven decision making in enterprises coupled with the growth in data volumes requires taking a fresh look at key mechanims that power data analytics platform. This fresh look is especially critical as the hardware used in modern servers uses compute and storage methods that have architecturally very different propoerties than those from a decade ago, which implies that traditional methods for data processing do not effectively exploit the full potential of modern hardware. The overall goal of this project was to design, implement and evaluate more efficient, and thus cost-effective, ways to carry out data analytics than existing methods.

Various data organization algoritms were developed as part of this project, contributing to the key components of the intellectual merit contributions of this project. Other intellectual contributions include the development of various algorithms for common data processing tasks such as searching though large datasets and combining informatoin from two or more differnent datasets. Methods were also developed to speed up machine learning programs by using static analysis methods. Overall the intellectual merits have lead to the creation of more efficient ways to carry out complex data analytics tasks.

The broader impacts of this work consists of taking ideas from this research and developing into an actual platform that has been open-sourced. Work under this project has been published in top-tier confereneces, and this project has contributed to the training of five Ph.D. students. 


Last Modified: 07/04/2018
Modified by: Jignesh M Patel

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page