Award Abstract # 0910859
DC: Large: Collaborative Research: ASTERIX: A Highly Scalable Parallel Platform for Semistructured Data Management and Analysis

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: REGENTS OF THE UNIVERSITY OF CALIFORNIA AT RIVERSIDE
Initial Amendment Date: August 15, 2009
Latest Amendment Date: August 15, 2009
Award Number: 0910859
Award Instrument: Standard Grant
Program Manager: Frank Olken
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: August 15, 2009
End Date: July 31, 2013 (Estimated)
Total Intended Award Amount: $429,261.00
Total Awarded Amount to Date: $429,261.00
Funds Obligated to Date: FY 2009 = $429,261.00
History of Investigator:
  • Vassilis Tsotras (Principal Investigator)
    tsotras@cs.ucr.edu
Recipient Sponsored Research Office: University of California-Riverside
200 UNIVERSTY OFC BUILDING
RIVERSIDE
CA  US  92521-0001
(951)827-5535
Sponsor Congressional District: 39
Primary Place of Performance: University of California-Riverside
200 UNIVERSTY OFC BUILDING
RIVERSIDE
CA  US  92521-0001
Primary Place of Performance
Congressional District:
39
Unique Entity Identifier (UEI): MR5QC5FCAVH5
Parent UEI:
NSF Program(s): Info Integration & Informatics,
DATA-INTENSIVE COMPUTING
Primary Program Source: 01000910DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): HPCC, 7793, 7925, 9215
Program Element Code(s): 736400, 779300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

The evolution of the "human Web," powered by HTML and HTTP, has revolutionized the way that people find information, buy things, communicate, and collaborate. Web services and semi-structured data formats are having a similar impact on the "machine Web." XML is enriching our ability to find and interchange information today; industry verticals have created XML-based data exchange standards; and XML backbones have gained adoption in support of service-oriented architectures and software-as-a-service initiatives. Other semi-structured formats, like JSON, are playing similar roles, and XML is increasingly being used for its original purpose of semantic document markup. As a result, the world will soon be awash in a sea of semi-structured information.

The ASTERIX project is developing new technologies for ingesting, storing, managing, indexing, querying, analyzing, and subscribing to vast quantities of semi-structured information. The project is combining ideas from three distinct areas - semi-structured data, parallel databases, and data-intensive computing - to create a next-generation, open source software platform that scales by running on large, shared-nothing computing clusters. ASTERIX targets a wide range of semi-structured information, ranging from "data" use cases - where information is well-tagged and highly regular - to "content" use cases - where data is irregular and much of each datum is textual. ASTERIX is taking an open stance on data formats and addressing research issues including highly scalable data storage and indexing, semi-structured query processing on very large clusters, and merging parallel database techniques with today's data-intensive computing techniques to support performant yet declarative solutions to the problem of analyzing semi-structured information.

Project website: http://asterix.ics.uci.edu/

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Mariam Salloum, Xin Luna Dong, Divesh Srivastava, Vassilis J. Tsotras "Online Ordering of Overlapping Data Sources" Proceedings of the VLDB Endowment , v.7 , 2013 , p.133

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page