
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | September 8, 2014 |
Latest Amendment Date: | September 8, 2014 |
Award Number: | 1440715 |
Award Instrument: | Standard Grant |
Program Manager: |
Kevin Thompson
kthompso@nsf.gov (703)292-4220 OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | October 1, 2014 |
End Date: | September 30, 2017 (Estimated) |
Total Intended Award Amount: | $853,658.00 |
Total Awarded Amount to Date: | $853,658.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
104 AIRPORT DR STE 2200 CHAPEL HILL NC US 27599-5023 (919)966-3411 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
100 Europa Dr. Chapel Hill NC US 27517-2398 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Information Technology Researc |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Science collaborations today are multi-institutional and multi-disciplinary, requiring processing of large datasets that are distributed among their participants. They also require advanced computing and networking infrastructure to process the data and arrive at results. A fundamental problem with the way these collaborations operate today lies in the fact that they are structured around the ownership of data or necessary pieces of infrastructure, rather than around the talents and capabilities of participating organizations. This project is developing mechanisms for mapping collaborations onto the new type of dynamically configurable, deeply networked institutional cloud infrastructure, where data is a first-class resource. By integrating data and resource management into a single system RADII aims at improving the productivity of research scientists and reducing time to discovery and operating costs.
RADII leverages two previous NSF investments: dynamic infrastructure technologies developed for NSF GENI initiative and the iRODS data grid system for maintaining large distributed datasets. The project is developing software tools to represent science collaborations using formal modeling mechanisms, map data processes, computations and storage onto physical infrastructure and flexibly manage the underlying infrastructure to optimize its utilization across multiple collaborations. The project uses several examples of collaborations from genomics to demonstrate how such collaborations can map onto infrastructure in a more flexible and cost effective way compared to today.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Project RADII integrated two technologies, namely rule-based data management via integrated Rule-Oriented-Data-System (iRODS) and network-infrastructure-as-a-service (NIaaS) via Open Resource Control Architecture (ORCA) into a collaborative research platform that allows scientists (1) to describe data-centric collaborations including infrastructure and data governance requirements into novel infrastructure “appliance’ artifacts using a high-level language, (2) maps high-level requirements of the infrastructure into dynamic virtual infrastructure using ORCA, (3) maps data policies into rules that are enforced by iRODS, (4) manages the collaboration throughout its life-cycle including changes to data policies and infrastructure. RADII offered scientists with a turn-key experience to provision collaborative data-centric infrastructure while providing fined-grained control over its capabilities. Furthermore, in RADII we investigated novel Software-Defined-Networking (SDN) based features to meet application performance requirements. A prototype integration of RADII with HydroShare --a data sharing platform serving more than 1,800 hydrologists and funded by the NSF SI2-SSI program—demonstrated its potential to provide novel compute and storage infrastructure (public/on-premise cloud and public cyberinfrastructure) to existing scientific communities. Also, a network integration of an on-premise Cloud at the University of California San Diego (UCSD) and ExoGENI showed how RADII provides end-to-end network connectivity within and outside ExoGENI infrastructure.
The project took a novel approach to bridge data and networked management technologies. This work was well received by the networking and data science community and led to a successful workshop “Resource Aware Datacentric Collaborative Infrastructure” at IEEE BigData, 2015. We also published our work at BigData 2016 and NOMS 2018. The project funded the research work of multiple master and doctoral students. Finally, RADII was demonstrated at SC15 and SC17 and served as the foundation of new NSF funded projects.
The software components developed in RADII have been adopted in the software stack of SciDAS, a big data model implementation funded by the NSF CC* Data program. More specifically, through the integration of iRODS with networking infrastructure (SDN), we are able to optimize network transfers based on attributes encoded in the metadata to support novel capabilities such as network-aware caching. Similarly, the concept of infrastructure ‘appliance’ serves as a reproducibility artifact used in SciDAS and other emerging projects to describe complex infrastructure, applications and experiments using a high-level language that non-technical experts can understand.
Last Modified: 02/02/2018
Modified by: Claris Castillo
Please report errors in award information by writing to: awardsearch@nsf.gov.