Award Abstract # 1541335
CC*DNI DIBBs: Multi-Institutional Open Storage Research InfraStructure (MI-OSiRIS)

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: REGENTS OF THE UNIVERSITY OF MICHIGAN
Initial Amendment Date: August 11, 2015
Latest Amendment Date: June 6, 2019
Award Number: 1541335
Award Instrument: Cooperative Agreement
Program Manager: Amy Walton
awalton@nsf.gov
 (703)292-4538
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2015
End Date: August 31, 2021 (Estimated)
Total Intended Award Amount: $4,918,411.00
Total Awarded Amount to Date: $4,958,411.00
Funds Obligated to Date: FY 2015 = $4,918,411.00
FY 2018 = $40,000.00
History of Investigator:
  • Shawn McKee (Principal Investigator)
    smckee@umich.edu
  • Kenneth Merz (Co-Principal Investigator)
  • Douglas Swany (Co-Principal Investigator)
  • Patrick Gossman (Co-Principal Investigator)
Recipient Sponsored Research Office: Regents of the University of Michigan - Ann Arbor
1109 GEDDES AVE STE 3300
ANN ARBOR
MI  US  48109-1015
(734)763-6438
Sponsor Congressional District: 06
Primary Place of Performance: University of Michigan Ann Arbor
450 Church St.
Ann Arbor
MI  US  48109-1040
Primary Place of Performance
Congressional District:
06
Unique Entity Identifier (UEI): GNJ7BBP73WE9
Parent UEI:
NSF Program(s): Data Cyberinfrastructure
Primary Program Source: 01001516DB NSF RESEARCH & RELATED ACTIVIT
01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 062Z, 7433, 8048, 9251
Program Element Code(s): 772600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Every field of science generates and utilizes data in various forms: programs, instrument outputs, papers, notes, applications, simulations, video and audio recordings, etc. The continuing and evolving challenge for scientists is how to store, access, transform, manage and curate the variety of data required to effectively conduct their research, and transparently share it with other researchers across campus or at other institutions. The MI-OSiRIS project is addressing that challenge by combining an object-based software-defined storage technology with a monitored, managed network infrastructure to give scientists a distributed storage system which allows them to directly access their data from resources at any of the participating institutions. Furthermore, MI-OSiRIS utilizes each institution's existing authentication infrastructure to allow scientists to provide controlled access to their data across all participating institutions. By documenting and publishing designs, code, and operational experiences, the MI-OSiRIS project serves as a replicable model for supporting data-intensive, multi-institutional science collaborations.

MI-OSiRIS implements a Ceph-based petabyte-scale distributed data system by deploying object storage servers at each participating institution, connecting them via a managed high speed network, and distributing data based on the specific requirements of each science research domain. Ceph, an open source storage platform, supports multiple data access methods (traditional file, native object, and block), and allows configuration of access, replication, distribution, and integrity on a per-research-domain basis. MI-OSiRIS is built on low-cost, commodity hardware and can deliver gigabytes per second of I/O bandwidth per node. The system monitors and manages the network paths between its partner institutions, science users and Ceph storage components by strategically deploying perfSONAR instances which have been augmented with a network discovery, monitoring, and management platform (Network Management Abstraction Layer). Globus Online servers provide access to data from outside MI-OSiRIS. In addition, MI-OSiRIS leverages Ceph's software defined storage aspects to automate some data-lifecycle management tasks.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Jeremy Musser, Ezra Kissel, Martin Swany, Joe Breen, Jason Stidd, Shawn McKee and Ben Meekhof "Applying OSiRIS NMAL to Network Slices on SLATE" Proceedings of CHEP 2019 , 2020 https://doi.org/10.6084/m9.figshare.12743336.v1
Shawn McKee, Kenneth M. Merz Jr., Andrew Keen, Ben Meekhof, Ezra Kissel and Michael Thompson "OSiRIS: A Distributed Storage and Networking Project Update" CHEP 2019 Proceedings , 2020 https://doi.org/10.6084/m9.figshare.12894920

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The Open Storage Research Infrastructure (OSiRIS; NSF grant #1541335 ) project has provided a distributed storage infrastructure across the three largest research Universities in the state of Michigan since 2016.  It uses off-the-shelf hardware and open source software to contain acquisition costs.  OSiRIS has deployed over 13 Petabytes of raw storage distributed equally between the University of Michigan, Michigan State University and Wayne State University, using the open source Ceph storage system, advanced networking and institutional identity management to deliver a reliable and robust data storage and sharing platform which includes data replication across sites.  

 

During the 6 years of the project, OSiRIS has fostered collaboration within and across institutions by hosting petabytes of data from 16 different science groups, making scientific datasets accessible and usable far beyond the state of Michigan and even internationally.  OSiRIS has enabled its users to easily access and share large amounts of data within and beyond their scientific collaborations.  The project has documented its tools, technologies and methods in a whitepaper which allows others to recreate their own version of OSiRIS.   At project’s end, the three host Universities have committed to continuing to operate OSiRIS for at least another year, with future support contingent on the value the project brings to the institution’s faculty and researchers. 


 


Last Modified: 12/29/2021
Modified by: Shawn Mckee

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page