
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | August 11, 2015 |
Latest Amendment Date: | June 6, 2019 |
Award Number: | 1541335 |
Award Instrument: | Cooperative Agreement |
Program Manager: |
Amy Walton
awalton@nsf.gov (703)292-4538 OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2015 |
End Date: | August 31, 2021 (Estimated) |
Total Intended Award Amount: | $4,918,411.00 |
Total Awarded Amount to Date: | $4,958,411.00 |
Funds Obligated to Date: |
FY 2018 = $40,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
1109 GEDDES AVE STE 3300 ANN ARBOR MI US 48109-1015 (734)763-6438 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
450 Church St. Ann Arbor MI US 48109-1040 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Data Cyberinfrastructure |
Primary Program Source: |
01001819DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Every field of science generates and utilizes data in various forms: programs, instrument outputs, papers, notes, applications, simulations, video and audio recordings, etc. The continuing and evolving challenge for scientists is how to store, access, transform, manage and curate the variety of data required to effectively conduct their research, and transparently share it with other researchers across campus or at other institutions. The MI-OSiRIS project is addressing that challenge by combining an object-based software-defined storage technology with a monitored, managed network infrastructure to give scientists a distributed storage system which allows them to directly access their data from resources at any of the participating institutions. Furthermore, MI-OSiRIS utilizes each institution's existing authentication infrastructure to allow scientists to provide controlled access to their data across all participating institutions. By documenting and publishing designs, code, and operational experiences, the MI-OSiRIS project serves as a replicable model for supporting data-intensive, multi-institutional science collaborations.
MI-OSiRIS implements a Ceph-based petabyte-scale distributed data system by deploying object storage servers at each participating institution, connecting them via a managed high speed network, and distributing data based on the specific requirements of each science research domain. Ceph, an open source storage platform, supports multiple data access methods (traditional file, native object, and block), and allows configuration of access, replication, distribution, and integrity on a per-research-domain basis. MI-OSiRIS is built on low-cost, commodity hardware and can deliver gigabytes per second of I/O bandwidth per node. The system monitors and manages the network paths between its partner institutions, science users and Ceph storage components by strategically deploying perfSONAR instances which have been augmented with a network discovery, monitoring, and management platform (Network Management Abstraction Layer). Globus Online servers provide access to data from outside MI-OSiRIS. In addition, MI-OSiRIS leverages Ceph's software defined storage aspects to automate some data-lifecycle management tasks.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The Open Storage Research Infrastructure (OSiRIS; NSF grant #1541335 ) project has provided a distributed storage infrastructure across the three largest research Universities in the state of Michigan since 2016. It uses off-the-shelf hardware and open source software to contain acquisition costs. OSiRIS has deployed over 13 Petabytes of raw storage distributed equally between the University of Michigan, Michigan State University and Wayne State University, using the open source Ceph storage system, advanced networking and institutional identity management to deliver a reliable and robust data storage and sharing platform which includes data replication across sites.
During the 6 years of the project, OSiRIS has fostered collaboration within and across institutions by hosting petabytes of data from 16 different science groups, making scientific datasets accessible and usable far beyond the state of Michigan and even internationally. OSiRIS has enabled its users to easily access and share large amounts of data within and beyond their scientific collaborations. The project has documented its tools, technologies and methods in a whitepaper which allows others to recreate their own version of OSiRIS. At project’s end, the three host Universities have committed to continuing to operate OSiRIS for at least another year, with future support contingent on the value the project brings to the institution’s faculty and researchers.
Last Modified: 12/29/2021
Modified by: Shawn Mckee
Please report errors in award information by writing to: awardsearch@nsf.gov.