Award Abstract # 0940824
DataNet Full Proposal: Sustainable Environment through Actionable Data (SEAD)

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: REGENTS OF THE UNIVERSITY OF MICHIGAN
Initial Amendment Date: September 27, 2011
Latest Amendment Date: April 26, 2017
Award Number: 0940824
Award Instrument: Cooperative Agreement
Program Manager: Amy Walton
awalton@nsf.gov
 (703)292-4538
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2011
End Date: September 30, 2017 (Estimated)
Total Intended Award Amount: $8,000,000.00
Total Awarded Amount to Date: $8,074,920.00
Funds Obligated to Date: FY 2011 = $3,503,839.00
FY 2012 = $80,000.00

FY 2013 = $416,161.00

FY 2014 = $2,074,920.00

FY 2015 = $2,000,000.00
History of Investigator:
  • Margaret Hedstrom (Principal Investigator)
    hedstrom@umich.edu
  • Praveen Kumar (Co-Principal Investigator)
  • George Alter (Co-Principal Investigator)
  • Beth Plale (Co-Principal Investigator)
  • James Myers (Co-Principal Investigator)
Recipient Sponsored Research Office: Regents of the University of Michigan - Ann Arbor
1109 GEDDES AVE STE 3300
ANN ARBOR
MI  US  48109-1015
(734)763-6438
Sponsor Congressional District: 06
Primary Place of Performance: Regents of the University of Michigan - Ann Arbor
1109 GEDDES AVE STE 3300
ANN ARBOR
MI  US  48109-1015
Primary Place of Performance
Congressional District:
06
Unique Entity Identifier (UEI): GNJ7BBP73WE9
Parent UEI:
NSF Program(s): Data Cyberinfrastructure
Primary Program Source: 01001213DB NSF RESEARCH & RELATED ACTIVIT
01001415DB NSF RESEARCH & RELATED ACTIVIT

01001314DB NSF RESEARCH & RELATED ACTIVIT

01001516DB NSF RESEARCH & RELATED ACTIVIT

01001112DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7433, 7726
Program Element Code(s): 772600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Abstract:
Award Number ? 0940824
Title: DataNet Full Proposal: Sustainable Environment through Actionable Data (SEAD)

The universities of Michigan, Indiana, and Illinois propose a DataNet partnership called Sustainable Environment through Actionable Data (SEAD). SEAD will enable new modalities of sustainability science - the study of dynamic interactions between nature and society. Advancing the science of sustainability requires integration of social science, natural science, and environmental data at multiple spatial and temporal scales that is rich in local and location-specific observations; referenced for regional, national, and global comparability and scale; and integrated to enable end users to detect interactions among multiple phenomena. SEAD will respond to the expressed needs of sustainability science researchers for long-term management of heterogeneous data by developing new capabilities for data integration, dissemination, and long-term preservation. SEAD will provide researchers with tools for active curation and use social networking to engage data producers and users in community curation, gradually shifting curatorial and collection development responsibilities from professional curators to the producer and user communities. Our focus is on the "long tail" of social and environmental data: derived data products, data collections from individual PI's and small group investigations, and data sets of local, regional or topical significance that are critical to sustainability science but are of limited value until they can be referenced geo-spatially and temporally, combined with related data and observations, and modeled consistently. SEAD will make data accessible to diverse users, including domain scientists, local, national and international policy makers, manufacturers of sustainable technologies, citizen scientists, and informed consumers. SEAD will take advantage of existing robust digital library and institutional repository (IR) infrastructures at the three universities for access, storage, and preservation to ensure wide accessibility of data, linkages between data and scientific publications, and persistence.

SEAD will serve researchers efficiently and in a financially sustainable way via active curation, make innovative use of social networking, integrate data with existing digital library infrastructures, and provide synthesis services that significantly increase the research and societal value of data. Our work will establish a new active curation paradigm that can be readily integrated into the scientific workflow and that leverages social networking technologies to engage the science community in data curation. Our research program will produce novel solutions to the synthesis of heterogeneous data across different levels of spatio-temporal granularity and scope; management of logical contexts and data models; appropriate sharing of data with privacy and proprietary restrictions; and preservation through emulation and migration-based-technologies and policies for distributed stewardship. Our cyberinfrastructure development work will support a network of repositories that functions on several levels: locally through integration of SEAD data into campus digital library/repository infrastructures, inter-institutionally through a model for distributed data curation and storage, and nationally and internationally by extending our approach to other IRs, other DataNet Partners, sensor and observational networks, and topical data archives. Our financial sustainability plan will identify appropriate incentive mechanisms and business models based on a tight coupling of preservation and access services with research library managed IR infrastructure and ongoing involvement of scientists and users.

SEAD will build national and global capabilities for science-informed sustainability policy and planning in land use, natural resource management, agriculture, energy, economic development, "green" manufacturing, and related areas where critical decisions will be made in the next decade. The project will engage the community that preserves and shares scientific data, thus enhancing the public investment in scientific research and making taxpayer funded data widely available and easier to use which will provide high-value cost-effective curation and preservation capabilities through partnerships with other "small science" domains.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Dharma Akmon "NSF DataNet Partners Update" Bulletin of the Association for Information Science and Technology , 2014
Inna Kouper, Yu Luo, Isuru Suriarachchi, Beth Plale "Provenance Enriched PID Kernel Information as OAI-ORE Map Replacement for SEAD Research Objects" ACM/IEEE Joint Conference on Digital Libraries (JCDL) , 2017
Isuru Suriarachchi and Beth Plale "Crossing Analytics Systems: A Case for Integrated Provenance in Data Lakes" IEEE 12th International Conference on eScience , 2016
Kumar, Praveen "{Hydrocomplexity: Addressing water security and emergent environmental risks}" Water Resources Research , 2015 , p.5827--583
Mostafa M. Elag, Praveen Kumar, Luigi Marini, James Myers, Margaret Hedstrom, Beth Plale "Identification and characterization of information-networks in long-tail data collections" Environmental Modeling and Software , v.94 , 2016 DOI: 10.1016/j.envsoft.2017.03.032
Myers, James "{Towards Sustainable Curation and Preservation: The SEAD Project?s Data Services Approach}" eScience, IT4RIs Workshop, September 2015 , 2015
Myers, J., Hedstrom, M., Akmon, D., Payette, S., Plate, B., Kouper, I., McCaulay, S., McDonald, R., Suriarachchi, I., Varadharaju, A., Kumar, P., Elag, M., Lee, J., Kooper, R., Marini, L. "Towards sustainable curation and preservation: The SEAD project?s data services approach." Proceedings - 11th IEEE International Conference on eScience, eScience 2015 , 2015 , p.485 10.1109/eScience.2015.56
Robert H. McDonald, Kavitha Chandrasekar, Inna Kouper, Stacy Konkiel, Margaret L. Hedstrom, James Myers, Praveen Kumar "SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science" International Journal of Digital Curation , v.8 , 2013 , p.172 doi:10.2218/ijdc.v8i2.281
Suriarachchi, Isuru "{Komadu: A Capture and Visualization System for Scientific Data Provenance}" Journal of Open Research Software , 2015

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The SEAD project developed and implemented a new approach to sharing and archiving data that makes it easier for individual researchers and small projects to take advantage of secure storage, easily add metadata, control access to active data, and publish data.  The project created a first-of-its-kind hosted, self-service capability for publishing large amounts of data using custom metadata without the need for project-specific cyberinfrastructure.  It enabled the publication of more than 50 data collections, including millions of files and terabytes of data, and it provided services to more than 40 groups (and more 200 registered users) over 6 years.  The project’s contributions to cyberinfrastructure for access to and long-term preservation of research data include development and implementation of an Active and Social Curation model and supporting technical infrastructure.  The SEAD project designed its active project spaces so that people working on projects could share their data with other project participants, comment on and annotate data, upload new versions of data sets, add and correct metadata associated with the data, and establish relationships among files. The project developed a curation and publication workflow that supports rich semantic annotation of datasets, and folder- and file-level components of datasets so that they can be transferred to repositories and preserved using best-practice standards for packaging (Library of Congress’ BagIT), metadata serialization (Open Archives Initiative Object Reuse and Exchange serialized as JSON-LD), and common metadata (i.e. Dublin Core for bibliographic information, and W3C Prov for provenance).  These contributions are documented in 60 publications and they were further disseminated by members of the SEAD team through 116 presentations, including 23 posters, at conferences and professional meetings; and at 38 workshops. In the final stage of the project responsibility for maintenance and further development of the SEAD code was transferred to the National Data Service.  This will enable other repositories in the future to ingest data from SEAD project sites.

 


Last Modified: 12/28/2017
Modified by: Margaret L Hedstrom

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page