Award Abstract # 1443047
CIF21 DIBBs: Domain-Aware Management of Heterogeneous Workflows: Active Data Management for Gravitational-Wave Science Workflows

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: SYRACUSE UNIVERSITY
Initial Amendment Date: August 13, 2014
Latest Amendment Date: August 16, 2018
Award Number: 1443047
Award Instrument: Continuing Grant
Program Manager: Amy Walton
awalton@nsf.gov
 (703)292-4538
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2014
End Date: September 30, 2019 (Estimated)
Total Intended Award Amount: $900,000.00
Total Awarded Amount to Date: $1,078,712.00
Funds Obligated to Date: FY 2014 = $600,000.00
FY 2015 = $150,000.00

FY 2016 = $150,000.00

FY 2018 = $178,712.00
History of Investigator:
  • Duncan Brown (Principal Investigator)
    dabrown@syr.edu
  • Ewa Deelman (Co-Principal Investigator)
  • Jian Qin (Co-Principal Investigator)
  • Peter Couvares (Co-Principal Investigator)
Recipient Sponsored Research Office: Syracuse University
900 S CROUSE AVE
SYRACUSE
NY  US  13244-4407
(315)443-2807
Sponsor Congressional District: 22
Primary Place of Performance: Syracuse University
Syracuse
NY  US  13244-1200
Primary Place of Performance
Congressional District:
22
Unique Entity Identifier (UEI): C4BXLBC11LC6
Parent UEI:
NSF Program(s): PHYSICS AT THE INFO FRONTIER,
Data Cyberinfrastructure
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
01001617DB NSF RESEARCH & RELATED ACTIVIT

01001516DB NSF RESEARCH & RELATED ACTIVIT

01001415DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 062Z, 7569, 7433, 8048, 8084
Program Element Code(s): 755300, 772600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Analysis and management of large data sets are vital for progress in the data-intensive realm of scientific research and education. Scientists are producing, analyzing, storing and retrieving massive amounts of data. The anticipated growth in the analysis of scientific data raises complex issues of stewardship, curation and long-term access. Scientific data is tracked and described by metadata. This award will fund the design, development, and deployment of metadata-aware workflows to enable the management of large data sets produced by scientific analysis. Scientific workflows for data analysis are used by a broad community of scientists including astronomy, biology, ecology, and physics. Making workflows metadata-aware is an important step towards making scientific results easier to share, to reuse, and to support reproducibility. This project will pilot new workflow tools using data from the Laser Interferometer Gravitational-wave Observatory (LIGO), a data-intensive project at the frontiers of astrophysics. The goal of LIGO is to use gravitational waves---ripples in the fabric of spacetime---to explore the physics of black holes and understand the nature of gravity.

Efficient methods for accessing and mining the large data sets generated by LIGO's diverse gravitational-wave searches are critical to the overall success of gravitational-wave physics and astronomy. Providing these capabilities will maximize existing NSF investments in LIGO, support new modes of collaboration within the LIGO Scientific Collaboration, and better enable scientists to explain their results to a wider community, including the critical issue of data and analysis provenance for LIGO's first detections. The interdisciplinary collaboration involved in this project brings together computational and informatics theories and methods to solve data and workflow management problems in gravitational-wave physics. The research generated from this project will make a significant contribution to the theory and methods in identification of science requirements, metadata modeling, eScience workflow management, data provenance, reproducibility, data discovery and analysis. The LIGO scientists participating in this project will ensure that the needs of the community are met. The cyberinfrastructure and data-management scientists will ensure that the software products are well-designed and that the work funded by this award is useful to a broader community.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 19)
Abbott, Benjamin P. and others "{GW170104: Observation of a 50-Solar-Mass Binary Black Hole Coalescence at Redshift 0.2}" Phys. Rev. Lett. , v.118 , 2017 , p.221101 10.1103/PhysRevLett.118.221101
Abbott, Benjamin P. and others "{GW170814: A Three-Detector Observation of Gravitational Waves from a Binary Black Hole Coalescence}" Phys. Rev. Lett. , v.119 , 2017 , p.141101 10.1103/PhysRevLett.119.141101
Abbott, Benjamin P. and others "{GW170817: Observation of Gravitational Waves from a Binary Neutron Star Inspiral}" Phys. Rev. Lett. , v.119 , 2017 , p.161101 10.1103/PhysRevLett.119.161101
Abbott, Benjamin P. and others "Upper limits on the rates of binary neutron star and neutron-star--black-hole mergers from Advanced LIGO's first observing run" The Astrophysical Journal , v.832 , 2016 , p.L21 10.3847/2041-8205/832/2/L21
Abbott, B. P. and others "Binary Black Hole Mergers in the first Advanced LIGO Observing Run" Phys. Rev. X , v.6 , 2016 , p.041015 10.1103/PhysRevX.6.041015
Abbott, B. P. and others "Characterization of transient noise in Advanced LIGO relevant to gravitational wave signal GW150914" Class. Quant. Grav. , v.33 , 2016 , p.134001 10.1088/0264-9381/33/13/134001
Abbott, B. P. and others "{Gravitational Waves and Gamma-rays from a Binary Neutron Star Merger: GW170817 and GRB 170817A}" Astrophys. J. , v.848 , 2017 , p.L13 10.3847/2041-8213/aa920c
Abbott, B. P. and others "GW150914: First results from the search for binary black hole coalescence with Advanced LIGO" Phys. Rev. , v.D93 , 2016 , p.122003 10.1103/PhysRevD.93.122003
Abbott, B. P. and others "GW151226: Observation of Gravitational Waves from a 22-Solar-Mass Binary Black Hole Coalescence" Phys. Rev. Lett. , v.116 , 2016 , p.241103 10.1103/PhysRevLett.116.241103
Abbott, B. P. and others "Observation of Gravitational Waves from a Binary Black Hole Merger" Phys. Rev. Lett. , v.116 , 2016 , p.061102 10.1103/PhysRevLett.116.061102
Abbott, B. P. and others "{GW170608: Observation of a 19-solar-mass Binary Black Hole Coalescence}" Astrophys. J. , v.851 , 2017 , p.L35 10.3847/2041-8213/aa9f0c
(Showing: 1 - 10 of 19)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Large-scale scientific workflows are essential to LIGO’s discoveries. To detect gravitational waves, LIGO data must be filtered through hundreds of thousands of signal models. This is repeated many times using simulated signals to measure the search’s efficiency and to diagnose and fix problems with the detectors. Searches are also run multiple times to tune the scientific parameters for maximum sensitivity. Analyses are run by teams of scientists in distributed locations and are executed using heterogeneous computing environments. LIGO was an early adopter of the Pegasus Workflow Management System (WMS)  and HTCondor for its binary black hole searches. This project built on the widely-used Pegasus WMS to address the problems encountered in large-scale, distributed scientific analysis. Advanced made possible by this project allowed Pegasus to manage LIGO workflow runs that were instrumental in the first direct detection of gravitational waves from colliding black holes, and the subsequent detection of colliding neutron stars observed both in gravitational wave and visible light spectrums.

The project team collaborated to develop new data management techniques in Pegasus and improved data access for LIGO workflows. This allowed LIGO to seamlessly execute large LIGO workflows across the  LIGO Data Grid and other NSF funded nation-wide computing infrastructures including the Open Science Grid (OSG) and XSEDE. We improved the Pegasus workflow monitoring dashboard for multi-user access, and improved visualization of workflow status and progress. These new capabilities proved important tools for both system administrators and the scientists running the workflows. The tools we developed provided valuable insight into the monitoring and error analysis of the workflows executed  to detect gravitational waves. A new web dashboard built on top of distributed data stores enables scientists and system administrators to get a holistic, global overview of the compact analysis workflows being run, monitor computational resource use, and identify trends and errors as they occur. With DIBBS-related improvements LIGO improved the turnaround of its offline binary search analysis from many weeks to only a few days. This speed increase proved essential for LIGO Scientific Collaboration to confirm the discoveries of GW150914 and GW170817, as well as other low-latency alerts that are generated and distributed to the astronomy community.

This award provided training for doctoral students who have subsequently moved to positions at national laboratories and in industry, furthering the development of the nation's STEM workforce.


Last Modified: 01/30/2020
Modified by: Duncan A Brown

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page