Skip to feedback

Award Abstract # 2313122
Collaborative Research: OAC Core: Topology-Aware Data Compression for Scientific Analysis and Visualization

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: UNIVERSITY OF KENTUCKY RESEARCH FOUNDATION, THE
Initial Amendment Date: August 24, 2023
Latest Amendment Date: August 24, 2023
Award Number: 2313122
Award Instrument: Standard Grant
Program Manager: Sharmistha Bagchi-Sen
shabagch@nsf.gov
 (703)292-8104
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2023
End Date: August 31, 2026 (Estimated)
Total Intended Award Amount: $201,765.00
Total Awarded Amount to Date: $201,765.00
Funds Obligated to Date: FY 2023 = $201,765.00
History of Investigator:
  • Xin Liang (Principal Investigator)
    xliang@uky.edu
Recipient Sponsored Research Office: University of Kentucky Research Foundation
500 S LIMESTONE
LEXINGTON
KY  US  40526-0001
(859)257-9420
Sponsor Congressional District: 06
Primary Place of Performance: University of Kentucky Research Foundation
500 S LIMESTONE 109 KINKEAD HALL
LEXINGTON
KY  US  40526-0001
Primary Place of Performance
Congressional District:
06
Unique Entity Identifier (UEI): H1HYA8Z1NTM5
Parent UEI:
NSF Program(s): OAC-Advanced Cyberinfrast Core
Primary Program Source: 01002324DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z, 7923, 9102, 9150
Program Element Code(s): 090Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Today's large-scale simulations are producing vast amounts of data that are revolutionizing scientific thinking and practices. For instance, a fusion simulation can produce 200 petabytes of data in a single run, while a climate simulation can generate 260 terabytes of data every 16 seconds with a 1 square kilometer resolution. As the disparity between data generation rates and available I/O bandwidths continues to grow, data storage and movement are becoming significant bottlenecks for extreme-scale scientific simulations in terms of in situ and post hoc analysis and visualization. The disparity necessitates data compression, which compresses large-scale simulations data in situ, and decompresses data in situ and/or post hoc for analysis and exploration. On the other hand, a critical step in extracting insight from large-scale simulations involves the definition, extraction, and evaluation of features of interest. Topological data analysis has provided powerful tools to capture features from scientific data in turbulent combustion, astronomy, climate science, computational physics and chemistry, and ecology. While lossy compression is leveraged to address the big data challenges, most existing lossy compressors are agnostic of and thus fail to preserve topological features that are essential to scientific discoveries. This project aims to research and develop advanced lossy compression techniques and software that preserve topological features in data for in situ and post hoc analysis and visualization at extreme scales. The success of this project will promote scientific research on driving applications in cosmology, climate, and fusion by enabling efficient and effective compression for scientific data, and the impact scales to other science and engineering disciplines. Furthermore, the research products of this project will be integrated into visualization and parallel processing curricula, disseminated via research and training workshops, and used to attract underrepresented students for broadening participation in computing.

This project tackles the data compression, analysis, and visualization needs in extreme-scale scientific simulations by developing a suite of topology-aware data compression algorithms for scalar field and vector field data. Such algorithms effectively reduce the size of data while preserving critical features defined by topological notions. This project will define and enforce topology-aware constraints over advanced lossy compression algorithms. Such capabilities have not been studied systematically within today?s data compression paradigm. This project will impact specific fields, including computational science, data analysis, data compression, and visualization, and the broader scientific community. The research products of this project will be delivered as publicly available software to significantly advance the research cyberinfrastructure for current and upcoming exascale systems. This project will foster novel discoveries in multiple scientific disciplines beyond cosmology, climate, and fusion by enabling efficient and effective compression on a wide range of platforms.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Jian, Zizhe and Di, Sheng and Liu, Jinyang and Zhao, Kai and Liang, Xin and Xu, Haiying and Underwood, Robert and Wu, Shixun and Huang, Jiajun and Chen, Zizhong and Cappello, Franck "CliZ: Optimizing Lossy Compression for Climate Datasets with Adaptive Fine-tuned Data Prediction" , 2024 https://doi.org/10.1109/IPDPS57955.2024.00044 Citation Details
Li, Yuxiao and Liang, Xin and Wang, Bei and Qiu, Yongfeng and Yan, Lin and Guo, Hanqi "MSz: An Efficient Parallel Algorithm for Correcting Morse-Smale Segmentations in Error-Bounded Lossy Compressors" IEEE Transactions on Visualization and Computer Graphics , v.31 , 2025 https://doi.org/10.1109/TVCG.2024.3456337 Citation Details
Ren, Congrong and Liang, Xin and Guo, Hanqi "A PredictionTraversal Approach for Compressing Scientific Data on Unstructured Meshes with Bounded Error" Computer Graphics Forum , v.43 , 2024 https://doi.org/10.1111/cgf.15097 Citation Details
Xia, Mingze and Di, Sheng and Cappello, Franck and Jiao, Pu and Zhao, Kai and Liu, Jinyang and Wu, Xuan and Liang, Xin and Guo, Hanqi "Preserving Topological Feature with Sign-of-Determinant Predicates in Lossy Compression: A Case Study of Vector Field Critical Points" , 2024 https://doi.org/10.1109/ICDE60146.2024.00378 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page