Award Abstract # 1838024
BIGDATA: IA: Collaborative Research: Protecting Yourself from Wildfire Smoke: Big Data Driven Adaptive Air Quality Prediction Methodologies

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: BOARD OF REGENTS OF THE NEVADA SYSTEM OF HIGHER ED
Initial Amendment Date: September 7, 2018
Latest Amendment Date: July 26, 2022
Award Number: 1838024
Award Instrument: Standard Grant
Program Manager: Sylvia Spengler
sspengle@nsf.gov
 (703)292-7347
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: January 1, 2019
End Date: December 31, 2023 (Estimated)
Total Intended Award Amount: $684,689.00
Total Awarded Amount to Date: $684,689.00
Funds Obligated to Date: FY 2018 = $684,689.00
History of Investigator:
  • Lei Yang (Principal Investigator)
    leiy@unr.edu
  • Heather Holmes (Co-Principal Investigator)
  • Feng Yan (Former Principal Investigator)
  • Lei Yang (Former Co-Principal Investigator)
Recipient Sponsored Research Office: Board of Regents, NSHE, obo University of Nevada, Reno
1664 N VIRGINIA ST # 285
RENO
NV  US  89557-0001
(775)784-4040
Sponsor Congressional District: 02
Primary Place of Performance: Board of Regents, NSHE, obo University of Nevada, Reno
1664 North Virginia Street
Reno
NV  US  89557-0001
Primary Place of Performance
Congressional District:
02
Unique Entity Identifier (UEI): WLDGTNCFFJZ3
Parent UEI: WLDGTNCFFJZ3
NSF Program(s): Big Data Science &Engineering
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 062Z, 8083, 9150
Program Element Code(s): 808300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

The objective of this project is to develop a framework to achieve real-time smoke transport prediction and air quality forecasting. Wildfire smoke can transport very fast and pose significant health hazards to communities. State-of-the-art smoke forecasting models typically have infrequent updates and provide predictions with a coarse spatial resolution due to spatiotemporal resolution limitations of input data and the tremendous computational power required to simulate atmospheric conditions. This project will develop real-time smoke transport and air quality prediction methodologies with better spatial resolution for improving the scalability and efficiency of the underlying data processing system to enable timely air quality alerts. While this project is applied towards smoke transport and air quality prediction, this work can be generalized to solve many other big data problems that require such design. The principal investigators will use the materials and topics from this project to enhance education by creating new big data analytics related courses and developing a Big Data Minor program at the University of Nevada, Reno. The project will also provide opportunities to engage more students from underrepresented groups and impact the education of several students, via K-12 outreach and mentoring undergraduate and graduate students.

The intellectual merit of this research is in establishing a novel big data driven air quality prediction for wildfire smoke to provide timely and effective health alerts. The planned new prediction methodology will integrate the novel Gaussian Markov Random Field based real-time spatiotemporal prediction with statistical-based long-term spatiotemporal prediction. To tackle the challenge of missing high-resolution data, a data fusion methodology is planned to integrate fine-grained image data collected from camera networks with air pollution monitoring data to increase data resolution. A Deep Neural Network based smoke density detection process will extract air quality information from camera image data. The planned novel signature time-series based prediction methodology will open opportunities to process larger amounts of spatiotemporal data using limited resources. By identifying critical data based on spatiotemporal properties, the project will develop a communication framework that enables efficient camera data transfer. Efficient parallel and distributed data processing is utterly important to support processing large scale data in real time. The planned decomposition-based parallelization methodology and a performance model driven scheduling framework will enable efficient dynamic computing resource management, which is key to the success of this project.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 66)
A. Ali, R. Pinciroli "BATCH: Machine Learning Inference Serving on Serverless Platforms with Adaptive Batching" 2020 SC20: International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Atlanta, GA, US, 2020 pp. 972-986. doi: 10.1109/SC41405.2020.00073 , v.1 , 2020 https://doi.org/10.1109/SC41405.2020.00073 Citation Details
Ali, Ahsan and Pinciroli, Riccardo and Yan, Feng and Smirni, Evgenia "CEDULE: A Scheduling Framework for Burstable Performance in Cloud Computing" 2018 IEEE International Conference on Autonomic Computing (ICAC) , 2018 10.1109/ICAC.2018.00024 Citation Details
Ali, Ahsan and Pinciroli, Riccardo and Yan, Feng and Smirni, Evgenia "It's not a Sprint, it's a Marathon: Stretching Multi-resource Burstable Performance in Public Clouds" Proceedings of the 20th International Middleware Conference (Middleware 2019) , 2019 10.1145/3366626.3368130 Citation Details
Ali, Ahsan and Pinciroli, Riccardo and Yan, Feng and Smirni, Evgenia "Optimizing inference serving on serverless platforms" Proceedings of the VLDB Endowment , v.15 , 2022 https://doi.org/10.14778/3547305.3547313 Citation Details
Al-Mamun, Abdullah and Yan, Feng and Zhao, Dongfang "BAASH: Lightweight, Efficient, and Reliable Blockchain-As-A-Service for HPC Systems" 2021 International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2021) , 2021 Citation Details
Al-Mamun, Abdullah and Yan, Feng and Zhao, Dongfang "SciChain: Blockchain-enabled Lightweight and Efficient Data Provenance for Reproducible Scientific Computing" 2021 IEEE 37th International Conference on Data Engineering (ICDE) , 2021 https://doi.org/10.1109/ICDE51399.2021.00166 Citation Details
Bai, Youhui and Li, Cheng and Zhou, Quan and Yi, Jun and Gong, Ping and Yan, Feng and Chen, Ruichuan and Xu, Yinlong "Gradient Compression Supercharged High-Performance Data Parallel DNN Training" The 28th ACM Symposium on Operating Systems Principles (SOSP 2021) , 2021 Citation Details
Chai, Zheng and Ali, Ahsan and Zawad, Syed and Truex, Stacey and Anwar, Ali and Baracaldo, Nathalie and Zhou, Yi and Ludwig, Heiko and Yan, Feng and Cheng, Yue "TiFL: A Tier-based Federated Learning System" Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing (HPDC 20) , 2020 10.1145/3369583.3392686 Citation Details
Cheng, H.-P. and Zhang, T. and Zhang, Y. and Li, S. and Liang, F. and Yan, F. and Li, M. and Chandra, V. and Li, H. and Chen, Y. "NASGEM: Neural Architecture Search via Graph Embedding Method" AAAI Conference on Artificial Intelligence (AAAI 2021) , 2021 Citation Details
Cheng, Hsin-Pai* and Huang, Yuanjun* and Guo, Xuyang* and Yan, Feng and Wen, Wei and Li, Hai and Chen, Yiran (*Equal "Differentiable Fine-grained Quantization for Deep Neural Network Compression" NIPS 2018 Workshop on Compact Deep Neural Networks with Industrial Applications (CDNNRIA) , 2018 Citation Details
Cheng, Hsin-Pai and Liang, Feng and Li, Meng and Cheng, Bowen and Yan, Feng and Li, Hai and Chandra, Vikas and Chen, Yiran "ScaleNAS: Multi-Path One-Shot NAS for Scale-Aware High-Resolution Representation" Proceedings of the AutoML Conference 2022 (co-located with ICML 2022) (AutoML 2022) , 2022 Citation Details
(Showing: 1 - 10 of 66)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Climate change has intensified the magnitude and frequency of wildfires and millions of Americans live in counties affected by wildfire smoke conditions, which pose significant health hazards to communities. This project aims to develop methodologies and tools for supporting real-time smoke transport and air quality prediction with better spatial resolution for improving the scalability and efficiency of the underlying data processing system to enable timely air quality alerts.

Under this common theme, the PIs and participants have completed the proposed research tasks. Research findings include a labeled wildfire smoke dataset with an open-sourced wildfire smoke detection benchmark, a camera data transportation framework to efficiently transmit camera data in real time, and agile parallel and distributed data processing methodologies to efficiently process vast amounts of data for real-time prediction. The results are disseminated to communities through presentations in scientific disciplinary meetings and peer-reviewed publications. The software and source codes for corresponding research outcomes have been released through Github, Bitbucket, and web portal. Moreover, the results are also integrated into the courses at both the undergraduate and graduate levels at UNR.

The research findings are expected to solve the big data challenge in smoke transport and air quality prediction using both domain knowledge and computing technologies and can be generalized to solve many other big data problems that require such design. Another major accomplishment of this project is to integrate research with educational activities and provide research assistantship and research project topics to both graduate and undergraduate students. Graduates supported by this project joined Microsoft Research, IBM Research, MathWorks, faculty at UNR, Oak Ridge National Laboratory, and Argonne National Laboratory.

 


Last Modified: 03/14/2024
Modified by: Lei Yang

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page