Award Abstract # 1443080
CIF21 DIBBs: Scalable Capabilities for Spatial Data Synthesis

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: UNIVERSITY OF ILLINOIS
Initial Amendment Date: September 4, 2014
Latest Amendment Date: August 15, 2019
Award Number: 1443080
Award Instrument: Standard Grant
Program Manager: Amy Walton
awalton@nsf.gov
 (703)292-4538
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2014
End Date: September 30, 2020 (Estimated)
Total Intended Award Amount: $1,499,998.00
Total Awarded Amount to Date: $1,499,998.00
Funds Obligated to Date: FY 2014 = $1,499,998.00
History of Investigator:
  • Shaowen Wang (Principal Investigator)
    shaowen@illinois.edu
  • Katarzyna Keahey (Co-Principal Investigator)
  • Anand Padmanabhan (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Illinois at Urbana-Champaign
506 S WRIGHT ST
URBANA
IL  US  61801-3620
(217)333-2187
Sponsor Congressional District: 13
Primary Place of Performance: University of Illinois at Urbana-Champaign
506 S. Wright Street
Urbana
IL  US  61801-3620
Primary Place of Performance
Congressional District:
13
Unique Entity Identifier (UEI): Y8CWNJRCNN91
Parent UEI: V2PHZ2CSCH63
NSF Program(s): Geography and Spatial Sciences,
Data Cyberinfrastructure
Primary Program Source: 01001415DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7433, 8048
Program Element Code(s): 135200, 772600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

This project will develop a set of tools for spatial data synthesis through scalable data aggregation and integration based on cloud computing, CyberGIS, and other existing tools. Many scientific problems require the aggregation and integration of large and varied spatial data from a multitude of sources, yet existing approaches and software cannot effectively synthesize the enormous amounts of spatial data that often are available. This project will resolve problems associated with the use of massive spatial data, thus facilitating work dependent on this type of data for scientific problem solving, such as research on population dynamics and urban sustainability. Learning materials derived from the research activities will be openly accessible through the CyberGIS Science Gateway. Targeted massive open online course development will provide inexpensive and efficient ways to teaching students about the capabilities and underlying scientific principles of spatial data synthesis. A summer school will be offered during the second half of the project to provide a more focused and in-depth training event.

This research project will create scalable capabilities for spatial data synthesis enabled by cloud computing and CyberGIS. The project will begin by developing the capabilities for solving specific scientific problems and then move on to engage a broader community for validating and improving the core capabilities. The research will incorporate two interrelated themes: (1) measuring urban sustainability based on a number of social, environmental, and physical factors and processes; and (2) examining population dynamics by synthesizing multiple population data sources with social media data. Spatial data synthesis capabilities that the project will provide include extracting metadata and dealing with problems of spatial references and units. The project also will develop a fundamental capability to characterize uncertainty in data and its propagation.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 42)
Armstrong, Marc P. and Wang, Shaowen and Zhang, Zhe "The Internet of Things and fast data streams: prospects for geospatial data science in emerging information ecosystems" Cartography and Geographic Information Science , v.46 , 2018 10.1080/15230406.2018.1503973 Citation Details
Armstrong, M. P., Wang, S., and Zhang, Z "The Internet of Things and Fast Data Streams: Prospects for Geospatial Data Science in Emerging Information Ecosystems" Cartography and Geographic Information Science , 2018
Cai, Y., Guan, K., Lobell, D., Potgieter, A. B, Wang, S., Peng, J., Xu, T., Asseng, S., Zhang, Y., You, L., and Peng, B. "Integrating Satellite and Climate Data to Predict Wheat Yield in Australia Using Machine Learning Approaches" Agricultural and Forest Meteorology , 2019 https://doi.org/10.1016/j.agrformet.2019.03.010
Feng Liu, Kate Keahey, Pierre Riteau, Jon Weissman "Dynamically Negotiating Capacity Between On-demand and Batch Clusters." Supercomputing'17 Conference , 2017
Fu, Qiaobin, Nicholas Timkovich, Pierre Riteau, and Kate Keahey "A Step towards Hadoop Dynamic Scaling" The 20th IEEE International Conference on High Performance Computing and Communications (HPCC-2018) , 2018
Gao, Y., Wang, S., Padmanabhan, A., Yin, J., and Cao, G. "Mapping Spatiotemporal Patterns of Events Using Social Media: A Case Study of Influenza Trends" International Journal of Geographical Information Science , 2018
Helwig, N. E., Gao, Y., Wang, S., and Ma, P. "Analyzing Spatiotemporal Trends in Social Media Data via Smoothing Spline Analysis of Variance" Spatial Statistics , 2015 DOI: 10.1016/j.spasta.2015.09.002
Hu, Hao and Lin, Tao and Wang, Shaowen and Rodriguez, Luis F "A cyberGIS approach to uncertainty and sensitivity analysis in biomass supply chain optimization" Applied energy , v.203 , 2017 , p.26--40
Hu, Hao and Lin, Tao and Wang, Shaowen and Rodriguez, Luis F "A cyberGIS approach to uncertainty and sensitivity analysis in biomass supply chain optimization" Applied Energy , v.203 , 2017 , p.26--40 https://doi.org/10.1016/j.apenergy.2017.03.107
Hu, Hao and Yin, Dandong and Liu, Yan Y. and Terstriep, Jeff and Hong, Xingchen and Wendel, Jeff and Wang, Shaowen "TopoLens: Building a CyberGIS community data service for enhancing the usability of high-resolution national topographic datasets" Concurrency and Computation: Practice and Experience , v.0 , 2018 , p.e4682 10.1002/cpe.4682
Jeong, Myeong-Hun and Yin, Junjun and Wang, Shaowen "Outlier Detection and Comparison of Origin-Destination Flows Using Data Depth" LIPIcs-Leibniz International Proceedings in Informatics , v.114 , 2018
(Showing: 1 - 10 of 42)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The central goal of the NSF project: CIF21 DIBBs: Scalable Capabilities for Spatial Data Synthesis (Award #1443080) was to develop a set of capabilities for spatial data synthesis through scalable data aggregation and integration based on cloud computing, cyberGIS, and other cutting-edge cyberinfrastructure tools. 

The intellectual merit of the project is centered on not only the innovation of the spatial data synthesis capabilities, but also the enabling power of such capabilities for significantly advancing data-intensive geospatial problem-solving within various research and education domains. Specifically, the novel CyberGISX platform prototyped by the project addresses many of the challenges associated with using cutting-edge cyberGIS and data science capabilities for non-expert audience, including software installations, dependencies management and scaling to heterogeneous computing and cloud platforms. The CyberGISX platform has not only made it easy to share the knowledge about using the spatial data synthesis capabilities developed by the project, but also aids with the reproducibility and replicability of research activities thereby promoting open science practice. The project also has contributed to making extensive scientific advances, including for example studies of (a) population dynamics and urban sustainability by synthesizing social media and spatial data; (b) emergency evaluation using data-intensive agent-based modeling; and (c) assessment of spatial accessibility of healthcare resources such as ICU units and ventilators responding to the COVID-19 pandemic. The broad and significant research efforts made by the project have positively impacted the following interdisciplinary fields: 1) advanced cyberinfrastructure and geographic information science; 2) urban and social sciences; 3) emergency management; and 4) human-environment and geographic sciences.

The work done under this award has made tremendous broader impacts. The CyberGISX platform and spatial data synthesis capabilities created by the project have been made available to the general public, and broad science and engineering communities. Capabilities and tools on CyberGISX have been incorporated into a number of undergraduate and graduate courses which have helped to enhance and empower interdisciplinary and transdisciplinary education. The project provides scalable and easy to access spatial data synthesis capabilities for tackling computation and data challenges associated with using geospatial big data for research, which has benefited a number of disciplines. Examples of such impacts include (a) social science communities, through data-intensive social media research; (b) Earth and environment science communities, through the development of LiDAR processing capabilities leveraging a collaboration with the U.S. Geological Survey; (c) health geography community through timely research on spatial accessibility of healthcare resources during the COVID-19 pandemic; and (d) emergency management communities through agent-based evacuation modeling responding to disasters and groundbreaking research on flood mapping. The crosscutting nature of the project bridging advanced cyberinfrastructure and many domain science communities has proven to be greatly and mutually beneficial to the associated research and education communities.

This project has placed a major emphasis on education and workforce development, which is important to the future development of both advanced cyberinfrastructure and data-intensive geospatial sciences and technologies. The project was instrumental in holding two summer schools conducted in 2017 and 2019 each of which brought more than 30 participants together from across the country to learn cutting-edge cyberGIS and geospatial data science. The summer schools were conducted with active collaboration with the AAG (American Association of Geographers) and UCGIS (University Consortium of Geographic Information Sciences) which helped engage diverse participants. A number of hands-on workshops and training courses were offered on multiple campuses and at various conferences and meetings with diverse and underrepresented participants engaged. Many course modules and computational notebooks were developed to teach both graduate and undergraduate students cyberGIS-enabled spatial data synthesis capabilities that have been pioneered by the project. An online MOOC course on Foundations of CyberGIS and Geospatial Data Science has been developed and offered on Coursera. Overall, the project has created novel spatial data synthesis capabilities and associated tools that drastically lower the barriers to harnessing geospatial big data and advanced cyberinfrastructure in many geospatial-related communities for conducting open and reproducible research and education.


Last Modified: 12/14/2020
Modified by: Shaowen Wang

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page