Award Abstract # 1935984
Mid-Scale RI-1: SAGE: A Software-Defined Sensor Network

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: NORTHWESTERN UNIVERSITY
Initial Amendment Date: September 17, 2019
Latest Amendment Date: August 11, 2023
Award Number: 1935984
Award Instrument: Standard Grant
Program Manager: Kevin Thompson
kthompso@nsf.gov
 (703)292-4220
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2019
End Date: February 29, 2024 (Estimated)
Total Intended Award Amount: $9,026,927.00
Total Awarded Amount to Date: $9,026,927.00
Funds Obligated to Date: FY 2019 = $9,026,927.00
History of Investigator:
  • Peter Beckman (Principal Investigator)
    peter.beckman@northwestern.edu
  • Ilkay Altintas (Co-Principal Investigator)
  • Eugene Kelly (Co-Principal Investigator)
  • Charles Catlett (Co-Principal Investigator)
  • Scott Collis (Co-Principal Investigator)
Recipient Sponsored Research Office: Northwestern University
633 CLARK ST
EVANSTON
IL  US  60208-0001
(312)503-7955
Sponsor Congressional District: 09
Primary Place of Performance: Northwestern University
2220 Campus Drive
Evanston
IL  US  60208-3120
Primary Place of Performance
Congressional District:
09
Unique Entity Identifier (UEI): EXZVPWZBLUE8
Parent UEI:
NSF Program(s): Information Technology Researc,
CYBERINFRASTRUCTURE
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):
Program Element Code(s): 164000, 723100
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Distributed, intelligent sensor networks that can collect and analyze data are essential to scientists seeking to understand the impacts of global urbanization, natural disasters such as flooding and wildfires, and climate change on natural ecosystems and city infrastructure. Sage is a pilot project that assembles sensor nodes to support machine learning frameworks and deploy them for rigorous testing in environmental testbeds in California, Colorado, and Kansas and in urban environments in Illinois and Texas. The reusable cyberinfrastructure running on these testbeds will give climate, traffic, and ecosystem scientists new data for building models to study these coupled systems. The software components developed in Sage are open source and provide an open architecture that will enable scientists from a wide range of fields to build their own intelligent sensor networks. The toolkit also extends the current educational curriculum used in Chicago and will inspire young people - with an emphasis on women and minorities -- to pursue science, technology, and mathematics careers by providing a platform for students to explore measurement-based science questions related to the natural and built environments.

The Sage project designs and builds new reusable software components and cyberinfrastructure services to enable deployment of intelligent environmental sensors. Geographically distributed sensor systems that include cameras, microphones, and weather and air quality stations can generate such large volumes of data that fast and efficient analysis is best performed by an embedded computer connected directly to the sensor. This project explores new techniques for applying machine learning algorithms to data from such intelligent sensors and builds reusable software that can run programs within the embedded computer and transmit the results over the network to central computer servers. The Sage project maintains links to computer source code, open hardware design documents, and sensor specifications, as well as both the raw and calibrated sensor data collected from all the testbed nodes at the website http://wa8.gl. The data is also be hosted in the cloud to facilitate easy data analysis. All project data is maintained for five years after the project ends.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Raut, Bhupendra A. and Muradyan, Paytsar and Sankaran, Rajesh and Jackson, Robert C. and Park, Seongha and Shahkarami, Sean A. and Dematties, Dario and Kim, Yongho and Swantek, Joseph and Conrad, Neal and Gerlach, Wolfgang and Shemyakin, Sergey and Beckma "Optimizing cloud motion estimation on the edge with phase correlation and optical flow" Atmospheric Measurement Techniques , v.16 , 2023 https://doi.org/10.5194/amt-16-1195-2023 Citation Details
Dematties, Dario and Raut, Bhupendra A. and Park, Seongha and Jackson, Robert C. and Shahkarami, Sean and Kim, Yongho and Sankaran, Rajesh and Beckman, Pete and Collis, Scott M. and Ferrier, Nicola "Lets Unleash the Network Judgment: A Self-Supervised Approach for Cloud Image Analysis" Artificial Intelligence for the Earth Systems , 2023 https://doi.org/10.1175/AIES-D-22-0063.1 Citation Details
Jackson, Robert_C and Raut, Bhupendra_A and Dematties, Dario and Collis, Scott_M and Ferrier, Nicola and Beckman, Pete and Sankaran, Rajesh and Kim, Yongho and Park, Seongha and Shahkarami, Sean and Newsom, Rob "ARMing the Edge: Designing Edge ComputingCapable Machine Learning Algorithms to Target ARM Doppler Lidar Processing" Artificial Intelligence for the Earth Systems , v.2 , 2023 https://doi.org/10.1175/AIES-D-22-0062.1 Citation Details
Park, Seongha and Kim, Yongho and Ferrier, Nicola J. and Collis, Scott M. and Sankaran, Rajesh and Beckman, Pete H. "Prediction of Solar Irradiance and Photovoltaic Solar Energy Product Based on Cloud Coverage Estimation Using Machine Learning Methods" Atmosphere , v.12 , 2021 https://doi.org/10.3390/atmos12030395 Citation Details
Yongho Kim, Seongha Park "Goal-driven scheduling model in edge computing for smart city applications" Journal of parallel and distributed computing , v.167 , 2022 https://doi.org/https://doi.org/10.1016/j.jpdc.2022.04.024 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Sage pioneered and demonstrated cyberinfrastructure to support scientific user code running securely on remote systems "at the edge."  This enabled students and scientists to process full resolution instrument and sensor data, using AI to create new scientific measurements. One user might use camera input and an AI model to count pedestrians; another might use the inputs to classify weather patterns. This flexible, programmable approach to processing data, beginning directly at the sensor, represents a computing continuum run-time system linking cloud computing to the edge.

 

The Sage open source software framework integrated and operated a national fleet of over 120 AI-enabled nodes. The resulting body of software, community code development processes, and scientific experiments strengthened and deepened the Sage software reliability, ease-of-use, and critical system functions including cybersecurity, reliable data movement, and edge application management.

 

Sage also created open hardware designs for AI computation at the edge, including both outdoor-hardened and commodity rack-mounted nodes deployed across the country, providing the scientific community with a robust reference platform for supporting AI computation linked to their selection of instruments and sensors.   

 

Partnering with ARM Research, Sage built the edge programming environment using robust, industry standard software layers including software containers (Docker), process orchestration (Kubernetes), AI and computer vision toolkits (TensorFlow, PyTorch, and OpenCV), Python notebooks (Jupyter), and reliable messaging (RabbitMQ). Integrating these cloud-native software tools into the edge AI environment provides users with a familiar AI programming environment. Security features built into the Linux operating system enabled novel, advanced Sage cybersecurity policies to ensure, for example, that user codes could introduce network vulnerabilities, relying exclusively on Sage encrypted data messaging layers.  All of the code developed in the partnership is open source and published on GitHub.

 

Sage also created an open Edge Code Repository (ECR) to build and store reusable codes to be deployed to the edge, empowering students to develop new or improved features atop tested applications, and scientists to share AI codes and techniques. Each code in the ECR is automatically built and tested in the cloud, with full documentation. Concurrently, the Sage portal provides tools for integrating new sensors, viewing live data streams, downloading datasets, and scheduling AI@Edge jobs---deepening the catalog of integrated services for Sage users.

 

Early edge-AI adopter communities–essential to validating the Sage cyberinfrastructure, architecture, software interfaces, and usability–represent diverse domains including computer science, climate and atmospheric science, engineering, biology, microbiology, ecology, and sociology---spanning five NSF directorates. Each provided unique design input and usability feedback as Sage was built and deployed, contributing to a unique national AI cyberinfrastructure.  Disciplinary partners include the U.S. National Ecological Observation Network (NEON) that samples the biosphere at a continental scale; the urban science community launched by the Array of Things to measure urban environments, air quality, and human activity; the Atmospheric Radiation Measurement (ARM) User Facility measuring key climate processes; the High Performance Wireless Research and Education Network (HPWREN) that provides real-time measurements of natural phenomena in southern California, from wildfires to earthquakes; the Oregon Hazards Laboratory (OHAZ) that participates in the ShakeAlert and ALERTWildfire network; the Nature Conservancy that measures wildlife; and the Great Lakes Indian Fish and Wildlife Commission (GLIFWC) to study wild rice in partnership with Ojibwe tribes. These partners leveraged Sage and AI@Edge to provide new, advanced data products for research not previously possible.

 

Today, Sage is a robust, distributed national network of nodes deployed in both harsh outdoor settings and in protected instrument huts.  Within each partner’s privacy policy and legal constraints, Sage collects image and audio training data, creating training libraries  for collaborators developing AI models.  These libraries have been used to train AI models for such tasks as detecting floods, estimating snow coverage, and estimating biodiversity with audio. More than 1 billion data records and over 25 terabytes of training data have been collected. Online tutorials, sample codes, and a dynamic online community (via discussion forums and Slack) have enabled users to download and install Sage without assistance from the core development team.  Users have added new instruments to Sage, such as lidar, software-defined radios, an amateur radio telescope, an ultrasonic anemometer, and camera pan-tilt units to explore AI for cyberphysical control systems.

 

Sage will continue to impact science.  Computer scientists can explore novel AI approaches and approaches for evaluating AI model performance, while cientists in other disciplines can focus on developing and using machine learning algorithms to analyze high-volume sensor data streams, relying on Sage critical cyberinfrastructure for data movement, networking, cybersecurity, and compute resources.  Dozens of students have contributed to Sage, gaining research skills at the nexus of cyberinfrastructure and domain science applications, inspiring the pursuit of STEM careers by providing a platform for exploring measurement-based science questions related to the natural and built environments, creating applications that focus on social or humanitarian goals relevant to them.


Last Modified: 05/29/2024
Modified by: Peter H Beckman

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page