Award Abstract # 1910017
SHF: Small: Collaborative Research: Fuzzing Cyber-Physical System Development Tool Chains with Deep Learning (DeepFuzz-CPS)

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: VANDERBILT UNIVERSITY
Initial Amendment Date: July 29, 2019
Latest Amendment Date: July 29, 2019
Award Number: 1910017
Award Instrument: Standard Grant
Program Manager: Sol Greenspan
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2019
End Date: September 30, 2023 (Estimated)
Total Intended Award Amount: $248,445.00
Total Awarded Amount to Date: $248,445.00
Funds Obligated to Date: FY 2019 = $248,445.00
History of Investigator:
  • Taylor Johnson (Principal Investigator)
    taylor.johnson@gmail.com
Recipient Sponsored Research Office: Vanderbilt University
110 21ST AVE S
NASHVILLE
TN  US  37203-2416
(615)322-2631
Sponsor Congressional District: 05
Primary Place of Performance: Vanderbilt University
1025 16th Avenue South, Room 401
Nashville
TN  US  37212-1234
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): GTNBNWXJ12D5
Parent UEI: K9AHBDTKCB55
NSF Program(s): Software & Hardware Foundation,
CPS-Cyber-Physical Systems
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7918, 7923, 7944
Program Element Code(s): 779800, 791800
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Developing a modern technical product such as a car, plane, or a complex medical device includes designing the complex interplay between sensors (which measure physical product and environment state) and actuators (such as small electric motors that control the product). To design this interplay, engineers rely on complex design software tools. This project will address two problems these engineers face. (1) First, little systematic knowledge of the design tools or the resulting designs is available to guide engineers. For example, little is known about how basic design properties (such as various design size measures) relate to design quality attributes (such as design complexity and comprehensibility). This project will thus collect and analyze a large number of publicly available designs to build such knowledge. (2) Second, since the design tools are complex they can contain software bugs. These bugs may in turn silently introduce bugs into widely-deployed safety-critical systems, since product control software generated from designs is often deployed in safety-critical environments. Bugs in such systems often lead to costly product recalls and may have serious consequences. This project will thus develop techniques for automatically finding software bugs in such design tools.

This project consists of the following three major components. (1) First, this project will build the largest curated corpus of publicly available cyber-physical system models and related artifacts. Preliminary results analyzing this corpus both confirms and contradicts earlier findings that are based on significantly fewer models, suggesting the utility of a large corpus for future research. (2) Second, to side-step the age-old problem of missing complete formal specifications of cyber-physical system tool chains, this project instead will design a novel scheme to infer the cyber-physical system language validity rules via deep learning from the project's model corpus. Sampling the deep learner will enable generating additional models for the researchers' existing differential cyber-physical system tool chain testing infrastructure. (3) Third, this project will supplement the deep learner's training set via the first systematic cyber-physical system-model mutation scheme based on equivalence modulo inputs. Initial experiments have found several bugs in a commercial cyber-physical system tool chain that have been confirmed by the vendor of the tool chain.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 14)
Bao, Tianshu and Jia, Xiaowei and Zwart, Jacob and Sadler, Jeffrey and Appling, Alison and Oliver, Samantha and Johnson, Taylor T. "Partial Differential Equation Driven Dynamic Graph Networks for Predicting Stream Water Temperature" 2021 IEEE International Conference on Data Mining (ICDM) , 2021 https://doi.org/10.1109/ICDM51629.2021.00011 Citation Details
Brix, Christopher and Müller, Mark Niklas and Bak, Stanley and Johnson, Taylor T. and Liu, Changliu "First three years of the international verification of neural networks competition (VNN-COMP)" International Journal on Software Tools for Technology Transfer , 2023 https://doi.org/10.1007/s10009-023-00703-4 Citation Details
Diego Manzanas Lopez and Sung Woo Choi and Hoang-Dung Tran and Taylor T. Johnson "NNV 2.0: The Neural Network Verification Tool" Computer Aided Verification , 2023 Citation Details
Hamilton, Nathaniel and Musau, Patrick and Lopez, Diego Manzanas and Johnson, Taylor T. "Zero-Shot Policy Transfer in Autonomous Racing: Reinforcement Learning vs Imitation Learning" 2022 IEEE International Conference on Assured Autonomy (ICAA) , 2022 https://doi.org/10.1109/ICAA52185.2022.00011 Citation Details
Manzanas Lopez, D. and Musau, P. and Hamilton, N.P. and Johnson, T.T. "Reachability Analysis of a General Class of Neural Ordinary Differential Equations" Formal Modeling and Analysis of Timed Systems. FORMATS 2022 , v.13465 , 2022 https://doi.org/10.1007/978-3-031-15839-1_15 Citation Details
Manzanas_Lopez, Diego and Althoff, Matthias and Benet, Luis and Blab, Clemens and Forets, Marcelo and Jia, Yuhao and Johnson, Taylor T and Kranzl, Manuel and Ladner, Tobias and Linauer, Lukas and Neubauer, Philipp and Neubauer, Sophie and Schilling, Chris "ARCH-COMP24 Category Report: Artificial Intelligence and Neural Network Control Systems (AINNCS) for Continuous and Hybrid Systems Plants" , 2024 https://doi.org/10.29007/mxld Citation Details
Manzanas Lopez, Diego and Johnson, Taylor T. and Bak, Stanley and Tran, Hoang-Dung and Hobbs, Kerianne L. "Evaluation of Neural Network Verification Methods for Air-to-Air Collision Avoidance" Journal of Air Transportation , v.31 , 2023 https://doi.org/10.2514/1.D0255 Citation Details
Musau, Patrick and Hamilton, Nathaniel and Lopez, Diego Manzanas and Robinette, Preston and Johnson, Taylor T. "On Using Real-Time Reachability for the Safety Assurance of Machine Learning Controllers" 2022 IEEE International Conference on Assured Autonomy (ICAA) , 2022 https://doi.org/10.1109/ICAA52185.2022.00010 Citation Details
Neelanjana Pal and Diego Manzanas Lopez and Taylor T Johnson "Robustness Verification of Deep Neural Networks Using Star-Based Reachability Analysis with Variable-Length Time Series Input" Formal Methods for Industrial Critical Systems. FMICS 2023 , 2023 Citation Details
Pal, Neelanjana and Lee, Seojin and Johnson, Taylor "Benchmark: Formal Verification of Semantic Segmentation Neural Networks" AISoLA 2023 , 2023 Citation Details
Tran, H and Pal, N and Musau, P and Lopez, D and Hamilton, N and Yang, X and Bak, S and Johnson, T "Robustness Verification of Semantic Segmentation Neural Networks Using Relaxed Reachability" International Conference on Computer Aided Verification , 2021 Citation Details
(Showing: 1 - 10 of 14)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Cyber-physical systems (CPS) such as motor vehicles, medical devices, and aerospace systems increasingly are built with design automation tool chains that encompass design-time modeling through implementation and deployment of auto-generated code to embedded computing devices. All stages of this process (simulation, code generation, compilation for the target hardware, etc.) have the potential to introduce software defects and bugs in the resulting software executed during operation of the CPS. Given these risks, this project investigated three major objectives. Objective A is to build and maintain a large-scale CPS model corpus and analyze model metrics and modeling practice. Objectives B and C generate new CPS models via deep learning (C) and equivalence modulo input (EMI)-based mutation (B), as input for differential testing to find CPS tool chain bugs. We evaluate these objectives using the MathWorks' Simulink/Stateflow, a prototypical and widely-used CPS development environment.

The outcomes of the project include numerous peer reviewed publications in prestigious venues, education of student researchers, and engagement with relevant industry partners to transition the methodologies and technologies developed. For example, the EMI fuzzing methods and deep learning fuzzing methods found novel confirmed code generation bugs in the CPS tool chains. Given the further emergence of generative artificial intelligence (AI) and other deep learning techniques over the project, such generative models were explored for their capability to perform more sophisticated fuzzing than some earlier randomized differential testing based approaches. The project also investigated utilizing software engineering and verification techniques for deep learning systems themselves as utilized within CPS, given the increasing usage of such AI and machine learning (ML) techniques within CPS for tasks ranging from perception to planning and control. Events were organized in support of these activities within the research community, such as the International Verification of Neural Networks Competition (VNN-COMP) and the International Competition on Verifying Continuous and Hybrid Systems (ARCH-COMP), particularly the category on Artificial Intelligence and Neural Network Control Systems (AINNCS), as well as a tutorial at the ACM SIGBED International Conference on Embedded Software (EMSOFT). Overall, the intellectual merit and broader impacts outcomes and results of the project may lead to increased usage of these novel rigorous software engineering techniques in the design of CPS by utilizing AI techniques for testing and verification, and also enhanced safety and trustworthiness of AI, particularly as used within CPS.


Last Modified: 01/29/2024
Modified by: Taylor T Johnson

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page