Award Abstract # 1741306
BIGDATA: IA: Predictive Analytics of Driver's Engagement for Injury Prevention

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: DREXEL UNIVERSITY
Initial Amendment Date: August 17, 2017
Latest Amendment Date: June 13, 2022
Award Number: 1741306
Award Instrument: Standard Grant
Program Manager: Wendy Nilsen
wnilsen@nsf.gov
 (703)292-2568
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: January 1, 2018
End Date: December 31, 2023 (Estimated)
Total Intended Award Amount: $999,993.00
Total Awarded Amount to Date: $1,015,993.00
Funds Obligated to Date: FY 2017 = $999,993.00
FY 2019 = $16,000.00
History of Investigator:
  • Bhupesh Shetty (Principal Investigator)
    bhupesh.shetty@drexel.edu
  • Helen Loeb (Co-Principal Investigator)
  • Weimao Ke (Co-Principal Investigator)
  • Santiago Ontanon (Co-Principal Investigator)
  • Sheila Klauer (Co-Principal Investigator)
  • Christopher Yang (Former Principal Investigator)
  • Jonathan Antin (Former Co-Principal Investigator)
Recipient Sponsored Research Office: Drexel University
3141 CHESTNUT ST
PHILADELPHIA
PA  US  19104-2875
(215)895-6342
Sponsor Congressional District: 03
Primary Place of Performance: Drexel University
3141 Chestnut Street
Philadelphia
PA  US  19104-2875
Primary Place of Performance
Congressional District:
03
Unique Entity Identifier (UEI): XF3XM9642N96
Parent UEI:
NSF Program(s): Big Data Science &Engineering
Primary Program Source: 01001718DB NSF RESEARCH & RELATED ACTIVIT
01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7364, 7433, 8083, 9251
Program Element Code(s): 808300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Over 30,000 people are killed in motor vehicle crashes on US roadways every year. Driver distraction from secondary in-vehicle activities, particularly among young drivers, has emerged as a major cause of motor vehicle crashes. A substantial amount of research has been focused on analyzing small sets of naturalistic driving or simulated data to study a small number of features for detecting the driver's engagement. However, many meaningful dependencies and patterns can only be discovered by large collections of data. In this project, the goal is leveraging the two petabytes federal database of naturalistic driving data to develop predictive analytics for detecting a driver's disengagement from the driving tasks in order to provide alerts to drivers and reduce the risk of motor vehicle crashes.

In this project, data pre-processing techniques are investigated for the large volume of heterogeneous data with over 100 variables in Strategic Highway Research Program 2 (SHRP 2). Two scalable predictive analytics algorithm families based on instance-based learning and heterogeneous network mining for predictive modeling are developed. In addition, a novel distributed computing infrastructure to support the scalable predictive analytics in performing pattern mining of driving behavior analysis, modeling, and prediction are developed. The research outcomes of this project shed a significant amount insight into current work of injury prevention due to motor vehicle crashes. The project extends the capability of machine learning, sensor informatics, and driving behavior analytics. The integrated education plan includes incorporating the research findings in courses offered at the Master of Science program in Health Informatics. The outreach plan involves organizing workshops, conferences, and seminars to disseminate the research outcomes.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 21)
Bruce G. Simons-Morton, Pnina Gershon "Crash rates over time among younger and older drivers in the SHRP 2 naturalistic driving study" Journal of safety research , v.73 , 2020 https://doi.org/2020.03.001 Citation Details
Chang, Chia-Hsuan and Monselise, Michal and Yang, Christopher C. "What Are People Concerned About During the Pandemic? Detecting Evolving Topics about COVID-19 from Twitter" Journal of Healthcare Informatics Research , v.5 , 2021 https://doi.org/10.1007/s41666-020-00083-3 Citation Details
Ehsani, Johnathon P. and Gershon, Pnina and Grant, Brydon J. and Zhu, Chunming and Klauer, Sheila G. and Dingus, Tom A. and Simons-Morton, Bruce G. "Learner Driver Experience and Teenagers Crash Risk During the First Year of Independent Driving" JAMA pediatrics , v.174 , 2020 https://doi.org/10.1001/jamapediatrics.2020.0208 Citation Details
Jazayeri, Ali and Capan, Muge and Ivy, Julie and Arnold, Ryan and Yang, Christopher C. "Proximity of Cellular and Physiological Response Failures in Sepsis" IEEE Journal of Biomedical and Health Informatics , v.25 , 2021 https://doi.org/10.1109/JBHI.2021.3098428 Citation Details
Jazayeri, Ali and Martinez, John Ray and Loeb, Helen S. and Yang, Christopher C. "The Impact of driver distraction and secondary tasks with and without other co-occurring driving behaviors on the level of road traffic crashes" Accident Analysis & Prevention , v.153 , 2021 https://doi.org/10.1016/j.aap.2021.106010 Citation Details
Jazayeri, Ali and Yang, Chris "Frequent Subgraph Mining Algorithms in Static and Temporal Graph-Transaction Settings: A Survey" IEEE Transactions on Big Data , 2022 https://doi.org/10.1109/TBDATA.2021.3072001 Citation Details
Jazayeri, Ali and Yang, Christopher C "Motif discovery algorithms in static and temporal networks: A survey" Journal of Complex Networks , v.8 , 2020 https://doi.org/10.1093/comnet/cnaa031 Citation Details
Jazayeri, Ali and Yang, Christopher C. "Frequent Pattern Mining in Continuous-Time Temporal Networks" IEEE Transactions on Pattern Analysis and Machine Intelligence , v.46 , 2024 https://doi.org/10.1109/TPAMI.2023.3324799 Citation Details
Liang, Ou Stella and Yang, Christopher C. "Determining the risk of driver-at-fault events associated with common distraction types using naturalistic driving data" Journal of Safety Research , v.79 , 2021 https://doi.org/10.1016/j.jsr.2021.08.003 Citation Details
Liang, Ou Stella and Yang, Christopher C. "How are different sources of distraction associated with at-fault crashes among drivers of different age gender groups?" Accident Analysis & Prevention , v.165 , 2022 https://doi.org/10.1016/j.aap.2021.106505 Citation Details
Liang, Ou Stella and Yang, Christopher C. "Mental health conditions and unsafe driving behaviors: A naturalistic driving study on ADHD and depression" Journal of Safety Research , v.82 , 2022 https://doi.org/10.1016/j.jsr.2022.05.014 Citation Details
(Showing: 1 - 10 of 21)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The goal of this project is increasing road safety by understanding and predicting driver engagement via the analysis of 4 Petabyte of SHRP 2 naturalistic driving data.  In this project, we first conduct a thorough data pre-processing, including reviewing variables, feature selection, data cleaning, de-duplication, filtering, format conversion, data integration and fusion, and data normalization, to ensure the data quality for predictive analytics.  We develop heterogeneous network mining and sophiscated statistical analysis to construct a global picture of driving behavior analysis focusing on driver engagement.  The temporal heterogeneous network mining are developed to extract the frequent patterns.  Novel representation is proposed to preserve the temporal aspects of the heterogeneous networks.  Constrained interval graphs are introduced and algorithms are developed to extract a complete set of frequent temporal patterns with isomorphism.  These frequent patterns are tested in the prediction model.  Evolution of temporal pattern in heterogeneous network, such as growing, diminishing, merging, or spliting, are identified  to understand the changes of patterns in sensor variables and their associations.  These patterns are helpful in the explanations of the prediction. We integrate a distributed and asynchronous computing infrastructure to support the distributed searching and asynchronous parallel graph computing.   Aggressive driving behavior are identified and analzyed with t-SNE to identify clusters time-series sensor data and determine four patterns, namely (1) stop and go driving, (2) changing lanes abruptly, (3) driving too fast for road condition, and (4) not properly maintaining lane.  A transfer learning with fairness optimization model has also been developed to optimize the fairness between different demographic groups and minimize the loss of performance.  Such model also provides explanations by identifying the important features that affect the fairness between groups.  Six PhD students and one postdoctoral student were trained in this project.  Four out of the six Phd students are female and one female student is an African American.  Two PhD students have public health or nursing backgrounds.  The interdisciplinary training has broadened their knowledge in health and computing.  

 


Last Modified: 01/16/2024
Modified by: Bhupesh   Shetty

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page