Traffic Safety Measures Using Multiple Streams Real Time Data

Similar documents
Laboratory for Energy Smart Systems (LESS)

Seed-Project - Transportation Safety Resource

Nondestructive Evaluation of Four Sister Bridges in Virginia Using Manual NDE Technologies and Robotic Platform RABIT

Laboratory Evaluation of Vestoplast Modified Hot Mix Asphalt (HMA)

Modeling of Interaction Between Steel and Concrete in Continuously Reinforced Concrete Pavements

Improved Road Condition Reporting

FINAL REPORT November Center for Advanced Infrastructure and Transportation 100 Brett Road, Piscataway, NJ and

Utilization of a Pneumatic Tube Mixing Technique for Processing and Stabilization of Contaminated Dredged Material

PRODUCT DESCRIPTIONS AND METRICS

PRELIMINARY REVIEW COPY

NTC Program Progress Performance Report (PPPR) Information Form

NYSDOT Waterproof HMA Mix Verification

NUTC RE252. Acquisition of Equipment for Composite Manufacturing Laboratory

Engineering Management Consulting Services. Submitted by. Janet Hansen Center for Advanced Infrastructure & Transportation Rutgers University

Origin-Destination Trips and Skims Matrices

Laboratory Evaluation of Honeywell Polymer vs SBS Polymer Modified Asphalt Mixtures

TRUCK TRIP ESTIMATION BY FACILITIES USING ARC-INFO

Predicting dangerous road segments using in-vehicle telematics.

Big Data in Transportation: Opportunities and Challenges

UTC R170. Hydrogen Flammability Limits and Implications on Fire Safety of Transportation Vehicles. Umit O. Koylu

NUTC R203 BEHAVIOR OF HIGH-PERFORMANCE CONCRETE IN STRUCTURAL APPLICATIONS. Jared Brewe

Forensic Testing of Stone Matrix Asphalt Rubber Material

University of Kentucky 1 Southeastern Transportation Center Safety Performance in a Connected Vehicle Environment. FIGURE 1: V2V and V2I Connectivity

Inorganic Protective Coatings and Fiber Reinforced Polymers Demonstration Project: Route 47 Wildwood Drawbridge Bridge House Rehabilitations

USDOT Connected Vehicle Research Program Vehicle-to-Vehicle Safety Application Research Plan

Greater Yellowstone Rural ITS Priority Corridor Project. Task 5. ITS Vision Working Paper

Strategic Highway Research Program (SHRP) 2. Revised Safety Research Plan: Making a Significant Improvement in Highway Safety.

NUTC R203. Modeling of Laser Cladding with Application to Fuel Cell Manufacturing. Zhiqiang Fan

Development of a Presence Assessment Tool for Iowa s Pavement Marking Management System

Empirical Analysis of Crowdsourced Google Travel Time Data for Sustainable Transportation Applications

Senda BOUKEF IBM Watson and Cloud Platform portfolio architect, MEA Watson / Presentation Title / Date

Improving Urban Mobility Through Urban Analytics Using Electronic Smart Card Data

BIG DATA & ANALYTICS WHAT SHOULD YOU BE DOING? DWAYNE INGRAM SVP, BUSINESS OPERATIONS

Truck Route Choice Modeling using Large Streams of GPS Data

OPTIMUM Project. Multi-source Big Data Fusion Driven Proactivity for Intelligent Mobility. Name. event location date. Organisation

Artificial Intelligence and Machine Learning in IoT

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence

Marketing & Big Data

Integrated Track Infrastructure Assessment Tools

About Us WORKING OFFICES. Istanbul Turkey Riyadh / Saudi Arabia Belgrade / Serbia Podgorica / Montenegro Shenzhen / China

RFID Applications for Tolling and Congestion Management

Jeffrey S. Weidner. Project Engineer - Intelligent Infrastructure Systems. Researcher Engineer and Ph.D. Student - Drexel University

Optimize Asset Performance with Industrial IoT and Analytics. Operator, Industrial IoT, Uptime, Predictive Maintenance, IBM

KENTUCKY TRANSPORTATION CENTER

Automating Variable Speeds and Traveler Information with Real-Time Traffic and Weather

Becoming an Intelligent Enterprise to Make Money Out of Data

Transportation Analytics. Sanjay Ranka University of Florida February 9, 2017

BRASIL PLAN ESTRATÉGICO SMART TRANSPORT STRATEGIC PLAN

A Quadcopter with Heterogeneous Sensors for Autonomous Bridge Inspection

Smart telematics solutions

CONDITION MONITORING SOLUTIONS

A VISION-BASED APPROACH TO STUDY DRIVER BEHAVIOR IN WORK ZONE AREAS

Artificial Intelligence and Its Impact on the Global Travel Industry

Improving Financial Performance with Predictive Analytics

The Use of Cell Phone Network Data in Traffic Data Collection and Long-Haul Truckshed (Geographic Extent) Tracking

NUTC R308. Fiber Reinforced Cementitious Matrix (FRCM) Composites for Reinforced Concrete Strengthening. Lesley Sneed

IoT ANALYTICS IN THE ENTERPRISE WITH FUNL

The Rise of Engineering-Driven Analytics. Richard Rovner VP Marketing

The usage of Big Data mechanisms and Artificial Intelligence Methods in modern Omnichannel marketing and sales

Evaluation of Congestion Pricing for Management Highway in Seattle

Jack Weast. Principal Engineer, Chief Systems Engineer. Automated Driving Group, Intel

Preparing our Roadways for Connected and Automated Vehicles. 70th Annual Ohio Transportation Engineering Conference Bob Edelstein

Innovations in Central Data Management for Truck Compliance and Mobility - Vehicle Information in Motion

Tapping on the Power of Big Data to Improve the Operation of Power Systems

Autonome Voertuigen hebben de toekomst. Taco Olthoff

Big Data & Analytics for Wind O&M: Opportunities, Trends and Challenges in the Industrial Internet

Central Data Repository for Traffic Data Collection in Rural Areas and Corridors Supporting Freight Mobility

chief data officer s guide to artificial intelligence

The Rise of Engineering-Driven Analytics

Data Analysis in the Internet of Things: IoT capabilities with MATLAB/Simulink

Industry Innovations with SAP Leonardo: Mill Products Industry

Transportation Research and Education at the University of Virginia. Dr. Brian Smith Dr. Jose Gomez Dr. Lindsay Ivey-Burden

Finding a Reliable Route between Two Points in a Transportation Network Using GIS

HOW TO USE AI IN BUSINESS

Intelligence That Moves the World

Better Perspectives with Big Data in City of Atlanta

Impact of Variable Pricing on Temporal Distribution of Travel Demand

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center

Pointer Telocation Ltd. (Translation of registrant's name into English)

P3 CRITICAL EMERGING TECHNOLOGIES (PRELIMINARY ANALYSIS) Authors: Jia Li Andrea Hall Kristie Chin C. M. Walton

Business Energy Council

Internet OF Things: Transformation In The Digital Era

Analytic Alphabet Soup: IoT, AI & ESP Big Data Analytics is a game changer in our Connected World

Data-driven Network Models for Analyzing Multi-modal Transportation Systems

11. Contract or Grant No. Austin, TX

Driving Innovation in the Smart Car

Approved for public release,

WISCONSIN DOT PUTTING RESEARCH TO WORK

AVHS encompass a wide and complex variety of technologies, such as artificial intelligence,

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Page 1 of 5 Copyright 2017 Platformatics, Inc. CA-MRCS-DS MSML

Survey paper on Smart Parking System based on Internet of Things

Operations in the 21st Century DOT Meeting Customers Needs and Expectations

Concept of Operations

NUTC R203. Safety Risks of Hydrogen Fuel for Applications in Transportation Vehicles. Shravan K. Vudumu

CFIRE. Evaluation of Green House Gas Emissions Models. CFIRE November 2014

UTC ETT190. GeoMo 2007 Subgrade Stabilization - Foundations & Pavements. Ronaldo Luna

CONNECTED LOGISTICS. Denis SENPERE Inspirage Europe Vejle, October 2 nd Inspirage / Oracle All rights reserved. Courtesy Alstom / Amtrak

The future of mobility, innovative infrastructure and private enterprise

RECOGNIZING USER INTENTIONS IN REAL-TIME

Transcription:

CAIT-UTC-055 Traffic Safety Measures Using Multiple Streams Real Time Data FINAL REPORT January 2017 Submitted by: Mohsen Jafari Professor of Industrial and Systems Engineering Center for Advanced Infrastructure and Transportation 100 Brett Road Piscataway, NJ 08854 External Project Manager Patricia Ott MBO Engineering, LLC In cooperation with Rutgers, The State University of New Jersey And U.S. Department of Transportation Federal Highway Administration

Disclaimer Statement The contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy of the information presented herein. This document is disseminated under the sponsorship of the Department of Transportation, University Transportation Centers Program, in the interest of information exchange. The U.S. Government assumes no liability for the contents or use thereof. The Center for Advanced Infrastructure and Transportation (CAIT) is a Tier I UTC Consortium led by Rutgers, The State University. Members of the consortium are the University of Delaware, Utah State University, Columbia University, New Jersey Institute of Technology, Princeton University, University of Texas at El Paso, University of Virginia and Virginia Polytechnic Institute. The Center is funded by the U.S. Department of Transportation.

1. Report No. CAIT-UTC-055 2. Government Accession No. 3. Recipient s Catalog No. 4. Title and Subtitle Traffic Safety Measures Using Multiple Stream Real Time data 7. Author(s) Mohsen A Jafari TECHNICAL REPORT STANDARD TITLE PAGE 5. Report Date 1/4/2017 6. Performing Organization Code CAIT/Rutgers University 8. Performing Organization Report No. CAIT-UTC-055 9. Performing Organization Name and Address Department of Industrial and Systems Engineering 96 Frelinghuysen Rd. Piscataway NJ 08854 10. Work Unit No. 11. Contract or Grant No. DTRT12-G-UTC16 12. Sponsoring Agency Name and Address 13. Type of Report and Period Covered Center for Advanced Infrastructure and Transportation Rutgers, The State University of New Jersey 100 Brett Road Piscataway, NJ 08854 15. Supplementary Notes U.S. Department of Transportation/Research and Innovative Technology Administration 1200 New Jersey Avenue, SE Washington, DC 20590-0001 Final Report 4/1/2015 9/1/2016 14. Sponsoring Agency Code 16. Abstract Traffic crashes and accidents result from many complex factors, but at a basic level, they are conflicts among vehicles and/or other road users. Roadway conditions, traffic signals, weather, traffic flow, drivers' behavior and health of vehicles on roadways are among significant factors that influence these conflicts. With the enormous advances in connected vehicles technology, the Internet of Things (IOT) and smart cars, the opportunities for more advanced safety techniques that are proactive and customizable to individual drivers are becoming more realizable. The main objective of this project is to build advanced analytics to estimate a composite traffic safety risk measure that change temporally and spatially, and take into account driver behavior, roadway quality conditions and historical safety characteristics of roadways. Our vision is that with smart cars and smart roadways, a travel plan for a given driver will be associated with a safety risk profile composed of these risk estimates that are sampled in time and change whenever one or more of the underlying data streams change. This project will focus more on the development of such a methodology and less on how it should be implemented and calibrated for different applications. 17. Key Words Traffic safety, predictive models, Drive behavior, smartphone apps 19. Security Classification (of this report) Unclassified 18. Distribution Statement No restriction 20. Security Classification (of this page) Unclassified 21. No. of Pages Total # 8 22. Price Form DOT F 1700.7 (8-69)

Table of Contents DESCRIPTION OF THE PROBLEM... 1 APPROACH... 1 METHODOLOGY... 1 FINDINGS... 3 CONCLUSIONS... 4 List of Figures Figure 1 A General Data Collection Model red boxes are primary candidates 2

DESCRIPTION OF THE PROBLEM The problem of interest is to build a predictive model to estimate a composite traffic safety risk measure that changes temporally and spatially, and takes into account driver behavior, roadway quality conditions and historical safety characteristics of roadways. A subsequent problem formulation will use this predictive model to sample safety risks at some equidistant time epochs during a trip. These sampled risks are then compiled into a risk profile for that trip. We are also interested in the statistical analysis of these risk profiles to better understand one s driving patterns, and by devising appropriate filtering schemes, we like to correlate these driving patterns to location, time, weather and other useful parameters. These risk profiles, if aggregated over location parameters, can also give us useful information on the road safety from an infrastructure point of view. Last but not least, safety risk predictions can be used toward the development of Advanced Driver Assistance System (ADAS). APPROACH Prior to this project, our team had experimented with 100-car driver s behavioral study and had built a predictive model using the data that is publically available from this study. The funding from this project was used to acquire a larger dataset that is available from publically available SHARP2 database. We did not have sufficient funds to acquire the whole dataset, and for that reason we had to employ a filtering mechanism that closely matched our predication requirements. Despite our best efforts in devising the filtering scheme we were not able to obtain all the data that was necessary for our analysis. For instance, it was quite important for us to receive video data especially at times of accidents (crash or near miss). But, due to security and confidentiality reasons, this data was not made available to us. Our funds was not sufficient either to obtain a dataset that contained trips per a defined set of drivers, hence, some aspects of driver behavior could not be included in our modeling scheme. Our dataset, however, included baseline, near miss and crash data for random drivers, stamped with time and events. It included time series data in order of 10-15 seconds prior to an event (e.g., crash) and 10 seconds after the event. This allowed us to extract real useful behavioral information at the time of events. The new dataset enabled us to significantly expand our safety risk prediction model and to develop a new model that explains driver reaction time on the basis using kinematics and statistical data. This is an important finding, which allows one to customize driver alerts according to driver s own characteristics and surrounding circumstances. Computation of reaction time and safety risk require a database architecture that was also planned and developed under this project. METHODOLOGY Our work pursued the following objectives: Development of appropriate data models and a database using multiple data streams. We used the data fields from Naturalistic Driver study and combined it with data from Straight Line Diagram (SLD) and historical crash data. Figure 1 illustrates the general data framework used for our data modeling. We adopted a feature engineering technique to prepare data from model building in the next step. 1

Figure 1 A General Data Collection Model red boxes are primary candidates 2

Development of a safety risk predictive model This model is intended to predict safety risk score of a driver at sampled time epochs. The overall risk score is a function of different driving conditions, such as crash, near miss and normal driving. One can further increase the model resolution by creating subcategories of each. For instance, there may be several subcategories of normal driving depending on safety critical applications. One can also combine several risk categories together and reduce driving states to unsafe and normal. Here we assumed three categories of crash, near miss and normal driving. Our methodology includes the estimation of each of these probabilities and a function that combines them into a risk score. In the absence of sufficient data one may use Probability of crash as a risk score. With the availability of data from drivers one will be able to update the model parameters and also extend the model to include additional data variables. Therefore, risk score modeling is a dynamic process that will improve as more data becomes available. Given a sample of real-time driving events one can train a Generalized Linear Model (GLM) with regularization (elastic-net) and estimate the regression coefficients. Preliminary architecture for an on-board driver alert system Our database supports the risk data collected from the above prediction model. The risk score model can be programmed offline and run from a cloud based server at sampled times. It can also be programmed by an APP and run from a smartphone inside vehicle. We have adopted a two-tried approach. A full prediction model estimation is programmed using R in an offline cloud based setting. The model parameter estimates are then programmed using a tabular structure in an APP. This APP also collects driver information (ID, age, etc.) from the database. A driver ID and GPS origin and destination define a trip. More complex trip models can be built at some later phases. The APP starts at the GPS origin. At intervals defined by the driver (e.g., default can be set to every 10 to 15 seconds) the risk score value (using either predictive or rule based approach) is calculated and stored locally in the smartphone. The sampled valued are also graphed (y-axis is risk and X-axis is the distance travelled). We refer to these graphs by driver risk profile over a trip. These risk profiles are uploaded to a server at the completion of the trip along with other pertinent information (such as some important driving patterns, etc.). FINDINGS Partially supported by this funding, we developed a novel data-driven approach to real-time traffic safety risk prediction. We quantified traffic safety risk as the likelihood of adverse driving outcomes. We proposed using the elastic net regularized multinomial logistic regression with a built-in variable selection and shrinkage mechanism and a cost-sensitive loss function for imbalanced data. To the best of our knowledge, this work is the first attempt toward applying the elastic net model to traffic accident related research. We introduced five measures of goodness (lack of goodness) to evaluate the performance of the prediction models, namely miss-classification, Type I, Type II, Upper Off-diagonal Triangular and Lower Offdiagonal Triangular error rates to take into account the sensitivity and specificity of the classifier in identifying cases in the minority classes. To evaluate the prediction performance of the prediction models, we used 10-fold cross validation, and used a subset of SHRP 2 NDS safety data to show the applicability of our proposed approach. We also developed a data preparation and feature engineering approach, which included the following steps. First, we introduced a three-class, i.e. crash, near-crash, and normal-driving, prediction model. Then, by breaking down the normal-driving class, we further illustrated that the model can easily accommodate different resolutions of driving outcome. 3

The developed predictive models can be used to support a multitude of applications, including Advanced Driver Assistance System (ADAS), safety risk profiling of drivers, and safety risk scoring of roadway segments for dynamic hotspot analysis. For example, the models can be incorporated into a data-driven collision warning system to warn drivers of critical events and/or unsafe driving situations. The design of such a system and its practical issues, including sensors acquisition, in-vehicle perception, onboard processing and the HMI design, can be the subject of future works. In this study, in selecting the final models, we preferred a higher type-i error in exchange for a lower type-ii error since the cost of the latter can be loss of lives. Since there is a tradeoff between type I and II errors, the effort to reduce one generally results in increasing the other. One approach to simultaneously reduce type I and II errors in an imbalanced data setting is to collect more data to increase the sample sizes of minority classes. In fact, in this study we have only used one third of the SHRP 2 data. Including more crash and near-crash cases most likely improve the prediction performance of the models. Another approach to tackle this problem is to use re-sampling methods such as bootstrapping which can be the subject of a future work. Additionally, with more data and provided that there are enough samples per specific crash type, separate models can be developed to predict specific types of collisions. This, in turn, will raise the issue of how to prioritize different crash-type warnings at a given time in an ADAS. Finally, advanced machine learning algorithms such as deep learning can be a potential alternative to improve the performance of the safety risk model as big players like Google, Nvidia, Microsoft and IBM have already invested heavily in new projects around this breakthrough method. The funding from this project partially supported a PhD thesis, which was completed and defended on January 17, 2017. A technical article was also accepted and schedule for publication: "Traffic Safety Risk Prediction Using Driver Behavior and Roadway Information Data" Nasim Arbabzadeh and Mohsen A Jafari, to appear in IEEE Transactions on Intelligent Transportation Systems, 2017. We also built a preliminary version of a smartphone APP was tested in limited basis by our team. Additional work will be necessary to build the alpha version of the APP and run additional field testing. CONCLUSIONS Future roadways will have a mix of autonomous and automated vehicles with regular vehicles that require human operators. To ensure the safety of all the road users in such a network, it is necessary to enhance the performance of the present Advanced Driver Assistance System (ADAS) for lower classes of vehicles. Real-time driving safety risk prediction is an essential element of an ADAS. In this study, we developed a novel data-driven approach to predict traffic safety risk that can be customized to individual drivers by including driver-specific variables. In particular, we used the elastic net regularized multinomial logistic regression and data from the second Strategic Highway Research Program (SHRP 2) Naturalistic Driving Study (NDS) to build these predictive models. We rigorously examined the variables in the dataset and performed data preparation and feature engineering steps to enhance the prediction performance with respect to model predictors. Two versions of the model were developed according to the level of warnings that the model can generate based on driving conditions. 4