A Procedure to Determine When Safety Performance Functions Should Be Recalibrated

Similar documents
Geni Bahar, P.E. Describe key concepts and applications

Evaluating Design Alternatives using Crash Prediction Methods from the Highway Safety Manual

An Introduction to the. Safety Manual

The sections below include examples of HSM implementation activities undertaken by various transportation agencies.

Calibrating Crash Modification Factors for Wyoming-Specific Conditions: Application of the Highway Safety Manual - Part D

Status of Highway Safety Manual Implementation. October 2015

THE HIGHWAY SAFETY MANUAL

SafetyAnalyst: The Georgia Experience

Role and Application of Accident Modification Factors (AMFs) within the Highway Design Process

How to Choose Between Calibrating SPFs from the HSM and Developing Jurisdiction-Specific SPFs

Interactive Highway Safety Design Model (IHSDM) Traffic Records Forum Biloxi, MS October 28,

Development of a Decision Making Process for the HSM Implementation

Modeling Safety in Utah

Interactive Highway Safety Design Model (IHSDM) AASHTO Subcommittee on Design July 21, 2009

TRANSPORTATION RESEARCH BOARD. Resurfacing, Restoration, and Rehabilitation Projects: Geometric Design. Tuesday, October 2, :00-3:30 PM ET

SAFETY EFFECTS OF FOUR-LANE TO THREE-LANE CONVERSIONS

New Analytical Tools for Safety Management of Urban and Suburban Arterials. Douglas W. Harwood Midwest Research Institute Kansas City, MO

Estimation of Average Daily Traffic on Low Volume Roads in Alabama

A random forests approach to prioritize Highway Safety Manual (HSM) variables for data collection

Overcoming Highway Safety Manual Implementation Challenges

Eng. Benjamín Colucci-Ríos, PhD, PE, PTOE, FITE, API, JD

Safety Hot Spot Analysis Methodology in Utah Grant G. Schultz, Ph.D., P.E., PTOE; and Jacob S. Farnsworth, E.I.T.

How to Utilize Your Data to Provide Information to Decision Makers in Traffic Safety ATSIP August 2nd

Statewide Roundabout Guidance

Safety evaluation of multilane arterials in Florida

TRAFFIC STUDY GUIDELINES

NCHRP 3-88 Guidelines for Ramp and Interchange Spacing

Modeling the Safety Effect of Access and Signal Density on. Suburban Arterials: Using Macro Level Analysis Method

Urban Street Safety, Operation, and Reliability

Highway Safety Analysis Tools for Engineers

SAFETY IMPACT OF HIGH FRICTION SURFACE TREATMENT INSTALLATIONS IN PENNSYLVANIA

Comparison Between Traditional Methods and the EB SPF Method for Identifying High Crash Risk Intersections

Comparing Roundabout Capacity Analysis Methods, or How the Selection of Analysis Method Can Affect the Design

Thesis: Development of a Mode Choice Model Considering Heterogeneous Traffic of Dhaka City B.S. Civil Engineering, 2008

TRAFFIC SAFETY EVALUATION. Using the Highway Safety Manual and the Interactive Highway Safety Design Model. I 15 Dry Lakes Design Exception

Innovative Tools and Techniques in Identifying Highway Safety Improvement Projects

Data-Driven Safety Analysis (DDSA) Implementing Safety Innovations Every Day Counts 3

Safety Effects of Street Illuminance on Urban Roadways

Updating RoadHAT: Collision Diagram Builder and HSM Elements

Urban Street Reliability Engine User Guide

Urban Street Reliability Engine User Guide

REAL-TIME CRASH RISK ANALYSIS OF URBAN ARTERIALS INCORPORATING BLUETOOTH, WEATHER, AND ADAPTIVE SIGNAL CONTROL DATA

NCHRP Report 687 Guidelines for Ramp and Interchange Spacing

INTERACTIVE HIGHWAY SAFETY DESIGN MODEL (IHSDM)

RELATIONSHIP BETWEEN TRAFFIC FLOW AND SAFETY OF FREEWAYS IN CHINA: A CASE STUDY OF JINGJINTANG FREEWAY

Volume to Capacity Estimation of Signalized Road Networks for Metropolitan Transportation Planning. Hiron Fernando, BSCE. A Thesis CIVIL ENGINEERING

TxDOT X-Ramps vs Diamonds: Safety and Operational Issues for Conversion

Final Report SAFETY. Craig Lyon. Gabrielle Renner

HIGHWAY DAMAGE DUE TO MOVEMENT OF WIND TURBINE COMPONENTS

BEFORE AND AFTER STUDY: EVALUATING THE SAFETY EFFECTS OF A THRU-U-TURN INTERSECTION

Identification of Crash Types, Risk Factors, and Countermeasures to Implement Systemic Safety

Title : Evaluation of Geometric Elements and Speeds using Probability Theory. Author : Heungun Oh. Abstract

Model Construction and Calibration Technical Documentation Draft

Automated Statewide Highway Intersection Safety Data Collection and Evaluation Strategy

GUIDE FOR THE PREPARATION OF TRAFFIC IMPACT STUDIES

HSM APPLICATION USING IHSDM (I-12 TO BUSH) April Renard, P.E. LADOTD Highway Safety

100 Introduction Functional Classification Traffic Data Terrain & Locale Design & Legal Speed...

Bigelow, Hans, and Phares 1. Bridge Rail and Approach Railing For Low-Volume Roads In Iowa Resubmission Date: October 29, 2010

APPENDIX D ACCESS MANAGEMENT GUIDELINES

An Automated System for Prioritizing Highway Improvement Locations and Analyzing Project Alternatives

The New Highway Capacity Manual 6 th Edition It s Not Your Father s HCM

The Highway Economic Requirements System (HERS): How HERS Works

Safety Evaluation of Diverging Diamond Interchanges in Missouri

CHAPTER 2: MODELING METHODOLOGY

ROAD ACCIDENT DATA BASE: THE CASE OF CACADU DISTRICT MUNICIPALITY ABSTRACT

Safety Evaluation of Diverging Diamond Interchanges in Missouri

9. TRAVEL FORECAST MODEL DEVELOPMENT

HIGHWAY SAFETY MANUAL WORKSHOP

Developing Direct Demand AADT Forecasting Models for Small and Medium Sized Urban Communities

Crash Analysis Tools 5/12/2011. Overview. Crash Reporting. Crash Data Crash Analysis High Crash Location Prioritization

A RISK-BASED OPTIMIZATION METHODOLOGY FOR MANAGING COUNTY PAVED ROADS

University of Kentucky 1 Southeastern Transportation Center Safety Performance in a Connected Vehicle Environment. FIGURE 1: V2V and V2I Connectivity

Emergency Incident Management, Benefits and Operational Issues

RESEARCH RESULTS DIGEST July 1999 Number 242

Highway Safety Manual Implementation. Priscilla Tobias, P.E. State Safety Engineer Illinois Department of Transportation

Application of Road Safety Audits to Urban Streets

GIS-BASED NON-SIGNALIZED INTERSECTION DATA INVENTORY TOOL TO IMPROVE TRAFFIC SAFETY

MPC-459 April 1, July 31, 2017

DEVELOPMENT AND APPLICATION

LOCATION AND DESIGN DIVISION

HIGH RISK RURAL ROADS (HRRR)

FDOT D7 Local Agency Traffic Safety Academy September 21, FHWA - Kevin Burgess, Chimai Ngo, Esther Strawder, and Jim Thorne

AASHTO s Highway Safety Manual and Performance Measures / Targets

USER BENEFIT ANALYSIS FOR HIGHWAYS MANUAL

Highway Safety Manual Lead State Peer Exchange Irvine, California

GENERAL PLAN UPDATE. Joint Study Session April 13, 2017

The Management in Access Management. John M. Broemmelsiek, PE U.S. Department of Transportation Federal Highway Administration

Predicting Travel Time Reliability: Methods and Implications for Practice

B HOT MIX, HOT LAID ASPHALTIC CONCRETE OPSS 313

USING TWOPAS SIMULATION MODEL TO PROVIDE DESIGN AND OPERATIONS INFORMATION ON THE PERFORMANCE OF IDAHO S TWO-LANE HIGHWAYS

Traffic Safety Analysis of Utah County Highways using SafetyAnalyst. Peter Crawford Kelly

Construction Related User Delay Costs The Case of the Crowchild Trail Bridge Rehabilitation in Calgary

A Network Demand Model for Rural Bypass Planning. Paper Number

Improved Methods for Superelevation Distribution: I. Single Curve

Model Performance Measures for State Traffic Record Systems

Alternative High Occupancy/Toll Lane Pricing Strategies and their Effect on Market Share

Highway Safety Manual Lead State Third Peer Exchange Nashville, Tennessee

DEVELOPMENT AND ANALYSIS OF AN INTERSECTION SAFETY PRIORITIZATION METHOD FOR THE STATE OF DELAWARE. Anna Duryea

MPC-480 June 9, Project Title: A Comprehensive Safety Assessment Methodology for Innovative Geometric Designs. University:

Recommended Roadway Plan Section 3 Existing Facilities & System Performance

Transcription:

A Procedure to Determine When Safety Performance Functions Should Be Recalibrated Mohammadali Shirazi Zachry Department of Civil Engineering Texas A&M University, College Station, TX 77843, United States Email: alishirazi@tamu.edu Srinivas Reddy Geedipally*, Ph.D. Assistant Research Engineer Texas A&M Transportation Institute Tel. (817) 462-0519 Email: srinivas-g@tti.tamu.edu Dominique Lord, Ph.D. Professor, Zachry Department of Civil Engineering Texas A&M University, College Station, TX 77843, United States Tel. (979) 458-3949 Email: d-lord@tamu.edu *Corresponding Author 1

ABSTRACT Crash prediction models or Safety Performance Functions can be used for estimating the number of crashes and evaluating roadway safety. Developing a new model can be a difficult task and requires a significant amount of time and energy. To simplify the process, the Highway Safety Manual provides Safety Performance Functions for conducting different types of safety analyses for several facilities. However, since data collected from a few selected states for a specific period of time were considered for fitting and validating these models, they are required to be calibrated to the conditions of the new jurisdiction, and as well need to be revisited for recalibration over time. Therefore, the analyst may need to know when or how often the models are recommended to be recalibrated. This paper addresses this question and documents recommendations that are based on the general characteristics of data. The proposed procedure only require (1) the total number of crashes (2) the average traffic flow and (3) the total segment length (or number of intersections) in the network. The method was validated with different empirical datasets collected in Texas and Michigan. The results show that the proposed procedure provides useful information about when recalibration is recommended. 1. INTRODUCTION Crash prediction models can be used to estimate the number of crashes, screen the effects of variables on crashes, identify hotspots and in general evaluate roadway safety. Developing a new model can demand a great deal of time, energy and resources. In order to simplify the process, the Highway Safety Manual (HSM) (AASHTO, 2010) provides crash prediction models (CPMs) or Safety Performance Functions (SPFs) for conducting different types of safety analyses for several categories of facilities. These SPFs were fitted and validated with data collected from a limited number of states. Therefore, for a new jurisdiction, the safety analyst needs to either develop a jurisdiction-specific model (see Lu et al., 2014; Young and Park, 2013) or calibrate the HSM SPFs to the jurisdiction conditions. For the later approach, the HSM provides a calibration procedure to account for both the randomness of crash data and the effect of the factors that were not or could not be considered in predictive models, such as the weather conditions or driver behavior. The HSM recommends collecting crash and site characteristics data from a randomly selected sample that includes 30-50 sites with total of at least 100 crashes per year. [note: several researchers have raised issues about the HSM one-size-fits-all sample size recommendation (see 2

Banihashemi, 2012; Alluri et al., 2016; Lord et al., 2016). For better guidelines on the required sample size, interested readers are referred to Bahar (2014) and Shirazi et al. (2016).] Then, the calibration factor (C-factor) is estimated by taking the ratio of the number of observed crashes to the number of unadjusted predicted crashes as follows: where, C= calibration factor; C N, N, N, N, CMF CMF N, = the observed number of crashes at site i; N, =the unadjusted predicted number of crashes at site i; N, = the predicted number of crashes at site i at the base conditions; and, CMF = the j-th crash modification factor at site i. (1) The C-factor is multiplied as a scalar term to the SPF from the HSM for the type of facility being analyzed. Since the release of the HSM, several states such as Oregon (Xie et al., 2011), Utah (Brimley et al., 2012), Illinois (Williamson and Zhou, 2012), Alabama (Mehta and Lou, 2013), and Maryland (Shin et al., 2014), among others, calibrated the HSM models to local conditions. It was widely reported in these studies that the calibration procedure is a time consuming task due to limitations which are mostly pertained to the data collection, readiness and completeness. Moreover, the procedure is not a one-time task. Since the characteristics of crash data (associated with continuous change in roadway, vehicle and driver characteristics) are likely to change as time goes on, prediction models should also be recalibrated (or updated) over time (see Wood et al, 2013; Connor et al, 2013). On the one hand, recalibrating the predictive models is inevitable as the characteristics of crash data are likely to change over time or temporarily. On the other hand, although more efficient than fitting a new model, recalibrating the predictive models can still be a challenging task. Furthermore, the effort put into recalibrating models could be wasted if it was found that a recalibration had not been required at that point of time. Consequently, the analyst or an agency may need to know when or how often SPFs should be recalibrated to avoid unnecessary calibration 3

efforts or expenses. The HSM recommends calibrating the models every 2 or 3 years, regardless of the crash data characteristics and their variation over time. However, this recommendation should be reviewed as crash data are random and their characteristics can vary substantially from one period of time to the next and this randomness can influence the recalibration decision. The objective of this study is therefore to provide a simple and convenient procedure (or recommendations) to determine when the analyst is advised to recalibrate the predictive models. The proposed procedure is based on general characteristics of data that will be used for recalibration of the predictive models. In order to use the recommendations, the analyst needs to secure only three sources of information for the period of data that will be used for recalibration. These sources are: 1) the total number of crashes 2) the traffic flow average and 3) the total segment length (or the total number of intersections). Once the information is secured, the recommendations can simply be used to make the recalibration decision. 2. RECALIBRATION RECOMMENDATIONS This section is divided into two parts. The first part describes the characteristics of the proposed method. The second part covers the procedure to use the recommendations for segment and intersection models. 2.1 Methodology As discussed above, the characteristics of crash data vary over time. This variation can influence the recalibration decision. Based on Equation (1), the value of the C-factor is a function of the observed number of crashes. Therefore, it may initially seem that a substantial change in total observed number of crashes is an indicator for recalibrating a model. However, this may not necessarily be the case, as sometimes the change in the number of crashes can simply be attributed to a comparable change in network traffic flow. An increase (or decrease) in number of crashes can be caused by one of these two factors: (1) a fluctuation in traffic flow, or (2) a change in crash risk (or a decrease or increase in safety). It can be argued that if the change in the number of crashes is pertained to comparable traffic flow fluctuation on the network, the analyst does not need to recalibrate the predictive models. On the other hand, if the increase (or decrease) in the number of 4

crashes is attributed to a particular change that induce a safety issue (or safety improvement) ceteris paribus, the model should be recalibrated. We use the HSM base models [Note: the flow only models with all other variables at the base conditions are referred to as base models] to account for the effect the traffic flow on time-to-time variation of the crash data and estimate the number of crashes caused by changing the mean value of the traffic flow, either Average Daily Traffic (ADT) or Annual Average Daily Traffic (AADT). The typical HSM base model to estimate the number of crashes on roadway segments is as follows: N, e (2) where F and L respectively denote the traffic flow (AADT or ADT) and the segment length at the site ". The parameters b and b denote the intercept and the traffic flow coefficient, respectively. The functional form shown in Equation (2) is kept the same. However, instead of using the Flow value at each individual site, the mean value of the traffic flow in the network (F ) is considered in the model to approximately estimate the overall number of crashes as a function of variable F. This is shown in Equation (3): N, (3) The parameter C (which is referred to as the C- proxy in this study) is calculated as follows: C N, e (4) In other words, N, C (5) e L Therefore, N, C (6) e L 5

Hence, where, C N (7) e L C = C- proxy; N = the total number of crashes on the network; F = the mean value of the traffic flow (AADT or ADT) on the network; and, L = the total combined length of all sites. As shown in Equation (7), only three sources of information at the network level are required to calculate the C-proxy: (1) the total number of crashes, (2) the mean value of the traffic flow, and (3) the total segment length. Recall that it was argued that if the number of crashes increases or decreases due to comparable traffic flow fluctuations, the analyst does not require to put effort on recalibration of the predictive model (i.e., find a new calibration factor). This can adequately be addressed by calculating the C- proxy periodically. In this case, if the C-proxy does not change significantly from one recalibration period to the next, the analyst will not be required to recalibrate the predictive model. On the other hand, a significant change in C-proxy indicates a variation in crash data that has not just been pertained to comparable traffic flow fluctuation. Hence, recalibration is advised. In order to use the procedure, the analyst is required to calculate the C-proxy periodically and compare it with the C-proxy that was estimated in the reference year [Note: the latest year that the predictive model was recalibrated is referred to as the reference year]. If the relative difference between the current C-proxy (C ) and reference-year C-proxy (C ) is less than a certain threshold, the analyst can ignore the recalibration and keep the current model; otherwise, the analyst is recommended to recalibrate the predictive model. In this study, we set the threshold to 10 percent. However, it should be pointed out that the procedure works for any threshold. The threshold is in fact a function of or a balance between the accuracy sought and the effort undertaken by the analyst (i.e., how often does the analyst want to recalibrate the model). Further discussion about the threshold is provided below. 6

2.2. Instructions to Use the Procedure This section summarizes the instructions for using the proposed procedure for roadway segment and intersection models. Before presenting the instructions, it is worth pointing out that the proposed method serves only as an approximation process. Some other factors can also influence the recalibration decision. For example, if significant system-wide safety improvements happened in the analyzed jurisdiction, the analyst would be required to update the models regardless of what the method states. Besides, the analyst is advised to recalibrate the models whenever it is possible and adequate resources are available. 2.2.1 Segment Models The general steps to calculate the C-proxy and use the procedure for roadway segment models are as follows: Step 1. Find the total number of crashes (N ) and the total segment length (L on the network facility. Step 2. Find the mean value of the traffic flow [the mean value of ADT or AADT (ADT, or AADT )] on the facility. Note that if the mean value of the ADT or AADT is not available to the analyst, it is advised to calculate the mean value by randomly collecting ADT from a limited number of sites that provide the overall representation of the network. Step 3. Consider the roadway segment base SPF (i.e., the model with all CMFs at the base condition) from the HSM. Let b and b denote the intercept and the coefficient of traffic flow respectively. Find N as: N e (8) Step 4. Find the C using the following equation: 7

C N (9) N L Step 5. Find the variable e as follows: e C C C 100 (10) where C denote the C that was calculated in the reference year. Step 6. If e 10%, the model needs to be recalibrated; hence, calibrate the model and set the current C as the new C. Otherwise, keep the current C and use the calibration factor that was estimated in the reference year. 2.2.2 Intersection models A similar procedure with small modifications can be proposed to investigate if the intersection models are required to be recalibrated. The general steps to calculate the C-proxy and use the recalibration procedure for intersection models are as follows: Step 1. Find the total number of crashes (N ) and the total number of intersections (N) on the network facility. Step 2. Find the average traffic flow on the major road (F ) and minor road (F ). Note: If the mean value of traffic flows is not available, it is advised to find the mean value by randomly collecting the traffic flows for a limited number of intersections that provide the overall representation of the intersection type. Step 3. Consider the intersection base SPF from the HSM. Let b, b and b denote the intercept, the coefficient of the traffic flow on the major road, and the coefficient of the traffic flow on the minor road respectively. Find the variable N as: 8

N e (11) Step 4. Find the C using the Equation (9). Note: for this case, in Equation (9), the total length variable (L ) should be replaced with the total number of intersections (N). Step 5. Find e using the Equation (10). Step 6. If e 10%, the model needs to be recalibrated; hence, calibrate the model and set the current C as the new C. Otherwise, keep the current C and use the calibration factor that was estimated in the reference year. 3. VALIDATING THE PROCEDURE WITH OBSERVED DATASETS In this section, the proposed procedure is evaluated for several facilities in Texas and Michigan. For each example, the C-proxy is calculated and the procedure is applied to determine when recalibration is needed. Then, the procedure is validated by calculating the change in C-factor over time and comparing the results with the recommended outcome of the procedure. Recall that the C-proxy is computed given three sources of information at the network level as shown in Equation (7); on the other hand, the C-factor is estimated by collecting detailed data at individual sites as shown in Equation (1). First, the procedure is validated for the Texas urban four-lane divided arterials. The crash data for this facility were collected in three-year frequency from 2007 to 2014. This means, for example, if the traffic and geometric data in 2009 are considered then the crash data in this dataset will include crashes occurred in a three year period from 2007 to 2009. All variables in the dataset met the base conditions except the median width. Thus, the model includes one CMF for the median width. Next, the same procedure is examined for Texas rural multilane divided segments. The crash data were collected in three-year frequency from 2007 to 2014. For this dataset, the SPF included three CMF modifications: lane width, right shoulder width and the median width. Finally, the recommendations are evaluated for four-legged signalized intersections in Michigan. The crash data were collected in one-year frequency from 2008 to 2012. For this dataset, the SPF included four CMF modifications: left turn lane, right turn lane, right turn on red and lighting. 9

3.1 Texas Urban Four-Lane Divided Arterials For this facility, crash data were collected in three-year frequency. Except the median width, all other variables met the base conditions. For the purpose of analysis, it is assumed that the agency calibrated the model in 2009 using data that were collected from 2007 to 2009. Although the calibration will be conducted after year 2009 is completed, for convenience purpose, the calibration year is referred to as 2009. This year is set as the reference year and it s C-proxy as the reference C-proxy (i.e., C =1.055). Table 1 shows the results of applying the proposed procedure to this dataset. <TABLE 1 here> In 2010, the analyst needs to use data from 2008 to 2010 and find the C-proxy (C. The relative difference between C and C is equal to 5.0 percent, which is less than 10 percent, so the method does not recommend the recalibration. To verify the decision, the calibration factor (Equation 1) in 2010 is also calculated and shown in the table. As it is shown, the difference between the calibration factor in 2010 and the one in the reference year is equal to 3.5 percent which is relatively small; hence, the decision based on the proposed procedure is confirmed by checking the change in C-factor. In 2011, the analyst needs to use the three-year data from 2009 to 2011 to calculate the C- proxy in 2011 (C ). Recall that recalibration was not recommended in 2010. Therefore, 2009 is still the reference year. The C-proxy in 2011 is compared to the C. Since the C-proxy is changed by more than 10 percent, the model is recommended for recalibration. Therefore, the model is recalibrated and the reference year and C are modified accordingly. This decision is also supported by calculations based on the change in C-factor between those years. In 2012, the data from 2010 to 2012 is used to find the C-proxy. The C-proxy is changed by 10 percent; hence, recalibration is recommended. To verify the procedure outcome, the calibration factor is also calculated. As shown in Table 1, in this year, the calibration factor is also increased by about 10 percent compared to the reference year; therefore, the outcome of the procedure is verified. The model is recalibrated and the reference year is set to 2011 and C is modified accordingly. 10

The same procedure is repeated for subsequent years. As indicated in Table 1, the model, again, is needed for recalibration in 2014 once the analyst observes an 18 percent change in C-proxy. Figure 1 shows the value of the C-proxy in different periods for the analyzed data. In this figure, points are marked in red if recalibration is recommended, and in green otherwise. <FIGURE 1 here> As discussed above, we used a threshold of 10 percent for our analysis. In cases when the safety analyst would prefer to have a more accurate C-factor, a lower threshold, such as 5 percent, could be used. However, the model would then need to be calibrated more frequently. On the other hand, by choosing a higher threshold, such as 15 percent, a less accurate C-factor will be used, but the analyst will place less effort on calibrating models over time. In the end, the analyst needs to balance the recalibration process between accuracy and the effort or resources needed for recalibrating the models. 3.2 Texas Multi-Lane Divided Rural Segments Texas multi-lane divided rural segments data were collected from 2007 to 2014 in three-year frequency. This dataset represents a scenario where multiple variables do not meet base conditions. The SPF included three CMFs: lane width, right shoulder width and median width. Table 2 shows the recalibration decisions made for this dataset. For the purpose of analysis, it is assumed that the model was calibrated in 2009 using data from 2007 to 2009. Table 2 shows the results of applying the method to this dataset. <TABLE 2 here> As it is indicated in Table 2, until 2013, no significant change in C-proxy is observed and values remain below the 10 percent threshold; hence, for four years, recalibration is not recommended. The small change in calibration factors verifies this decision. In 2013, however, the C-proxy is increased by 10.0 percent compared to the reference C-proxy; therefore, recalibration is recommended. The significant increase (10.1 percent) in the calibration factor verifies the decision based on the proposed recommendations. In 2014, since the change in C-proxy is more than 10 percent, the model again is recommended for recalibration, even though the SPF was recalibrated in the previous year. Figure 2 shows the value of the C-proxy in different periods for 11

the analyzed data. Similar to the Figure 1, points marked in red indicate a recalibration decision, and green otherwise. <FIGURE 2 here> 3.3 Michigan Four-Legged Signalized Intersections To validate the proposed procedure for intersections, the crash data collected at four-legged signalized intersections in Michigan are used. The data were collected from 2008 to 2012 in oneyear frequency. For this dataset, the SPF included four CMF modifications: left turn lane, right turn lane, right turn on red and lighting. For the purpose of analysis, it was assumed that the predictive model was calibrated in 2008. Therefore, the reference year was set to 2008, and the C- proxy in 2008 was set as the reference C-proxy (C =1.346). Next, the recalibration need was investigated for subsequent years from 2009 to 2012 in one-year time periods. Table 3 presents the results of applying the proposed procedure to this dataset. For each year, the C-proxy was calculated and reported in Table 3. The new C-proxy was compared to the C-proxy in reference year. <TABLE 3 here> As shown in the table, since the C-proxy change is less than 10 percent for all periods of data, no recalibration is required for any of the subsequent years until 2012. To validate the results, the C-factors were also calculated and reported in the table. As indicated in Table 3, the change in C- factors in subsequent years is also negligible. This verifies the decision based on the proposed recommendations. 4. SUMMARY AND CONCLUSIONS Although less taxing than creating new models, the calibration of predictive models can still be a time-consuming and expensive task. Therefore, the analyst needs to know when or how often the models should be recalibrated to avoid unnecessary efforts and expenses. The HSM recommends deriving calibration factors at least every two or three years. However, there is no research to support this recommendation. In this research, we have documented a procedure to determine when the predictive models are advised to be recalibrated. The proposed procedure was successfully validated by applying them to several observed datasets in Texas and Michigan. The 12

procedure is straightforward and can be used for recalibrating any predictive models, irrespective of the type of facility, such as intersections and roadway segments. The only sources of information needed for segment models are: (1) the total number of crashes, (2) the mean value of ADT or AADT and (3) the total segment length. For intersection models, the only needed information are (1) the total number of crash data, (2) the average traffic flow on major and minor roads and (3) the number of intersections. The agency is required to secure these values periodically and follow the steps stated in the recommendations to make a decision on recalibration of the predictive models. Although the study recommendations are applicable to any threshold, further research is needed to determine the appropriate or statistically-oriented threshold for each facility being analyzed. In that context, although beyond the scope of this project, the uncertainty associated with the product of models and CMFs should be examined for determining the threshold for deciding when a model needs to be recalibrated (Lord, 2008; Bahar, 2014). ACKNOWLEDGMENTS The authors would like to thank the ATLAS Center for funding this project. The views expressed by the authors of this paper do not necessarily reflect the views of the Center. REFRENCES AASHTO, 2010. Highway Safety Manual (1st Edition). American Association of State Highways and Transportation Officials Transportation Research Board of the National Academies, Washington, DC. Alluri, P., Saha, D., & Gan, A. (2016). Minimum sample sizes for estimating reliable Highway Safety Manual (HSM) calibration factors. Journal of Transportation Safety & Security, 8(1), 56-74. Bahar, G. (2014). User s Guide to Develop Highway Safety Manual Safety Performance Function Calibration Factors. National Cooperative Highway Research Program, Transportation Research Board, Washington, D.C. Banihashemi, M. (2012). Sensitivity Analysis of Data Set Sizes for Highway Safety Manual Calibration Factors. Transportation Research Record 2279: Journal of the Transportation Research Board, pp. 75 81. Brimley, B. K., M. Saito, and G. G. Schultz (2012). Calibration of Highway Safety Manual Safety Performance Function. Transportation Research Record 2279: Journal of the Transportation Research Board, pp. 82 89. 13

Connors, R. D., M. Maher, A. Wood, L. Mountain, and K. Ropkins (2013). Methodology for Fitting and Updating Predictive Accident Models with Trend. Accident Analysis & Prevention, 56, pp. 82 94. Lord, D. (2008) Methodology for Estimating the Variance and Confidence Intervals of the Estimate of the Product of Baseline Models and AMFs. Accident Analysis & Prevention, Vol. 40, No. 3, pp. 1013-1017. Lord, D., Geedipally, S.R., Shirazi, M. (2016) Improved Guidelines for Estimating the Highway Safety Manual Calibration Factors. Final Report ATLAS-2015-10. Texas A&M Transportation Institute, College Station, TX. Lu, J., Haleem, K., Alluri, P., Gan, A., & Liu, K. (2014). Developing local safety performance functions versus calculating calibration factors for Safety Analyst applications: a Florida case study. Safety science, 65, 93-105. Mehta, G., and Y. Lou (2013). Calibration and Development of Safety Performance Functions for Alabama. Transportation Research Record 2398: Journal of the Transportation Research Board, pp.75 82. Shin, H., Y. J. Lee, and S. Dadvar (2014). The Development of Local Calibration Factors for Implementing the Highway Safety Manual in Maryland (No. MD-14-SP209B4J). Maryland State Highway Administration, Hanover, MD. Shirazi, M., D. Lord, S.R. Geedipally (2016). Sample-size guidelines for recalibrating crash prediction models: Recommendation for the Highway Safety Manual. Accident Analysis & Prevention, 93, pp.160-168. Williamson, M., and H. Zhou (2012). Develop Calibration Factors for Crash Prediction Models for Rural Two-Lane Roadways in Illinois. Procedia-Social and Behavioral Sciences, 43, pp. 330 338. Wood, A. G., L. J. Mountain, R. D. Connors, M. J. Maher, and K. Ropkins (2013). Updating Outdated Predictive Accident Models. Accident Analysis & Prevention, 55, pp: 54 66. Xie, F., K. Gladhill, K. K. Dixon, and C.M. Monsere (2011). Calibration of Highway Safety Manual Predictive Models for Oregon State Highways. Transportation Research Record 2241: Journal of the Transportation Research Board, pp. 19 28. Young, J. and Park, P.Y. (2013). Benefits of small municipalities using jurisdiction-specific safety performance functions rather than the Highway Safety Manual's calibrated or uncalibrated safety performance functions. Canadian Journal of Civil Engineering, 40(6), pp.517-527. 14

Table 1. Recalibration for Texas Urban Four-Lane Divided Arterials. Period 1 2 3 4 5 6 Year 2007-2009 2008-2010 2009-2011 2010-2012 2011-2013 2012-2014 Total Crashes 8,573 8,140 7,613 8,613 9,114 10,072 Avg. Traffic Flow 17,508 17,375 17,355 17,452 17,286 17,346 Total Length (Mile) 1,050.3 1,060.2 1,083.2 1,105.9 1,105.9 1,105.2 C-proxy 1.055 1.002 0.919 1.011 1.083 1.192 Reference C-proxy a - 1.055 (2009) b 1.055 (2009) 0.919 (2011) 1.011 (2012) 1.011 (2012) Change in C-proxy (%) c - 5.0 12.9 10.0 7.2 18.0 Recalibration Needed? - No Yes Yes No Yes -factor 1.011 0.976 0.887 0.978 1.039 1.116 Change in C-factor (%) - 3.5 12.3 10.2 6.3 14.1 a The number in parenthesis indicates the reference year (the time that the model was recalibrated). b The time that the reference year was changed for the first time is marked in bold. C The underlined numbers show the cases when recalibration was recommended. 15

Table 2. Recalibration for the Texas Multi-Lane Divided Rural Segments. Period 1 2 3 4 5 6 Year 2007-2009 2008-2010 2009-2011 2010-2012 2011-2013 2012-2014 Total Crashes 1,974 1,993 1,988 2,175 2,118 2,511 Avg. Traffic Flow 8,065 7,763 7,706 7,888 7,516 7,555 Total Length (Mile) 489.6 511.1 518.5 539.2 514.1 535.8 C-proxy 0.891 0.897 0.888 0.912 0.980 1.109 Reference C-proxy a - 0.891 (2009) b 0.891 (2009) 0.891 (2009) 0.891 (2009) 0.980 (2013) Change in C-proxy (%) c - 0.7 0.3 2.4 10.0 13.2 Recalibration Needed? - No No No Yes Yes -factor 0.961 0.961 0.946 0.994 1.058 1.120 Change in C-factor (%) - 0.0 1.6 3.4 10.1 5.9 a The number in parenthesis indicates the reference year (the time that the model was recalibrated). b The time that the reference year was changed for the first time is marked in bold. C The underlined numbers show the cases when recalibration was recommended. 16

Table 3. Recalibration for the Michigan Four-Legged Signalized Intersections. Period 1 2 3 4 5 Year 2008 2009 2010 2011 2012 Total Crashes 2,855 2,825 2,937 2,900 2,850 Avg. Major Flow 20,889 20,997 21,455 21,111 21,542 Avg. Minor Flow 8,781 8,832 9,034 8,890 9,056 No. of Intersections 349 349 349 348 345 C-proxy 1.346 1.323 1.338 1.352 1.307 Reference C-proxy a - 1.346 (2008) b 1.346 (2008) 1.346 (2008) 1.346 (2008) Change in C-proxy (%) - 1.7 0.6 0.5 2.9 Recalibration Needed? - No No No No -factor 1.827 1.802 1.825 1.844 1.785 Change in C-factor (%) - 1.4 0.1 0.9 2.3 a The number in parenthesis indicates the reference year (the time that the model was recalibrated). b The time that the reference year was changed for the first time is marked in bold. 17

1.25 1.20 1.15 C-Proxy 1.10 1.05 18.0% 1.00 0.95 12.9% 10.0% 0.90 0.85 1 2 3 4 5 6 Period Figure 1. The Value of C-proxy in Each Period of Analysis for Texas Four-Lane Divided Arterials. 18

1.15 1.10 1.05 13.2% C-Proxy 1.00 0.95 0.90 10.0% 0.85 0.80 1 2 3 4 5 6 Period Figure 2. The Value of C-proxy in Each Period of Analysis for Texas Multi-Lane Divided Rural Segments. 19