ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA

Size: px
Start display at page:

Download "ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA"

Transcription

1 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA INTRODUCTION VICTOR RICHMOND R. JOSE McDonough School of Business, Georgetown University, Washington, D.C. The task of assessing probabilities for uncertain events can be cognitively challenging even for the most skilled forecasters. This is most evident when one has limited prior experience with such events. The presence of data, however, can ease the burden on the decision maker by providing a good starting point for understanding the nature of the uncertainty, and such data hopefully would lead to accurate and reliable assessments. In many cases, it is almost impossible to find information directly related to the uncertainty of concern. Some related data, however, are typically available that could be used to improve our understanding of the uncertainty. For example, consider an engineer who is studying the failure time of a certain component. She is mainly concerned about whether this component will last until some ordered spares arrive, in order to prevent an disruption in the company s operations. She may study data on the same component or a closely related product for which information is available. This information may be used to generate a probability assessment for this component since it seems to be a reasonable assumption that the life of this component would be similar to that of other identical or similarly designed items. On the other hand, consider an insurance company which routinely assesses the likelihood that a customer would be involved in a motor vehicle accident. It would be a realistic assumption to say that based on certain demographics such as age, occupation, and driving experience, some individuals are more likely than others to be involved in an accident. Here, the related information can be used to help predict whether or not the customer would be involved in an accident say, in the next year. In these examples, data become useful in the probability assessment process. In the engineering example, assessments can be made by using a historical approach, that is, probabilities are assigned to events based on the assumption that past information is representative of current and future behavior. It works on the premise that the earlier events are drawn from a common distribution or process and that the event being assessed will follow as well. In the insurance example, however, it may no longer be plausible to assume that the chance of a particular individual getting into a motor vehicle accident is a simple random draw from the accident history of everyone in the population. Here, additional explanatory variables are used to make inferences about the event probability. ASSESSMENTS USING HISTORICAL DATA In the historical data approach, a probability assessment is made by generating a distribution based on past information and using this for analysis related to the uncertainty of interest. Framed as such, a large literature in statistics is available and has been devoted to address this problem. Parametric Approach Traditionally, constructing probability distributions from data often starts by considering a subset of probability measures that are indexed by some parameter θ R n.by considering functions characterized by this parameter θ, the assessment task is somewhat easier since the focus is then simply on estimating appropriate values for θ. Choosing a Model. Often, problems can be approached in many different ways and Wiley Encyclopedia of Operations Research and Management Science, edited by James J. Cochran Copyright 2010 John Wiley & Sons, Inc. 1

2 2 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA choosing an appropriate model can be somewhat confusing. Though there is no acid test to clearly determine which distribution or parametric family is appropriate in every occasion, research [1] suggests that considering simple questionnaires about the uncertainty (such as the one listed below) can be helpful in limiting the set of distributions that we need to consider. 1. Nature of the Sample Space. Is the random variable discrete, continuous or mixed (i.e., can it take only discrete values, is it continuous over some interval(s) or both)? Are we dealing with a univariate or multivariate distribution? 2. Bounds. Is the range of values bounded above? Or below? If so, what are reasonable upper and lower bounds? 3. Shape. Is the distribution symmetric? If it is skewed, in which direction is it skewed? 4. Concentration. What range of values is of primary interest to us? Is the modeling of the tails critical? Is the distribution unimodal? Or multimodal? 5. Underlying Process and Dependencies. Is there an underlying process? Do we expect the variable to be dependent on other variables? Are the variables correlated? Are other forms of dependencies known? Despite the fact that questionnaires like this one can be used to limit the parametric families that we should consider, it often does not single out a particular parametric family. Often, we still have to choose among several distributions. Working with a Model. For each of the models which we might consider, the next task for the decision maker is to generate the probability assessments that he or she needs. Consider our engineering example: suppose the records show that the last 15 units of the same component used in the plant lasted 12, 14, 15, 6, 9, 10, 11, 12, 14, 13, 13, 10, 14, 9, and 18 days. If the spares are scheduled to arrive in 10 days, then we can use this to estimate the probability that there will be a disruption. If we assume that the data we have observed comes from independent draws from a common distribution with the same probability of lasting less than 10 days, then we can say that a binomial distribution (i.e., f (x) = θ x (1 θ) n x ) might be appropriate for this. Having decided on a parametric model would then limit the problem to estimating an appropriate value for θ. Suppose that over a period of time or a series of experiments, you were able to collect a random sample X 1,..., X n coming from a distribution function F(x; θ), where θ is an unknown parameter belonging to some set R m (m 1). There are many ways to estimate θ. Perhaps the most commonly used approach used today is the maximum likelihood (ML) approach. Under this method, the data observed is assumed to be generated by some joint distribution L (characterized by the parameter θ) called a likelihood function L(θ) = f (x 1,..., x n θ). (1) The estimation is then made by finding the ˆθ, which maximizes the value of Equation (1), that is, ˆθ = arg max L(θ). (2) θ The estimate in Equation (2) can be interpreted as the parameter that gives the highest probability (or the one that makes it most likely ) that the data comes from the said process. A large literature has shown many interesting and useful statistical properties for the ML estimator, making it desirable to many researchers and practitioners. However, one challenge still encountered in this approach is that solving Equation (2) for certain functions can be computationally difficult. Going back to our example, it can be verified that the ML approach would yield an estimate for θ of 12/15 = 80%. Intuitively, this makes sense since 3 instances of the 15 were less than the delivery time of 10 days. Alternative estimation techniques which may yield significantly different results, are also available. Lehmann and Casella [2] provide a detailed discussion on different statistical estimation techniques.

3 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA 3 Naturally, an alternative model could also make the estimates significantly different. For example, if we assume that the lifetimes of the component follow say, a normal distribution, then the estimate for the mean and standard deviation using a momentmatching approach would be 12 and 2.95 respectively. This would mean that the probability that the component survives before the spare arrives would be P(Component Life 10) = P(Z 0.678) = 75.1%. An alternative approach to solving for point estimates is computing for the posterior distribution of the parameter θ and using this information to generate an estimate for the parameter. Given prior information on θ encoded in the distribution f (θ), Bayes rule allows us to compute the posterior distribution of θ as follows: f (θ x) = f (x 1,..., x n θ) f (θ) f (x1,..., x n θ) f (θ)dθ. (3) Typically, in many instances where no informationonθ is available, the statistical literature often relies on the use of noninformative (or diffuse) priors. Once a posterior distribution is attained, we can then use this for analysis. Retrospectively, we can also use this to make probabilistic statements about the uncertainty. The posterior mean of θ could be used as an estimate for θ, similar to what the ML approach tries to achieve. Other statistics (such as the median or other specific quantiles) could be used as well, depending on the problem. Compared to the earlier approach, this Bayesian treatment is a more careful analysis about the event uncertainty since the inherent uncertainties in θ is incorporated in the analysis through the prior distribution f (θ). Selecting among Models. If only one model is considered appropriate, then there is no ambiguity in which probability estimate to use. However, when multiple models are considered, we often have to choose which one should be used and reported. The typical approach in practice is to compare models based on their fit with the empirical data. Models are ranked using goodness-of-fit statistics. Some commonly used statistics include Pearson s chisquared (χ 2 ), Kolmogorov Smirnov (KS), and Anderson Darling (AD). D Agostino and Stephens [3] provide a more detailed treatise on this subject. Current statistical softwares have the ability to fit data into certain parametric families. In addition, users have the option of ranking the distributions based on fit using some common goodness-of-fit statistics. For example, Crystal Ball, a commonly used Monte Carlo simulation package, estimates parameters for distributions using an ML approach and users have the option to rank distributions based on either the χ 2, KS, or AD statistic. Figure 1 shows a sample output generated by Crystal Ball for a small financial data set. For these data-fitting techniques, one important issue that often happens in practice is dealing with small data sets. Research [4] has shown that for several well-known distributions, small differences in the estimates have a large effect on decision and risk analysis studies, especially when the uncertainty has a high relative standard deviation. Extra caution is needed when relying on assessments and analysis that are generated from such small data sets. Nonparametric Approach Once in a while, an individual may not have sufficient information to select a class of parametric families to use. In these cases, making certain assumptions on the functional form of the distribution might lead to large errors instead of improvements in the model. In a risk analysis study [5], it was shown that incorrect substitutions in functional forms of probability distributions generated errors that can grow by a factor on the order of n for a data set of n measurements. An alternative to forcing a choice for a parametric model is to allow the empirical data to determine the distribution on its own. In this nonparametric approach, the empirical distribution is used as the probability distribution in the analysis. This practice is commonly done in Monte Carlo simulation for determining some distribution when none of the standard distributions used seem to

4 4 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA Figure 1. Data-fitting option of Crystal Ball. Figure 2. Using empirical distributions in a Monte Carlo setting. be appropriate. Many Monte Carlo and statistical software packages now allow users to use empirical distributions in their analysis. Figure 2 shows the tool in Crystal Ball that could be used to create a custom distribution. Though the freedom from choosing a parametric family seems to be a better alternative, the nonparametric approach has its own setback. The main disadvantage of this approach is that the quality and reliability of the analysis exponentially decreases as the number of data points drop. To a lesser extent, tractability may also provide some problems but with the presence of modern computing this is not a big issue unless the dimension or size of the problem is large. ASSESSMENTS USING EXPLANATORY VARIABLES A different and equally valid approach in using data to assess probability distributions is to use historical and current information to predict certain events. For example, consider a firm that has decided to provide a new type of small short-term loans to its customers.

5 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA 5 In this case, one useful assessment question that might be asked by this firm is how likely is it that an individual will default a loan if one gets approved. In this case, historical information on default rates may not be as useful since customers with different backgrounds and profiles may not be likely to default their loans at the same rate. If there is historical data on loans of a similar type, then some analyses could be done to estimate the probability that an individual would default. In particular, information such as age, household income, level of education, amount of credit card debt, and current credit rating score might be useful in predicting the default rates for each individual applicant. Tools from Categorical Data Analysis One class of models that could be used to analyze explanatory variables is called discrete choice models, which are models that predict what category an uncertain event would materialize as a function of any number of variables that are believed to have an influence on the event. Binary Models. Consider a Bernoulli random variable Y i, which depends on a set of explanatory variables (X i1,..., X in ). An initial model that could be considered is a linear probability model, which tries to linearly model the probability through the explanatory variables, that is, p i E(Y i ) = β 0 + β 1 X i1 + +β n X in. (4) Two problems are typical in an approach such as this. First, the estimate that you might get may fall outside the interval [0,1]. The other issue is that the event probability may not be linearly related to the explanatory variable. Typically, you may expect that as you go to the extremities, changes in the probability would start to become smaller. An alternative approach which addresses these issues and has also been popular in the literature is logistic regression. Logistic regressions are models that fit the log-odds ratio (or logit ) of a binomial random variable as a linear function of the predictor variables {X ij } n j=1,thatis, ( ) pi logit(p i ) = ln 1 p i = β 0 + β 1 X i1 + +β n X in. (5) Equation (5) is equivalent to p i = P(Y i = 1) = exp (β 0 +β 1 X i1 + +βnx in ). (6) The term logistic comes from the fact that Equation (6) is similar in form to a logistic curve/function which has the form f (x) = (1 + exp( x)) 1. Figure 3 shows an example comparing a linear probability model with a logistic regression. To solve for the coefficients in a logistic model, ML estimation is performed. In certain instances the ML estimation procedure can be computationally tedious, especially for Probability Linear model Logit model Dependent variable Figure 3. A comparison of the linear probability and logit models.

6 6 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA large problems. However, standard statistical software such as Stata, SAS, and SPSS now have a logistic regression tool built into their standard packages; that does these computations quickly and efficiently. Multinomial Models. In cases where there are more than two possible states or categories, the binary model can be extended to the multinomial case. If Y i can take on the values {1, 2,..., m}, then we can write the logit equations as logit(p ij ) = β 0j + β 1j X i1 + +β nj X in, (7) for j i. Since we also know that j p ij = 1, we only need m 1 equations for Equation (7). Hence, one category (WLOG say state 1) is typically chosen to be the reference state; that is, the log-odds ratio is always compared to the first state making things easier to compare or mathematically, [ P(Yi = j) ] logit(p ij ) = ln. (8) P(Y i = 1) Then, Equation (7) can be rewritten as p i1 = P(Y i = 1) 1 =, m 1 + exp(β 0k + β 1k X i1 + +β nk X in ) k=2 (9) p ij = P(Y i = j) exp(β 0j + β 1j X i1 + +β nj X in ) = m 1 + exp(β 0k + β 1k X i1 + +β nk X in ) k=2 for j 1. (10) Again, many software packages available today incorporate a multinomial logistic regression tool. In addition, other multinomial logistic models (ordered state space models, conditional logistic models) are available in these packages. An excellent discussion of these more sophisticated models is provided in Refs 6 and 7. Tools from Time Series Models and Forecasting Alternatively, we can also use some inferential techniques to generate probability assessments over continuous intervals. Consider a fashion retailer who is trying to provide probabilities for sales for the next few quarters. Given the presence of past data, projections for future revenue streams can be generated say, by using some time series forecasting technique that could be used to provide a predictive distribution for future sales. Figure 4 provides an example of a data series where seasonal patterns are quite evident; draws from historical values might be misleading because of the growth trend that has been experienced by this firm. Moreover, one may be interested in using other predictors outside the historical data for making predictions. Generalized linear modelsmightbeusedtopredictsay,thesales, using other variables such as the advertising Figure 4. A sample time series forecast output from a statistical package. 0 Q3/96 Q3/99 Q3/02 Q3/05 Q3/08 Q3/11

7 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA 7 budget, lagged sales, market share, economic indicators, and the like. The predictive distribution of the forecast can then be used as the probability distribution for the unknown quantity. Here, the emphasis is not on the point forecast generated by the model but rather the probabilistic information that is contained in the predictive distribution. SOME CONSIDERATIONS The use of data can be very useful in determining an appropriate probability distribution for an uncertain event. Historical data about the uncertainty can provide a glimpse of the past behavior of the uncertainties being studied, while related information may provide a good basis for making inferences about these uncertainties. Despite the many advantages of using data, one must still be cautious of falling into the following traps that are common in this process: 1. Framing Traps. Frame blindness refers to the tendency of approaching a problem in an inappropriate manner because one has already accepted a specific framework with little thought leading to the discounting of better alternatives [8]. This narrowness in perspective often blinds decision makers. A common tendency is to lock in on a data set and discard any other theory or model that may reject this data set before doing a reasonable exploration of options. This can be due to a number of reasons such as having an initial cost associated with the data set, the data set being accepted as a common standard in the past, or perhaps its being attributable to a specific individual/source/group. 2. Illusion of Objectivity. One particular phenomenon with the use of data is the illusion of objectivity [9]. This refers to the distortion in the stakeholder s understanding of the situation by making him or her believe that the assessments made using the data are purely objective and free from any input from individuals. In many cases, the data used in the generation of probability assessments are devoid of human inputs but the choice of which data to use and how to model the data (including, for example, distributional assumptions or choice of a model in a regression setting) often involves a decision maker who ascertains why the use of such data over others is appropriate. 3. Purity of Data. Related to the notion of objectivity is the purity of data phenomenon, where decision makers often eliminate all sources of information that are attributable to individuals. In many instances however, information from individuals can be very useful and insightful. Research has shown that combining forecasts (see Bayesian Aggregation of Experts Forecasts; Combining Forecasts.) can lead to improvements in forecasting accuracy and this is true regardless of whether the forecasts come from individuals, groups, models or data. Data can be a powerful tool in providing us more information about uncertainties that we have limited information on. They are useful as long as we keep in mind that data can never be used without a sprinkling of caution and a good dose of critical thought. REFERENCES 1. Lipton J, Shaw WD, Holmes J, et al. Short communication: Selecting input distributions for use in Monte Carlo simulations. Regul Toxicol Pharm 1995;21: Lehmann EL, Casella G. Theory of point estimation, Springer texts in statistics. 2nd ed. New York: Springer; D Agostino RB, Stephens M. Statistics: a series of textbooks and monographs. Volume 68, Goodness-of-fit techniques: New York: Marcel Dekker; Haas C. Importance of distributional form in characterizing inputs in Monte Carlo risk assessments. Risk Anal 1997;17(1): Seiler FA, Alvarez JL. On the selection of distributions for stochastic variables. Risk Anal 1996;16(1):5 18.

8 8 ASSESSING PROBABILITY DISTRIBUTIONS FROM DATA 6. Agresti A. Categorical data analysis. 2nd ed. Wiley series in probability and statistics. Hoboken (NJ): Wiley-Interscience; Hosmer DW, Lemeshow S. Applied logistic regression. Wiley series in probability and statistics. Hoboken (NJ): Wiley-Interscience; Russo JE, Schoemaker PJH. Decision traps: ten barriers to brilliant decision making and how to overcome them. New York: Doubleday; Berger JO, Berry DA. Statistical analysis and the illusion of objectivity. Am Sci 1988; 76(2): FURTHER READING Frey HC, Burmaster DE. Methods for characterizing variability and uncertainty: comparison of bootstrap simulation and likelihood-based approaches. Risk Anal 1999;19(1): Greene WH. Econometric analysis. 6th ed. New Jersey: Prentice Hall; Hamed M, Bedient P. On the effect of probability distributions of input variables in public health risk assessment. Risk Anal 1997;17(1): Hamilton JD. Time series analysis. New Jersey: Princeton University Press; Hattis D, Burmaster DE. Assessment of variability and uncertainty distributions for practical risk analysis. Risk Anal 1994;14(5): Kleinbaum DG, Klein M. Logistic regression. 2nd ed. New York: Springer; Lloyd CJ. Statistical analysis of categorical data. New York: Wiley Interscience; Thompson KM. Software review of distribution fitting programs: Crystal Ball and BestFit Add- In Hum Ecol Risk Assess 1999; 5(3):

Distinguish between different types of numerical data and different data collection processes.

Distinguish between different types of numerical data and different data collection processes. Level: Diploma in Business Learning Outcomes 1.1 1.3 Distinguish between different types of numerical data and different data collection processes. Introduce the course by defining statistics and explaining

More information

CREDIT RISK MODELLING Using SAS

CREDIT RISK MODELLING Using SAS Basic Modelling Concepts Advance Credit Risk Model Development Scorecard Model Development Credit Risk Regulatory Guidelines 70 HOURS Practical Learning Live Online Classroom Weekends DexLab Certified

More information

Model Validation of a Credit Scorecard Using Bootstrap Method

Model Validation of a Credit Scorecard Using Bootstrap Method IOSR Journal of Economics and Finance (IOSR-JEF) e-issn: 2321-5933, p-issn: 2321-5925.Volume 3, Issue 3. (Mar-Apr. 2014), PP 64-68 Model Validation of a Credit Scorecard Using Bootstrap Method Dilsha M

More information

Incorporating DSM Uncertainty and Flexibility into Integrated Resource Planning

Incorporating DSM Uncertainty and Flexibility into Integrated Resource Planning Incorporating DSM Uncertainty and Flexibility into Integrated Resource Planning Eric W. Hildebrandt, RCG/Hagler, Bailly, Inc. Robert M. Wirtshafter, University of Pennsylvania This paper presents an approach

More information

Introduction to Business Research 3

Introduction to Business Research 3 Synopsis Introduction to Business Research 3 1. Orientation By the time the candidate has completed this module, he or she should understand: what has to be submitted for the viva voce examination; what

More information

Learning Structured Preferences

Learning Structured Preferences Learning Structured Preferences Leon Bergen 1, Owain R. Evans, Joshua B. Tenenbaum {bergen, owain, jbt}@mit.edu Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Cambridge,

More information

Introduction to Analytics Tools Data Models Problem solving with analytics

Introduction to Analytics Tools Data Models Problem solving with analytics Introduction to Analytics Tools Data Models Problem solving with analytics Analytics is the use of: data, information technology, statistical analysis, quantitative methods, and mathematical or computer-based

More information

Market mechanisms and stochastic programming

Market mechanisms and stochastic programming Market mechanisms and stochastic programming Kjetil K. Haugen and Stein W. Wallace Molde University College, Servicebox 8, N-6405 Molde, Norway E-mail: Kjetil.Haugen/Stein.W.Wallace@himolde.no 18.12.01

More information

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions Near-Balanced Incomplete Block Designs with An Application to Poster Competitions arxiv:1806.00034v1 [stat.ap] 31 May 2018 Xiaoyue Niu and James L. Rosenberger Department of Statistics, The Pennsylvania

More information

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee Module - 04 Lecture - 04 Sales Forecasting I A very warm welcome

More information

GETTING STARTED WITH PROC LOGISTIC

GETTING STARTED WITH PROC LOGISTIC PAPER 255-25 GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services, Inc. USA Introduction Logistic Regression is an increasingly popular analytic tool. Used to predict the probability

More information

On Distribution Asset Management: Development of Replacement Strategies

On Distribution Asset Management: Development of Replacement Strategies IEEE PES PowerAfrica 27 Conference and Exposition Johannesburg, South Africa, 16-2 July 27 On Distribution Asset Management: Development of Replacement Strategies 1 Miroslav Begovic, Fellow, IEEE, 1 Joshua

More information

A CUSTOMER-PREFERENCE UNCERTAINTY MODEL FOR DECISION-ANALYTIC CONCEPT SELECTION

A CUSTOMER-PREFERENCE UNCERTAINTY MODEL FOR DECISION-ANALYTIC CONCEPT SELECTION Proceedings of the 4th Annual ISC Research Symposium ISCRS 2 April 2, 2, Rolla, Missouri A CUSTOMER-PREFERENCE UNCERTAINTY MODEL FOR DECISION-ANALYTIC CONCEPT SELECTION ABSTRACT Analysis of customer preferences

More information

SAS/STAT 14.1 User s Guide. Introduction to Categorical Data Analysis Procedures

SAS/STAT 14.1 User s Guide. Introduction to Categorical Data Analysis Procedures SAS/STAT 14.1 User s Guide Introduction to Categorical Data Analysis Procedures This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual

More information

Decision Support and Business Intelligence Systems

Decision Support and Business Intelligence Systems Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 4: Modeling and Analysis Learning Objectives Understand the basic concepts of management support system (MSS) modeling

More information

Statistics, Data Analysis, and Decision Modeling

Statistics, Data Analysis, and Decision Modeling - ' 'li* Statistics, Data Analysis, and Decision Modeling T H I R D E D I T I O N James R. Evans University of Cincinnati PEARSON Prentice Hall Upper Saddle River, New Jersey 07458 CONTENTS Preface xv

More information

Getting Started with OptQuest

Getting Started with OptQuest Getting Started with OptQuest What OptQuest does Futura Apartments model example Portfolio Allocation model example Defining decision variables in Crystal Ball Running OptQuest Specifying decision variable

More information

In any particular situation, dealing with environmental risk involves two steps:

In any particular situation, dealing with environmental risk involves two steps: Presented by: Gargi Bhatt 04/27/2005 ENVIRONMENTAL RISK ASSESSMENT Introduction: The field of Environmental statistics is one of the rapid growths at the moment. Environmental decision making is prevalent

More information

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression 6.1 Introduction In the previous chapter, the detailed results of FA were presented and discussed. As a result, fourteen factors

More information

Joint Adoption of Conservation Agricultural Practices by Row Crop Producers in Alabama

Joint Adoption of Conservation Agricultural Practices by Row Crop Producers in Alabama Joint Adoption of Conservation Agricultural Practices by Row Crop Producers in Alabama Jason S. Bergtold, Agricultural Economist, USDA-ARS-NSDL, Auburn, AL Manik Anand, Graduate Student, Auburn University,

More information

THE LEAD PROFILE AND OTHER NON-PARAMETRIC TOOLS TO EVALUATE SURVEY SERIES AS LEADING INDICATORS

THE LEAD PROFILE AND OTHER NON-PARAMETRIC TOOLS TO EVALUATE SURVEY SERIES AS LEADING INDICATORS THE LEAD PROFILE AND OTHER NON-PARAMETRIC TOOLS TO EVALUATE SURVEY SERIES AS LEADING INDICATORS Anirvan Banerji New York 24th CIRET Conference Wellington, New Zealand March 17-20, 1999 Geoffrey H. Moore,

More information

THREE LEVEL HIERARCHICAL BAYESIAN ESTIMATION IN CONJOINT PROCESS

THREE LEVEL HIERARCHICAL BAYESIAN ESTIMATION IN CONJOINT PROCESS Please cite this article as: Paweł Kopciuszewski, Three level hierarchical Bayesian estimation in conjoint process, Scientific Research of the Institute of Mathematics and Computer Science, 2006, Volume

More information

TIMEBOXING PLANNING: BUFFERED MOSCOW RULES

TIMEBOXING PLANNING: BUFFERED MOSCOW RULES TIMEBOXING PLANNING: BUFFERED MOSCOW RULES EDUARDO MIRANDA, INSTITUTE FOR SOFTWARE RESEARCH, CARNEGIE MELLON UNIVERSITY, SEPTEMBER 2011 ABSTRACT Time boxing is a management technique which prioritizes

More information

Use Multi-Stage Model to Target the Most Valuable Customers

Use Multi-Stage Model to Target the Most Valuable Customers ABSTRACT MWSUG 2016 - Paper AA21 Use Multi-Stage Model to Target the Most Valuable Customers Chao Xu, Alliance Data Systems, Columbus, OH Jing Ren, Alliance Data Systems, Columbus, OH Hongying Yang, Alliance

More information

STATISTICAL TECHNIQUES. Data Analysis and Modelling

STATISTICAL TECHNIQUES. Data Analysis and Modelling STATISTICAL TECHNIQUES Data Analysis and Modelling DATA ANALYSIS & MODELLING Data collection and presentation Many of us probably some of the methods involved in collecting raw data. Once the data has

More information

Introduction to Categorical Data Analysis Procedures (Chapter)

Introduction to Categorical Data Analysis Procedures (Chapter) SAS/STAT 12.1 User s Guide Introduction to Categorical Data Analysis Procedures (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 12.1 User s Guide. The correct bibliographic

More information

Credit Risk Models Cross-Validation Is There Any Added Value?

Credit Risk Models Cross-Validation Is There Any Added Value? Credit Risk Models Cross-Validation Is There Any Added Value? Croatian Quants Day Zagreb, June 6, 2014 Vili Krainz vili.krainz@rba.hr The views expressed during this presentation are solely those of the

More information

Empirical Exercise Handout

Empirical Exercise Handout Empirical Exercise Handout Ec-970 International Corporate Governance Harvard University March 2, 2004 Due Date The new due date for empirical paper is Wednesday, March 24 at the beginning of class. Description

More information

An Analysis of Cointegration: Investigation of the Cost-Price Squeeze in Agriculture

An Analysis of Cointegration: Investigation of the Cost-Price Squeeze in Agriculture An Analysis of Cointegration: Investigation of the Cost-Price Squeeze in Agriculture Jody L. Campiche Henry L. Bryant Agricultural and Food Policy Center Agricultural and Food Policy Center Department

More information

GETTING STARTED WITH PROC LOGISTIC

GETTING STARTED WITH PROC LOGISTIC GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services and University of California, Berkeley Extension Division Introduction Logistic Regression is an increasingly popular analytic

More information

Non-Operational Stockpile Reliability Prediction Methods Using Logistic Regression Techniques

Non-Operational Stockpile Reliability Prediction Methods Using Logistic Regression Techniques Non-Operational Stockpile Reliability Prediction Methods Using Logistic Regression Techniques Lou Gullo Raytheon Missile Systems October, 2013 Copyright 2013 Raytheon Company. All rights reserved. Customer

More information

WRITTEN PRELIMINARY Ph.D. EXAMINATION. Department of Applied Economics. University of Minnesota. June 16, 2014 MANAGERIAL, FINANCIAL, MARKETING

WRITTEN PRELIMINARY Ph.D. EXAMINATION. Department of Applied Economics. University of Minnesota. June 16, 2014 MANAGERIAL, FINANCIAL, MARKETING WRITTEN PRELIMINARY Ph.D. EXAMINATION Department of Applied Economics University of Minnesota June 16, 2014 MANAGERIAL, FINANCIAL, MARKETING AND PRODUCTION ECONOMICS FIELD Instructions: Write your code

More information

Center for Demography and Ecology

Center for Demography and Ecology Center for Demography and Ecology University of Wisconsin-Madison A Comparative Evaluation of Selected Statistical Software for Computing Multinomial Models Nancy McDermott CDE Working Paper No. 95-01

More information

Forecasting Intermodal Competition in a Multimodal Environment

Forecasting Intermodal Competition in a Multimodal Environment TRANSPORTATION RESEARCH RECORD 1139 15 Forecasting Intermodal Competition in a Multimodal Environment KEVIN NEELS AND JOSEPH MATHER In this paper, the problem or accurately describing patterns of intermodal

More information

Examination of Cross Validation techniques and the biases they reduce.

Examination of Cross Validation techniques and the biases they reduce. Examination of Cross Validation techniques and the biases they reduce. Dr. Jon Starkweather, Research and Statistical Support consultant. The current article continues from last month s brief examples

More information

NEW CURRICULUM WEF 2013 M. SC/ PG DIPLOMA IN OPERATIONAL RESEARCH

NEW CURRICULUM WEF 2013 M. SC/ PG DIPLOMA IN OPERATIONAL RESEARCH NEW CURRICULUM WEF 2013 M. SC/ PG DIPLOMA IN OPERATIONAL RESEARCH Compulsory Course Course Name of the Course Credits Evaluation % CA WE MA 5610 Basic Statistics 3 40 10 60 10 MA 5620 Operational Research

More information

Folia Oeconomica Stetinensia DOI: /foli FORECASTING RANDOMLY DISTRIBUTED ZERO-INFLATED TIME SERIES

Folia Oeconomica Stetinensia DOI: /foli FORECASTING RANDOMLY DISTRIBUTED ZERO-INFLATED TIME SERIES Folia Oeconomica Stetinensia DOI: 10.1515/foli-2017-0001 FORECASTING RANDOMLY DISTRIBUTED ZERO-INFLATED TIME SERIES Mariusz Doszyń, Ph.D., Associate Prof. University of Szczecin Faculty of Economics and

More information

Control Charts for Customer Satisfaction Surveys

Control Charts for Customer Satisfaction Surveys Control Charts for Customer Satisfaction Surveys Robert Kushler Department of Mathematics and Statistics, Oakland University Gary Radka RDA Group ABSTRACT Periodic customer satisfaction surveys are used

More information

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES Sample Size Issues for Conjoint Analysis Studies Bryan Orme, Sawtooth Software, Inc. 1998 Copyright 1998-2001, Sawtooth Software, Inc. 530 W. Fir St. Sequim, WA

More information

Identification of the essential genes in the M. tuberculosis genome by random transposon mutagenesis

Identification of the essential genes in the M. tuberculosis genome by random transposon mutagenesis Identification of the essential genes in the M. tuberculosis genome by random transposon mutagenesis Karl W Broman Department of Biostatistics Johns Hopkins Bloomberg School of Public Health www.biostat.jhsph.edu/

More information

Progress Report: Predicting Which Recommended Content Users Click Stanley Jacob, Lingjie Kong

Progress Report: Predicting Which Recommended Content Users Click Stanley Jacob, Lingjie Kong Progress Report: Predicting Which Recommended Content Users Click Stanley Jacob, Lingjie Kong Machine learning models can be used to predict which recommended content users will click on a given website.

More information

BOOTSTRAPPING AND CONFIDENCE INTERVALS

BOOTSTRAPPING AND CONFIDENCE INTERVALS BOOTSTRAPPING AND CONFIDENCE INTERVALS Fill out your reading report on Learning Suite MPA 630: Data Science for Public Management November 8, 2018 PLAN FOR TODAY Why are we even doing this? Confidence

More information

Introduction to Quantitative Genomics / Genetics

Introduction to Quantitative Genomics / Genetics Introduction to Quantitative Genomics / Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics September 10, 2008 Jason G. Mezey Outline History and Intuition. Statistical Framework. Current

More information

RELIABILITY ANALYSIS OF THERMAL POWER GENERATING UNITS BASED ON WORKING HOURS

RELIABILITY ANALYSIS OF THERMAL POWER GENERATING UNITS BASED ON WORKING HOURS Journal of Reliability and Statistical Studies; ISSN (Print): 0974-8024, (Online):2229-5666 Vol. 7, Issue 1 (2014): 113-123 RELIABILITY ANALYSIS OF THERMAL POWER GENERATING UNITS BASED ON WORKING HOURS

More information

Republic of the Philippines LEYTE NORMAL UNIVERSITY Paterno St., Tacloban City. Course Syllabus in FD nd Semester SY:

Republic of the Philippines LEYTE NORMAL UNIVERSITY Paterno St., Tacloban City. Course Syllabus in FD nd Semester SY: Republic of the Philippines LEYTE NORMAL UNIVERSITY Paterno St., Tacloban City Course Syllabus in FD 602 2 nd Semester SY: 2015-2016 I. COURSE CODE : FD 602 II. COURSE TITLE : Advanced Statistics I III.

More information

Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data

Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data Ramón V. León Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and

More information

Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data. Ramón V. León

Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data. Ramón V. León Unit 14: Introduction to the Use of Bayesian Methods for Reliability Data Ramón V. León Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and

More information

Chapter 3. Basic Statistical Concepts: II. Data Preparation and Screening. Overview. Data preparation. Data screening. Score reliability and validity

Chapter 3. Basic Statistical Concepts: II. Data Preparation and Screening. Overview. Data preparation. Data screening. Score reliability and validity Chapter 3 Basic Statistical Concepts: II. Data Preparation and Screening To repeat what others have said, requires education; to challenge it, requires brains. Overview Mary Pettibone Poole Data preparation

More information

International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March ISSN

International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March ISSN International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March-2017 381 On empirical power of univariate normality testsunder symmetric, asymmetric and scaled distributions Muyombya

More information

COMPARISON OF LOGISTIC REGRESSION MODEL AND MARS CLASSIFICATION RESULTS ON BINARY RESPONSE FOR TEKNISI AHLI BBPLK SERANG TRAINING GRADUATES STATUS

COMPARISON OF LOGISTIC REGRESSION MODEL AND MARS CLASSIFICATION RESULTS ON BINARY RESPONSE FOR TEKNISI AHLI BBPLK SERANG TRAINING GRADUATES STATUS International Journal of Humanities, Religion and Social Science ISSN : 2548-5725 Volume 2, Issue 1 2017 www.doarj.org COMPARISON OF LOGISTIC REGRESSION MODEL AND MARS CLASSIFICATION RESULTS ON BINARY

More information

Estimating Discrete Choice Models of Demand. Data

Estimating Discrete Choice Models of Demand. Data Estimating Discrete Choice Models of Demand Aggregate (market) aggregate (market-level) quantity prices + characteristics (+advertising) distribution of demographics (optional) sample from distribution

More information

Kuhn-Tucker Estimation of Recreation Demand A Study of Temporal Stability

Kuhn-Tucker Estimation of Recreation Demand A Study of Temporal Stability Kuhn-Tucker Estimation of Recreation Demand A Study of Temporal Stability Subhra Bhattacharjee, Catherine L. Kling and Joseph A. Herriges Iowa State University Contact: subhra@iastate.edu Selected Paper

More information

DRAFT FOR DISCUSSION AND REVIEW, NOT TO BE CITED OR QUOTED

DRAFT FOR DISCUSSION AND REVIEW, NOT TO BE CITED OR QUOTED DRAFT FOR DISCUSSION AND REVIEW, NOT TO BE CITED OR QUOTED USING MISSPECIFICATION TESTS ON WITHIN-SAMPLE MODELS AND CONSIDERING THE TRADE-OFF BETWEEN LEVEL AND SIGNIFICANCE AND POWER TO PROMOTE WELL-CALIBRATED,

More information

The Nature of Econometrics

The Nature of Econometrics Lecture 1: The Nature of Econometrics and Economic Data (Ch 1) The Nature of Econometrics General speaking, econometrics is a discipline that develops statistical methods for analyzing economic relationships,

More information

MEMBERS OF EMPLOYMENT EQUITY GROUPS: PERCEPTIONS OF MERIT AND FAIRNESS IN STAFFING ACTIVITIES

MEMBERS OF EMPLOYMENT EQUITY GROUPS: PERCEPTIONS OF MERIT AND FAIRNESS IN STAFFING ACTIVITIES MEMBERS OF EMPLOYMENT EQUITY GROUPS: PERCEPTIONS OF MERIT AND FAIRNESS IN STAFFING ACTIVITIES A STATISTICAL STUDY CONDUCTED BY THE PUBLIC SERVICE COMMISSION OF CANADA MARCH 2014 Public Service Commission

More information

Discriminant Analysis Applications and Software Support

Discriminant Analysis Applications and Software Support Mirko Savić Dejan Brcanov Stojanka Dakić Discriminant Analysis Applications and Stware Support Article Info:, Vol. 3 (2008), No. 1, pp. 029-033 Received 12 Januar 2008 Accepted 24 April 2008 UDC 311.42:004

More information

Appendix of Sequential Search with Refinement: Model and Application with Click-stream Data

Appendix of Sequential Search with Refinement: Model and Application with Click-stream Data Appendix of Sequential Search with Refinement: Model and Application with Click-stream Data Yuxin Chen and Song Yao A1. Robustness Tests of An Alternative Information Structure One assumption we have pertaining

More information

Survey Research (and some additional ideas)

Survey Research (and some additional ideas) Survey Research (and some additional ideas) Jeffrey K. Lazo lazo@ucar.edu National Center for Atmospheric Research Boulder, Colorado, USA Total Economic Value Use values Direct use (timber, other forest

More information

Factors Affecting Customers Attitude among Foreign and Domestic Footwear Products

Factors Affecting Customers Attitude among Foreign and Domestic Footwear Products Factors Affecting Customers Attitude among Foreign and Domestic Footwear Products Zemenu Aynadis 1 1 (Department of Marketing Management, College of Business and Economics, Mekelle University, Ethiopia)

More information

MBF1413 Quantitative Methods

MBF1413 Quantitative Methods MBF1413 Quantitative Methods Prepared by Dr Khairul Anuar 1: Introduction to Quantitative Methods www.notes638.wordpress.com Assessment Two assignments Assignment 1 -individual 30% Assignment 2 -individual

More information

Advanced Tutorials. SESUG '95 Proceedings GETTING STARTED WITH PROC LOGISTIC

Advanced Tutorials. SESUG '95 Proceedings GETTING STARTED WITH PROC LOGISTIC GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services and University of California, Berkeley Extension Division Introduction Logistic Regression is an increasingly popular analytic

More information

IBM SPSS Statistics Editions

IBM SPSS Statistics Editions Editions Get the analytical power you need for better decision-making Why use IBM SPSS Statistics? is the world s leading statistical software. It enables you to quickly dig deeper into your data, making

More information

A Parametric Bootstrapping Approach to Forecast Intermittent Demand

A Parametric Bootstrapping Approach to Forecast Intermittent Demand Proceedings of the 2008 Industrial Engineering Research Conference J. Fowler and S. Mason, eds. A Parametric Bootstrapping Approach to Forecast Intermittent Demand Vijith Varghese, Manuel Rossetti Department

More information

Structural versus Reduced Form

Structural versus Reduced Form Econometric Analysis: Hausman and Leonard (2002) and Hosken et al (2011) Class 6 1 Structural versus Reduced Form Empirical papers can be broadly classified as: Structural: Empirical specification based

More information

Logistic Regression with Expert Intervention

Logistic Regression with Expert Intervention Smart Cities Symposium Prague 2016 1 Logistic Regression with Expert Intervention Pavla Pecherková and Ivan Nagy Abstract This paper deals with problem of analysis of traffic data. A traffic network has

More information

A Smart Approach to Analyzing Smart Meter Data

A Smart Approach to Analyzing Smart Meter Data A Smart Approach to Analyzing Smart Meter Data Ted Helvoigt, Evergreen Economics (Lead Author) Steve Grover, Evergreen Economics John Cornwell, Evergreen Economics Sarah Monohon, Evergreen Economics ABSTRACT

More information

Revision confidence limits for recent data on trend levels, trend growth rates and seasonally adjusted levels

Revision confidence limits for recent data on trend levels, trend growth rates and seasonally adjusted levels W O R K I N G P A P E R S A N D S T U D I E S ISSN 1725-4825 Revision confidence limits for recent data on trend levels, trend growth rates and seasonally adjusted levels Conference on seasonality, seasonal

More information

Thus, there are two points to keep in mind when analyzing risk:

Thus, there are two points to keep in mind when analyzing risk: One-Minute Spotlight WHAT IS RISK? Uncertainty about a situation can often indicate risk, which is the possibility of loss, damage, or any other undesirable event. Most people desire low risk, which would

More information

Comparison of Intervals. Dr. Brad Warner Maj. Jim Rutledge. Department of Mathematical Sciences

Comparison of Intervals. Dr. Brad Warner Maj. Jim Rutledge. Department of Mathematical Sciences USAFA-TR-98-1 Comparison of Intervals Dr. Brad Warner Maj. Jim Rutledge Department of Mathematical Sciences United States Air Force Academy Colorado Springs, Colorado 80840 JANUARY 1998 APPROVED FOR PUBLIC

More information

LOSS DISTRIBUTION ESTIMATION, EXTERNAL DATA

LOSS DISTRIBUTION ESTIMATION, EXTERNAL DATA LOSS DISTRIBUTION ESTIMATION, EXTERNAL DATA AND MODEL AVERAGING Ethan Cohen-Cole Federal Reserve Bank of Boston Working Paper No. QAU07-8 Todd Prono Federal Reserve Bank of Boston This paper can be downloaded

More information

Standard Formula, Internal Models and Scope of Validation. Example IM Calibrations Interest Rates Equities. Self-sufficiency

Standard Formula, Internal Models and Scope of Validation. Example IM Calibrations Interest Rates Equities. Self-sufficiency How to Calibrate Risk Distributions Andrew D Smith (andrewdsmith8@deloitte.co.uk) 2 December 2011 2010 The Actuarial Profession www.actuaries.org.uk Agenda Standard Formula, Internal Models and Scope of

More information

Capacity and Complexity Control in Predicting the Spread Between Borrowing and Lending Interest Rates

Capacity and Complexity Control in Predicting the Spread Between Borrowing and Lending Interest Rates From: KDD-95 Proceedings. Copyright 1995, AAAI (www.aaai.org). All rights reserved. Capacity and Complexity Control in Predicting the Spread Between Borrowing and Lending Interest Rates Corinna Cortes

More information

Practice Final Exam STCC204

Practice Final Exam STCC204 Practice Final Exam STCC24 The following are the types of questions you can expect on the final exam. There are 24 questions on this practice exam, so it should give you a good indication of the length

More information

Uncertainty in transport models. IDA workshop 7th May 2014

Uncertainty in transport models. IDA workshop 7th May 2014 Uncertainty in transport models IDA workshop 7th May 2014 Presentation outline Rationale Uncertainty in transport models How uncertainty is quantified Case study General comments 2 DTU Transport, Technical

More information

CHAPTER VI FUZZY MODEL

CHAPTER VI FUZZY MODEL CHAPTER VI This chapter covers development of a model using fuzzy logic to relate degree of recording/evaluation with the effectiveness of financial decision making and validate the model vis-à-vis results

More information

Active Learning for Conjoint Analysis

Active Learning for Conjoint Analysis Peter I. Frazier Shane G. Henderson snp32@cornell.edu pf98@cornell.edu sgh9@cornell.edu School of Operations Research and Information Engineering Cornell University November 1, 2015 Learning User s Preferences

More information

MONTE CARLO RISK AND DECISION ANALYSIS

MONTE CARLO RISK AND DECISION ANALYSIS MONTE CARLO RISK AND DECISION ANALYSIS M. Ragheb /7/ INTRODUCTION The risk generated and the ensuing economic impact of incorrect values used in decision-making models is incalculable. The models could

More information

Copyrights 2018 by International Financial Modeling Institute

Copyrights 2018 by International Financial Modeling Institute 1 Copyrights 2018 by International Financial Modeling Institute Last Update: June 2018 2 PFM Body of Knowledge About PFM BOK Professional Financial Modeler (PFM) Certification Program has embodied strong

More information

Chapter 1 Introduction: The Role of Statistics in Engineering

Chapter 1 Introduction: The Role of Statistics in Engineering 1 Chapter 1 Introduction: The Role of Statistics in Engineering Example: The manufacturer of a medical laser used in ophthalmic surgery wants to be able to quote quality characteristics of the laser to

More information

Chapter 1. An Introduction to Econometrics. 1.1 Why Study Econometrics?

Chapter 1. An Introduction to Econometrics. 1.1 Why Study Econometrics? Chapter 1 An Introduction to Econometrics 1.1 Why Study Econometrics? Econometrics is a set of research tools also employed in the business disciplines of accounting, finance, marketing and management.

More information

Choosing Optimal Designs for Pair-Wise Metric Conjoint Experiments

Choosing Optimal Designs for Pair-Wise Metric Conjoint Experiments Seoul Journal of Business Volume 10, Number 1 (June 2004) Choosing Optimal Designs for Pair-Wise Metric Conjoint Experiments Jin Gyo Kim* Sloan School of Management MIT, MA USA Abstract This paper presents

More information

Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS

Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS Hierarchical Linear Models (HLM) provide a flexible and powerful approach when studying response effects that vary by groups.

More information

Estimation of haplotypes

Estimation of haplotypes Estimation of haplotypes Cavan Reilly October 7, 2013 Table of contents Bayesian methods for haplotype estimation Testing for haplotype trait associations Haplotype trend regression Haplotype associations

More information

BAYESIAN INFORMATION UPDATE FOR RETAILER S TWO-STAGE INVENTORY MODEL UNDER DEMAND UNCERTAINTY

BAYESIAN INFORMATION UPDATE FOR RETAILER S TWO-STAGE INVENTORY MODEL UNDER DEMAND UNCERTAINTY BAYESIAN INFORMATION UPDATE FOR RETAILER S TWO-STAGE INVENTORY MODEL UNDER DEMAND UNCERTAINTY Mohammad Anwar Rahman Industrial Engineering Technology The University of Southern Mississippi Email: mohammad.rahman@usm.edu

More information

Are Energy Audits Worth It? Teasing Apart the Role of Audits in Driving Customer Efficiency Actions

Are Energy Audits Worth It? Teasing Apart the Role of Audits in Driving Customer Efficiency Actions Are Energy Audits Worth It? Teasing Apart the Role of Audits in Driving Customer Efficiency Actions Joan Mancuso, Quantum Consulting Inc. Mary Dimit, Pacific Gas and Electric Co. Historically, audit programs

More information

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition Chapter 4: Foundations for inference OpenIntro Statistics, 2nd Edition Variability in estimates 1 Variability in estimates Application exercise Sampling distributions - via CLT 2 Confidence intervals 3

More information

Utilizing Optimization Techniques to Enhance Cost and Schedule Risk Analysis

Utilizing Optimization Techniques to Enhance Cost and Schedule Risk Analysis 1 Utilizing Optimization Techniques to Enhance Cost and Schedule Risk Analysis Colin Smith, Brandon Herzog SCEA 2012 2 Table of Contents Introduction to Optimization Optimization and Uncertainty Analysis

More information

Harbingers of Failure: Online Appendix

Harbingers of Failure: Online Appendix Harbingers of Failure: Online Appendix Eric Anderson Northwestern University Kellogg School of Management Song Lin MIT Sloan School of Management Duncan Simester MIT Sloan School of Management Catherine

More information

Prospect theory and investment decision behavior: A review

Prospect theory and investment decision behavior: A review 2018 International Conference on Education Technology and Social Sciences (ETSOCS 2018) Prospect theory and investment decision behavior: A review Wen Wan1, a 1 Xi`an gao xin NO.1 high school, Shan`xi

More information

Machine Learning Logistic Regression Hamid R. Rabiee Spring 2015

Machine Learning Logistic Regression Hamid R. Rabiee Spring 2015 Machine Learning Logistic Regression Hamid R. Rabiee Spring 2015 http://ce.sharif.edu/courses/93-94/2/ce717-1 / Agenda Probabilistic Classification Introduction to Logistic regression Binary logistic regression

More information

Sawtooth Software. Brief Managerial Overview of Conjoint (Trade-Off) Analysis RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. Brief Managerial Overview of Conjoint (Trade-Off) Analysis RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES Brief Managerial Overview of Conjoint (Trade-Off) Analysis Bryan Orme, Sawtooth Software, Inc. Copyright 2003, Sawtooth Software, Inc. 530 W. Fir St. Sequim, WA

More information

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief Editions Get the analytical power you need for better decision making Why use IBM SPSS Statistics? is the world s leading statistical software. It enables you to quickly dig deeper into your data, making

More information

INTRODUCTION BACKGROUND. Paper

INTRODUCTION BACKGROUND. Paper Paper 354-2008 Small Improvements Causing Substantial Savings - Forecasting Intermittent Demand Data Using SAS Forecast Server Michael Leonard, Bruce Elsheimer, Meredith John, Udo Sglavo SAS Institute

More information

Balance Scorecard Application to Predict Business Success with Logistic Regression

Balance Scorecard Application to Predict Business Success with Logistic Regression 12 Journal of Advances in Economics and Finance, Vol. 3, No.1, February 2018 https://dx.doi.org/10.22606/jaef.2018.31002 Balance Scorecard Application to Predict Business Success with Logistic Regression

More information

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations Transformations Transformations & Data Cleaning Linear & non-linear transformations 2-kinds of Z-scores Identifying Outliers & Influential Cases Univariate Outlier Analyses -- trimming vs. Winsorizing

More information

Opening SPSS 6/18/2013. Lesson: Quantitative Data Analysis part -I. The Four Windows: Data Editor. The Four Windows: Output Viewer

Opening SPSS 6/18/2013. Lesson: Quantitative Data Analysis part -I. The Four Windows: Data Editor. The Four Windows: Output Viewer Lesson: Quantitative Data Analysis part -I Research Methodology - COMC/CMOE/ COMT 41543 The Four Windows: Data Editor Data Editor Spreadsheet-like system for defining, entering, editing, and displaying

More information

Predicting Customer Churn Using CLV in Insurance Industry

Predicting Customer Churn Using CLV in Insurance Industry Shiraz Journal of System Management Vol. 2, No. 1, Ser. 5, (2014), 39-49 Predicting Customer Churn Using CLV in Insurance Industry Vahid Dust Mohammadi Department of Industrial Engineering, Tarbit Modares

More information

USING EXPECTATIONS DATA TO INFER MANAGERIAL OBJECTIVES AND CHOICES. Tat Y. Chan,* Barton H. Hamilton,* and Christopher Makler** November 1, 2006

USING EXPECTATIONS DATA TO INFER MANAGERIAL OBJECTIVES AND CHOICES. Tat Y. Chan,* Barton H. Hamilton,* and Christopher Makler** November 1, 2006 USING EXPECTATIONS DATA TO INFER MANAGERIAL OBJECTIVES AND CHOICES Tat Y. Chan,* Barton H. Hamilton,* and Christopher Makler** November 1, 6 VERY PRELIMINARY AND INCOMPLETE. PLEASE DO NOT CITE OR QUOTE!

More information