Gasoline Consumption Analysis

Size: px
Start display at page:

Download "Gasoline Consumption Analysis"

Transcription

1 Gasoline Consumption Analysis One of the most basic topics in economics is the supply/demand curve. Simply put, the supply offered for sale of a commodity is directly related to its price, while the demand for purchase of a commodity is inversely related to its price. Can this pattern be quantified with real data? (original data source: Economic Report of the President) The following analysis is based on figures for gasoline consumption in the United States from 1960 through First, here is a scatter plot of gasoline consumption per capita versus the price index for gasoline. The reason we take log to the prices is that, according to economic theory, coefficients from a log/log fit can be viewed as elasticity Scatterplot of loggpc vs logpg 1.05 loggpc logpg There is apparently a positive relationship here, not an inverse one! Is economic theory all wrong? Well, no. We haven t taken into account the fact that potential determinants of gasoline consumption other than price have also changed over the years. We can do this using multiple regression. Variables that are used are as follows: logpg : Logged price index for gasoline logi : Logged per capita real disposable income logpnc : Logged price index for new cars logpuc : Logged price index for used cars logppt : Logged price index for public transportation logpd : Logged aggregate price index for durable goods logpn : Logged aggregate price index for nondurable goods logps : Logged aggregate price index for consumer services

2 First, we notice that these variables are highly correlated with each other. A regression using all of the variables will exhibit high multicollinearity, and will presumably include several redundant variables. Correlations: logpg, logi, logpnc, logpuc, logppt, logpd, logpn, logps logpg logi logpnc logpuc logppt logpd logpn logi logpnc logpuc logppt logpd logpn logps Here is the regression output. Just as we suspected, we have several redundant variables (ones with the t statistics being insignificant). Note the extremely high VIF values, by the way. Regression Analysis: loggpc versus logpg, logi,... loggpc = logpg logi logpnc logpuc logppt logpd logpn logps Constant logpg logi logpnc logpuc logppt logpd logpn logps S = R-Sq = 98.9% R-Sq(adj) = 98.6% This leads us to the general problem of model selection. Omitting important effects (underfitting) reduces predictive power, biases estimates of effects for included predictors, and results in less understanding of the process being studied. Including unnecessary effects (overfitting) complicates descriptions of the process, and tends to lead to poorer predictions because of the additional unnecessary noise. Model selection is as much of an art as a science, and involves subjective decision-making. We have already seen how hypothesis tests (e.g. t-tests) can be used to try to decide which predictors are needed in a model. Unfortunately, there are several reasons why such tests are not adequate for the task of choosing the appropriate model to use. First, hypothesis tests don t necessarily answer the question a data analyst is most interested in. With a large enough sample, almost any estimated slope will be significantly different from zero, but that doesn t mean that the predictor provides additional useful predictive power. Similarly, in small samples, important effects might not be statistically significant at typical levels simply because of insufficient data. That is, there is a clear distinction between statistical significance and practical importance.

3 A second important point is that when predictors are related to one another (as is the case for these data), t-test can provide very misleading indications of the importance of a predictor. For example, consider a two-predictor situation where the predictors are each highly correlated with the target variable, and are also highly correlated with each other. In this situation it is likely that the t-statistic for each predictor will be relatively small. This is not an inappropriate result, as t-statistics test whether a predictor adds significant predictive power given the other variables in the model. For each predictor, given the other, the predictor adds little of importance (being highly correlated with each other, one is redundant in the presence of the other). Unfortunately this means that the t-statistics are useless in identifying that either predictor alone provides great predictive power. What is needed is a strategy for determining a best model (or even better, a set of best models) among a larger class of candidate models. In doing this, we need to be clear as to what we mean by a best model. The underlying premise is that any model is viewed as an approximation of reality. In this sense, there is no true model at all (or, perhaps, what is basically the same, the true model is too complex to be useful). Our goal is not to find the true model, but rather to find a model, or set of models, that best balances fit and simplicity (the so-called principle of parsimony, using as few parameters as possible while still adequately representing the relationships in the data). This should result in a model that provides useful descriptions of the process being studied from estimated parameters, and which can be used for predictions of future events. An effective way to do this is using a best subsets regression. Here is the output: Best Subsets Regression: loggpc versus logpg, logi,... Response is loggpc l l l l o o o l l l o l g g g o o o g o P P P g g g Mallows P g N U P P P P Vars R-Sq R-Sq(adj) C-p S G I C C T D N S X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X Since we are interested in estimating a demand function, we would like to consider models with the gasoline price variable logpg included. We would like to find the number of predictors where adding any more doesn t improve the fit very much; that is, where the R 2 levels off. One way to make this decision is to try to find a place where the standard error of the estimate begins to level off (since at that point the width of prediction intervals won t change very much if more predictors are used). That appears to be at the five predictor model. The adjusted R 2, on the other hand, is maximized at the seven predictor model (note that the adjusted R 2 attempts to

4 explicitly represent the tradeoff between fit and complexity of the model by penalizing R 2 with a term that is a function of the number of predictors p). The entry under C-p refers to Mallows Cp, a criterion introduced in the late 1960 s that was devised to explicitly balance the desires for improved fit and simplicity of the model by being an estimate of the error that would come from using the model for prediction. Cp has the form Cp = SSE/s 2 n + 2p + 2, where SSE is the Residual SS for the model being examined, p is the number of predictors in that model, and s 2 is the Residual MS based on using all of the candidate predicting variables. Minimization of Cp forces a tradeoff between fidelity to the data (smaller Residual SS) and simplicity of the model (smaller p). Two operational rules for Cp that have been suggested are to choose the model with smallest Cp, or smallest p such that Cp = p+1 or less. Both rules recommend the seven variable model. For this dataset, the different methods agree that the models to consider have somewhere between five and seven predictors. Seven predictors seems too many here (given the gain in R 2 of 0.1% when going from six to seven predictors), and the t statistic for logppt in this model is not significantly different from zero (logpuc is marginal): Regression Analysis: loggpc versus logpg, logi,... loggpc = logpg logi logpuc logppt logpd logpn logps Constant logpg logi logpuc logppt logpd logpn logps S = R-Sq = 98.9% R-Sq(adj) = 98.7% The best six predictor model is the same as above, except that the insignificant logppt is dropped out. The t statistic for logpuc is still marginal, but by this output this model is not unreasonable: Regression Analysis: loggpc versus logpg, logi,... loggpc = logpg logi logpuc logpd logpn logps Constant logpg logi logpuc logpd logpn logps S = R-Sq = 98.8% R-Sq(adj) = 98.6%

5 Ultimately, when you have competing models that are this similar, it becomes a matter of personal preference. We could also drop logpuc and look at the resulting five predictor model. Regression Analysis: loggpc versus logpg, logi, logpd, logpn, logps loggpc = logpg logi logpd logpn logps Constant logpg logi logpd logpn logps S = R-Sq = 98.6% R-Sq(adj) = 98.4% All of the variables are significant, but also show a lot of collinearity. The key thing to recognize from this model is that the coefficient for the price of gasoline (logpg) in the equation is Recall what this means: given that income, durable goods price index, nondurable goods price index, and consumer services price index are held fixed, a one unit increase in the logged gasoline price index is associated with a.4985 unit decrease in logged gasoline consumption. This, of course, is the inverse relationship we are looking for in a demand curve. As noted earlier, in this log/log model, this value represents the estimated price elasticity of gasoline, saying that a 1% increase in price is associated with a 0.5% decrease in consumption. The high variance inflation factors lead to a larger standard error of the coefficient for logpg. In fact, a 95% confidence interval for the decrease in logged gasoline consumption associated with a one unit increase in the logged price index is ( , ). There is an important issue regarding model selection that cannot be ignored. The act of looking at all of the regressions (as best subsets does) will cause an inflation of measures of the strength of the regression, since apparent relationships that are due to random chance will be identified as a true relationship. For example, in a sample of 400 observations with 50 potential predictors that are pure noise, best subsets regression is very likely to identify a model with R 2 at least 50%! This problem is sometimes given the names of data snooping. For this reason, it is not a good idea to just throw every possible variable in, and then let the computer choose the ones to use. Rather, there should be a very thorough pre examination of the data, where irrelevant and redundant variables are weeded out before any statistics are run. Even better, if you have enough data, some should be held out of the analysis; then, the model built on the first part of the sample can be validated on the held out portion. If it s not possible to validate your model on new data, a simple adjustment to the standard error of the estimate can help. The value reported above,.00828, is based on the number of predictors in the chosen model (that is, 5). A better choice is to use the number of predictors in the most complex model considered in the model selection, or in this case 10. So, an adjusted version of the standard error of the estimate divides by , rather than , or sqrt( /25) = Thus, a better rough 95% prediction interval for logged per capita demand when using this model is ±(2)(.00907) = ±.0181, rather than ±(2)(.00828) = ±.0166, with the former roughly 9% wider than the latter. Diagnostics and residual plots look okay here.

6 Finally, we can try to make an adjusted demand versus gasoline price index plot. That is, the relationship between demand for gasoline and price after all of the other variables have been taken into account. Here s how we do it: (1) Do a regression of the dependent variable (here loggpc) on all of the predictors except the one you re interested in (here logpg). Save the residuals (call it RES1). We can call this variable the adjusted gasoline consumption. (2) Do a regression of the predicting variable you re interested in (here logpg) on the other predicting variables. Save the residuals (call it RES2). We can call this variable the adjusted gasoline price index. (3) The partial correlation is simply the correlation between RES1 and RES2. (4) If you regressed RES1 on RES2 you d notice something remarkable the coefficient for RES2 is precisely the same as the coefficient for the predictor of interest in the full multiple regression model (that is, the coefficient for RES2 in this case is exactly ). (5) If you d like a graphical representation of the adjusted relationship between the predictor of interest and the target variable, taking the other predictors into account, just do a scatter plot of RES1 on RES2. Thus, here is the (adjusted) demand curve for gasoline: 0.03 Scatterplot of RES1 vs RES RES RES

Untangling Correlated Predictors with Principle Components

Untangling Correlated Predictors with Principle Components Untangling Correlated Predictors with Principle Components David R. Roberts, Marriott International, Potomac MD Introduction: Often when building a mathematical model, one can encounter predictor variables

More information

Semester 2, 2015/2016

Semester 2, 2015/2016 ECN 3202 APPLIED ECONOMETRICS 3. MULTIPLE REGRESSION B Mr. Sydney Armstrong Lecturer 1 The University of Guyana 1 Semester 2, 2015/2016 MODEL SPECIFICATION What happens if we omit a relevant variable?

More information

Chapter 5 Regression

Chapter 5 Regression Chapter 5 Regression Topics to be covered in this chapter: Regression Fitted Line Plots Residual Plots Regression The scatterplot below shows that there is a linear relationship between the percent x of

More information

Tutorial 3 - Sessions 6to8:

Tutorial 3 - Sessions 6to8: Simona Abis Tutorial 3 - Sessions 6to8: Session 7 - Pricing with market power: Essentials: Firms have market power, hence face downward sloping demand curves They can influence market price Charging a

More information

BUS105 Statistics. Tutor Marked Assignment. Total Marks: 45; Weightage: 15%

BUS105 Statistics. Tutor Marked Assignment. Total Marks: 45; Weightage: 15% BUS105 Statistics Tutor Marked Assignment Total Marks: 45; Weightage: 15% Objectives a) Reinforcing your learning, at home and in class b) Identifying the topics that you have problems with so that your

More information

Chapter 3. Table of Contents. Introduction. Empirical Methods for Demand Analysis

Chapter 3. Table of Contents. Introduction. Empirical Methods for Demand Analysis Chapter 3 Empirical Methods for Demand Analysis Table of Contents 3.1 Elasticity 3.2 Regression Analysis 3.3 Properties & Significance of Coefficients 3.4 Regression Specification 3.5 Forecasting 3-2 Introduction

More information

Model Building Process Part 2: Factor Assumptions

Model Building Process Part 2: Factor Assumptions Model Building Process Part 2: Factor Assumptions Authored by: Sarah Burke, PhD 17 July 2018 Revised 6 September 2018 The goal of the STAT COE is to assist in developing rigorous, defensible test strategies

More information

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES Sample Size Issues for Conjoint Analysis Studies Bryan Orme, Sawtooth Software, Inc. 1998 Copyright 1998-2001, Sawtooth Software, Inc. 530 W. Fir St. Sequim, WA

More information

Acknowledgments. Data Mining. Examples. Overview. Colleagues

Acknowledgments. Data Mining. Examples. Overview. Colleagues Data Mining A regression modeler s view on where it is and where it s likely to go. Bob Stine Department of Statistics The School of the Univ of Pennsylvania March 30, 2006 Acknowledgments Colleagues Dean

More information

Categorical Predictors, Building Regression Models

Categorical Predictors, Building Regression Models Fall Semester, 2001 Statistics 621 Lecture 9 Robert Stine 1 Categorical Predictors, Building Regression Models Preliminaries Supplemental notes on main Stat 621 web page Steps in building a regression

More information

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Correlation and Simple. Linear Regression. Scenario. Defining Correlation Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling

More information

Regression Analysis I & II

Regression Analysis I & II Data for this session is available in Data Regression I & II Regression Analysis I & II Quantitative Methods for Business Skander Esseghaier 1 In this session, you will learn: How to read and interpret

More information

Introduction to Labour Economics. Professor H.J. Schuetze Economics 370. What is Labour Economics?

Introduction to Labour Economics. Professor H.J. Schuetze Economics 370. What is Labour Economics? Introduction to Labour Economics Professor H.J. Schuetze Economics 370 What is Labour Economics? Let s begin by looking at what economics is in general Study of interactions between decision makers, which

More information

and Forecasting CONCEPTS

and Forecasting CONCEPTS 6 Demand Estimation and Forecasting CONCEPTS Regression Method Scatter Diagram Linear Regression Regression Line Independent Variable Explanatory Variable Dependent Variable Intercept Coefficient Slope

More information

ST7002 Optional Regression Project. Postgraduate Diploma in Statistics. Trinity College Dublin. Sarah Mechan. FAO: Prof.

ST7002 Optional Regression Project. Postgraduate Diploma in Statistics. Trinity College Dublin. Sarah Mechan. FAO: Prof. ST72 Optional Regression Project Postgraduate Diploma in Statistics Trinity College Dublin Sarah Mechan FAO: Prof. John Haslett School of Computer Science & Statistics Section 1 Introduction This report

More information

Marketing research departments seldom have

Marketing research departments seldom have Unconventional Wisdom By Terry Grapentine Marketing research departments seldom have the time or resources to undertake their own research on research. Consequently, some issues surrounding measurement

More information

Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras

Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras Lecture No. # 14 Modal Split Analysis Contd. This is lecture 14 on urban

More information

STAT 350 (Spring 2016) Homework 12 Online 1

STAT 350 (Spring 2016) Homework 12 Online 1 STAT 350 (Spring 2016) Homework 12 Online 1 1. In simple linear regression, both the t and F tests can be used as model utility tests. 2. The sample correlation coefficient is a measure of the strength

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis Approaching Performance Projects CS 147: Computer Systems Performance Analysis Approaching Performance Projects 1 / 35 Overview Overview Overview Planning

More information

Time Series Analysis in the Social Sciences

Time Series Analysis in the Social Sciences one Time Series Analysis in the Social Sciences in the social sciences, data are usually collected across space, that is, across countries, cities, and so on. Sometimes, however, data are collected across

More information

Regression diagnostics

Regression diagnostics Regression diagnostics Biometry 755 Spring 2009 Regression diagnostics p. 1/48 Introduction Every statistical method is developed based on assumptions. The validity of results derived from a given method

More information

Confidence Intervals

Confidence Intervals Confidence Intervals Example 1: How prevalent is sports gambling in America? 2007 Gallup poll took a random sample of 1027 adult Americans. 17% of the sampled adults had gambled on sports in the past year.

More information

Linear model to forecast sales from past data of Rossmann drug Store

Linear model to forecast sales from past data of Rossmann drug Store Abstract Linear model to forecast sales from past data of Rossmann drug Store Group id: G3 Recent years, the explosive growth in data results in the need to develop new tools to process data into knowledge

More information

DEMAND CURVE AS A CONSTRAINT FOR BUSINESSES

DEMAND CURVE AS A CONSTRAINT FOR BUSINESSES 1Demand and rofit Seeking 8 Demand is important for two reasons. First, demand acts as a constraint on what business firms are able to do. In particular, the demand curve forces firms to accept lower sales

More information

Managerial Economics

Managerial Economics Managerial Economics Estimating Demand Functions Rudolf Winter-Ebmer Johannes Kepler University Linz Winter Term 2014 Winter-Ebmer, Managerial Economics: Unit 2 - Demand Estimation 1 / 21 Why do you need

More information

CHAPTER 1 A CASE STUDY FOR REGRESSION ANALYSIS

CHAPTER 1 A CASE STUDY FOR REGRESSION ANALYSIS CHAPTER 1 AIR POLLUTION AND PUBLIC HEALTH: A CASE STUDY FOR REGRESSION ANALYSIS 1 2 Fundamental Equations Simple linear regression: y i = α + βx i + ɛ i, i = 1,, n Multiple regression: y i = p j=1 x ij

More information

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression 6.1 Introduction In the previous chapter, the detailed results of FA were presented and discussed. As a result, fourteen factors

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

ECONOMICS 103. Topic 5: Externalities. External costs and benefits

ECONOMICS 103. Topic 5: Externalities. External costs and benefits ECONOMICS 103 External costs and benefits Topic 5: Externalities Marginal external costs, marginal external benefits Marginal social costs and benefits v marginal private costs and benefits Environmental

More information

Common Mistakes in Performance Evaluation

Common Mistakes in Performance Evaluation Common Mistakes in Performance Evaluation The Art of Computer Systems Performance Analysis By Raj Jain Adel Nadjaran Toosi Wise men learn by other men s mistake, fools by their own. H. G. Wells No Goals

More information

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the

More information

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 02 Data Mining Process Welcome to the lecture 2 of

More information

Quadratic Regressions Group Acitivity 2 Business Project Week #4

Quadratic Regressions Group Acitivity 2 Business Project Week #4 Quadratic Regressions Group Acitivity 2 Business Project Week #4 In activity 1 we created a scatter plot on the calculator using a table of values that were given. Some of you were able to create a linear

More information

Notes on Intertemporal Consumption Choice

Notes on Intertemporal Consumption Choice Paul A Johnson Economics 304 Advanced Topics in Macroeconomics Notes on Intertemporal Consumption Choice A: The Two-Period Model Consider an individual who faces the problem of allocating their available

More information

Why do we need a model?

Why do we need a model? Modeling Basics Bu eğitim sunumları İstanbul Kalkınma Ajansı nın 2016 yılı Yenilikçi ve Yaratıcı İstanbul Mali Destek Programı kapsamında yürütülmekte olan TR10/16/YNY/0036 no lu İstanbul Big Data Eğitim

More information

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition Chapter 4: Foundations for inference OpenIntro Statistics, 2nd Edition Variability in estimates 1 Variability in estimates Application exercise Sampling distributions - via CLT 2 Confidence intervals 3

More information

Chapter 3. Database and Research Methodology

Chapter 3. Database and Research Methodology Chapter 3 Database and Research Methodology In research, the research plan needs to be cautiously designed to yield results that are as objective as realistic. It is the main part of a grant application

More information

Techniques for Understanding Consumer Demand and Behavior

Techniques for Understanding Consumer Demand and Behavior Economics for Managers by Paul Farnham Chapter 4: Techniques for Understanding Consumer Demand and Behavior 4.1 Getting Information About Consumer Behavior Expert opinion Consumer surveys Test marketing

More information

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help

More information

The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pa

The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pa The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pages 37-64. The description of the problem can be found

More information

Lecture Private Information, Adverse Selection and Market Failure

Lecture Private Information, Adverse Selection and Market Failure Lecture Private Information, Adverse Selection and Market Failure David Autor 14.03 Fall 2004 1 Private Information, Adverse Selection and Market Failure It used to be thought that markets for information

More information

Continuous Improvement Toolkit

Continuous Improvement Toolkit Continuous Improvement Toolkit Regression (Introduction) Managing Risk PDPC Pros and Cons Importance-Urgency Mapping RACI Matrix Stakeholders Analysis FMEA RAID Logs Break-even Analysis Cost -Benefit Analysis

More information

CHAPTER 5 RESULTS AND ANALYSIS

CHAPTER 5 RESULTS AND ANALYSIS CHAPTER 5 RESULTS AND ANALYSIS This chapter exhibits an extensive data analysis and the results of the statistical testing. Data analysis is done using factor analysis, regression analysis, reliability

More information

4.3 Nonparametric Tests cont...

4.3 Nonparametric Tests cont... Class #14 Wednesday 2 March 2011 What did we cover last time? Hypothesis Testing Types Student s t-test - practical equations Effective degrees of freedom Parametric Tests Chi squared test Kolmogorov-Smirnov

More information

Week 11: Collinearity

Week 11: Collinearity Week 11: Collinearity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Regression and holding other

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2012, 2008, 2005 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means

More information

Forecasting with Seasonality Version 1.6

Forecasting with Seasonality Version 1.6 Forecasting with Seasonality Version 1.6 Dr. Ron Tibben-Lembke Sept 3, 2006 Forecasting with seasonality and a trend is obviously more difficult than forecasting for a trend or for seasonality by itself,

More information

Econometric Analysis Dr. Sobel

Econometric Analysis Dr. Sobel Econometric Analysis Dr. Sobel Econometrics Session 1: 1. Building a data set Which software - usually best to use Microsoft Excel (XLS format) but CSV is also okay Variable names (first row only, 15 character

More information

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT STAT 512 EXAM I STAT 512 Name (7 pts) Problem Points Score 1 40 2 25 3 28 USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT WRITE LEGIBLY. ANYTHING UNREADABLE WILL NOT BE GRADED GOOD LUCK!!!!

More information

An Assessment of the ISM Manufacturing Price Index for Inflation Forecasting

An Assessment of the ISM Manufacturing Price Index for Inflation Forecasting ECONOMIC COMMENTARY Number 2018-05 May 24, 2018 An Assessment of the ISM Manufacturing Price Index for Inflation Forecasting Mark Bognanni and Tristan Young* The Institute for Supply Management produces

More information

STATISTICAL TECHNIQUES. Data Analysis and Modelling

STATISTICAL TECHNIQUES. Data Analysis and Modelling STATISTICAL TECHNIQUES Data Analysis and Modelling DATA ANALYSIS & MODELLING Data collection and presentation Many of us probably some of the methods involved in collecting raw data. Once the data has

More information

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening r's age when 1st child born 2 4 6 Density.2.4.6.8 Density.5.1 Sociology 774: Regression Models for Categorical Data Instructor: Natasha Sarkisian Preliminary Data Screening A. Examining Univariate Normality

More information

BOOTSTRAPPING AND CONFIDENCE INTERVALS

BOOTSTRAPPING AND CONFIDENCE INTERVALS BOOTSTRAPPING AND CONFIDENCE INTERVALS Fill out your reading report on Learning Suite MPA 630: Data Science for Public Management November 8, 2018 PLAN FOR TODAY Why are we even doing this? Confidence

More information

Producer Theory - Monopoly

Producer Theory - Monopoly Producer Theory - Monopoly Mark Dean Lecture Notes for Fall 2009 Introductory Microeconomics - Brown University 1 Introduction Up until now, we have assumed that all the agents in our economies are price

More information

How to Use Excel for Regression Analysis MtRoyal Version 2016RevA *

How to Use Excel for Regression Analysis MtRoyal Version 2016RevA * OpenStax-CNX module: m63578 1 How to Use Excel for Regression Analysis MtRoyal Version 2016RevA * Lyryx Learning Based on How to Use Excel for Regression Analysis BSTA 200 Humber College Version 2016RevA

More information

14.27 Economics and E-Commerce Fall 14. Lecture 2 - Review

14.27 Economics and E-Commerce Fall 14. Lecture 2 - Review 14.27 Economics and E-Commerce Fall 14 Lecture 2 - Review Prof. Sara Ellison MIT OpenCourseWare 4 basic things I want to get across in this review: Monopoly pricing function of demand, elasticity, & marginal

More information

Discussion Solution Mollusks and Litter Decomposition

Discussion Solution Mollusks and Litter Decomposition Discussion Solution Mollusks and Litter Decomposition. Is the rate of litter decomposition affected by the presence of mollusks? 2. Does the effect of mollusks on litter decomposition differ among the

More information

MARK SCHEME for the May/June 2010 question paper for the guidance of teachers 9772 ECONOMICS. 9772/02 Paper 2 (Essays), maximum raw mark 75

MARK SCHEME for the May/June 2010 question paper for the guidance of teachers 9772 ECONOMICS. 9772/02 Paper 2 (Essays), maximum raw mark 75 UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS Pre-U Certificate www.xtremepapers.com MARK SCHEME for the May/June 2010 question paper for the guidance of teachers 9772 ECONOMICS 9772/02 Paper 2 (Essays),

More information

Delivering Value Why Else Are You Doing The Project?

Delivering Value Why Else Are You Doing The Project? Delivering Value Why Else Are You Doing The Project? THOUGHT LEADERSHIP WHITE PAPER In partnership with By Andy Jordan, PMP, ProjectManagement.com Research Analyst Projects are the way that an organization

More information

Chapter 9. Regression Wisdom. Copyright 2010 Pearson Education, Inc.

Chapter 9. Regression Wisdom. Copyright 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom Copyright 2010 Pearson Education, Inc. Getting the Bends Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can t take

More information

Business Quantitative Analysis [QU1] Examination Blueprint

Business Quantitative Analysis [QU1] Examination Blueprint Business Quantitative Analysis [QU1] Examination Blueprint 2014-2015 Purpose The Business Quantitative Analysis [QU1] examination has been constructed using an examination blueprint. The blueprint, also

More information

Viability Analysis of an Enterprise I

Viability Analysis of an Enterprise I IJAMT 71 Viability Analysis of an Enterprise I Charles Andoh, University of Ghana Business School, West Africa Abstract The study derives the conditions for the profitability of any enterprise. So long

More information

Test Codes: QEA and QEB (Both are of 'Descriptive' type) (Junior Research Fellowship in Quantitative Economics)

Test Codes: QEA and QEB (Both are of 'Descriptive' type) (Junior Research Fellowship in Quantitative Economics) Test Codes: QEA and QEB (Both are of 'Descriptive' type) (Junior Research Fellowship in Quantitative Economics) The candidates for Junior Research Fellowship in Quantitative Economics are required to take

More information

CHAPTER 4 EMPIRICAL ANALYSIS AND DISCUSSION

CHAPTER 4 EMPIRICAL ANALYSIS AND DISCUSSION 79 CHAPTER 4 EMPIRICAL ANALYSIS AND DISCUSSION 4.1 SAMPLE SELECTION AND DESCRIPTIVE STATISTICS The study begins by identifying industrial sectors heavily reliant on intellectual capital. The data covers

More information

Model Selection, Evaluation, Diagnosis

Model Selection, Evaluation, Diagnosis Model Selection, Evaluation, Diagnosis INFO-4604, Applied Machine Learning University of Colorado Boulder October 31 November 2, 2017 Prof. Michael Paul Today How do you estimate how well your classifier

More information

Procedure 3. Statistical Requirements for CEC Test Methods

Procedure 3. Statistical Requirements for CEC Test Methods Procedure 3 Issue 3 19th March 2014 Statistical Requirements for CEC Test Methods 1. Purpose...2 2. Overview...2 3. Processes...3 i) Repeatability/Discrimination...3 ii) Pilot inter-laboratory programme(s)...4

More information

Analysis of Factors that Affect Productivity of Enterprise Software Projects

Analysis of Factors that Affect Productivity of Enterprise Software Projects Analysis of Factors that Affect Productivity of Enterprise Software Projects Tsuneo Furuyama Scool of Science, Tokai University Kitakaname 4-1-1, Hiratsuka city, Japan furuyama@tokai-u.jp Abstract Analysis

More information

ECON Chapter 5 - Labour Economics. Maggie Jones

ECON Chapter 5 - Labour Economics. Maggie Jones ECON 370 - Chapter 5 - Labour Economics Maggie Jones Demand for Labour in Competitive Labour Markets Demand for Labour in Competitive Labour Markets The principles that determine the demand for any factor

More information

Estimating Discrete Choice Models of Demand. Data

Estimating Discrete Choice Models of Demand. Data Estimating Discrete Choice Models of Demand Aggregate (market) aggregate (market-level) quantity prices + characteristics (+advertising) distribution of demographics (optional) sample from distribution

More information

Common Mistakes and How to Avoid Them

Common Mistakes and How to Avoid Them Common Mistakes and How to Avoid Them 2-1 Overview Common Mistakes in Evaluation Checklist for Avoiding Common Mistakes A Systematic Approach to Performance Evaluation Case Study: Remote Pipes vs RPC 2-2

More information

Reply to the Referees John J. Seater 15 April 2008

Reply to the Referees John J. Seater 15 April 2008 Reply to the Referees John J. Seater 15 April 2008 I thank the two referees for their thoughtful comments. Both obviously read my paper carefully and have provided well-considered comments. Naturally,

More information

Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of Kansas Farm Management Association Cow-Calf Enterprise

Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of Kansas Farm Management Association Cow-Calf Enterprise Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of 2010-2014 Kansas Farm Management Association Cow-Calf Enterprise Dustin L. Pendell (dpendell@ksu.edu), Youngjune Kim

More information

Transport Economics. Energy Summer School, 2017 Selena Sheng Energy Centre, Business School, UoA 27/02/2017

Transport Economics. Energy Summer School, 2017 Selena Sheng Energy Centre, Business School, UoA 27/02/2017 Transport Economics Energy Summer School, 2017 Selena Sheng Energy Centre, Business School, UoA 27/02/2017 Transport Economics Transport demand Transport supply Congestion pricing Econometric modelling

More information

Empirical Exercise Handout

Empirical Exercise Handout Empirical Exercise Handout Ec-970 International Corporate Governance Harvard University March 2, 2004 Due Date The new due date for empirical paper is Wednesday, March 24 at the beginning of class. Description

More information

as explained in [2, p. 4], households are indexed by an index parameter ι ranging over an interval [0, l], implying there are uncountably many

as explained in [2, p. 4], households are indexed by an index parameter ι ranging over an interval [0, l], implying there are uncountably many THE HOUSEHOLD SECTOR IN THE SMETS-WOUTERS DSGE MODEL: COMPARISONS WITH THE STANDARD OPTIMAL GROWTH MODEL L. Tesfatsion, Econ 502, Fall 2014 Last Revised: 19 November 2014 Basic References: [1] ** L. Tesfatsion,

More information

Lesson-9. Elasticity of Supply and Demand

Lesson-9. Elasticity of Supply and Demand Lesson-9 Elasticity of Supply and Demand Price Elasticity Businesses know that they face demand curves, but rarely do they know what these curves look like. Yet sometimes a business needs to have a good

More information

ADVANCED DATA ANALYTICS

ADVANCED DATA ANALYTICS ADVANCED DATA ANALYTICS MBB essay by Marcel Suszka 17 AUGUSTUS 2018 PROJECTSONE De Corridor 12L 3621 ZB Breukelen MBB Essay Advanced Data Analytics Outline This essay is about a statistical research for

More information

Presented at the 2013 ICEAA Professional Development & Training Workshop - ODASA-CE Software

Presented at the 2013 ICEAA Professional Development & Training Workshop -  ODASA-CE Software ODASA-CE Software Growth Research ICEAA 2013 Authors: Lauren Nolte, Kevin Cincotta, Eric Lofgren, Remmie Arnold Sponsor: Shawn Collins, ODASA-CE Outline 1. Background & Purpose Software Estimating Process

More information

Outliers and Their Effect on Distribution Assessment

Outliers and Their Effect on Distribution Assessment Outliers and Their Effect on Distribution Assessment Larry Bartkus September 2016 Topics of Discussion What is an Outlier? (Definition) How can we use outliers Analysis of Outlying Observation The Standards

More information

R-SQUARED RESID. MEAN SQUARE (MSE) 1.885E+07 ADJUSTED R-SQUARED STANDARD ERROR OF ESTIMATE

R-SQUARED RESID. MEAN SQUARE (MSE) 1.885E+07 ADJUSTED R-SQUARED STANDARD ERROR OF ESTIMATE These are additional sample problems for the exam of 2012.APR.11. The problems have numbers 6, 7, and 8. This is not Minitab output, so you ll find a few extra challenges. 6. The following information

More information

Habitat as a predictor of Warbler usage.

Habitat as a predictor of Warbler usage. 1. Introduction: Habitat as a predictor of Warbler usage. Carl James Schwarz (cschwarz@stat.sfu.ca) Over the past few years, the BC MOE has completed detailed habitat evaluations at 223 Black-throated

More information

Eco 300 Intermediate Micro

Eco 300 Intermediate Micro Eco 300 Intermediate Micro Instructor: Amalia Jerison Office Hours: T 12:00-1:00, Th 12:00-1:00, and by appointment BA 127A, aj4575@albany.edu A. Jerison (BA 127A) Eco 300 Spring 2010 1 / 61 Monopoly Market

More information

Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of Kansas Farm Management Association Cow-Calf Enterprise

Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of Kansas Farm Management Association Cow-Calf Enterprise Differences Between High-, Medium-, and Low-Profit Cow-Calf Producers: An Analysis of 2012-2016 Kansas Farm Management Association Cow-Calf Enterprise Dustin L. Pendell (dpendell@ksu.edu) and Kevin L.

More information

Examiner s report F5 Performance Management December 2017

Examiner s report F5 Performance Management December 2017 Examiner s report F5 Performance Management December 2017 General comments The F5 Performance Management exam is offered in both computer-based (CBE) and paper formats. The structure is the same in both

More information

Introduction to Software Metrics

Introduction to Software Metrics Introduction to Software Metrics Outline Today we begin looking at measurement of software quality using software metrics We ll look at: What are software quality metrics? Some basic measurement theory

More information

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS HIMSS ME-PI Community Sigma Score Calculation Worksheet INSTRUCTIONS Quick Tour Let s start with a quick tour of the Excel spreadsheet. There are six worksheets in the spreadsheet. Sigma Score (Snapshot)

More information

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Module - 1 Lecture - 7 Aggregate Planning, Dynamic Programming, Backordering

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

You can find the consultant s raw data here:

You can find the consultant s raw data here: Problem Set 1 Econ 475 Spring 2014 Arik Levinson, Georgetown University 1 [Travel Cost] A US city with a vibrant tourist industry has an industrial accident (a spill ) The mayor wants to sue the company

More information

A few firms Imperfect Competition Oligopoly. Figure 8.1: Market structures

A few firms Imperfect Competition Oligopoly. Figure 8.1: Market structures 8.1 Setup Monopoly is a single firm producing a particular commodity. It can affect the market by changing the quantity; via the (inverse) demand function p (q). The tradeoff: either sell a lot cheaply,

More information

Modelling buyer behaviour - 2 Rate-frequency models

Modelling buyer behaviour - 2 Rate-frequency models Publishing Date: May 1993. 1993. All rights reserved. Copyright rests with the author. No part of this article may be reproduced without written permission from the author. Modelling buyer behaviour -

More information

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations Transformations Transformations & Data Cleaning Linear & non-linear transformations 2-kinds of Z-scores Identifying Outliers & Influential Cases Univariate Outlier Analyses -- trimming vs. Winsorizing

More information

All Possible Mixed Model Selection - A User-friendly SAS Macro Application

All Possible Mixed Model Selection - A User-friendly SAS Macro Application All Possible Mixed Model Selection - A User-friendly SAS Macro Application George C J Fernandez, University of Nevada - Reno, Reno NV 89557 ABSTRACT A user-friendly SAS macro application to perform all

More information

Shewhart and the Probability Approach. The difference is much greater than how we compute the limits

Shewhart and the Probability Approach. The difference is much greater than how we compute the limits Quality Digest Daily, November 2, 2015 Manuscript 287 The difference is much greater than how we compute the limits Donald J. Wheeler & Henry R. Neave In theory, there is no difference between theory and

More information

CHAPTER 19 HEAT ENGINES AND REFRIGERATORS

CHAPTER 19 HEAT ENGINES AND REFRIGERATORS CHAPTER 19 HEAT ENGINES AND REFRIGERATORS In this chapter, we will examine devices that turn heat into work. I swear, I haven t confused you people for Mechanical Engineering majors! Biological processes

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Combination Chemotherapy: DESIGNING CLINICAL RESEARCH Use of a combination of two therapeutics, especially drug-drug combination (called combination

More information

Transportation Economics and Decision Making. L e c t u r e - 8

Transportation Economics and Decision Making. L e c t u r e - 8 Transportation Economics and Decision Making L e c t u r e - 8 Travel Behavior Many practical transportation policy issues are concerned with choice of mode Example: the gain or loss of transit revenue

More information

ESTIMATION OF FLOW-DURATION CURVES AT UNGAGED SITES IN SOUTHERN NEW ENGLAND. By Stacey A. Archfield, Richard M. Vogel, and Sara L.

ESTIMATION OF FLOW-DURATION CURVES AT UNGAGED SITES IN SOUTHERN NEW ENGLAND. By Stacey A. Archfield, Richard M. Vogel, and Sara L. ESTIMATION OF FLOW-DURATION CURVES AT UNGAGED SITES IN SOUTHERN NEW ENGLAND By Stacey A. Archfield, Richard M. Vogel, and Sara L. Brandt Department of Civil and Environmental Engineering, Tufts University,

More information

Linear Regression Analysis of Gross Output Value of Farming, Forestry, Animal Husbandry and Fishery Industries

Linear Regression Analysis of Gross Output Value of Farming, Forestry, Animal Husbandry and Fishery Industries 1106 Proceedings of the 8th International Conference on Innovation & Management Linear Regression Analysis of Gross Output Value of Farming, Forestry, Animal Husbandry and Fishery Industries Liu Haime,

More information

Control Charts for Customer Satisfaction Surveys

Control Charts for Customer Satisfaction Surveys Control Charts for Customer Satisfaction Surveys Robert Kushler Department of Mathematics and Statistics, Oakland University Gary Radka RDA Group ABSTRACT Periodic customer satisfaction surveys are used

More information