Chapter 2. Linear model. Put some concreteness on problem. The Bivariate Regression Model. Sample of n observations, labeled as i=1,2,..

Size: px
Start display at page:

Download "Chapter 2. Linear model. Put some concreteness on problem. The Bivariate Regression Model. Sample of n observations, labeled as i=1,2,.."

Transcription

1 Linear model Chapter 2 The Bivariate Regression Model Sample of n observations, labeled as i=1,2,..n y i = + x i 1 + i and 1 are population values represent the true relationship between x and y Unfortunately these values are unknown The job of the researcher is to estimate these values 1 2 Put some concreteness on problem Notice that if we differentiate y with respect to x, we obtain y/ x = 1 1 represents how much y will change for a fixed change in x Increase in income for more education Change in crime or bankruptcy when casinos are opened Increase in test score if you study more Suppose a state is experiencing a significant budget shortfall Short-term solution raise tax on cigarettes by 35 cents/pack Problem a tax hike will reduce consumption (theory of demand) Question for state as taxes are raised, how much will cigarette consumption fall 3 4 1

2 Benefits and Costs of Model Suppose y is a state s per capita consumption of cigarettes x represents taxes on cigarettes Question how much will y fall if x is increased by 35 cents/pack? Note there are many reasons why people smoke cost is but one of them Placed more structure on the model, therefore we can obtain precise statements about the relationship between x and y These statements will be true so long as the hypothesized relationship is true As you place more structure on any model, the chance that the assumptions of the model are correct declines. 5 6 Data Data on state consumption/taxes, states x 2 years = 12 observations Y = per capita consumption Per capita packs per year Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, , 12 observations X = tax (State + Federal) in real cents per pack 2 dollars Real tax per pack (State + Federal, 2 $) 8 2

3 3 Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, , 12 observations What is i? Per capita packs per year Real tax per pack (State + Federal, 2 $) 9 There are many factors that determine a state s level of cigarette consumption Some of these factors we can measure, but for what ever reason, we do not have data Education, age, income, etc. Some of these factors we cannot measure Dislike of cigarettes, anti-smoking sentiment of your friends/neighbors/relatives i identified what we cannot measure in our model 1 Current smoking rates By demographic group Think of a difference way draw a vertical line at any tax level (e.g., 4 cents). Notice that at this level, there are multiple values of Y that are present Therefore on average, higher taxes will reduce consumption, but it cannot explain all of consumption across states Adults 2.8% Gender Males 23.5% Females 18.1% Age group % % % Race White 2.7% AA 21.1% AI/AN 26.9% Asian 1.7% Hispanic origin Hispanic 14.5% Non hispanic 21.9% Education < HS 28.9% HS 26.4% Some col. 22.1% College+ 8.2% Family Income <$2K 3.3% $2-$35K 27.2% $35-$55K 22.1% $55-$75K 18.3% $>75K 13.5%

4 Performance in the Olympics We can however estimate values of i by estimating values of β and 1. Estimates have hats : ˆ and ˆ 1 Our goal, is to choose values for ˆ and ˆ 1in an optimal way. Requires minimizing some function of the estimated errors associated with the model 13 Medal count in the Olympics is a simple measure of output Countries vary by Size Resources How is performance once we control for these attributes? 14 Ranking by Total Medal Count Ranking by Medals/1 million People medals~k name country medals United States of America USA 14 2 People's Republic of China CHI 88 3 Russian Federation RUS 82 4 Great Britain GB 65 5 Germany GER Japan JAP 38 7 Australia AUS 35 8 France FRE 34 9 Republic of Korea SKOR 28 9 Italy ITA m~a_rank name country medals medals~a Grenada GREN Jamaica JAM Trinidad and Tobago T&T New Zealand NZEL Bahamas BAH Slovenia SLOVE Mongolia MONG Hungary HUN Montenegro MONT Denmark DEN

5 5 ln(medals per 1 Million) vs ln(gdp per capita) 5 ln(medals per 1 Million) vs ln(gdp per capita) Greneda 4 4 Jamaica NZ 3 3 Australia ln(medals per 1 million) ln(medals per 1 million) China GB USA India ln(gdp per capita) ln(gdp per capita) 18 5 ln(medals per 1 Million) vs ln(gdp per capita) Greneda ln(medals per 1 million) India Jamaica NZ Australia GB USA China ln(gdp per capita) 19 Given linear model yi xi 1 i We can predict an level of consumption given parameter values y ˆ ˆ ˆi x i 1 The predicted value will not always be accurate sometimes we will over or under predict the true value Because of the linear relationship between x and y, predictions will lie along a line 2 5

6 y Difference between the actual & predicted value y 1 y ˆ x ˆ ˆi i 1 y ˆ ˆ ˆ ˆ i yi yi xi 1 i if y ˆ ˆ i yi i you underpredict (you did better than expected) ŷ 2 ŷ 1 y 2 ˆ i ˆ i If y ˆ ˆ i yi i you overpredict (you did worse than expected) x 1 x 2 x ln(medals per 1 million) ln(medals per 1 Million) vs ln(gdp per capita) Greneda Jamaica NZ Australia GB USA China Estimation Estimated errors measure what we don t know Want to minimize these errors as much as possible There are N errors in each model Need to select a criteria to somehow minimize all these errors -3 India ln(gdp per capita)

7 Criteria: Least squares Y let ˆ and ˆ be candidate values for 1 the parameters. The estimated error is then y ˆ x ˆ ˆi i i 1 f(x) Objective :minthe sum of squared errors n n 2 ˆ ˆ 2 ˆ i ( i i 1) i 1 i 1 SSE y x 25 X 2 X 1 X 26 Cigarette example Data available on web page state_cig_data.dta Already in a Stata data file To use, Download to a folder Change directory to the folder type use state_cig_data. * dscribe data. desc Contains data from state_cig_data.dta obs: 1,2 vars: 7 28 Aug 28 15:39 size: 29,58 (99.8% of memory free) - storage display value variable name type format label variable label - state str2 %9s 2-digit state code year int %8.g year state_tax float %9.g state tax in cents per pack retail_price float %9.g average retail price, nominal federal_tax byte %8.g federal tax in cents per pack packs_pc float %9.g packs of cigarettes per capita cpi float %9.g consumer price index, 2=

8 . * generate real taxes. gen real_tax=(state_tax+federal_tax)/cpi.. * get means of real tax and per capita consumption. sum packs_pc real_tax Generate real tax By dividing by CPI Variable Obs Mean Std. Dev. Min Max packs_pc real_tax ˆ ˆ ˆ xy y.6115( ) ˆ x. * get correlation coefficient. corr packs_pc real_tax (obs=12) packs_pc real_tax packs_pc 1. real_tax Means Std deviations ˆ y x ˆ 16.6 (56.5)(.937) Correlation coefficient 29 3 Run regression in stata SSM, SSE, SST R 2. * get regression estimate. reg packs_pc real_tax Source SS df MS Number of obs = F( 1, 118) = Model Prob > F =. Residual R-squared = Adj R-squared =.3733 Total Root MSE = Notice that SSE + SSM = SST R 2 = SSM/SST = 351.4/ =.3739 packs_pc Coef. Std. Err. t P> t [95% Conf. Interval] real_tax _cons β and β

9 Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, , 12 observations Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, , 12 observations Per capita packs per year Per capita packs per year Real tax per pack (State + Federal, 2 $) Real tax per pack (State + Federal, 2 $) Using the results. * output residuals. predict error_packs_pc, residuals Output residuals ˆ For every penny increase in taxes, per capita consumption falls by.94 packs per year A 35 cent increase in taxes will reduce consumption by (35)(.94) = 32 packs per person per year.. * show that the means of the errors are zero. sum error_packs_pc Variable Obs Mean Std. Dev. Min Max error_pack~c e * get correlation between x and error. corr error_packs_pc real_tax (obs=12) error_~c real_tax error_pack~c 1. real_tax Predicted error Has zero mean Predicted error not Correlated with x

10 Example 2 Do better performing teams have higher attendance? Data on wins and average attendance/game for 24 baseball season 3 observations attendance.dta - storage display value variable name type format label variable label - team str13 %13s team city attendance long %12.g avg attendance per game wins int %8.g wins during year -. * get means of wins and payroll. sum wins attendance Variable Obs Mean Std. Dev. Min Max wins attendance * get detailed data for attendance. sum attendance, detail avg attendance per game Percentiles Smallest 1% % % Obs 3 25% Sum of Wgt. 3 5% Mean Largest Std. Dev % % Variance 8.68e+7 95% Skewness % Kurtosis * run simple regression. reg attendance wins Source SS df MS Number of obs = F( 1, 28) = 8.89 Model Prob > F =.59 Residual 1.911e R-squared = Adj R-squared =.2139 Total e Root MSE = attendance Coef. Std. Err. t P> t [95% Conf. Interval] wins _cons ˆ for every addition win, attendance increase by 31 / game. An addition1 wins put 31.5 more people in seats 4 1

11 . * output predicted attendance. predict pred_att, xb.. list team wins attendance pred_att team wins attend~e pred_att Seattle New York-AL Baltimore Boston Cleveland Texas Anaheim Oakland Minnesota Chicago--AL Construct a new variable, The predicted value of Y Predicted attendance Plot: Actual and Predicted Attendance 45 degree line Consider Seattle Y p = (31.5)(93) = Actual attendance 42 Example 3: Education and Earnings Stylized fact: log wages or earnings is linear in education (above a certain range) Interpreted as a return to education Theoretical models why this would be the case Linear model: y=ln(weekly wages) endogenous variable x=years of education exogenous factor y i = + x i 1 + i 43 Notice that 1 has a different interpretation 1 =dy/dx In this case, y=ln(wages) dln(wages)/dx = (1/wages)dWages/dX dwages/wages = % change in changes (change in wages over base wages) when the endogenous variable is a natural log, 1 =dy/dx is interpreted as % change in y for a unit change in x 44 11

12 Data cps87.dta 19,96 observations from 1987 Current Population Survey on Full time (>3 hours) Males Aged % of sample 8% 7% 6% 5% 4% 3% 2% Weekly Earnings, Adult Males, 2 Census 1% 45 % Weekly Earnings 46 6% ln(wekly Earnings), Adult Males, 2 Census Average ln(weekly Wages) vs Educ % of Sample 5% 4% 3% 2% 1% Average ln(weekly Wages) Years of Education % ln(weekly Earnings)

13 ln(weekly Wages) ln(weekly Wages) vs. Education Years of Education. * construct ln weekly earnings. gen ln_weekly_earn=ln(weekly_earn).. * get descriptive information. sum weekly_earn ln_weekly_earn years_educ Generate ln(weekly earnings) Variable Obs Mean Std. Dev. Min Max weekly_earn ln_weekly_~n years_educ Means of the variables * run simple regression. reg ln_weekly_earn years_educ Source SS df MS Number of obs = F( 1, 1994) = Model Prob > F =. Residual R-squared = Adj R-squared =.163 Total Root MSE = ln_weekly_~n Coef. Std. Err. t P> t [95% Conf. Interval] years_educ _cons Example 4:London Olympics * generate measure of medals per person; * divide by 1,, people; gen medals_capita=medals/(population/1); label var medals_capita "medals per 1,, people"; * take natural ln of medals_capita and gdp_capita; gen ln_medals_capita=ln(medals_capita); gen ln_gdp_capita=ln(gdp_capita); * regress ln(medals/population) on ln(gdp/population); reg ln_medals_capita ln_gdp_capita;

14 Example 4: London Olympics. * regress ln(medals/population) on ln(gdp/population);. reg ln_medals_capita ln_gdp_capita; Source SS df MS Number of obs = F( 1, 83) = Model Prob > =.1 F Residual R-squared = Adj R-squared =.1677 Total Root MSE = ln_medals_~a Coef. Std. Err. t P> t [95% Conf. Interval] ln_gdp_cap~a _cons ln medals per 1 million people ln(medals/population) vs ln(gdp/population) GREN JAM T&TBAH NZEL MONG MONT HUN SLOVE GEO LITH CROA EST DEN AUS CUB BEL AZE CYPGBIRE NETH QUA ARM LATV CZE KAZ SLOVA BAH SWE TAI NOR MOLD SERB GABRUS SKOR PR FRECAN GER FIN UKR BOTS ROM ITA SWI ESP USA SING TUN JAP NKOR KEN BULG POL BEL KUW DREP IRAN COL GRE TAJ UZE SAFR HK ARG ETH BRA POR GUAT CHI MALA MEX TUR AFGH THAI MORO VEN SAUD UGA EGY ALG IND ln of medals per 1,, Fitted values INDO ln of gdp per capita ln medals per 1 million people ETH ln(medals/population) vs ln(gdp/population) NKOR IND EGY INDO JAM CUB CHI GREN MEX VEN SKOR SAUD AUS GB QUA NOR CAN USA ln of gdp per capita ln of medals per 1,, Fitted values 55 14

The Multivariate Regression Model

The Multivariate Regression Model The Multivariate Regression Model Example Determinants of College GPA Sample of 4 Freshman Collect data on College GPA (4.0 scale) Look at importance of ACT Consider the following model CGPA ACT i 0 i

More information

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011 ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011 Instructions: Answer all five (5) questions. Point totals for each question are given in parentheses. The parts within each

More information

Florida. Difference-in-Difference Models 8/23/2016

Florida. Difference-in-Difference Models 8/23/2016 Florida Difference-in-Difference Models Bill Evans Health Economics 8/25/1997, State of Florida settles out of court in their suits against tobacco manufacturers Awarded $13 billion over 25 years Use $200m

More information

Application: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).)

Application: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).) Application: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).) There are two data sets; each as the same treatment group of 185 men. JTRAIN2 includes 260

More information

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis In any longitudinal analysis, we can distinguish between analyzing trends vs individual change that is, model

More information

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014 ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014 Instructions: Answer all five (5) questions. Point totals for each question are given in parentheses. The parts within each

More information

Midterm Exam. Friday the 29th of October, 2010

Midterm Exam. Friday the 29th of October, 2010 Midterm Exam Friday the 29th of October, 2010 Name: General Comments: This exam is closed book. However, you may use two pages, front and back, of notes and formulas. Write your answers on the exam sheets.

More information

* STATA.OUTPUT -- Chapter 5

* STATA.OUTPUT -- Chapter 5 * STATA.OUTPUT -- Chapter 5.*bwt/confounder example.infile bwt smk gest using bwt.data.correlate (obs=754) bwt smk gest -------------+----- bwt 1.0000 smk -0.1381 1.0000 gest 0.3629 0.0000 1.0000.regress

More information

HAS SUSTAINED GROWTH DECOUPLED FROM INDUSTRIALIZATION? Dani Rodrik February 2014

HAS SUSTAINED GROWTH DECOUPLED FROM INDUSTRIALIZATION? Dani Rodrik February 2014 HAS SUSTAINED GROWTH DECOUPLED FROM INDUSTRIALIZATION? Dani Rodrik February 2014 Recent growth performance in the developing world Growth performance of country groups since 1980 10.0% 1980-1990 8.0% 1990-2000

More information

Table. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use "k:mydirectory,

Table. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use k:mydirectory, Table XTMIXED Procedure in STATA with Output Systolic Blood Pressure, 2001. use "k:mydirectory,. xtmixed sbp nage20 nage30 nage40 nage50 nage70 nage80 nage90 winter male dept2 edu_bachelor median_household_income

More information

PSC 508. Jim Battista. Dummies. Univ. at Buffalo, SUNY. Jim Battista PSC 508

PSC 508. Jim Battista. Dummies. Univ. at Buffalo, SUNY. Jim Battista PSC 508 PSC 508 Jim Battista Univ. at Buffalo, SUNY Dummies Dummy variables Sometimes we want to include categorical variables in our models Numerical variables that don t necessarily have any inherent order and

More information

Categorical Data Analysis

Categorical Data Analysis Categorical Data Analysis Hsueh-Sheng Wu Center for Family and Demographic Research October 4, 200 Outline What are categorical variables? When do we need categorical data analysis? Some methods for categorical

More information

OECD experience with strenghthening fiscal capacities for subnational public investment

OECD experience with strenghthening fiscal capacities for subnational public investment Leopolis Hall Prospekt Svobody, L viv, Ukraine 22 June 2017 OECD experience with strenghthening fiscal capacities for subnational public investment Isabelle CHATRY Project manager, subnational finances

More information

Example Analysis with STATA

Example Analysis with STATA Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group

More information

Guideline on evaluating the impact of policies -Quantitative approach-

Guideline on evaluating the impact of policies -Quantitative approach- Guideline on evaluating the impact of policies -Quantitative approach- 1 2 3 1 The term treatment derives from the medical sciences and has more meaning when is used in that context. However, this term

More information

Example Analysis with STATA

Example Analysis with STATA Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group

More information

Group Comparisons: Using What If Scenarios to Decompose Differences Across Groups

Group Comparisons: Using What If Scenarios to Decompose Differences Across Groups Group Comparisons: Using What If Scenarios to Decompose Differences Across Groups Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 15, 2015 We saw that the

More information

Soci Statistics for Sociologists

Soci Statistics for Sociologists University of North Carolina Chapel Hill Soci708-001 Statistics for Sociologists Fall 2009 Professor François Nielsen Stata Commands for Module 11 Multiple Regression For further information on any command

More information

for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty

for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty -> reg trstprl trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty Source SS df MS

More information

You can find the consultant s raw data here:

You can find the consultant s raw data here: Problem Set 1 Econ 475 Spring 2014 Arik Levinson, Georgetown University 1 [Travel Cost] A US city with a vibrant tourist industry has an industrial accident (a spill ) The mayor wants to sue the company

More information

Introduction of STATA

Introduction of STATA Introduction of STATA News: There is an introductory course on STATA offered by CIS Description: Intro to STATA On Tue, Feb 13th from 4:00pm to 5:30pm in CIT 269 Seats left: 4 Windows, 7 Macintosh For

More information

Bios 312 Midterm: Appendix of Results March 1, Race of mother: Coded as 0==black, 1==Asian, 2==White. . table race white

Bios 312 Midterm: Appendix of Results March 1, Race of mother: Coded as 0==black, 1==Asian, 2==White. . table race white Appendix. Use these results to answer 2012 Midterm questions Dataset Description Data on 526 infants with very low (

More information

Notes on PS2

Notes on PS2 17.871 - Notes on PS2 Mike Sances MIT April 2, 2012 Mike Sances (MIT) 17.871 - Notes on PS2 April 2, 2012 1 / 9 Interpreting Regression: Coecient regress success_rate dist Source SS df MS Number of obs

More information

Why Bother? What Matters? If we have data, let s look at data. If all we have are opinions, let s go with mine - Jim Barksdale, former CEO of Netscape

Why Bother? What Matters? If we have data, let s look at data. If all we have are opinions, let s go with mine - Jim Barksdale, former CEO of Netscape Our objective: a scientific approach to measuring digital money readiness and quantifying associated benefits 1 Why Bother? 2 What Matters? If we have data, let s look at data. If all we have are opinions,

More information

The study obtains the following results: Homework #2 Basics of Logistic Regression Page 1. . version 13.1

The study obtains the following results: Homework #2 Basics of Logistic Regression Page 1. . version 13.1 Soc 73994, Homework #2: Basics of Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 14, 2018 All answers should be typed and mailed to

More information

Week 10: Heteroskedasticity

Week 10: Heteroskedasticity Week 10: Heteroskedasticity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline The problem of (conditional)

More information

Eco311, Final Exam, Fall 2017 Prof. Bill Even. Your Name (Please print) Directions. Each question is worth 4 points unless indicated otherwise.

Eco311, Final Exam, Fall 2017 Prof. Bill Even. Your Name (Please print) Directions. Each question is worth 4 points unless indicated otherwise. Your Name (Please print) Directions Each question is worth 4 points unless indicated otherwise. Place all answers in the space provided below or within each question. Round all numerical answers to the

More information

Applied Econometrics

Applied Econometrics Applied Econometrics Lecture 3 Nathaniel Higgins ERS and JHU 20 September 2010 Outline of today s lecture Schedule and Due Dates Making OLS make sense Uncorrelated X s Correlated X s Omitted variable bias

More information

Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling

Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling Amber Barnato MD MPH MS University of Pittsburgh Scott Halpern MD PhD University of Pennsylvania Learning objectives 1. List

More information

Lecture 2a: Model building I

Lecture 2a: Model building I Epidemiology/Biostats VHM 812/802 Course Winter 2015, Atlantic Veterinary College, PEI Javier Sanchez Lecture 2a: Model building I Index Page Predictors (X variables)...2 Categorical predictors...2 Indicator

More information

Chapter 2 Part 1B. Measures of Location. September 4, 2008

Chapter 2 Part 1B. Measures of Location. September 4, 2008 Chapter 2 Part 1B Measures of Location September 4, 2008 Class will meet in the Auditorium except for Tuesday, October 21 when we meet in 102a. Skill set you should have by the time we complete Chapter

More information

Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users

Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users Data Set for this Assignment: Download from the course website: Stata Users: framingham_1000.dta Source: Levy (1999) National

More information

İnsan Tunalı November 29, 2018 Econ 511: Econometrics I. ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement

İnsan Tunalı November 29, 2018 Econ 511: Econometrics I. ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement İnsan Tunalı November 29, 2018 Econ 511: Econometrics I STATA Exercise 1 ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement TASK 1: --- name: log: g:\econ511\heter_housinglog log type: text opened

More information

The Effect of Occupational Danger on Individuals Wage Rates. A fundamental problem confronting the implementation of many healthcare

The Effect of Occupational Danger on Individuals Wage Rates. A fundamental problem confronting the implementation of many healthcare The Effect of Occupational Danger on Individuals Wage Rates Jonathan Lee Econ 170-001 Spring 2003 PID: 703969503 A fundamental problem confronting the implementation of many healthcare policies is the

More information

Multilevel/ Mixed Effects Models: A Brief Overview

Multilevel/ Mixed Effects Models: A Brief Overview Multilevel/ Mixed Effects Models: A Brief Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 27, 2018 These notes borrow very heavily, often/usually

More information

APPENDIX C Questions from Hofstede s survey used to identify individualism (source Exhibit 5.11 in Hofstede (2001)):

APPENDIX C Questions from Hofstede s survey used to identify individualism (source Exhibit 5.11 in Hofstede (2001)): APPENDIX C Questions from Hofstede s survey used to identify individualism (source Exhibit 5.11 in Hofstede (2001)): 1. Have challenging work to do work from which you can get a personal sense of accomplishment

More information

Three Step Approach. Our new unified meta-theory theory of Competence. Cultural Differences affect business in many different ways. A model of culture

Three Step Approach. Our new unified meta-theory theory of Competence. Cultural Differences affect business in many different ways. A model of culture Three Step Approach Managing Diversity in Global Business Dimensions & Process Awareness Increase awareness of one s own cultural perspective. Respect Appreciate cultural differences. Reconciliation Resolve

More information

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening r's age when 1st child born 2 4 6 Density.2.4.6.8 Density.5.1 Sociology 774: Regression Models for Categorical Data Instructor: Natasha Sarkisian Preliminary Data Screening A. Examining Univariate Normality

More information

17.871: PS3 Key. Part I

17.871: PS3 Key. Part I 17.871: PS3 Key Part I. use "cces12.dta", clear. reg CC424 CC334A [aweight=v103] if CC334A!= 8 & CC424 < 6 // Need to remove values that do not fit on the linear scale. This entails discarding all respondents

More information

Week 11: Collinearity

Week 11: Collinearity Week 11: Collinearity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Regression and holding other

More information

Analyzing CHIS Data Using Stata

Analyzing CHIS Data Using Stata Analyzing CHIS Data Using Stata Christine Wells UCLA IDRE Statistical Consulting Group February 2014 Christine Wells Analyzing CHIS Data Using Stata 1/ 34 The variables bmi p: BMI povll2: Poverty level

More information

COMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION

COMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION PLS 802 Spring 2018 Professor Jacoby COMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION This handout shows the log of a STATA session that compares alternative estimates of

More information

Exploring Functional Forms: NBA Shots. NBA Shots 2011: Success v. Distance. . bcuse nbashots11

Exploring Functional Forms: NBA Shots. NBA Shots 2011: Success v. Distance. . bcuse nbashots11 NBA Shots 2011: Success v. Distance. bcuse nbashots11 Contains data from http://fmwww.bc.edu/ec-p/data/wooldridge/nbashots11.dta obs: 199,119 vars: 15 25 Oct 2012 09:08 size: 24,690,756 ------------- storage

More information

All analysis examples presented can be done in Stata 10.1 and are included in this chapter s output.

All analysis examples presented can be done in Stata 10.1 and are included in this chapter s output. Chapter 9 Stata v10.1 Analysis Examples Syntax and Output General Notes on Stata 10.1 Given that this tool is used throughout the ASDA textbook this chapter includes only the syntax and output for the

More information

Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17

Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17 Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2 B. Rosner, 5/09/17 1 Outline 1. Testing for effect modification in logistic regression analyses 2. Conditional logistic

More information

. *increase the memory or there will problems. set memory 40m (40960k)

. *increase the memory or there will problems. set memory 40m (40960k) Exploratory Data Analysis on the Correlation Structure In longitudinal data analysis (and multi-level data analysis) we model two key components of the data: 1. Mean structure. Correlation structure (after

More information

The relationship between innovation and economic growth in emerging economies

The relationship between innovation and economic growth in emerging economies Mladen Vuckovic The relationship between innovation and economic growth in emerging economies 130 - Organizational Response To Globally Driven Institutional Changes Abstract This paper will investigate

More information

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the

More information

PREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY

PREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY Page22 PREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY Wilson Cordova wilson.cordova@cksc.edu.ph Chiang Kai Shek College, Philippines Abstract There are varied sources

More information

log: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56

log: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56 log: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56 (20 cities >100k pop). de obs: 20 20 cities >100k pop vars: 13 size: 1,040 storage display value variable name type format label variable

More information

Timing Production Runs

Timing Production Runs Class 7 Categorical Factors with Two or More Levels 189 Timing Production Runs ProdTime.jmp An analysis has shown that the time required in minutes to complete a production run increases with the number

More information

Work in the Age of Robots and AI

Work in the Age of Robots and AI Work in the Age of Robots and AI Christopher Pissarides Regius Professor London School of Economics IZA 20 th Anniversary Celebration Berlin, 28-29 June 2018 Road map 1. Robots, AI and their industrial

More information

Lecture 3: Producer-Level Analysis; French Firms

Lecture 3: Producer-Level Analysis; French Firms Lecture 3: Producer-Level Analysis; French Firms Samuel Kortum April 13, 2007 An Anatomy of International Trade Joint project with Jonathan Eaton and Francis Kramarz. Technologies Countries: i =1,...N.

More information

ICT AND DEMAND FOR LABOUR: DOES AGE MATTER?

ICT AND DEMAND FOR LABOUR: DOES AGE MATTER? ICT AND DEMAND FOR LABOUR: DOES AGE MATTER? Or: a bit of an outsider s view Michael Polder, OECD-STI/EAS Expert consultation on Active ageing in the digital economy 1-2 September 215, Harris-Manchester

More information

TRENDS IN FOREIGN TRADE OF THE PRIMARY AND SECONDARY WOOD PRODUCTS. Maja Moro, Darko Motik, Ksenija Šegotić, Andreja Pirc Barčić

TRENDS IN FOREIGN TRADE OF THE PRIMARY AND SECONDARY WOOD PRODUCTS. Maja Moro, Darko Motik, Ksenija Šegotić, Andreja Pirc Barčić TRENDS IN FOREIGN TRADE OF THE PRIMARY AND SECONDARY WOOD PRODUCTS Maja Moro, Darko Motik, Ksenija Šegotić, Andreja Pirc Barčić INTRODUCTION The imminent entry of Croatia into the full membership of the

More information

Unit 6: Simple Linear Regression Lecture 2: Outliers and inference

Unit 6: Simple Linear Regression Lecture 2: Outliers and inference Unit 6: Simple Linear Regression Lecture 2: Outliers and inference Statistics 101 Thomas Leininger June 18, 2013 Types of outliers in linear regression Types of outliers How do(es) the outlier(s) influence

More information

Plan of the presentation

Plan of the presentation Organisation for Economic Co-operation and Development Over-Qualified or Under-Skilled? Glenda Quintini OECD DELSA, Employment Analysis and Policy Division OECD Directorate for Employment, Labour and Social

More information

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved. AcaStat How To Guide AcaStat Software Copyright 2016, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Frequencies... 3 List Variables... 4 Descriptives... 5 Explore Means...

More information

This is a quick-and-dirty example for some syntax and output from pscore and psmatch2.

This is a quick-and-dirty example for some syntax and output from pscore and psmatch2. This is a quick-and-dirty example for some syntax and output from pscore and psmatch2. It is critical that when you run your own analyses, you generate your own syntax. Both of these procedures have very

More information

Biostatistics 208. Lecture 1: Overview & Linear Regression Intro.

Biostatistics 208. Lecture 1: Overview & Linear Regression Intro. Biostatistics 208 Lecture 1: Overview & Linear Regression Intro. Steve Shiboski Division of Biostatistics, UCSF January 8, 2019 1 Organization Office hours by appointment (Mission Hall 2540) E-mail to

More information

Automobility in Brazil, Russia, India, and China A collaborative study by the RAND Corporation and the Institute for Mobility Research

Automobility in Brazil, Russia, India, and China A collaborative study by the RAND Corporation and the Institute for Mobility Research Automobility in Brazil, Russia, India, and China A collaborative study by the RAND Corporation and the Institute for Mobility Research Charlene Rohr Presentation to the Konrad Adenauer Foundation 18 February

More information

ECON Introductory Econometrics Seminar 6

ECON Introductory Econometrics Seminar 6 ECON4150 - Introductory Econometrics Seminar 6 Stock and Watson EE10.1 April 28, 2015 Stock and Watson EE10.1 ECON4150 - Introductory Econometrics Seminar 6 April 28, 2015 1 / 21 Guns data set Some U.S.

More information

Trunkierte Regression: simulierte Daten

Trunkierte Regression: simulierte Daten Trunkierte Regression: simulierte Daten * Datengenerierung set seed 26091952 set obs 48 obs was 0, now 48 gen age=_n+17 gen yhat=2000+200*(age-18) gen wage = yhat + 2000*invnorm(uniform()) replace wage=max(0,wage)

More information

!! NOTE: SAS Institute Inc., SAS Campus Drive, Cary, NC USA ! NOTE: The SAS System used:!

!! NOTE: SAS Institute Inc., SAS Campus Drive, Cary, NC USA ! NOTE: The SAS System used:! 1 The SAS System NOTE: Copyright (c) 2002-2010 by SAS Institute Inc., Cary, NC, USA. NOTE: SAS (r) Proprietary Software 9.3 (TS1M0) Licensed to UNIVERSITY OF TORONTO/COMPUTING & COMMUNICATIONS, Site 70072784.

More information

Building a Dynamic European Labour Market

Building a Dynamic European Labour Market Building a Dynamic European Labour Market Christopher A Pissarides London School of Economics Jobs for Europe: The Employment Policy Conference Brussels, 6-7 September 2012 1 A dynamic labour market A

More information

Compartmental Pharmacokinetic Analysis. Dr Julie Simpson

Compartmental Pharmacokinetic Analysis. Dr Julie Simpson Compartmental Pharmacokinetic Analysis Dr Julie Simpson Email: julieas@unimelb.edu.au BACKGROUND Describes how the drug concentration changes over time using physiological parameters. Gut compartment Absorption,

More information

ROBUST ESTIMATION OF STANDARD ERRORS

ROBUST ESTIMATION OF STANDARD ERRORS ROBUST ESTIMATION OF STANDARD ERRORS -- log: Z:\LDA\DataLDA\sitka_Lab8.log log type: text opened on: 18 Feb 2004, 11:29:17. ****The observed mean responses in each of the 4 chambers; for 1988 and 1989.

More information

Interpreting and Visualizing Regression models with Stata Margins and Marginsplot. Boriana Pratt May 2017

Interpreting and Visualizing Regression models with Stata Margins and Marginsplot. Boriana Pratt May 2017 Interpreting and Visualizing Regression models with Stata Margins and Marginsplot Boriana Pratt May 2017 Interpreting regression models Often regression results are presented in a table format, which makes

More information

Nonparametric Synthetic Control Method for program evaluation: Model and Stata implementation

Nonparametric Synthetic Control Method for program evaluation: Model and Stata implementation 23rd London Stata Users Group Meeting 7-8 September 2017 Cass Business School, London, UK Nonparametric Synthetic Control Method for program evaluation: Model and Stata implementation Giovanni Cerulli

More information

Diego Restuccia. University of Toronto and NBER

Diego Restuccia. University of Toronto and NBER Discussion of Are There Too Many Farms in the World? Labor-Market Transaction Costs, Machine Capacities and Optimal Farm Size, by Foster and Rosenzweig Diego Restuccia University of Toronto and NBER NBER

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON4137 Applied Micro Econometrics Date of exam: Thursday, May 31, 2018 Grades are given: June 15, 2018 Time for exam: 09.00 to 12.00 The problem set covers

More information

Structural Transformation and Growth

Structural Transformation and Growth Structural Transformation and Growth Diego Restuccia University of Toronto and NBER University of Oslo August 4-8, 27 Restuccia Macro Growth and Development University of Oslo / 49 Overview Structural

More information

IAEA Activities in LTO Area

IAEA Activities in LTO Area Activities in LTO Area Robert Krivanek R.Krivanek@iaea.org Project manager for LTO Division of Nuclear Installation Safety International Atomic Energy Agency Contents 1. SALTO (Safety Aspects of LTO) Peer

More information

The International Communications Market Post

The International Communications Market Post The International Communications Market 2016 7 7 Post 179 Contents 7.1 Post: overview and key market developments 181 Overview 181 Parcel volumes continue to grow 183 Consumers in most countries shop online

More information

Read and Describe the SENIC Data

Read and Describe the SENIC Data Read and Describe the SENIC Data If the data come in an Excel spreadsheet (very common), blanks are ideal for missing values. The spreadsheet must be.xls, not.xlsx. Beware of trying to read a.csv file

More information

Milk Data Analysis. 1. Objective: analyzing protein milk data using STATA.

Milk Data Analysis. 1. Objective: analyzing protein milk data using STATA. 1. Objective: analyzing protein milk data using STATA. 2. Dataset: Protein milk data set (in the class website) Data description: Percentage protein content of milk samples at weekly intervals from each

More information

DEVELOPING COUNTRIES PARTICIPATION IN GLOBAL VALUE CHAINS IMPLICATIONS FOR TRADE AND TRADE- RELATED POLICIES SELECTED PRELIMINARY RESULTS

DEVELOPING COUNTRIES PARTICIPATION IN GLOBAL VALUE CHAINS IMPLICATIONS FOR TRADE AND TRADE- RELATED POLICIES SELECTED PRELIMINARY RESULTS DEVELOPING COUNTRIES PARTICIPATION IN GLOBAL VALUE CHAINS IMPLICATIONS FOR TRADE AND TRADE- RELATED POLICIES SELECTED PRELIMINARY RESULTS By Przemek Kowalski, OECD Trade and Agriculture Directorate Asian

More information

Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 28, 2015

Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 28, 2015 Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 28, 2015 NOTE: The routines spost13, lrdrop1, and extremes

More information

ECON Introductory Econometrics Seminar 9

ECON Introductory Econometrics Seminar 9 ECON4150 - Introductory Econometrics Seminar 9 Stock and Watson EE13.1 May 4, 2015 Stock and Watson EE13.1 ECON4150 - Introductory Econometrics Seminar 9 May 4, 2015 1 / 18 Empirical exercise E13.1: Data

More information

SUGGESTED SOLUTIONS Winter Problem Set #1: The results are attached below.

SUGGESTED SOLUTIONS Winter Problem Set #1: The results are attached below. 450-2 Winter 2008 Problem Set #1: SUGGESTED SOLUTIONS The results are attached below. 1. The balanced panel contains larger firms (sales 120-130% bigger than the full sample on average), which are more

More information

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT STAT 512 EXAM I STAT 512 Name (7 pts) Problem Points Score 1 40 2 25 3 28 USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT WRITE LEGIBLY. ANYTHING UNREADABLE WILL NOT BE GRADED GOOD LUCK!!!!

More information

Stata v 12 Illustration. One Way Analysis of Variance

Stata v 12 Illustration. One Way Analysis of Variance Stata v 12 Illustration Page 1. Preliminary Download anovaplot.. 2. Descriptives Graphs. 3. Descriptives Numerical 4. Assessment of Normality.. 5. Analysis of Variance Model Estimation.. 6. Tests of Equality

More information

Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance

Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance Program Note 12.1 - One-Way ANOVA and Multiple Comparisons The Stata command

More information

Factors Affecting Internet Development

Factors Affecting Internet Development Factors Affecting Internet Development 1 Factors Affecting Internet Development Prepared For: Dr. Abdul H. Chowdhury Business Statistics BUS 511 MBA Program, Summer 2009 Prepared By: Shahpar Sultana ID

More information

Intensive and Extensive Land Supply Response to Agricultural Crop Prices. Zabid Iqbal. December 6, 2017

Intensive and Extensive Land Supply Response to Agricultural Crop Prices. Zabid Iqbal. December 6, 2017 Intensive and Extensive Land Supply Response to Agricultural Crop Prices Zabid Iqbal December 6, 2017 1 Intensive and Extensive Land Supply Response to Agricultural Crop Prices Outline Contribution Background/Motivation/Existing

More information

What can be Gained From International Analyses? An Economic Perspective

What can be Gained From International Analyses? An Economic Perspective What can be Gained From International Analyses? An Economic Perspective Guido Schwerdt (University of Konstanz and Ifo Institut) Kickoff of the European Training Network OCCAM TU Dortmund University 22

More information

F u = t n+1, t f = 1994, 2005

F u = t n+1, t f = 1994, 2005 Forecasting an Electric Utility's Emissions Using SAS/AF and SAS/STAT Software: A Linear Analysis Md. Azharul Islam, The Ohio State University, Columbus, Ohio. David Wang, The Public Utilities Commission

More information

Number of obs = R-squared = Root MSE = Adj R-squared =

Number of obs = R-squared = Root MSE = Adj R-squared = Appendix for the details of statistical test results Statistical Package used:stata/se 11.1 1. ANOVA result with dependent variable: current level of happiness, independent variables: sexs, ages, and survey

More information

DAY 2 Advanced comparison of methods of measurements

DAY 2 Advanced comparison of methods of measurements EVALUATION AND COMPARISON OF METHODS OF MEASUREMENTS DAY Advanced comparison of methods of measurements Niels Trolle Andersen and Mogens Erlandsen mogens@biostat.au.dk Department of Biostatistics DAY xtmixed:

More information

Examples of Using Stata v11.0 with JRR replicate weights Provided in the NHANES data set

Examples of Using Stata v11.0 with JRR replicate weights Provided in the NHANES data set Examples of Using Stata v110 with JRR replicate weights Provided in the NHANES 1999-2000 data set This document is designed to illustrate comparisons of methods to use JRR replicate weights sometimes provided

More information

Regression Analysis I & II

Regression Analysis I & II Data for this session is available in Data Regression I & II Regression Analysis I & II Quantitative Methods for Business Skander Esseghaier 1 In this session, you will learn: How to read and interpret

More information

ADVANCED ECONOMETRICS I

ADVANCED ECONOMETRICS I ADVANCED ECONOMETRICS I Practice Exercises (1/2) Instructor: Joaquim J. S. Ramalho E.mail: jjsro@iscte-iul.pt Personal Website: http://home.iscte-iul.pt/~jjsro Office: D5.10 Course Website: http://home.iscte-iul.pt/~jjsro/advancedeconometricsi.htm

More information

Codebook for Regional Authority Index (RAI)

Codebook for Regional Authority Index (RAI) Citation for the dataset: Codebook for Regional Authority Index (RAI) Country Scores Dataset version July 2015 Hooghe, Liesbet, Gary Marks, Arjan H. Schakel, Sandra Chapman Osterkatz, Sara Niedzwiecki,

More information

4.3 Nonparametric Tests cont...

4.3 Nonparametric Tests cont... Class #14 Wednesday 2 March 2011 What did we cover last time? Hypothesis Testing Types Student s t-test - practical equations Effective degrees of freedom Parametric Tests Chi squared test Kolmogorov-Smirnov

More information

ARMENIA ELECTRICITY DEMAND FORECAST

ARMENIA ELECTRICITY DEMAND FORECAST ARMENIA ELECTRICITY DEMAND FORECAST ASSISTANCE TO ENERGY SECTOR TO STRENGTHEN ENERGY SECURITY AND REGIONAL INTEGRATION CONTRACT NUMBER EPP-I-08-03-00008-00 NOVEMBER 2012 This publication was produced for

More information

12.C Agricultural Production Targeting

12.C Agricultural Production Targeting 12.C Agricultural Production Targeting Robert A. McDougall 12.C.1 Background Agricultural production targeting is a procedure applied to certain I-O tables before the main data construction phase. Except

More information

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals heavily

More information

Hofstede's Dimensions of Culture

Hofstede's Dimensions of Culture Hofstede's Dimensions of Culture Geert H. Hofstede was born on October 2, 1928 in Haarlem, the Netherlands. He received his M.Sc. from the Delft Institute of Technology in 1953, his Ph.D. (cum laude) from

More information

A.O. Baranov, V.N. Pavlov, Yu.M. Slepenkova

A.O. Baranov, V.N. Pavlov, Yu.M. Slepenkova 25 th INFORUM World Conference Riga, Latvia, 28 August 1 September 2017 Construction of the Dynamic Input Output Model of Russian Economy with a Human Capital Block and Problems of Its Information Support

More information

Center for Demography and Ecology

Center for Demography and Ecology Center for Demography and Ecology University of Wisconsin-Madison A Comparative Evaluation of Selected Statistical Software for Computing Multinomial Models Nancy McDermott CDE Working Paper No. 95-01

More information