Chapter 2. Linear model. Put some concreteness on problem. The Bivariate Regression Model. Sample of n observations, labeled as i=1,2,..

Similar documents
The Multivariate Regression Model

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011

Florida. Difference-in-Difference Models 8/23/2016

Application: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).)

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014

Midterm Exam. Friday the 29th of October, 2010

* STATA.OUTPUT -- Chapter 5

HAS SUSTAINED GROWTH DECOUPLED FROM INDUSTRIALIZATION? Dani Rodrik February 2014

Table. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use "k:mydirectory,

PSC 508. Jim Battista. Dummies. Univ. at Buffalo, SUNY. Jim Battista PSC 508

Categorical Data Analysis

OECD experience with strenghthening fiscal capacities for subnational public investment

Example Analysis with STATA

Guideline on evaluating the impact of policies -Quantitative approach-

Example Analysis with STATA

Group Comparisons: Using What If Scenarios to Decompose Differences Across Groups

Soci Statistics for Sociologists

for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty

You can find the consultant s raw data here:

Introduction of STATA

Bios 312 Midterm: Appendix of Results March 1, Race of mother: Coded as 0==black, 1==Asian, 2==White. . table race white

Notes on PS2

Why Bother? What Matters? If we have data, let s look at data. If all we have are opinions, let s go with mine - Jim Barksdale, former CEO of Netscape

The study obtains the following results: Homework #2 Basics of Logistic Regression Page 1. . version 13.1

Week 10: Heteroskedasticity

Eco311, Final Exam, Fall 2017 Prof. Bill Even. Your Name (Please print) Directions. Each question is worth 4 points unless indicated otherwise.

Applied Econometrics

Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling

Lecture 2a: Model building I

Chapter 2 Part 1B. Measures of Location. September 4, 2008

Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users

İnsan Tunalı November 29, 2018 Econ 511: Econometrics I. ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement

The Effect of Occupational Danger on Individuals Wage Rates. A fundamental problem confronting the implementation of many healthcare

Multilevel/ Mixed Effects Models: A Brief Overview

APPENDIX C Questions from Hofstede s survey used to identify individualism (source Exhibit 5.11 in Hofstede (2001)):

Three Step Approach. Our new unified meta-theory theory of Competence. Cultural Differences affect business in many different ways. A model of culture

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening

17.871: PS3 Key. Part I

Week 11: Collinearity

Analyzing CHIS Data Using Stata

COMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION

Exploring Functional Forms: NBA Shots. NBA Shots 2011: Success v. Distance. . bcuse nbashots11

All analysis examples presented can be done in Stata 10.1 and are included in this chapter s output.

Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17

. *increase the memory or there will problems. set memory 40m (40960k)

The relationship between innovation and economic growth in emerging economies

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

PREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY

log: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56

Timing Production Runs

Work in the Age of Robots and AI

Lecture 3: Producer-Level Analysis; French Firms

ICT AND DEMAND FOR LABOUR: DOES AGE MATTER?

TRENDS IN FOREIGN TRADE OF THE PRIMARY AND SECONDARY WOOD PRODUCTS. Maja Moro, Darko Motik, Ksenija Šegotić, Andreja Pirc Barčić

Unit 6: Simple Linear Regression Lecture 2: Outliers and inference

Plan of the presentation

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

This is a quick-and-dirty example for some syntax and output from pscore and psmatch2.

Biostatistics 208. Lecture 1: Overview & Linear Regression Intro.

Automobility in Brazil, Russia, India, and China A collaborative study by the RAND Corporation and the Institute for Mobility Research

ECON Introductory Econometrics Seminar 6

Trunkierte Regression: simulierte Daten

!! NOTE: SAS Institute Inc., SAS Campus Drive, Cary, NC USA ! NOTE: The SAS System used:!

Building a Dynamic European Labour Market

Compartmental Pharmacokinetic Analysis. Dr Julie Simpson

ROBUST ESTIMATION OF STANDARD ERRORS

Interpreting and Visualizing Regression models with Stata Margins and Marginsplot. Boriana Pratt May 2017

Nonparametric Synthetic Control Method for program evaluation: Model and Stata implementation

Diego Restuccia. University of Toronto and NBER

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Structural Transformation and Growth

IAEA Activities in LTO Area

The International Communications Market Post

Read and Describe the SENIC Data

Milk Data Analysis. 1. Objective: analyzing protein milk data using STATA.

DEVELOPING COUNTRIES PARTICIPATION IN GLOBAL VALUE CHAINS IMPLICATIONS FOR TRADE AND TRADE- RELATED POLICIES SELECTED PRELIMINARY RESULTS

Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, Last revised March 28, 2015

ECON Introductory Econometrics Seminar 9

SUGGESTED SOLUTIONS Winter Problem Set #1: The results are attached below.

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT

Stata v 12 Illustration. One Way Analysis of Variance

Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance

Factors Affecting Internet Development

Intensive and Extensive Land Supply Response to Agricultural Crop Prices. Zabid Iqbal. December 6, 2017

What can be Gained From International Analyses? An Economic Perspective

F u = t n+1, t f = 1994, 2005

Number of obs = R-squared = Root MSE = Adj R-squared =

DAY 2 Advanced comparison of methods of measurements

Examples of Using Stata v11.0 with JRR replicate weights Provided in the NHANES data set

Regression Analysis I & II

ADVANCED ECONOMETRICS I

Codebook for Regional Authority Index (RAI)

4.3 Nonparametric Tests cont...

ARMENIA ELECTRICITY DEMAND FORECAST

12.C Agricultural Production Targeting

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS

Hofstede's Dimensions of Culture

A.O. Baranov, V.N. Pavlov, Yu.M. Slepenkova

Center for Demography and Ecology

Transcription:

Linear model Chapter 2 The Bivariate Regression Model Sample of n observations, labeled as i=1,2,..n y i = + x i 1 + i and 1 are population values represent the true relationship between x and y Unfortunately these values are unknown The job of the researcher is to estimate these values 1 2 Put some concreteness on problem Notice that if we differentiate y with respect to x, we obtain y/ x = 1 1 represents how much y will change for a fixed change in x Increase in income for more education Change in crime or bankruptcy when casinos are opened Increase in test score if you study more Suppose a state is experiencing a significant budget shortfall Short-term solution raise tax on cigarettes by 35 cents/pack Problem a tax hike will reduce consumption (theory of demand) Question for state as taxes are raised, how much will cigarette consumption fall 3 4 1

Benefits and Costs of Model Suppose y is a state s per capita consumption of cigarettes x represents taxes on cigarettes Question how much will y fall if x is increased by 35 cents/pack? Note there are many reasons why people smoke cost is but one of them Placed more structure on the model, therefore we can obtain precise statements about the relationship between x and y These statements will be true so long as the hypothesized relationship is true As you place more structure on any model, the chance that the assumptions of the model are correct declines. 5 6 Data Data on state consumption/taxes, 1981-2 51 states x 2 years = 12 observations Y = per capita consumption Per capita packs per year 3 25 2 15 1 Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, 1981-2, 12 observations X = tax (State + Federal) in real cents per pack 2 dollars 7 5 2 4 6 8 1 12 14 16 Real tax per pack (State + Federal, 2 $) 8 2

3 Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, 1981-2, 12 observations What is i? Per capita packs per year 25 2 15 1 5 2 4 6 8 1 12 14 16 Real tax per pack (State + Federal, 2 $) 9 There are many factors that determine a state s level of cigarette consumption Some of these factors we can measure, but for what ever reason, we do not have data Education, age, income, etc. Some of these factors we cannot measure Dislike of cigarettes, anti-smoking sentiment of your friends/neighbors/relatives i identified what we cannot measure in our model 1 Current smoking rates By demographic group Think of a difference way draw a vertical line at any tax level (e.g., 4 cents). Notice that at this level, there are multiple values of Y that are present Therefore on average, higher taxes will reduce consumption, but it cannot explain all of consumption across states Adults 2.8% Gender Males 23.5% Females 18.1% Age group 18-44 23.6% 45-64 21.8% 75+ 5.7% Race White 2.7% AA 21.1% AI/AN 26.9% Asian 1.7% Hispanic origin Hispanic 14.5% Non hispanic 21.9% Education < HS 28.9% HS 26.4% Some col. 22.1% College+ 8.2% Family Income <$2K 3.3% $2-$35K 27.2% $35-$55K 22.1% $55-$75K 18.3% $>75K 13.5% 11 12 3

Performance in the Olympics We can however estimate values of i by estimating values of β and 1. Estimates have hats : ˆ and ˆ 1 Our goal, is to choose values for ˆ and ˆ 1in an optimal way. Requires minimizing some function of the estimated errors associated with the model 13 Medal count in the Olympics is a simple measure of output Countries vary by Size Resources How is performance once we control for these attributes? 14 Ranking by Total Medal Count Ranking by Medals/1 million People +----------------------------------------------------------+ medals~k name country medals ---------------------------------------------------------- 1 United States of America USA 14 2 People's Republic of China CHI 88 3 Russian Federation RUS 82 4 Great Britain GB 65 5 Germany GER 44 ---------------------------------------------------------- 6 Japan JAP 38 7 Australia AUS 35 8 France FRE 34 9 Republic of Korea SKOR 28 9 Italy ITA 28 ---------------------------------------------------------- 15 +--------------------------------------------------------------+ m~a_rank name country medals medals~a -------------------------------------------------------------- 1 Grenada GREN 1 95.2381 2 Jamaica JAM 12 44.34873 3 Trinidad and Tobago T&T 4 3.3556 4 New Zealand NZEL 13 29.31665 5 Bahamas BAH 1 28.27591 -------------------------------------------------------------- 6 Slovenia SLOVE 4 19.43748 7 Mongolia MONG 5 17.5887 8 Hungary HUN 17 17.6485 9 Montenegro MONT 1 16.12828 1 Denmark DEN 9 16.11529 -------------------------------------------------------------- 16 4

5 ln(medals per 1 Million) vs ln(gdp per capita) 5 ln(medals per 1 Million) vs ln(gdp per capita) Greneda 4 4 Jamaica NZ 3 3 Australia ln(medals per 1 million) 2 1-1 ln(medals per 1 million) 2 1-1 China GB USA -2-2 -3-3 India -4 5 6 7 8 9 1 11 12 ln(gdp per capita) 17-4 5 6 7 8 9 1 11 12 ln(gdp per capita) 18 5 ln(medals per 1 Million) vs ln(gdp per capita) Greneda ln(medals per 1 million) 4 3 2 1-1 -2-3 India Jamaica NZ Australia GB USA China -4 5 6 7 8 9 1 11 12 ln(gdp per capita) 19 Given linear model yi xi 1 i We can predict an level of consumption given parameter values y ˆ ˆ ˆi x i 1 The predicted value will not always be accurate sometimes we will over or under predict the true value Because of the linear relationship between x and y, predictions will lie along a line 2 5

y Difference between the actual & predicted value y 1 y ˆ x ˆ ˆi i 1 y ˆ ˆ ˆ ˆ i yi yi xi 1 i if y ˆ ˆ i yi i you underpredict (you did better than expected) ŷ 2 ŷ 1 y 2 ˆ i ˆ i If y ˆ ˆ i yi i you overpredict (you did worse than expected) x 1 x 2 x 21 22 ln(medals per 1 million) 5 4 3 2 1-1 -2 ln(medals per 1 Million) vs ln(gdp per capita) Greneda Jamaica NZ Australia GB USA China Estimation Estimated errors measure what we don t know Want to minimize these errors as much as possible There are N errors in each model Need to select a criteria to somehow minimize all these errors -3 India -4 5 6 7 8 9 1 11 12 ln(gdp per capita) 23 24 6

Criteria: Least squares Y let ˆ and ˆ be candidate values for 1 the parameters. The estimated error is then y ˆ x ˆ ˆi i i 1 f(x) Objective :minthe sum of squared errors n n 2 ˆ ˆ 2 ˆ i ( i i 1) i 1 i 1 SSE y x 25 X 2 X 1 X 26 Cigarette example Data available on web page state_cig_data.dta Already in a Stata data file To use, Download to a folder Change directory to the folder type use state_cig_data. * dscribe data. desc Contains data from state_cig_data.dta obs: 1,2 vars: 7 28 Aug 28 15:39 size: 29,58 (99.8% of memory free) - storage display value variable name type format label variable label - state str2 %9s 2-digit state code year int %8.g year state_tax float %9.g state tax in cents per pack retail_price float %9.g average retail price, nominal federal_tax byte %8.g federal tax in cents per pack packs_pc float %9.g packs of cigarettes per capita cpi float %9.g consumer price index, 2=1. - 27 28 7

. * generate real taxes. gen real_tax=(state_tax+federal_tax)/cpi.. * get means of real tax and per capita consumption. sum packs_pc real_tax Generate real tax By dividing by CPI Variable Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- packs_pc 12 16.621 28.29377 31.9 245.4 real_tax 12 56.5339 18.45741 17.84456 145 ˆ ˆ ˆ xy y.6115(28.29377) 1.937 ˆ 18.45741 x. * get correlation coefficient. corr packs_pc real_tax (obs=12) packs_pc real_tax -------------+------------------ packs_pc 1. real_tax -.6115 1. Means Std deviations ˆ y x ˆ 16.6 (56.5)(.937) 159.1 1 Correlation coefficient 29 3 Run regression in stata SSM, SSE, SST R 2. * get regression estimate. reg packs_pc real_tax Source SS df MS Number of obs = 12 -------------+------------------------------ F( 1, 118) = 67.95 Model 351.398 1 351.398 Prob > F =. Residual 51736.995 118 51.76282 R-squared =.3739 -------------+------------------------------ Adj R-squared =.3733 Total 815747.392 119 8.537186 Root MSE = 22.399 Notice that SSE + SSM = SST R 2 = SSM/SST = 351.4/815747.4 =.3739 packs_pc Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- real_tax -.9373448.3816-24.66. -1.11944 -.862746 _cons 159.1434 2.243373 7.94. 154.7412 163.5456 β and β 1 31 32 8

Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, 1981-2, 12 observations Scatter Plot: Real taxes per pack vs. Per capita cigarette consumption 51 states, 1981-2, 12 observations 3 3 25 25 Per capita packs per year 2 15 1 Per capita packs per year 2 15 1 5 5 2 4 6 8 1 12 14 16 Real tax per pack (State + Federal, 2 $) 2 4 6 8 1 12 14 16 Real tax per pack (State + Federal, 2 $) 33 34 Using the results. * output residuals. predict error_packs_pc, residuals Output residuals ˆ 1.9373 For every penny increase in taxes, per capita consumption falls by.94 packs per year A 35 cent increase in taxes will reduce consumption by (35)(.94) = 32 packs per person per year.. * show that the means of the errors are zero. sum error_packs_pc Variable Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- error_pack~c 12 9.24e-9 22.38781-57.57952 121.775.. * get correlation between x and error. corr error_packs_pc real_tax (obs=12) error_~c real_tax -------------+------------------ error_pack~c 1. real_tax -. 1. Predicted error Has zero mean Predicted error not Correlated with x 35 36 9

Example 2 Do better performing teams have higher attendance? Data on wins and average attendance/game for 24 baseball season 3 observations attendance.dta - storage display value variable name type format label variable label - team str13 %13s team city attendance long %12.g avg attendance per game wins int %8.g wins during year -. * get means of wins and payroll. sum wins attendance Variable Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- wins 3 8.83333 14.75334 55 13 attendance 3 28157.3 9317.7 131 43712 37 38. * get detailed data for attendance. sum attendance, detail avg attendance per game ------------------------------------------------------------- Percentiles Smallest 1% 131 131 5% 138 138 1% 15169.5 13157 Obs 3 25% 273 17182 Sum of Wgt. 3 5% 28934.5 Mean 28157.3 Largest Std. Dev. 9317.7 75% 34527 39494 9% 39828.5 4163 Variance 8.68e+7 95% 43323 43323 Skewness -.243352 99% 43712 43712 Kurtosis 2.256339 39. * run simple regression. reg attendance wins Source SS df MS Number of obs = 3 -------------+------------------------------ F( 1, 28) = 8.89 Model 6678457 1 6678457 Prob > F =.59 Residual 1.911e+9 28 6824936.1 R-squared =.241 -------------+------------------------------ Adj R-squared =.2139 Total 2.5178e+9 29 86819537.6 Root MSE = 8261.3 attendance Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- wins 31.473 13.9825 2.98.6 97.4894 523.457 _cons 395.14 8539.57.36.72-14397.25 2587.53 ˆ 1 31.5... for every addition win, attendance increase by 31 / game. An addition1 wins put 31.5 more people in seats 4 1

. * output predicted attendance. predict pred_att, xb.. list team wins attendance pred_att +--------------------------------------------+ team wins attend~e pred_att -------------------------------------------- 1. Seattle 93 43712 31929.54 2. New York-AL 13 43323 353.2 3. Baltimore 67 33122 23868.31 4. Boston 93 32717 31929.54 5. Cleveland 74 3238 2638.64 -------------------------------------------- 6. Texas 72 2945 25418.55 7. Anaheim 99 28464 33789.83 8. Oakland 13 26788 353.2 9. Minnesota 94 23759 32239.59 1. Chicago--AL 81 273 2828.97 -------------------------------------------- Construct a new variable, The predicted value of Y Predicted attendance 5 4 3 2 1 Plot: Actual and Predicted Attendance 45 degree line Consider Seattle Y p = 395.1 + (31.5)(93) = 31929 41 1 2 3 4 5 Actual attendance 42 Example 3: Education and Earnings Stylized fact: log wages or earnings is linear in education (above a certain range) Interpreted as a return to education Theoretical models why this would be the case Linear model: y=ln(weekly wages) endogenous variable x=years of education exogenous factor y i = + x i 1 + i 43 Notice that 1 has a different interpretation 1 =dy/dx In this case, y=ln(wages) dln(wages)/dx = (1/wages)dWages/dX dwages/wages = % change in changes (change in wages over base wages) when the endogenous variable is a natural log, 1 =dy/dx is interpreted as % change in y for a unit change in x 44 11

Data cps87.dta 19,96 observations from 1987 Current Population Survey on Full time (>3 hours) Males Aged 21-64 % of sample 8% 7% 6% 5% 4% 3% 2% Weekly Earnings, Adult Males, 2 Census 1% 45 % 1 2 3 4 5 6 7 8 9 1111213141516171825 Weekly Earnings 46 6% ln(wekly Earnings), Adult Males, 2 Census Average ln(weekly Wages) vs Educ % of Sample 5% 4% 3% 2% 1% Average ln(weekly Wages) 6.8 6.6 6.4 6.2 6. 5.8 5.6 5.4 5.2 5. 2 4 6 8 1 12 14 16 18 2 Years of Education % 4.15 4.35 4.55 4.75 4.95 5.15 5.35 5.55 5.75 5.95 6.15 6.35 6.55 6.75 6.95 7.15 7.35 ln(weekly Earnings) 47 48 12

ln(weekly Wages) ln(weekly Wages) vs. Education 7.5 7. 6.5 6. 5.5 5. 4.5 4. 2 4 6 8 1 12 14 16 18 2 Years of Education. * construct ln weekly earnings. gen ln_weekly_earn=ln(weekly_earn).. * get descriptive information. sum weekly_earn ln_weekly_earn years_educ Generate ln(weekly earnings) Variable Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- weekly_earn 1996 488.264 236.4713 6 999 ln_weekly_~n 1996 6.6737.51347 4.94345 6.96755 years_educ 1996 13.16126 2.795234 18 Means of the variables 49 5. * run simple regression. reg ln_weekly_earn years_educ Source SS df MS Number of obs = 1996 -------------+------------------------------ F( 1, 1994) = 3877.62 Model 854.2855 1 854.2855 Prob > F =. Residual 4385.5814 1994.2231397 R-squared =.1631 -------------+------------------------------ Adj R-squared =.163 Total 5239.33869 1995.263217216 Root MSE =.46937 ln_weekly_~n Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- years_educ.741141.1192 62.27..717813.76447 _cons 5.91872.16138 317.97. 5.6484 5.123261 Example 4:London Olympics * generate measure of medals per person; * divide by 1,, people; gen medals_capita=medals/(population/1); label var medals_capita "medals per 1,, people"; * take natural ln of medals_capita and gdp_capita; gen ln_medals_capita=ln(medals_capita); gen ln_gdp_capita=ln(gdp_capita); * regress ln(medals/population) on ln(gdp/population); reg ln_medals_capita ln_gdp_capita; 51 52 13

Example 4: London Olympics. * regress ln(medals/population) on ln(gdp/population);. reg ln_medals_capita ln_gdp_capita; Source SS df MS Number of obs = 85 -------------+------------------------------ F( 1, 83) = 17.93 Model 31.8688335 1 31.8688335 Prob > =.1 F Residual 147.537169 83 1.77755625 R-squared =.1776 -------------+------------------------------ Adj R-squared =.1677 Total 179.462 84 2.13578574 Root MSE = 1.3333 ln_medals_~a Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- ln_gdp_cap~a.472596.111622 4.23..2493614.6911579 _cons -3.111996 1.4655-2.97.4-5.193543-1.345 ln medals per 1 million people -4-2 2 4 ln(medals/population) vs ln(gdp/population) GREN JAM T&TBAH NZEL MONG MONT HUN SLOVE GEO LITH CROA EST DEN AUS CUB BEL AZE CYPGBIRE NETH QUA ARM LATV CZE KAZ SLOVA BAH SWE TAI NOR MOLD SERB GABRUS SKOR PR FRECAN GER FIN UKR BOTS ROM ITA SWI ESP USA SING TUN JAP NKOR KEN BULG POL BEL KUW DREP IRAN COL GRE TAJ UZE SAFR HK ARG ETH BRA POR GUAT CHI MALA MEX TUR AFGH THAI MORO VEN SAUD UGA EGY ALG IND ln of medals per 1,, Fitted values 53 54 INDO 6 8 1 12 ln of gdp per capita ln medals per 1 million people -4-2 2 4 ETH ln(medals/population) vs ln(gdp/population) NKOR IND EGY INDO JAM CUB CHI GREN MEX VEN SKOR SAUD AUS GB QUA NOR CAN USA 6 8 1 12 ln of gdp per capita ln of medals per 1,, Fitted values 55 14