The Multivariate Regression Model
|
|
- Baldric Jones
- 6 years ago
- Views:
Transcription
1 The Multivariate Regression Model Example Determinants of College GPA Sample of 4 Freshman Collect data on College GPA (4.0 scale) Look at importance of ACT Consider the following model CGPA ACT i 0 i i ACT 4 tests English/math/reading/science reasoning Composite scores from -36 Average score in 000 was Movement from to represents 7 percentage points in the distribution (56 th to 63th percentile) 3 College GPA Scatter Plot: ACT Score and College GPA ACT 4
2 College GPA Scatter Plot: ACT Score and College GPA. *run regression with one variable. reg college_gpa act Source SS df MS Number of obs = F(, 39) = 6. Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = college_gpa Coef. Std. Err. t P> t [95% Conf. Interval] act _cons Interpret the result: ACT 5 6 Is this an accurate estimate of (CGPA)/ (ACT)? ACT is but one measure of ability Noisy measure at best Are there other measures available? Consider another model (Think of this as the true model) CGPA ACT HSGPA i 0 i i i College GPA Scatter Plot: HS GPA and College GPA HS GPA 8
3 Scatter Plot: HS GPA and College GPA Scatter Plot: HS GPA and College GPA College GPA 3.0 College GPA HS GPA HS GPA 9 0 *run synthetic regression of hs_gpa on act reg hs_gpa act. * get correlations between key variables. corr college_gpa act hs_gpa (obs=4) colleg~a act hs_gpa college_gpa.0000 act hs_gpa Source SS df MS Number of obs = F(, 39) = 8.88 Model Prob > F = Residual R-squared = Adj R-squared = 0.3 Total Root MSE = hs_gpa Coef. Std. Err. t P> t [95% Conf. Interval] act _cons
4 , x (7) E[ ] ˆ x i 0 i i ˆ n i ( x x )( x x ) i i n i ( x x ) i we anticipate that 0 and we have shown that ˆ 0 E[ ˆ ] then E[ ] On average, the value we estimated in the False model will be greater than the one in the true model 3 4. * run multivariate regression. reg college_gpa act hs_gpa Source SS df MS Number of obs = F(, 38) = 4.78 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE =.3403 college_gpa Coef. Std. Err. t P> t [95% Conf. Interval] act hs_gpa _cons The coefficient on ACT in the false model was The coefficient in the True model is the coefficient falls by 77% Example : Class Size and Performance Data from 40 schools in CA Outcome is average on state test for reading and math in 6 th grade Average scores around 650 for state Key covariate: student/teacher ratio SCORE STR i 0 i i 5 6 4
5 Scatter Plot: Student Teacher Ratio vs. Average Test Scores Scatter Plot: Student Teacher Ratio vs. Average Test Scores Average Test Score Average Test Score Student-Teacher ratio Student-Teacher ratio 7 8. * run regression with one variable. reg average_score student_teacher Source SS df MS Number of obs = F(, 48) =.58 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = 8.58 average_sc~e Coef. Std. Err. t P> t [95% Conf. Interval] student_te~r _cons A one student increase in class size will reduce average Scores by.8 points Omitted variables Class size is but one covariate we could add Consider others that might be correlated with X that are omitted from model Example: % ESL These students tend to score lower on tests If they are also more or less likely to be in more crowded schools, then results could be biased Increasing a class size by 5 will reduce average scores By 5(.8)=.4 which is.4/654=.07 or by.7% 9 0 5
6 SCORE STR ESL i 0 i i i Think of this as the true model, E[ ] ˆ x x i 0 i i (7) ˆ n i ( x x )( x x ) i i n i ( x x ) i Scatter Plot: % ESL vs. Average Test Scores Scatter Plot: % ESL vs. Average Test Scores Average Test Score Average Test Score % ESL % ESL 3 4 6
7 Scatter Plot: Student Teacher Ratio vs. % ESL Scatter Plot: Student Teacher Ratio vs. % ESL % ESL Student-Teacher ratio 5 % ESL Student-Teacher ratio 6 averag~e studen~r esl_pct average_sc~e.0000 student_te~r esl_pct and we have shown that ˆ 0 E[ ] ˆ then E[ ] we anticipate that 0 On average, the value we estimated in the False model will be smaller than the one in the true model 7 8 7
8 Think of the prediction this way ˆ E[ ] ˆ β 0 ( ) ( )( ) 9 In the single variable model, the Student/teacher ratio is picking up two effects Larger class sizes reduce performance ESL students are more likely to be in more crowded schools, and they that tend to have lower scores Therefore, the model without ESL will estimate a too large of a negative number 30. * run multivariate regression. reg average_score student_teacher esl_pct Source SS df MS Number of obs = F(, 47) = 55.0 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = average_sc~e Coef. Std. Err. t P> t [95% Conf. Interval] student_te~r esl_pct _cons student increase in class size reduces test scores by 5(.) = 5.5 which is 5.5/654= or.8% -- half the Estimate impact as before A one percentage point increase in % ESL in school Will reduce average scores by.64 points 3. * demonstrate the partialing out. * nature of mv regressions.. * run a regression of STR on ESL. * output the residuals. reg student_teacher esl Source SS df MS Number of obs = F(, 48) = 5.5 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE =.8604 student_te~r Coef. Std. Err. t P> t [95% Conf. Interval] esl_pct _cons * output residuals. predict res_str, residual 3 8
9 . * run a regression of test scores. * on the student_teacher residuals. reg average_score res_str Source SS df MS Number of obs = F(, 48) = 4. Model Prob > F = 0.0 Residual R-squared = Adj R-squared = 0.00 Total Root MSE = average_sc~e Coef. Std. Err. t P> t [95% Conf. Interva res_str _cons Exact same number as before 33 9
Notes on PS2
17.871 - Notes on PS2 Mike Sances MIT April 2, 2012 Mike Sances (MIT) 17.871 - Notes on PS2 April 2, 2012 1 / 9 Interpreting Regression: Coecient regress success_rate dist Source SS df MS Number of obs
More informationECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011
ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2011 Instructions: Answer all five (5) questions. Point totals for each question are given in parentheses. The parts within each
More informationSoci Statistics for Sociologists
University of North Carolina Chapel Hill Soci708-001 Statistics for Sociologists Fall 2009 Professor François Nielsen Stata Commands for Module 11 Multiple Regression For further information on any command
More information* STATA.OUTPUT -- Chapter 5
* STATA.OUTPUT -- Chapter 5.*bwt/confounder example.infile bwt smk gest using bwt.data.correlate (obs=754) bwt smk gest -------------+----- bwt 1.0000 smk -0.1381 1.0000 gest 0.3629 0.0000 1.0000.regress
More informationLecture 2a: Model building I
Epidemiology/Biostats VHM 812/802 Course Winter 2015, Atlantic Veterinary College, PEI Javier Sanchez Lecture 2a: Model building I Index Page Predictors (X variables)...2 Categorical predictors...2 Indicator
More information17.871: PS3 Key. Part I
17.871: PS3 Key Part I. use "cces12.dta", clear. reg CC424 CC334A [aweight=v103] if CC334A!= 8 & CC424 < 6 // Need to remove values that do not fit on the linear scale. This entails discarding all respondents
More informationSOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis In any longitudinal analysis, we can distinguish between analyzing trends vs individual change that is, model
More informationWeek 11: Collinearity
Week 11: Collinearity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Regression and holding other
More informationPSC 508. Jim Battista. Dummies. Univ. at Buffalo, SUNY. Jim Battista PSC 508
PSC 508 Jim Battista Univ. at Buffalo, SUNY Dummies Dummy variables Sometimes we want to include categorical variables in our models Numerical variables that don t necessarily have any inherent order and
More informationEco311, Final Exam, Fall 2017 Prof. Bill Even. Your Name (Please print) Directions. Each question is worth 4 points unless indicated otherwise.
Your Name (Please print) Directions Each question is worth 4 points unless indicated otherwise. Place all answers in the space provided below or within each question. Round all numerical answers to the
More informationECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014
ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics May 2014 Instructions: Answer all five (5) questions. Point totals for each question are given in parentheses. The parts within each
More informationCompartmental Pharmacokinetic Analysis. Dr Julie Simpson
Compartmental Pharmacokinetic Analysis Dr Julie Simpson Email: julieas@unimelb.edu.au BACKGROUND Describes how the drug concentration changes over time using physiological parameters. Gut compartment Absorption,
More informationMidterm Exam. Friday the 29th of October, 2010
Midterm Exam Friday the 29th of October, 2010 Name: General Comments: This exam is closed book. However, you may use two pages, front and back, of notes and formulas. Write your answers on the exam sheets.
More informationBios 312 Midterm: Appendix of Results March 1, Race of mother: Coded as 0==black, 1==Asian, 2==White. . table race white
Appendix. Use these results to answer 2012 Midterm questions Dataset Description Data on 526 infants with very low (
More informationApplied Econometrics
Applied Econometrics Lecture 3 Nathaniel Higgins ERS and JHU 20 September 2010 Outline of today s lecture Schedule and Due Dates Making OLS make sense Uncorrelated X s Correlated X s Omitted variable bias
More informationYou can find the consultant s raw data here:
Problem Set 1 Econ 475 Spring 2014 Arik Levinson, Georgetown University 1 [Travel Cost] A US city with a vibrant tourist industry has an industrial accident (a spill ) The mayor wants to sue the company
More informationCHECKING INFLUENCE DIAGNOSTICS IN THE OCCUPATIONAL PRESTIGE DATA
PLS 802 Spring 2018 Professor Jacoby CHECKING INFLUENCE DIAGNOSTICS IN THE OCCUPATIONAL PRESTIGE DATA This handout shows the log from a Stata session that examines the Duncan Occupational Prestige data
More informationExample Analysis with STATA
Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group
More informationExample Analysis with STATA
Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group
More informationWeek 10: Heteroskedasticity
Week 10: Heteroskedasticity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline The problem of (conditional)
More informationFlorida. Difference-in-Difference Models 8/23/2016
Florida Difference-in-Difference Models Bill Evans Health Economics 8/25/1997, State of Florida settles out of court in their suits against tobacco manufacturers Awarded $13 billion over 25 years Use $200m
More informationApplication: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).)
Application: Effects of Job Training Program (Data are the Dehejia and Wahba (1999) version of Lalonde (1986).) There are two data sets; each as the same treatment group of 185 men. JTRAIN2 includes 260
More informationInterpreting and Visualizing Regression models with Stata Margins and Marginsplot. Boriana Pratt May 2017
Interpreting and Visualizing Regression models with Stata Margins and Marginsplot Boriana Pratt May 2017 Interpreting regression models Often regression results are presented in a table format, which makes
More informationROBUST ESTIMATION OF STANDARD ERRORS
ROBUST ESTIMATION OF STANDARD ERRORS -- log: Z:\LDA\DataLDA\sitka_Lab8.log log type: text opened on: 18 Feb 2004, 11:29:17. ****The observed mean responses in each of the 4 chambers; for 1988 and 1989.
More informationThe Effect of Occupational Danger on Individuals Wage Rates. A fundamental problem confronting the implementation of many healthcare
The Effect of Occupational Danger on Individuals Wage Rates Jonathan Lee Econ 170-001 Spring 2003 PID: 703969503 A fundamental problem confronting the implementation of many healthcare policies is the
More informationPREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY
Page22 PREDICTIVE MODEL OF TOTAL INCOME FROM SALARIES/WAGES IN THE CONTEXT OF PASAY CITY Wilson Cordova wilson.cordova@cksc.edu.ph Chiang Kai Shek College, Philippines Abstract There are varied sources
More informationİnsan Tunalı November 29, 2018 Econ 511: Econometrics I. ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement
İnsan Tunalı November 29, 2018 Econ 511: Econometrics I STATA Exercise 1 ANSWERS TO ASSIGNMENT 10: Part II STATA Supplement TASK 1: --- name: log: g:\econ511\heter_housinglog log type: text opened
More informationThe study obtains the following results: Homework #2 Basics of Logistic Regression Page 1. . version 13.1
Soc 73994, Homework #2: Basics of Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 14, 2018 All answers should be typed and mailed to
More informationGroup Comparisons: Using What If Scenarios to Decompose Differences Across Groups
Group Comparisons: Using What If Scenarios to Decompose Differences Across Groups Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 15, 2015 We saw that the
More informationTrunkierte Regression: simulierte Daten
Trunkierte Regression: simulierte Daten * Datengenerierung set seed 26091952 set obs 48 obs was 0, now 48 gen age=_n+17 gen yhat=2000+200*(age-18) gen wage = yhat + 2000*invnorm(uniform()) replace wage=max(0,wage)
More informationlog: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56
log: F:\stata_parthenope_01.smcl opened on: 17 Mar 2012, 18:21:56 (20 cities >100k pop). de obs: 20 20 cities >100k pop vars: 13 size: 1,040 storage display value variable name type format label variable
More information. *increase the memory or there will problems. set memory 40m (40960k)
Exploratory Data Analysis on the Correlation Structure In longitudinal data analysis (and multi-level data analysis) we model two key components of the data: 1. Mean structure. Correlation structure (after
More informationUnit 6: Simple Linear Regression Lecture 2: Outliers and inference
Unit 6: Simple Linear Regression Lecture 2: Outliers and inference Statistics 101 Thomas Leininger June 18, 2013 Types of outliers in linear regression Types of outliers How do(es) the outlier(s) influence
More informationfor var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty
for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty -> reg trstprl trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty Source SS df MS
More information(LDA lecture 4/15/08: Transition model for binary data. -- TL)
(LDA lecture 4/5/08: Transition model for binary data -- TL) (updated 4/24/2008) log: G:\public_html\courses\LDA2008\Data\CTQ2log log type: text opened on: 5 Apr 2008, 2:27:54 *** read in data ******************************************************
More informationSurvey commands in STATA
Survey commands in STATA Carlo Azzarri DECRG Sample survey: Albania 2005 LSMS 4 strata (Central, Coastal, Mountain, Tirana) 455 Primary Sampling Units (PSU) 8 HHs by PSU * 455 = 3,640 HHs svy command:
More informationSociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening
r's age when 1st child born 2 4 6 Density.2.4.6.8 Density.5.1 Sociology 774: Regression Models for Categorical Data Instructor: Natasha Sarkisian Preliminary Data Screening A. Examining Univariate Normality
More informationA.O. Baranov, V.N. Pavlov, Yu.M. Slepenkova
25 th INFORUM World Conference Riga, Latvia, 28 August 1 September 2017 Construction of the Dynamic Input Output Model of Russian Economy with a Human Capital Block and Problems of Its Information Support
More informationECON Introductory Econometrics Seminar 9
ECON4150 - Introductory Econometrics Seminar 9 Stock and Watson EE13.1 May 4, 2015 Stock and Watson EE13.1 ECON4150 - Introductory Econometrics Seminar 9 May 4, 2015 1 / 18 Empirical exercise E13.1: Data
More informationFoley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling
Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling Amber Barnato MD MPH MS University of Pittsburgh Scott Halpern MD PhD University of Pennsylvania Learning objectives 1. List
More informationMilk Data Analysis. 1. Objective: analyzing protein milk data using STATA.
1. Objective: analyzing protein milk data using STATA. 2. Dataset: Protein milk data set (in the class website) Data description: Percentage protein content of milk samples at weekly intervals from each
More informationNumber of obs = R-squared = Root MSE = Adj R-squared =
Appendix for the details of statistical test results Statistical Package used:stata/se 11.1 1. ANOVA result with dependent variable: current level of happiness, independent variables: sexs, ages, and survey
More informationThis example demonstrates the use of the Stata 11.1 sgmediation command with survey correction and a subpopulation indicator.
Analysis Example-Stata 11.0 sgmediation Command with Survey Data Correction March 25, 2011 This example demonstrates the use of the Stata 11.1 sgmediation command with survey correction and a subpopulation
More informationSUGGESTED SOLUTIONS Winter Problem Set #1: The results are attached below.
450-2 Winter 2008 Problem Set #1: SUGGESTED SOLUTIONS The results are attached below. 1. The balanced panel contains larger firms (sales 120-130% bigger than the full sample on average), which are more
More informationInteractions made easy
Interactions made easy André Charlett Neville Q Verlander Health Protection Agency Centre for Infections Motivation Scientific staff within institute using Stata to fit many types of regression models
More informationChapter 5 Regression
Chapter 5 Regression Topics to be covered in this chapter: Regression Fitted Line Plots Residual Plots Regression The scatterplot below shows that there is a linear relationship between the percent x of
More informationBiostatistics 208. Lecture 1: Overview & Linear Regression Intro.
Biostatistics 208 Lecture 1: Overview & Linear Regression Intro. Steve Shiboski Division of Biostatistics, UCSF January 8, 2019 1 Organization Office hours by appointment (Mission Hall 2540) E-mail to
More informationChecking the model. Linearity. Normality. Constant variance. Influential points. Covariate overlap
Checking the model Linearity Normality Constant variance Influential points Covariate overlap 1 Checking the model: linearity Average value of outcome initially assumed to be linear function of continuous
More information(February draft)
For an International NGO Background statistics, cross tabs, summaries, graphs, t-tests and regression analysis for Nepal response survey data (February 2017 - draft) Contents Confidence level/statistical
More informationGuideline on evaluating the impact of policies -Quantitative approach-
Guideline on evaluating the impact of policies -Quantitative approach- 1 2 3 1 The term treatment derives from the medical sciences and has more meaning when is used in that context. However, this term
More informationWhy Are Electricity Prices in RTOs Increasingly Expensive?
ROBERT F. MCCULLOUGH, JR. MANAGING PARTNER Date: To: From: Subject: McCullough Research Clients Robert McCullough Heidi Schramm Why Are Electricity Prices in RTOs Increasingly Expensive? For the last two
More informationExploring Functional Forms: NBA Shots. NBA Shots 2011: Success v. Distance. . bcuse nbashots11
NBA Shots 2011: Success v. Distance. bcuse nbashots11 Contains data from http://fmwww.bc.edu/ec-p/data/wooldridge/nbashots11.dta obs: 199,119 vars: 15 25 Oct 2012 09:08 size: 24,690,756 ------------- storage
More informationCategorical Data Analysis
Categorical Data Analysis Hsueh-Sheng Wu Center for Family and Demographic Research October 4, 200 Outline What are categorical variables? When do we need categorical data analysis? Some methods for categorical
More informationComputer Handout Two
Computer Handout Two /******* senic2.sas ***********/ %include 'senicdef.sas'; /* Effectively, Copy the file senicdef.sas to here */ title2 'Elementary statistical tests'; proc freq; title3 'Use proc freq
More informationThe Multivariate Dustbin
UCLA Statistical Consulting Group (Ret.) Stata Conference Baltimore - July 28, 2017 Back in graduate school... My advisor told me that the future of data analysis was multivariate. By multivariate he meant...
More informationUnit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users
Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users Data Set for this Assignment: Download from the course website: Stata Users: framingham_1000.dta Source: Levy (1999) National
More informationCOMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION
PLS 802 Spring 2018 Professor Jacoby COMPARING MODEL ESTIMATES: THE LINEAR PROBABILITY MODEL AND LOGISTIC REGRESSION This handout shows the log of a STATA session that compares alternative estimates of
More informationECON Introductory Econometrics Seminar 6
ECON4150 - Introductory Econometrics Seminar 6 Stock and Watson EE10.1 April 28, 2015 Stock and Watson EE10.1 ECON4150 - Introductory Econometrics Seminar 6 April 28, 2015 1 / 21 Guns data set Some U.S.
More informationThe Servitization of Manufacturing: A Longitudinal Study of Global Trends. Professor Andy Neely Director, Cambridge Service Alliance
The Servitization of Manufacturing: A Longitudinal Study of Global Trends Professor Andy Neely Director, Cambridge Service Alliance The world of manufacturing is changing The shift to service based competitive
More informationUnit 5 Logistic Regression Homework #7 Practice Problems. SOLUTIONS Stata version
Unit 5 Logistic Regression Homework #7 Practice Problems SOLUTIONS Stata version Before You Begin Download STATA data set illeetvilaine.dta from the course website page, ASSIGNMENTS (Homeworks and Exams)
More informationIntroduction of STATA
Introduction of STATA News: There is an introductory course on STATA offered by CIS Description: Intro to STATA On Tue, Feb 13th from 4:00pm to 5:30pm in CIT 269 Seats left: 4 Windows, 7 Macintosh For
More informationChapter 2. Linear model. Put some concreteness on problem. The Bivariate Regression Model. Sample of n observations, labeled as i=1,2,..
Linear model Chapter 2 The Bivariate Regression Model Sample of n observations, labeled as i=1,2,..n y i = + x i 1 + i and 1 are population values represent the true relationship between x and y Unfortunately
More informationExamples of Using Stata v11.0 with JRR replicate weights Provided in the NHANES data set
Examples of Using Stata v110 with JRR replicate weights Provided in the NHANES 1999-2000 data set This document is designed to illustrate comparisons of methods to use JRR replicate weights sometimes provided
More informationProblem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT
STAT 512 EXAM I STAT 512 Name (7 pts) Problem Points Score 1 40 2 25 3 28 USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT WRITE LEGIBLY. ANYTHING UNREADABLE WILL NOT BE GRADED GOOD LUCK!!!!
More informationMultilevel/ Mixed Effects Models: A Brief Overview
Multilevel/ Mixed Effects Models: A Brief Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 27, 2018 These notes borrow very heavily, often/usually
More informationStata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance
Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance Program Note 12.1 - One-Way ANOVA and Multiple Comparisons The Stata command
More informationStata v 12 Illustration. One Way Analysis of Variance
Stata v 12 Illustration Page 1. Preliminary Download anovaplot.. 2. Descriptives Graphs. 3. Descriptives Numerical 4. Assessment of Normality.. 5. Analysis of Variance Model Estimation.. 6. Tests of Equality
More informationFinal Exam Spring Bread-and-Butter Edition
Final Exam Spring 1996 Bread-and-Butter Edition An advantage of the general linear model approach or the neoclassical approach used in Judd & McClelland (1989) is the ability to generate and test complex
More informationTable. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use "k:mydirectory,
Table XTMIXED Procedure in STATA with Output Systolic Blood Pressure, 2001. use "k:mydirectory,. xtmixed sbp nage20 nage30 nage40 nage50 nage70 nage80 nage90 winter male dept2 edu_bachelor median_household_income
More informationSHIPPING COST, SPOT RATE AND EBAY AUCTIONS: A STRUCTURED EQUATION MODELING APPROACH
SHIPPING COST, SPOT RATE AND EBAY AUCTIONS: A STRUCTURED EQUATION MODELING APPROACH Ossama Elhadary, DBA City University of New York Abstract In this paper, the author used a Structured Equation Modeling
More informationAnalyzing CHIS Data Using Stata
Analyzing CHIS Data Using Stata Christine Wells UCLA IDRE Statistical Consulting Group February 2014 Christine Wells Analyzing CHIS Data Using Stata 1/ 34 The variables bmi p: BMI povll2: Poverty level
More informationBUS105 Statistics. Tutor Marked Assignment. Total Marks: 45; Weightage: 15%
BUS105 Statistics Tutor Marked Assignment Total Marks: 45; Weightage: 15% Objectives a) Reinforcing your learning, at home and in class b) Identifying the topics that you have problems with so that your
More informationTabulate and plot measures of association after restricted cubic spline models
Tabulate and plot measures of association after restricted cubic spline models Nicola Orsini Institute of Environmental Medicine Karolinska Institutet 3 rd Nordic and Baltic countries Stata Users Group
More informationDo not turn over until you are told to do so by the Invigilator.
UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 FINANCIAL ECONOMETRICS ECO-7009A Tme allowed: HOURS Answer ALL FOUR questons. Queston 1 carres a weght of 5%; queston carres 0%;
More informationElementary tests. proc ttest; title3 'Two-sample t-test: Does consumption depend on Damper Type?'; class damper; var dampin dampout diff ;
Elementary tests /********************** heat2.sas *****************************/ title2 'Standard elementary tests'; options pagesize=35; %include 'heatread.sas'; /* Basically the data step from heat1.sas
More informationBIOSTATS 640 Spring 2017 Stata v14 Unit 2: Regression & Correlation. Stata version 14
Stata version 14 Illustration Simple and Multiple Linear Regression February 2017 I- Simple Linear Regression.... 1. Introduction to Example... 2. Preliminaries: Descriptives.. 3. Model Fitting (Estimation)
More informationRegression Analysis I & II
Data for this session is available in Data Regression I & II Regression Analysis I & II Quantitative Methods for Business Skander Esseghaier 1 In this session, you will learn: How to read and interpret
More information(R) / / / / / / / / / / / / Statistics/Data Analysis
Series de Tiempo FE-UNAM Thursday September 20 14:47:14 2012 Page 1 (R) / / / / / / / / / / / / Statistics/Data Analysis User: Prof. Juan Francisco Islas{space -4} Project: UNIDAD II ----------- name:
More informationTiming Production Runs
Class 7 Categorical Factors with Two or More Levels 189 Timing Production Runs ProdTime.jmp An analysis has shown that the time required in minutes to complete a production run increases with the number
More informationTopics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17
Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2 B. Rosner, 5/09/17 1 Outline 1. Testing for effect modification in logistic regression analyses 2. Conditional logistic
More informationUsing Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 28, 2015
Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 28, 2015 NOTE: The routines spost13, lrdrop1, and extremes
More informationNon-linear Pricing of Paid Content Products
18 th Bled econference eintegration in Action Bled, Slovenia, June 6-8, 2005 Non-linear Pricing of Paid Content Products Florian Stahl University of St. Gallen, Switzerland mail@florian-stahl.com Fabian
More informationExperiment Outcome &Literature Review. Presented by Fang Liyu
Experiment Outcome &Literature Review Presented by Fang Liyu Experiment outcome 1. Data from JD Sample size: 1) Data contains 3325 products in 8 days 2) There are 2000-3000 missing values in each data
More informationThis is a quick-and-dirty example for some syntax and output from pscore and psmatch2.
This is a quick-and-dirty example for some syntax and output from pscore and psmatch2. It is critical that when you run your own analyses, you generate your own syntax. Both of these procedures have very
More informationSTAT 350 (Spring 2016) Homework 12 Online 1
STAT 350 (Spring 2016) Homework 12 Online 1 1. In simple linear regression, both the t and F tests can be used as model utility tests. 2. The sample correlation coefficient is a measure of the strength
More informationOutliers Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 7, 2016
Outliers Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 7, 206 These notes draw heavily from several sources, including Fox s Regression Diagnostics; Pindyck
More informationProbability Of Booking
Axis Title Web Social Analytics Air France Assignment 1 Spring 216 Shuhua Zhu Assignment 1 Question 1: CTR TCR NET REVEAVE. COSROA AVE. REV PROB COUNT 451 451 451 451 459 368 451 MAX 2.% 9.% $549,524 $1.
More informationPsych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd
Second Mid-Term Exam Multiple Regression Question A: Public policy analysts are interested in understanding how and why individuals come to develop the opinions they do of various public policy issues.
More informationThe SAS System 1. RM-ANOVA analysis of sheep data assuming circularity 2
The SAS System 1 Obs no2 sheep time y 1 1 1 time1 2.197 2 1 1 time2 2.442 3 1 1 time3 2.542 4 1 1 time4 2.241 5 1 1 time5 1.960 6 1 1 time6 1.988 7 1 2 time1 1.932 8 1 2 time2 2.526 9 1 2 time3 2.526 10
More informationMultiple Imputation and Multiple Regression with SAS and IBM SPSS
Multiple Imputation and Multiple Regression with SAS and IBM SPSS See IntroQ Questionnaire for a description of the survey used to generate the data used here. *** Mult-Imput_M-Reg.sas ***; options pageno=min
More informationLongitudinal Data Analysis, p.12
Biostatistics 140624 2011 EXAM STATA LOG ( NEEDED TO ANSWER EXAM QUESTIONS) Multiple Linear Regression, p2 Longitudinal Data Analysis, p12 Multiple Logistic Regression, p20 Ordered Logistic Regression,
More informationLogistic Regression, Part III: Hypothesis Testing, Comparisons to OLS
Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals heavily
More informationADVANCED ECONOMETRICS I
ADVANCED ECONOMETRICS I Practice Exercises (1/2) Instructor: Joaquim J. S. Ramalho E.mail: jjsro@iscte-iul.pt Personal Website: http://home.iscte-iul.pt/~jjsro Office: D5.10 Course Website: http://home.iscte-iul.pt/~jjsro/advancedeconometricsi.htm
More informationThe Servitization of Manufacturing: A Longitudinal Study of Global Trends. Professor Andy Neely Director, Cambridge Service Alliance
The Servitization of Manufacturing: A Longitudinal Study of Global Trends Professor Andy Neely Director, Cambridge Service Alliance The world of manufacturing is changing The shift to service based competitive
More informationEXPERIMENTAL INVESTIGATIONS ON FRICTION WELDING PROCESS FOR DISSIMILAR MATERIALS USING DESIGN OF EXPERIMENTS
137 Chapter 6 EXPERIMENTAL INVESTIGATIONS ON FRICTION WELDING PROCESS FOR DISSIMILAR MATERIALS USING DESIGN OF EXPERIMENTS 6.1 INTRODUCTION In the present section of research, three important aspects are
More informationPubHlth 640 Intermediate Biostatistics Unit 2 Regression and Correlation
PubHlth 640 Intermediate Biostatistics Unit 2 Regression and Correlation Multiple Linear Regression Software: Stata v 10.1 Human p53 and Breast Cancer Risk Source: Matthews et al. Parity Induced Protection
More informationAll analysis examples presented can be done in Stata 10.1 and are included in this chapter s output.
Chapter 9 Stata v10.1 Analysis Examples Syntax and Output General Notes on Stata 10.1 Given that this tool is used throughout the ASDA textbook this chapter includes only the syntax and output for the
More informationHomework 1: Who s Your Daddy? Is He Rich Like Me?
Homework 1: Who s Your Daddy? Is He Rich Like Me? 36-402, Spring 2015 Due at 11:59 pm on Tuesday, 20 January 2015 GENERAL INSTRUCTIONS: You may submit either (1) a single PDF, containing all your written
More informationWhat Factors Influence Seat Belt Usage Rates in the United States?: A Meta-analysis
University of Kentucky UKnowledge MPA/MPP Capstone Projects Martin School of Public Policy and Administration 2006 What Factors Influence Seat Belt Usage Rates in the United States?: A Meta-analysis Tiffany
More informationEFA in a CFA Framework
EFA in a CFA Framework 2012 San Diego Stata Conference Phil Ender UCLA Statistical Consulting Group Institute for Digital Research & Education July 26, 2012 Phil Ender EFA in a CFA Framework Disclaimer
More information