RESIDUAL FILES IN HLM OVERVIEW

Size: px
Start display at page:

Download "RESIDUAL FILES IN HLM OVERVIEW"

Transcription

1 HLML09_ RESIDUAL FILES IN HLM Ralph B. Taylor All materials copyright (c) by Ralph B. Taylor OVERVIEW Residual files in multilevel models where people are grouped into some type of cluster (neighborhoods, organizations, ect.) are useful for several general purposes including: looking at results for an individual or a group. For example, in an ANOVA model you might want to see how far above or below the grand mean (G00) various individual neighborhoods are scoring (EBINTRCP), or you might want to look at the predicted score in a model for an individual. The residual files contain diagnostic information about how well your model is performing. You want to look at some of these. This is analogous to looking at the structure of your residuals in an OLS regression model. Describing the relationship between different model parameters. For example, in a model with varying slopes, in which you also are trying to predict group means, you can look at the relationship between a neighborhood s (residual or predicted) score on the outcome, and its predicted slope for a variable. They also are useful models of variation over time, of course. Perhaps most of interest is using the level 2 residual file to capture predicted scores of individuals for whom you have repeated observations over time. HOW TO CREATE You create a residual file by issuing the command for one. This is under the basic settings menu. specifications. If you are doing a 3 level model, you can request level 1 and/or level 2 and/or 3 level residual files. With a 2 level model you can request level 1 and/or level 2 residual files. NOTE in box at bottom you need to NAME the residual file. Tip. To keep these straight, I always end my residual file name with blabla_r.sav. I know that every file ending in _r is a residual file. It is a good idea if blabla corresponds to the name of the hlm run itself HLML_RESIDUALS Page 1

2 Tip. If you are going to be generating both level 1 and level 2 residual files, suggest you develop a strategy for keeping these straight like blabla_l2r.sav and blabla_l1r.sav. By default it is going to save these as SPSS type files. WHAT IS IN A LEVEL 2 FILE FOR A GROUPING HLM PROBLEM [NOTE the specifics of this illustration are geared to a data file with respondents in 45 Philadelphia neighborhoods.] First, there is a row of data for each Level 2 unit. So if you have 45 neighborhoods you should have 45 records. THE VARIABLES IN THE LEVEL 2 RESIDUAL FILE Tip. You will not see all these variables calculated for every problem, depending on things like weights, whether you are running a generalized model, and things of that nature. Do not be alarmed if it tells you it cannot calculate certain things. It looks like if analyses are weighted, you cannot get some of these diagnostics. ID$ NJ ID number for the L-2 unit number of L-1 cases in corresponding L-2 unit Variation across Level 2 Units CHIPCT "expected values of the order statistics for a sample size J selected from a population that is distributed Chi square(v)" where v=number of random effects per unit (HLM v. 4 manual, p. 32) These are expected values for lack of fit, for the overall model, for each Jth unit. These can be tested against the chi square distribution with one degree of freedom. In other words if you have a significant (> = p <.05) chi square, it is telling you that the observed values did not match up with the expected values, across a range of parameters in the model. If you have a significant chi squared value, or maybe even two, do not be alarmed. If you had 100 level 2 groups in your model, just by chance how many of these chi squared statistics do you think would be significant? HLML_RESIDUALS Page 2

3 MDIST These are Mahalanobis distances (D squared).d 2 is used in a wide variety of contexts, for example, as a measure of psychometric distance in a multi-variable hyperspace, or as a measure of effect size when there are multiple variables of interest. 1 In HLM what it is doing for each varying parameter is going into each group; seeing how sizable the discrepancy or distance of the residuals in a multivariate space are relative to the variation (distances) of the predicted scores; 2 such that a larger score means a larger lack of fit, as with the chi squared; and adding up across all the random parameters. The question about how do you find the statistical significance of D 2 complicated according to some. See Hess et al. (2007) for background. Raudenbush & Bryk (Hierarchical Linear Models, 2 nd edition, 2002: 274), however, suggest that with sufficiently large samples at level 1, this statistic will have a chi squared distribution with Q+1 degrees of freedom when the data are normal where Q are the random effects in the model. So if you have just varying intercepts, and no varying slopes, Q+1 = 2. The question, of course, is what is sufficiently large. Tip. D 2 can be plotted, in a P-P or Q-Q plot, to see if it is relatively normal. It should be, especially if there are a lot of level 2 units. In short, with D 2 the individual variables tell you about specific groups that are not fitting the model well (discrepancies between observed scores on a random effect like the intercept, and the predicted scores on a random effect like the intercept, and about the overall distribution of the Level 2 units. Remaining (Residual) Variation Within Each Level 2 Unit In a level 2 residual file these next three measures are measures of within group remaining residual variation. LNTOTVAR is the "natural log of total standard deviation within each L-2 unit" on residual Y scores. (V 4 manual, 32). 1 Hess, M. R., Hogarty, K. Y., Ferron, J. M., & Kromrey, J. D. (2007). Interval Estimates of Multivariate Effect Sizes: Coverage and Interval Width Estimates Under Variance Heterogeneity and Nonnormality. Educational and Psychological Measurement, 67(1), Raudenbush & Bryk (2002: 274) HLML_RESIDUALS Page 3

4 Tip. These can be negative, suggesting standard deviations of less than one. OLSRSVAR If you do an OLS regression model for each group, and log the standard deviation of the residuals, you get OLSRSVAR: the "natural log of residual standard deviation within each unit based on its least squares regression." (v 4. manual, p. 33) In other words, the standard deviation of raw (unstandardized) OLS residuals in each level 2 unit, logged. If you did OLS, this is your measure of residual variance for each level 2 unit. MDRSVAR When HLM has fitted its best model, you can get the same residual variation, logged. This is MDRSVAR: the "natural logarithm of the residual standard deviation from the final fitted fixed effects model." (v 4. manual, p. 33) In other words, the log of the standard deviation of the residuals after HLM has fit its best model. Tip. MDRSVAR should be plotted in a Q-Q plot or a P-P plot to see if it is relatively normal (Raudenbush & Bryk 2002: ). You should do this. Background on P-P / Q-Q plots: if you want more background Hamilton, Chapter 1. 3 Tip. Raudenbush & Bryk (2002: 264) recommend looking at a stem and leaf plot of MDRSVAR. In their HSB data they found some schools where respondents were particularly homogeneous (see Figure 9.2 on p. 266). They suggest adding additional Level 1 variables to try and remove at least some of the residual heterogeneity. (264). Predicted and Residual Intercepts and Slopes The Example The following parameter come from an example problem completed with unweighted PHMC 2006 survey data for Philadelphia only. Files _01.hlm _01.txt _01_l2_r.sav hlm problem setup hlm output file level 2 (neighborhoods) residual file 3 Hamilton, L. C. (1992). Regression with graphics. Pacific Grove, Calif.: Brooks/Cole. HLML_RESIDUALS Page 4

5 _01_l1r.sav level 1 (individuals) residual file Outcome = distrust; improve (2 = not work together, 1 = yes work together) has varying slope Level-1 Model: Y = B0 + B1*(RESPAGE) + B2*(IMPROVE) + B3*(FEMALE_D) + B4*(HS_LESS) + B5*(BLACK_D) + B6*(LATINO_D) + B7*(LT2XPOV) + R Level-2 Model B0 = G00 + G01*(STATUS) + U0 B1 = G10 B2 = G20 + U2 B3 = G30 B4 = G40 B5 = G50 B6 = G60 B7 = G70 In this example there is one set of residuals of interest: the residuals of the intercept. There is another random effect of interest: the variations in the slope of the IMPROVE variable. You can see the value of each of these for each level 2 unit. EBINTRCP Empirical Bayes best estimate of the residual L-2 value. These are your estimates of U0J. These are the discrepancies between the predicted score for each neighborhood and the EB estimate of that score: (EB estimate of intercept predicted intercept) G00 Tip. You probably want to check and be sure that your level 2 residuals (EBINTRCP) are uncorrelated with the level 2 predictors already entered in your model. (B&R p. 269). This is analogous to looking in OLS to be sure that e does not correlate with X. We expect homoscedastic residuals dispersed randomly around zero. (p. 269). You also can see if any of the level 2 residuals correlate with a level 2 predictor not yet entered. ECINTRCP These are the same as above except that the grand mean (G00) has been added in, so this equals (EB estimate of intercept predicted intercept) OLINTRCP OLS estimates of L-2 residual value: U0J. Note these have an average of zero, as HLML_RESIDUALS Page 5

6 they must. Note these have an average of zero, as residuals must axiomatically. Note the standard deviation of EBINTRCP is probably smaller than the standard deviation of OLINTCRP; the estimates are somewhat shrunken. Tip. If you graph EBINTRCP against OLINTRCP you will find that the latter is smaller relative to the former. How much the EBs shrink is a function of the remaining reliability of the U0js are people in the group still agreeing a lot with each other about the remaining discrepancies from the predicted values. FVINTRCP. In the Level 2 file these are predicted group scores. Very very useful. Tip. Sort the file based on these predicted scores and think about what places are high and what are low, and the implications. In this model a higher predicted score means more distrust. Tip. Look at the scatterplot of FVINTRCP (x) with EBINTRCP (y). There should not be too much of a correlation between the two; positive and negative residuals should be equally likely to appear all along the various predicted values, more or less. The residuals may bulge more in some places than others, but ignore unless extreme. If you allowed your slopes to vary, as we did in this model, then you will get some interesting information. Here we allowed IMPROVE (1 = yes worked together with neighbors, 2 = no did not) to have varying impacts across neighborhoods. There are three parameters of interest. FVIMPROV. EBIMPROV. OLIMPROV. This is the average slope and corresponds to the value you see in the output, G20 =.356. This is each neighborhood s Empirical Bayes estimated deviation around from the average slope, each neighborhood s U2j. The variance of these slopes adds up to T22. This is the OLS estimate of the deviation between a neighborhood s slope and the average slope across all neighborhoods. Tip. It is not a bad idea to take a look at the OLS slope deviations compared to the EB slope deviations, to see how much they have shrunken, and to see, overall, what kinds of adjustments HLM has made here. HLML_RESIDUALS Page 6

7 ECIMPROV This is each neighborhood s predicted slope (G20 + U20) on improve Tip. Not a bad idea to take a look at the relationship between a neighborhood s predicted slope and it s predicted score. Useful information. Tip. Also extremely useful to look at each neighborhood s deviation from the grand intercept and deviation from the grand slope. Gives you a sense of how hard HLM is working! EBZAGE Empirical Bayes estimate of the discrepancy between the average slope for z scored age, and the slope in each L-2 unit. This is: U*1J Will be dramatically shrunken when compared to OLS estimate of same which is: OLZAGE OLS estimate of same: U1J HLML_RESIDUALS Page 7

Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology

Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology As noted previously, Hierarchical Linear Modeling (HLM) can be considered a particular instance

More information

Multilevel Modeling Tenko Raykov, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania

Multilevel Modeling Tenko Raykov, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Multilevel Modeling Tenko Raykov, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Multilevel Modeling Part 1 Introduction, Basic and Intermediate Modeling Issues Tenko Raykov Michigan

More information

Getting Started with HLM 5. For Windows

Getting Started with HLM 5. For Windows For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 About this Document... 3 1.2 Introduction to HLM... 3 1.3 Accessing HLM... 3 1.4 Getting Help with HLM... 3 Section 2: Accessing

More information

Latent Growth Curve Analysis. Daniel Boduszek Department of Behavioural and Social Sciences University of Huddersfield, UK

Latent Growth Curve Analysis. Daniel Boduszek Department of Behavioural and Social Sciences University of Huddersfield, UK Latent Growth Curve Analysis Daniel Boduszek Department of Behavioural and Social Sciences University of Huddersfield, UK d.boduszek@hud.ac.uk Outline Introduction to Latent Growth Analysis Amos procedure

More information

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy.

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy. Introduction to SPSS Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 This workshop is designed to

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

Using SPSS for Linear Regression

Using SPSS for Linear Regression Using SPSS for Linear Regression This tutorial will show you how to use SPSS version 12.0 to perform linear regression. You will use SPSS to determine the linear regression equation. This tutorial assumes

More information

Timing Production Runs

Timing Production Runs Class 7 Categorical Factors with Two or More Levels 189 Timing Production Runs ProdTime.jmp An analysis has shown that the time required in minutes to complete a production run increases with the number

More information

The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pa

The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pa The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics 7.5, pages 37-64. The description of the problem can be found

More information

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help

More information

How to Get More Value from Your Survey Data

How to Get More Value from Your Survey Data Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................3

More information

STAT 2300: Unit 1 Learning Objectives Spring 2019

STAT 2300: Unit 1 Learning Objectives Spring 2019 STAT 2300: Unit 1 Learning Objectives Spring 2019 Unit tests are written to evaluate student comprehension, acquisition, and synthesis of these skills. The problems listed as Assigned MyStatLab Problems

More information

The Big Picture: Four Domains of HLM. Today s Discussion. 3 Background Concepts. 1) Nested Data. Introduction to Hierarchical Linear Modeling (HLM)

The Big Picture: Four Domains of HLM. Today s Discussion. 3 Background Concepts. 1) Nested Data. Introduction to Hierarchical Linear Modeling (HLM) Center for dvanced Study of Teaching CSTL and Learning Introduction to Hierarchical Linear Modeling (HLM) The Big Picture: Four Domains of HLM Part 1 Specifying Equations Part I: Concepts and Equations

More information

Multilevel Modeling and Cross-Cultural Research

Multilevel Modeling and Cross-Cultural Research 11 Multilevel Modeling and Cross-Cultural Research john b. nezlek Cross-cultural psychologists, and other scholars who are interested in the joint effects of cultural and individual-level constructs, often

More information

Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS

Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS Appendix A Mixed-Effects Models 1. LONGITUDINAL HIERARCHICAL LINEAR MODELS Hierarchical Linear Models (HLM) provide a flexible and powerful approach when studying response effects that vary by groups.

More information

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION JMP software provides introductory statistics in a package designed to let students visually explore data in an interactive way with

More information

STATISTICS PART Instructor: Dr. Samir Safi Name:

STATISTICS PART Instructor: Dr. Samir Safi Name: STATISTICS PART Instructor: Dr. Samir Safi Name: ID Number: Question #1: (20 Points) For each of the situations described below, state the sample(s) type the statistical technique that you believe is the

More information

Using Mapmaker/QTL for QTL mapping

Using Mapmaker/QTL for QTL mapping Using Mapmaker/QTL for QTL mapping M. Maheswaran Tamil Nadu Agriculture University, Coimbatore Mapmaker/QTL overview A number of methods have been developed to map genes controlling quantitatively measured

More information

4.3 Nonparametric Tests cont...

4.3 Nonparametric Tests cont... Class #14 Wednesday 2 March 2011 What did we cover last time? Hypothesis Testing Types Student s t-test - practical equations Effective degrees of freedom Parametric Tests Chi squared test Kolmogorov-Smirnov

More information

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Correlation and Simple. Linear Regression. Scenario. Defining Correlation Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling

More information

Chapter 3. Basic Statistical Concepts: II. Data Preparation and Screening. Overview. Data preparation. Data screening. Score reliability and validity

Chapter 3. Basic Statistical Concepts: II. Data Preparation and Screening. Overview. Data preparation. Data screening. Score reliability and validity Chapter 3 Basic Statistical Concepts: II. Data Preparation and Screening To repeat what others have said, requires education; to challenge it, requires brains. Overview Mary Pettibone Poole Data preparation

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

Business Quantitative Analysis [QU1] Examination Blueprint

Business Quantitative Analysis [QU1] Examination Blueprint Business Quantitative Analysis [QU1] Examination Blueprint 2014-2015 Purpose The Business Quantitative Analysis [QU1] examination has been constructed using an examination blueprint. The blueprint, also

More information

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the

More information

Spreadsheets in Education (ejsie)

Spreadsheets in Education (ejsie) Spreadsheets in Education (ejsie) Volume 2, Issue 2 2005 Article 5 Forecasting with Excel: Suggestions for Managers Scott Nadler John F. Kros East Carolina University, nadlers@mail.ecu.edu East Carolina

More information

CHAPTER 5 RESULTS AND ANALYSIS

CHAPTER 5 RESULTS AND ANALYSIS CHAPTER 5 RESULTS AND ANALYSIS This chapter exhibits an extensive data analysis and the results of the statistical testing. Data analysis is done using factor analysis, regression analysis, reliability

More information

SPSS 14: quick guide

SPSS 14: quick guide SPSS 14: quick guide Edition 2, November 2007 If you would like this document in an alternative format please ask staff for help. On request we can provide documents with a different size and style of

More information

Tutorial #7: LC Segmentation with Ratings-based Conjoint Data

Tutorial #7: LC Segmentation with Ratings-based Conjoint Data Tutorial #7: LC Segmentation with Ratings-based Conjoint Data This tutorial shows how to use the Latent GOLD Choice program when the scale type of the dependent variable corresponds to a Rating as opposed

More information

AP Statistics Scope & Sequence

AP Statistics Scope & Sequence AP Statistics Scope & Sequence Grading Period Unit Title Learning Targets Throughout the School Year First Grading Period *Apply mathematics to problems in everyday life *Use a problem-solving model that

More information

Categorical Variables, Part 2

Categorical Variables, Part 2 Spring, 000 - - Categorical Variables, Part Project Analysis for Today First multiple regression Interpreting categorical predictors and their interactions in the first multiple regression model fit in

More information

Displaying Bivariate Numerical Data

Displaying Bivariate Numerical Data Price ($ 000's) OPIM 303, Managerial Statistics H Guy Williams, 2006 Displaying Bivariate Numerical Data 250.000 Price / Square Footage 200.000 150.000 100.000 50.000 - - 500 1,000 1,500 2,000 2,500 3,000

More information

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test CHAPTER 8 T Tests A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test 8.1. One-Sample T Test The One-Sample T Test procedure: Tests

More information

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University Multiple Regression Dr. Tom Pierce Department of Psychology Radford University In the previous chapter we talked about regression as a technique for using a person s score on one variable to make a best

More information

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian. Preliminary Data Screening r's age when 1st child born 2 4 6 Density.2.4.6.8 Density.5.1 Sociology 774: Regression Models for Categorical Data Instructor: Natasha Sarkisian Preliminary Data Screening A. Examining Univariate Normality

More information

Examining Turnover in Open Source Software Projects Using Logistic Hierarchical Linear Modeling Approach

Examining Turnover in Open Source Software Projects Using Logistic Hierarchical Linear Modeling Approach Examining Turnover in Open Source Software Projects Using Logistic Hierarchical Linear Modeling Approach Pratyush N Sharma 1, John Hulland 2, and Sherae Daniel 1 1 University of Pittsburgh, Joseph M Katz

More information

5 CHAPTER: DATA COLLECTION AND ANALYSIS

5 CHAPTER: DATA COLLECTION AND ANALYSIS 5 CHAPTER: DATA COLLECTION AND ANALYSIS 5.1 INTRODUCTION This chapter will have a discussion on the data collection for this study and detail analysis of the collected data from the sample out of target

More information

Gasoline Consumption Analysis

Gasoline Consumption Analysis Gasoline Consumption Analysis One of the most basic topics in economics is the supply/demand curve. Simply put, the supply offered for sale of a commodity is directly related to its price, while the demand

More information

Chapter 5 Regression

Chapter 5 Regression Chapter 5 Regression Topics to be covered in this chapter: Regression Fitted Line Plots Residual Plots Regression The scatterplot below shows that there is a linear relationship between the percent x of

More information

FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT

FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT Third Edition AMITAVA MITRA Auburn University College of Business Auburn, Alabama WILEY A JOHN WILEY & SONS, INC., PUBLICATION PREFACE xix PARTI PHILOSOPHY

More information

ANALYSING QUANTITATIVE DATA

ANALYSING QUANTITATIVE DATA 9 ANALYSING QUANTITATIVE DATA Although, of course, there are other software packages that can be used for quantitative data analysis, including Microsoft Excel, SPSS is perhaps the one most commonly subscribed

More information

SAS/STAT 13.1 User s Guide. Introduction to Multivariate Procedures

SAS/STAT 13.1 User s Guide. Introduction to Multivariate Procedures SAS/STAT 13.1 User s Guide Introduction to Multivariate Procedures This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is

More information

What is the general form of a regression equation? What is the difference between y and ŷ?

What is the general form of a regression equation? What is the difference between y and ŷ? 3. Least-Squares Regression What is the general form of a regression equation? What is the difference between y and ŷ? Example: Tapping on cans Don t you hate it when you open a can of soda and some of

More information

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Overview This tutorial will demonstrate how to: Summarize core exon-level data to produce gene-level data Perform exploratory analysis

More information

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT

Problem Points Score USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT STAT 512 EXAM I STAT 512 Name (7 pts) Problem Points Score 1 40 2 25 3 28 USE YOUR TIME WISELY SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT WRITE LEGIBLY. ANYTHING UNREADABLE WILL NOT BE GRADED GOOD LUCK!!!!

More information

Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling

Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling Foley Retreat Research Methods Workshop: Introduction to Hierarchical Modeling Amber Barnato MD MPH MS University of Pittsburgh Scott Halpern MD PhD University of Pennsylvania Learning objectives 1. List

More information

Module 7: Multilevel Models for Binary Responses. Practical. Introduction to the Bangladesh Demographic and Health Survey 2004 Dataset.

Module 7: Multilevel Models for Binary Responses. Practical. Introduction to the Bangladesh Demographic and Health Survey 2004 Dataset. Module 7: Multilevel Models for Binary Responses Most of the sections within this module have online quizzes for you to test your understanding. To find the quizzes: Pre-requisites Modules 1-6 Contents

More information

Department of Sociology King s University College Sociology 302b: Section 570/571 Research Methodology in Empirical Sociology Winter 2006

Department of Sociology King s University College Sociology 302b: Section 570/571 Research Methodology in Empirical Sociology Winter 2006 Department of Sociology King s University College Sociology 302b: Section 570/571 Research Methodology in Empirical Sociology Winter 2006 Computer assignment #3 DUE Wednesday MARCH 29 th (in class) Regression

More information

STEP 1: ANOVA WITH ALL FACTORS

STEP 1: ANOVA WITH ALL FACTORS DOE ANALYSIS FOR THE Rz value We take all the terms and perform the factor analysis. These are the significant factors as yielded by ANOVA. STEP 1: ANOVA WITH ALL FACTORS Let us see the design points which

More information

Statistics 201 Summary of Tools and Techniques

Statistics 201 Summary of Tools and Techniques Statistics 201 Summary of Tools and Techniques This document summarizes the many tools and techniques that you will be exposed to in STAT 201. The details of how to do these procedures is intentionally

More information

Continuous Improvement Toolkit

Continuous Improvement Toolkit Continuous Improvement Toolkit Regression (Introduction) Managing Risk PDPC Pros and Cons Importance-Urgency Mapping RACI Matrix Stakeholders Analysis FMEA RAID Logs Break-even Analysis Cost -Benefit Analysis

More information

Measurement Systems Analysis

Measurement Systems Analysis Measurement Systems Analysis Components and Acceptance Criteria Rev: 11/06/2012 Purpose To understand key concepts of measurement systems analysis To understand potential sources of measurement error and

More information

CS130 Software Tools. Fall 2010 Regression and Excel

CS130 Software Tools. Fall 2010 Regression and Excel Software Tools Regression and Excel 1 Regression Analysis (Part 1) Regression Analysis The statistical crystal ball Regression analysis is a form of statistical analysis used for forecasting. We are going

More information

Module 1: Fundamentals of Data Analysis

Module 1: Fundamentals of Data Analysis Using Statistical Data to Make Decisions Module 1: Fundamentals of Data Analysis Dr. Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business S tatistics are an important

More information

Only check assumptions when I explicitly ask you to.

Only check assumptions when I explicitly ask you to. Name: Check one: Section 1 (Tues-Thurs. 10:30-noon): Section 2 (Tues-Thurs. 1:30-3:00): Statistics 102 Midterm #2 November 15, 2001 6-8pm This exam is closed book. You may have two pages of notes. You

More information

Introduction to Multivariate Procedures (Book Excerpt)

Introduction to Multivariate Procedures (Book Excerpt) SAS/STAT 9.22 User s Guide Introduction to Multivariate Procedures (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.22 User s Guide. The correct bibliographic citation

More information

Europ. Pharm., 5th Ed. (2005), Ch. 5.3, A completely randomised (0,4,4,4)-design - An in-vitro assay of influenza vaccines

Europ. Pharm., 5th Ed. (2005), Ch. 5.3, A completely randomised (0,4,4,4)-design - An in-vitro assay of influenza vaccines Page 1 of 6 Europ. Pharm., 5th Ed. (2005), Ch. 5.3, 5.2.2. - A completely randomised (0,4,4,4)-design - An in-vitro assay of influenza Document-Id: Document-63 (Revision:3) Document Type: Quantitative

More information

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved. AcaStat How To Guide AcaStat Software Copyright 2016, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Frequencies... 3 List Variables... 4 Descriptives... 5 Explore Means...

More information

ARIMA LAB ECONOMIC TIME SERIES MODELING FORECAST Swedish Private Consumption version 1.1

ARIMA LAB ECONOMIC TIME SERIES MODELING FORECAST Swedish Private Consumption version 1.1 Bo Sjo 2011-11-10 (Updated) ARIMA LAB ECONOMIC TIME SERIES MODELING FORECAST Swedish Private Consumption version 1.1 Send in a written report to bosjo@liu.se before Wednesday November 25, 2012. 1 1. Introduction

More information

Package ConjointChecks

Package ConjointChecks Package ConjointChecks August 29, 2016 Title A package to check the cancellation axioms of conjoint measurement. Version 0.0.9 Date 2012-12-14 Author Ben Domingue Maintainer Ben

More information

Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users

Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users Unit 2 Regression and Correlation 2 of 2 - Practice Problems SOLUTIONS Stata Users Data Set for this Assignment: Download from the course website: Stata Users: framingham_1000.dta Source: Levy (1999) National

More information

y x where x age and y height r = 0.994

y x where x age and y height r = 0.994 Unit 2 (Chapters 7 9) Review Packet Answer Key Use the data below for questions 1 through 14 Below is data concerning the mean height of Kalama children. A scientist wanted to look at the effect that age

More information

An Introduction to Latent Growth Curve Modeling (LGC Modeling): A Resource Packet for Participants

An Introduction to Latent Growth Curve Modeling (LGC Modeling): A Resource Packet for Participants AEA 2014: Latent Growth Curve Modeling 1 An Introduction to Latent Growth Curve Modeling (LGC Modeling): A Resource Packet for Participants Innovative Statistical Solutions for Evaluation Data: Latent

More information

Habitat as a predictor of Warbler usage.

Habitat as a predictor of Warbler usage. 1. Introduction: Habitat as a predictor of Warbler usage. Carl James Schwarz (cschwarz@stat.sfu.ca) Over the past few years, the BC MOE has completed detailed habitat evaluations at 223 Black-throated

More information

Telecommunications Churn Analysis Using Cox Regression

Telecommunications Churn Analysis Using Cox Regression Telecommunications Churn Analysis Using Cox Regression Introduction As part of its efforts to increase customer loyalty and reduce churn, a telecommunications company is interested in modeling the "time

More information

Develop Innovative Methods in Secondary Analyses of Child Welfare Databases -- Children s Bureau Discretionary Grants Program Grantee s Final Report

Develop Innovative Methods in Secondary Analyses of Child Welfare Databases -- Children s Bureau Discretionary Grants Program Grantee s Final Report Develop Innovative Methods in Secondary Analyses of Child Welfare Databases -- Children s Bureau Discretionary Grants Program Grantee s Final Report I. Executive Summary Shenyang Guo, Ph.D., Principal

More information

Discriminant models for high-throughput proteomics mass spectrometer data

Discriminant models for high-throughput proteomics mass spectrometer data Proteomics 2003, 3, 1699 1703 DOI 10.1002/pmic.200300518 1699 Short Communication Parul V. Purohit David M. Rocke Center for Image Processing and Integrated Computing, University of California, Davis,

More information

Two-way CASE ANOVA Using SPSS

Two-way CASE ANOVA Using SPSS Two-way CASE ANOVA Using SPSS Copyright 2015, J. Toby Mordkoff Premises 1. We want the most external validity with respect to the subjects. Therefore, we want to maximize the number of subjects that are

More information

CHAPTER 10 ASDA ANALYSIS EXAMPLES REPLICATION IVEware

CHAPTER 10 ASDA ANALYSIS EXAMPLES REPLICATION IVEware CHAPTER 10 ASDA ANALYSIS EXAMPLES REPLICATION IVEware GENERAL NOTES ABOUT ANALYSIS EXAMPLES REPLICATION These examples are intended to provide guidance on how to use the commands/procedures for analysis

More information

1. Contingency Table (Cross Tabulation Table)

1. Contingency Table (Cross Tabulation Table) II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate

More information

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Modelling categorical variables using logit models

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Modelling categorical variables using logit models Statistical Modelling for Social Scientists Manchester University January 20, 21 and 24, 2011 Graeme Hutcheson, University of Manchester Modelling categorical variables using logit models Software commands

More information

Biostat Exam 10/7/03 Coverage: StatPrimer 1 4

Biostat Exam 10/7/03 Coverage: StatPrimer 1 4 Biostat Exam 10/7/03 Coverage: StatPrimer 1 4 Part A (Closed Book) INSTRUCTIONS Write your name in the usual location (back of last page, near the staple), and nowhere else. Turn in your Lab Workbook at

More information

Physi ViewCalc. Operator Instructions

Physi ViewCalc. Operator Instructions Operator Instructions May 2009 Microsoft Excel is a registered trademark of Microsoft Corporation. Copyright Micromeritics Instrument Corporation 2009. Product Description Product Description The tool

More information

Know Your Data (Chapter 2)

Know Your Data (Chapter 2) Let s Get Started! Know Your Data (Chapter 2) Now we each have a time series whose future values we are interested in forecasting. The next step is to become thoroughly familiar with the construction of

More information

Multilevel/ Mixed Effects Models: A Brief Overview

Multilevel/ Mixed Effects Models: A Brief Overview Multilevel/ Mixed Effects Models: A Brief Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised March 27, 2018 These notes borrow very heavily, often/usually

More information

Example Analysis with STATA

Example Analysis with STATA Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group

More information

EFFICACY OF ROBUST REGRESSION APPLIED TO FRACTIONAL FACTORIAL TREATMENT STRUCTURES MICHAEL MCCANTS

EFFICACY OF ROBUST REGRESSION APPLIED TO FRACTIONAL FACTORIAL TREATMENT STRUCTURES MICHAEL MCCANTS EFFICACY OF ROBUST REGRESSION APPLIED TO FRACTIONAL FACTORIAL TREATMENT STRUCTURES by MICHAEL MCCANTS B.A., WINONA STATE UNIVERSITY, 2007 B.S., WINONA STATE UNIVERSITY, 2008 A THESIS submitted in partial

More information

Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance

Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance Stata Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition Chapter 12: Analysis of Variance Program Note 12.1 - One-Way ANOVA and Multiple Comparisons The Stata command

More information

This is an important debate

This is an important debate 1 of 5 8/3/2009 2:46 PM Create account Login Research-based policy analysis and commentary from leading economists Andrew K. Rose 1 August 2009 Less synchronised business cycles would be good news for

More information

STAT 350 (Spring 2016) Homework 12 Online 1

STAT 350 (Spring 2016) Homework 12 Online 1 STAT 350 (Spring 2016) Homework 12 Online 1 1. In simple linear regression, both the t and F tests can be used as model utility tests. 2. The sample correlation coefficient is a measure of the strength

More information

Example Analysis with STATA

Example Analysis with STATA Example Analysis with STATA Exploratory Data Analysis Means and Variance by Time and Group Correlation Individual Series Derived Variable Analysis Fitting a Line to Each Subject Summarizing Slopes by Group

More information

Archives of Scientific Psychology Reporting Questionnaire for Manuscripts Describing Primary Data Collections

Archives of Scientific Psychology Reporting Questionnaire for Manuscripts Describing Primary Data Collections (Based on APA Journal Article Reporting Standards JARS Questionnaire) 1 Archives of Scientific Psychology Reporting Questionnaire for Manuscripts Describing Primary Data Collections JARS: ALL: These questions

More information

FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT. Fourth Edition. AMITAVA MITRA Auburn University College of Business Auburn, Alabama.

FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT. Fourth Edition. AMITAVA MITRA Auburn University College of Business Auburn, Alabama. FUNDAMENTALS OF QUALITY CONTROL AND IMPROVEMENT Fourth Edition AMITAVA MITRA Auburn University College of Business Auburn, Alabama WlLEY CONTENTS PREFACE ABOUT THE COMPANION WEBSITE PART I PHILOSOPHY AND

More information

Lab 1: A review of linear models

Lab 1: A review of linear models Lab 1: A review of linear models The purpose of this lab is to help you review basic statistical methods in linear models and understanding the implementation of these methods in R. In general, we need

More information

Statistical Modelling for Business and Management. J.E. Cairnes School of Business & Economics National University of Ireland Galway.

Statistical Modelling for Business and Management. J.E. Cairnes School of Business & Economics National University of Ireland Galway. Statistical Modelling for Business and Management J.E. Cairnes School of Business & Economics National University of Ireland Galway June 28 30, 2010 Graeme Hutcheson, University of Manchester Luiz Moutinho,

More information

Segmentation and Targeting

Segmentation and Targeting Segmentation and Targeting Outline The segmentation-targeting-positioning (STP) framework Segmentation The concept of market segmentation Managing the segmentation process Deriving market segments and

More information

Statistics, Data Analysis, and Decision Modeling

Statistics, Data Analysis, and Decision Modeling - ' 'li* Statistics, Data Analysis, and Decision Modeling T H I R D E D I T I O N James R. Evans University of Cincinnati PEARSON Prentice Hall Upper Saddle River, New Jersey 07458 CONTENTS Preface xv

More information

Introduction to Hierarchical Linear Models ICPSR 2009

Introduction to Hierarchical Linear Models ICPSR 2009 Introduction to Hierarchical Linear Models ICPSR 2009 Instructors: Aline G. Sayer, University of Massachusetts Amherst Natalya Verbitsky Savitz, Mathematica Policy Research Course Description The hierarchical

More information

Informed Decision-Making in Exploratory Factor Analysis

Informed Decision-Making in Exploratory Factor Analysis Informed Decision-Making in Exploratory Factor Analysis Melissa Cater Louisiana State University AgCenter Harrisonburg, LA mcater@agcenter.lsu.edu Krisanna Machtmes Louisiana State University Baton Rouge,

More information

Using Excel s Analysis ToolPak Add-In

Using Excel s Analysis ToolPak Add-In Using Excel s Analysis ToolPak Add-In Bijay Lal Pradhan, PhD Introduction I have a strong opinions that we can perform different quantitative analysis, including statistical analysis, in Excel. It is powerful,

More information

Multilevel Mixed-Effects Generalized Linear Models. Prof. Dr. Luiz Paulo Fávero Prof. Dr. Matheus Albergaria

Multilevel Mixed-Effects Generalized Linear Models. Prof. Dr. Luiz Paulo Fávero Prof. Dr. Matheus Albergaria Multilevel Mixed-Effects Generalized Linear Models in aaaa Prof. Dr. Luiz Paulo Fávero Prof. Dr. Matheus Albergaria SUMMARY - Theoretical Fundamentals of Multilevel Models. - Estimation of Multilevel Mixed-Effects

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 2015 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

Super-marketing. A Data Investigation. A note to teachers:

Super-marketing. A Data Investigation. A note to teachers: Super-marketing A Data Investigation A note to teachers: This is a simple data investigation requiring interpretation of data, completion of stem and leaf plots, generation of box plots and analysis of

More information

Vital Attributes of Information Technology Service Consultants in Incident Management

Vital Attributes of Information Technology Service Consultants in Incident Management Vital Attributes of Information Technology Service Consultants in Incident Management Rumel V. Atienza. 1, Mc Robart Steeler G. Dela Peña 2 and Matea Alvyn Trinidad 3 1,2 De La Salle University, 241 Taft

More information

MEANS AND ENDS: A COMPARATIVE STUDY OF EMPIRICAL METHODS FOR INVESTIGATING GOVERNANCE AND PERFORMANCE

MEANS AND ENDS: A COMPARATIVE STUDY OF EMPIRICAL METHODS FOR INVESTIGATING GOVERNANCE AND PERFORMANCE MEANS AND ENDS: A COMPARATIVE STUDY OF EMPIRICAL METHODS FOR INVESTIGATING GOVERNANCE AND PERFORMANCE Carolyn J. Heinrich and Laurence E. Lynn, Jr. The University of Chicago DRAFT September 1999 Prepared

More information

emergent group-level effects U.S. Army Medical Research Unit-Europe Walter Reed Army Institute of Research

emergent group-level effects U.S. Army Medical Research Unit-Europe Walter Reed Army Institute of Research The impact of within-group conflict on perceptions of the organizations: Evidence for emergent group-level effects Paul D. Bliese, Jeffrey L. Thomas, Amy B. Adler, & Kathleen M. Wright U.S. Army Medical

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Lecture (chapter 7): Estimation procedures

Lecture (chapter 7): Estimation procedures Lecture (chapter 7): Estimation procedures Ernesto F. L. Amaral February 19 21, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A Tool for Social Research.

More information

Chapter 11. Multiple-Sample SEM. Overview. Rationale of multiple-sample SEM. Multiple-sample path analysis. Multiple-sample CFA.

Chapter 11. Multiple-Sample SEM. Overview. Rationale of multiple-sample SEM. Multiple-sample path analysis. Multiple-sample CFA. Chapter 11 Multiple-Sample SEM Facts do not cease to exist because they are ignored. Overview Aldous Huxley Rationale of multiple-sample SEM Multiple-sample path analysis Multiple-sample CFA Extensions

More information

Introduction to Business Research 3

Introduction to Business Research 3 Synopsis Introduction to Business Research 3 1. Orientation By the time the candidate has completed this module, he or she should understand: what has to be submitted for the viva voce examination; what

More information

SPSS Guide Page 1 of 13

SPSS Guide Page 1 of 13 SPSS Guide Page 1 of 13 A Guide to SPSS for Public Affairs Students This is intended as a handy how-to guide for most of what you might want to do in SPSS. First, here is what a typical data set might

More information