Examples of Using Stata v11.0 with JRR replicate weights Provided in the NHANES data set

Similar documents
Survey commands in STATA

CHAPTER 11 ASDA ANALYSIS EXAMPLES REPLICATION-SUDAAN

SUDAAN Analysis Example Replication C5

Interpreting and Visualizing Regression models with Stata Margins and Marginsplot. Boriana Pratt May 2017

SUDAAN Analysis Example Replication C6

ECON Introductory Econometrics Seminar 6

Compartmental Pharmacokinetic Analysis. Dr Julie Simpson

This is a quick-and-dirty example for some syntax and output from pscore and psmatch2.

The Multivariate Regression Model

Introduction of STATA

Propensity Scores for Multiple Treatments

Week 10: Heteroskedasticity

Stata v 12 Illustration. One Way Analysis of Variance

Using Weights in the Analysis of Survey Data

Using Stata 11 & higher for Logistic Regression Richard Williams, University of Notre Dame, Last revised March 28, 2015

The relationship between innovation and economic growth in emerging economies

Advanced Analytics through the credit cycle

*STATA.OUTPUT -- Chapter 13

EFA in a CFA Framework

A HYPOTHETICAL US SPORTS GAMBLING MARKET. Richard A. McGowan & John F. Mahon

ROBUST ESTIMATION OF STANDARD ERRORS

Appendix C: Lab Guide for Stata

BIOSTATS 640 Spring 2017 Stata v14 Unit 2: Regression & Correlation. Stata version 14

Introduction to Survey Data Analysis. Linda K. Owens, PhD. Assistant Director for Sampling & Analysis

Module 15: Multilevel Modelling of Repeated Measures Data. Stata Practical

for var trstprl trstlgl trstplc trstplt trstep: reg X trust10 stfeco yrbrn hinctnt edulvl pltcare polint wrkprty

The Multivariate Dustbin

Post-Estimation Commands for MLogit Richard Williams, University of Notre Dame, Last revised February 13, 2017

Correlated Random Effects Panel Data Models

Two pieces of evidence on gender discrimination in the Italian University system

Technical note The treatment effect: Comparing the ESR and PSM methods with an artificial example By: Araar, A.: April 2015:

SHIPPING COST, SPOT RATE AND EBAY AUCTIONS: A STRUCTURED EQUATION MODELING APPROACH

Full-time schooling, part-time schooling, and wages: returns and risks in Portugal

[DataSet3] C:\Program Files\SPSSInc\Statistics17\Samples\English\GSS93 subset.sav. Auto-Clustering. Ratio of BIC Changes b

Regression Analysis I & II

Full-time schooling, part-time schooling, and wages: returns and risks in Portugal

CHAPTER Activity Cost Behavior

Calculating Absolute Rate Differences and Relative Rate Ratios in SAS/SUDAAN and STATA. Ashley Hirai, PhD May 20, 2014

CROSSTAB Example #8. This example illustrates the variety of hypotheses and test statistics now available on the TEST statement in CROSSTAB.

The Effect of Prepayment on Energy Use. By: Michael Ozog, Ph.D. Integral Analytics, Inc.

3. Stated choice analyses

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS

Questionnaire. (3) (3) Bachelor s degree (3) Clerk (3) Third. (6) Other (specify) (6) Other (specify)

Unit 6: Simple Linear Regression Lecture 2: Outliers and inference

Semiparametric analysis of case control genetic data in the presence of environmental factors

Measuring Impact of Increase in High-Skilled Workers on the Livelihoods of Medium- and Low-Skilled Workers

What Factors Influence Seat Belt Usage Rates in the United States?: A Meta-analysis

Non-linear Pricing of Paid Content Products

Chapter 2. Linear model. Put some concreteness on problem. The Bivariate Regression Model. Sample of n observations, labeled as i=1,2,..

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

MULTILOG Example #3. SUDAAN Statements and Results Illustrated. Input Data Set(s): IRONSUD.SSD. Example. Solution

Outliers Richard Williams, University of Notre Dame, Last revised April 7, 2016

remained are mine. 1 I would like to thank Dr Nguyen Thang, Director of Forecast and Analysis for his helpful comments. All errors

F u = t n+1, t f = 1994, 2005

Telecommunications Churn Analysis Using Cox Regression

Title. Syntax. Description. xt Introduction to xt commands

Evaluating options for securing camel breeding stock in production systems vulnerable to climate variability in Northern Kenya Mumina G.

BIO 226: Applied Longitudinal Analysis. Homework 2 Solutions Due Thursday, February 21, 2013 [100 points]

Sylvain Tremblay SAS Canada

R. Creedon, J. Krstic, R. Mann, K. Ruffini, M. Skuodis, K. Smula, M. Vlekke

GHG Fluxes Calculation. By Ahmad S. Mat Su Dec. 10, 2012

Drivers of Sustainable Land Management in Eastern Africa

demographic of respondent include gender, age group, position and level of education.

Online Appendix Stuck in the Adoption Funnel: The Effect of Interruptions in the Adoption Process on Usage

SQL*LIMS Stability Analytics Software

The impact of workforce on industrialization in Algeria with the existence of expenditure in education as a variable mediator from 1970 to 2014

Research Article Minimum Porosity Formation in Pressure Die Casting by Taguchi Method

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd

Tougher rules on mandatory substitution brought forth price drop in the Swedish pharmaceutical market

FIRM S SPECIFIC ATTRIBUTES AND VOLUNTARY ENVIRONMENTAL DISCLOSURE IN NIGERIA: EVIDENCE FROM LISTED MANUFACTURING COMPANIES

Gender Wage Gaps in Greece 1 - an examination among University Graduates

FROM: Dan Rubado, Evaluation Project Manager, Energy Trust of Oregon; Phil Degens, Evaluation Manager, Energy Trust of Oregon

An Econometric Assessment of Electricity Demand in the United States Using Panel Data and the Impact of Retail Competition on Prices

Examining the Short-Run Price Elasticity of Gasoline Demand in the United States

Measuring Parallelism and Relative Potency In Well-Behaved and Ill-Behaved Cell-Based Bioassays

Please respond to each of the following attitude statement using the scale below:

Measuring the Impact of a Comprehensive Health and Wellness Initiative

A Comparative Assessment of Disclosure Risk and Data Quality between MASSC and Other Statistical Disclosure Limitation Methods

Multiple Imputation and Multiple Regression with SAS and IBM SPSS

Understanding and accounting for product

Survey of Income and Program Participation 2001 Wave 8 Food Security Data File Technical Documentation and User Notes

Improved Water Supply Forecasts for the Kootenay Basin

1 of :01 PM

Relationship between Consumption, Labor Supply and Fertility - Theory and Evidence from Japan

Need Additional Statistics in Your Report? - ODS OUTPUT to the Rescue!

A Latent Class Analysis of agricultural technology adoption behavior in Uganda: Implications for Optimal Targeting

FACTORS THAT DETERMINE THE STATED WILLINGNESS TO PAY FOR AIR POLLUTION: A CASE OF BOPHELONG TOWNSHIP

Market Sentiment Helps Explain the Price of Bitcoin

The Underlying Factors of Regional U.S. Hotel Market Resiliency Post 9/11

Biodiversity, Ecosystem Conservation and Poverty Reduction: A Contingent Valuation Approach in Grande Riviere

Bangladesh - CGAP Smallholder Household Survey 2016, Building the Evidence Base on The Agricultural and Financial Lives of Smallholder Households

CHAPTER. Activity Cost Behavior

What can a CIE tell us about the origins of negative treatment effects of a training programme

Probability Of Booking

The Importance of Service Encounter Social Exchange Elements for Creating Positive Word of Mouth within the Australian Public Hotel Industry

1. Department of Design and Environmental Analysis, Cornell University; Energy Technology Area, Lawrence Berkeley National Laboratory

Palestinian Central Bureau of Statistics Economic Statistics Directorate

Timing Production Runs

Mohammad Abdallah Fayad Altawalbeh, Firas Na el Rawhi Hashem, Khalid Ali Ahmad Alduneibat. Tafilah Technical University, Jordan

Transcription:

Examples of Using Stata v110 with JRR replicate weights Provided in the NHANES 1999-2000 data set This document is designed to illustrate comparisons of methods to use JRR replicate weights sometimes provided by NCHS and other distributors of public release data sets The NHANES 1999-2000 provided JRR replicate weights along with masked strata and PSU variables Demonstrated is how to use the JRR replicate weights directly in Stata along with a comparison of results using both the Taylor Series linearization approach and use of the JRR option in Stata where the replicates are developed by the Stata program Stata Do File: ******************************************************************************************************************** * Use of JRR replicate weights with NHANES 1999-2000 dataset * Comparison to Linearization and JRR with "Masked Weights" ******************************************************************************************************************** * read in dataset with demo variables and blood pressure readings use "F:\brahms\summerclasses\complexanalysis2007\demo_bpx_nhanes9900dta", clear * do surveyset with 52 replicate weights svyset _n [pweight=wtint2yr], jkrweight(wtmrep01 wtmrep02 wtmrep03 wtmrep04 wtmrep05 wtmrep06 wtmrep07 wtmrep08 wtmrep09 /// wtmrep10 wtmrep11 wtmrep12 wtmrep13 wtmrep14 wtmrep15 wtmrep16 wtmrep17 wtmrep18 wtmrep19 /// wtmrep20 wtmrep21 wtmrep22 wtmrep23 wtmrep24 wtmrep25 wtmrep26 wtmrep27 wtmrep28 wtmrep29 /// wtmrep30 wtmrep31 wtmrep32 wtmrep33 wtmrep34 wtmrep35 wtmrep36 wtmrep37 wtmrep38 wtmrep39 /// wtmrep40 wtmrep41 wtmrep42 wtmrep43 wtmrep44 wtmrep45 wtmrep46 wtmrep47 wtmrep48 wtmrep49 /// wtmrep50 wtmrep51 wtmrep52) vce(linearized) svydes * run analyses using the JRR replicate weights provided by project staff svy, vce(jackknife): mean bpxdi1 * note that riagendr =1 = male and =2= female svy, vce(jackknife): tabulate riagendr, cell se svy, vce(jackknife): regress bpxdi1 iriagendr * reset svy using masked stratum and psu variables svyset sdmvpsu [pweight=wtint2yr], strata(sdmvstra) vce(linearized) svydes * use of linearized approach svy, vce(linearized): mean bpxdi1 svy, vce(linearized): regress bpxdi1 iriagendr * use of JRR approach with strata and psu variables and Stata creates the JRR replicates for us svy, vce(jackknife): mean bpxdi1 svy, vce(jackknife): regress bpxdi1 iriagendr

Output : * read in dataset with demo variables and blood pressure readings use "F:\brahms\summerclasses\complexanalysis2007\demo_bpx_nhanes9900dta", clear * do surveyset with 52 replicate weights svyset _n [pweight=wtint2yr], jkrweight(wtmrep01 wtmrep02 wtmrep03 wtmrep04 wtmrep05 wtmrep06 wtmrep07 wtmrep08 wtmrep09 /// > wtmrep10 wtmrep11 wtmrep12 wtmrep13 wtmrep14 wtmrep15 wtmrep16 wtmrep17 wtmrep18 wtmrep19 /// > wtmrep20 wtmrep21 wtmrep22 wtmrep23 wtmrep24 wtmrep25 wtmrep26 wtmrep27 wtmrep28 wtmrep29 /// > wtmrep30 wtmrep31 wtmrep32 wtmrep33 wtmrep34 wtmrep35 wtmrep36 wtmrep37 wtmrep38 wtmrep39 /// > wtmrep40 wtmrep41 wtmrep42 wtmrep43 wtmrep44 wtmrep45 wtmrep46 wtmrep47 wtmrep48 wtmrep49 /// > wtmrep50 wtmrep51 wtmrep52) vce(linearized) jkrweight: wtmrep01 wtmrep02 wtmrep03 wtmrep04 wtmrep05 wtmrep06 wtmrep07 wtmrep08 wtmrep09 wtmrep10 wtmrep11 wtmrep12 wtmrep13 wtmrep14 wtmrep15 wtmrep16 wtmrep17 wtmrep18 wtmrep19 wtmrep20 wtmrep21 wtmrep22 wtmrep23 wtmrep24 wtmrep25 wtmrep26 wtmrep27 wtmrep28 wtmrep29 wtmrep30 wtmrep31 wtmrep32 wtmrep33 wtmrep34 wtmrep35 wtmrep36 wtmrep37 wtmrep38 wtmrep39 wtmrep40 wtmrep41 wtmrep42 wtmrep43 wtmrep44 wtmrep45 wtmrep46 wtmrep47 wtmrep48 wtmrep49 wtmrep50 wtmrep51 wtmrep52 Strata 1: <one> SU 1: <observations> svydes Survey: Describing stage 1 sampling units jkrweight: wtmrep01 wtmrep02 wtmrep03 wtmrep04 wtmrep05 wtmrep06 wtmrep07 wtmrep08 wtmrep09 wtmrep10 wtmrep11 wtmrep12 wtmrep13 wtmrep14 wtmrep15 wtmrep16 wtmrep17 wtmrep18 wtmrep19 wtmrep20 wtmrep21 wtmrep22 wtmrep23 wtmrep24 wtmrep25 wtmrep26 wtmrep27 wtmrep28 wtmrep29 wtmrep30 wtmrep31 wtmrep32 wtmrep33 wtmrep34 wtmrep35 wtmrep36 wtmrep37 wtmrep38 wtmrep39 wtmrep40 wtmrep41 wtmrep42 wtmrep43 wtmrep44 wtmrep45 wtmrep46 wtmrep47 wtmrep48 wtmrep49 wtmrep50 wtmrep51 wtmrep52 Strata 1: <one> SU 1: <observations> #Obs per Unit ---------------------------- Stratum #Units #Obs min mean max 1 9282 9282 1 10 1 1 9282 9282 1 10 1

* run analyses using the JRR replicate weights provided by project staff svy, vce(jackknife): mean bpxdi1 Jackknife replications (52) 50 Number of strata = 1 Number of obs = 6457 Population size = 194157303 Mean Std Err [95% Conf Interval] bpxdi1 7022135 5179801 6918146 7126123 * note that riagendr =1 = male and =2= female svy, vce(jackknife): tabulate riagendr, cell se (running tabulate on estimation sample) Number of strata = 1 Number of obs = 9282 Population size = 250273131 ------------------------------------ gender - adjudicat ed proportions se ----------+------------------------- 1 4915 0014 2 5085 0014 Total 1 ------------------------------------ Key: proportions = cell proportions se = jackknife standard errors of cell proportions svy, vce(jackknife): regress bpxdi1 iriagendr Jackknife replications (52) 50 Number of strata = 1 Number of obs = 6457 Population size = 194157303 F( 1, 51) = 7951 Prob > F = 00000 R-squared = 00127 bpxdi1 Coef Std Err t P> t [95% Conf Interval] -------------+-- 2riagendr -3050762 3421358-892 0000-3737628 -2363895 _cons 7177298 5124574 14006 0000 7074418 7280178

* reset svy using masked stratum and psu variables svyset sdmvpsu [pweight=wtint2yr], strata(sdmvstra) vce(linearized) Strata 1: sdmvstra SU 1: sdmvpsu svydes Survey: Describing stage 1 sampling units Strata 1: sdmvstra SU 1: sdmvpsu #Obs per Unit ---------------------------- Stratum #Units #Obs min mean max 1 3 983 287 3277 355 2 2 650 272 3250 378 3 2 759 370 3795 389 4 2 812 337 4060 475 5 2 700 346 3500 354 6 2 689 326 3445 363 7 2 798 361 3990 437 8 2 685 291 3425 394 9 2 644 315 3220 329 10 2 574 266 2870 308 11 2 683 300 3415 383 12 2 763 365 3815 398 13 2 542 268 2710 274 13 27 9282 266 3438 475 * use of linearized approach svy, vce(linearized): mean bpxdi1 Linearized Mean Std Err [95% Conf Interval] bpxdi1 7022135 4736565 6920545 7123724

svy, vce(linearized): regress bpxdi1 iriagendr F( 1, 14) = 4162 Prob > F = 00000 R-squared = 00127 Linearized bpxdi1 Coef Std Err t P> t [95% Conf Interval] -------------+-- 2riagendr -3050762 4728907-645 0000-4065011 -2036512 _cons 7177298 4612186 15562 0000 7078377 727622 * use of JRR approach with strata and psu variables and Stata creates the JRR replicates for us svy, vce(jackknife): mean bpxdi1 Jackknife replications (27) Replications = 27 Mean Std Err [95% Conf Interval] bpxdi1 7022135 4746321 6920336 7123933 svy, vce(jackknife): regress bpxdi1 iriagendr Jackknife replications (27) Replications = 27 F( 1, 14) = 4155 Prob > F = 00000 R-squared = 00127 bpxdi1 Coef Std Err t P> t [95% Conf Interval] -------------+-- 2riagendr -3050762 4732821-645 0000-4065851 -2035673 _cons 7177298 4627265 15511 0000 7078053 7276543