- Stratified Samples - Systematic Samples - Samples can vary - Standard Error

Similar documents
Frequency asked questions about the assignment. Introduction to tophat. Probability, conditional probability, marginal, and Bayes rule

AP Stats ~ Lesson 8A: Confidence Intervals OBJECTIVES:

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions

Chapter 7: Sampling Distributions

Chapter 12 Module 3. AMIS 310 Foundations of Accounting

Small business guide to hiring and managing apprentices and trainees

Chapter 12. Sample Surveys. Copyright 2010 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc.

provided that the population is at least 10 times as large as the sample (10% condition).

Social Media Profit Guide

Confidence Intervals for Large Sample Means

Suppose we wanted to use a sample of 25 students to estimate the average GPA of all students. Now suppose we choose our sample by random sampling and

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Three steps to joining and participating in unions

EconS Asymmetric Information

FAQ: Collecting and Analyzing Data

Contact: Version: 2.0 Date: March 2018

LECTURE 17: MULTIVARIABLE REGRESSIONS I

Stats Review Chapter 8. Mary Stangler Center for Academic Success Revised 8/16

Call-To-Action. Using Call-to-Action Marketing to Get More Leads

Lecture 9 - Sampling Distributions and the CLT

5 Digital Marketing mistakes that Insurance Brokers make, and how to avoid them

By: Aderatis Marketing

The Company s Best Yogurt: The Importance of Statistics in Food Product Development

Chapter 7: Sampling Distributions

Audiences negotiate meaning

Chapter 7: Sampling Distributions

Statistics 201 Spring 2018 Exam 2 Practice Exam (from Fall 2016)

CHAPTER 21A. What is a Confidence Interval?

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Thinking About Chance & Probability Models

10 AM 2 PM. Either goes home right at 4 and comes back sometimes, or stays altogether

Business Result Advanced

Survey Sampling. Situation: Population of N individuals (or items) e.g. students at this university light bulbs produced by a company on one day

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Emi I ve always loved animals. After graduating with a Masters Degree in Creative Writing in 2012, I went through a quarter-life crisis of not

The state of health benefits for small businesses

WMI 606: Research Methods Course Tools and data sources session 5

DIRECTIONS. Exam format is multiple choice. Choose the single best answer for each question. Answers and a formula sheet are found at back.

6. The probability that you win at least $1 both time is (a) 1/2 (b) 4/36 (c) 1/36 (d) 1/4 (e) 3/4

Delegating tasks in the office!

Scheduling Principles and Problems

Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification

Shape and Velocity Management. Stu Schmidt

MINDING OUR GAP REPORTING ON GENDER PAY

10 Things To Never Say

Why the Units We Evaluate Should be Randomly Selected

Young People s Guide to Interviewing

Chapter 7: Survey Research. Psychology 2301 Introduction to Research Methods University of Houston

A MARKETING GEEK S GUIDE TO: ORACLE ELOQUA ADVANCED FORMS

Chapter 10: Financial Mathematics Percentages

BerniePortal ADP Integration Guide. Copyright 2017 BH Web Services, LLC 1

HOW TO SET UP AND RUN A FAMILY BUSINESS

THE SECRET TO HIGH-PERFORMING DIGITAL CAMPAIGNS A Guide to Audience Targeting

PeepSo - Advanced Ads Integration Plugin. Targeting Ads Based On PeepSo Community Plugins

Topline questionnaire

Growing Great Employees Readers Guide

2014 Talent Acquisition Survey

My Top 19 Customer Service Tips

Cut Your PPC Campaigns Down To Size

Chapter 10 Regression Analysis

PROOF. Five Answers, most decision makers can t get their hands on. A publication from Acumen. Copyright 2014 Acumen Information Systems

Scentsy Social Movement. A guide for using Pinterest to grow your Scentsy business

Case Study. January, 2017

Crucial that our data is not only correct but up to date if we will be using our data in this way.

Victorian Meals Provider Survey

XpertHR Podcast: Gender pay gap reporting - what we have learnt so far. Original XpertHR podcast: 19 th October 2018

Finally: the resource leveling feature explained May4, 12pm-1pm EST Sander Nekeman

Reverse-engineering AdWords Quality Score factors

When it comes to planning the technical aspects of an evaluation,

The Application of Survival Analysis to Customer-Centric Forecasting

Test Date: A. Get none of the 5 questions correct. B. Get all of the questions wrong. C. Get at least one question wrong

Design Like a Pro. Boost Your Skills in HMI / SCADA Project Development. Part 3: Designing HMI / SCADA Projects That Deliver Results

Remarks as Prepared for Delivery Mike Duke, President and CEO of Walmart Sustainability Milestone Meeting July 16, 2009

The greatness gap: The state of employee disengagement. Achievers 2015 North American workforce survey results

Sample: n=2,252 people age 16 or older nationwide, including 1,125 cell phone interviews Interviewing dates:

Writing a Privacy Policy 101

PSCSF Presentation ROL SOLUTIONS LTD GOVMETRIC, SERVMETRIC AND THE SMILEY FACES LOGO ARE ALL REGISTERED TRADEMARKS OF ROL SOLUTIONS LTD.

Today... Midterm sign-up form available after add/drop date. OSH submission - look for Crowdmark 2 days before due date.

VIDEO 1: WHY ARE FORMS IMPORTANT?

Linda Carrington, Wessex Commercial Solutions

A toolkit for job-seekers to help you compare occupations and find the best fit for you.

Scope Creep: need strong change control to prevent or mitigate. Must establish the change control system early in the project.

Absolute vs. Comparative Advantage

FOUR SOCIAL MEDIA TACTICS EVERY REAL ESTATE AGENT NEEDS

The Art and Science of Bing Ads Reporting

EconS Theory of the Firm

CHAPTER 8 PERFORMANCE APPRAISAL OF A TRAINING PROGRAMME 8.1. INTRODUCTION

ECONOMICS U$A PROGRAM #2 THE FIRM: HOW CAN IT KEEP COSTS DOWN?

How to work effectively with multiple online travel agents. Wake up to booking.yeah

MORE THAN JUST A WARM BODY. Michelle Fuller Food Service and Transportation Director for Madison District Public Schools

Measuring Performance with Objective Evaluations

THE NORMAL CURVE AND SAMPLES:

How to Lie with Statistics Darrell Huff

Population parameters, unlike RVs, are fixed, usually unknown, characteristics of the population.

5 Key Ingredients: Build Your Successful Online Business 12/6/17

BEFORE YOU BEGIN You will need to know the vendor, WBS Element, Fund, and Project Manager for the job.

On the Path to ISO Accreditation

Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical.

Transcription:

- Stratified Samples - Systematic Samples - Samples can vary - Standard Error -

From last time: A sample is a small collection we observe and assume is representative of a larger sample. Example: You haven t seen Vancouver, you ve seen only seen a small part of it. It would be infeasible to see all of Vancouver. When someone asks you how is Vancouver?, you infer to the whole population of Vancouver places using your sample.

From last time: A sample is random if every member of the population has an equal chance of being in the sample. Your Vancouver sample is not random. You re more likely to have seen Production Station than you have of 93 rd st. in Surrey.

From last time: A simple random sample (SRS) is one where the chances of being in a sample are independent. Your Vancouver sample is not SRS because if you ve seen 93 rd st., you re more likely to have also seen 94 th st.

A common, random but not SRS sampling method is stratified sampling. To stratify something means to divide it into groups. (Geologically into layers)

To do stratified sampling, first split the population into different groups or strata. Often this is done naturally.

Possible strata: Sections of a course, gender, income level, grads/undergrads any sort of category like that. Then, random select some of the strata. Unless you re doing something fancy like multiple layers, the strata are selected using SRS.

Within each strata, select members of the population using SRS. If the strata are different sizes, select samples from them proportional to their sizes.

Example: Quality testing of milk. A government agency wants to check if the milk from a company is up to code. There are several trucks out leaving the plant today, each truck is a stratum. (single version of strata). The agency selects some of the trucks with SRS.

Each truck is carrying many jugs of milk, some jugs from each truck are selected by SRS. One of the trucks is twice as big as the others, so twice as many jugs are sampled from that one. Therefore every jug has an equal chance of being sampled.

Say they tested 50 jugs of milk from a total of 5 trucks. That s a lot easier than stopping 50 trucks and testing 1 jug each. This is part of the appeal of stratified sampling. Another appeal is that you can choose EVERY strata. (A stratum s chance of being picked by SRS becomes 1)

Example: Employment survey. A large company wants information about its workforce of 1000 full time employees and 500 part-time employees. A company chooses both strata and uses SRS to select 80 from the full-time stratum and 40 from the part-time stratum. 8% of each strata is sampled this way.

Samples can vary. Not every sample will be the same. My Vancouver sample is different from yours, which will be different from the person sitting next to you. You ve all seen different parts of the city, you ve all observed a different set of members of the population.

If samples are different, then their means are going to be different too. But, no matter how many times you take a sample, it s always from the same population. So the sample mean is always the same (unknown) number. can change, but the population mean

The sample mean mean μ., on average is going to be the population (Average of is μ) The standard deviation of is the standard error :

The typical amount that sample means change from the true mean is the standard error. Technically, it s the standard error of the mean, because you can have standard errors of other things too, but we ll only look at the standard error of the mean.

The standard error is our main tool for reducing the uncertainty of our sample mean. n is the sample size. The larger n gets, the smaller gets. In other words, a bigger sample gives you a better estimate of the sample mean.

This should be intuitive, if you take a bigger sample, you have more information about the population.

This is important because it gives us some measure of control over the statistics we get. We can t do that with the standard deviation.

Say the government agency of before knows that in regular milk, the amount of calcium is normal with mean 20 mg/l, and standard deviation 5 mg/l. If it samples 1 1L bottle of regular milk, it will have a standard error of 5 mg/l. If it samples 4 1L bottles milk, the mean calcium concentration will have a standard error

If the agency samples 25 one-litre bottles, the average calcium per bottle is going to be a lot closer to the true mean of 20mg/L than it was with 4 bottles. The sample mean of 25 bottles will have a standard error of 1, even though the standard deviation of a single bottle is 5.

Why does this happen? Consider: Which is more likely, one bottle being above the mean, or a whole lot of bottles? In a large sample, the bottles above the mean are going to balance out with the bottles below the mean. As you get more and more bottles, the closer to a 50-50 balance you would expect.

As we get closer to that 50-50 balance, the sample mean will tend to be closer and closer to the true mean. Since we become more sure of where the sample mean will be, we say it becomes less variable.

It s why elevators can make these limits: It s 68kg/person, and lots of people weigh more than 68kg. But how often will you get a group of 26 averaging more than 68kg/person.

Practice example: Suppose the average age when smokers begin is 17 years old with a standard deviation of 2 years. What s the standard error of the mean from a sample of 16 smokers? What s the standard error of the mean of 100 smokers?

The sample mean never changes with size, it s always centered around the true mean at 17.

We can expand our definition of z-score from something that pertains to single values to something that pertains to sample means. It s still (value minus mean) / (standard deviation of the value), But since the value is a sample mean instead of a single value, it has a different standard deviation.

Consider again the smokers starting at What s the z-score of a single smoker if he starts at 18 years? What s the z-score of a sample of 16 smokers if their mean is 18 years?

Instead of finding the standard error first, we can put it all into one question. (Just another option)

What s the chance of getting a sample of 100 smokers who started at an average of 18 years or older?

Common Question: How do I know what z-score formula to use? This or the one from chapter 5? Answer: Look for an indication that you re dealing with a sample. If it s giving you an n (sample size), use it. Pro-Tip: Use this new one by default. If you can t find n, you probably have a sample of size 1, so use n=1.

When you use a sample of size 1, the standard error z becomes the standard deviation z. When n=1 So

In other terms: Use the formula with square root n when you have an n. Use the original z-score formula when it s just a single value. If you don t know, use the square root n formula because you ll still get the right answer, you ll just waste some effort.

Finally Why would we ever deal with standard error? Parameters are usually unknown. In less contrived situations, we wouldn t know what the true mean was, but the larger our sample the better our idea of that true mean.

On Monday - More standard error, now with proportion data! - Law of large numbers - End of Midterm 1 exam material.