Frequency asked questions about the assignment. Introduction to tophat. Probability, conditional probability, marginal, and Bayes rule

Size: px
Start display at page:

Download "Frequency asked questions about the assignment. Introduction to tophat. Probability, conditional probability, marginal, and Bayes rule"

Transcription

1 Stat 203 Week 3 Agenda Frequency asked questions about the assignment. Introduction to tophat Probability, conditional probability, marginal, and Bayes rule Lots of probability examples. Correlation vs association. Scatterplots.

2 Assignment/Notes Frequently Asked Questions Q: Where is the 'change dir' function for R on a Mac? A: Misc -> Change working direction Q: Does this have to be typed, or can it be written? A: Typed is better, but written works if it's scanned into a PDF format using an app like TinyScanner. Q: Do we have to do all these questions? A: No, only the graded ones. Q: R for all of them? A: No, only the ones with stars (and datasets listed).

3 Q: My answer key says the answers for 2.31 (guineapigs) is different than what my computer says. A: Ignore the answer key, I removed 12 of the data points. Q: There's no dataset for 3.23, but it wants a histogram. A: Only a frequency table is given, not the data. The histogram can be drawn by hand. Q: For the 3sd rule, it says 3*15... A: That should be 3*25

4 Document camera: Venn Diagrams

5 Document camera: Mutually exclusive

6 <break question 1> Let T be the event that someone tests positive for a disease. Let D be the event that they have the disease. We would assume that having a disease makes you more likely to test positive. So let Pr(T D) = 0.90 Pr(T not D) = 0.30, and finally Pr(D) = 0.20 What is the probability of Test pos. AND have disease?

7 The union / Or operator. If we take two events that never happen together, the probability of one event OR the other happened is the two probabilities added together. Pr( Vancouver OR Toronto is voted the best city) = Pr( Vancouver is best) + Pr(Toronto is best) They can t both be the best city, so these events never happen together. Another term for never happening together is mutually exclusive.

8 Example: A lottery machine picks a single number from 1 to 49. Pr( Machine picks 1 or 2) = Pr( Picks 1) + Pr(Picks 2) = 1/49 + 1/ 49 = 2/49 The (simplified) one-or-the-other formula is... Pr(A OR B) = Pr(A) + Pr(B) when A or B can t happen together. We could also have written Pr( Picks 2 or less) = 2/49

9 For that matter, we could have written Pr( Picks 3 or less) = Pr(Picks 1) + Pr(Picks 2) + Pr(Picks 3) = 1/49 + 1/49 + 1/49 = 3/49 Or even... Pr( Machine picks 10 or less) 10 numbers that are 10 or less = numbers in total 49

10 What happens when the two events are CAN happen together? In other words, what happens when events A and B are NOT mutually exclusive? We can't just add the chance of the two events because some events are going to get double counted. By example, in a 52 card deck of cards, what is the chance of getting a King OR a Heart. There are 4 kings, and there are 13 hearts. But there are only 16 cards that either a king OR a heart.

11 A, 2, 3, 4, 5, 6, 7, 8 9, 10, J, Q, K, K, K,... K If we were to add the probabilities as if they were mutually exclusive, we would over estimate the total probability. Pr(King) = 4 / 52 Pr(Heart) = 13 / 52 Pr(King) + Pr(Heart) = 17 / 52 When we know by counting that

12 Pr(King OR Heart) = 16 / where is the difference coming from? If we add the two possibilities directly, the king of hearts is counted in both sets. A, 2, 3, 4, 5, 6, 7, 8 9, 10, J, Q, K K, K, K,... K The FULL formula for finding Pr(A or B) is...

13 Pr(A or B) = Pr(A) + Pr(B) Pr(A and B) where Pr(A) + Pr(B) is getting the outcomes from both sets, and - Pr(A and B) one copy of each 'double counted' outcome. Pr(King or Heart) = 4/ /52 1/52 If you don't know if two events are mutually exclusive, which formula is used? Always use the full formula. If A and B are mutually exclusive, then Pr(A and B) = 0, therefore subtracting Pr(A and B) won't change anything.

14 The 'addition only' formula is just a convenient shortcut. <break question 2> Two six-sided dice are rolled. (Rolls are independent) Pr( First die rolls a 3) = <break question 3> Two six-sided dice are rolled. (Rolls are independent) Pr( Both dice roll 3s) =

15 <break question 4> Two six-sided dice are rolled. (Rolls are independent) Pr( At least one die rolls a 3) = (Hint: Mutually exclusive???)

16 In case you were wondering, there are dice of other than six-sides. (for interest) Why is the one on the right problematic?

17 Sometimes the collection of events A or B is written A U B. The U, stands for union. A union is a collection of something, so A U B is the collection of all possible outcomes that are in either event A or B (or both).

18 Now let's talk about babies.

19 <break question 4> Every baby is going to be born pre-term, normal, or late. What is the chance that a baby will be born at the normal time? Pr(Pre-Term) = 0.12 Pr(Late) = 0.08

20 How did we know that Pr(Normal) would be 0.80? Recall Pr(certainty) = 1 and we are certain that every baby has to fit into exactly one category: pre-term, normal, and late. So Pr(normal) = 1 Pr(pre-term) Pr(late) = = = 0.80

21 <break question 5> What is the chance that any given baby will be born underweight AND pre-term? Pr(Pre-Term) = 0.12 Pr(Late) = 0.08 Pr(Underweight Pre-Term) = 0.60 Pr(Underweight Normal) = 0.20 Pr(Underweight Late) = 0.05

22 Document camera slide: Finding Pr(underweight AND pre-term) using a tree.

23 Correlation vs association Association refers to any sort of trend between between any two variables. Correlations are a specific type of association. Correlation refers to a trend (usually linear) between any two variables of interval data pertaining to the same set of observations. In each case 'trend' just means 'happens together'.

24 Examples of association: Health science is more popular amongst women, computer science is more popular amongst men. There is an association between field of study and gender. Lifetime incomes of post-secondary graduates is higher than that of high school graduates. There is a (positive) association between education level and lifetime income.

25 Examples of correlation*: The weight of bearded dragons increases with the head-totail length of bearded dragons. This is a positive correlation. Country by country, life expectancy at birth increases as the income-per-capita increases. This is a positive correlation. Heating costs decrease as outdoor temperature increases. This is a negative correlation. *Some examples have a non-linear component, we will revisit these later. The most common graph to show two sets of interval data together is the scatter plot.

26 Each dot represents a subject. In Length vs. Weight, each dot is a dragon.

27 The height of the dot represents the length of the dragon. How far it is to the right represents the weight of the dragon.

28 The dragon for this dot is 18cm long, and weighs 700g. There is an obvious upward trend in the graph. This shows a positive correlation.

29 The negative correlation between heating cost and outdoor temperature can be shown the same way.

30 The lack of correlation between two variables can also be show in a scatterplot.

31

32 Basil is happy(?) to be a data point. Break question 6: (answers at slides end)

33 Break question 6: The fee charged for any sort of event service (catering, flowers, space rental) is greater when the provider knows that it's for a wedding. This is an example of a between event type and service cost. a) Non-association. b) Association, but not a correlation. c) Positive correlation. d) Negative correlation.

34 Break question 7: All else being equal, a vehicle with less mass (weight, usually) consumes less gasoline per kilometre. This is an example of a between vehicle mass (kg) and gasoline consumption (L/km). a) Non-association. b) Association, but not a correlation. c) Positive correlation. d) Negative correlation.

35 Break question 8: In other words a vehicle with less mass can travel further on a given amount of gasoline. This is an example of a between vehicle mass (kg) and gasoline EFFICIENCY (km/l). a) Non-association. b) Association, but not a correlation. c) Positive correlation. d) Negative correlation. Why?

36 Question 6: Event type is a categorical variable. For a correlation, BOTH variables must be numerical. Question 7: As the mass INCREASES, the need for fuel INCREASES. Since these increase together, there is a positive correlation. Question 8: As mass INCREASES, fuel efficiency DECREASES, so this is a negative correlation. Consider the units you're using when finding a correlation.

37 Pearson coefficient Pearson s correlation coefficient refers to the strength and direction of a linear trend between two numerical variables (usually continuous, but not always). It is the most popular to use and is considered the default option. If someone is referring to the correlation, it's almost always the Pearson correlation coefficient. Much like how mean is the default of average. Pearson correlation is always a value between -1 and 1 that tells how strong a correlation is and in what direction.

38 The stronger a correlation, the farther the coefficient is from zero (and the closer it is to 1 or -1)

39 Positive correlations have positive coefficients r. Negative correlations have negative coefficients r. The stronger the negative correlation, the closer it is to -1. A perfect correlation, one in which all the values fit perfectly on a line, has a correlation 1 (for positive) or -1 (for negative).

40 If there is no correlation at all, r will have a value of zero.

41 However, since r is from a sample, it will vary like everything else from a sample. Instead of zero, it usually has some value close to zero on either side.

42 But sometimes it doesn't come together right.

43 Scatterplots show the interaction between two variables, and Pearson's correlation coefficient shows the strength and direction of the linear trend in that interaction. Pearson's correlation does NOT, however, indicate the slope of that linear relationship. Only whether it is negative or

44 positive. It is also not an appropriate measure to describe non-linear relationships between variables.

45 In real world contexts, the most common form of non-linear relationship is a curvilinear one. (See: Gapminder World)

46 Life expectancy increases with the logarithm of income, not linearly with income. (See: Gapminder World)

47 In this case, the issue is one of diminishing returns. In other cases, a curvilinear relationship is the result of multiple competing factors.

48 (For interest: The optimum difficulty is called the Zone of Proximal Development, or ZPD)

49 Mathematically, non-linear means messy.

50 SAMPLING One of the primary goals of statistics is to make statistical inferences on of a population. A population is a blanket term for anything that is too large or difficult to measure directly. Population Examples: - All the people living in Burnaby. - All the water in Deer Lake. - All the inpatients at VGH.

51 However, in a statistical sense, populations must be a well defined set of the same type of thing. Population Non-Examples: The city of Burnaby (what in the city? People? Cars?) All the water around (around where? One lake? In the sky?) Statistical inferences are made about parameters, which is some numeric variable of interest relating to the population.

52 Parameter examples: - The average concentration of pollutant in Deer Lake (measured as ppm or mg/l), - The average days that VGH inpatients have been admitted to the hospital, -The proportion of VGH inpatients that have signs of pneumonia. Parameters also need to be well-defined variables.

53 Parameter non-example: -How sick are people at VGH right now ( sick is ill-defined). -Whether or not the patient in VGH Room 1234 has cancer. (does not pertain to a population). Parameters are very difficult or impossible to measure directly because they involve the whole population. However, we can take a sample of a population and directly measure a statistic from that.

54 To repeat: Statistics describe Samples Parameters describe Populations

55 There are many ways to take a sample, but in this class we will assume that every sample is a simple random sample (SRS) unless it is stated otherwise. In a simple random sample, each member of the population has an equal chance of being selected, and every possible sample has an equal chance of being selected.

56 If the parameter of interest is proportion with pneumonia signs and the population is inpatients at VGH, we can take a simple random sample (SRS) of 10 of the inpatients. If we find that 6/10 of the patents have signs of pneumonia, then we could make an inference that 0.60 of all the inpatients have signs of pneumonia.

57 In all of these sample examples, we re making one really big assumption: The sample is representative of the population. This lets us take the sample and generalize it to the whole population. e.g. A car we tested consumed 7.8L/100km, we assume that most cars of the same model and year will have similar mileage. *cough VW cough*

58 Not all sampling is simple. It can be as fascinating as it is ugly.

59 Simple random sampling (SRS) is just one of many sampling options. Other options involve mathematical complications, but have the their own practical advantages. These methods include... Non-random/convenience samples Sampling the most convenient members of the population. This is the easiest, but least statistically valid method.

60 Stratified samples/ two-level sampling Splitting up the population into groups/strata first, sampling random groups, and using SRS without each group. Sometimes this is easier than SRS and with similar results, especially when sampling over a large geographic area.

61 Quota sampling: Choosing beforehand the number of people from each group you want, and using SRS until that number from each group is met. Usually, for cost reasons, observations beyond each group's quota are rejected. Systematic samples: Taking observation from a population over time at fixed intervals. (e.g. at 11am of each day), used to account for non-independence between observations that are near in time.

62 Snowball / Network / Recruitment / Respondent driven samples Selecting a small 'seed' group of a population and having members of that seed group recruit new people into the sample. Useful when a group is difficult to find, but well connected, such the homeless or endangered species.

63 Transect samples Travelling along a selected geographical path (i.e. a transect) and taking every observed population member along the path as the sample. Useful in field-based sciences such as forestry and environmental restoration.

64 Example: Opinion Polls. Opinion polls are done by choosing phone numbers at randomand calling them. This is: SRS Simple Random Sample (SRS) because there's no structure or plan, and every number is equally likely to be dialed.

65 Example: Class opinion. I try to get an opinion from the class by asking the front row. This is: Non-Random!! Why is not random bad in this case? People in the front of the class tend to be more engaged in the material and less likely to slumber. Engaged people are over-represented.

66 Polls on webpages and social media are self-selected. This means people are choosing for themselves to response, rather than being randomly chosen.

67 A simple random sample (SRS) is one where the chances of being in a sample are independent.

68 To stratify something means to divide it into groups (Geologically into layers) To do stratified sampling, first split the population into different groups or strata. Often this is done naturally. Possible strata: Sections of a course, gender, income level, grads/undergrads any sort of category like that. Then, random select some of the strata.

69 The strata are selected using SRS. Within each strata, select members of the population using SRS.

70 Example: Quality testing of milk. A government agency wants to check if the milk from a company is up to code. There are several trucks out leaving the plant today, each truck is a stratum. (single version of strata). The agency selects some of the trucks with SRS.

71 Each truck is carrying many jugs of milk, some jugs from each truck are selected by SRS.

72 Say they tested 50 jugs of milk from a total of 5 trucks (10 jugs each). That s a lot easier than stopping 50 trucks and testing 1 jug each. This is the appeal of stratified sampling.

73 Example: Employment survey. A large company wants information about its workforce of 1000 full time employees and 500 part-time employees. A company chooses both strata and uses SRS to select 80 from the full-time stratum and 40 from the part-time stratum. 8% of each strata is sampled this way.

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions One of the primary goals of statistics is to make statistical inferences on

More information

- Stratified Samples - Systematic Samples - Samples can vary - Standard Error

- Stratified Samples - Systematic Samples - Samples can vary - Standard Error - Stratified Samples - Systematic Samples - Samples can vary - Standard Error - From last time: A sample is a small collection we observe and assume is representative of a larger sample. Example: You haven

More information

Displaying Bivariate Numerical Data

Displaying Bivariate Numerical Data Price ($ 000's) OPIM 303, Managerial Statistics H Guy Williams, 2006 Displaying Bivariate Numerical Data 250.000 Price / Square Footage 200.000 150.000 100.000 50.000 - - 500 1,000 1,500 2,000 2,500 3,000

More information

International Program for Development Evaluation Training (IPDET)

International Program for Development Evaluation Training (IPDET) The World Bank Group Carleton University IOB/Ministry of Foreign Affairs, Netherlands International Program for Development Evaluation Training (IPDET) Building Skills to Evaluate Development Interventions

More information

Chapter 12. Sample Surveys. Copyright 2010 Pearson Education, Inc.

Chapter 12. Sample Surveys. Copyright 2010 Pearson Education, Inc. Chapter 12 Sample Surveys Copyright 2010 Pearson Education, Inc. Background We have learned ways to display, describe, and summarize data, but have been limited to examining the particular batch of data

More information

Contact: Version: 2.0 Date: March 2018

Contact: Version: 2.0 Date: March 2018 Survey Sampling Contact: andrew.ballingall@fife.gov.uk Version: 2.0 Date: March 2018 Sampling allows you to draw conclusions about a particular population by examining a part of it. When carrying out a

More information

AP Statistics Scope & Sequence

AP Statistics Scope & Sequence AP Statistics Scope & Sequence Grading Period Unit Title Learning Targets Throughout the School Year First Grading Period *Apply mathematics to problems in everyday life *Use a problem-solving model that

More information

POLI 343 Introduction to Political Research

POLI 343 Introduction to Political Research POLI 343 Introduction to Political Research Session 11-Probability Sampling Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College of Education

More information

Gush vs. Bore: A Look at the Statistics of Sampling

Gush vs. Bore: A Look at the Statistics of Sampling Gush vs. Bore: A Look at the Statistics of Sampling Open the Fathom file Random_Samples.ftm. Imagine that in a nation somewhere nearby, a presidential election will soon be held with two candidates named

More information

MATHEMATICAL MODELLING

MATHEMATICAL MODELLING 334 MATHEMATICS MATHEMATICAL MODELLING A2 A2.1 Introduction An adult human body contains approximately 1,50,000 km of arteries and veins that carry blood. The human heart pumps 5 to 6 litres of blood in

More information

Chapter 9 Assignment (due Wednesday, August 9)

Chapter 9 Assignment (due Wednesday, August 9) Math 146, Summer 2017 Instructor Linda C. Stephenson (due Wednesday, August 9) The purpose of the assignment is to find confidence intervals to predict the proportion of a population. The population in

More information

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Correlation and Simple. Linear Regression. Scenario. Defining Correlation Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling

More information

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy.

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy. Introduction to SPSS Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 This workshop is designed to

More information

Section 1.1 Analyzing Categorical Data

Section 1.1 Analyzing Categorical Data Section 1.1 Analyzing Categorical Data Categorical Variables place individuals into one of several groups or categories The values of a categorical variable are labels for the different categories The

More information

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts. Chapter 8 Script Slide 1 Are Your Curves Normal? Probability and Why It Counts Hi Jed Utsinger again. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts. Now, I don t want any

More information

SPSS Guide Page 1 of 13

SPSS Guide Page 1 of 13 SPSS Guide Page 1 of 13 A Guide to SPSS for Public Affairs Students This is intended as a handy how-to guide for most of what you might want to do in SPSS. First, here is what a typical data set might

More information

The 'stickiness' of prices

The 'stickiness' of prices The 'stickiness' of prices By Sam Wylie When supply and demand drift apart, prices adjust to restore equilibrium. But when prices cannot adjust, or can only adjust slowly, there is an inefficiency in the

More information

Chapter 3. Displaying and Summarizing Quantitative Data. 1 of 66 05/21/ :00 AM

Chapter 3. Displaying and Summarizing Quantitative Data.  1 of 66 05/21/ :00 AM Chapter 3 Displaying and Summarizing Quantitative Data D. Raffle 5/19/2015 1 of 66 05/21/2015 11:00 AM Intro In this chapter, we will discuss summarizing the distribution of numeric or quantitative variables.

More information

6. The probability that you win at least $1 both time is (a) 1/2 (b) 4/36 (c) 1/36 (d) 1/4 (e) 3/4

6. The probability that you win at least $1 both time is (a) 1/2 (b) 4/36 (c) 1/36 (d) 1/4 (e) 3/4 AP Statistics ~ Unit 3 Practice Test ANSWERS MULTIPLE CHOICE PRACTICE 1. An assignment of probability must obey which of the following? (a) The probability of any event must be a number between 0 and 1,

More information

If you are using a survey: who will participate in your survey? Why did you decide on that? Explain

If you are using a survey: who will participate in your survey? Why did you decide on that? Explain Journal 11/13/18 If you are using a survey: who will participate in your survey? Why did you decide on that? Explain If you are not using a survey: Where will you look for information? Why did you decide

More information

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES Sample Size Issues for Conjoint Analysis Studies Bryan Orme, Sawtooth Software, Inc. 1998 Copyright 1998-2001, Sawtooth Software, Inc. 530 W. Fir St. Sequim, WA

More information

Chapter 2 Data Collection

Chapter 2 Data Collection Chapter 2 Data Collection 2.1 a. Categorical b. Categorical c. Discrete numerical 2.2 a. Continuous numerical b. Discrete numerical c. Categorical d. Continuous numerical 2.3 a. Continuous numerical b.

More information

Indian Institute of Technology Kanpur National Programme on Technology Enhanced Learning (NPTEL) Course Title Marketing Management 1

Indian Institute of Technology Kanpur National Programme on Technology Enhanced Learning (NPTEL) Course Title Marketing Management 1 Indian Institute of Technology Kanpur National Programme on Technology Enhanced Learning (NPTEL) Course Title Marketing Management 1 Lecture: W4-L4 Capturing Marketing Insights by Prof. Jayanta Chatterjee

More information

Distinguish between different types of numerical data and different data collection processes.

Distinguish between different types of numerical data and different data collection processes. Level: Diploma in Business Learning Outcomes 1.1 1.3 Distinguish between different types of numerical data and different data collection processes. Introduce the course by defining statistics and explaining

More information

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the

More information

10.2 Correlation. Plotting paired data points leads to a scatterplot. Each data pair becomes one dot in the scatterplot.

10.2 Correlation. Plotting paired data points leads to a scatterplot. Each data pair becomes one dot in the scatterplot. 10.2 Correlation Note: You will be tested only on material covered in these class notes. You may use your textbook as supplemental reading. At the end of this document you will find practice problems similar

More information

Chapter 5. Statistical Reasoning

Chapter 5. Statistical Reasoning Chapter 5 Statistical Reasoning Measures of Central Tendency Back in Grade 7, data was described using the measures of central tendency and range. Central tendency refers to the middle value, or perhaps

More information

Sample Survey and Sampling Methods

Sample Survey and Sampling Methods Sample Survey and Sampling Methods Course unit ECON 53115 MA/MSSc in Economics Dr. WM Semasinghe Dept. of Economics Convenience Sampling A nonprobability sampling method. 10/2/2018 People are sampled simply

More information

Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical.

Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical. Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical. 1 Monday s, 11/12/12, notes: Numerical Summaries of Quantitative Varibles Chapter 3 of your textbook deals with

More information

BUSS1020 Quantitative Business Analysis

BUSS1020 Quantitative Business Analysis BUSS1020 Quantitative Business Analysis Week 1 - Introduction and Collecting Data Process of statistical analysis 1. Define the objective, and understand the data we need to collect. 2. Collect the required

More information

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help

More information

and Forecasting CONCEPTS

and Forecasting CONCEPTS 6 Demand Estimation and Forecasting CONCEPTS Regression Method Scatter Diagram Linear Regression Regression Line Independent Variable Explanatory Variable Dependent Variable Intercept Coefficient Slope

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B. Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institution of Technology, Madras

More information

Producer Theory - Monopoly

Producer Theory - Monopoly Producer Theory - Monopoly Mark Dean Lecture Notes for Fall 2009 Introductory Microeconomics - Brown University 1 Introduction Up until now, we have assumed that all the agents in our economies are price

More information

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test CHAPTER 8 T Tests A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test 8.1. One-Sample T Test The One-Sample T Test procedure: Tests

More information

Networks: Fall 2010 Homework 5 David Easley and Eva Tardos Due November 11, 2011

Networks: Fall 2010 Homework 5 David Easley and Eva Tardos Due November 11, 2011 Networks: Fall 2010 Homework 5 David Easley and Eva Tardos Due November 11, 2011 As noted on the course home page, homework solutions must be submitted by upload to the CMS site, at https://cms.csuglab.cornell.edu/.

More information

1. Contingency Table (Cross Tabulation Table)

1. Contingency Table (Cross Tabulation Table) II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate

More information

The following content is provided under a Creative Commons license. Your support will help

The following content is provided under a Creative Commons license. Your support will help MITOCW Lecture 8 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make a donation

More information

Parking Lot Checker. Wilson Kwong Nate Mayotte Jeff Wanger

Parking Lot Checker. Wilson Kwong Nate Mayotte Jeff Wanger Parking Lot Checker Wilson Kwong Nate Mayotte Jeff Wanger Introduction The purpose of the parking lot checker is to provide a visual representation of where available parking spaces are in a parking lot.

More information

AP* Environmental Science Mastering the Math

AP* Environmental Science Mastering the Math AP* Environmental Science Mastering the Math Part I: Dimensional Analysis (aka Factor-Label or Unit Cancellation Method) Sample Problem 1 A large, coal-fired electric power plant produces 12 million kilowatt-hours

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

Statistics Definitions ID1050 Quantitative & Qualitative Reasoning

Statistics Definitions ID1050 Quantitative & Qualitative Reasoning Statistics Definitions ID1050 Quantitative & Qualitative Reasoning Population vs. Sample We can use statistics when we wish to characterize some particular aspect of a group, merging each individual s

More information

Conceptualizing is where you define your research problem and explain the constructs and theories that are relevant. Conceptual definitions explain

Conceptualizing is where you define your research problem and explain the constructs and theories that are relevant. Conceptual definitions explain Operationalizing Conceptualizing is where you define your research problem and explain the constructs and theories that are relevant. Conceptual definitions explain your constructs by telling what they

More information

Probability and Statistics Cycle 3 Test Study Guide

Probability and Statistics Cycle 3 Test Study Guide Probability and Statistics Cycle 3 Test Study Guide Name Block 1. Match the graph with its correct distribution shape. The distribution shape is categorized as: A. Uniform B. Skewed to the right C. Normal

More information

Quadratic Regressions Group Acitivity 2 Business Project Week #4

Quadratic Regressions Group Acitivity 2 Business Project Week #4 Quadratic Regressions Group Acitivity 2 Business Project Week #4 In activity 1 we created a scatter plot on the calculator using a table of values that were given. Some of you were able to create a linear

More information

Chapter 4. Elementary Probability Theory

Chapter 4. Elementary Probability Theory Chapter 4 Elementary Probability Theory Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Probability Probability is a numerical measure

More information

Lecture 10. Outline. 1-1 Introduction. 1-1 Introduction. 1-1 Introduction. Introduction to Statistics

Lecture 10. Outline. 1-1 Introduction. 1-1 Introduction. 1-1 Introduction. Introduction to Statistics Outline Lecture 10 Introduction to 1-1 Introduction 1-2 Descriptive and Inferential 1-3 Variables and Types of Data 1-4 Sampling Techniques 1- Observational and Experimental Studies 1-6 Computers and Calculators

More information

Welcome to this IBM podcast. What is product. line engineering? I'm Angelique Matheny with IBM. It's not

Welcome to this IBM podcast. What is product. line engineering? I'm Angelique Matheny with IBM. It's not IBM Podcast [ MUSIC ] MATHENY: Welcome to this IBM podcast. What is product line engineering? I'm Angelique Matheny with IBM. It's not easy to build a smarter product. Now try to build more than one at

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

e-learning Student Guide

e-learning Student Guide e-learning Student Guide Basic Statistics Student Guide Copyright TQG - 2004 Page 1 of 16 The material in this guide was written as a supplement for use with the Basic Statistics e-learning curriculum

More information

IT Audit Process. Michael Romeu-Lugo MBA, CISA November 4, IT Audit Process. Prof. Mike Romeu

IT Audit Process. Michael Romeu-Lugo MBA, CISA November 4, IT Audit Process. Prof. Mike Romeu Michael Romeu-Lugo MBA, CISA November 4, 2015 1 Audit Sampling Audit Sampling is the application of an audit procedure to less than 100% of the target population for the purpose of drawing a general conclusion

More information

See below for an example on gym use in a large apartment complex: Men Women Total Men Women Total Used gym Used gym

See below for an example on gym use in a large apartment complex: Men Women Total Men Women Total Used gym Used gym Business Stats / Math 1111 Learning Centre Contingency Tables When dealing with probability, it is helpful to think about the sample space when approaching a problem. The sample space is a complete list

More information

Unit 6 Multiple Representations: Using Linear Relations and their Multiple Representations

Unit 6 Multiple Representations: Using Linear Relations and their Multiple Representations Unit 6 Multiple Representations: Using Linear Relations and their Multiple Representations Lesson Outline BIG PICTURE Grade 9 Applied Students will: determine solutions to linear equations by a variety

More information

Statistics Year 1 (AS) Unit Test 1: Statistical Sampling

Statistics Year 1 (AS) Unit Test 1: Statistical Sampling Statistics Year 1 (AS) Unit Test 1: Statistical Sampling 1 a Explain what is meant by a census. b Write down two disadvantages of using a census rather than a sample. Each circuit board produced at GC

More information

Bivariate Data Notes

Bivariate Data Notes Bivariate Data Notes Like all investigations, a Bivariate Data investigation should follow the statistical enquiry cycle or PPDAC. Each part of the PPDAC cycle plays an important part in the investigation

More information

Survey Sampling. Situation: Population of N individuals (or items) e.g. students at this university light bulbs produced by a company on one day

Survey Sampling. Situation: Population of N individuals (or items) e.g. students at this university light bulbs produced by a company on one day Survey Sampling Situation: Population of N individuals (or items) e.g. students at this university light bulbs produced by a company on one day Seek information about population e.g. amount of money students

More information

Chapter 1 Introduction: The Role of Statistics in Engineering

Chapter 1 Introduction: The Role of Statistics in Engineering 1 Chapter 1 Introduction: The Role of Statistics in Engineering Example: The manufacturer of a medical laser used in ophthalmic surgery wants to be able to quote quality characteristics of the laser to

More information

Chapter 1 Data and Descriptive Statistics

Chapter 1 Data and Descriptive Statistics 1.1 Introduction Chapter 1 Data and Descriptive Statistics Statistics is the art and science of collecting, summarizing, analyzing and interpreting data. The field of statistics can be broadly divided

More information

Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification

Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification Data 8, Final Review Review schedule: - Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification Your friendly reviewers

More information

CHAPTER 1 INTRODUCTION TO STATISTICS

CHAPTER 1 INTRODUCTION TO STATISTICS DUM 2413 STATISTICS & PROBABILITY CHAPTER 1 INTRODUCTION TO STATISTICS PREPARED BY: DR. CHUAN ZUN LIANG; DR. NORATIKAH ABU; DR. SITI ZANARIAH SATARI FACULTY OF INDUSTRIAL SCIENCES & TECHNOLOGY chuanzl@ump.edu.my;

More information

Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras

Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras Urban Transportation Planning Prof Dr. V. Thamizh Arasan Department of Civil Engineering Indian Institute Of Technology, Madras Lecture No. # 14 Modal Split Analysis Contd. This is lecture 14 on urban

More information

B. Statistical Considerations

B. Statistical Considerations Because the quadrats are being placed with their long side parallel to the x-axis, the coordinates are ordered first by the x-axis and next by the y-axis. Thus the new order is as follows: x-axis y-axis

More information

FAQ: Collecting and Analyzing Data

FAQ: Collecting and Analyzing Data Question 1: How do you choose a tool for collecting data? Answer 1: A variety of tools exist for collecting data, including direct observation or interviews, surveys, questionnaires, and experiments. Choice

More information

1. What is a key difference between an Affinity Diagram and other tools?

1. What is a key difference between an Affinity Diagram and other tools? 1) AFFINITY DIAGRAM 1. What is a key difference between an Affinity Diagram and other tools? Affinity Diagram builds the hierarchy 'bottom-up', starting from the basic elements and working up, as opposed

More information

AP Stats ~ Lesson 8A: Confidence Intervals OBJECTIVES:

AP Stats ~ Lesson 8A: Confidence Intervals OBJECTIVES: AP Stats ~ Lesson 8A: Confidence Intervals OBJECTIVES: DETERMINE the point estimate and margin of error from a confidence interval. INTERPRET a confidence interval in context. INTERPRET a confidence level

More information

CHAPTER 21A. What is a Confidence Interval?

CHAPTER 21A. What is a Confidence Interval? CHAPTER 21A What is a Confidence Interval? RECALL Parameter fixed, unknown number that describes the population Statistic known value calculated from a sample a statistic is used to estimate a parameter

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2012, 2008, 2005 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means

More information

Midterm Review Summer 2009 Chapters 1 7 Stat 111

Midterm Review Summer 2009 Chapters 1 7 Stat 111 Midterm Review Summer 2009 Chapters 1 7 Stat 111 Name FORM A Directions: Read each question carefully and answer as clearly as possible. 1. A book store wants to estimate the proportion of its customers

More information

Chapter Eleven. Sampling Foundations

Chapter Eleven. Sampling Foundations Chapter Eleven Sampling Foundations Chapter Objectives Define and distinguish between sampling and census studies Discuss when to use a probability versus a nonprobability sampling method and implement

More information

ASIC speaks on Improving and Maintaining Audit Quality & The Role of Others

ASIC speaks on Improving and Maintaining Audit Quality & The Role of Others CPA Australia Podcast - Episode 16 - Transcript ASIC speaks on Improving and Maintaining Audit Quality & The Role of Others INTRO: Hello and welcome to the CPA Australia Podcast your source for business,

More information

Unit 6 Day 5 Notes Population and Sampling

Unit 6 Day 5 Notes Population and Sampling AFM Unit 6 Day 5 Notes Population and Sampling Name Date Population and Sampling The entire set of individuals or objects in which we are interested in is the population. Subset of population is a sample

More information

Iowa State University Economics 101 Microeconomics Principles Prof. Kilkenny Spring First Exam February 25, 2005

Iowa State University Economics 101 Microeconomics Principles Prof. Kilkenny Spring First Exam February 25, 2005 Iowa State University Economics 101 Microeconomics Principles Prof. Kilkenny Spring 2005 First Exam February 25, 2005 General Instructions: 1. Write your name here: 2. USE a #2 PENCIL: Write your name,

More information

Unit 6 Day 5 Notes Population and Sampling

Unit 6 Day 5 Notes Population and Sampling AFM Unit 6 Day 5 Notes Population and Sampling Name Date Population and Sampling The entire set of individuals or objects in which we are interested in is the population. Subset of population is a sample

More information

ECONOMICS 103. Topic 3: Supply, Demand & Equilibrium

ECONOMICS 103. Topic 3: Supply, Demand & Equilibrium ECONOMICS 103 Topic 3: Supply, Demand & Equilibrium Assumptions of the competitive market model: all agents are price takers, homogeneous products. Demand & supply: determinants of demand & supply, demand

More information

LECTURE 17: MULTIVARIABLE REGRESSIONS I

LECTURE 17: MULTIVARIABLE REGRESSIONS I David Youngberg BSAD 210 Montgomery College LECTURE 17: MULTIVARIABLE REGRESSIONS I I. What Determines a House s Price? a. Open Data Set 6 to help us answer this question. You ll see pricing data for homes

More information

DO NOT POST THESE ANSWERS ONLINE BFW Publishers Chapter 4

DO NOT POST THESE ANSWERS ONLINE BFW Publishers Chapter 4 Chapter 4 Section 4.1 Check Your Understanding, page 213: 1. The company inspector is using a convenience sample. This could lead him to overestimate the quality of the oranges if the farmer puts the best

More information

Classroom Probability Simulations Using R: Margin of Error in a Public Opinion Poll

Classroom Probability Simulations Using R: Margin of Error in a Public Opinion Poll Classroom Probability Simulations Using R: Margin of Error in a Public Opinion Poll Outstanding Professor Address, Part 2 Fall Faculty Convocation (9/21/04) Bruce E. Trumbo Department of Statistics, CSU

More information

MATH20 S PRE-CALCULUS UNIT H: PROBABIITY AND STATISTICS CLASS NOTES

MATH20 S PRE-CALCULUS UNIT H: PROBABIITY AND STATISTICS CLASS NOTES 1 MATH20 S PRE-CALCULUS UNIT H: PROBABIITY AND STATISTICS CLASS NOTES 1. A considerable portion of this material is a review of Grade 9 and prior grades. Regardless; Probability and Statistics plays such

More information

Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the

Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the equal work part. We suspect that salary varies with grade

More information

The following content is provided under a Creative Commons license. Your support will help

The following content is provided under a Creative Commons license. Your support will help MITOCW Lecture 11 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make a

More information

Introduction to Control Charts

Introduction to Control Charts Introduction to Control Charts Highlights Control charts can help you prevent defects before they happen. The control chart tells you how the process is behaving over time. It's the process talking to

More information

Statistics 201 Summary of Tools and Techniques

Statistics 201 Summary of Tools and Techniques Statistics 201 Summary of Tools and Techniques This document summarizes the many tools and techniques that you will be exposed to in STAT 201. The details of how to do these procedures is intentionally

More information

Test Date: A. Get none of the 5 questions correct. B. Get all of the questions wrong. C. Get at least one question wrong

Test Date: A. Get none of the 5 questions correct. B. Get all of the questions wrong. C. Get at least one question wrong Review! Probability Chapter 14-15 Complete each problem in the seconds column. Record answers to the side questions in the 2nd column completely. Name: Test Date: Class: AP Statistics Period : Chapter

More information

Math 1 Variable Manipulation Part 8 Working with Data

Math 1 Variable Manipulation Part 8 Working with Data Name: Math 1 Variable Manipulation Part 8 Working with Data Date: 1 INTERPRETING DATA USING NUMBER LINE PLOTS Data can be represented in various visual forms including dot plots, histograms, and box plots.

More information

Math 1 Variable Manipulation Part 8 Working with Data

Math 1 Variable Manipulation Part 8 Working with Data Math 1 Variable Manipulation Part 8 Working with Data 1 INTERPRETING DATA USING NUMBER LINE PLOTS Data can be represented in various visual forms including dot plots, histograms, and box plots. Suppose

More information

Instagram Traffic for E-Commerce

Instagram Traffic for E-Commerce Instagram Traffic for E-Commerce Instagram Basics Instagram is a mobile social network created that is geared strictly to images and videos. Think of it like Twitter but for images and videos. The social

More information

INSTRUCTIONS. In this experiment, we will create a market in which you will act as bidders in a sequence of auctions.

INSTRUCTIONS. In this experiment, we will create a market in which you will act as bidders in a sequence of auctions. INSTRUCTIONS This is an experiment in the economics of market decision making. Various research organizations have provided funds for conducting this research. The instructions are simple, and if you follow

More information

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition Chapter 4: Foundations for inference OpenIntro Statistics, 2nd Edition Variability in estimates 1 Variability in estimates Application exercise Sampling distributions - via CLT 2 Confidence intervals 3

More information

Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi

Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi Lecture - 31 Layout Planning In the last lecture we were talking about some

More information

BUS105 Statistics. Tutor Marked Assignment. Total Marks: 45; Weightage: 15%

BUS105 Statistics. Tutor Marked Assignment. Total Marks: 45; Weightage: 15% BUS105 Statistics Tutor Marked Assignment Total Marks: 45; Weightage: 15% Objectives a) Reinforcing your learning, at home and in class b) Identifying the topics that you have problems with so that your

More information

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy AGENDA 1. Introduction 2. Use Cases 3. Popular Algorithms 4. Typical Approach 5. Case Study 2016 SAPIENT GLOBAL MARKETS

More information

RESEARCH APTITUDE SAMPLING TECHNIQUES PART-1 FOR: UGC- NET/JRF PAPER-1

RESEARCH APTITUDE SAMPLING TECHNIQUES PART-1 FOR: UGC- NET/JRF PAPER-1 RESEARCH APTITUDE SAMPLING TECHNIQUES PART-1 FOR: UGC- NET/JRF PAPER-1 Meaning of Sample? WHAT IS IT? qa sample is a group of people objects or items taken from a larger population as a representative

More information

Why and How Should the Government Subsidize Education? February 13, 2008

Why and How Should the Government Subsidize Education? February 13, 2008 Why and How Should the Government Subsidize Education? February 13, 2008 Human Capital Model To keep things simple and flexible, lets use the following version of the human capital model 2 periods everyone

More information

INDUSTRIAL ENGINEERING

INDUSTRIAL ENGINEERING 1 P a g e AND OPERATION RESEARCH 1 BREAK EVEN ANALYSIS Introduction 5 Costs involved in production 5 Assumptions 5 Break- Even Point 6 Plotting Break even chart 7 Margin of safety 9 Effect of parameters

More information

Planning Your Success With Jimmy Petruzzi

Planning Your Success With Jimmy Petruzzi Planning Your Success With Jimmy Petruzzi www.nlp-trainingcourses.com Good time management, as we have already hinted at, is largely about good planning and knowing how to set effective priorities. Understanding

More information

A new framework for digital publishing decisions 95. Alastair Dryburgh. Alastair Dryburgh 2003

A new framework for digital publishing decisions 95. Alastair Dryburgh. Alastair Dryburgh 2003 A new framework for digital publishing decisions 95 Learned Publishing (2003)16, 95 101 Introduction In my previous article 1 I looked in detail at how developments in publishing were creating problems

More information

How Consumers React to Gas Prices. Insights from NACS Consumer Fuels Surveys on consumer perceptions related to gas prices and the economy overall

How Consumers React to Gas Prices. Insights from NACS Consumer Fuels Surveys on consumer perceptions related to gas prices and the economy overall How Consumers React to Gas Prices Insights from NACS Consumer Fuels Surveys on consumer perceptions related to gas prices and the economy overall May 2018 How Consumers React to Gas Prices Nearly 40 million

More information

Why the Units We Evaluate Should be Randomly Selected

Why the Units We Evaluate Should be Randomly Selected FRPA Resource Evaluation Program Scientifically Valid Evaluations of Forest Practices under the Forest and Range Practices Act Why the Units We Evaluate Should be Randomly Selected Prepared by Wendy Bergerud,

More information

Lesson-8. Equilibrium of Supply and Demand

Lesson-8. Equilibrium of Supply and Demand Introduction to Equilibrium Lesson-8 Equilibrium of Supply and Demand In economic theory, the interaction of supply and demand is understood as equilibrium. We may think of demand as a force tending to

More information