How to do Statistics in Excel

Size: px
Start display at page:

Download "How to do Statistics in Excel"

Transcription

1 How to do Statistics in Excel Introduction The best answer is Don t. Get someone else to do it. But since there is no American who has a life and also likes statistics, this may not be easy. And it likely won t be cheap. On the other hand, you could do it yourself. Admittedly, that is not a fun choice. This tutorial, though, may reduce the pain and even enable you to understand what question you are asking that needs statistics. The most common flaw in reasoning that people have when they turn to statistics is that they don t quite know how numbers are going to help them answer their question. And that causes fear and avoidance. Overview. There is help. Statistics is a branch of Probability theory. Probability is actually the underlying base of knowledge in all the sciences, including social sciences. We seldom can say, This causes that, with certainty. We can, though, say When this happens, the odds are that this other thing is going to happen. And the odds may be pretty good. How good is the question statistics will help answer. (But remember E. G. Bulwer-Lytton s comment, Fate laughs at probabilities, or in other words, the improbable MUST occur at some time.) (S.A. 5/14 80) To do statistics, you do not need to understand them, any more than you have to be able to fix a car in order to drive. You do, though, have to know or at least suspect when they can mislead you. As British Prime Minister Benjamin Disraeli allegedly said, There are lies, damned lies and statistics. (1) What do you study in statistics? A lot of numbers with something in common. Usually they come from a larger set of numbers called a population. The numbers are associated with each member of the population in some way that matters. For instance, students GPAs. You choose a group containing less than all the population to study, usually because the population is too large or inaccessible to study everything in it. The smaller group is called a sample. If the properties of the total population matter, then the properties of a sample can sometimes give you a good approximation of the properties of the whole population. So, if you want to know something about a lot of things, take some of the population and measure the thing you care about. Most likely what you learn from the sample will tell you something fairly reliable about the whole population. At this point, if somebody says anything about the binomial distribution or the Normal distribution, smile at them and thank them and go away from them. Unless they volunteer to do this job for you. Note: The spreadsheets listed in the tutorial will not work on web servers. If you want them, me at rfarnham@rxf3.com and I will send them. 1

2 What Excel has Too much. And it all looks frightening. Don t try to learn it, you are not and don t want to be a spreadsheet jockey. But the nice thing is that people have created spreadsheets that solve tasks you might need solved, so your job is reduced to finding something someone has built for you. Now come the examples. Some populations have most of their members near a central value, e.g. humans have height and there is an Average height. If you are a sauna manufacturer, the saunas must have higher ceilings in Scandinavia than in Japan. But knowing this central value or Average for height does not help you pick the height of your saunas made for Scandinavia, because bigger costs more and you don t want the cost to turn away buyers just because you made them big enough for the tallest Scandinavians. So, you need to know how tall the tallest are, and how many are that tall. Since you are interested in only the extreme values in the population, specifically the tallest, you don t need to bother with complicated statistical concerns such as Normal Distribution and Central Tendencies. Instead use: Non-Parametric Statistics. (The name comes because all the statistics you have learned stressed the importance of a Normal Frequency Distribution, the shape of which is specified by four Parameters, values that define some properties of the Normal Distribution. You don t need any of that for this statistical task.) 1. Start Excel and load the file named Sort. xslx The column of numbers represents 500 measurements of the height of male Scandinavians. Some are very tall, but not many. How many can you not accomodate in order to keep your costs down, while still keeping most people as potential customers? You need to know how many are how tall, so sort the heights in this column, highest to lowest. To do this, click on the green shaded choice at the top right labeled Sort 2

3 The box that appears offers you choices. You could choose what you want, but it is set now to sort the values in Columns 1. Click OK. If you want to sell saunas in Scandinavia that will accommodate 97 percent of the population, you are done. Even if the frequency distribution of heights is Normal and you could get the answer another way, the answer you got is still the right answer. If that s all you needed to know, go make saunas and don t spend any more time on statistics. (For a copy of any of the spreadsheets, me at rfarnham@rxf3.com and I will send it. They can be re-used with different numbers of your choice.) 3

4 Parametric statistics Central tendencies Now comes this Normal Distribution stuff. Among the properties of populations, there is often a tendency of values to cluster more in the middle than at the extremes. This central tendency is what we saw in the Scandinavian study, but we did not need it. Many times, though, this central tendency and the spread of values around it are valuable information. The measures you will use, then, are: Average, take all the heights of everybody and divide the sum by the number of people. Median, the height with the most people. Dispersion, = how many are close to the Average and how many are very different. Excel has the tools to find these numbers if you have the data about the population, or sample of the population, but remember you do not want to do it, you want someone to do it for you. I live to serve, so I have made a spreadsheet that does it for you. You still have to type in the numbers for the heights, but the calculations are automatic. 2. Load the file called FreqDistr.xlsx (usually also named a Bell Curve ) and if you want it personally, send me an at rfarnham@uncc.edu and I will send it to you. Why the name? Because the number of people with heights near the central value (Median or Average, depending) should be more than the number of people who are much taller or shorter. If we make a graph ( histogram ) where we plot the number of people at each height ( Frequency ) using the data from the first example, it should look like this: The frequency of each height is in the table Freq Distr. The histogram above graphs that data. 4

5 Now the value with the most people is visible (the Median), and an idea of the dispersion around the central values is visible. We need a measure for this dispersion, named the Standard Deviation and it is calculated for you below. (You probably heard from someone that the use of a Standard Deviation depends on whether the frequency distribution is Normal. I told you already not to go near that person. But now I will discuss it as a matter of personal privilege.) Figure 4 Skewness and Kurtosis can be used as tests for the likelihood that the distribution is Normal. The four parameters of the Normal Frequency Distribution are: Average. In a population with a Normal distribution, the Average and the Median ought to be about the same. Standard Deviation. If the population has a Normal distribution, the Standard Deviation has a useful property, discussed below. Skewness. This should be around zero in a population with a Normal distribution. Kurtosis. This should also be about zero in a population with a Normal distribution. If your distribution has the correct Skewness and Kurtosis, it probably is a Normal distribution. Do not worry about how these things are calculated. Just think of Excel as magic. The graph above does not show a very Normal distribution. Heights are in fact not Normally distributed. Further, Scandinavians consist of two populations, Teutonic and Lapp, who are shorter on average. Here is a truer picture of the distribution of heights in Sweden. 5

6 3. Load the file named FreqDistr2pops.xlsx Note: The Average and Median are close The Skewness is close to zero. The Kurtosis is a little low. But the distribution is NOT Normal. There are in fact two populations shown, Teutonics and the shorter Lapps, i.e. two populations distributions of heights have been mingled. Lapps Teutonics This shows why you have to always be suspicious of statistics. The problem is always in the assumptions, and one might have thought heights were a single population until one looks at the graph. Eyes are scientific instruments. But in sauna manufacturing, all you needed was the top three percent value and didn t care if more than one population was present. 6

7 Let s put this stuff to work. 4. Load the spreadsheet named QC.xlsx Example: If you have a machine that measures something, it will not always give you the exact same answer on the same item. There is always a little change in the machine each time that leads to a slightly different result. It is caused by things like a minor temperature or humidity change or a truck passing by and rattling the floor, and that is called Random Error. It is not Systematic Error, which it would be if you could find what caused the error and fix it. Random Error always has a Normal Distribution, and the Normal distribution has special properties. The most important property of a Normal distribution is that the number of values within: 1 Standard Deviation from the Average will be 68% of the total number of values. 2 Standard Deviations will be 95 percent of the total number of measurements. 3 Standard Deviations will be 99.7 percent. This is useful. Suppose you have a Control specimen of known value, say 100, that you are going to make the machine measure three every day, and after some days doing that you then calculate the Average and Standard Deviation of those measurements, you know pretty much how accurate the measurements can be with this machine. If you need more accuracy, you will have to get a different machine. Remember, this assumes a Normal Distribution of Random Error, which is true by definition. If the distribution of Error is not Normal, then there is Systematic Error, and you should fix the machine. Here is an example of Random Error: 7

8 8

9 We now have measures of the machine s accuracy and its inherent error. To use this, we follow the machine s analyses for the next 30 days, to see if it starts to show more error than before: +2 SD Average (Mean) - 2 SD (Green lines represent 2 Standard Deviations above and below the mean.) But the next 23 days: 9

10 Correlation: Sometimes you are measuring things with one instrument, but then you move to another place and are stuck with a different machine that is still reliable but does give different numerical measurements from the old one. For instance, the old instrument measures pressure in pounds per square inch and the new one measures it in kilopascals by a different method. How do you know if the second one is as accurate as the old one? You compare their performance by measuring the same item on each one. But there is still Random Error in each measurement from each machine, so you need to measure how close their measurements are the same items on average and then decide if that is close enough. This process is called Regression Analysis. The most common form is Linear Regression, in which the assumption is that Random Error on one machine is a multiple of the Random Error on the other machine. So if the second machine gives a measurement 2.4 times the measurement of the first machine, then we can easily use the second machine and divide its answers by 2.4 to find out what the first machine would have gotten. Here is a tool to do this: 5. Load the file named Statistics Lin Rgrs begin.xlsx, remembering that the graph of a linear relationship will be straight: This is the empty template. 10

11 This is one populated with measurement data from an old machine and from a newer one: First, this is Linear Regression, and the Regression Line certainly looks linear. Remember, that means the results on the new machine are reliably some multiple of the results on the old machine, and that multiplier is the Slope above, While the calculations are complex and hidden, the regression line is characterized by this Slope multiplier and the Intercept, the number the new machine would show if the old machine showed zero. The new machine could certainly be used, although the expected Reference Ranges that define normal would have to be multiplied by in order to be interpreted correctly. This is done for you at the upper right. 6. Load the file named Statistics Lin Rgrs Tilt.xlsx 11

12 Note that the higher value points seem to fall below the Regression Line. Uh Oh, something is not Linear. That means that at the upper levels, the results on the new machine are NOT multiples of those on the old machine, and our elegant calculations for new Reference Ranges are wrong. The clients also will have difficulty interpreting these non-linear results in terms of their prior experience with the old machine s results. They might not like that. The machine is non-returnable. Can we use it somehow? The graph looks like it starts to slant around 20 units on the old machine, 32 on the new one. Below that it looks linear. Check this, by erasing all the points that are above 20 on the old machine. (This has been done for you below, and the points deleted are encircled in red.) Good. We can report results that are less than 32 on the new machine with confidence, our clients can reliably interpret those results with the new reference ranges, and if we get results higher than 32 on the new machine, we can dilute the test specimen in half and then measure it. If the result is now in the linear range under 32, then we can double that diluted result and report that value safely. We will of course badmouth the machine among our peers, but it is not a total loss. 7. Load the file named Lin Rgrs Outlier begin.xlsx ) 12

13 Two points are clearly way outside the Regression Line. Is this data usable? Remember the question you want an answer to: Do the machines reliably correlate? The data shows that in at least two cases one or the other machine has significantly erred. Are you asking if the machines always give the same answer on the same item? Not at all, there are better tests than this to answer that question. So the best way to find the correlation is to remove these points that cloud the issue, since they are clearly errors. 8. Do it now. Hover the mouse arrow over the erroneous points (called Outliers ) and the data item will be shown. Look in the data list and delete those points. Result is below: Great straight line. The machines have a linear relationship, if we know one machine s result then we know the other machine s as well. Work done. Discarding data is always risky in statistical analysis and must be done judiciously (= correctly ) but there are times when it needs to be done or the true underlying relationships are obscured. Always ask yourself first if doing this might misleadingly affect the answer you seek. Or, better, find someone who knows. Conclusion: There are many other tasks needing statistics, and one of the most common not mentioned here is the task of telling whether one population is different from each other, e.g did the patients taking the medicine do any better than the ones with the same disease who didn t take the medicine? Someone likely has told you that this is a test of the Null Hypothesis. Avoid them. But find someone somewhere who has built a spreadsheet application like the ones here, and borrow it, using it like the car you drive but can t fix. Or you may have more than two variables to deal with, with each having an effect on the other. This needs a method called Analysis of Variance, or ANOVA. Don t look it up in a book, for you will not be able to function for a week. At some point you encounter tasks that you yourself simply cannot do. So look helpless, beg, or just tell your boss you can t do it. That s better than trying to do it and thinking you were successful when you weren t. Some people have been known to try prayer, some alcohol, some Mountain Dew or Red Bull. Results vary. Good luck. 13

14 Glossary: Average. Divide the sum of all data items by the number of items. Synonym: Mean Correlation: A mathematical relationship between populations, such that when you pick a measurement from the first, you have a reliable approximation of what the correlating measurement should be on an item from the second population. Dispersion: How many data items are close to the Average and how many are very different. Linear Regression: An attempt to find a linear relationship between two populations of data. Linear means that one item is the multiple of the other, together with an added constant number. Mean: The data item that has equal numbers of other measurements above it as below it. Non-Parametric Statistics: Analysis of data that does not have a Normal distribution. Normal: An alternate definition of a valued and respected normal English Word. The Normal Distribution is not particularly normal, but as a definition in Statistics, it has some useful features. Parametric statistics: Analysis of data that has a Normal distribution. The parameters are Average, Standard Deviation, Skewness and Kurtosis, and they must meet specific numerical criteria to establish that the population does in fact have a Normal Distribution. Random Error: Error whichis unpredictable and for which no explanation can be found. It mathematically will average out to zero, and is a property of a system that cannot be changed Reference Range: A property of a population such that measurements outside the Reference Range are defined as abnormal. Standard Deviation: If the population has a Normal distribution, the Standard Deviation has a useful property, discussed below. Systematic Error: Error that is due to a change in the system and which persists, implying a duty to find the source and fix it. Quicksand terms: (If you act like you know them, you will be given unwanted tasks.) Analysis of Variance: An attempt to find correlations for populations that may have an effect on measurements of the other populations. binomial distribution: Do not seek this knowledge. Kurtosis: Normal: A measure of the Peakedness of the frequency distribution curve of a population. An alternate definition of a valued and respected normal English Word. It is a common frequency distribution that is neither normal or abnormal. Get over it. It does have some useful properties, as discussed in the text Null Hypothesis: The statement that two samples of data represent the same population. Parameter: A calculable property of a data population. There are four, called moments, and are useful for inference about a population that iis Normally distributed. Skewness: A measure of the asymmetrical slant of the frequency distribution curve of a population. Both skewness and kurtosis should be also be about zero in a population with a Normal distribution. But this alone is not absolute proof the distribution is Normal. There are other tests for this. 14

15 Illustrations: Kurtosis is the degree of peakedness of a distribution. A normal distribution is a mesokurtic distribution. A pure leptokurtic distribution has a higher peak than the normal distribution and has heavier tails. A pure platykurtic distribution has a lower peak than a normal distribution and lighter tails. Skewness is the degree of departure from symmetry of a distribution. A positively skewed distribution has a tail which is pulled in the positive direction. A negatively skewed distribution has a tail which is pulled in the negative direction. 15

Descriptive Statistics Tutorial

Descriptive Statistics Tutorial Descriptive Statistics Tutorial Measures of central tendency Mean, Median, and Mode Statistics is an important aspect of most fields of science and toxicology is certainly no exception. The rationale behind

More information

Chapter 1 Data and Descriptive Statistics

Chapter 1 Data and Descriptive Statistics 1.1 Introduction Chapter 1 Data and Descriptive Statistics Statistics is the art and science of collecting, summarizing, analyzing and interpreting data. The field of statistics can be broadly divided

More information

Marginal Costing Q.8

Marginal Costing Q.8 Marginal Costing. 2008 Q.8 Break-Even Point. Before tackling a marginal costing question, it s first of all crucial that you understand what is meant by break-even point. What this means is that a firm

More information

Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification

Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification Data 8, Final Review Review schedule: - Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification Your friendly reviewers

More information

THE NORMAL CURVE AND SAMPLES:

THE NORMAL CURVE AND SAMPLES: -69- &KDSWHU THE NORMAL CURVE AND SAMPLES: SAMPLING DISTRIBUTIONS A picture of an ideal normal distribution is shown below. The horizontal axis is calibrated in z-scores in terms of standard deviation

More information

Forecasting Introduction Version 1.7

Forecasting Introduction Version 1.7 Forecasting Introduction Version 1.7 Dr. Ron Tibben-Lembke Sept. 3, 2006 This introduction will cover basic forecasting methods, how to set the parameters of those methods, and how to measure forecast

More information

Physics 141 Plotting on a Spreadsheet

Physics 141 Plotting on a Spreadsheet Physics 141 Plotting on a Spreadsheet Version: Fall 2018 Matthew J. Moelter (edited by Jonathan Fernsler and Jodi L. Christiansen) Department of Physics California Polytechnic State University San Luis

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Untangling Correlated Predictors with Principle Components

Untangling Correlated Predictors with Principle Components Untangling Correlated Predictors with Principle Components David R. Roberts, Marriott International, Potomac MD Introduction: Often when building a mathematical model, one can encounter predictor variables

More information

LECTURE 17: MULTIVARIABLE REGRESSIONS I

LECTURE 17: MULTIVARIABLE REGRESSIONS I David Youngberg BSAD 210 Montgomery College LECTURE 17: MULTIVARIABLE REGRESSIONS I I. What Determines a House s Price? a. Open Data Set 6 to help us answer this question. You ll see pricing data for homes

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

Make sure to listen to this audio: as you go through this handout, to get maximum value.

Make sure to listen to this audio:  as you go through this handout, to get maximum value. Seven Steps to Fearless Marketing The Keys to Attracting more Clients with Less Struggle and Effort By Robert Middleton Action Plan Marketing 1 Make sure to listen to this audio: www.marketingball.com/tc/ftc.mp3

More information

Using Living Cookbook & the MyPoints Spreadsheet How the Honey Do List Guy is Losing Weight

Using Living Cookbook & the MyPoints Spreadsheet How the Honey Do List Guy is Losing Weight Using Living Cookbook & the MyPoints Spreadsheet How the Honey Do List Guy is Losing Weight Background We both knew we were too heavy, but we d accepted the current wisdom, Accept yourself as you are you

More information

Gush vs. Bore: A Look at the Statistics of Sampling

Gush vs. Bore: A Look at the Statistics of Sampling Gush vs. Bore: A Look at the Statistics of Sampling Open the Fathom file Random_Samples.ftm. Imagine that in a nation somewhere nearby, a presidential election will soon be held with two candidates named

More information

QUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column?

QUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column? QUESTION 1 When a Belt Poka-Yoke's a defect out of the process entirely then she should track the activity with a robust SPC system on the characteristic of interest in the defect as an early warning system.

More information

Chapter 2 Part 1B. Measures of Location. September 4, 2008

Chapter 2 Part 1B. Measures of Location. September 4, 2008 Chapter 2 Part 1B Measures of Location September 4, 2008 Class will meet in the Auditorium except for Tuesday, October 21 when we meet in 102a. Skill set you should have by the time we complete Chapter

More information

Using Excel s Analysis ToolPak Add-In

Using Excel s Analysis ToolPak Add-In Using Excel s Analysis ToolPak Add-In Bijay Lal Pradhan, PhD Introduction I have a strong opinions that we can perform different quantitative analysis, including statistical analysis, in Excel. It is powerful,

More information

Instructions. AIRBUS A3XX: Developing the World s Largest Commercial Aircraft

Instructions. AIRBUS A3XX: Developing the World s Largest Commercial Aircraft Instructions AIRBUS A3XX: Developing the World s Largest Commercial Aircraft In this case, you will be analyzing the strategic interaction between Airbus and Boeing, the two leading producers of large

More information

1. Contingency Table (Cross Tabulation Table)

1. Contingency Table (Cross Tabulation Table) II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate

More information

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts. Chapter 8 Script Slide 1 Are Your Curves Normal? Probability and Why It Counts Hi Jed Utsinger again. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts. Now, I don t want any

More information

Bar graph or Histogram? (Both allow you to compare groups.)

Bar graph or Histogram? (Both allow you to compare groups.) Bar graph or Histogram? (Both allow you to compare groups.) We want to compare total revenues of five different companies. Key question: What is the revenue for each company? Bar graph We want to compare

More information

Episode 105: Opening the Faucet with Google AdWords

Episode 105: Opening the Faucet with Google AdWords Episode 105: Opening the Faucet with Google AdWords Guest: Niki Fielding This is Prove It. A podcast for impatient business owners, overwhelmed marketers, and PR practitioners with no time for podcasts.

More information

Distinguish between different types of numerical data and different data collection processes.

Distinguish between different types of numerical data and different data collection processes. Level: Diploma in Business Learning Outcomes 1.1 1.3 Distinguish between different types of numerical data and different data collection processes. Introduce the course by defining statistics and explaining

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Correlation and Simple. Linear Regression. Scenario. Defining Correlation Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling

More information

CHAPTER 4. Labeling Methods for Identifying Outliers

CHAPTER 4. Labeling Methods for Identifying Outliers CHAPTER 4 Labeling Methods for Identifying Outliers 4.1 Introduction Data mining is the extraction of hidden predictive knowledge s from large databases. Outlier detection is one of the powerful techniques

More information

How to Use Excel for Regression Analysis MtRoyal Version 2016RevA *

How to Use Excel for Regression Analysis MtRoyal Version 2016RevA * OpenStax-CNX module: m63578 1 How to Use Excel for Regression Analysis MtRoyal Version 2016RevA * Lyryx Learning Based on How to Use Excel for Regression Analysis BSTA 200 Humber College Version 2016RevA

More information

Also, big thank you to fellow TA Enoch Hill for edits and some additions to the guide.

Also, big thank you to fellow TA Enoch Hill for edits and some additions to the guide. Hello class, once again, here s an unofficial guide to the sample midterm. Please use this with caution, since 1) I am prone to error so incorrect explanations are entirely possible and 2) you should do

More information

Communicate and Collaborate with Visual Studio Team System 2008

Communicate and Collaborate with Visual Studio Team System 2008 Communicate and Collaborate with Visual Studio Team System 2008 White Paper May 2008 For the latest information, please see www.microsoft.com/teamsystem This is a preliminary document and may be changed

More information

SPSS Guide Page 1 of 13

SPSS Guide Page 1 of 13 SPSS Guide Page 1 of 13 A Guide to SPSS for Public Affairs Students This is intended as a handy how-to guide for most of what you might want to do in SPSS. First, here is what a typical data set might

More information

Capability on Aggregate Processes

Capability on Aggregate Processes Capability on Aggregate Processes CVJ Systems AWD Systems Trans Axle Solutions edrive Systems The Problem Fixture 1 Fixture 2 Horizontal Mach With one machine and a couple of fixtures, it s a pretty easy

More information

STAT 2300: Unit 1 Learning Objectives Spring 2019

STAT 2300: Unit 1 Learning Objectives Spring 2019 STAT 2300: Unit 1 Learning Objectives Spring 2019 Unit tests are written to evaluate student comprehension, acquisition, and synthesis of these skills. The problems listed as Assigned MyStatLab Problems

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis Approaching Performance Projects CS 147: Computer Systems Performance Analysis Approaching Performance Projects 1 / 35 Overview Overview Overview Planning

More information

P rofit t (1 + i) t. V alue = t=0

P rofit t (1 + i) t. V alue = t=0 These notes correspond to Chapter 2 of the text. 1 Optimization A key concept in economics is that of optimization. It s a tool that can be used for many applications, but for now we will use it for pro

More information

Modelling buyer behaviour - 2 Rate-frequency models

Modelling buyer behaviour - 2 Rate-frequency models Publishing Date: May 1993. 1993. All rights reserved. Copyright rests with the author. No part of this article may be reproduced without written permission from the author. Modelling buyer behaviour -

More information

Introduction to Statistics. Measures of Central Tendency

Introduction to Statistics. Measures of Central Tendency Introduction to Statistics Measures of Central Tendency Two Types of Statistics Descriptive statistics of a POPULATION Relevant notation (Greek): µ mean N population size sum Inferential statistics of

More information

Assignment 6. A Time Series Analysis 2006 Samuel L. Baker

Assignment 6. A Time Series Analysis 2006 Samuel L. Baker ASSIGNMENT 6 1 Going back in time (so we can use real data)... Assignment 6. A Time Series Analysis 2006 Samuel L. Baker Your job is to project Medicaid-paid deliveries of babies for 1991, based on the

More information

6) Consumer surplus is the red area in the following graph. It is 0.5*5*5=12.5. The answer is C.

6) Consumer surplus is the red area in the following graph. It is 0.5*5*5=12.5. The answer is C. These are solutions to Fall 2013 s Econ 1101 Midterm 1. No guarantees are made that this guide is error free, so please consult your TA or instructor if anything looks wrong. 1) If the price of sweeteners,

More information

A Bid for Every Auction. Implementing and Improving AdWords Automated Bidding

A Bid for Every Auction. Implementing and Improving AdWords Automated Bidding A Bid for Every Auction Implementing and Improving AdWords Automated Bidding Ten Automated Bidding Takeaways The Advantages of Auction-time Bidding 1 Bid to the user s context, as close to auction-time

More information

Introduction to Control Charts

Introduction to Control Charts Introduction to Control Charts Highlights Control charts can help you prevent defects before they happen. The control chart tells you how the process is behaving over time. It's the process talking to

More information

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions One of the primary goals of statistics is to make statistical inferences on

More information

Unit QUAN Session 6. Introduction to Acceptance Sampling

Unit QUAN Session 6. Introduction to Acceptance Sampling Unit QUAN Session 6 Introduction to Acceptance Sampling MSc Strategic Quality Management Quantitative methods - Unit QUAN INTRODUCTION TO ACCEPTANCE SAMPLING Aims of Session To introduce the basic statistical

More information

Chapter 5 Notes Page 1

Chapter 5 Notes Page 1 Chapter 5 Notes Page 1 COST BEHAVIOR When dealing with costs, it helps for you to determine what drives the cost in question. A Cost Driver (also called Cost Base) is an activity that is associated with,

More information

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 19 Tutorial - ANOVA (Contd.) (Refer Slide

More information

10.2 Correlation. Plotting paired data points leads to a scatterplot. Each data pair becomes one dot in the scatterplot.

10.2 Correlation. Plotting paired data points leads to a scatterplot. Each data pair becomes one dot in the scatterplot. 10.2 Correlation Note: You will be tested only on material covered in these class notes. You may use your textbook as supplemental reading. At the end of this document you will find practice problems similar

More information

We all know that what you measure gets done. But what if you are. measuring the wrong things and doing the wrong things as a result?

We all know that what you measure gets done. But what if you are. measuring the wrong things and doing the wrong things as a result? We all know that what you measure gets done. But what if you are measuring the wrong things and doing the wrong things as a result? Michaela Rebbeck talks to Alex Knight about operational measures that

More information

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA

Two Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the

More information

Introduction to Statistics. Measures of Central Tendency and Dispersion

Introduction to Statistics. Measures of Central Tendency and Dispersion Introduction to Statistics Measures of Central Tendency and Dispersion The phrase descriptive statistics is used generically in place of measures of central tendency and dispersion for inferential statistics.

More information

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help

More information

Launch Store. University

Launch Store. University Launch Store University Importing and Exporting CSVs In this lesson, you will learn how to: What is a CSV? How to export a CSV document of your products from control panel How to import a CSV into your

More information

PRINCIPLES AND APPLICATIONS OF SPECIAL EDUCATION ASSESSMENT

PRINCIPLES AND APPLICATIONS OF SPECIAL EDUCATION ASSESSMENT PRINCIPLES AND APPLICATIONS OF SPECIAL EDUCATION ASSESSMENT CLASS 3: DESCRIPTIVE STATISTICS & RELIABILITY AND VALIDITY FEBRUARY 2, 2015 OBJECTIVES Define basic terminology used in assessment, such as validity,

More information

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition Chapter 4: Foundations for inference OpenIntro Statistics, 2nd Edition Variability in estimates 1 Variability in estimates Application exercise Sampling distributions - via CLT 2 Confidence intervals 3

More information

CHAPTER 5 FIRM PRODUCTION, COST, AND REVENUE

CHAPTER 5 FIRM PRODUCTION, COST, AND REVENUE CHAPTER 5 FIRM PRODUCTION, COST, AND REVENUE CHAPTER OBJECTIVES You will find in this chapter models that will help you understand the relationship between production and costs and the relationship between

More information

A Fundamental Error Find out what consumers think about your product, not your messaging.

A Fundamental Error Find out what consumers think about your product, not your messaging. A Fundamental Error Find out what consumers think about your product, not your messaging. Marketers and researchers make a fundamental error almost universally when developing and testing marketing communications.

More information

Getting Started with OptQuest

Getting Started with OptQuest Getting Started with OptQuest What OptQuest does Futura Apartments model example Portfolio Allocation model example Defining decision variables in Crystal Ball Running OptQuest Specifying decision variable

More information

Forecasting with Seasonality Version 1.6

Forecasting with Seasonality Version 1.6 Forecasting with Seasonality Version 1.6 Dr. Ron Tibben-Lembke Sept 3, 2006 Forecasting with seasonality and a trend is obviously more difficult than forecasting for a trend or for seasonality by itself,

More information

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression

Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression 6.1 Introduction In the previous chapter, the detailed results of FA were presented and discussed. As a result, fourteen factors

More information

Utilizing Data Science To Assess Software Quality

Utilizing Data Science To Assess Software Quality Utilizing Data Science To Assess Software Quality Renato Martins renatopmartins@gmail.com Abstract Data Science is a "hip" term these days. Most people automatically associate Data Science with financial

More information

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of numbers. Also, students will understand why some measures

More information

Mathematics in Contemporary Society - Chapter 5 (Spring 2018)

Mathematics in Contemporary Society - Chapter 5 (Spring 2018) City University of New York (CUNY) CUNY Academic Works Open Educational Resources Queensborough Community College Spring 218 Mathematics in Contemporary Society - Chapter (Spring 218) Patrick J. Wallach

More information

Managerial Accounting Prof. Dr. Varadraj Bapat Department of School of Management Indian Institute of Technology, Bombay

Managerial Accounting Prof. Dr. Varadraj Bapat Department of School of Management Indian Institute of Technology, Bombay Managerial Accounting Prof. Dr. Varadraj Bapat Department of School of Management Indian Institute of Technology, Bombay Lecture - 32 Standard Costing, Mix, Yield, Sales and Fixed Overhead Variances The

More information

STAT/MATH Chapter3. Statistical Methods in Practice. Averages and Variation 1/27/2017. Measures of Central Tendency: Mode, Median, and Mean

STAT/MATH Chapter3. Statistical Methods in Practice. Averages and Variation 1/27/2017. Measures of Central Tendency: Mode, Median, and Mean STAT/MATH 3379 Statistical Methods in Practice Dr. Ananda Manage Associate Professor of Statistics Department of Mathematics & Statistics SHSU 1 Chapter3 Averages and Variation Copyright Cengage Learning.

More information

How to Keep the Delivery of Your Pricing and Other Sales Information Simple! Module 4

How to Keep the Delivery of Your Pricing and Other Sales Information Simple! Module 4 How to Keep the Delivery of Your Pricing and Other Sales Information Simple! Module 4 In Module 1 we talked about the Myths of Pricing and then we went over the basics of gathering your financial information

More information

DOWNLOAD PDF MANUAL ON PRESENTATION OF DATA AND CONTROL CHART ANALYSIS

DOWNLOAD PDF MANUAL ON PRESENTATION OF DATA AND CONTROL CHART ANALYSIS Chapter 1 : ASTM manual on presentation of data and control chart analysis ( edition) Open Library tis Atmh s Manual on Presentation of Data and Control Chart Analysis is the ninth edition of the Astm

More information

Measurement and sampling

Measurement and sampling Name: Instructions: (1) Answer questions in your blue book. Number each response. (2) Write your name on the cover of your blue book (and only on the cover). (3) You are allowed to use your calculator

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B. Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institution of Technology, Madras

More information

ANALYSING QUANTITATIVE DATA

ANALYSING QUANTITATIVE DATA 9 ANALYSING QUANTITATIVE DATA Although, of course, there are other software packages that can be used for quantitative data analysis, including Microsoft Excel, SPSS is perhaps the one most commonly subscribed

More information

Before getting into the details of our engagement, I d like to ask, what does your company do?

Before getting into the details of our engagement, I d like to ask, what does your company do? Before getting into the details of our engagement, I d like to ask, what does your company do? We re a multilevel marketing company, with sales of around two-hundred million dollars a year. We deal with

More information

Excel #2: No magic numbers

Excel #2: No magic numbers Excel #2: No magic numbers This lesson comes from programmers who long ago learned that everything entered into code must be defined and documented. Placing numbers into an equation is dangerous because

More information

The future of outbound is precision dialling How to optimise your outbound contact activities

The future of outbound is precision dialling How to optimise your outbound contact activities The future of outbound is precision dialling How to optimise your outbound contact activities Rostrvm Solutions Limited May 2013 Dukes Court, Duke Street Woking, Surrey GU21 5RT enquiries@rostrvm.com www.rostrvm.com

More information

STAB22 section 2.1. Figure 1: Scatterplot of price vs. size for Mocha Frappuccino

STAB22 section 2.1. Figure 1: Scatterplot of price vs. size for Mocha Frappuccino STAB22 section 2.1 2.2 We re changing dog breed (categorical) to breed size (quantitative). This would enable us to see how life span depends on breed size (if it does), which we could assess by drawing

More information

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University Multiple Regression Dr. Tom Pierce Department of Psychology Radford University In the previous chapter we talked about regression as a technique for using a person s score on one variable to make a best

More information

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy.

SCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy. Introduction to SPSS Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 This workshop is designed to

More information

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS HIMSS ME-PI Community Sigma Score Calculation Worksheet INSTRUCTIONS Quick Tour Let s start with a quick tour of the Excel spreadsheet. There are six worksheets in the spreadsheet. Sigma Score (Snapshot)

More information

Concepts for Using TC2000/TCnet PCFs

Concepts for Using TC2000/TCnet PCFs 2004 Jim Cooper by Concepts for Using TC2000/TCnet PCFs Concepts for Using TC2000/TCnet PCFs 1 What is a PCF? 1 Why would I want to use a PCF? 1 What if I m no good at programming or math? 2 How do I make

More information

Workshop #2: Evolution

Workshop #2: Evolution The DNA Files: Workshops and Activities The DNA Files workshops are an outreach component of The DNA Files public radio documentary series produced by SoundVision Productions with funding from the National

More information

KING ABDULAZIZ UNIVERSITY FACULTY OF COMPUTING & INFORMATION TECHNOLOGY DEPARTMENT OF INFORMATION SYSTEM. Lab 1- Introduction

KING ABDULAZIZ UNIVERSITY FACULTY OF COMPUTING & INFORMATION TECHNOLOGY DEPARTMENT OF INFORMATION SYSTEM. Lab 1- Introduction Lab 1- Introduction Objective: We will start with some basic concept of DSS. And also we will start today the WHAT-IF analysis technique for decision making. Activity Outcomes: What is what-if analysis

More information

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 02 Data Mining Process Welcome to the lecture 2 of

More information

Lecture-16. Data Tables, Scenarios & Goal Seek in Excel 2007

Lecture-16. Data Tables, Scenarios & Goal Seek in Excel 2007 Lecture-16 Data Tables, Scenarios & Goal Seek in Excel 2007 In Excel, a Data Table is a way to see different results by altering an input cell in your formula. As an example, we're going to alert the interest

More information

Overview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers

Overview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers Field Guide To Statistics for Blood Bankers A Basic Lesson in Understanding Data and P.A.C.E. Program: 605-022-09 Presenter: Bill Cheney Audience: Clinical Laboratory Professionals Overview Statistics

More information

The future of outbound is Precision Dialling How to optimise your outbound contact activities

The future of outbound is Precision Dialling How to optimise your outbound contact activities The future of outbound is Precision Dialling How to optimise your outbound contact activities Rostrvm Solutions Limited December 2015 Dukes Court, Duke Street Woking, Surrey GU21 5RT enquiries@rostrvm.com

More information

Putting non-service employees on the phones

Putting non-service employees on the phones Putting non-service employees on the phones For the article Vista Print puts its employees on the phones to Learn the Customer in the July issue of Customer Service Newsletter (CSN), editor Bill Keenan

More information

The Tyranny of Tools, or Wait, what were we trying to do in the first place? Bill Hathaway, MoreSteam

The Tyranny of Tools, or Wait, what were we trying to do in the first place? Bill Hathaway, MoreSteam The Tyranny of Tools, or Wait, what were we trying to do in the first place? Bill Hathaway, MoreSteam I think we would all agree that tools are necessary to get work done. They amplify our efforts, and

More information

SPSS 14: quick guide

SPSS 14: quick guide SPSS 14: quick guide Edition 2, November 2007 If you would like this document in an alternative format please ask staff for help. On request we can provide documents with a different size and style of

More information

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee Module - 04 Lecture - 04 Sales Forecasting I A very warm welcome

More information

Number arrangements and wheels in LSA

Number arrangements and wheels in LSA Number arrangements and wheels in LSA Introduction LSA wheel window has had many improvements since its first implementation back in version 3. The latest version is by far the best yet and will almost

More information

Shewhart and the Probability Approach. The difference is much greater than how we compute the limits

Shewhart and the Probability Approach. The difference is much greater than how we compute the limits Quality Digest Daily, November 2, 2015 Manuscript 287 The difference is much greater than how we compute the limits Donald J. Wheeler & Henry R. Neave In theory, there is no difference between theory and

More information

How to Solve These Five Issues of. Marketing Strategy Creation?

How to Solve These Five Issues of. Marketing Strategy Creation? How to Solve These Five Issues of Marketing Strategy Creation? Opportunities multiply as they are seized. Sun Tzu 2 Introduction No matter whether you re running your mom-and-pop business or you consult

More information

Dr. Eli Goldratt Unplugged

Dr. Eli Goldratt Unplugged Dr. Eli Goldratt Unplugged This is part one of SCDigest editor Dan Gilmore s interview with Dr. Eli Goldratt, father of the Theory of Constraints, and author of The Goal and several other influential books

More information

Counting Culture: Seasonality in Northern Irish Arts and Culture

Counting Culture: Seasonality in Northern Irish Arts and Culture Counting Culture: Seasonality in Northern Irish Arts and Culture December 2017 Seasonality in Northern Irish Arts and Culture At thrive, we are in the lucky position of having access to a lot of data from

More information

HOW TO CREATE A FRENZY OF CUSTOMERS IN YOUR STORE

HOW TO CREATE A FRENZY OF CUSTOMERS IN YOUR STORE HOW TO CREATE A FRENZY OF CUSTOMERS IN YOUR STORE Free, Customized Traffic Scale Report will show you, very specifically, the benefits of awakening the Sleeping Buyer in your market. Click here to claim

More information

(800) Leader s Guide

(800) Leader s Guide www.edgetrainingsystems.com (800) 476 1405 Leader s Guide Copyright 2002 Edge Training Systems, Inc. All Rights Reserved This publication may not be reproduced, stored in a retrieval system, or transmitted

More information

The Real Estate Philosopher

The Real Estate Philosopher The Real Estate Philosopher Peter Drucker: Creating Customers Peter Drucker one of the great intellectual thinkers of the twentieth and twenty-first centuries asks a question: What is the purpose of a

More information

Because you re reading this book, you re likely familiar with the basics

Because you re reading this book, you re likely familiar with the basics Chapter 1 Beyond Number Crunching: The Art and Science of Data Analysis In This Chapter Realizing your role as a data analyst Avoiding statistical faux pas Delving into the jargon of Stats II Because you

More information

I m going to begin by showing you the basics of creating a table in Excel. And then later on we will get into more advanced applications using Excel.

I m going to begin by showing you the basics of creating a table in Excel. And then later on we will get into more advanced applications using Excel. I m going to begin by showing you the basics of creating a table in Excel. And then later on we will get into more advanced applications using Excel. If you had the choice of looking at this. 1 Or something

More information

STATISTICAL TECHNIQUES. Data Analysis and Modelling

STATISTICAL TECHNIQUES. Data Analysis and Modelling STATISTICAL TECHNIQUES Data Analysis and Modelling DATA ANALYSIS & MODELLING Data collection and presentation Many of us probably some of the methods involved in collecting raw data. Once the data has

More information

DEMAND CURVE AS A CONSTRAINT FOR BUSINESSES

DEMAND CURVE AS A CONSTRAINT FOR BUSINESSES 1Demand and rofit Seeking 8 Demand is important for two reasons. First, demand acts as a constraint on what business firms are able to do. In particular, the demand curve forces firms to accept lower sales

More information

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras Lecture - 5 Aggregate Planning, Tabular Method, Linear Programming (Refer

More information