How to do Statistics in Excel
|
|
- Kelley Walsh
- 6 years ago
- Views:
Transcription
1 How to do Statistics in Excel Introduction The best answer is Don t. Get someone else to do it. But since there is no American who has a life and also likes statistics, this may not be easy. And it likely won t be cheap. On the other hand, you could do it yourself. Admittedly, that is not a fun choice. This tutorial, though, may reduce the pain and even enable you to understand what question you are asking that needs statistics. The most common flaw in reasoning that people have when they turn to statistics is that they don t quite know how numbers are going to help them answer their question. And that causes fear and avoidance. Overview. There is help. Statistics is a branch of Probability theory. Probability is actually the underlying base of knowledge in all the sciences, including social sciences. We seldom can say, This causes that, with certainty. We can, though, say When this happens, the odds are that this other thing is going to happen. And the odds may be pretty good. How good is the question statistics will help answer. (But remember E. G. Bulwer-Lytton s comment, Fate laughs at probabilities, or in other words, the improbable MUST occur at some time.) (S.A. 5/14 80) To do statistics, you do not need to understand them, any more than you have to be able to fix a car in order to drive. You do, though, have to know or at least suspect when they can mislead you. As British Prime Minister Benjamin Disraeli allegedly said, There are lies, damned lies and statistics. (1) What do you study in statistics? A lot of numbers with something in common. Usually they come from a larger set of numbers called a population. The numbers are associated with each member of the population in some way that matters. For instance, students GPAs. You choose a group containing less than all the population to study, usually because the population is too large or inaccessible to study everything in it. The smaller group is called a sample. If the properties of the total population matter, then the properties of a sample can sometimes give you a good approximation of the properties of the whole population. So, if you want to know something about a lot of things, take some of the population and measure the thing you care about. Most likely what you learn from the sample will tell you something fairly reliable about the whole population. At this point, if somebody says anything about the binomial distribution or the Normal distribution, smile at them and thank them and go away from them. Unless they volunteer to do this job for you. Note: The spreadsheets listed in the tutorial will not work on web servers. If you want them, me at rfarnham@rxf3.com and I will send them. 1
2 What Excel has Too much. And it all looks frightening. Don t try to learn it, you are not and don t want to be a spreadsheet jockey. But the nice thing is that people have created spreadsheets that solve tasks you might need solved, so your job is reduced to finding something someone has built for you. Now come the examples. Some populations have most of their members near a central value, e.g. humans have height and there is an Average height. If you are a sauna manufacturer, the saunas must have higher ceilings in Scandinavia than in Japan. But knowing this central value or Average for height does not help you pick the height of your saunas made for Scandinavia, because bigger costs more and you don t want the cost to turn away buyers just because you made them big enough for the tallest Scandinavians. So, you need to know how tall the tallest are, and how many are that tall. Since you are interested in only the extreme values in the population, specifically the tallest, you don t need to bother with complicated statistical concerns such as Normal Distribution and Central Tendencies. Instead use: Non-Parametric Statistics. (The name comes because all the statistics you have learned stressed the importance of a Normal Frequency Distribution, the shape of which is specified by four Parameters, values that define some properties of the Normal Distribution. You don t need any of that for this statistical task.) 1. Start Excel and load the file named Sort. xslx The column of numbers represents 500 measurements of the height of male Scandinavians. Some are very tall, but not many. How many can you not accomodate in order to keep your costs down, while still keeping most people as potential customers? You need to know how many are how tall, so sort the heights in this column, highest to lowest. To do this, click on the green shaded choice at the top right labeled Sort 2
3 The box that appears offers you choices. You could choose what you want, but it is set now to sort the values in Columns 1. Click OK. If you want to sell saunas in Scandinavia that will accommodate 97 percent of the population, you are done. Even if the frequency distribution of heights is Normal and you could get the answer another way, the answer you got is still the right answer. If that s all you needed to know, go make saunas and don t spend any more time on statistics. (For a copy of any of the spreadsheets, me at rfarnham@rxf3.com and I will send it. They can be re-used with different numbers of your choice.) 3
4 Parametric statistics Central tendencies Now comes this Normal Distribution stuff. Among the properties of populations, there is often a tendency of values to cluster more in the middle than at the extremes. This central tendency is what we saw in the Scandinavian study, but we did not need it. Many times, though, this central tendency and the spread of values around it are valuable information. The measures you will use, then, are: Average, take all the heights of everybody and divide the sum by the number of people. Median, the height with the most people. Dispersion, = how many are close to the Average and how many are very different. Excel has the tools to find these numbers if you have the data about the population, or sample of the population, but remember you do not want to do it, you want someone to do it for you. I live to serve, so I have made a spreadsheet that does it for you. You still have to type in the numbers for the heights, but the calculations are automatic. 2. Load the file called FreqDistr.xlsx (usually also named a Bell Curve ) and if you want it personally, send me an at rfarnham@uncc.edu and I will send it to you. Why the name? Because the number of people with heights near the central value (Median or Average, depending) should be more than the number of people who are much taller or shorter. If we make a graph ( histogram ) where we plot the number of people at each height ( Frequency ) using the data from the first example, it should look like this: The frequency of each height is in the table Freq Distr. The histogram above graphs that data. 4
5 Now the value with the most people is visible (the Median), and an idea of the dispersion around the central values is visible. We need a measure for this dispersion, named the Standard Deviation and it is calculated for you below. (You probably heard from someone that the use of a Standard Deviation depends on whether the frequency distribution is Normal. I told you already not to go near that person. But now I will discuss it as a matter of personal privilege.) Figure 4 Skewness and Kurtosis can be used as tests for the likelihood that the distribution is Normal. The four parameters of the Normal Frequency Distribution are: Average. In a population with a Normal distribution, the Average and the Median ought to be about the same. Standard Deviation. If the population has a Normal distribution, the Standard Deviation has a useful property, discussed below. Skewness. This should be around zero in a population with a Normal distribution. Kurtosis. This should also be about zero in a population with a Normal distribution. If your distribution has the correct Skewness and Kurtosis, it probably is a Normal distribution. Do not worry about how these things are calculated. Just think of Excel as magic. The graph above does not show a very Normal distribution. Heights are in fact not Normally distributed. Further, Scandinavians consist of two populations, Teutonic and Lapp, who are shorter on average. Here is a truer picture of the distribution of heights in Sweden. 5
6 3. Load the file named FreqDistr2pops.xlsx Note: The Average and Median are close The Skewness is close to zero. The Kurtosis is a little low. But the distribution is NOT Normal. There are in fact two populations shown, Teutonics and the shorter Lapps, i.e. two populations distributions of heights have been mingled. Lapps Teutonics This shows why you have to always be suspicious of statistics. The problem is always in the assumptions, and one might have thought heights were a single population until one looks at the graph. Eyes are scientific instruments. But in sauna manufacturing, all you needed was the top three percent value and didn t care if more than one population was present. 6
7 Let s put this stuff to work. 4. Load the spreadsheet named QC.xlsx Example: If you have a machine that measures something, it will not always give you the exact same answer on the same item. There is always a little change in the machine each time that leads to a slightly different result. It is caused by things like a minor temperature or humidity change or a truck passing by and rattling the floor, and that is called Random Error. It is not Systematic Error, which it would be if you could find what caused the error and fix it. Random Error always has a Normal Distribution, and the Normal distribution has special properties. The most important property of a Normal distribution is that the number of values within: 1 Standard Deviation from the Average will be 68% of the total number of values. 2 Standard Deviations will be 95 percent of the total number of measurements. 3 Standard Deviations will be 99.7 percent. This is useful. Suppose you have a Control specimen of known value, say 100, that you are going to make the machine measure three every day, and after some days doing that you then calculate the Average and Standard Deviation of those measurements, you know pretty much how accurate the measurements can be with this machine. If you need more accuracy, you will have to get a different machine. Remember, this assumes a Normal Distribution of Random Error, which is true by definition. If the distribution of Error is not Normal, then there is Systematic Error, and you should fix the machine. Here is an example of Random Error: 7
8 8
9 We now have measures of the machine s accuracy and its inherent error. To use this, we follow the machine s analyses for the next 30 days, to see if it starts to show more error than before: +2 SD Average (Mean) - 2 SD (Green lines represent 2 Standard Deviations above and below the mean.) But the next 23 days: 9
10 Correlation: Sometimes you are measuring things with one instrument, but then you move to another place and are stuck with a different machine that is still reliable but does give different numerical measurements from the old one. For instance, the old instrument measures pressure in pounds per square inch and the new one measures it in kilopascals by a different method. How do you know if the second one is as accurate as the old one? You compare their performance by measuring the same item on each one. But there is still Random Error in each measurement from each machine, so you need to measure how close their measurements are the same items on average and then decide if that is close enough. This process is called Regression Analysis. The most common form is Linear Regression, in which the assumption is that Random Error on one machine is a multiple of the Random Error on the other machine. So if the second machine gives a measurement 2.4 times the measurement of the first machine, then we can easily use the second machine and divide its answers by 2.4 to find out what the first machine would have gotten. Here is a tool to do this: 5. Load the file named Statistics Lin Rgrs begin.xlsx, remembering that the graph of a linear relationship will be straight: This is the empty template. 10
11 This is one populated with measurement data from an old machine and from a newer one: First, this is Linear Regression, and the Regression Line certainly looks linear. Remember, that means the results on the new machine are reliably some multiple of the results on the old machine, and that multiplier is the Slope above, While the calculations are complex and hidden, the regression line is characterized by this Slope multiplier and the Intercept, the number the new machine would show if the old machine showed zero. The new machine could certainly be used, although the expected Reference Ranges that define normal would have to be multiplied by in order to be interpreted correctly. This is done for you at the upper right. 6. Load the file named Statistics Lin Rgrs Tilt.xlsx 11
12 Note that the higher value points seem to fall below the Regression Line. Uh Oh, something is not Linear. That means that at the upper levels, the results on the new machine are NOT multiples of those on the old machine, and our elegant calculations for new Reference Ranges are wrong. The clients also will have difficulty interpreting these non-linear results in terms of their prior experience with the old machine s results. They might not like that. The machine is non-returnable. Can we use it somehow? The graph looks like it starts to slant around 20 units on the old machine, 32 on the new one. Below that it looks linear. Check this, by erasing all the points that are above 20 on the old machine. (This has been done for you below, and the points deleted are encircled in red.) Good. We can report results that are less than 32 on the new machine with confidence, our clients can reliably interpret those results with the new reference ranges, and if we get results higher than 32 on the new machine, we can dilute the test specimen in half and then measure it. If the result is now in the linear range under 32, then we can double that diluted result and report that value safely. We will of course badmouth the machine among our peers, but it is not a total loss. 7. Load the file named Lin Rgrs Outlier begin.xlsx ) 12
13 Two points are clearly way outside the Regression Line. Is this data usable? Remember the question you want an answer to: Do the machines reliably correlate? The data shows that in at least two cases one or the other machine has significantly erred. Are you asking if the machines always give the same answer on the same item? Not at all, there are better tests than this to answer that question. So the best way to find the correlation is to remove these points that cloud the issue, since they are clearly errors. 8. Do it now. Hover the mouse arrow over the erroneous points (called Outliers ) and the data item will be shown. Look in the data list and delete those points. Result is below: Great straight line. The machines have a linear relationship, if we know one machine s result then we know the other machine s as well. Work done. Discarding data is always risky in statistical analysis and must be done judiciously (= correctly ) but there are times when it needs to be done or the true underlying relationships are obscured. Always ask yourself first if doing this might misleadingly affect the answer you seek. Or, better, find someone who knows. Conclusion: There are many other tasks needing statistics, and one of the most common not mentioned here is the task of telling whether one population is different from each other, e.g did the patients taking the medicine do any better than the ones with the same disease who didn t take the medicine? Someone likely has told you that this is a test of the Null Hypothesis. Avoid them. But find someone somewhere who has built a spreadsheet application like the ones here, and borrow it, using it like the car you drive but can t fix. Or you may have more than two variables to deal with, with each having an effect on the other. This needs a method called Analysis of Variance, or ANOVA. Don t look it up in a book, for you will not be able to function for a week. At some point you encounter tasks that you yourself simply cannot do. So look helpless, beg, or just tell your boss you can t do it. That s better than trying to do it and thinking you were successful when you weren t. Some people have been known to try prayer, some alcohol, some Mountain Dew or Red Bull. Results vary. Good luck. 13
14 Glossary: Average. Divide the sum of all data items by the number of items. Synonym: Mean Correlation: A mathematical relationship between populations, such that when you pick a measurement from the first, you have a reliable approximation of what the correlating measurement should be on an item from the second population. Dispersion: How many data items are close to the Average and how many are very different. Linear Regression: An attempt to find a linear relationship between two populations of data. Linear means that one item is the multiple of the other, together with an added constant number. Mean: The data item that has equal numbers of other measurements above it as below it. Non-Parametric Statistics: Analysis of data that does not have a Normal distribution. Normal: An alternate definition of a valued and respected normal English Word. The Normal Distribution is not particularly normal, but as a definition in Statistics, it has some useful features. Parametric statistics: Analysis of data that has a Normal distribution. The parameters are Average, Standard Deviation, Skewness and Kurtosis, and they must meet specific numerical criteria to establish that the population does in fact have a Normal Distribution. Random Error: Error whichis unpredictable and for which no explanation can be found. It mathematically will average out to zero, and is a property of a system that cannot be changed Reference Range: A property of a population such that measurements outside the Reference Range are defined as abnormal. Standard Deviation: If the population has a Normal distribution, the Standard Deviation has a useful property, discussed below. Systematic Error: Error that is due to a change in the system and which persists, implying a duty to find the source and fix it. Quicksand terms: (If you act like you know them, you will be given unwanted tasks.) Analysis of Variance: An attempt to find correlations for populations that may have an effect on measurements of the other populations. binomial distribution: Do not seek this knowledge. Kurtosis: Normal: A measure of the Peakedness of the frequency distribution curve of a population. An alternate definition of a valued and respected normal English Word. It is a common frequency distribution that is neither normal or abnormal. Get over it. It does have some useful properties, as discussed in the text Null Hypothesis: The statement that two samples of data represent the same population. Parameter: A calculable property of a data population. There are four, called moments, and are useful for inference about a population that iis Normally distributed. Skewness: A measure of the asymmetrical slant of the frequency distribution curve of a population. Both skewness and kurtosis should be also be about zero in a population with a Normal distribution. But this alone is not absolute proof the distribution is Normal. There are other tests for this. 14
15 Illustrations: Kurtosis is the degree of peakedness of a distribution. A normal distribution is a mesokurtic distribution. A pure leptokurtic distribution has a higher peak than the normal distribution and has heavier tails. A pure platykurtic distribution has a lower peak than a normal distribution and lighter tails. Skewness is the degree of departure from symmetry of a distribution. A positively skewed distribution has a tail which is pulled in the positive direction. A negatively skewed distribution has a tail which is pulled in the negative direction. 15
Descriptive Statistics Tutorial
Descriptive Statistics Tutorial Measures of central tendency Mean, Median, and Mode Statistics is an important aspect of most fields of science and toxicology is certainly no exception. The rationale behind
More informationChapter 1 Data and Descriptive Statistics
1.1 Introduction Chapter 1 Data and Descriptive Statistics Statistics is the art and science of collecting, summarizing, analyzing and interpreting data. The field of statistics can be broadly divided
More informationMarginal Costing Q.8
Marginal Costing. 2008 Q.8 Break-Even Point. Before tackling a marginal costing question, it s first of all crucial that you understand what is meant by break-even point. What this means is that a firm
More informationDay 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification
Data 8, Final Review Review schedule: - Day 1: Confidence Intervals, Center and Spread (CLT, Variability of Sample Mean) Day 2: Regression, Regression Inference, Classification Your friendly reviewers
More informationTHE NORMAL CURVE AND SAMPLES:
-69- &KDSWHU THE NORMAL CURVE AND SAMPLES: SAMPLING DISTRIBUTIONS A picture of an ideal normal distribution is shown below. The horizontal axis is calibrated in z-scores in terms of standard deviation
More informationForecasting Introduction Version 1.7
Forecasting Introduction Version 1.7 Dr. Ron Tibben-Lembke Sept. 3, 2006 This introduction will cover basic forecasting methods, how to set the parameters of those methods, and how to measure forecast
More informationPhysics 141 Plotting on a Spreadsheet
Physics 141 Plotting on a Spreadsheet Version: Fall 2018 Matthew J. Moelter (edited by Jonathan Fernsler and Jodi L. Christiansen) Department of Physics California Polytechnic State University San Luis
More informationChapter 10 Regression Analysis
Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST
More informationUntangling Correlated Predictors with Principle Components
Untangling Correlated Predictors with Principle Components David R. Roberts, Marriott International, Potomac MD Introduction: Often when building a mathematical model, one can encounter predictor variables
More informationLECTURE 17: MULTIVARIABLE REGRESSIONS I
David Youngberg BSAD 210 Montgomery College LECTURE 17: MULTIVARIABLE REGRESSIONS I I. What Determines a House s Price? a. Open Data Set 6 to help us answer this question. You ll see pricing data for homes
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating
More informationMake sure to listen to this audio: as you go through this handout, to get maximum value.
Seven Steps to Fearless Marketing The Keys to Attracting more Clients with Less Struggle and Effort By Robert Middleton Action Plan Marketing 1 Make sure to listen to this audio: www.marketingball.com/tc/ftc.mp3
More informationUsing Living Cookbook & the MyPoints Spreadsheet How the Honey Do List Guy is Losing Weight
Using Living Cookbook & the MyPoints Spreadsheet How the Honey Do List Guy is Losing Weight Background We both knew we were too heavy, but we d accepted the current wisdom, Accept yourself as you are you
More informationGush vs. Bore: A Look at the Statistics of Sampling
Gush vs. Bore: A Look at the Statistics of Sampling Open the Fathom file Random_Samples.ftm. Imagine that in a nation somewhere nearby, a presidential election will soon be held with two candidates named
More informationQUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column?
QUESTION 1 When a Belt Poka-Yoke's a defect out of the process entirely then she should track the activity with a robust SPC system on the characteristic of interest in the defect as an early warning system.
More informationChapter 2 Part 1B. Measures of Location. September 4, 2008
Chapter 2 Part 1B Measures of Location September 4, 2008 Class will meet in the Auditorium except for Tuesday, October 21 when we meet in 102a. Skill set you should have by the time we complete Chapter
More informationUsing Excel s Analysis ToolPak Add-In
Using Excel s Analysis ToolPak Add-In Bijay Lal Pradhan, PhD Introduction I have a strong opinions that we can perform different quantitative analysis, including statistical analysis, in Excel. It is powerful,
More informationInstructions. AIRBUS A3XX: Developing the World s Largest Commercial Aircraft
Instructions AIRBUS A3XX: Developing the World s Largest Commercial Aircraft In this case, you will be analyzing the strategic interaction between Airbus and Boeing, the two leading producers of large
More information1. Contingency Table (Cross Tabulation Table)
II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate
More informationChapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.
Chapter 8 Script Slide 1 Are Your Curves Normal? Probability and Why It Counts Hi Jed Utsinger again. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts. Now, I don t want any
More informationBar graph or Histogram? (Both allow you to compare groups.)
Bar graph or Histogram? (Both allow you to compare groups.) We want to compare total revenues of five different companies. Key question: What is the revenue for each company? Bar graph We want to compare
More informationEpisode 105: Opening the Faucet with Google AdWords
Episode 105: Opening the Faucet with Google AdWords Guest: Niki Fielding This is Prove It. A podcast for impatient business owners, overwhelmed marketers, and PR practitioners with no time for podcasts.
More informationDistinguish between different types of numerical data and different data collection processes.
Level: Diploma in Business Learning Outcomes 1.1 1.3 Distinguish between different types of numerical data and different data collection processes. Introduce the course by defining statistics and explaining
More information= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney
Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer
More informationCorrelation and Simple. Linear Regression. Scenario. Defining Correlation
Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling
More informationCHAPTER 4. Labeling Methods for Identifying Outliers
CHAPTER 4 Labeling Methods for Identifying Outliers 4.1 Introduction Data mining is the extraction of hidden predictive knowledge s from large databases. Outlier detection is one of the powerful techniques
More informationHow to Use Excel for Regression Analysis MtRoyal Version 2016RevA *
OpenStax-CNX module: m63578 1 How to Use Excel for Regression Analysis MtRoyal Version 2016RevA * Lyryx Learning Based on How to Use Excel for Regression Analysis BSTA 200 Humber College Version 2016RevA
More informationAlso, big thank you to fellow TA Enoch Hill for edits and some additions to the guide.
Hello class, once again, here s an unofficial guide to the sample midterm. Please use this with caution, since 1) I am prone to error so incorrect explanations are entirely possible and 2) you should do
More informationCommunicate and Collaborate with Visual Studio Team System 2008
Communicate and Collaborate with Visual Studio Team System 2008 White Paper May 2008 For the latest information, please see www.microsoft.com/teamsystem This is a preliminary document and may be changed
More informationSPSS Guide Page 1 of 13
SPSS Guide Page 1 of 13 A Guide to SPSS for Public Affairs Students This is intended as a handy how-to guide for most of what you might want to do in SPSS. First, here is what a typical data set might
More informationCapability on Aggregate Processes
Capability on Aggregate Processes CVJ Systems AWD Systems Trans Axle Solutions edrive Systems The Problem Fixture 1 Fixture 2 Horizontal Mach With one machine and a couple of fixtures, it s a pretty easy
More informationSTAT 2300: Unit 1 Learning Objectives Spring 2019
STAT 2300: Unit 1 Learning Objectives Spring 2019 Unit tests are written to evaluate student comprehension, acquisition, and synthesis of these skills. The problems listed as Assigned MyStatLab Problems
More informationCS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis Approaching Performance Projects CS 147: Computer Systems Performance Analysis Approaching Performance Projects 1 / 35 Overview Overview Overview Planning
More informationP rofit t (1 + i) t. V alue = t=0
These notes correspond to Chapter 2 of the text. 1 Optimization A key concept in economics is that of optimization. It s a tool that can be used for many applications, but for now we will use it for pro
More informationModelling buyer behaviour - 2 Rate-frequency models
Publishing Date: May 1993. 1993. All rights reserved. Copyright rests with the author. No part of this article may be reproduced without written permission from the author. Modelling buyer behaviour -
More informationIntroduction to Statistics. Measures of Central Tendency
Introduction to Statistics Measures of Central Tendency Two Types of Statistics Descriptive statistics of a POPULATION Relevant notation (Greek): µ mean N population size sum Inferential statistics of
More informationAssignment 6. A Time Series Analysis 2006 Samuel L. Baker
ASSIGNMENT 6 1 Going back in time (so we can use real data)... Assignment 6. A Time Series Analysis 2006 Samuel L. Baker Your job is to project Medicaid-paid deliveries of babies for 1991, based on the
More information6) Consumer surplus is the red area in the following graph. It is 0.5*5*5=12.5. The answer is C.
These are solutions to Fall 2013 s Econ 1101 Midterm 1. No guarantees are made that this guide is error free, so please consult your TA or instructor if anything looks wrong. 1) If the price of sweeteners,
More informationA Bid for Every Auction. Implementing and Improving AdWords Automated Bidding
A Bid for Every Auction Implementing and Improving AdWords Automated Bidding Ten Automated Bidding Takeaways The Advantages of Auction-time Bidding 1 Bid to the user s context, as close to auction-time
More informationIntroduction to Control Charts
Introduction to Control Charts Highlights Control charts can help you prevent defects before they happen. The control chart tells you how the process is behaving over time. It's the process talking to
More informationWeek 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions
Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions One of the primary goals of statistics is to make statistical inferences on
More informationUnit QUAN Session 6. Introduction to Acceptance Sampling
Unit QUAN Session 6 Introduction to Acceptance Sampling MSc Strategic Quality Management Quantitative methods - Unit QUAN INTRODUCTION TO ACCEPTANCE SAMPLING Aims of Session To introduce the basic statistical
More informationChapter 5 Notes Page 1
Chapter 5 Notes Page 1 COST BEHAVIOR When dealing with costs, it helps for you to determine what drives the cost in question. A Cost Driver (also called Cost Base) is an activity that is associated with,
More informationApplied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur
Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 19 Tutorial - ANOVA (Contd.) (Refer Slide
More information10.2 Correlation. Plotting paired data points leads to a scatterplot. Each data pair becomes one dot in the scatterplot.
10.2 Correlation Note: You will be tested only on material covered in these class notes. You may use your textbook as supplemental reading. At the end of this document you will find practice problems similar
More informationWe all know that what you measure gets done. But what if you are. measuring the wrong things and doing the wrong things as a result?
We all know that what you measure gets done. But what if you are measuring the wrong things and doing the wrong things as a result? Michaela Rebbeck talks to Alex Knight about operational measures that
More informationTwo Way ANOVA. Turkheimer PSYC 771. Page 1 Two-Way ANOVA
Page 1 Two Way ANOVA Two way ANOVA is conceptually like multiple regression, in that we are trying to simulateously assess the effects of more than one X variable on Y. But just as in One Way ANOVA, the
More informationIntroduction to Statistics. Measures of Central Tendency and Dispersion
Introduction to Statistics Measures of Central Tendency and Dispersion The phrase descriptive statistics is used generically in place of measures of central tendency and dispersion for inferential statistics.
More information= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data
Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help
More informationLaunch Store. University
Launch Store University Importing and Exporting CSVs In this lesson, you will learn how to: What is a CSV? How to export a CSV document of your products from control panel How to import a CSV into your
More informationPRINCIPLES AND APPLICATIONS OF SPECIAL EDUCATION ASSESSMENT
PRINCIPLES AND APPLICATIONS OF SPECIAL EDUCATION ASSESSMENT CLASS 3: DESCRIPTIVE STATISTICS & RELIABILITY AND VALIDITY FEBRUARY 2, 2015 OBJECTIVES Define basic terminology used in assessment, such as validity,
More informationChapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition
Chapter 4: Foundations for inference OpenIntro Statistics, 2nd Edition Variability in estimates 1 Variability in estimates Application exercise Sampling distributions - via CLT 2 Confidence intervals 3
More informationCHAPTER 5 FIRM PRODUCTION, COST, AND REVENUE
CHAPTER 5 FIRM PRODUCTION, COST, AND REVENUE CHAPTER OBJECTIVES You will find in this chapter models that will help you understand the relationship between production and costs and the relationship between
More informationA Fundamental Error Find out what consumers think about your product, not your messaging.
A Fundamental Error Find out what consumers think about your product, not your messaging. Marketers and researchers make a fundamental error almost universally when developing and testing marketing communications.
More informationGetting Started with OptQuest
Getting Started with OptQuest What OptQuest does Futura Apartments model example Portfolio Allocation model example Defining decision variables in Crystal Ball Running OptQuest Specifying decision variable
More informationForecasting with Seasonality Version 1.6
Forecasting with Seasonality Version 1.6 Dr. Ron Tibben-Lembke Sept 3, 2006 Forecasting with seasonality and a trend is obviously more difficult than forecasting for a trend or for seasonality by itself,
More informationChapter Six- Selecting the Best Innovation Model by Using Multiple Regression
Chapter Six- Selecting the Best Innovation Model by Using Multiple Regression 6.1 Introduction In the previous chapter, the detailed results of FA were presented and discussed. As a result, fourteen factors
More informationUtilizing Data Science To Assess Software Quality
Utilizing Data Science To Assess Software Quality Renato Martins renatopmartins@gmail.com Abstract Data Science is a "hip" term these days. Most people automatically associate Data Science with financial
More informationStudents will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of
Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of numbers. Also, students will understand why some measures
More informationMathematics in Contemporary Society - Chapter 5 (Spring 2018)
City University of New York (CUNY) CUNY Academic Works Open Educational Resources Queensborough Community College Spring 218 Mathematics in Contemporary Society - Chapter (Spring 218) Patrick J. Wallach
More informationManagerial Accounting Prof. Dr. Varadraj Bapat Department of School of Management Indian Institute of Technology, Bombay
Managerial Accounting Prof. Dr. Varadraj Bapat Department of School of Management Indian Institute of Technology, Bombay Lecture - 32 Standard Costing, Mix, Yield, Sales and Fixed Overhead Variances The
More informationSTAT/MATH Chapter3. Statistical Methods in Practice. Averages and Variation 1/27/2017. Measures of Central Tendency: Mode, Median, and Mean
STAT/MATH 3379 Statistical Methods in Practice Dr. Ananda Manage Associate Professor of Statistics Department of Mathematics & Statistics SHSU 1 Chapter3 Averages and Variation Copyright Cengage Learning.
More informationHow to Keep the Delivery of Your Pricing and Other Sales Information Simple! Module 4
How to Keep the Delivery of Your Pricing and Other Sales Information Simple! Module 4 In Module 1 we talked about the Myths of Pricing and then we went over the basics of gathering your financial information
More informationDOWNLOAD PDF MANUAL ON PRESENTATION OF DATA AND CONTROL CHART ANALYSIS
Chapter 1 : ASTM manual on presentation of data and control chart analysis ( edition) Open Library tis Atmh s Manual on Presentation of Data and Control Chart Analysis is the ninth edition of the Astm
More informationMeasurement and sampling
Name: Instructions: (1) Answer questions in your blue book. Number each response. (2) Write your name on the cover of your blue book (and only on the cover). (3) You are allowed to use your calculator
More informationMAS187/AEF258. University of Newcastle upon Tyne
MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................
More informationModule - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches
Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B. Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institution of Technology, Madras
More informationANALYSING QUANTITATIVE DATA
9 ANALYSING QUANTITATIVE DATA Although, of course, there are other software packages that can be used for quantitative data analysis, including Microsoft Excel, SPSS is perhaps the one most commonly subscribed
More informationBefore getting into the details of our engagement, I d like to ask, what does your company do?
Before getting into the details of our engagement, I d like to ask, what does your company do? We re a multilevel marketing company, with sales of around two-hundred million dollars a year. We deal with
More informationExcel #2: No magic numbers
Excel #2: No magic numbers This lesson comes from programmers who long ago learned that everything entered into code must be defined and documented. Placing numbers into an equation is dangerous because
More informationThe future of outbound is precision dialling How to optimise your outbound contact activities
The future of outbound is precision dialling How to optimise your outbound contact activities Rostrvm Solutions Limited May 2013 Dukes Court, Duke Street Woking, Surrey GU21 5RT enquiries@rostrvm.com www.rostrvm.com
More informationSTAB22 section 2.1. Figure 1: Scatterplot of price vs. size for Mocha Frappuccino
STAB22 section 2.1 2.2 We re changing dog breed (categorical) to breed size (quantitative). This would enable us to see how life span depends on breed size (if it does), which we could assess by drawing
More informationMultiple Regression. Dr. Tom Pierce Department of Psychology Radford University
Multiple Regression Dr. Tom Pierce Department of Psychology Radford University In the previous chapter we talked about regression as a technique for using a person s score on one variable to make a best
More informationSCENARIO: We are interested in studying the relationship between the amount of corruption in a country and the quality of their economy.
Introduction to SPSS Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 This workshop is designed to
More informationHIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS
HIMSS ME-PI Community Sigma Score Calculation Worksheet INSTRUCTIONS Quick Tour Let s start with a quick tour of the Excel spreadsheet. There are six worksheets in the spreadsheet. Sigma Score (Snapshot)
More informationConcepts for Using TC2000/TCnet PCFs
2004 Jim Cooper by Concepts for Using TC2000/TCnet PCFs Concepts for Using TC2000/TCnet PCFs 1 What is a PCF? 1 Why would I want to use a PCF? 1 What if I m no good at programming or math? 2 How do I make
More informationWorkshop #2: Evolution
The DNA Files: Workshops and Activities The DNA Files workshops are an outreach component of The DNA Files public radio documentary series produced by SoundVision Productions with funding from the National
More informationKING ABDULAZIZ UNIVERSITY FACULTY OF COMPUTING & INFORMATION TECHNOLOGY DEPARTMENT OF INFORMATION SYSTEM. Lab 1- Introduction
Lab 1- Introduction Objective: We will start with some basic concept of DSS. And also we will start today the WHAT-IF analysis technique for decision making. Activity Outcomes: What is what-if analysis
More informationBusiness Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee
Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 02 Data Mining Process Welcome to the lecture 2 of
More informationLecture-16. Data Tables, Scenarios & Goal Seek in Excel 2007
Lecture-16 Data Tables, Scenarios & Goal Seek in Excel 2007 In Excel, a Data Table is a way to see different results by altering an input cell in your formula. As an example, we're going to alert the interest
More informationOverview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers
Field Guide To Statistics for Blood Bankers A Basic Lesson in Understanding Data and P.A.C.E. Program: 605-022-09 Presenter: Bill Cheney Audience: Clinical Laboratory Professionals Overview Statistics
More informationThe future of outbound is Precision Dialling How to optimise your outbound contact activities
The future of outbound is Precision Dialling How to optimise your outbound contact activities Rostrvm Solutions Limited December 2015 Dukes Court, Duke Street Woking, Surrey GU21 5RT enquiries@rostrvm.com
More informationPutting non-service employees on the phones
Putting non-service employees on the phones For the article Vista Print puts its employees on the phones to Learn the Customer in the July issue of Customer Service Newsletter (CSN), editor Bill Keenan
More informationThe Tyranny of Tools, or Wait, what were we trying to do in the first place? Bill Hathaway, MoreSteam
The Tyranny of Tools, or Wait, what were we trying to do in the first place? Bill Hathaway, MoreSteam I think we would all agree that tools are necessary to get work done. They amplify our efforts, and
More informationSPSS 14: quick guide
SPSS 14: quick guide Edition 2, November 2007 If you would like this document in an alternative format please ask staff for help. On request we can provide documents with a different size and style of
More informationIndustrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee
Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee Module - 04 Lecture - 04 Sales Forecasting I A very warm welcome
More informationNumber arrangements and wheels in LSA
Number arrangements and wheels in LSA Introduction LSA wheel window has had many improvements since its first implementation back in version 3. The latest version is by far the best yet and will almost
More informationShewhart and the Probability Approach. The difference is much greater than how we compute the limits
Quality Digest Daily, November 2, 2015 Manuscript 287 The difference is much greater than how we compute the limits Donald J. Wheeler & Henry R. Neave In theory, there is no difference between theory and
More informationHow to Solve These Five Issues of. Marketing Strategy Creation?
How to Solve These Five Issues of Marketing Strategy Creation? Opportunities multiply as they are seized. Sun Tzu 2 Introduction No matter whether you re running your mom-and-pop business or you consult
More informationDr. Eli Goldratt Unplugged
Dr. Eli Goldratt Unplugged This is part one of SCDigest editor Dan Gilmore s interview with Dr. Eli Goldratt, father of the Theory of Constraints, and author of The Goal and several other influential books
More informationCounting Culture: Seasonality in Northern Irish Arts and Culture
Counting Culture: Seasonality in Northern Irish Arts and Culture December 2017 Seasonality in Northern Irish Arts and Culture At thrive, we are in the lucky position of having access to a lot of data from
More informationHOW TO CREATE A FRENZY OF CUSTOMERS IN YOUR STORE
HOW TO CREATE A FRENZY OF CUSTOMERS IN YOUR STORE Free, Customized Traffic Scale Report will show you, very specifically, the benefits of awakening the Sleeping Buyer in your market. Click here to claim
More information(800) Leader s Guide
www.edgetrainingsystems.com (800) 476 1405 Leader s Guide Copyright 2002 Edge Training Systems, Inc. All Rights Reserved This publication may not be reproduced, stored in a retrieval system, or transmitted
More informationThe Real Estate Philosopher
The Real Estate Philosopher Peter Drucker: Creating Customers Peter Drucker one of the great intellectual thinkers of the twentieth and twenty-first centuries asks a question: What is the purpose of a
More informationBecause you re reading this book, you re likely familiar with the basics
Chapter 1 Beyond Number Crunching: The Art and Science of Data Analysis In This Chapter Realizing your role as a data analyst Avoiding statistical faux pas Delving into the jargon of Stats II Because you
More informationI m going to begin by showing you the basics of creating a table in Excel. And then later on we will get into more advanced applications using Excel.
I m going to begin by showing you the basics of creating a table in Excel. And then later on we will get into more advanced applications using Excel. If you had the choice of looking at this. 1 Or something
More informationSTATISTICAL TECHNIQUES. Data Analysis and Modelling
STATISTICAL TECHNIQUES Data Analysis and Modelling DATA ANALYSIS & MODELLING Data collection and presentation Many of us probably some of the methods involved in collecting raw data. Once the data has
More informationDEMAND CURVE AS A CONSTRAINT FOR BUSINESSES
1Demand and rofit Seeking 8 Demand is important for two reasons. First, demand acts as a constraint on what business firms are able to do. In particular, the demand curve forces firms to accept lower sales
More informationOperations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras
Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras Lecture - 5 Aggregate Planning, Tabular Method, Linear Programming (Refer
More information