Section 1.1 Analyzing Categorical Data

Similar documents
Aug 1 9:38 AM. 1. Be able to determine the appropriate display for categorical variables.

Math 1 Variable Manipulation Part 8 Working with Data

Math 1 Variable Manipulation Part 8 Working with Data

A is used to answer questions about the quantity of what is being measured. A quantitative variable is comprised of numeric values.

CHAPTER FIVE CROSSTABS PROCEDURE

Making Sense of Data

STAT 206: Chapter 2 (Organizing and Visualizing Variables)

30 important questions of Data Interpretation - I (Quantitative Aptitude)

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 2 Organizing Data

What happened on the Titanic at 11:40 on the night of April 14, 1912, is

Variables and data types

SPSS 14: quick guide

STA Module 2A Organizing Data and Comparing Distributions (Part I)

Two-Way Tables ESSENTIAL QUESTION. How can you use two-way frequency tables to solve real-world problems? Real-World Video. my.hrw.

Chapter 2 Efficiency, Market & Government

ENTERTAINMENT HABITS SURVEY RESULTS MAY 2007

Lecture 10. Outline. 1-1 Introduction. 1-1 Introduction. 1-1 Introduction. Introduction to Statistics

Displaying Bivariate Numerical Data

1 Adda247 No. 1 APP for Banking & SSC Preparation Website: bankersadda.com sscadda.com store.adda247.com

A GUIDE TO GETTING SURVEY RESPONSES

Chapter 1 Data and Descriptive Statistics

Tables, graphs and maps

Data Analysis. Chapter1. Introduction 2. Section Section Section Chapter 1 Wrap-Up. Statistics: the Science and art of Data

CHAPTER 2: ORGANIZING AND VISUALIZING VARIABLES

CHAPTER 2: ORGANIZING AND VISUALIZING VARIABLES

Data familiarisation and description

The Dummy s Guide to Data Analysis Using SPSS

Animal Cloning. American Anti-Vivisection Society. Produced for. Prepared by. December 22, 2006

AQR Unit 2: Probability Contingency Table Problems. Name: Date:

MAT 1272 STATISTICS LESSON Organizing and Graphing Qualitative Data

An ordered array is an arrangement of data in either ascending or descending order.

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches

AudienceView How-To Guide: Understanding the Profiler

Why Learn Statistics?

1 PEW RESEARCH CENTER

Pub Entertainment With Smile - All rights reserved

ANGLIAN WATER S GENDER PAY GAP REPORT

Statistics 201 Summary of Tools and Techniques

Chapter 1. * Data = Organized collection of info. (numerical/symbolic) together w/ context.

0450 BUSINESS STUDIES

7115 BUSINESS STUDIES

Data Visualization. Prof.Sushila Aghav-Palwe

Repeated Measures ANOVA

An introduction to Audience Research. Christine Wilson Senior Director, Strategy & Planning Canadian Broadcasting Corporation

1. Contingency Table (Cross Tabulation Table)

Impact of Social Media on Purchasing Decision of Consumerwith Special Reference to Lahore, Pakistan

Gender Pay Report 2017

TERRA Environmental Research Institute

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

Session 5. Organizing database

SOCIAL MEDIA PLATFORMS. Best ones too use for Acumen animation studios

HotFM Research Report

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

Case Report ISSUES RAISED

DATA MEMO. The average American internet user is not sure what podcasting is, what an RSS feed does, or what the term phishing means

Add Gender Indicators to a Monitoring and Evaluation Plan Activity 8.1: Measuring Gender Constructs

OREGON ELECTRICITY SURVEY

Choose 3 of the cartoons and write down what message you think they are trying to give.

Facebook Inception March 13 th, 2016

XL1A: Graph Nominal Frequency Data Using Excel2013 3/10/2017 V0E. Excel2013-Graph-Nominal0-Slides.pdf 1. Graph Nominal Frequency Data using Excel 2013

Analytics Transition Year Module 2

AMB201: MARKETING & AUDIENCE RESEARCH

Identifying, Selecting and Recruiting Members

Choose 3 of the cartoons and write down what message you think they are trying to give.

#netmedyouth. The The NET-MED Youth Project is funded by the European Union

1. What is a key difference between an Affinity Diagram and other tools?

To provide a framework and tools for planning, doing, checking and acting upon audits

LYFE MARKETING:MEET THE PUBLISHER

Mathematics in Contemporary Society - Chapter 5 (Spring 2018)

Chapter 2: Displaying and Describing Categorical Data Quiz A Name

Apple s Historical Advertisement

Lesson Plan One Energy Matters

Radio buttons. Tick Boxes. Drop down list. Spreadsheets Revision Booklet. Entering Data. Each cell can contain one of the following things

Marketing Data Analysis

Assessing Your Agency s Ethical Culture

Quantitative Methods. Presenting Data in Tables and Charts. Basic Business Statistics, 10e 2006 Prentice-Hall, Inc. Chap 2-1

Public Opinion Survey of Zanzibaris

When was ancient Athens? (Pg. 59) Athens at its height as an ancient democracy was between the years 400 to 300 BCE. BCE and CE whats the difference?

People Inc Free Report Templates P&A Software Solutions Report Templates

MAS187/AEF258. University of Newcastle upon Tyne

PRESENTING DATA ATM 16% Automated or live telephone 2% Drive-through service at branch 17% In person at branch 41% Internet 24% Banking Preference

Business Statistics: A Decision-Making Approach 7 th Edition

CivicScience Insight Report

Descriptive Statistics Tutorial

AudienceView How-To Guide: How to read a Digital Behaviour Report? Part 2 (Websites)

Lotteries Yukon s 2013 Household Survey and Web Survey Summary of Results

Course 5 Week 1 Assignment Document By Puneet Kapoor

Leadership Practices Inventory: LPI

GEODIS Wilson UK Ltd 2017 Gender Pay Report

Audience Research: Qualitative and Quantitative Methods. Ken LeClair Director of Research Canadian Broadcasting Corporation

CHAPTER 2: ORGANIZING AND VISUALIZING VARIABLES

Magazine s websites: usage & perception Yolanda Ausin ARI Spanish Magazine Association

SPSS Guide Page 1 of 13

Operational Policy CODE OF ADVERTISING STANDARDS 1. STATEMENT

Media Measurement. and Rates. Chapter 14 Advertising. Section Read to Learn List the components of media measurement.

Welcome to the Youth PQA Crash Course. This PowerPoint presentation is designed to guide you through using the Youth Program Quality Assessment (PQA)

An Overview of Mass Market Advertising

AP Statistics Test #1 (Chapter 1)

If you are using a survey: who will participate in your survey? Why did you decide on that? Explain

SURVEY ON PUBLIC OPINION ON THE LEVEL OF AWARENESS OF ENERGY EFFICIENCY

Transcription:

Section 1.1 Analyzing Categorical Data Categorical Variables place individuals into one of several groups or categories The values of a categorical variable are labels for the different categories The distribution of a categorical variable lists the count or percent of individuals who fall into each category. Radio Station Formats (Distribution of a categorical variable) The radio audience rating service Arbitron places the country s 13,838 radio stations into categories that describe the kinds of programs they broadcast. Here are two different tables showing the distribution of station formats: Individuals: Variable being measured: The on the left, which we call a frequency table, displays the counts (frequencies) of stations in each format category. On the right, we see a relative frequency table of the data that shows the percents (relative frequencies) of stations in each format category.

Displaying categorical data Frequency tables can be difficult to read. Sometimes is easier to analyze a distribution by displaying it with a bar graph or pie chart. Who Owns an MP3 Player? (Choosing the best graph to display the data) Portable MP3 music players, such as the Apple ipod, are popular but not equally popular with people of all ages. Here are the percents of all ages. Here are the percents of people in various age groups who own a portable MP3 player, according to an Arbitron survey of 1112 randomly selected people. PROBLEM: (a) Make a well-labeled bar graph to display the data. Describe what you see. (b) Would it be appropriate to make a pie chart for these data? Why or why not? Graphs: Good and Bad

Bar graphs compare several quantities by comparing the heights of bars that represent those quantities. This ad for DIRECTV has multiple problems. How many can you point out? What s wrong with the following graphs? Two-Way Tables and Marginal Distributions When a dataset involves two categorical variables, we begin by examining the counts or percents in various categories for one of the variables. DEFINITION: Two-way Table Two-way Table describes two categorical variables, organizing counts according to a row variable and a column variable.

I m Gonna Be Rich! A SURVEY OF 4826 RANDOMLY SELECTED YOUNG ADULTS (AGED 19 TO 25) ASKED, What do you think are the chances you will have much more than a middle-class income at age 30? The table below shows the responses, omitting a few people who refused to respond or who said they were already rich. What are the variables described by this two-way table? How many young adults were surveyed? DEFINITION: Marginal Distribution Two-Way Tables and Marginal Distributions The Marginal Distribution of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table. Note: Percents are often more informative than counts, especially when comparing groups of different sizes. To examine a marginal distribution, 1) Use the data in the table to calculate the marginal distribution (in percents) of the row or column totals. 2) Make a graph to display the marginal distribution. I m Gonna Be Rich (Examining a marginal distribution) PROBLEM: (a) Use the data in the two-way table to calculate the marginal distribution (in percents) of opinions. (b) Make a graph to display the marginal distribution. Describe what you see.

CHECK YOUR UNDERSTANDING 1. Use the data in the two-way table to calculate the marginal distribution (in percents) of genders. 2. Make a graph to display the marginal distribution. Describe what you see. Relationships Between Categorical Variables Marginal distributions tell us nothing about the relationship between two variables. DEFINITION: Conditional Distribution A Conditional Distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. To examine or compare conditional distributions, 1) Select the row(s) or column(s) of interest. 2) Use the data in the table to calculate the conditional distribution (in percents) of the row(s) or column(s). 3) Make a graph to display the conditional distribution. Use a side-by-side bar graph or segmented bar graph to compare distributions. I m Gonna Be Rich! (Calculating conditional distribution) PROBLEM: Calculate the conditional distribution of opinion among the men. Men Women Response Percent Response Percent Almost no chance Some chance A 50-50 chance A good chance Almost certain Almost no chance Some chance A 50-50 chance A good chance Almost certain

HOW TO ORGANIZE A STATISTICAL PROBLEM: A FOUR-STEP PROCESS : What s the question that you re trying to answer? : How will you go about answering the question? What statistical techniques does this problem call for? : Make graphs and carry out needed calculations. : Give your practical conclusion in the setting of the real-world problem. *** Statistics Problems Demand Consistency Superpowers (Conditional distribution and relationships) Based on the survey data, can we conclude that boys and girls differ in their preference of superpower? Give appropriate evidence to support your answer. Superpower Percent of females Percent of males Invisibility 15 15 Superstrength 3 20 Telepathy 34 6 Fly 31 21 Freeze time 17 38 Total 100 100 STATE: PLAN: DO: CONCLUDE: