DSC 201: Data Analysis & Visualization

Size: px
Start display at page:

Download "DSC 201: Data Analysis & Visualization"

Transcription

1 DSC 201: Data Analysis & Visualization Aggregation Dr. David Koop

2 Selection & Highlighting Selection: a user action (mouse, keyboard) on items, links, etc. Selection types: single vs. multiple, contiguous vs. non-contiguous Feedback is important! How? Change selected item's visual encoding - Change color: want to achieve visual popout - Add outline mark: allows original color to be preserved - Change size (line width) - Add motion: marching ants 2

3 Selection & Highlighting Selection: a user action (mouse, keyboard) on items, links, etc. Selection types: single vs. multiple, contiguous vs. non-contiguous Feedback is important! How? Change selected item's visual encoding - Change color: want to achieve visual popout - Add outline mark: allows original color to be preserved - Change size (line width) - Add motion: marching ants 2

4 line). Although it is just an example, using it you can already ask some quite interesting questions. If I am going to gamble whether Nasdaq 100 will gain or lose tomorrow what is my chance? Is Friday or Monday the most unlucky day for investors? Is spring better than winter to invest? Can you find the outliers? When did the outliers occur? Public data source: PiTrading.com. Try it out or check out these other examples. Juxtaposition and Coordinated Views US Venture Capital Landscape 2011 (choropleth chart, bubble chart) Major Canadian City Crime Stats (bubble overlay, bar chart, line chart) List of Community Contributed Examples Simple, Specific Chart Examples (source) Nasdaq 100 Index 1985/11/ /06/29 Yearly Performance (radius: fluctuation/index ratio, color: gain/loss) 150% 100% Index Gain % 50% 0% -50% % -150% -2,000-1,500-1, ,000 1,500 2,000 Index Gain Days by Gain/Loss Quarters Day of Week Days by Fluctuation(%) Loss(46%) Gain(53%) Q4 Q1 Mon Tue Wed 2,000 1,500 1,000 Q3 Q2 Thu Fri , % -20% -15% -10% -5% 0% 5% 10% 15% 20% 25% 7,000 6,000 5,000 [ Monthly Index Average Monthly Index Move 3

5 Multiple Views All Subset None Same Redundant Overview/ Detail Small Multiples Multiform Multiform, Overview/ Detail No Linkage [Munzner (ill. Maguire), 2014] 4

6 Superimposition 80 Temperature (ºF) 70 Austin 60 New York San Francisco October November December 2012 February March April May June July August September [M. Bostock, 5

7 Flattening the Sphere? [USGS Map Projections] 6

8 Choropleth Map: What are Marks and Channels? [M. Ericson, New York Times] 7

9 What are the Marks and Channels? [M. Ericson, New York Times] 8

10 House Races: More Geographic Data? [New York Times, 2010] 9

11 House Races: Maps Aren't Always Best [NYTimes] 10

12 Assignment 4 Visualization using Tableau Year as a Dimension 11

13 Aggregation: Histograms Very similar to bar charts Often shown without space between (continuity) Choice of number of bins - Important! - Viewers may infer different trends based on the layout Weight Class (lbs) [Munzner (ill. Maguire), 2014] 12

14 Boxplots Show distribution Single value (e.g. mean, max, min, quartiles) doesn't convey everything Created by John Tukey who grew up in New Bedford! Show spread and skew of data Best for unimodal data Variations like vase plot for multimodal data Aggregation here involves many different marks [Flowing Data] 13

15 Data Analysis Scenarios Often want to analyze data by some grouping: - Dogs vs. cats - Millennials vs. Gen-X vs. Baby Boomers - Physics vs. Chemistry Compute statistics based on those groupings - max, min, median Perform your own type of transformation (top-k, spread, etc.) Create visualizations for each group 14

16 Split-Apply-Combine Coined by H. Wickham, 2011 Similar to Map (split+apply) Reduce (combine) paradigm The Pattern: 1. Split the data by some grouping variable 2. Apply some function to each group independently 3. Combine the data into some output dataset The apply step is usually one of : - Aggregate - Transform - Filter [T. Brandt] 15

17 Split-Apply-Combine [W. McKinney, Python for Data Analysis] 16

18 Splitting by Variables.(sex).(age) name age sex John 13 Male Mary 15 Female Alice 14 Female Peter 13 Male Roger 14 Male Phyllis 13 Female name age sex John 13 Male Peter 13 Male Roger 14 Male name age sex Mary 15 Female Alice 14 Female Phyllis 13 Female name age sex John 13 Male Peter 13 Male Phyllis 13 Female name age sex Alice 14 Female Roger 14 Male name age sex Mary 15 Female [H. Wickham, 2011] 17

19 Apply+Combine: Counting.(sex).(age).(sex, age) sex value age value sex age value Male Male 13 2 Female Male Female 13 1 Female Female [H. Wickham, 2011] 18

20 In Pandas groupby method creates a GroupBy object groupby doesn't actually compute anything until there is an aggregation or we wish to examine the groups Choose keys (columns) to group by 19

21 Aggregation Operations: - size() - mean() - sum() May also wish to aggregate only certain subsets - Use square brackets with column names Can also write your own functions for aggregation and pass then to agg function - def peak_to_peak(arr): return arr.max() - arr.min() grouped.agg(peak_to_peak) 20

22 Optimized groupby methods Function name count sum mean median std, var min, max prod first, last Description Number of non-na values in the group Sum of non-na values Mean of non-na values Arithmetic median of non-na values Unbiased (n - 1 denominator) standard deviation and variance Minimum and maximum of non-na values Product of non-na values First and last non-na values [W. McKinney, Python for Data Analysis] 21

23 Iterating over groups for name, group in df.groupby('key1'): print(name) print(group) Can also.describe() groups 22

24 Example: Tipping Data tipping.ipynb 23

Chapter 3. Displaying and Summarizing Quantitative Data. 1 of 66 05/21/ :00 AM

Chapter 3. Displaying and Summarizing Quantitative Data.  1 of 66 05/21/ :00 AM Chapter 3 Displaying and Summarizing Quantitative Data D. Raffle 5/19/2015 1 of 66 05/21/2015 11:00 AM Intro In this chapter, we will discuss summarizing the distribution of numeric or quantitative variables.

More information

A is used to answer questions about the quantity of what is being measured. A quantitative variable is comprised of numeric values.

A is used to answer questions about the quantity of what is being measured. A quantitative variable is comprised of numeric values. Stats: Modeling the World Chapter 2 Chapter 2: Data What are data? In order to determine the context of data, consider the W s Who What (and in what units) When Where Why How There are two major ways to

More information

Social Studies 201 Fall Answers to Computer Problem Set 1 1. PRIORITY Priority for Federal Surplus

Social Studies 201 Fall Answers to Computer Problem Set 1 1. PRIORITY Priority for Federal Surplus Social Studies 1 Fall. Answers to Computer Problem Set 1 1 Social Studies 1 Fall Answers for Computer Problem Set 1 1. FREQUENCY DISTRIBUTIONS PRIORITY PRIORITY Priority for Federal Surplus 1 Reduce Debt

More information

Advanced Higher Statistics

Advanced Higher Statistics Advanced Higher Statistics 2018-19 Advanced Higher Statistics - 3 Unit Assessments - Prelim - Investigation - Final Exam (3 Hours) 1 Advanced Higher Statistics Handouts - Data Booklet - Course Outlines

More information

Chapter 1 Data and Descriptive Statistics

Chapter 1 Data and Descriptive Statistics 1.1 Introduction Chapter 1 Data and Descriptive Statistics Statistics is the art and science of collecting, summarizing, analyzing and interpreting data. The field of statistics can be broadly divided

More information

CEE3710: Uncertainty Analysis in Engineering

CEE3710: Uncertainty Analysis in Engineering CEE3710: Uncertainty Analysis in Engineering Lecture 1 September 6, 2017 Why do we need Probability and Statistics?? What is Uncertainty Analysis?? Ex. Consider the average (mean) height of females by

More information

STAT 2300: Unit 1 Learning Objectives Spring 2019

STAT 2300: Unit 1 Learning Objectives Spring 2019 STAT 2300: Unit 1 Learning Objectives Spring 2019 Unit tests are written to evaluate student comprehension, acquisition, and synthesis of these skills. The problems listed as Assigned MyStatLab Problems

More information

Data Visualization. Prof.Sushila Aghav-Palwe

Data Visualization. Prof.Sushila Aghav-Palwe Data Visualization By Prof.Sushila Aghav-Palwe Importance of Graphs in BI Business intelligence or BI is a technology-driven process that aims at collecting data and analyze it to extract actionable insights

More information

Section 9: Presenting and describing quantitative data

Section 9: Presenting and describing quantitative data Section 9: Presenting and describing quantitative data Australian Catholic University 2014 ALL RIGHTS RESERVED. No part of this work covered by the copyright herein may be reproduced or used in any form

More information

Biostatistics 208 Data Exploration

Biostatistics 208 Data Exploration Biostatistics 208 Data Exploration Dave Glidden Professor of Biostatistics Univ. of California, San Francisco January 8, 2008 http://www.biostat.ucsf.edu/biostat208 Organization Office hours by appointment

More information

Chapter 1. * Data = Organized collection of info. (numerical/symbolic) together w/ context.

Chapter 1. * Data = Organized collection of info. (numerical/symbolic) together w/ context. Chapter 1 Objectives (1) To understand the concept of data in statistics, (2) Learn to recognize its context & components, (3) Recognize the 2 basic variable types. Concept briefs: * Data = Organized collection

More information

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy AGENDA 1. Introduction 2. Use Cases 3. Popular Algorithms 4. Typical Approach 5. Case Study 2016 SAPIENT GLOBAL MARKETS

More information

Biostat Exam 10/7/03 Coverage: StatPrimer 1 4

Biostat Exam 10/7/03 Coverage: StatPrimer 1 4 Biostat Exam 10/7/03 Coverage: StatPrimer 1 4 Part A (Closed Book) INSTRUCTIONS Write your name in the usual location (back of last page, near the staple), and nowhere else. Turn in your Lab Workbook at

More information

Table. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use "k:mydirectory,

Table. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use k:mydirectory, Table XTMIXED Procedure in STATA with Output Systolic Blood Pressure, 2001. use "k:mydirectory,. xtmixed sbp nage20 nage30 nage40 nage50 nage70 nage80 nage90 winter male dept2 edu_bachelor median_household_income

More information

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of numbers. Also, students will understand why some measures

More information

Math 1 Variable Manipulation Part 8 Working with Data

Math 1 Variable Manipulation Part 8 Working with Data Name: Math 1 Variable Manipulation Part 8 Working with Data Date: 1 INTERPRETING DATA USING NUMBER LINE PLOTS Data can be represented in various visual forms including dot plots, histograms, and box plots.

More information

Math 1 Variable Manipulation Part 8 Working with Data

Math 1 Variable Manipulation Part 8 Working with Data Math 1 Variable Manipulation Part 8 Working with Data 1 INTERPRETING DATA USING NUMBER LINE PLOTS Data can be represented in various visual forms including dot plots, histograms, and box plots. Suppose

More information

Chart styles in Snap Survey Software. A summary of standard chart styles and advanced analysis options

Chart styles in Snap Survey Software. A summary of standard chart styles and advanced analysis options Chart styles in Snap Survey Software A summary of standard chart styles and advanced analysis options Table of Contents 3. About this report 4. How to view the charts used within this report 5. Bar Charts

More information

Analysis of Factors Affecting Resignations of University Employees

Analysis of Factors Affecting Resignations of University Employees Analysis of Factors Affecting Resignations of University Employees An exploratory study was conducted to identify factors influencing voluntary resignations at a large research university over the past

More information

Statistics 201 Summary of Tools and Techniques

Statistics 201 Summary of Tools and Techniques Statistics 201 Summary of Tools and Techniques This document summarizes the many tools and techniques that you will be exposed to in STAT 201. The details of how to do these procedures is intentionally

More information

Stat 411/511 MORE ON THE RANDOM SAMPLING MODEL. Charlotte Wickham. stat511.cwick.co.nz. Sep

Stat 411/511 MORE ON THE RANDOM SAMPLING MODEL. Charlotte Wickham. stat511.cwick.co.nz. Sep Stat 411/511 MORE ON THE RANDOM SAMPLING MODEL Sep 29 2015 Charlotte Wickham stat511.cwick.co.nz Announcements My office hours: Mondays 11am 255 Weniger Thursdays 3-5pm 3003 Cordley Help with Statistics

More information

Probability Of Booking

Probability Of Booking Axis Title Web Social Analytics Air France Assignment 1 Spring 216 Shuhua Zhu Assignment 1 Question 1: CTR TCR NET REVEAVE. COSROA AVE. REV PROB COUNT 451 451 451 451 459 368 451 MAX 2.% 9.% $549,524 $1.

More information

SPSS 14: quick guide

SPSS 14: quick guide SPSS 14: quick guide Edition 2, November 2007 If you would like this document in an alternative format please ask staff for help. On request we can provide documents with a different size and style of

More information

Dr. Allen Back. Aug. 26, 2016

Dr. Allen Back. Aug. 26, 2016 Dr. Allen Back Aug. 26, 2016 AP Stats vs. 1710 Some different emphases. AP Stats vs. 1710 Some different emphases. But generally comparable. AP Stats vs. 1710 Some different emphases. But generally comparable.

More information

Determining Effective Data Display with Charts

Determining Effective Data Display with Charts Determining Effective Data Display with Charts 1 Column Line Pie Stock XY (Scatter) Area Bubble Chart Types Covered 2 1 Visualizing Data 3 Data Graphics Principles 4 2 Data Graphics Principles Above all

More information

Measurement and sampling

Measurement and sampling Name: Instructions: (1) Answer questions in your blue book. Number each response. (2) Write your name on the cover of your blue book (and only on the cover). (3) You are allowed to use your calculator

More information

Elementary Statistics Lecture 2 Exploring Data with Graphical and Numerical Summaries

Elementary Statistics Lecture 2 Exploring Data with Graphical and Numerical Summaries Elementary Statistics Lecture 2 Exploring Data with Graphical and Numerical Summaries Chong Ma Department of Statistics University of South Carolina chongm@email.sc.edu Chong Ma (Statistics, USC) STAT

More information

BAR CHARTS. Display frequency distributions for nominal or ordinal data. Ej. Injury deaths of 100 children, ages 5-9, USA,

BAR CHARTS. Display frequency distributions for nominal or ordinal data. Ej. Injury deaths of 100 children, ages 5-9, USA, Graphs BAR CHARTS. Display frequency distributions for nominal or ordinal data. Ej. Injury deaths of 100 children, ages 5-9, USA, 1980-85. HISTOGRAMS. Display frequency distributions for continuous or

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) ORDINARY CERTIFICATE IN STATISTICS, 2003

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) ORDINARY CERTIFICATE IN STATISTICS, 2003 EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) ORDINARY CERTIFICATE IN STATISTICS, 2003 Paper II Time Allowed: Three Hours Candidates may attempt

More information

Bar graph or Histogram? (Both allow you to compare groups.)

Bar graph or Histogram? (Both allow you to compare groups.) Bar graph or Histogram? (Both allow you to compare groups.) We want to compare total revenues of five different companies. Key question: What is the revenue for each company? Bar graph We want to compare

More information

Making Sense of Data

Making Sense of Data Tips for Revising Making Sense of Data Make sure you know what you will be tested on. The main topics are listed below. The examples show you what to do. Plan a revision timetable. Always revise actively

More information

Statistics Chapter 3 Triola (2014)

Statistics Chapter 3 Triola (2014) 3-1 Review and Preview Branches of statistics Descriptive Stats: is the branch of stats that involve the organization, summarization, and display of data Inferential Stats: is the branch of stats that

More information

Using Excel s Analysis ToolPak Add-In

Using Excel s Analysis ToolPak Add-In Using Excel s Analysis ToolPak Add-In Bijay Lal Pradhan, PhD Introduction I have a strong opinions that we can perform different quantitative analysis, including statistical analysis, in Excel. It is powerful,

More information

Chapter 2 Part 1B. Measures of Location. September 4, 2008

Chapter 2 Part 1B. Measures of Location. September 4, 2008 Chapter 2 Part 1B Measures of Location September 4, 2008 Class will meet in the Auditorium except for Tuesday, October 21 when we meet in 102a. Skill set you should have by the time we complete Chapter

More information

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B. Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institution of Technology, Madras

More information

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test

CHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test CHAPTER 8 T Tests A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test 8.1. One-Sample T Test The One-Sample T Test procedure: Tests

More information

AP Statistics Test #1 (Chapter 1)

AP Statistics Test #1 (Chapter 1) AP Statistics Test #1 (Chapter 1) Name Part I - Multiple Choice (Questions 1-20) - Circle the answer of your choice. 1. You measure the age, marital status and earned income of an SRS of 1463 women. The

More information

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION JMP software provides introductory statistics in a package designed to let students visually explore data in an interactive way with

More information

Computing Descriptive Statistics Argosy University

Computing Descriptive Statistics Argosy University 2014 Argosy University 2 Computing Descriptive Statistics: Ever Wonder What Secrets They Hold? The Mean, Mode, Median, Variability, and Standard Deviation Introduction Before gaining an appreciation for

More information

Creating Simple Report from Excel

Creating Simple Report from Excel Creating Simple Report from Excel 1.1 Connect to Excel workbook 1. Select Connect Microsoft Excel. In the Open File dialog box, select the 2015 Sales.xlsx file. 2. The file will be loaded to Tableau, and

More information

Attachment 1. Categorical Summary of BMP Performance Data for Solids (TSS, TDS, and Turbidity) Contained in the International Stormwater BMP Database

Attachment 1. Categorical Summary of BMP Performance Data for Solids (TSS, TDS, and Turbidity) Contained in the International Stormwater BMP Database Attachment 1 Categorical Summary of BMP Performance Data for Solids (TSS, TDS, and Turbidity) Contained in the International Stormwater BMP Database Prepared by Geosyntec Consultants, Inc. Wright Water

More information

Indicators of Hydrologic Alteration (IHA) software, Version 7.1

Indicators of Hydrologic Alteration (IHA) software, Version 7.1 Indicators of Hydrologic Alteration (IHA) software, Version 7.1 IHA Software Analyzes hydrologic characteristics and their changes over time. Computes 67 ecologicallyrelevant flow statistics using daily

More information

Introduction to Statistics. Measures of Central Tendency

Introduction to Statistics. Measures of Central Tendency Introduction to Statistics Measures of Central Tendency Two Types of Statistics Descriptive statistics of a POPULATION Relevant notation (Greek): µ mean N population size sum Inferential statistics of

More information

CHAPTER 4. Labeling Methods for Identifying Outliers

CHAPTER 4. Labeling Methods for Identifying Outliers CHAPTER 4 Labeling Methods for Identifying Outliers 4.1 Introduction Data mining is the extraction of hidden predictive knowledge s from large databases. Outlier detection is one of the powerful techniques

More information

Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for

Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for you to discover methylation changes at specific genomic

More information

Lecture 10. Outline. 1-1 Introduction. 1-1 Introduction. 1-1 Introduction. Introduction to Statistics

Lecture 10. Outline. 1-1 Introduction. 1-1 Introduction. 1-1 Introduction. Introduction to Statistics Outline Lecture 10 Introduction to 1-1 Introduction 1-2 Descriptive and Inferential 1-3 Variables and Types of Data 1-4 Sampling Techniques 1- Observational and Experimental Studies 1-6 Computers and Calculators

More information

Overview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers

Overview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers Field Guide To Statistics for Blood Bankers A Basic Lesson in Understanding Data and P.A.C.E. Program: 605-022-09 Presenter: Bill Cheney Audience: Clinical Laboratory Professionals Overview Statistics

More information

Introduction to Statistics. Measures of Central Tendency and Dispersion

Introduction to Statistics. Measures of Central Tendency and Dispersion Introduction to Statistics Measures of Central Tendency and Dispersion The phrase descriptive statistics is used generically in place of measures of central tendency and dispersion for inferential statistics.

More information

36.2. Exploring Data. Introduction. Prerequisites. Learning Outcomes

36.2. Exploring Data. Introduction. Prerequisites. Learning Outcomes Exploring Data 6. Introduction Techniques for exploring data to enable valid conclusions to be drawn are described in this Section. The diagrammatic methods of stem-and-leaf and box-and-whisker are given

More information

Data Visualization. Non-Programming approach to Visualize Data

Data Visualization. Non-Programming approach to Visualize Data Data Visualization Non-Programming approach to Visualize Data Dr. Omer Ayoub Senior Data Scientist, House of Mathematical and Statistical Sciences, King Abdul Aziz Univerrsity, Jeddah, Saudi Arabia Dr.

More information

Agence Nationale de la Météorologie du Sénégal

Agence Nationale de la Météorologie du Sénégal Using INSTAT software with daily rainfall data to estimate best sowing dates for farmers in Saint Louis of Senegal By Elarion Sambou Agence Nationale de la Météorologie du Sénégal (A.N.A.M.S.) BP 8257,

More information

Super-marketing. A Data Investigation. A note to teachers:

Super-marketing. A Data Investigation. A note to teachers: Super-marketing A Data Investigation A note to teachers: This is a simple data investigation requiring interpretation of data, completion of stem and leaf plots, generation of box plots and analysis of

More information

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions

Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions Week 1 Tuesday Hr 2 (Review 1) - Samples and Populations - Descriptive and Inferential Statistics - Normal and T distributions One of the primary goals of statistics is to make statistical inferences on

More information

11-1 Descriptive Statistics

11-1 Descriptive Statistics For Exercises 1-4, complete each step. a. Construct a histogram and use it to describe the shape of the distribution. b. Summarize the center and spread of the data using either the mean and standard deviation

More information

Outliers and Their Effect on Distribution Assessment

Outliers and Their Effect on Distribution Assessment Outliers and Their Effect on Distribution Assessment Larry Bartkus September 2016 Topics of Discussion What is an Outlier? (Definition) How can we use outliers Analysis of Outlying Observation The Standards

More information

GYANVRIKSH INTERACTIVE PVT LTD III Foor, QZ Plaza, Opp Haveli Restaurant, Kothaguda, Kondapur Hyderabad , &

GYANVRIKSH INTERACTIVE PVT LTD III Foor, QZ Plaza, Opp Haveli Restaurant, Kothaguda, Kondapur Hyderabad , & Introduction and Getting Started (Desktop & Server) Why Tableau? History of Tableau Products of Tableau Tableau Desktop Tableau Server Tableau Public Tableau Online Tableau Reader Architecture of Tableau

More information

Lab 1: A review of linear models

Lab 1: A review of linear models Lab 1: A review of linear models The purpose of this lab is to help you review basic statistical methods in linear models and understanding the implementation of these methods in R. In general, we need

More information

points in a line over time.

points in a line over time. Chart types Published: 2018-07-07 Dashboard charts in the ExtraHop system offer multiple ways to visualize metric data, which can help you answer questions about your network behavior. You select a chart

More information

Employee Demographics: Exploring similarities and differences in the private and public sector

Employee Demographics: Exploring similarities and differences in the private and public sector Employee Demographics: Exploring similarities and differences in the private and public sector Abrar Al-Hasan, Jorge Christian, Hideki Kakuma, Li Wei Chen, Yavor Nikolov Course BUDT733 Professor Galit

More information

STA Module 2A Organizing Data and Comparing Distributions (Part I)

STA Module 2A Organizing Data and Comparing Distributions (Part I) STA 2023 Module 2A Organizing Data and Comparing Distributions (Part I) 1 Learning Objectives Upon completing this module, you should be able to: 1. Classify variables and data as either qualitative or

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

OBJECTIVES & COURSE OUTLINE COM116 ADVANCED EXCEL REPORTING 17 - CPE HOURS COURSE OBJECTIVES

OBJECTIVES & COURSE OUTLINE COM116 ADVANCED EXCEL REPORTING 17 - CPE HOURS COURSE OBJECTIVES OBJECTIVES & COURSE OUTLINE COM116 ADVANCED EXCEL REPORTING 17 - CPE HOURS COURSE OBJECTIVES Learn to simplify the reporting process. Learn about the functions that work together to automate many of the

More information

Fundamental Elements of Statistics

Fundamental Elements of Statistics Fundamental Elements of Statistics Slide Statistics the science of data Collection Evaluation (classification, summary, organization and analysis) Interpretation Slide Population Sample Sample: A subset

More information

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 2 Organizing Data

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 2 Organizing Data STA 2023 Module 2 Organizing Data Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Classify variables and data as either qualitative or quantitative. 2. Distinguish

More information

Forecasting the revenue generated by ATM / CARD / CASH for EZTABLE to identify the potential / actual revenue from different payment systems.

Forecasting the revenue generated by ATM / CARD / CASH for EZTABLE to identify the potential / actual revenue from different payment systems. Forecasting the revenue generated by ATM / CARD / CASH for EZTABLE to identify the potential / actual revenue from different payment systems. Team #6 Member Sean, Xie 103078517 Sam, Wang 103078502 Lydia,

More information

AP Statistics Scope & Sequence

AP Statistics Scope & Sequence AP Statistics Scope & Sequence Grading Period Unit Title Learning Targets Throughout the School Year First Grading Period *Apply mathematics to problems in everyday life *Use a problem-solving model that

More information

Jobs and Skills in the Bay Area

Jobs and Skills in the Bay Area Jobs and Skills in the Bay Area Project Goals For my final project, I decided to work with a dataset of jobs and skills data that was collected as part of my Master s Final Project. In our Master s Final

More information

Simply Prepared ecourse. Module 4, Chapter 1. Worksheet

Simply Prepared ecourse. Module 4, Chapter 1. Worksheet Simply Prepared ecourse Module 4, Chapter 1 Goal #1 Determine how much water your family needs to store in order to meet Fema s minimum suggestion. Use the chart below if needed. Number of people in family:

More information

Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical.

Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical. Week 13, 11/12/12-11/16/12, Notes: Quantitative Summaries, both Numerical and Graphical. 1 Monday s, 11/12/12, notes: Numerical Summaries of Quantitative Varibles Chapter 3 of your textbook deals with

More information

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved. AcaStat How To Guide AcaStat Software Copyright 2016, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Frequencies... 3 List Variables... 4 Descriptives... 5 Explore Means...

More information

STAT 430 SAS Examples SAS1 ==================== ssh tap sas913 (or sas82), sas

STAT 430 SAS Examples SAS1 ==================== ssh tap sas913 (or sas82), sas STAT 430 SAS Examples SAS1 ==================== ssh abc@glue.umd.edu, tap sas913 (or sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm 1. How to get the SAS window. General Introduction.

More information

Exam 1 - Practice Exam (Chapter 1,2,3)

Exam 1 - Practice Exam (Chapter 1,2,3) Exam 1 - Practice Exam (Chapter 1,2,3) (Test Bank Odds Ch 1-3) VERSION 2 TRUE/FALSE. Write 'T' if the statement is true and 'F' if the statement is false. 1) Statistics is a discipline that involves tools

More information

Statistics is the area of Math that is all about 1: collecting, 2: analysing and 3: reporting about data.

Statistics is the area of Math that is all about 1: collecting, 2: analysing and 3: reporting about data. Unit 2 Statistical Analysis Statistics is the area of Math that is all about 1: collecting, 2: analysing and 3: reporting about data. 1: Collecting data Collecting data is done by performing Surveys, either

More information

Identify sampling methods and recognize biased samples

Identify sampling methods and recognize biased samples 9-1 Samples and Surveys Identify sampling methods and recognize biased samples Vocabulary population (p. 462) sample (p. 462) biased sample (p. 463) random sample (p. 462) systematic sample (p. 462) stratified

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

charts Every social media manager needs Andy Cotgreave Social Content Manager

charts Every social media manager needs Andy Cotgreave Social Content Manager 4 charts Every social media manager needs Andy Cotgreave Social Content Manager 1. Slope chart for growth of followers/reach This chart shows the performance on social networks for 4 companies between

More information

Demand Estimation Sub Committee. Seasonal Normal Review 2020: Review of CWV Formula

Demand Estimation Sub Committee. Seasonal Normal Review 2020: Review of CWV Formula Demand Estimation Sub Committee Seasonal Normal Review 2020: Review of CWV Formula 11 th February 2019 Overview During 2019 DESC are reviewing / revising the Composite Weather Variable (CWV) formula AND

More information

Introduction to descriptive statistics

Introduction to descriptive statistics Introduction to descriptive statistics Illustrated with XLSTAT Jean Paul Maalouf jpmaalouf@xlstat.com linkedin.com/in/jean-paul-maalouf www.xlstat.com Oct. 12, 2016 1 PLAN XLSTAT: who are we? Statistics:

More information

Lean Six Sigma Green Belt Supplement

Lean Six Sigma Green Belt Supplement Problem Solving and Process Improvement Tools and Techniques Guide Book Lean Six Sigma Green Belt Supplement Max Zornada, University of Adelaide Executive Education 7 th Floor, 10 Pultney Street, Adelaide,

More information

Exam 1 - Practice Exam (Chapter 1,2,3)

Exam 1 - Practice Exam (Chapter 1,2,3) Exam 1 - Practice Exam (Chapter 1,2,3) (Test Bank Odds Ch 1-3) TRUE/FALSE. Write 'T' if the statement is true and 'F' if the statement is false. 1) Statistics is a discipline that involves tools and techniques

More information

DIGITAL VERSION. Microsoft EXCEL Level 2 TRAINER APPROVED

DIGITAL VERSION. Microsoft EXCEL Level 2 TRAINER APPROVED DIGITAL VERSION Microsoft EXCEL 2013 Level 2 TRAINER APPROVED Module 4 Displaying Data Graphically Module Objectives Creating Charts and Graphs Modifying and Formatting Charts Advanced Charting Features

More information

What is Statistics? Stat Camp for the MBA Program. Where Is Statistics Needed? Where Is Statistics Needed?

What is Statistics? Stat Camp for the MBA Program. Where Is Statistics Needed? Where Is Statistics Needed? Stat Camp for the MBA Program Daniel Solow Lecture 1 Exploratory Data Analysis What is Statistics? Statistics is the art and science of collecting, analyzing, presenting and interpreting data, which are

More information

Define Metrics in Reports

Define Metrics in Reports in Reports Metrics are used to build reports. They are the numerical values displayed in reports. They represent aggregations of facts and attributes. A metric is a simple or complex calculation applied

More information

CREATE INSTANT VISIBILITY INTO KEY MANUFACTURING METRICS

CREATE INSTANT VISIBILITY INTO KEY MANUFACTURING METRICS CREATE INSTANT VISIBILITY INTO KEY MANUFACTURING METRICS The QualityWorX Dashboard provides the most comprehensive, easy-to-use reporting platform for production and quality management in the industry.

More information

Unravelling Airbnb Predicting Price for New Listing

Unravelling Airbnb Predicting Price for New Listing Unravelling Airbnb Predicting Price for New Listing Paridhi Choudhary H John Heinz III College Carnegie Mellon University Pittsburgh, PA 15213 paridhic@andrew.cmu.edu Aniket Jain H John Heinz III College

More information

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction Paper SAS1774-2015 Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction ABSTRACT Xiangxiang Meng, Wayne Thompson, and Jennifer Ames, SAS Institute Inc. Predictions, including regressions

More information

INDUSTRIAL VISUALIZATION

INDUSTRIAL VISUALIZATION 1 INDUSTRIAL VISUALIZATION U of T, Info Vis Oct 31, 2016 From CBS News Show 60 Minutes: http://www.cbsnews.com/news/new-search-engine-exposes-the-dark-web/ From Wired.com Contents 2015 Uncharted Software

More information

AP Statistics Part 1 Review Test 2

AP Statistics Part 1 Review Test 2 Count Name AP Statistics Part 1 Review Test 2 1. You have a set of data that you suspect came from a normal distribution. In order to assess normality, you construct a normal probability plot. Which of

More information

Test Name: Test 1 Review

Test Name: Test 1 Review Test Name: Test 1 Review 1. Determine whether the statement describes a population or a sample. The heights of all the corn plants at Mr. Lonardo's greenhouse. 2. Determine whether the statement describes

More information

Excel 2011 Charts - Introduction Excel 2011 Series The University of Akron. Table of Contents COURSE OVERVIEW... 2

Excel 2011 Charts - Introduction Excel 2011 Series The University of Akron. Table of Contents COURSE OVERVIEW... 2 Table of Contents COURSE OVERVIEW... 2 DISCUSSION... 2 OBJECTIVES... 2 COURSE TOPICS... 2 LESSON 1: CREATE A CHART QUICK AND EASY... 3 DISCUSSION... 3 CREATE THE CHART... 4 Task A Create the Chart... 4

More information

Topic 1: Descriptive Statistics

Topic 1: Descriptive Statistics Topic 1: Descriptive Statistics Econ 245_Topic 1 page1 Reference: N.C &T.: Chapter 1 Objectives: Basic Statistical Definitions Methods of Displaying Data Definitions: S : a numerical piece of information

More information

Online Appendix for Are Online and Offline Prices Similar? Evidence from Multi-Channel Retailers

Online Appendix for Are Online and Offline Prices Similar? Evidence from Multi-Channel Retailers Online Appendix for Are Online and Offline Prices Similar? Evidence from Multi-Channel Retailers Alberto Cavallo MIT & NBER This version: August 29, 2016 A Appendix A.1 Price-level Comparison with Amazon.com

More information

Statistics Definitions ID1050 Quantitative & Qualitative Reasoning

Statistics Definitions ID1050 Quantitative & Qualitative Reasoning Statistics Definitions ID1050 Quantitative & Qualitative Reasoning Population vs. Sample We can use statistics when we wish to characterize some particular aspect of a group, merging each individual s

More information

OAG Sampling Guide for Performance Audits

OAG Sampling Guide for Performance Audits OAG Sampling Guide for Performance Audits February 2006 Paul Pilon Ce document est également disponible en français. Contents may not be reproduced for commercial purposes, but any other reproduction,

More information

DATA MANAGEMENT BASICS. July 19, 2017 September 8, 2017

DATA MANAGEMENT BASICS. July 19, 2017 September 8, 2017 DATA MANAGEMENT BASICS July 19, 2017 September 8, 2017 OBJECTIVES Enthuse evaluative thinking Encourage performance measurement Build knowledge of concepts related to: o Data collection o Data analysis

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

Mathematics in Contemporary Society - Chapter 5 (Spring 2018)

Mathematics in Contemporary Society - Chapter 5 (Spring 2018) City University of New York (CUNY) CUNY Academic Works Open Educational Resources Queensborough Community College Spring 218 Mathematics in Contemporary Society - Chapter (Spring 218) Patrick J. Wallach

More information

1. Contingency Table (Cross Tabulation Table)

1. Contingency Table (Cross Tabulation Table) II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate

More information

Most Common mistakes Data Scientist make

Most Common mistakes Data Scientist make Tomaž Kaštrun SQLSaturday Parma November 2018 Most Common mistakes Data Scientist make Organizers GetLatestVersion.it Sponsors About (2.0.1) BI Developer and data analyst SQL Server, SAS, R, Python, C#,

More information

Ch. 16 Exploring, Displaying, and Examining Data

Ch. 16 Exploring, Displaying, and Examining Data TECH 646 Analysis of Research in Industry and Technology PART IV Analysis and Presentation of Data: Data Presentation and Description; Exploring, Displaying, and Examining Data; Hypothesis Testing; Measures

More information