Exploratory Data Analysis with #PrDC16

Size: px
Start display at page:

Download "Exploratory Data Analysis with #PrDC16"

Transcription

1 Exploratory Data Analysis with #PrDC16

2 Motivation The ability to take data to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it that s going to be a hugely important skill in the next decades, because now we really do have essentially free and ubiquitous data. So the complimentary scarce factor is the ability to understand that data and extract value from it. Hal Varian, Google s Chief Economist The McKinsey Quarterly, Jan 2009

3 Source: Dice 2014 Tech Salary Survey Results

4 A Flood of Data is Coming... Source: Source: Wikipedia Sink or Swim

5 Overview Introduction to R Data Munging Descriptive Statistics Data Visualization Beyond R and EDA

6 How Does This Apply to Me? As a software developer, I often: Perform log file analysis Analyze software performance Analyze code metrics for code quality Detect anomalies in source data Transform or clean data files to make them usable Help decision makers make decisions based on data

7 About Me Independent software consultant Education B.S. in Computer Science B.A. in Philosophy Community Pluralsight Author ASPInsider Public Speaker Open-Source Software

8 SPONSORS

9 Introduction to R

10 What is R? Open source Language and environment Numerical and graphical analysis Cross platform

11 What is R? Active development Large user community Modular and extensible extensions and best of all

12 FREE

13 FREE

14 Source:

15

16 Source:

17

18 Code Demo

19 Data Munging

20 Data Munging Transforming data Raw data to usable data Data must be cleaned first

21 Data Munging Tasks Renaming variables Data type conversion Encoding, decoding or recoding data Merging data sets Transforming data Handling missing data (imputing) Handling anomalous values

22 Loading Data in R File-based data Web-based data Databases Statistical data And many more CSV XML

23 Cleaning Data This step is often the: Most difficult Most time consuming TIP: Record all steps

24

25

26 Column with wrong name Rows with missing values Runtime column has units Revenue in multiple scales Wrong file format

27 Code Demo

28

29 Descriptive Statistics

30 Descriptive Statistics Describe data Provides a summary aka: Summary statistics Movie Runtime Statistic Value (minutes) Minimum 38 1 st Quartile 93 Median 101 Mean rd Quartile 113 Maximum 219

31 Statistical Terms Observations Variables Qualitative variable Quantitative variable ID Date Customer Product Quantity John Pizza John Soda Jill Salad Jill Milk Miko Pizza Miko Soda Sam Pizza Sam Milk 1

32 Number of Variables Types of Analysis Number of variables Univariate Bivariate Multivariate Qualitative Univariate Analysis Quantitative Univariate Analysis Type of variables Qualitative Quantitative Qualitative Bivariate Analysis Qualitative & Quantitative Bivariate Analysis Quantitative Bivariate Analysis Multivariate Analysis Type of Variable(s)

33 Univariate Analysis One variable Qualitative Frequency Quantitative Central tendency Dispersion

34 Bivariate Analysis Qualitative Joint frequency Quantitative Two variables Predictor Outcome Measures Covariance Correlation

35

36 Cowboys & Space Invaders: The Musical Extended Edition

37 Code Demo

38 Cowboys & Space Invaders: The Musical Extended Edition

39 Data Visualization

40 Data Visualization Visual data representation For human pattern recognition Map dimensions to visual characteristics

41 Data Visualization ID Date Customer Product Quantity John Pizza John Soda Jill Salad Jill Milk Miko Pizza Miko Soda Sam Pizza Sam Milk 1

42 Number of Variables Types of Data Visualizations Number of variables Univariate Bivariate Multivariate Qualitative Univariate Analysis Quantitative Univariate Analysis Type of variable(s) Qualitative Quantitative Qualitative Bivariate Analysis Qualitative & Quantitative Bivariate Analysis Quantitative Bivariate Analysis Multivariate Analysis Type of Variable(s)

43 Cowboys & Space Invaders: The Musical Extended Edition

44 Code Demo

45 The Warlord of the Rings : An Unexpected Adventure Feature Length PG

46 Beyond R and EDA

47 This is just the tip of the iceberg! This is just the tip of the iceberg!

48 Advanced Data Analysis with R Cluster Analysis Statistical Modeling Dimensionality Reduction Analysis of Variance (ANOVA)

49 Source: Nathan Yau (

50 Data Mining and Machine Learning with R

51 Alternatives to R for EDA

52 Source: Microsoft

53

54

55

56

57

58 Where to Go Next R website: R Studio: Revolutions: Flowing Data: R-Blogger:

59

60 Conclusion

61 Conclusion Introduction to R Data munging Descriptive statistics Data visualization Beyond R & EDA

62 Feedback Feedback is very important to me! One thing you liked? One thing I could improve?

63 Contact Info Matthew Renze Website:

Data Visualization with #KCDC

Data Visualization with #KCDC Data Visualization with R @MatthewRenze #KCDC Overview Introduction to R Intro to Data Visualization Types of Data Visualizations Visualizing One Variable Visualizing Two Variables Visualizing

More information

DevSci: Better Software Through Data #KCDC2018

DevSci: Better Software Through Data #KCDC2018 DevSci: Better Software Through Data Science @MatthewRenze #KCDC2018 What is data science? Why is it important? How do I get started? Job Postings for Data Scientists Top-paying Tech Skills Skill

More information

Data Science: The Big #SQLServerUserGroupDubai

Data Science: The Big #SQLServerUserGroupDubai Data Science: The Big Picture @MatthewRenze #SQLServerUserGroupDubai Job Postings for Data Scientists Top-paying Tech Skills Skill 2016 Change Skill 2016 Change Source: Dice Salary Survey 2017 What

More information

Introduction to Agile and Scrum

Introduction to Agile and Scrum Introduction to Agile and Scrum Matthew Renze @matthewrenze COMS 309 - Software Development Practices Purpose Intro to Agile and Scrum Prepare you for the industry Questions and answers Overview Intro

More information

Essentials of Marketing Research

Essentials of Marketing Research A Essentials of Marketing Research Third Edition Joseph F. Hair, Jr. Kennesaw State University Mary Wolfinbarger Celsi California State University-Long Beach David J. Ortinau University of South Florida

More information

Being Smarter with Azure Machine Learning and R

Being Smarter with Azure Machine Learning and R Being Smarter with Azure Machine Learning and R Leila Etaati SQL Saturday #464, Melbourne 20 th February 2016 Housekeeping Mobile Phones please set to stun during sessions Evaluations complete online to

More information

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities

More information

Why Agile? The Economics, Psychology, and Science of Agile s #PrairieCode

Why Agile? The Economics, Psychology, and Science of Agile s #PrairieCode Why Agile? The Economics, Psychology, and Science of Agile s Success @MatthewRenze #PrairieCode Purpose Explain why Agile practices are so successful Insights from Economics, Psychology, and Science Top

More information

POST GRADUATE PROGRAM IN DATA SCIENCE & MACHINE LEARNING (PGPDM)

POST GRADUATE PROGRAM IN DATA SCIENCE & MACHINE LEARNING (PGPDM) OUTLINE FOR THE POST GRADUATE PROGRAM IN DATA SCIENCE & MACHINE LEARNING (PGPDM) Module Subject Topics Learning outcomes Delivered by Exploratory & Visualization Framework Exploratory Data Collection and

More information

Agile Software Requirements. Matthew Renze Iowa State University COMS 409 Software Requirements

Agile Software Requirements. Matthew Renze Iowa State University COMS 409 Software Requirements Agile Software Requirements Matthew Renze Iowa State University COMS 409 Software Requirements Purpose Introduce you to Agile software development Discuss Agile software requirements Overview What is Agile?

More information

DATA MANAGEMENT BASICS. July 19, 2017 September 8, 2017

DATA MANAGEMENT BASICS. July 19, 2017 September 8, 2017 DATA MANAGEMENT BASICS July 19, 2017 September 8, 2017 OBJECTIVES Enthuse evaluative thinking Encourage performance measurement Build knowledge of concepts related to: o Data collection o Data analysis

More information

Brief Contents. digitalisiert durch: IDS Basel Bern. Essentials of marketing research 2013

Brief Contents. digitalisiert durch: IDS Basel Bern. Essentials of marketing research 2013 Brief Contents Part The and Value of Marketing Research Information 1 1 Marketing Research for Managerial Decision Making 2 2 The Marketing Research Process and Proposais 24 Part 2 Designing the Marketing

More information

SIDDHARTH INSTITUTE OF ENGINEERING & TECHNOLOGY (AUTONOMOUS) :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE)

SIDDHARTH INSTITUTE OF ENGINEERING & TECHNOLOGY (AUTONOMOUS) :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE) S SIDDHARTH INSTITUTE OF ENGINEERING & TECHNOLOGY (AUTONOMOUS) :: PUTTUR Siddharth Nagar, Narayanavanam Road 517583 QUESTION BANK (DESCRIPTIVE) Subject with Code : Course & Branch: MBA IYear I-Sem Regulation:

More information

VIII. STATISTICS. Part I

VIII. STATISTICS. Part I VIII. STATISTICS Part I IN THIS CHAPTER: An introduction to descriptive statistics Measures of central tendency: mean, median, and mode Measures of spread, dispersion, and variability: range, variance,

More information

Access to data sources for official statistics in The Netherlands. Barteld Braaksma

Access to data sources for official statistics in The Netherlands. Barteld Braaksma Access to data sources for official statistics in The Netherlands Barteld Braaksma Mission of the European Statistical System We provide the European Union, the world and the public with independent high

More information

Knowledge-Based Society by KaaS (KaaS: Knowledge as a Service)

Knowledge-Based Society by KaaS (KaaS: Knowledge as a Service) -Based Society by KaaS (KaaS: as a Service) October 23, 2008 Systems Development Laboratory, Hitachi Ltd. Akira Maeda, Ph.D Contents 1. ICT trends and their implications 2. ICT as a knowledge generation

More information

Exceed your business with SharePoint Server 2010

Exceed your business with SharePoint Server 2010 Exceed your business with SharePoint Server 2010 Alexandre Mendeiros alexandre.mendeiros@microsoft.com Alexandre Mendeiros Premier Field Engineer for SQL Server & Business Intelligence area. Working with

More information

Azure ML Studio. Overview for Data Engineers & Data Scientists

Azure ML Studio. Overview for Data Engineers & Data Scientists Azure ML Studio Overview for Data Engineers & Data Scientists Rakesh Soni, Big Data Practice Director Randi R. Ludwig, Ph.D., Data Scientist Daniel Lai, Data Scientist Intersys Company Summary Overview

More information

C3 Products + Services Overview

C3 Products + Services Overview C3 Products + Services Overview AI CLOUD PREDICTIVE ANALYTICS IoT Table of Contents C3 is a Computer Software Company 1 C3 PaaS Products 3 C3 SaaS Products 5 C3 Product Trials 6 C3 Center of Excellence

More information

How the growth of R helps data-driven organizations succeed

How the growth of R helps data-driven organizations succeed How the growth of R helps data-driven organizations succeed David Smith Chief Community Officer 7 th China R User Conference, May 2014 Agenda Introduction History of R Growth of R Applications of R The

More information

Advanced Job Daimler. Julian Leweling, Daimler AG

Advanced Job Daimler. Julian Leweling, Daimler AG Advanced Job Analytics @ Daimler Julian Leweling, Agenda From Job Ads to Knowledge: Advanced Job Analytics @ Daimler About Why KNIME? Our Inspiration Use Case KNIME Walkthrough Application Next steps Advanced

More information

Comparing Ecological Communities Part One: Classification

Comparing Ecological Communities Part One: Classification Comparing Ecological Communities Part One: Classification Reading Assignment: Ch. 15, GSF Review Community Ecology Lecture, Sept. 17 And GSF, Chapter 9 10/14/09 1 What are three basic ways vegetation can

More information

Advanced Higher Statistics

Advanced Higher Statistics Advanced Higher Statistics 2018-19 Advanced Higher Statistics - 3 Unit Assessments - Prelim - Investigation - Final Exam (3 Hours) 1 Advanced Higher Statistics Handouts - Data Booklet - Course Outlines

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

Data Analytics Training Program using

Data Analytics Training Program using Data Analytics Training Program using In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers

More information

IBM SPSS Statistics Editions

IBM SPSS Statistics Editions Editions Get the analytical power you need for better decision-making Why use IBM SPSS Statistics? is the world s leading statistical software. It enables you to quickly dig deeper into your data, making

More information

A Smart Tool to analyze the Salary trends of H1-B Workers

A Smart Tool to analyze the Salary trends of H1-B Workers 1 A Smart Tool to analyze the Salary trends of H1-B Workers Akshay Poosarla, Ramya Vellore Ramesh Under the guidance of Prof.Meiliu Lu Abstract Limiting the H1-B visas is bad news for skilled workers in

More information

SAS Human Capital Management 5.1. Administrator s Guide

SAS Human Capital Management 5.1. Administrator s Guide SAS Human Capital Management 5.1 Administrator s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2009. SAS Human Capital Management 5.1: Administrator s Guide.

More information

TWO DAYS WORKSHOP ON PARAMETRIC TESTS UNIVARIATE, BIVARIATE AND MULTIVARIATE DATA ANALYSIS (ANOVA FAMILY)

TWO DAYS WORKSHOP ON PARAMETRIC TESTS UNIVARIATE, BIVARIATE AND MULTIVARIATE DATA ANALYSIS (ANOVA FAMILY) DEPARTMENT OF COMMERCE SCHOOL OF MANAGEMENT PONDICHERRY UNIVERSITY, PUDUCHERRY TWO DAYS WORKSHOP ON PARAMETRIC TESTS UNIVARIATE, BIVARIATE AND MULTIVARIATE DATA ANALYSIS (ANOVA FAMILY) 5th & 6th March,

More information

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations

+? Mean +? No change -? Mean -? No Change. *? Mean *? Std *? Transformations & Data Cleaning. Transformations Transformations Transformations & Data Cleaning Linear & non-linear transformations 2-kinds of Z-scores Identifying Outliers & Influential Cases Univariate Outlier Analyses -- trimming vs. Winsorizing

More information

Multi-Dimensional Representations -- How Many Dimensions? Cluster Stack Visualization for Market Segmentation Analysis

Multi-Dimensional Representations -- How Many Dimensions? Cluster Stack Visualization for Market Segmentation Analysis Multi-Dimensional Representations -- How Many Dimensions? Cluster Stack Visualization for Market Segmentation Analysis William Wright Oculus Info Inc. An earlier version of this paper appeared in the Symposium

More information

Council for Best Practices Meeting

Council for Best Practices Meeting Independent Insurance Agents & Brokers of America, Inc. 2017 Best Practices Symposium Council for Best Practices Meeting Tom Doran Paula Williams January 2017 Agenda 2 2016 Best Practices Study Findings

More information

Lecture 1: What is Data Mining?

Lecture 1: What is Data Mining? Lecture 1: What is Data Mining? Alan Lee Department of Statistics STATS 784 Lecture 1 July 26, 2017 Page 1/34 Outline Housekeeping Introduction Data mining projects Statistical learning References Page

More information

20. Alternative Ranking Using Criterium Decision Plus (Multi-attribute Utility Analysis)

20. Alternative Ranking Using Criterium Decision Plus (Multi-attribute Utility Analysis) The selection of the preferred alternative, using a structured decision process, begins with creation of the decision hierarchy. The hierarchy is the clear organization of the goal, listing of categories

More information

Carry out rule-based statistical analysis

Carry out rule-based statistical analysis Overview This unit is about carrying out a range of different types of statistical analysis for internal and external clients under supervision. Applicable NOS Unit SSC/ N 2101 Unit Code SSC/ N 2101 Unit

More information

Putting the Open in OpenROAD. Joe Kronk Director, Engineering

Putting the Open in OpenROAD. Joe Kronk Director, Engineering 1 Putting the Open in OpenROAD Joe Kronk Director, Engineering Abstract This presentation explores Ingres Corporation plans for putting the "Open" into OpenROAD Open sourcing OpenROAD gives the community

More information

Integrated Predictive Maintenance Platform Reduces Unscheduled Downtime and Improves Asset Utilization

Integrated Predictive Maintenance Platform Reduces Unscheduled Downtime and Improves Asset Utilization November 2017 Integrated Predictive Maintenance Platform Reduces Unscheduled Downtime and Improves Asset Utilization Abstract Applied Materials, the innovator of the SmartFactory Rx suite of software products,

More information

Like This Facebook Training

Like This Facebook Training Like This Facebook Training Introduction Day One Like This Facebook Training The purpose of the Like This Facebook Training is to give you the tools and the skill to use Facebook successfully from a business

More information

MARK2052: Marketing Research

MARK2052: Marketing Research MARK2052: Marketing Research Lecture 1 Research Process 3 Evolution of Marketing and Marketing Research 3 Marketing Research Roles 3 The Role of Marketing Research and the Research Process (Chapter 1)

More information

MARKETING RESEARCH AN APPLIED APPROACH FIFTH EDITION NARESH K. MALHOTRA DANIEL NUNAN DAVID F. BIRKS. W Pearson

MARKETING RESEARCH AN APPLIED APPROACH FIFTH EDITION NARESH K. MALHOTRA DANIEL NUNAN DAVID F. BIRKS. W Pearson MARKETING RESEARCH AN APPLIED APPROACH FIFTH EDITION NARESH K. MALHOTRA DANIEL NUNAN DAVID F. BIRKS W Pearson Marlow, England London New York Boston San Francisco Toronto Sydney Dubai Singapore Hong Kong

More information

TIME SERIES ANALYSIS (TSA) AND DEEP LEARNING

TIME SERIES ANALYSIS (TSA) AND DEEP LEARNING INNOVATION PLATFORM WHITE PAPER 1 Exponential growth of IoT Big Data is coming. This paper highlights improvements in the predictive capability of time series analysis when combined with deep learning

More information

Predicting the present with Google Trends. -Hyunyoung Choi -Hal Varian

Predicting the present with Google Trends. -Hyunyoung Choi -Hal Varian Predicting the present with Google Trends -Hyunyoung Choi -Hal Varian Outline Problem Statement Goal Methodology Analysis and Forecasting Evaluation Applications and Examples Summary and Future work Problem

More information

OPEN STANDARDS BENCHMARKING PRODUCT DEVELOPMENT MEASURE LIST

OPEN STANDARDS BENCHMARKING PRODUCT DEVELOPMENT MEASURE LIST ABOUT APQC's MEASURE LIST The APQC Open Standards Benchmarking measure list concisely lists all of the measures currently available for a specific survey. These measures are organized by research area

More information

PL Best Practice Managing Inventor ipart/iassemblies in Vault PDM

PL Best Practice Managing Inventor ipart/iassemblies in Vault PDM PL20989 - Best Practice Managing Inventor ipart/iassemblies in Vault PDM Markus Koechl Solutions Engineer PDM PLM Autodesk Central Europe Peter van Avondt Technical Specialist - Data Management Autodesk

More information

Multivariate analysis of marketing data - applications for bricolage market

Multivariate analysis of marketing data - applications for bricolage market Bulletin of Transilvania University of Braşov Series V: Economic Sciences Vol. 9 (58) No. 2 2016 Multivariate analysis of marketing data - applications for bricolage market Mihai FÂNARU 1 Abstract: By

More information

WIRED FOR INNOVATION HOW IT IS TRANSFORMING COMPETITION

WIRED FOR INNOVATION HOW IT IS TRANSFORMING COMPETITION WIRED FOR INNOVATION HOW IT IS TRANSFORMING COMPETITION Erik Brynjolfsson MIT Sloan School and MIT Center for Digital Business http://digital.mit.edu/erik Copyright Erik Brynjolfsson. Reproduction prohibited

More information

Next Presentation: I, Robot : Skyrocket Analytics with SAS Process Automation Presenter: Brian Sun

Next Presentation: I, Robot : Skyrocket Analytics with SAS Process Automation Presenter: Brian Sun Next Presentation: I, Robot : Skyrocket Analytics with SAS Process Automation Presenter: Brian Sun Picture Courtesy of Waj Hoda Brian has master s degrees in business administration (University of British

More information

Interactive & Actionable Data Visualisation With Power View

Interactive & Actionable Data Visualisation With Power View Interactive & Actionable Data Visualisation With Power View Julie Koesmarno MVP, MCT, MCSE Data Platform, MCSE Business Intelligence SQL Server / BI Consultant @MsSQLGirl http://www.mssqlgirl.com Julie

More information

Regression. Jay Benesch. Brief history context of RF fault analysis regression methods via JMP and R summary

Regression. Jay Benesch. Brief history context of RF fault analysis regression methods via JMP and R summary Regression Jay Benesch Brief history context of RF fault analysis regression methods via JMP and R summary History I Ordinary least squares regression invented by Gauss ~1795 but first publication Legendre

More information

part 1 Marketing Research and the Research Process 1 part 5 Data Analysis and Interpretation 349 2, Determine Research Design 57

part 1 Marketing Research and the Research Process 1 part 5 Data Analysis and Interpretation 349 2, Determine Research Design 57 B R I E F C O N T E N T S part 1 Marketing Research and the Research Process 1 part 1 Marketing Research: It's Everywhere! 2 2 Alternative Approaches to Marketing Intelligence 18 3 The Research Process

More information

Data Warehouse Part 2

Data Warehouse Part 2 Data Warehouse Part 2 Goals Assess your current information environment to provide and understanding of what you need to do to construct and build your data warehouse. Any time you get information in less

More information

DATA ANALYTICS WITH R, EXCEL & TABLEAU

DATA ANALYTICS WITH R, EXCEL & TABLEAU Learn. Do. Earn. DATA ANALYTICS WITH R, EXCEL & TABLEAU COURSE DETAILS centers@acadgild.com www.acadgild.com 90360 10796 Brief About this Course Data is the foundation for technology-driven digital age.

More information

TABLE OF CONTENTS ! +

TABLE OF CONTENTS ! + TABLE OF CONTENTS MEANINGFUL DASHBOARDS... 2 What is a Dashboard?... 2 Where are Dashboards Used?... 4 The Market View... 6 What Should You Look For?... 7 The Key Benefits... 9 Creating Meaningful Dashboards

More information

Building the In-Demand Skills for Analytics and Data Science Course Outline

Building the In-Demand Skills for Analytics and Data Science Course Outline Day 1 Module 1 - Predictive Analytics Concepts What and Why of Predictive Analytics o Predictive Analytics Defined o Business Value of Predictive Analytics The Foundation for Predictive Analytics o Statistical

More information

SQLStarter Intro to Data Science. Dave

SQLStarter Intro to Data Science. Dave SQLStarter Dave Leininger @DaveLeininger SQLStarter Dave Leininger WHO IS FUSION ALLIANCE? SQLStarter: What is Data Science? Why would I want to be a Data Scientist? What are the tools and technologies?

More information

Turning Insight Into Action

Turning Insight Into Action Turning Insight Into Action Presented by Mike Purcell Marketing Manager The explosion in available data makes consolidating decision systems an increasing challenge. However, too much data can be overwhelming

More information

Experimental Design Day 2

Experimental Design Day 2 Experimental Design Day 2 Experiment Graphics Exploratory Data Analysis Final analytic approach Experiments with a Single Factor Example: Determine the effects of temperature on process yields Case I:

More information

Data Mining Applications with R

Data Mining Applications with R Data Mining Applications with R Yanchang Zhao Senior Data Miner, RDataMining.com, Australia Associate Professor, Yonghua Cen Nanjing University of Science and Technology, China AMSTERDAM BOSTON HEIDELBERG

More information

Optimization and Visualization of Opinion Mining and Sentiments In Tourism Dashboard: A Case of Statics Centre Abu-Dhabi (SCAD)

Optimization and Visualization of Opinion Mining and Sentiments In Tourism Dashboard: A Case of Statics Centre Abu-Dhabi (SCAD) Optimization and Visualization of Opinion Mining and Sentiments In Tourism Dashboard: A Case of Statics Centre Abu-Dhabi (SCAD) Hamed Saif Albusaidi 1, Prakash Kumar Udupi 2, Vishal Dattana 3 1 Student,

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

Brian Macdonald Big Data & Analytics Specialist - Oracle

Brian Macdonald Big Data & Analytics Specialist - Oracle Brian Macdonald Big Data & Analytics Specialist - Oracle Improving Predictive Model Development Time with R and Oracle Big Data Discovery brian.macdonald@oracle.com Copyright 2015, Oracle and/or its affiliates.

More information

Evaluating Open Source Business Intelligence Tools using OSSpal Methodology

Evaluating Open Source Business Intelligence Tools using OSSpal Methodology Evaluating Open Source Business Intelligence Tools using OSSpal Methodology Tânia Ferreira 1, Isabel Pedrosa 2 and Jorge Bernardino 1,3 1 Polytechnic of Coimbra, Institute of Engineering of Coimbra ISEC,

More information

Motivating people: Getting beyond money

Motivating people: Getting beyond money 1 NOVEMBER 2009 Motivating people: Getting beyond money The economic slump offers business leaders a chance to more effectively reward talented employees by emphasizing nonfinancial motivators rather than

More information

A Visual Exploration Approach to Project Portfolio Management

A Visual Exploration Approach to Project Portfolio Management A Visual Exploration Approach to Project Portfolio Management Extended Dissertation Abstract for AMCIS 2007 Doctoral Consortium Guangzhi Zheng Department of Computer Information Systems J. Mack Robinson

More information

7 Steps to Data Blending for Predictive Analytics

7 Steps to Data Blending for Predictive Analytics 7 Steps to Data Blending for Predictive Analytics Evolution of Analytics Just as the volume and variety of data has grown significantly in recent years, so too has the expectations for analytics. No longer

More information

2015 IBM Corporation. IBM BigFix Inventory. Advantages of migrating from Software Use Analysis 2.2 to BigFix Inventory 9.x

2015 IBM Corporation. IBM BigFix Inventory. Advantages of migrating from Software Use Analysis 2.2 to BigFix Inventory 9.x 2015 IBM Corporation IBM BigFix Inventory Advantages of migrating from Software Use Analysis 2.2 to BigFix Inventory 9.x END OF SUPPORT Software Use Analysis 2.2 will withdraw support in April 2017. After

More information

CREDIT RISK MODELLING Using SAS

CREDIT RISK MODELLING Using SAS Basic Modelling Concepts Advance Credit Risk Model Development Scorecard Model Development Credit Risk Regulatory Guidelines 70 HOURS Practical Learning Live Online Classroom Weekends DexLab Certified

More information

Data Science 101: an introduction

Data Science 101: an introduction Data Science 101: an introduction Data Science Team Stanford University, Department of Statistics Sexy jobs I keep saying the sexy job in the next ten years will be statisticians. People think I m joking,

More information

Introduction to Statistics I

Introduction to Statistics I Introduction to Statistics I Keio University, Faculty of Economics Course Description and Introduction Simon Clinet (Keio University) Intro to Stats September 27, 2018 1 / 26 General information Instructor

More information

EDUCATION SECTOR ACTIVITIES

EDUCATION SECTOR ACTIVITIES S.No Activity Description EDUCATION SECTOR ACTIVITIES 1 Education Advisor Assists Higher Education providers, inside and outside the Free Zone, on various programs requirements, governmental collaborations

More information

Data Scientists in Software Teams: State of Art and Challenges [IEEE Transactions on Software Engineering, ICSE 2018 Journal First]

Data Scientists in Software Teams: State of Art and Challenges [IEEE Transactions on Software Engineering, ICSE 2018 Journal First] Data Scientists in Software Teams: State of Art and Challenges [IEEE Transactions on Software Engineering, ICSE 2018 Journal First] Miryung Kim University of California, Los Angeles Thomas Zimmermann,

More information

Business Model Canvas

Business Model Canvas Business Model Canvas 1-1 Customer Segments Defines the different groups of people or organizations to serve Separate segments if: Needs require and justify distinct offer Reached through different channels

More information

Operationalizing Your Analytics: Closing the gaps in the analytic value chain

Operationalizing Your Analytics: Closing the gaps in the analytic value chain TDWI Savannah Solution Summit October 2017 Operationalizing Your Analytics: Closing the gaps in the analytic value chain James Taylor, CEO tdwi.org/savannah17 @jamet123 #decisionmgt 2017 Decision Management

More information

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief Editions Get the analytical power you need for better decision making Why use IBM SPSS Statistics? is the world s leading statistical software. It enables you to quickly dig deeper into your data, making

More information

Mining Time Series. Feb/13/2012

Mining Time Series. Feb/13/2012 Mining Time Series Feb/13/2012 What is Time Series Time series is a sequence of data points, measured typically at successive time points spaced at uniform time intervals. Time series mining comprises

More information

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

PASS4TEST. IT Certification Guaranteed, The Easy Way!  We offer free update service for one year PASS4TEST \ We offer free update service for one year Exam : MB2-713 Title : Microsoft Dynamics CRM 2016 Sales Vendor : Microsoft Version : DEMO Get Latest & Valid MB2-713 Exam's Question and Answers 1

More information

5 CHAPTER: DATA COLLECTION AND ANALYSIS

5 CHAPTER: DATA COLLECTION AND ANALYSIS 5 CHAPTER: DATA COLLECTION AND ANALYSIS 5.1 INTRODUCTION This chapter will have a discussion on the data collection for this study and detail analysis of the collected data from the sample out of target

More information

Taxonomy Governance Best Practices

Taxonomy Governance Best Practices November 5, 2018 Copyright 2018 Taxonomy Strategies LLC and Semantic Staffing. All rights reserved. Taxonomy Strategies semantic STAFFING Taxonomy Governance Best Practices Joseph Busch, Principal Analyst

More information

Andrew Macdonald ILOG Technical Professional 2010 IBM Corporation

Andrew Macdonald ILOG Technical Professional 2010 IBM Corporation The value of IBM WebSphere ILOG BRMS Understanding the value of IBM WebSphere ILOG Business Rule Management Systems (BRMS). BRMS can be used to implement and manage change in a safe and predictable way

More information

Pre- launch Buzz Makes or Breaks Your App s Success

Pre- launch Buzz Makes or Breaks Your App s Success Pre- launch Buzz Makes or Breaks Your App s Success Product lifespan has become shorter and shorter in recent years, as manufacturers and developers struggle to give consumers the next big thing after

More information

Maximizing the potential of manufacturing data to advance biopharmaceutical production

Maximizing the potential of manufacturing data to advance biopharmaceutical production Maximizing the potential of manufacturing data to advance biopharmaceutical production Anja Zgodic 1 Ashley Lee, Paul Stey, Andras Zsom, Sam Bell 2 Tom Mistretta, Patrick Gammell 1 1. Amgen Process Development

More information

BIG DATA SKILLS: CHALLENGES FOR THE UNIVERSITY WORLD CREATING A NEW GENERATION OF DATA SCIENTISTS. Massimiliano Marcellino Bocconi University

BIG DATA SKILLS: CHALLENGES FOR THE UNIVERSITY WORLD CREATING A NEW GENERATION OF DATA SCIENTISTS. Massimiliano Marcellino Bocconi University BIG DATA SKILLS: CHALLENGES FOR THE UNIVERSITY WORLD CREATING A NEW GENERATION OF DATA SCIENTISTS Massimiliano Marcellino Bocconi University CES 2017 Seminar on the new generation of statisticians and

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Q-Rapids: Quality-aware Rapid Software Development. Barcelona, June 2017

Q-Rapids: Quality-aware Rapid Software Development. Barcelona, June 2017 Barcelona, June 2017 The GESSI research group of the Universitat Politècnica de Catalunya (UPC) conducts research in many fields of software engineering, with particular emphasis on: Quality-aware Rapid

More information

Ch. 16 Exploring, Displaying, and Examining Data

Ch. 16 Exploring, Displaying, and Examining Data TECH 646 Analysis of Research in Industry and Technology PART IV Analysis and Presentation of Data: Data Presentation and Description; Exploring, Displaying, and Examining Data; Hypothesis Testing; Measures

More information

Visualizing Crowdfunding

Visualizing Crowdfunding Visualizing Crowdfunding Alexander Chao UC Berkeley B.A. Statistics 2015 2601 Channing Way Berkeley, Ca 94704 alexchao56@gmail.com ABSTRACT With websites such as Kickstarter and Indiegogo rising in popular

More information

Introduction to Big Data. By Dinesh Amatya

Introduction to Big Data. By Dinesh Amatya Introduction to Big Data By Dinesh Amatya Big Data Size of data? Technology? A method? Big data Big data refers to data sets whose size is beyond the ability of typical database software tools to capture,

More information

iq-training 2014 Course Catalog

iq-training 2014 Course Catalog iq-training 2014 Course Catalog Qualex's iq-training is the solution to your training needs; it is a solution for organizations that, before or after implementing SAS solutions, need additional support

More information

-SQA-SCOTTISH QUALIFICATIONS AUTHORITY HIGHER NATIONAL UNIT SPECIFICATION GENERAL INFORMATION

-SQA-SCOTTISH QUALIFICATIONS AUTHORITY HIGHER NATIONAL UNIT SPECIFICATION GENERAL INFORMATION -SQA-SCOTTISH QUALIFICATIONS AUTHORITY HIGHER NATIONAL UNIT SPECIFICATION GENERAL INFORMATION -Unit number- 7482149 -Unit title- INTRODUCTION TO STATISTICS FOR QUALITY ANALYSIS -Superclass category- -Date

More information

Application of Statistical Methods to Analyze Groundwater Quality

Application of Statistical Methods to Analyze Groundwater Quality Journal of Earth Sciences and Geotechnical Engineering, vol. 1, no.1, 2011, 1-7 ISSN: 1792-9040(print), 1792-9660 (online) International Scientific Press, 2011 Application of Statistical Methods to Analyze

More information

THE IMPACTS OF OPEN DATA: TOWARDS EX POST ASSESSMENT

THE IMPACTS OF OPEN DATA: TOWARDS EX POST ASSESSMENT 1 THE IMPACTS OF OPEN DATA: TOWARDS EX POST ASSESSMENT Heli Koski The Research Institute of the Finnish Economy 25/11/2015 Share-PSI 2.0 Workshop, Berlin The open-data revolution has not lived up to expectations.

More information

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction Paper SAS1774-2015 Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction ABSTRACT Xiangxiang Meng, Wayne Thompson, and Jennifer Ames, SAS Institute Inc. Predictions, including regressions

More information

"The Next Era of Corporate Disclosure". Sabine Content, Deputy Director Corporate & Stakeholder Relations

The Next Era of Corporate Disclosure. Sabine Content, Deputy Director Corporate & Stakeholder Relations "The Next Era of Corporate Disclosure". Sabine Content, Deputy Director Corporate & Stakeholder Relations Agenda i. About GRI ii. Relevance of Sustainability and Reporting iii. Sustainability reporting

More information

Sponsorship Opportunities. San Francisco March 30- April 1, 2015

Sponsorship Opportunities. San Francisco March 30- April 1, 2015 Sponsorship Opportunities San Francisco March 30- April 1, 2015 1 Details WHO Managers Project Leaders, Directors, CXOs, Vice Presidents, Investors and Decision Makers of any kind involved with analytics

More information

Power BI for Data Science Integration and exploration capabilities

Power BI for Data Science Integration and exploration capabilities Power BI for Data Science Integration and exploration capabilities J AV I E R G U I L L E N C H A R LOT T E B I G R O U P C H A R LOT T E, N C - 2018 Power BI for Data Science exploration Different mindset

More information

Analytics for Banks. September 19, 2017

Analytics for Banks. September 19, 2017 Analytics for Banks September 19, 2017 Outline About AlgoAnalytics Problems we can solve for banks Our experience Technology Page 2 About AlgoAnalytics Analytics Consultancy Work at the intersection of

More information

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro 2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro Mia Stephens mia.stephens@jmp.com http://bit.ly/1uygw57 Copyright 2010 SAS Institute Inc. All rights reserved. Background TQM

More information

Engineering Your Startup to Innovate at Scale. Randy linkedin.com/in/randyshoup

Engineering Your Startup to Innovate at Scale. Randy linkedin.com/in/randyshoup Engineering Your Startup to Innovate at Scale Randy Shoup @randyshoup linkedin.com/in/randyshoup Background VP Engineering at Stitch Fix o Combining Art and Science to revolutionize apparel retail Consulting

More information

6 Levels of Business Intelligence Value

6 Levels of Business Intelligence Value 6 Levels of Business Intelligence Value 6 Levels of BI Value In today s hyper-competitive market environment, business intelligence continues to be an area of investment and interest for businesses. The

More information