USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP

Similar documents
WELCOME TO SAS FOR MARKETING

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Approaching an Analytical Project. Tuba Islam, Analytics CoE, SAS UK

Brian Macdonald Big Data & Analytics Specialist - Oracle

Modernizing Data Integration

Achieve Better Insight and Prediction with Data Mining

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1

In-Memory Analytics: Get Faster, Better Insights from Big Data

Deep Dive into High Performance Machine Learning Procedures. Tuba Islam, Analytics CoE, SAS UK

SAS Business Knowledge Series

New Features in Enterprise Miner

Building the In-Demand Skills for Analytics and Data Science Course Outline

SAS Decision Manager

SAP Predictive Analytics Hands-On. Andreas Forster December 2015

IBM SPSS Modeler Premium

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN

IBM SPSS Modeler Personal

SPM 8.2. Salford Predictive Modeler

InsideBIGDATA Guide to Predictive Analytics

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

RISK AND FINANCE INTEGRATION IN THE CAPITAL PLANNING PROCESS

Ensemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro

Intel s Machine Learning Strategy. Gary Paek, HPC Marketing Manager, Intel Americas HPC User Forum, Tucson, AZ April 12, 2016

Building a Bridge between Risk and Finance to Address IFRS 9 and Stresstesting

Equifax InterConnect. A Product Review. By James Taylor CONTENTS

Predictive Analytics Cheat Sheet

Symantec ediscovery Platform, powered by Clearwell

ACHIEVING OPTIMAL IFRS9 COMPLIANCE

Insight is 20/20: The Importance of Analytics

SAP Predictive Analytics Suite

Knowledge Solution for Credit Scoring

Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS

20332B: Advanced Solutions of Microsoft SharePoint Server 2013


Sylvain Tremblay SAS Canada

TDWI Analytics Fundamentals. Course Outline. Module One: Concepts of Analytics

Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge

Oracle Big Data Discovery The Visual Face of Big Data

e7 Capacity Expansion Long-term resource planning for resource planners and portfolio managers

Architecture Overview for Data Analytics Deployments

HP Cloud Maps for rapid provisioning of infrastructure and applications

CUSTOMER INTELLIGENCE MARKETING IN 21ST CENTURY

CI Information Hub. Incorporating Text Analysis into Business and Competitive Intelligence

Software Processes. Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 4 Slide 1

From Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here.

Ask the Expert Model Selection Techniques in SAS Enterprise Guide and SAS Enterprise Miner

Integrating MATLAB Analytics into Enterprise Applications

#mstrworld. A Deep Dive Into Self-Service Data Discovery In MicroStrategy. Vijay Anand Gianthomas Tewksbury Volpe. #mstrworld

PREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING

10/12/ Copyright 2012, Oracle and/or its affiliates. All rights reserved. Oracle Unified Method (OUM) Overview

GET MORE VALUE OUT OF BIG DATA

MAXIMIZING COMPLIANCE EFFECTIVENESS

Software Processes. Objectives. Topics covered. The software process. Waterfall model. Generic software process models

Objectives. The software process. Topics covered. Waterfall model. Generic software process models. Software Processes

NEXT GENERATION PREDICATIVE ANALYTICS USING HP DISTRIBUTED R

DATA ANALYTICS WITH R, EXCEL & TABLEAU

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Customer Relationship Management in marketing programs: A machine learning approach for decision. Fernanda Alcantara

Topics covered. Software process models Process iteration Process activities The Rational Unified Process Computer-aided software engineering

Test-king.P questions P IBM B2B Integration Technical Mastery Test v1

Workflow and Electronic Records Capture

Who Are My Best Customers?

Machine Learning 101

Document Management Proposed Scanning Solution September 22nd 2008

Solutions Implementation Guide

IBM SPSS Decision Trees

IN the inaugural issue of the IEEE Transactions on Services Computing (TSC), I used SOA, service-oriented consulting

Cloud Transformation with Enterprise Maps 3.10, CSA 4.60 or CODAR 1.60

WELCOME TO. Cloud Data Services: The Art of the Possible

Workflow Engines: The Next New Thing in Advisor Technology. September 18, 2013

Microsoft Developer Day

Stat Production Services for Oracle E-Business Suite (Onsite and Remote)

Intelligence for the Industrial Internet of Things

Chapter 9. Business Intelligence Systems

Go With The Workflow: PDF for SharePoint June 22 nd,2010 2:00 EST

What s New in Microsoft Dynamics CRM 4.0. Bryan Nielson Director, Product Marketing

Cognitive, AI and Analytics

Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 2 Descriptive Analytics I: Nature of Data, Statistical Modeling, and Visualization

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Forecasting Software

ARA Plugin for CA CDD User Guide

C opyr i g ht 2016, SAS Ins titut e Inc. All rights res er ve d. Bienvenue

Lancet Data Sciences and Bluestem Brands

DevOps and Machine Learning. Jasjeet Thind VP, Data Science & Engineering, Zillow

Thermo Scientific Qtegra Intelligent Scientific Data Solution. Delivering quality. Driving productivity

Business visualization: Dashboards, reporting and approachable analytics all from one interface. What does SAS Visual Analytics do?

Embracing Technical Computing Trends with MATLAB Accelerating the Pace of Engineering and Science

Data mining and Renewable energy. Cindi Thompson

How to build and deploy machine learning projects

CONFIGMGR DATA SOLUTIONS

SAS Forum. Transactional Fraud. Filip Verbeke, Sales Manager Fraud Solutions South West Europe. Copyright 2015, SAS Institute Inc. All right reserved.

Title: Leveraging Oracle Identity Manager (OIM) to Improve Costs and Control. An Oracle White Paper March 2009

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Translate Integration Imperative into a solution Framework. A Solution Framework. August 1 st, Mumbai By Dharanibalan Gurunathan

Transcription:

USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP

INTRODUCTION PAT VALENTE, MA Solution Specialist, Data Sciences at SAS. Training in Economics and Statistics. 20 years experience in business areas including Finance, marketing and logistics. Well versed in analytics and data challenges that exist throughout large organizations. pat.valente@sas.com

AGENDA SAS AND OPEN SOURCE Open Source analytics in Business Open Source Integration Node Output modes Workflow examples to incorporate R models Careful considerations Questions

OPEN SOURCE INTEGRATION THIS IS ACHIEVED WITH SAS ANALYTICS IN ACTION SAS ANALYTICS IN ACTION = Data is about gathering data from the different data sources and locations, unifying it and making it ready for modeling Discovery is about having the flexibility to prototype analytical models to uncover business value Deployment is about engineering enterprise level solutions from those prototypes with governance measures to ensure quality

Extend Integrate OPEN SOURCE INTEGRATION SAS DOES IT BY INTEGRATING AND EXTENDING IT Where do we integrate? Where do we extend?

USING R IN SAS ENTERPRISE MINER THE OPEN SOURCE INTEGRATION NODE Enables the execution of R code within an Enterprise Miner workflow. Transfers data, metadata, and results automatically between Enterprise Miner and R

USING R IN SAS ENTERPRISE MINER THE OPEN SOURCE INTEGRATION NODE Facilitates multitasking in R Generates text and graphical output from R Integrates both supervised and unsupervised learning tasks

USING R IN SAS ENTERPRISE MINER PMML OUTPUT Predictive modeling markup language (PMML) is an open standard enabling certain R models to be translated into SAS DATA step code Currently supported R models include: Linear Models (lm) Multinomial Log-Linear Models (multinom (nnet)) Generalized Linear Models (glm (stats)) Decision Trees (rpart) Neural Networks (nnet) k-means Clustering (kmeans (stats))

USING R IN SAS ENTERPRISE MINER PMML MODE

USING R IN SAS ENTERPRISE MINER MERGE OUTPUT MODE Merge output mode enables integration with thousands of R packages that are not supported in PMML output mode. Variables created in R are merged with SAS Enterprise Miner data sources by the user. SAS DATA step code is not created.

USING R IN SAS ENTERPRISE MINER MERGE MODE

USING R IN SAS ENTERPRISE MINER SOME PRECAUTIONS Some items to consider when running R models in Open Source note: Missing Values may be an issue Ensure Categorical Variables are not high in cardinality Memory issues

USE SAS TO INTEGRATE R INTEGRATE R MODELS Why? Model Comparison Leverage R for new algorithms Ensemble Modelling Generate Score Code Deploy R models SAS MODELS Copyr i g ht 2016, SAS Ins titut e Inc. All rights res er ve d. 18

WHY BRING OPEN SOURCE TO SAS? EXTEND Model comparisons Copyr i g ht 2016, SAS Ins titut e Inc. All rights res er ve d. 19

QUESTIONS sas.com

USING R IN SAS ENTERPRISE MINER SUMMARY OF BENEFITS Model Building in SAS Enterprise Miner Use the latest R packages for model building and comparison Multi-Threaded Processing of Workflows SAS Enterprise Miner handles multi-threaded execution Use Open Source Node in SAS Enterprise Miner in various flows simultaneously Collaboration Many users can access the same Enterprise Miner diagram Reusable data processing and pre-analysis Using the EM functionality in prior nodes (i.e. data prep, pre-processing) of R models Scoring Create supported models in R that can be converted into scoring code for operational deployment (i.e. in-database)