SAS Python R ** Best-of-all-Worlds Workshop ** Larry Orimoloye Global Technology Practice

Similar documents
SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

GOVERNMENT ANALYTICS LEADERSHIP FORUM SAS Canada & The Institute of Public Administration of Canada. April 26 The Shaw Centre

Deep Learning For Vision Analytics. SAS User Group Malaysia 3 rd May, 2018

SAS Viya. Примеры проектов на новой платформе. Copyright SAS Institute Inc. All rights reserved.

VDMML Enablement Session Data Science Jam Sessions

Brian Macdonald Big Data & Analytics Specialist - Oracle

Microsoft Azure Essentials

SAP Predictive Analytics Suite

USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP

Introducing Analytics with SAS Enterprise Miner. Matthew Stainer Business Analytics Consultant SAS Analytics & Innovation practice

Machine Learning For Enterprise: Beyond Open Source. April Jean-François Puget

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

Analytics in Action transforming the way we use and consume information

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1

C3 Products + Services Overview

BRINGING AI TO ALL DEVELOPERS

Advanced Analytics in Azure

Preparing for the analytics economy Why and how Telecom operators should adapt

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

SAS FORUM RUSSIA Welcome

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD

Modern Analytics Architecture

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Azure ML Studio. Overview for Data Engineers & Data Scientists

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Transforming Analytics with Cloudera Data Science WorkBench

NICE Customer Engagement Analytics - Architecture Whitepaper

Context. The NEW data services from UST Global UST GLOBAL - A UNIQUE PARTNER. UST Global Data Services March 2018!1

Changing The Business landscape SAS and Open Source, Better Together. Dr Mark Chia, Head of Advanced Analytics, SAS

Mass-Scale, Automated Machine Learning and Model Deployment Using SAS Factory Miner and SAS Decision Manager

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

COGNITIVE QA: LEVERAGE AI AND ANALYTICS FOR GREATER SPEED AND QUALITY. us.sogeti.com


Intel Public Sector 3

Sunnie Chung. Cleveland State University

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

Intel s Machine Learning Strategy. Gary Paek, HPC Marketing Manager, Intel Americas HPC User Forum, Tucson, AZ April 12, 2016

Sascha Schubert Product Manager Data Mining SAS EMEA Copyright 2005, SAS Institute Inc. All rights reserved.

Deep Dive into High Performance Machine Learning Procedures. Tuba Islam, Analytics CoE, SAS UK

2015 The MathWorks, Inc. 1

Making your Data Ready for AI

Cloudera, Inc. All rights reserved.

Machine Learning 101

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Making Data Science Simple

C3 IoT: Products + Services Overview

C3 IoT: Products + Services Overview

Business is being transformed by three trends

AGENDA. Oslo 30/

What s New. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

Integrating Artificial Intelligence and Simulation Modeling

Enterprise-Scale MATLAB Applications

SmartCare. SPSS Workshop. Rick Durham - North American Advanced Analytics Channel Team IBM Corporation. Date: 5/28/2014

Data Science is a Team Sport and an Iterative Process

API Economy - making APIs part of new business models

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

Is Machine Learning the future of the Business Intelligence?

INTRODUCTION TO R FOR DATA SCIENCE WITH R FOR DATA SCIENCE DATA SCIENCE ESSENTIALS INTRODUCTION TO PYTHON FOR DATA SCIENCE. Azure Machine Learning

Building data-driven applications with SAP Data Hub and Amazon Web Services

Our Emerging Offerings Differentiators In-focus

Databricks Cloud. A Primer

TDWI Analytics Fundamentals. Course Outline. Module One: Concepts of Analytics

By 2020, more than half of major new business processes and systems will incorporate some element of the IoT.

«ARE WE DISRUPTING OURSELVES?» Jörg Besier Managing Director, Accenture

Digital Transformation 2.0

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.

Who is Databricks? Today, hundreds of organizations around the world use Databricks to build and power their production Spark applications.

What s new in MATLAB and Simulink

DATA SCIENCE: HYPE AND REALITY PATRICK HALL

Analytics in the Digital Economy data, experience, ideas & people. Juergen Hagedorn, Viktor Kehayov Product Management, SAP Analytics March 2017

Goya Deep Learning Inference Platform. Rev. 1.2 November 2018

How to develop Data Scientist Super Powers! Using Azure from R to scale and persist analytic workloads.. Simon Field

Big Data Analytics met Hadoop

What s new in Machine Learning across the Splunk Portfolio

National Occupational Standard

WELCOME TO SAS FOR MARKETING

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

Insight is 20/20: The Importance of Analytics

Hadoop and Analytics at CERN IT CERN IT-DB

FINANCE + DEEP LEARNING SKYMIND / DEEPLEARNING4J 2015

IBM SPSS & Apache Spark

Architecture Overview for Data Analytics Deployments

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Deep Learning Acceleration with MATRIX: A Technical White Paper

Application Performance Management for Microsoft Azure and HDInsight

Salford Predictive Modeler. Powerful machine learning software for developing predictive, descriptive, and analytical models.

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Data Driven Culture Enabled by Data Management

SAP Predictive Analytics Hands-On. Andreas Forster December 2015

Machine Learning 2.0 for the Uninitiated A PRACTICAL GUIDE TO IMPLEMENTING ML 2.0 SYSTEMS FOR THE ENTERPRISE.

Machine Learning with Google Cloud Platform

RAPIDS GPU POWERED MACHINE LEARNING

Transcription:

SAS Python R ** Best-of-all-Worlds Workshop ** Larry Orimoloye Global Technology Practice

Agenda

AGENDA SAS Big data analytics platform (9:15am 10:30am) Introduction SAS Analytics Lifecycle process SAS Viya SAS Viya Data Mining and Machine Learning Demo Break(30Minute) SAS Viya API (Python) (11am 12.15pm) Python crash course (Optional) How to connect to CAS & Load data using jupyter notebook Working with CasTable using jupyter notebook Using CASTable objects like a DataFrame Data exploration and summary statistics SAS VIYA & Python model: Best of both worlds Break(15 Minute)

AGENDA SAS Viya API (R ) (12.30pm 1.30pm) R crash course (Optional) How to connect to CAS & Load data using R studio Working with CasTable using R Studio Using CASTable objects like a DataFrame Data exploration and summary statistics SAS VIYA & R model: Best of both worlds Lunch (90 Minutes) Advanced topics (Optional) - (3pm 4pm) AI / NLP Questions (4pm 5pm)

The Analytical Lifecycle Data Science Explorative Innovation New Data Experiments Explore Prepare Discovery Ask Deploy Deployment Act Operations Automation Robust Actions Decisions Data Scientist, Business Units Model Evaluate IT, Business Analyst, Business Units DATA Data Warehouse / Data Lake DATA

Architecture Revolution: key drivers Powerful Adaptive Open Unified Improve time-to-answer with faster deployment and provisioning Self-Service with API and APPS Innovative algorithms Optimised for in-memory, instream, in-hadoop, in-db, incloud, in-device Built to be elastic and scalable Support industry open standards for Cloud and onprem deployment Consumption based pricing for selected product & solutions Access through SAS languages and non-sas languages 3 rd party applications integration via API & Services One centralised analytic environment Analytical lifecycle end-to-end Administration, management, delivery and execution all integrated Company Confidential For Internal Use Only

Parallel & Serial, Pub / Sub, Web Services, MQs Source-based Engines Customer Intelligence Analytics In-Stream In-Hadoop In-Database In-Memory Runtime Engine Cloud Analytics Services (CAS) UA UAA A UAA Microservices Data Source M gmt Fo lders etc BI GUIs C A S Mgmt Log An alytics GUIs Data Mgmt GUIs Q uery Gen En v M gr M odel M gmt Au d it Solutions APIs Risk Management! Fraud and Security Intelligence Business Visualization Data Management Platform

Advanced Analytics Products Suite on SAS Viya Data Mining & Machine Learning Text Analytics Forecasting Econometrics Optimization Interactive Programmatic Automated Modern Approachable

SAS Visual Data Mining and Machine Learning is an end-to-end machine learning solution on the most advanced analytics platform to date SAS Viya.

Visual Data Mining and Machine Learning Visual Analytics GUI Visual Analytics Visual Statistics Visual Data Mining and Machine Learning SAS Studio UI Baseline Procedures VS Procedures VDMML Procedures Python, Java, Lua, REST APIs Baseline Action sets VS Action sets VDMML Action sets Requires Visual Analytics Requires Visual Statistics

SAS Visual Data Mining and Machine Learning Assess Supervised Models Creates score code Decision Trees Generalized Linear Models K-means and K-modes Clustering Linear Regression Logistic Regression Nonlinear Regression Ordinary Least Squares Regression Partial Least Squares Regression Principal Component Analysis Quantile Regression Factorization Machines Gradient Boosting Neural Networks Random Forest Support Vector Machines Multi Threaded Data Step DS2 SQL Variable Binning Variable Cardinality Analysis Sampling and Partitioning Missing Value Imputation Variable Selection Transpose Image processing Network Analytics/Community Detection Boolean Rules Text Mining Autotuning

SAS Viya Data Mining and Machine Learning New Capabilities Gradient Boosting New distributed algorithm all new implementation Factorization Machines New distributed algorithm for large scale sparse data Recommendation Engine - primary business driver Auto-tuning capabilities Available for complex algorithms Decision Trees, Random Forests, Neural Networks, Gradient Boosting, Support Vector Machines, Factorization Machines High Frequency Analytics Support Vector Data Description, Robust PCA, Moving Windows PCA, Short Time Fourier Transform Image Analysis

SAS VIYA DATA MINING AND MACHINE LEARNING AUTOTUNING HYPERPARAMETERS Highly data dependent Related to model complexity Decision Trees Neural Networks Gradient Boosting Random Forest Support Vector Machines Factorization Machines Auto Tuning: Automate hyperparameters search and find the optimal set Maximize predictability on independent data set Aims to avoid over-fitting by controlling model complexity Creates more accurate models faster vs hand tuning SAS auto tuning leverages world class SAS optimization engines

Auto-tuning Machine Learning Models Approaches SAS supports four methods: Random search Latin hypercube sampling (LHS) Bayesian search Genetic algorithm (LHS + Optimization)

Visual Interfaces SAS Viya For Many Users GUI user Programming Interfaces SAS Coder Non-SAS Coder API Interfaces Developer

SAS Visual Analytics Reporting Build and share reports and dashboards that help your organization make intelligent decisions. Discovery Explore your data with visualizations that help you understand even the most complex data Self-Service Analytics The power of easy-tounderstand analytics, in the hands of more users and provide path towards maturity

Data Preparation Access to different data sources Table and column profiling Filter data and column transformations Visual joins

Visual Exploration Discover relationships, trends, outliers Smart auto charting Analytics driven visualizations Publish any visualization as a report object.

Self-Service Analytics Descriptive statistics Forecasting and scenario analysis Supervised and unsupervised learning Text analytics

Advance Analytics Highly scalable, in-memory analytical processing Integrated data mining and machine learning Modern machine learning algorithms Model development Model assessment and scoring

SAS Visual Data Mining and Machine Learning GUI Interface SAS Visual Analytics Machine Learning Forest Gradient Boosting Neural Networks Support Vector Machines Factorization Machines Statistics Linear Regression Logistic Regression GLM Regression Clustering (k-means) Decision Tree

SAS Visual Data Mining and Machine Learning Programmatic Interface - SAS Studio Web Interface Interactive Program Editor Snippets Tasks Program Generator

Programming Interfaces Open APIs Access to all analytical actions for programmers Script-Save-Share-Schedule Open API Integration Integration from Python, Java, Lua, R

SAS Viya and Python SAS Sripting Wrapper for Analytics Transfer (SWAT) for Python Access to SAS Viya from Python Integration of SAS Analytics in Python code Jupyter Notebook support Issue tracking and collaboration in development through GitHub project + +

SAS Viya and R SAS Scripting Wrapper for Analytics Transfer (SWAT) for R Access to SAS Viya from R Integration of SAS Analytics in R code R Studio and Jupyter Notebook support Issue tracking and collaboration in development through GitHub project

What Does SWAT Do in a Nutshell SAS Scripting Wrapper for Analytics Transfer (SWAT) packages are open source interfaces to CAS Python coders can have access to the SAS Cloud Analytic Services (CAS) engine (the centre piece of the SAS Viya framework) You can load and analyse large data sets using processing power of CAS engine (either on a physical server or on cloud) and execute workflows of CAS analytic actions from Python on the client side. The SWAT package mimics much of the API of the Pandas package so that using CAS feels familiar to Pandas users. Install client components from GitHub #pip install https://github.com/sassoftware/pythonswat/releases/download/vx.x.x/python-swat-x.x.x-platform.tar.gz Dependencies: 64-bit Python 2.7, 3.4, or 3.5 on Linux SAS Viya

SAS Viya Programming As viewed through Legos SAS Studio can include one or more procedures to create a task. Stack one or more action sets to create procedures Stack one or more actions to create action sets Each action has one or more parameter settings. Studio Task Procedure Action Set Action Action Parameters Actions are at the heart of every CAS procedure

SAS Viya : New Open Architecture Different Languages Same Power proc print data = hmeq (obs = 10); run; Workers Controller df = s.castable( hmeq ) df.head(10) Translated Action [table.fetch] table.name = hmeq from = 1 to = 10 df <- defcastable(s, hmeq ) head(df, 10)

Calling VDMML Analytics from Python

SAS on SAS Viya with SWAT Open API for Python for SAS Viya (CAS Engine) https://github.com/sassoftware/python-swat SAS Viya with SWAT R Open API for R for SAS Viya (CAS Engine) https://github.com/sassoftware/r-swat

SAS ENTERPRISE MINER VIYA BRIDGE NODE Increase the Reach Model comparison Testing new SAS Viya algorithms Simultaneous processing of all model trainings Model ensembles Integration with SAS Enterprise Miner Metadata Management, deployment und monitoring of SAS Viya models

Get Started with SAS Viya Enablement SAS Viya Virtual Learning Support your adoption Free resources Getting Started training Video Libraries e-learning Topics Include Administration Programming & Analytics SAS Visual Analytics Open Source Integration

SAS Visual Data Mining and Machine Learning Some highlights Custom pipelines Parallel execution SAS best practice pipelines (i.e. Rapid Predictive Modeler) Defining and sharing custom reusable nodes Ability to deploy models in to databases directly Score code generation and score code APIs Import SAS 9.4 score code into pipeline comparison SAS Code integration

DEMO Visual Data Mining and Machine Learning

Break 30minute (10:30-11AM)

SAS Viya API (Python) Timing (11-12:15PM) Hands-on session Python crash course (Optional) How to connect to CAS & Load data using jupyter notebook Working with CasTable using jupyter notebook Using CASTable objects like a DataFrame Data exploration and summary statistics SAS VIYA & Python model: Best of both worlds

Decentralized

Centralized Shared Data, Shared Computing

Demo Assets Click these!

Resources sas.com

Break 15minute (12:15-12:30PM)

SAS Viya API (R ) Timing (12:30-1:30PM) Hands-on session R crash course (Optional) How to connect to CAS & Load data using R Studio Working with CasTable using R Studio Using CASTable objects like a DataFrame Data exploration and summary statistics SAS VIYA & R model: Best of both worlds

Demo Assets Click these!

Lunch 90minute (13:30-15:00PM)

Artificial Intelligence/NLP (15:00-16:00PM)

Artificial Intelligence is the science of training systems to emulate human tasks through learning and automation.

CHALLENGE ARTIFICIAL INTELLIGENCES CHALLENGES Lack of Customer insights Human-like tasks are not automated Technology and supplier not reliable Lack of Banking Tailored applications Loads of Unused Unstructured data such as Text and Images Business Executive

CHALLENGE ARTIFICIAL INTELLIGENCE CHALLENGES Lack of Customer insights Human-like tasks are not automated Technology and supplier not reliable Lack of Banking Tailored applications Loads of Unused Unstructured data such as Text and Images Large unstructured data, poor quality & governance Increased integration complexity with open source Data Scientists/ Researchers are a scarce resource Time consuming analytical processes Analytics Practitioner

CHALLENGE ARTIFICIAL INTELLIGENCE CHALLENGES Lack of Customer insights Human-like tasks are not automated Technology and supplier not reliable Lack of Banking Tailored applications Loads of Unused Unstructured data such as Text and Images Large unstructured data, poor quality & governance Increased integration complexity with open source Data Scientists/ Researchers are a scarce resource Time consuming analytical processes Difficult and costly to develop and maintain No real-time capabilities Lack of Automation and Repeatability IT Executive

ADVANCING WITH ARTIFICIAL INTELLIGENCE: CHALLENGES & VISION CHALLENGE STRATEGY VISION Lack of Customer insights Human-like tasks are not automated Technology and supplier not reliable Repetitive learning and discovery through data Automate manual tasks Highly skilled people to setup and maintain Lack of Banking Tailored applications Loads of Unused Unstructured data such as Text and Images Large unstructured data, poor quality & governance Develop Governed and Tailored Applications Add new features to existing solutions Better value from investments in Analytics Increased integration complexity with open source Data Scientists/ Researchers are a scarce resource Time consuming analytical processes Scale Analytics (deep learning)process to cope with big data Self-learning models for better accuracy Improve customer insight Difficult and costly to develop and maintain No real-time capabilities Lack of Automation and Repeatability Get more out of data with increase accuracy Competitive advantage through the best models and data usage

Support Core Artificial Intelligence at SAS Machine Learning Natural Language Computer Vision Forecasting and Optimization Data Wrangling Visualization Decision Support Deployment

Some application of AI in banking Anti-fraud Customer Optimization Journey Credit Application Chatbot Real time Transaction Analysis Personalized Recommend ation Next Best Action Anti-money Laundering Algorithmic Trading

Insurance industry

Cancer treatment

Wild-Life conservation

Performance assessment

Movie Industry

Customer journey optimization- Reinforcement learning

Open discussion What are your AI use cases?

Deep Learning Deep forward networks Auto encoders Convolutional networks Recurrent networks Cognitive Computing Speech to Text Natural language interaction Natural language generation Biomedical image processing AI M E T H O D S Powered by Viya MPP & GPU processing Python, Lua, Java, CASL and REST

Deep Learning Toolkit Ships with VDMML license as CAS actions CAS Action, but also built into MS pipeline for Neural Nets with more than 5 layers DEEP FORWARD CONVOLUTIONAL CAS Actions Only AUTOENCODERS RECURRENT Company Confidential For Internal Use Only

SAS deep learning toolkits

Image Actions loadimages CAS flattenimagetable compareimages c1 c2 c3 c4 c5 c6 c7 c8 C9 208.0 220.0 225.0 232.0 237.0 244.0 250.0 254.0 255.0 _channel1 channel2 channel3 channel4 source_id reference_id_ 0.00245 0.00481 0.00147 1.0 sor.jpg ref.jpg saveimages matchimages summarizeimages augmentimages Column jpg minwidth maxwidth max3rdchannel 0 _image_ 1.0 704.0 704.0 255.0 colorjittering lighten invertpixels pyramidup rotateleft horizontalflip colorshifting darken sharpen pyramiddown rotateright verticalflip Continue to Next Page

Image Actions processimages action loadimages CAS processimages Convert_color Bilateral_filter threshold laplacian Canny_edge processimages contour resize get_patch sobel normalize rescale morphology Box_filter Gaussian_filter Median_filter Build_pyramid Contour image is combination of: Convert_color Bilateral_filter Threshold Laplacian Contours Add_constant Hist_equal_global Hist_equal_addl Mutation_vert Mutation_sharp

#analyticsx BioMedical C o p y r i ght 2016, S AS I ns ti tut e In c. A ll ri ght s res er ved.

Deep Learning Python APIs

Design Strategy

Deploy Streaming A.I. Sample Images Profile Labels: gender / outfit / company Channels Model Training Alerts / Reports/ Decisioning IoT Data (camera) Model Deployment @ Streaming Data Marketing Automation

X Streaming AI Train At-Rest & Score In-Stream Train & Score In-Stream Streaming Data Prep for AI SAS Machine Learning Models Forest, Gradient Boosting Factorization Machine Support Vector Machine Robust PCA Support Vector Data Description Text Analytics (ASTORE only) Deep Learning Bayesian networks X SAS Machine Learning Models Streaming K-Means Streaming DBSCAN Streaming Linear Regression Streaming Logistic Regression Streaming Support Vector Machine SAS Algorithms (among others) Streaming Text Tokenization and Text Vectorization Streaming Receiver Operating Characteristic Information Streaming Image Processing

Reinforcement Learning in CI360 Coming Soon in 2018 Allow marketers to automatically learn what sequences of content maximize conversion goal on a web page, also taking into account different user attributes.

Cognitive Computing

Cognitive Computing SAS Vision Enable more human-like interactions with our software solutions. Reason on input. Explain on output. Cognitive computing is based on self-learning systems that use machine learning techniques to perform specific, human-like tasks in an intelligent way.

Detecting Emotions Company Confidential For Internal Use Only

Validate Safety Compliance Hard Hat Demo Company Confidential For Internal Use Only

Image Classification Classifying images in real time Company Confidential For Internal Use Only

Thank You! sas.com