Salford Predictive Modeler. Powerful machine learning software for developing predictive, descriptive, and analytical models.

Similar documents
SPM 8.2. Salford Predictive Modeler

3 Ways to Improve Your Targeted Marketing with Analytics

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Copyright 2013, SAS Institute Inc. All rights reserved.

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE

From Profit Driven Business Analytics. Full book available for purchase here.

Advanced Root Cause Analysis for Product Quality Improvement using Machine Learning in TIBCO Spotfire

IBM SPSS Decision Trees

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

Effective CRM Using. Predictive Analytics. Antonios Chorianopoulos

Acknowledgments. Data Mining. Examples. Overview. Colleagues

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN

Using Decision Tree to predict repeat customers

Predicting Customer Purchase to Improve Bank Marketing Effectiveness

How to Get More Value from Your Survey Data

Analytics for Banks. September 19, 2017

Machine Learning 101

Data Science in a pricing process

Predicting Customer Loyalty Using Data Mining Techniques

Advanced Analytics for Fraud, Waste and Abuse Detection

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

Data Mining in CRM THE CRM STRATEGY

Linear model to forecast sales from past data of Rossmann drug Store

ADVANCED DATA ANALYTICS

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

Introducing Analytics with SAS Enterprise Miner. Matthew Stainer Business Analytics Consultant SAS Analytics & Innovation practice

Deep Dive into High Performance Machine Learning Procedures. Tuba Islam, Analytics CoE, SAS UK

Preface to the third edition Preface to the first edition Acknowledgments

FINAL PROJECT REPORT IME672. Group Number 6

Bootcamp to Predictive Modelling with R - Statistical Track

Credit Card Marketing Classification Trees

SAS Business Knowledge Series

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

RECURRING REVENUE OPTIMIZATION: THE BASICS

Advanced Analytics for Fraud, Waste, and Abuse Detection WHITE PAPER

After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and

Getting Started with Machine Learning

Who Are My Best Customers?

Prediction and Interpretation for Machine Learning Regression Methods

Churn Prediction for Game Industry Based on Cohort Classification Ensemble

SAS Enterprise Miner 5.3 for Desktop

Predictive Analytics and Machine Learning: An Overview

From Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here.

An Executive s Guide to Predictive Data Modeling. An introductory look at how data modeling can drive better business decisions.

Uncover possibilities with predictive analytics

IBM SPSS Statistics Editions

PREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief

KDD Challenge Orange Labs R&D

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Azure ML Studio. Overview for Data Engineers & Data Scientists

Datameer for Data Preparation: Empowering Your Business Analysts

Ensemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo

Building the In-Demand Skills for Analytics and Data Science Course Outline

Olin Business School Master of Science in Customer Analytics (MSCA) Curriculum Academic Year. List of Courses by Semester

IBM Kenexa Talent Insights

IBM SPSS Modeler Personal

Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge

IBM SPSS Modeler Personal

Discover why your statistical intellect is in even more. demand now, than ever before. Natalie Mendes

IBM SPSS Statistics: What s New

Implementing Instant-Book and Improving Customer Service Satisfaction. Arturo Heyner Cano Bejar, Nick Danks Kellan Nguyen, Tonny Kuo

The real life of Data Scientists. or why we created the Data Science Studio

Drive Better Insights with Oracle Analytics Cloud

Speech Analytics Transcription Accuracy

Progress Report: Predicting Which Recommended Content Users Click Stanley Jacob, Lingjie Kong

Advanced analytics at your hands

IBM SPSS & Apache Spark

Approaching an Analytical Project. Tuba Islam, Analytics CoE, SAS UK

CSC-272 Exam #1 February 13, 2015

Business Customer Value Segmentation for strategic targeting in the utilities industry using SAS

AI Strategies in Retail

Achieving customer intimacy with IBM SPSS products

SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

Case studies in Data Mining & Knowledge Discovery

{saharonr, lastgift>35

CREDIT RISK MODELLING Using SAS

NEXT GENERATION PREDICATIVE ANALYTICS USING HP DISTRIBUTED R

TIBCO Industry Analytics: Consumer Packaged Goods and Retail Solutions

science and applications

Customer Lifecycle Management How Infogix Helps Enterprises Manage Opportunity and Risk throughout the Customer Lifecycle

POST GRADUATE PROGRAM IN DATA SCIENCE & MACHINE LEARNING (PGPDM)

Achieve Better Insight and Prediction with Data Mining

Predictive Analytics Using Support Vector Machine

Predicting Loyal Customers for Sellers on Tmall to Increase Return on Promoting Cost

Successful Selling: Acing Advanced Analytics to Drive Commercial Growth

How to build and deploy machine learning projects

Data Mining. Implementation & Applications. Jean-Paul Isson. Sr. Director Global BI & Predictive Analytics Monster Worldwide

Machine Learning 2.0 for the Uninitiated A PRACTICAL GUIDE TO IMPLEMENTING ML 2.0 SYSTEMS FOR THE ENTERPRISE.

MANAGING NEXT GENERATION ARTIFICIAL INTELLIGENCE IN BANKING

TNM033 Data Mining Practical Final Project Deadline: 17 of January, 2011

InfoSphere Warehousing 9.5

Putting Big Data & Analytics to Work!

Software Feature Sets. Powerful. Flexible. Intuitive. Alight feature sets. Driver-Based Planning & Analytics

SAP Predictive Analytics Suite

Big Data for the Pharmaceutical Industry

Practical Application of Predictive Analytics Michael Porter

Knowledge Discovery and Data Mining

WHY USE ANALYTICS RAID FMS FRAUD IN TELCOS ADVANCED FRAUD DETECTION. Fact Check. To Fight Fraud. Hands-On

Transcription:

Powerful machine learning software for developing predictive, descriptive, and analytical models.

The Company Minitab helps companies and institutions to spot trends, solve problems and discover valuable insights in data by delivering a comprehensive and best-in-class suite of data analysis and process improvement tools. Combined with unparalleled ease-of-use, Minitab makes it simpler than ever to get deep insights from data. Plus, a team of highly trained data analytic experts ensure that users get the most out of their analyses, enabling them to make better, faster and more accurate decisions. For over 40 years, Minitab has helped organizations drive cost containment, enhance quality, boost customer satisfaction and increase effectiveness. Thousands of businesses and institutions worldwide use Minitab Statistical Software, Companion by Minitab, and Quality Trainer to uncover flaws in their processes and improve them. In 2017, Minitab acquired Salford Systems, a leading provider of advanced analytics which delivers a suite of powerful machine learning, predictive analytics and modeling capabilities. (SPM ) is a powerful machine learning software suite for developing predictive, descriptive, and analytical models. SPM helps organizations take full advantage of their data to generate fact-based insights that drive business decisions. SPM s tools are known for their accuracy, ease-of-use, ability to handle large volumes of data, high-speed model development, robustness and reliability, and consistent delivery of accurate predictive models. Adding SPM s four principal modeling engines: CART, TreeNet, MARS and Random Forests to your current statistical package will allow your organization to handle more complicated data scenarios such as non-linear relationships, larger datasets, complex interaction among variables, missing data and extreme outliers. These tools enable both the novice and expert modeler to uncover complex interactions between predictors and nonlinear relationships to develop world-class predictive models. minitab.com/spm/ 1

SPM - Core Capabilities The core capabilities of SPM include classification, regression, survival analysis, missing value analysis, data binning and clustering/segmentation to cover a diverse array of machine learning and data science needs. Some of the largest corporations in the world use SPM s reliable and accurate models. Creating models Uncover complex interactions between predictors and complex nonlinear relationships to build more accurate models. SPM helps you select the most significant factors from a large pool of candidates. For new modelers, SPM does not require coding. Default settings provide a hard to beat baseline model, without time-consuming setup. Evaluating your data set Data errors? Missing values? Outliers? Non-normality? Not a problem. SPM algorithms are generally not affected by such errors as they reject training data points that vary too much with the existing model. SPM algorithms are robust to some of the assumptions of classical statistics. Missing Values: Automatically handles missing values without deleting rows or columns Unbalanced Data: Automatically upweights rare class, thus ensuring proper detection of rare events Automate Model: Quickly shows you the best SPM modeling engine Automate Model evaluates the ideal SPM methodology to quickly identify the ideal modeling engine for your data. For experts modelers, time-saving automation features reduce the time it takes to arrive at an accurate model, and enhance model scalability. SPM s pre-packaged scenarios minimize the time needed to find the best-model by automatic selection of tuning parameters. Automate Shaving: Variable reduction with minimal sacrifice to model accuracy CART Trees may treat outliers by isolating them in small terminal nodes which can limit their effect. Automate Target: Account for non-linear multivariate relationships, going beyond simple correlation More than 70 pre-packaged scenarios

Evaluation of your powerful predictive model Highly accurate and reliable predictions to enable decision-making SPM s models are defensible and easily interpretable internally to executive stakeholders and externally to regulators The ROC Curve allows you to visualize how well your model is doing versus random chance as shown by the diagonal line. This Life Curve shows that the model found defects without having to look at all the units coming off your manufacturing line. Fortunately, [SPM made it] very simple for us to hone in on the key predictors and be able to devise strategies to deal with those effectively. I m a believer that Minitab and SPM can work very effectively together. Master Black Belt, Global Operations for a top manufacturing organization Clicks not code minimizing the time for the best predictive model User-friendly modeling engines for analysts of all levels Built-in automation for model scalability and error reduction Clear visualization of the most significant insights Highly accurate and reliable predictions to enable decision-making

Minitab Machine Learning Techniques Minitab Statistical Software* CART Modeling Engine Classification and Regression trees Linear or polynormal relationships Few extreme outliers or missing values Supervised Data Complexity Complex relationships Larger data Learning Type Reduce Variables Type Unsupervised Goal Create Groupings Type CART has revolutionized the field of advanced analytics and the current era of data science. CART is one of the most important tools in modern machine learning. Other analytics providers have tried to replicate CART, but none have succeeded in terms of accuracy, performance, feature set, built-in automation and ease-of-use. TreeNet Modeling Engine Gradient Boosting Continuous Regression Target Type Two Categorical Number of Outcomes Single decision tree CART Results Equation Goal Predictive accuracy* Factor Analysis Principal Component Analysis Hierarchial (nested) groupings Cluster Variables Non-hierarchial groupings K-Means Clustering TreeNet modeling engine is s most flexible and powerful machine learning tool, responsible for at least a dozen prizes in major data mining competitions since its introduction in 2002. TreeNet adds the advantage of a degree of accuracy usually not attainable by a single model or by ensembles such as bagging or conventional boosting. Binary Logistic Regression Yes Ordinal Logistic Regression More than two Ordered Outcomes Feeling stuck or ready to learn more? No Nominal Logistic Regression MARS Many Variables (Wide) Random Forests Data Structure Many Observations TreeNet * ANY modeling technique could result in the best predictive accuracy for a specific data set, but most often TreeNet will be the most accurate. Random Forests Modeling Engine Breiman and Cutler s Random Forests Random Forests provides insights into data by generating methods that are applied after the classification trees are grown. Furthermore, it includes new technology for identifying clusters or segments in data as well as new methods for ranking the importance of variables. Random Forests is a collection of many CART trees that are not influenced by each other when constructed. The sum of the predictions made from decision trees determines the overall prediction of the forest. Random Forests is great at analyzing complex data structures embedded in small to moderate data sets containing less than 10,000 rows but potentially millions of columns. Highly-technical user support: We believe that expert and timely assistance is a critical part of our commitment to you. Access the help you need to use our software from trained data scientists and statisticians at www.minitab.com/support Online resources: Our library of online resources includes general introductory videos, software overviews, and tips & tricks Training program: Coming soon! *Minitab Statistical Software sold separately. MARS Modeling Engine Automatic Non-linear Regression MARS is ideal for users who prefer results similar to traditional regression while getting essential nonlinearities and interactions. The MARS methodology s approach to regression modeling effectively uncovers important data patterns and relationships that are difficult, if not impossible, for other regression methods to reveal.

Improved Decision Making SPM helps organizations looking to optimize revenue and minimize cost and risk through improved decisions across business industries and functions including: INDUSTRIES FUNCTIONAL AREAS MANUFACTURING FINANCIAL SERVICES HEALTH CARE & BIOMEDICAL OTHER INDUSTRIES CONTINUOUS IMPROVEMENT MARKETING Manufacturing Defects Credit Risk Patient Safety & Readmission Rates Actuarial Sciences Maximize Supply Chain Efficiencies Targeted Marketing Preventative Maintenance Fraud Prevention Medical Device Diagnostics Mining Operational Optimization Customer Retention & Churn Reengineering for Better Performance Customer Segmentation Pharma Research & Development Oil & Gas Operations & R&D Improving Work Processes Customer Satisfaction SPM can be used in industries and functions with data that can t be accurately modeled using traditional regression and classification approaches.

SPM in Action Finding the root cause of excessive variation Identifying which factors are contributing to the excessive variation in a manufacturing chemical process. Using the CART Modeling Engine, SPM s decision tree algorithm that takes categorical data to predict a qualitative value, historical data can be segmented into a set of yes/no rules. This segmentation splits the response (Y) variable into partitions based on the predictor (X) settings. In the example below, out of the many variables that were run in CART, one predictor production rate is a large contributor to a point falling outside the production control limits. Determining what factors are correlated to manufacturing defects Quickly uncover what factors in a manufacturing process are predictive of defects in production SPM s TreeNet Modeling Engine will generate the probability of a defect based on the variables (signals) supplied. Using the Automate Shaving feature in TreeNet, we can quickly narrow down 590 variables associated to the manufacturing process to 299 key factors that show a correlation with defects in production within minutes. If production rate <= 91.76, then the estimated probability of the process being out of control is relatively high (33%). If production rate > 91.76, then the process is likely in statistical control. Continually growing or pruning the CART tree will quickly identify additional causes of the excessive variability in this process. Once narrowed down to a few vital X s, controls can be put in place to reduce the chance of falling out of process specifications. TreeNet Modeling Engine can quickly narrow down the variables that make the most impact and visually illustrate the relationship between the likelihood of a defect and the identified variables to draft a preventative action plan. GAIN EFFICIENCY Dramatically improve the productivity of existing analytical teams. DECIDE Uncover significant insights to support data-driven decisions, gain competitive edge and improve efficiency. AUTOMATE Speed up analysis and automate portions of the analysis cycle.

SPM Licensing Options SPM offers annual licensing options for individual users and for license sharing across multiple users. Technical support is included for all currently licensed users through the life of the release. Multi-User Single-User Our flexible multi-user licensing option can be installed on an unlimited number of computers for concurrent use based on the number of seats purchased. A yearly subscription on one computer for smaller companies or a super user who needs unlimited access to the software. Users share these seats the number of seats is the total number of users that can access the software at the same time. SPM Awards and Achievements PAKDD, the annual Pacific-Asia Knowledge Discovery and Data Mining Competition Cross-selling task - financial dataset DMA Direct Marketing Association Analytics Challenge Predicting customer lifetime value to drive personalized customer interactions Healthcare response task Make-A-Wish Foundation targeting solution lapsed donor segments Targeted marketing task KDD Cup, the annual Data Mining and Knowledge Discovery competition Web analytics task SIG KDD Innovation Award to Drs. Leo Breiman and Jerome Friedman, creators of algorithms in SPM Outstanding technical contributions to the field of knowledge discovery in data and data mining that has helped further the theory and development of commercial systems Teradata Center for Customer Relationship Management (CRM) at Duke University Churn Modeling, CRM Contact us and try SPM free today minitab.com/spm/