SPM 8.2. Salford Predictive Modeler

Similar documents
Salford Predictive Modeler. Powerful machine learning software for developing predictive, descriptive, and analytical models.

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE

Introducing Analytics with SAS Enterprise Miner. Matthew Stainer Business Analytics Consultant SAS Analytics & Innovation practice

Week 1 Unit 4: Defining Project Success Criteria

Data Science in a pricing process

Copyright 2013, SAS Institute Inc. All rights reserved.

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

IBM SPSS Decision Trees

SAS Enterprise Miner 5.3 for Desktop

Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge

3 Ways to Improve Your Targeted Marketing with Analytics

SAP Predictive Analytics Hands-On. Andreas Forster December 2015

IBM SPSS Modeler Personal

SAP Predictive Analytics Suite

Advanced Root Cause Analysis for Product Quality Improvement using Machine Learning in TIBCO Spotfire

From Profit Driven Business Analytics. Full book available for purchase here.

PREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING

Achieve Better Insight and Prediction with Data Mining

IBM SPSS Modeler Personal

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

GENERAL SERVICES ADMINISTRATION. Federal Supply Service. Authorized Federal Supply Schedule Price List

Machine Learning 101

Ensemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo

STATE OF THE ART ANALYTICS

Who Are My Best Customers?

Advanced analytics at your hands

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Azure ML Studio. Overview for Data Engineers & Data Scientists

SAS Business Knowledge Series

NICE Customer Engagement Analytics - Architecture Whitepaper

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro

Analytics for Banks. September 19, 2017

Deep Dive into High Performance Machine Learning Procedures. Tuba Islam, Analytics CoE, SAS UK

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN

Prediction and Interpretation for Machine Learning Regression Methods

FINAL PROJECT REPORT IME672. Group Number 6

Linear model to forecast sales from past data of Rossmann drug Store

Machine Learning 2.0 for the Uninitiated A PRACTICAL GUIDE TO IMPLEMENTING ML 2.0 SYSTEMS FOR THE ENTERPRISE.

Data Mining Applications with R

Data Mining. Implementation & Applications. Jean-Paul Isson. Sr. Director Global BI & Predictive Analytics Monster Worldwide

IBM SPSS Statistics: What s New

SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

DASI: Analytics in Practice and Academic Analytics Preparation

Predicting Customer Purchase to Improve Bank Marketing Effectiveness

Segmentation Modeling

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

POST GRADUATE PROGRAM IN DATA SCIENCE & MACHINE LEARNING (PGPDM)

Advanced Analytics for Fraud, Waste and Abuse Detection

SAS ANALYTICS IN ACTION APPROACHABLE ANALYTICS AND DECISIONS AT SCALE TUBA ISLAM, SAS GLOBAL TECHNOLOGY PRACTICE, ANALYTICS

Predictive Analytics for the Business Analyst. Fern Halper July 8,

Qlik Sense. Data Sheet. Transform Your Organization with Analytics

Today. Last time. Lecture 5: Discrimination (cont) Jane Fridlyand. Oct 13, 2005

IBM SPSS Statistics. Editions. Get the analytical power you need for better decision making. Why use IBM SPSS Statistics? IBM Analytics Solution Brief

NEXT GENERATION PREDICATIVE ANALYTICS USING HP DISTRIBUTED R

From Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here.

Preface to the third edition Preface to the first edition Acknowledgments

The real life of Data Scientists. or why we created the Data Science Studio

Actionable Intelligence that Accelerates Profitable Growth. An Introduction to Zilliant IQ for Salesforce

Predictive Analytics

Using SAS Enterprise Guide, SAS Enterprise Miner, and SAS Marketing Automation to Make a Collection Campaign Smarter

Practical Application of Predictive Analytics Michael Porter

Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS

IBM SPSS Statistics Editions

Data Mining in CRM THE CRM STRATEGY

Watts App: An Energy Analytics and Demand-Response Advisor Tool

Building the In-Demand Skills for Analytics and Data Science Course Outline

Using Decision Tree to predict repeat customers

Predictive Analytics With Oracle Data Mining

In-Memory Analytics: Get Faster, Better Insights from Big Data

PASW Modeler. Achieve Your Goals with Deep, Predictive Insight

IBM Incentive Compensation Management

Data Mining Technology and its Future Prospect

USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP

Effective CRM Using. Predictive Analytics. Antonios Chorianopoulos

Accelerated Data Blending and Analytics for Microsoft Power BI with Alteryx

Drive Better Insights with Oracle Analytics Cloud

DATA SCIENCE: HYPE AND REALITY PATRICK HALL

Advanced Analytics in the Consumer Goods Sector

Data mining and Renewable energy. Cindi Thompson

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

New Customer Acquisition Strategy

TECHNOLOGY BRIEF Intapp and Applied AI. THE INTAPP PROFESSIONAL SERVICES PLATFORM Unified Client Life Cycle

Probabilistic load and price forecasting: Team TOLOLO

Artificial Intelligence and Project Management: Beyond Human Imagination!

WHITE PAPER. Results Delivers Value

Forecasting Seasonal Footwear Demand Using Machine Learning. By Majd Kharfan & Vicky Chan, SCM 2018 Advisor: Tugba Efendigil

Knowledge Discovery and Data Mining

Watson Analytics Linda Dest March 24, 2016

Harnessing Predictive Analytics to Improve Customer Data Analysis and Reduce Fraud

PROVEN PRACTICES FOR PREDICTIVE MODELING

GOVERNMENT ANALYTICS LEADERSHIP FORUM SAS Canada & The Institute of Public Administration of Canada. April 26 The Shaw Centre

Forecasting in SAP Integrated Business Planning Webinar

Advanced Analytics for Fraud, Waste, and Abuse Detection WHITE PAPER

Approaching an Analytical Project. Tuba Islam, Analytics CoE, SAS UK

C3 Products + Services Overview

Top-down Forecasting Using a CRM Database Gino Rooney Tom Bauer

Predictive Analytics for Donor Management

Olin Business School Master of Science in Customer Analytics (MSCA) Curriculum Academic Year. List of Courses by Semester

Transcription:

SPM 8.2 Salford Predictive Modeler

SPM 8.2 The SPM Salford Predictive Modeler software suite is a highly accurate and ultra-fast platform for developing predictive, descriptive, and analytical models from databases and datasets of any size, complexity, or organization. The SPM software suite s data mining technologies span classification, regression, survival analysis, missing value analysis, data binning and clustering/segmentation to cover all aspects of your data science projects. SPM algorithms are considered to be essential in sophisticated data science circles. Features > > 70+ pre-packaged automation scenarios inspired by the way leading model analysts structure their work. > > Tools to relieve gruntwork, allowing the analyst to focus on the creative aspects of model development. > > Regression, Classification, and Logistic Regression enhanced to support massive datasets. Salford Predictive Modeler

The SPM software suite s automation accelerates the process of model building by conducting substantial portions of the model exploration and refinement process for the analyst. While the analyst is always in full control we optionally anticipate the analysts next best steps and package a complete set of results from alternative modeling strategies for easy review. Do in one day what normally requires a week or more using other systems! Product Versions ULTRA: The best of the best. For the modeler who must have access to leading edge technology and fastest run times including major advances in ensemble modeling, interaction detection and automation. PROEX: For the modeler who needs cutting-edge data mining technology, including extensive automation of modeling experiments typical for experienced data analysts and dozens of extensions to the Salford data mining engines. PRO: A true predictive modeling workbench designed for the professional data miner. Variety of supporting conventional statistical modeling tools, programming language, reporting services, and a modest selection of workflow automation options. BASIC: Literally the basics. Salford Systems award winning data mining engines without extensions or automation or surrounding statistical services, programming language, and sophisticated reporting. Designed for small budgets while still delivering our world famous engines. Features > > Performance improvements > > Descriptive statistics controls > > Model-based imputation or imputation using summary statistics > > Correlation analysis and Multidimensional Scaling > > Exact test partitioning fractions > > KEEP and EXCLUDE controls > > Expanded Set of Performance Statistics: Variance of ROC and Kolmogorov- Smirnov (K-5) Statistic > > New Time Series support mechanisms > > New automated model experiments > > New scoring distribution controls and visualization > > Recursive feature elimination

Engines CART Classification and Regression Trees CART software is the ultimate classification tree that has revolutionized the field of advanced analytics, and inaugurated the current era of data science. CART is one of the most important tools in modern data mining. Others have tried to copy CART, but no one has succeeded, as evidenced by accuracy, performance, feature set, built-in automation and ease of use. Designed for both non-technical and technical users, CART can quickly reveal important data relationships that could remain hidden using other analytical tools. Technically, CART is based on landmark mathematical theory introduced in 1984 by four world-renowned statisticians at Stanford University and the University of California, Berkeley. Salford Systems implementation of CART is the only decision tree software embodying the original proprietary code. The CART creators continue to collaborate with Salford Systems to continually enhance CART with proprietary advances. NewestFeatures > > Unsupervised learning for advanced anomaly detection > > Association analysis using hotspots > > Forced splits > > Scalable limits on bottom nodes > > Automated prior search > > Automated bias penalty Patented extensions to CART are specifically designed to enhance results for market research and web analytics. CART supports high-speed deployment, allowing Salford Systems models to predict and score in real time on a massive scale. Classification and Regression Trees

TreeNet Gradient Boosting TreeNet software is Salford Systems most flexible and powerful data mining tool, responsible for at least a dozen prizes in major data mining competitions since its introduction in 2002. SUPREME ACCURACY TreeNet adds the advantage of a degree of accuracy usually not attainable by a single model or by ensembles such as bagging or conventional boosting. As opposed to neural networks, TreeNet is not sensitive to data errors and needs no time-consuming data preparation, preprocessing or imputation of missing values. TreeNet s robustness extends to data contaminated with erroneous target labels. This type of data error can be very challenging for conventional data mining methods and will be catastrophic for conventional boosting. In contrast, TreeNet is generally immune to such errors as it dynamically rejects training data points too much at variance with the existing model. ADVANCED FEATURES Interaction detection statistics establish whether interactions of any kind are needed in a predictive model, and searches to discover specifically which interactions are important. The interaction detection system not only helps improve model performance (sometimes dramatically) but also assists in the discovery of valuable new segments and previously unrecognized patterns. NextGen > > Regularized Gradient Boosting (RGBOOST) > > Building Random Forests with TreeNet > > RuleLearner : TreeNet s accuracy plus the interpretability and transparency of regression and CART > > New loss functions including differential lift > > Interaction Strength Reporting: discover, measure, and plot the most important interactions > > Advanced Controls: subsampling and influence trimming > > Monotone constraints > > Vary the number of predictors > > Customized starting values > > Model building with TreeNet splines Gradient Boosting

MARS Automatic Non-linear Regression MARS software is ideal for users who prefer results in a form similar to traditional regression while capturing essential nonlinearities and interactions. The MARS approach to regression modeling effectively uncovers important data patterns and relationships that are difficult, if not impossible, for other regression methods to reveal. REGRESSION & CLASSIFICATION The MARS model is designed to predict numeric outcomes such as the average monthly bill of a mobile phone customer or the amount that a shopper is expected to spend in a web site visit. MARS is also capable of producing high quality classification models for a yes/no outcome. MARS performs variable selection, variable transformation, interaction detection, and selftesting, all automatically and at high speed. OUTSTANDING PERFORMANCE Areas where MARS has exhibited excellent performance include forecasting electricity demand for power generating companies, relating customer satisfaction scores to the engineering specifications of products, and presence/absence modeling in geographical information systems (GIS). Features > > Build a series of models varying the maximum number of basis functions (Automate BASIS) > > Build a series of models varying the smoothness parameter (Automate MINSPAN) > > Build a series of models varying the order of interactions (Automate INTERACTIONS) > > Build a series of models varying the modeling speed (Automate SPEED) > > Build a series of models using varying degree of penalty on added variables (Automate PENALTY MARS) Automatic Non-linear Regression

Engines Random Forest Breiman and Cutler s Random Forests Random Forests software is a bagging tool that leverages the power of multiple alternative analyses, randomization strategies, and ensemble learning. Its strengths are spotting outliers and anomalies in data, displaying clusters, predicting future outcomes, identifying important predictors, replacing missing values with imputations, and providing insightful graphics. CLUSTER AND SEGMENT Much of the insight provided by Random Forests is generated by methods applied after the trees are grown and include new technology for identifying clusters or segments in data as well as new methods for ranking the importance of variables. The method was developed by Leo Breiman and Adele Cutler of the University of California, Berkeley, and is licensed exclusively to Salford Systems. Ongoing research is being undertaken by Salford Systems in collaboration with Professor Adele Cutler, the surviving co-author of Random Forests. SUITED FOR WIDE DATASETS Random Forests is a collection of many CART trees that are not influenced by each other when constructed. The sum of the predictions made from decision trees determines the overall prediction of the forest. Random Forests is best suited for the analysis of complex data structures embedded in small to moderate data sets containing less than 10,000 rows but potentially millions of columns. Advantages > > Easy parameter optimization with automates > > Special RF clustering enhancements > > Extreme Trees: Special controls for random splitting > > Advanced missing value imputation options > > Penalty Configuration: allows a user to penalize the inclusion of particular variables Breiman and Cutler s Random Forests

THE COMPANY Salford Systems specializes in state-of-the-art machine learning technology designed to assist data scientists in all aspects of predictive model development. PEDIGREE Salford Systems technology was developed and enhanced in direct collaboration with original creators of modern machine learning and data mining. We maintain an active R&D program, leveraging our ties to leading universities. THE SOFTWARE Salford s tools are known for their ease of use, capability of working with large volumes of data, highspeed model development, robustness and reliability and consistent delivery of ultra-accurate models. Salford s modeling automation tools guide novice data scientists through the complex process of model development and help expert data scientists develop world-class predictive models. It s not just Salford Systems that s winning. Our customers using Salford Systems software and algorithms have been recognized for their winning approach to challenges in their industries. CUSTOMER AWARDS AND ACHIEVEMENTS PAKDD Data Mining Competition, the annual Pacific-Asia Knowledge Discovery and Data Mining data mining competition Cross-selling task, financial dataset DMA Direct Marketing Association Analytics Challenge Predicting customer lifetime value to drive personalized customer interactions Healthcare response task Make-A-Wish Foundation targeting solution lapsed donor segments Targeted marketing task SALFORD AWARDS AND ACHIEVEMENTS KDD Cup, the annual Data Mining and Knowledge Discovery competition Web analytics task SIG KDD Innovation Award to Drs. Leo Breiman and Jerome Friedman, creators of algorithms in SPM Outstanding technical contributions to the field of knowledge discovery in data and data mining that have had lasting impact in furthering the theory and/or development of commercial systems Teradata Center for Customer Relationship Management (CRM) at Duke University Churn Modeling, CRM CONTACT 9685 Via Excelencia, Suite 208, San Diego, CA 92126 Telephone: (619)543-8880 Fax: (619)543-8888 salford-systems.com SPM, Salford Predictive Modeler, CART, MARS, Tree Net, Random Forests, and RuleLearner, are all registered trademarks of Minitab, Inc.