Sascha Schubert Product Manager Data Mining SAS EMEA Copyright 2005, SAS Institute Inc. All rights reserved.

Similar documents
Introducing Analytics with SAS Enterprise Miner. Matthew Stainer Business Analytics Consultant SAS Analytics & Innovation practice

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Big Data Analytics met Hadoop

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD

SAS Decision Manager

STATE OF THE ART ANALYTICS

IT117: Microsoft Power Business Intelligence

MS-20466: Implementing Data Models and Reports with Microsoft SQL Server

AVANTUS TRAINING PTE PTE LTD LTD

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

WELCOME TO SAS FOR MARKETING

New Features in Enterprise Miner

Garanti Bank s Journey to Big Data Ayşen Büyükakın Business Intelligence & Analytics Unit Manager

Retail Business Intelligence Solution

Approaching an Analytical Project. Tuba Islam, Analytics CoE, SAS UK

Mass-Scale, Automated Machine Learning and Model Deployment Using SAS Factory Miner and SAS Decision Manager

SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

Analytics in Action transforming the way we use and consume information

Bringing the Power of SAS to Hadoop Title

PROVEN PRACTICES FOR PREDICTIVE MODELING

20466: Implementing Data Models and Reports with Microsoft SQL Server 2014

Analytical Tools 1. Analytical Tools Jennifer Dilly Ferris State University November 20, 2011

Implementing Data Models and Reports with Microsoft SQL Server

SmartCare. SPSS Workshop. Rick Durham - North American Advanced Analytics Channel Team IBM Corporation. Date: 5/28/2014

Achieve Better Insight and Prediction with Data Mining

SAP Predictive Analytics Hands-On. Andreas Forster December 2015

Knowledge Solution for Credit Scoring

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

SAS Education Providing knowledge through global training and certification

Mobile Application Developer

Harnessing Predictive Analytics to Improve Customer Data Analysis and Reduce Fraud

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE

Using SAS Enterprise Guide, SAS Enterprise Miner, and SAS Marketing Automation to Make a Collection Campaign Smarter

Exceed your business with SharePoint Server 2010

GADD Analytics Overview

SAS Viya. Примеры проектов на новой платформе. Copyright SAS Institute Inc. All rights reserved.

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

GADD platform Overview

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

In-Memory Analytics: Get Faster, Better Insights from Big Data

OLAP Technologies and Applications

Introduction to Hyperion Financial Reporting

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

"Charting the Course... MOC B PerformancePoint 2010 Designing and Implementing Scorecards and Dashboards Course Summary

REPORT BUILDER AND PERFORMANCEPOINT 2010 COMBO PACK

Real-Time Marketing exploiting stateof-the-art

InfoSphere Software The Value of Trusted Information IBM Corporation

Developing Industry Solutions using IBM Counter Fraud Management

Creating and Scheduling SAS Job Flows with the Schedule Manager Plugin in SAS Management Console

Oracle Real-Time Decisions (RTD) Ecommerce Interaction Management Use Case

How Data Science is Changing the Way Companies Do Business Colin White

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Organon Advisors, Inc.

Solution Architect with 18 years experience in business, visual production and technology. AIIM Certified Enterprise Content Management Practitioner

Christian Johansson, Global Product Manager Decathlon Software ABB Decathlon Software. AS Systemintegratörer

SharePoint 2013 Business Intelligence

SAS Visual Statistics 8.1: The New Self-Service Easy Analytics Experience Xiangxiang Meng, Cheryl LeSaint, Don Chapman, SAS Institute Inc.

Text Mining Analysis on Knowledge Sharing Using Enterprise Microblogging System. Angela Lee Siew Hoong, Prof Lim Tong Ming, Justin Lim

SAP Real-time Data Platform 9 th October Matteo Losi Head of Presales and Business Development Italy Italy EMEA

After working through that presentation, you will be prepared to use Xcelsius dashboards accessing BI query data via SAP NetWeaver BW connection in

Information Architecture: Leveraging Information in an SOA Environment. David McCarty IBM Software IT Architect. IBM SOA Architect Summit

Stuck with Power BI? Get Pyramid Starting at $0/month. Start Moving with the Analytics OS

Designing your BI Architecture

Access to Cognos Portal: Install a virtualisation Tool : Virtual Box,

Bridging the Gap Between Research and IT. 22 June 2010 Conrad Agramont Product Management Andrew Leuthe Product Marketing

Trusted Experts in Business Analytics. Business Analytics Training Catalog

Enterprise Marketing. Copyright 2009, SAS Institute Inc. All rights reserved. Norman Webb Practice Manager, EMEA Customer Intelligence Practice

B5A70G Essentials for IBM Cognos BI (V10.2.2)

SAS Enterprise Miner 5.3 for Desktop

IBM SPSS Modeler Personal

Enterprise Command Center

Finding Actionable Insights in Your Organisation s Voice Data

Comprehensive Enterprise Solution for Compliance and Risk Monitoring

MICROSOFT CERTIFICATION PATH COMPETENCY AREAS Mobility: IT Pro. Cloud platform: IT Pro & Developer. Productivity: IT Pro

Business Optimization New Opportunities for Growth. Ambuj Goyal General Manager IBM Information Management Software

IBM SPSS Modeler Personal

Applied business analysts approach to IT projects Methodological framework

"Charting the Course to Your Success!" MOC Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008.

Improving enterprise performance through operations intelligence solutions siemens.com/xhq

What's New - Technical in Microsoft Dynamics AX 2012 for Development

IBM Balanced Warehouse Buyer s Guide. Unlock the potential of data with the right data warehouse solution

Enterprise Computing. Paul Padley SAS Institute. Adaptive Architectures for Business Intelligence - managing the deployment cost curve

Cognitive enterprise archive and retrieval

Energy Utilities Data Explosion: Load Analytics and Customer Segmentation

PASW Modeler. Achieve Your Goals with Deep, Predictive Insight

Fusion Accounting Hub Reporting Cloud. Florida OAUG SunCollaborate 2015

VDMML Enablement Session Data Science Jam Sessions

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Automating Customer Analytics. DynaMine Data Mining Automation powered by KNIME.

SAS & Clinical Data Repository Karthikeyan Chidambaram

360 Production Awareness: Reporting and Analytics for SAP Manufacturing. Salvatore Castro, Satheesh Gannamraju

How to improve your AML detection? Christopher Ghenne Principal Manager Fraud & Security Intelligence EMEA

The New, Extended Oracle Business Intelligence - A System for Enterprise Performance Management. Gavin Dupre Director, BI Sales Consulting EMEA

Aligning Knowledge Management Systems to Business Strategy By Narayana Subramanian

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7

IBM SPSS Modeler. Accelerate time to value with visual data science and machine learning. Highlights

Adobe and Hadoop Integration

InterSystems Symposia 2011

Transcription:

Challenges for Data and Text Mining and how SAS addresses them Sascha Schubert Product Manager Data Mining SAS EMEA

Predictive Analytics Process 1. Prepare Data 2. Develop Model (Analytical Training Set) 3. Deploy Model Transactional Demographic Operational Financial Interactive, Batch or Real Time Unstructured Marketing or Risk Data Warehouse Other Domain Sources 4. Monitor Model Decision Support System

S2 Successful Data Mining through Integration Data Manager Data Preparation Deployment Services Report Administration Data Miner Exploratory Analysis Descriptive Segmentation Predictive Modeling 1. Register Training Set 2. Retrieve Training Set 4. Batch Scoring Plug In 3. Register Results Package Data Aggregation Metadata Server Model Development 5a. Deploy Model 5b. Distribute Reports Data Manager Data Miner Business Analyst Model Deployment Business Analyst Application Developer Manages Campaigns Domain Expert Evaluates Processes & ROI Model Management

Slide 3 MS2 Note to M-E: I modified this graphic to include 5a. Deploy Models. It seems like it should be on there somewhere unless they are just thinking it is part of 5b. Distribute Reports. Let me know if you like the modified version. Marjorie Shelley; 17-Mar-05

Data Sources Specific Needs for Data Mining Data Volumes Long history More columns (observed and derived) Different sources Data Format Transactional data vs. customer history data Data Type Database Weblogs Free Text

Answers to Data Challenges Provide Data Models for business-specific problems SAS Industry Intelligence Solutions Create required analytical data format from many different data formats SAS ETL Solutions Create and store business-specific metadata for enterprise wide use SAS Metadata Server Provide flexible tools for interactive data preparation and selection SAS Enterprise Miner

Integration: SAS Enterprise Miner - SAS ETL Studio Data Preparation ETL Studio Define a Process Job to Create a Table Register Table to Metadata Server Create Data Mining Metadata as Part of Job Register DM Metadata to Metadata Server Available Now

S1 Successful Data Mining through Integration Data Manager Data Preparation Deployment Services Report Administration Data Miner Exploratory Analysis Descriptive Segmentation Predictive Modeling 1. Register Training Set 2. Retrieve Training Set 4. Batch Scoring Plug In 3. Register Results Package Data Aggregation Metadata Server Model Development 5a. Deploy Model 5b. Distribute Reports Data Manager Data Miner Business Analyst Model Deployment Business Analyst Application Developer Manages Campaigns Domain Expert Evaluates Processes & ROI Model Management

Slide 7 MS1 Note to M-E: I modified this graphic to include 5a. Deploy Models. It seems like it should be on there somewhere unless they are just thinking it is part of 5b. Distribute Reports. Let me know if you like the modified version. Marjorie Shelley; 17-Mar-05

Analytical Data Preparation Interactive Tools Transformations Builder Filter outliers interactively Principle Components node with results browser Available in Autumn 2005

Text Mining Challenges Handle Bad Text Quality Text Cleaning Fixing misspellings Detecting all multi-word terms: sliding door, front seat Deal with abbreviations/user-defined terms Adj d doors, call cust., i/m arm broken Visually Discover Concepts Link terms to display concepts Available in Autumn 2005

Integrate Analytical Modeling Algorithms Data Miners always want new algorithms SAS will support new algorithms such as SVM More important to combine existing techniques Hybrid models Ensemble Models (bagging and boosting) Combine different modeling techniques Integrate for predictive analytics Web Path Analysis Time Series Analysis Market Basket Analysis

Combine Different Modeling Techniques

Modeling Algorithms Integrate your own modeling techniques in SAS Enterprise Miner Can integrate ANY SAS model very easily Use the Extension facilities Create new nodes easily based on SAS and XML SAS will provide a sharing platform for user written SAS Enterprise Miner Extension Nodes Available Now

Develop Customized Tools

Performance: Grid Computing a means to apply the resources from a collection of computers in a network and to harness all the compute power into a single project Available for Model Training in EM with EM 5.2 in Autumn 2005

Enterprise Miner on SMP SMP server

Enterprise Miner on a Grid

Model Deployment Most important step in the process Often the most time consuming task with many manual steps involved Options: Batch On-Demand Real-time

Ways to Deploy Data Mining Models in SAS Batch Deploy EM SAS score code directly Integrate SAS EM Score code using Mining Results plugin in ETL Studio Interactive Score within Enterprise Miner Use Stored Processes to Score Model on Demand Use Scoring Task in Enterprise Guide 4 Real-Time Integrate Score Code with operational systems using SAS Integration Technologies C or Java Score Code

Integration: SAS Enterprise Miner - SAS ETL Studio Data Miner Data Manager Data Preparation Deployment Services Report Administration 1. Register Training Set 2. Retrieve Training Set Exploratory Analysis Descriptive Segmentation Predictive Modeling 3. Register Results Package Data Aggregation 4. Mining Results Transform Metadata Server Model Development Model Deployment

Integration: SAS Enterprise Miner - SAS ETL Studio Batch Scoring ETL Studio Use Mining Results Plug-in to register EM models for Scoring Define a Process for Batch Scoring Available Now

Stored Processes for Scoring on Demand

Scoring SAS Enterprise Miner Models interactively in Enterprise Guide 4.1 Currently Early Adopter Production in October 2005 HMEQ Scoring Model Scoring Model Output Data

SAS Model Manager SAS Enterprise Miner Model Development & Model Scoring SAS Model Manager New solution to address the gap between the model development and model scoring environments Addresses: Increased amount of Models & Data Model Selection (Challenger Champion Retired) Different computing environments for training and scoring Multi-channel delivery: batch, interactive, on-demand

Model Lifecycle Management Development Environment Production Environment Model Registration Model Development Environment SAS Enterprise Miner SAS Credit Scoring SAS/STAT Base SAS Score Code Champion Model Selection Model Testing Production Environment Interactive Batch Real Time Model Deployment Model Tracking Model Retirement

SAS Model Management Studio Client Interface Customizable Project Hierarchy Champion and Challenger Models Model Scoring Code and Metadata

Timeline - SAS Model Deployment Studio Summer 2005 MDS 1.1 for Development Partners Winter 2005 MDS 2.1 for Early Adopters Spring 2006 MDS 2.1 Production