Real data science, fast and simple.

Size: px
Start display at page:

Download "Real data science, fast and simple."

Transcription

1 RapidMiner Overview Real data science, fast and simple.

2 RapidMiner Highlights #1 200, By the numbers Data Science Platform Engaged Community Members Global Clients Channel Partners Analysts Leader 2014, 2015, 2016 & 2017 Gartner Magic Quadrant for Data Science Platforms Leader 2017 Predictive Analytics & Machine Learning #1 Open-Source Platform Last five years in a row Data Mining & Analytics Software Poll Innovation Winner 2015 Wisdom of Crowds for Advanced & Predictive Analytics, Big Data Analytics & End-User Data Prep Accolades CB Insights The AI 100, Startups Using Artificial Intelligence to Transform Industries VENTANA RESEARCH 2016 Technology Innovation Awards Winner Predictive Analytics 2

3 Insight Without Action Has No Value Analytics 3.0 * Predictive & Step Prescriptive Five Analytics 1.0 Descriptive Analytics 2.0 Diagnostic Reactive Proactive Passive Business Intelligence Database Sums & Counts Historical Information Data Visualization Analytic Data Marts Drilldown Current Insight Data Science Big Data Machine Learning Human / Automated Actions * First referenced by Thomas H Davenport, HBR December

4 High Value Use Cases Need Real Data Science Automotive Life Sciences Retail & Consumer Goods Government Banking Manufacturing Telco e-health Insurance Oil & Gas Utilities Travel, Transport & Logistics Customer Analytics Operational Analytics Risk Analytics Customer Acquisition Cross-sell/Upsell Offer Optimization Retention & Loyalty Win back Channel / Mix Optimization Web Analytics Pricing Optimization Supply Chain Optimization Manufacturing Operations Asset Performance Process Engineering Capacity Planning Call Center Operations Retail Store Operations Predictive Maintenance IT Operations Credit Scoring Insurance Underwriting Capital Planning Stress Testing Fraud Detection Anti-Money Laundering Rogue Trading Cyber Security Compliance Drive Revenue Reduce Costs Avoid Risks +50% New revenue opportunities * -34% Realized cost savings * +46% Increased profitability * *Ventana Research Next Generation Predictive Analytics Benchmark Research,

5 Lightning-Fast Unified Platform Incorporate all types of data Data Prep Speed & optimize ALL data exploration, blending & cleansing tasks Data selection Data Cleaning Data integration Data formatting Data exploration Model & Validate Apply machine learning to rapidly prototype & confidently validate predictive models Modeling Cross validation Model Optimization Model Management Model Export Operationalize Easily deploy & maintain models and embed analytic results Model deployment Scoring as web service Model monitoring Reporting and visualization Maintenance Embed results in all types of business apps & data visualization tools 5

6 The RapidMiner Competitive Advantage Unified Platform Prototype Substantiate Operationalize seamless, high performance orchestration Lightning Fast Data Science Powerful, visual & guided use of 1,500 data prep and machine learning functions & third party libraries #1 Marketplace for Data Science Expertise On-demand consultants, algorithms & extensions; global presence & domain expertise in every industry Real data science, fast and simple. 6

7 RapidMiner Platform & Pricing 1 year subscription shown Unlimited Cores Studio Free Studio Small $2,500 per user Studio Medium $5,000 per user Studio Large $10,000 per user 10, ,000 1,000,000 Unlimited Data Rows 10x+ performance 4x performance 2x performance Free product versions receive community support. Row limits in Studio apply when using Server or Radoop so limiting the data a user can use. Radoop Enterprise First User $15,000 Each additional User $5,000 Executes all RapidMiner functions plus 70+ native Hadoop operators Radoop Free 70+ native Hadoop operators only Unlimited Cores Server Free Server Small $15,000 per instance 10, ,000 1,000,000 Unlimited Data Rows Server Large $60,000 per instance Server Medium $30,000 per instance RapidMiner Studio RapidMiner Radoop RapidMiner Server Visual Workflow Designer Guided Analytics & Reusable Processes Wealth of Predictive Algorithms & Functions Execute Data Science Workflows Seamlessly on Hadoop Analysis upon the full breadth & variety of stored big data Collaborate & Share Compute Integrate Operationalize 7

8 Get Successful with RapidMiner Get Guidance Attend product workshops and ask questions of product experts as you build your first machine learning workflows Get Started Jumpstart your enablement and get started fast with free self-service tutorials, videos and the daily demo Get Educated & Certified Develop the essential skills to be successful with the RapidMiner product suite Live Online Virtual instructor-led Self-Paced Online Learn when convenient Classroom Face-to-face at our or your office 4 Get Successful Utilize the experience and expertise of the RapidMiner Customer Success Team Customer orientation Installation support & guidance Implementation planning Use case, architecture, best practices Training, Certification & Services needs Quarterly reviews 5 Get Connected & Contribute Connect to the RapidMiner community: learn, share, contribute: 200,000+ member, 34,000+ posts Innumerable external blogs, articles, scientific papers & books Community & Blogs Books Videos & In-Product Tutorials Webinars Demos & Documentation 8

9 RapidMiner Partner Network Technology Value Added Resellers Systems Integrators OEM Global Partners 9

10 Real data science, fast and simple. RapidMiner Inc. 10 Milk Street 11th Floor Boston, MA Boston Budapest Dortmund London

11 Additional Content 11

12 RapidMiner Data Science Impact Bridge the Data Science Skills Gap Operationalize Competitive Advantage 39% Chief Analytics Officer Empower operational workers to consume data science in their routine decision making Chief Executive Officer Leverage prescriptive analytics in all your decisions to achieve better outcomes Improved 46% customer service * 50% Created new revenue opportunities * Increased profitability * Coding Data Scientist Applied Data Scientist 95% faster Accelerate the creation of highvalue data science while streamlining low-value tasks Confidently extract the hidden value from your data using intuitive predictive analytics 5-10x data science capability Build Better Predictive Models Faster Easily Use Predictive Analytics *Ventana Research Next-Generation Predictive Analytics Benchmark Research,

13 The RapidMiner Data Science Platform Extensive Domain Expertise Expert marketplace of certified RapidMiner skills RapidMiner Marketplaces On-demand Innovation & Execution Plug-ins, Algorithms, Extensions Product Marketplace to extend and innovate RapidMiner Studio Lightning Fast Real Data Science, Code Optional RapidMiner Server Seamless Deployment, Management & Collaboration Data Access Connect to any data source, any format, at any scale Data Exploration Quickly discover patterns or data quality issues Data Prep Speed & optimize ALL data exploration, blending & cleansing tasks Modeling Efficiently build and deliver better models faster Validation Confidently & accurately estimate model performance Collaboration Connect to any data source, any format, at any scale Computation Quickly discover patterns or data quality issues Scheduling Speed & optimize ALL data exploration, blending & cleansing tasks Integration Efficiently build and deliver better models faster Management Confidently & accurately estimate model performance RapidMiner Radoop Simplified, Intelligent Big Data Science & Machine Learning Simplified Analytics Reduces Hadoop complexity Lightning Fast Covers complete analytics lifecycle Broad Data Access Eliminate connectivity struggles Integrated Security Ensure security compliance Optimized for Hadoop Leverage Hadoop distributed power Scalable Processing Process in-hadoop and inmemory Spark Execution Execute RapidMiner sub - processes in parallel 13

14 The RapidMiner Platform Web Services RapidMiner Studio Visual Workflow Designer RapidMiner Market Place Industry, Application & ML Extensions RapidMiner Server Collaborate + Compute + Deploy + Maintain RapidMiner Web Applications Workflow Builder Process Execution Engine Data and Process Repository User/Group Access Rights management Web App Portal Process Scheduler RapidMiner Radoop Compile + Execute in Hadoop Process Execution Engine RapidMiner Radoop Compile + Execute in Hadoop Integrate using Web Services, JSON, SQL, Server Application Databases / Data warehouses Java SE/EE Application Application (BI, ERP, CRM ) / Portal Incorporate all types of data Run in multiple Compute Engines R / Python / SQL Scripting In-Memory H2O / Weka In-Hadoop & Spark 14

15 RapidMiner Studio All-In-One Data Science Workflow Designer Lightning Fast Visual interface for rapidly building complete analytic workflows Powerful Rich library of algorithms and functions to build the strongest possible model for any use case Open & Extensible Open source innovation keeps pace with changing business needs 15

16 RapidMiner Server Operationalization & Collaboration Management Team Collaboration Central repository facilitates sharing of data sources, analytic processes & best practices Frictionless Operationalization Flexible execution options streamline deployment, maintenance & embedding of analysis Dynamic & Continuous Model Management Individual and customizable processes to check for accuracy drifts or shifts 16

17 RapidMiner Radoop Extends the RapidMiner s visual workflow to Hadoop Hadoop made easy Translates data science workflows into Hadoop so data scientists concentrate on analytics not Hadoop programming In Hadoop Execution Pushes analytic instructions into Hadoop for computation Secure Complies with Hadoop security standards 17

18 Sample Use Cases Telco - Austria Automated Customer Feedback Text Analysis for Automated Categorization & Routing Payments Worldwide Customer feedback & voice of the customer, churn prevention, text mining, automated text categorization, and sentiment analysis to customer support and sat to prevent customer churn Telco Germany Automated Online Market Research, Text Analytics, Sentiment Analysis, Customer Insight Telco - Austria Optimize customer support by automatically categorizing unstructured data by content and to prioritize and reduce response time and cost so increasing customer satisfaction Telco - Switzerland Server & Equipment Load Forecasting, Predictive Maintenance, Predicting & Preventing Server & Component Failures Telco Europe CRM applications including optimization of direct marketing campaigns, automated generation of product recommendations for crossselling and up-selling, customer churn prevention, and fraud detection Telco Hungary Customer Relationship Analytics, Churn Prediction & Prevention, Direct Marketing Campaign Optimization, Scheduling & Automated Execution of ETL Tasks Marketing Germany Automated Online Market Research, Text & Sentiment Analysis, Customer Insight, Competitive Intelligence Market Research - Worldwide Prediction of sales volumes; CRM optimization; social media monitoring and sentiment analysis Telco Germany Fraud Detection & Prevention Payments Worldwide Sentiment Analysis of online text sources, including social media and other user generated content for customer care triage OEM Europe Fraud Detection & Prevention Solutions for Telecoms 18

19 Sample Customer Use Cases Multiple Customers, Industries Automated Customer Feedback Text Analysis for Automated / Social Media, Categorization, Triage & Routing Payments Worldwide Sentiment Analysis of online text sources, including social media and other user generated content for customer care triage Partner - Europe Smart meter installation optimization as a service maximize first time visit success Payments - Russia Market Research Worldwide Org Telco Europe Fraud detection in retail network historical data on service usage, transaction history, customer profiles, usage logs, and known cases of fraudulent behavior Prediction of sales volumes; CRM optimization; social media monitoring and sentiment analysis Automated Customer Feedback Text Analysis for Automated Categorization & Routing CRM applications including optimization of direct marketing campaigns, automated generation of product recommendations for cross-selling and up-selling, customer churn prevention, and fraud detection 19

20 Sample Customer Use Cases Voice of the Customer Automated Customer Feedback Text Analysis for Automated / Social Media, Categorization, Triage & Routing Manufacturing Production Optimization Optimization Of Production Logistics & Flows, Quality, Yield, Product Mix, Process Mining Manufacturing Predictive Maintenance High Value Assets - Silicon, Cars, Trucks, Aircraft, Turbines, IT Infrastructure, Fraud Detection Fraud detection in retail network historical data on service usage, transaction history, customer profiles, usage logs, and known cases of fraudulent behavior Maximizing Customer Lifetime Value CRM applications including optimization of direct marketing campaigns, automated generation of product recommendations for cross-selling and up-selling, customer churn prevention, and fraud detection 20

21 Safeguarding Electronic Payments Anticipating the risk of fraud Russia s Largest electronic payment service The Challenge Protecting against fraud and anticipation of risk 7x24 Large and diverse set of partners (merchants) over 70,0000 How to classify and check merchant ecommerce sites for payment system compliance? RapidMiner Solution Analyze, classify and check merchants ecommerce sites for compliance Utilize text mining with NLP to auto-categorize with high sentiment accuracy Mashup the widest data sets - historical data on service usage, transaction history, customer profiles, usage logs, and known cases of fraudulent behavior Detect anomalies, misuse and fraud through operationalized classification model Outcome Only 8-10% of merchant sites now screened manually at 80% confidence threshold Accurate automated analysis of high risk sites- 92% correctly classified Elimination of false positives - no normal sites classified as high risk Time and cost to resolve fraud case radically reduced 21

22 Repeat Business through Marketing Efficacy Identify upsell offers through deep customer analytics Large North American restaurant delivery chain The Challenge Industry with tight margins & intense competition Broad array of online & mobile channels for customers to place orders Goal to improve marketing offers and create more repeat business RapidMiner Solution Capture a vast array of customer ordering data from multiple online & mobile phone channels Use RapidMiner to join & enriched data with 3 rd -party demographics & competitive data Use data science to assess performance and growth drivers at individual stores & franchise groups Results used to tailor coupons & upsell offers to customers Outcome Greater flow of repeat customers, driving growth at individual stores and franchise groups Far outpaced the industry: Posted best Q2 & Q3 domestic same-store sales growth of the 25 largest restaurant chains in the U.S. Next steps: RapidMiner Radoop 22

23 Customer Satisfaction through Quality of Service Customer experience begins with network quality Leading European Telecoms Provider The Challenge Backend infrastructure footprint & costs increasing yearly Customer satisfaction driven by service quality in areas such as video streaming latency Network operation teams must accelerate root cause analysis, reduce time to repair Data visualization with big data alone cannot provide operationalized insight needed RapidMiner Solution Secure large scale Hortonworks Hadoop Big Data Hub architecture to leverage data lakes Correlation of log events with historical log data to preempt service quality degradation Through machine learning rapidly predict demand as consumer usage patterns change Utilize text mining to optimize help desk ticket triage and processing Outcome Reduce infrastructure requirements (-10%) Improved customer retention (2%+) IT Operations costs reduced (-30%) 23

24 Drive Data Science Agility & Cut Costs Faster development & deployment of customer analytics models Leading North American Financial Services Institution The Challenge Existing data science teams looking to replace SAS Strong dislike of unwieldy SAS platform with the coding & complexity of it s multiple applications & user interfaces Cost of SAS too high RapidMiner Solution Pull together customer data from across a number of internal databases & third-party sources Easily incorporate a large library of legacy predictive models written in R & Python Small team of 4 data scientists using collaboration features in RapidMiner Server to share data prep and machine learning processes Outcome Improved upsell opportunities and customer retention Speeds the process of data prep, rapid prototyping & validation of models over SAS methods and coding-only methods Expansion into Risk department where data science team doesn t code in SAS, R or Python 24

25 Gartner & Forrester RapidMiner a Clear Leader 2017 Magic Quadrant for Data Science Platforms PAML Wave a Leader, owing to its market presence, the volume of client inquiries that Gartner receives about it, its user community, and its well-rounded product that addresses most data science use cases well. Reference customers praised many facets of the platform its large selection of algorithms, flexible modeling capabilities, data source integration and consequent data preparation. The platform's strength lies not just in particular areas, but also in its all-around consistency. RapidMiner wraps breadth and depth in a beautiful package. RapidMiner invested heavily to revamp visual interface to make it the most concise and fluid that we have seen during this evaluation. Add to that, RapidMiner s comprehensive set of operators that encapsulate a wide range of data prep, analytical, and modeling functionality to increase productivity of data scientists. 25

26 Peer Insights True Expert Validation Business Software and Services Reviews Top Predictive Analytics Products by Enterprise reviewers Verified software ratings and reviews from your enterprise IT peers Reviews for Advanced Analytics Platforms 26