Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Similar documents
Oracle Big Data Discovery The Visual Face of Big Data

Oracle Big Data Discovery Cloud Service

Brian Macdonald Big Data & Analytics Specialist - Oracle

Hadoop Course Content


Analytics in Action transforming the way we use and consume information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Bringing the Power of SAS to Hadoop Title

Safe Harbor Statement

BIG DATA AND HADOOP DEVELOPER

Big Data The Big Story

GET MORE VALUE OUT OF BIG DATA

Transforming Analytics with Cloudera Data Science WorkBench

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

20775A: Performing Data Engineering on Microsoft HD Insight

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

SAP Predictive Analytics Suite

20775A: Performing Data Engineering on Microsoft HD Insight

Garanti Bank s Journey to Big Data Ayşen Büyükakın Business Intelligence & Analytics Unit Manager

Cask Data Application Platform (CDAP) Extensions

Adobe and Hadoop Integration

20775 Performing Data Engineering on Microsoft HD Insight

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Business is being transformed by three trends

Adobe and Hadoop Integration

Oracle 全数据平台解决方案 : 打破技术壁垒, 释放数据能量. Sally Piao 甲骨文公司全球研发副总裁

Big Data & Hadoop Advance

WELCOME TO. Cloud Data Services: The Art of the Possible

20775: Performing Data Engineering on Microsoft HD Insight

Microsoft Azure Essentials

Architecting an Open Data Lake for the Enterprise

Common Customer Use Cases in FSI

Confidential

Sr. Sergio Rodríguez de Guzmán CTO PUE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

Oracle Retail Data Model (ORDM) Overview

5th Annual. Cloudera, Inc. All rights reserved.

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

Oracle Big Data Cloud Service

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Meta-Managed Data Exploration Framework and Architecture

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

Data Ingestion in. Adobe Experience Platform

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

DataAdapt Active Insight

Pentaho Technical Overview. Max Felber Solution Engineer September 22, 2016

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

The Industry Leader in Data Warehousing, Big Data Analytics, and Marketing Solutions

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

Copyright 2013 Oracle and/or its affiliates. All rights reserved.

Big Data Analytics met Hadoop

Datameer for Data Preparation: Empowering Your Business Analysts

COMP9321 Web Application Engineering

Practices of Business Intelligence. (Business Intelligence, Analytics, and Data Science)

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

Hortonworks Data Platform

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Apache Hadoop in the Datacenter and Cloud

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

Two offerings which interoperate really well

Analytics in the Digital Economy data, experience, ideas & people. Juergen Hagedorn, Viktor Kehayov Product Management, SAP Analytics March 2017

Hadoop Integration Deep Dive

NICE Customer Engagement Analytics - Architecture Whitepaper

Big Data Platform Implementation

Hortonworks Connected Data Platforms

TDWI Analytics Fundamentals. Course Outline. Module One: Concepts of Analytics

MicroStrategy 10. Adam Leno Technical Architect NDM Technologies

Safe Harbor Statement

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Copyright 2014, Oracle and/or its affiliates. All rights reserved

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Modernizing Your Data Warehouse with Azure

DATASHEET. Tarams Business Intelligence. Services Data sheet

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Drive Better Insights with Oracle Analytics Cloud

Hybrid Data Management

Big data is hard. Top 3 Challenges To Adopting Big Data

Enterprise Command Center

Building Your Big Data Team

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

Predictive Analytics Reimagined for the Digital Enterprise

NFLABS SIMPLIFYING BIG DATA. Real &me, interac&ve data analy&cs pla4orm for Hadoop

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Improving Healthcare Payer Performance with Big Data

Advancing Information Management and Analysis with Entity Resolution. Whitepaper ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION

Spark and Hadoop Perfect Together

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

The Alpine Data Platform

Building a Single Source of Truth across the Enterprise An Integrated Solution

Governing Big Data and Hadoop

Analytics for All Data

Transcription:

Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Finding new business potential with Big Data Analytics Carsten Frisch Oracle Business Analytics DOAG 2015 Business Solutions Conference Darmstadt, 10. Juni 2015 Copyright 2015, Oracle and/or its its affiliates. All All rights reserved.

Referent» Carsten Frisch» Senior Sales Consultant» Business Analytics Big Data Discovery Lead - DE/CH Cluster» Kontakt +49 (0)6103 397-380» carsten.frisch@oracle.com Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information The following purposes is intended only, to and outline may our not general be incorporated product direction. into any contract. It is intended It is not for a commitment information purposes to deliver only, any material, and may not code, be or incorporated functionality, into and any should contract. not be It is relied not a upon in commitment making purchasing to deliver decisions. any material, The development, code, or functionality, release, and and timing should of not any be features relied upon or functionality in making purchasing described decisions. for Oracle s The products development, remains release, at the and sole timing discretion of any of Oracle. features or functionality described for Oracle s products remains at the sole discretion of Oracle. Copyright 2015, Oracle and/or its affiliates. All rights reserved. 4

Copyright 2015, Oracle and/or its affiliates. All rights reserved. 5

Monetizing New Insights Business Cases for Big Data and the Discovery Lab Copyright 2015, Oracle and/or its affiliates. All rights reserved. 6

Financial Services Copyright 2015, Oracle and/or its affiliates. All rights reserved. 7

Enabling Rich Customer Experience Across Channels Is A Key Focus For Banks Customers have become more Email Mail demanding and their loyalties Sales are diffused with low-switching Branch 360 degree view of customer Phone costs. The customer experience expectations for banking services (across channels) are being reset by the experiences Online Mobile being provided by retailers and ATM online providers elsewhere Source: Redefining Customer Experience, Infosys Whitepaper; PWC Report 2012 Copyright 2015, Oracle and/or its affiliates. All rights reserved. 8

Banks Need To Move Towards Personalization And Targeted Marketing To Enhance Customer Experience Top 3 Emerging Changes in Customer Behavior That Impact Banking (% of respondents) Using Direct and Self-Service Channels Seeking Better, More Personal Advice Price Sensitivity, Discount Seeking 49% 44% 63% Customer Demand. More personalized services, offers and enhanced customer experience. More relevant services and transparent access to information across all channels consistently. Increase simplicity, self-control, mobility of banking services Customers are making web / mobile as their primary channel of interaction with their banks. These channels are already heavily personalized and there is a rising demand for more personalized services and offers from customers Source: Enhancing The Banking Customer Value Proposition Through Technology-led Innovation, Accenture Copyright 2015, Oracle and/or its affiliates. All rights reserved. 9

Market Challenges Are Compelling Banks To Focus On Customer Insight And Real-Time Offers INDUSTRY CHALLENGES ENHANCE CUSTOMER EXPERIENCE REAL-TIME OFFERS KEY BIG DATA CAPABILITIES Develop deep client relationships by offering superior service Analyze internal customer logs and social media activity to generate indications of customer dissatisfaction allowing time to act Analyze behavior profiles, spending habits, and segmentation to gain view on customer risk and enhance risk management capabilities Generate real-time, context sensitive, targeted offers based on analytical insights on spending patterns Rapid time to market and improved customer value Leverage insights from social media during various stages of product and service development OPTIMIZE OPERATIONS Source: Oracle Financial Services Industry Solutions Overview; Oracle Insight; PWC Report Provide more visibility into performance in order to facilitate timely and cost effective management of operations Discover opportunities to achieve greater efficiency across global operations Understand and forecast performance and drive strategies that improve operations and financial results Copyright 2015, Oracle and/or its affiliates. All rights reserved. 10

Leveraging Big Data for Competitive Advantage in FS Customer Insight Data Monetisation Optimise Operations Customer Insight Social Media Sentiment & Engagement Big Data Augmentation Real Time Offers Context Sensitive Offers / Ads Location Based Offers / Ads Compliance Processes Personalised Services New Product Launch New Revenue Streams Information as a Service Fraud Detection Risk Management Fast Data Quality of Models Financial risk Security risk Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Digital Business, Data-driven Decisioning Copyright 2015, Oracle and/or its affiliates. All rights reserved. 12

Characteristics of Digital Business Leaders They Reframe Challenges Looking at them from new perspectives and multiple angles They Sprint They work at pace - researching, testing and evaluating current ideas while generating new ones They Appreciate That Failure Can Be Good and are not afraid of new ideas They Convert Data Into Value They invest heavily in analyzing their own data and data from external sources to establish patterns and un-noticed opportunities Copyright 2015, Oracle and/or its affiliates. All rights reserved. 13

Identify (business) question Verify earlier findings Design of a solution model Gather all necessary data Analyse the data Present & implement results Data-driven Decisions Become clear about all aspects of the decision to be taken or the problem to be solved. Try to identify alternatives to your perception Non-Analysts & Executives: should take a closer look on steps 1 and 6 of the analysis process if they plan to make use of statistical analysis. Find out who has investigated such or a similar problem in the past and the approach that has been taken Formulate a detailled hypothesis how specific variables might influence the result of the chosen model Gather all available information about the variables of your hypothesis. The relevance of a dataset might address your business question directly or needs to be derived Apply a statistical model and evaluate the correctness of the approach. Repeat this procedure until the right method has been identified. Data Science + Knowledge Discovery Frame the results obtained in a comprehensible story. This kind of presentation intends to motivate decision makers and relevant stake-holders to take action Adopted from Thomas H. Davenport, Harvard Business Manager 2013 Copyright 2015, Oracle and/or its affiliates. All rights reserved. 14

Vertical and Horizontal Data Science Skills Data Vertical Warehouse Deep technical skills Eigenvalues, Lasso-related regressions Experts in Bayesian networks, R Support Vector Machine Hadoop, NoSQL, Data Modeling, DW Machine Learning & Statistics Storytelling experts Horizontal Cross-discipline knowledge Visualization skills Programming experience Domain expertise Aware of pitfalls & rules of thumb Look for the individual Unicorn or build a Data Science Team? The Specialist The Unicorn Copyright 2015, Oracle and/or its affiliates. All rights reserved. 15

Enabling Data-driven Innovations in Organizations Business Analysts: Day-to-Day performance of a business unit Information Consumer: Reporting on individual transactions Automated Process: Decisions effecting execution of an indiv. transactions Executive: Decisions effecting strategy and direction Data Scientists: Information analysis to meet strategic goals Perf. Mgmt. Knowledge Discovery Dynamic Dashboards and Reports Volume and Fixed Reporting BICC ACC Knowledge Driven Business Process Insight Analytical Competence Center (ACC)» Separate group reporting to CxO. not part of a Business Intelligence Competence Center (BICC)» Mission: broadening the adoption of Analytics across the organization» Skilled resource pool of Data Scientists, Statisticians and Business Experts» Data-driven approach (not development-driven) with privileged access to enterprise data sources» Group will be assigned to projects for a limited time Copyright 2015, Oracle and/or its affiliates. All rights reserved. 16

Discovery Lab Copyright 2015, Oracle and/or its affiliates. All rights reserved. 17

Information Management Conceptual View Actionable Events Actionable Insights Actionable Information BICC Structured Enterprise Data Data Streams Event Engine Data Reservoir Data Factory Enterprise Information Store Business Intelligence Other Data Execution Innovation Line of governance ACC Events & Data Discovery Lab Discovery Output Source: Oracle White Paper Information Management and Big Data A Reference Architecture Copyright 2015, Oracle and/or its affiliates. All rights reserved. 18

Discovery Lab: Design Pattern ACC» Iterative development approach data oriented NOT development oriented» Small group of highly skilled individuals (aka Data Scientists or a team organized as an Analytical Competence Center, ACC) with privileged access to enterprise data sources» Specific focus on identifying commercial value for exploitation» Wide range of tools and techniques applied» Typically separate infrastructure but could also be unified Reservoir if resource managed effectively» Data provisioned through Data Factory or own ETL processes Copyright 2015, Oracle and/or its affiliates. All rights reserved. 20

Discovery Lab: Activity Cycles Copyright 2015, Oracle and/or its affiliates. All rights reserved. 21

Virtualisation & Information Services Discovery Lab: Data Provisioning Analysis Processing & Delivery Data Factory flow General BI flow The majority of BI development activity will be from existing sources performed by the BICC developing new reports to existing or new channels 1 Pre-Built Intelligence Assets Intelligence Analysis Tools Dashboards & Reports Scorecards Charts & Graphs Ad Hoc Query & Analysis Tools OLAP Tools Forecasting & Simulation Tools BICC Reporting Tools 2 Discovery Lab & Development Environment Sandbox Project 3 Query & Search Tools Raw Data Sandbox Project 2 Statistics Tools Sandbox Project 1 Data store Analytical Processing Data Science (Primary Toolset) Data Modelling Tools Programming & Scripting Data & Text Mining Tools Faceted Query Tools ACC ACC may quickly develop new reporting through mashups from any available internal and external sources and may used advanced analytical tools for innovative analysis Data Quality & Profiling Graphical rendering tools Copyright 2015, Oracle and/or its affiliates. All rights reserved. 22

Polystructured Data Structured Data Unified: Big Data Management and Analytics Experiment, Prototype, Collaborate Productize, Secure & Govern Exalytics Exadata Oracle BI Foundation Suite (ROLAP/MOLAP, Mobile, ) In-Memory Appliance Oracle Advanced Analytics (Data Mining, Oracle R Enterprise) Oracle Database Oracle SQL Queries Oracle Big Data SQL Tables in DB» Quickly find, explore, transform, analyze and share discoveries in Big Data Discovery» Publish results to the Hadoop Distributed File System (HDFS)» Use to build predictive models with Oracle R for Hadoop Experiment, Prototype & Collaborate BDA Data Warehouse Oracle Big Data Discovery Hadoop (HDFS) Data Reservoir Oracle R for Hadoop SQL join Tables in Hadoop Productize, Secure, Govern» Connect published HDFS files to secure Oracle DB using Oracle Big Data SQL» No data movement required» Seamlessly extends existing DWH and BI investments with non-traditional data in Hadoop Copyright 2015, Oracle and/or its affiliates. All rights reserved. 23

Need To Get Analytic Value Fast Data Uncertainty» Not familiar and overwhelming» Potential value not obvious» Requires significant manipulation Tool Complexity» Early Hadoop tools only for experts» Existing BI tools not designed for Hadoop» Emerging solutions lack broad capabilities 80% effort typically spent on evaluating and preparing data Overly dependent on scarce and highly skilled resources Copyright 2015, Oracle and/or its affiliates. All rights reserved. 24

Oracle Big Data Discovery Copyright 2015, Oracle and/or its affiliates. All rights reserved. 25

Oracle Big Data Discovery: The Visual Face of Hadoop find explore transform discover share Copyright 2015, Oracle and/or its affiliates. All rights reserved. 26

Oracle Big Data Discovery: Components Hadoop Cluster (Oracle Big Data Appliance or Commodity Hardware with Cloudera CDH 5.) BDD node name node data node data node data node data node Hadoop 2.x Metadata (HCatalog) Workload Mgmt (YARN) Filesystem (HDFS) Studio Oracle Big Data Discovery Workloads Web UI: Find, Explore, Transform, Discover, Share In-Memory Discovery Indexes DGraph: Search, Guided Navigation, Analytics Data Processing, Workflow & Monitoring Profiling: catalog entry creation, data type & language detection, schema configuration Sampling: dgraph (index) file creation Transforms: >100 functions Enrichments: location (geo), text (cleanup, sentiment, entity, key-phrase, whitelist tagging) Self-Service Provisioning & Data Transfer Personal Data: Upload CSV and XLS to HDFS Other Hadoop Workloads MapReduce Spark Hive Pig Oracle Big Data SQL (Oracle Big Data Appliance only) Copyright 2015, Oracle and/or its affiliates. All rights reserved. 27

Oracle Big Data Discovery: Preparation of Data Sources Have to be created as Hive Tables and registered in the Hive Metastore Hive Table with a standard Regex SerDe ( Serializer-Deserializer ) to map more complex file structures by using Regular Expressions into regular table columns Hive Table definition for fixed-width or delimited files Hive Table using a custom developed SerDe to map nested file structures of a JSON file into regular table columns Copyright 2015, Oracle and/or its affiliates. All rights reserved. 29

Oracle Big Data Discovery: Preparation of Data Sources There are multiple ways to get new Data Sets loaded Big Data Discovery Upload of XLS und CSV files and automatic Hive Table creation HUE (Hadoop User Experience) Upload of various file formats, table creation wizzards, web-based Hive Query Client Hive Command Line Interface is similar to the MySQL command line Copyright 2015, Oracle and/or its affiliates. All rights reserved. 30

Oracle Big Data Discovery: Preparation of Data Sources or by using your favorite Data Integration / ETL Tool File (FS/HDFS) IKM File To Hive (Load Data) IKM Hive Transform IKM Hive Control Append Hive LKM HBase to Hive Hive IKM File-Hive To Oracle (OLH, OSCH) Oracle DB IKM SQL to Hive- HBase-File (SQOOP) HBase IKM Hive to HBase Hive IKM File-Hive to SQL (SQOOP) Any RDBMS Hive HBase Any RDBMS Oracle Data Integrator 12.1.3 with Advanced Big Data Option (Supporting HDFS, Hive, HBase, Scoop, Pig, Spark) Copyright 2015, Oracle and/or its affiliates. All rights reserved. 31

Oracle Big Data Discovery: Data Ingestion Data Processing Workflow including Profiling and Enrichment access_logs 100m rows Hive / HCatalog access_logs 1 m rows Profiling and Enrichment Process access_logs 1 m rows BDD access_logs 1 m rows 1M of 100M Copyright 2015, Oracle and/or its affiliates. All rights reserved. 32

Demonstration Oracle Big Data Discovery Oracle Big Data Discovery Demonstration Copyright 2015, Oracle and/or its affiliates. All rights reserved. 35

Catalog» Access a rich, interactive catalog of all data in Hadoop» Familiar search and guided navigation for ease of use» See data set summaries, user annotation and recommendations» Provision personal and enterprise data to Hadoop via selfservice Copyright 2015, Oracle and/or its affiliates. All rights reserved. 36

Explore» Visualize all attributes by type» Sort attributes by information potential» Assess attribute statistics, data quality and outliers» Use scratch pad to uncover correlations between attributes Copyright 2015, Oracle and/or its affiliates. All rights reserved. 37

Transform» Intuitive, user driven data wrangling» Extensive library of powerful data transformations and enrichments» Preview results, undo, commit and replay transforms» Test on sample data then apply to full data set in Hadoop Copyright 2015, Oracle and/or its affiliates. All rights reserved. 38

Transform User friendly Preferred method for the Business Analyst Copyright 2015, Oracle and/or its affiliates. All rights reserved. 39

Transform but flexible (based on Groovy Programming Language / Library) Preferred Method for IT / Data Engineer / Data Scientist Copyright 2015, Oracle and/or its affiliates. All rights reserved. 40

Discover» Join and blend data for deeper perspectives» Easy usage - compose project pages via drag and drop» Use powerful search and guided navigation to ask questions» See new patterns in rich, interactive data visualizations Copyright 2015, Oracle and/or its affiliates. All rights reserved. 41

Share» Share projects, bookmarks and snapshots with others» Build galleries and tell big data stories» Collaborate and iterate as a team» Publish blended data to HDFS for leverage in other tools Copyright 2015, Oracle and/or its affiliates. All rights reserved. 42

Data Discovery & Analytics Copyright 2015, Oracle and/or its affiliates. All rights reserved. 43

Data Discovery & Analytics Lifecycle Typical Effort Copyright 2015, Oracle and/or its affiliates. All rights reserved. 44

Data Discovery & Analytics Lifecycle More Time left for Analysis and Interpretation of Results Copyright 2015, Oracle and/or its affiliates. All rights reserved. 45

% of Positive Responders Analytics: More Data Variety available Better Results Example: Marketing Campaigns Getting lift on responders Data Mining-based prediction results with Response Modelling including hundreds of input variables like:» Demographic data» Purchase POS transactional data» Polystructured data, text & comments» Spatial location data» Long term vs. recent historical behaviour» Web visits» Sensor data» 100 0 Population Size (% of Total Cases) Naïve Guess or Random Model with 20 variables Model with 75 variables Model with 250 variables 100 Copyright 2015, Oracle and/or its affiliates. All rights reserved. 46

Oracle Advanced Analytics Native SQL Data Mining/Analytic Functions + High-performance R Integration Oracle R Enterprise (ORE)» Allows distributed processing of huge data volumes» Benefits from DB features, e.g. Security and SQL access» R Studio = GUI for Data Analysts Oracle Data Mining (ODM)» Implemented in the Oracle Database kernel» Direct access via PL/SQL API and SQL operators» Oracle Data Miner GUI embedded in SQL Developer Copyright 2015, Oracle and/or its affiliates. All rights reserved. 47

Copyright 2015, Oracle and/or its affiliates. All rights reserved. 48