The Intersection of Big Data and DB2

Size: px
Start display at page:

Download "The Intersection of Big Data and DB2"

Transcription

1 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development

2 Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop Traditional vs Big Data Complementary Solutions and Big SQL Data Warehouse Augmentation Summary IBM Corporation

3 What is Big Data? All kinds of data Large volumes Valuable insight, but difficult to extract May be extremely time sensitive Big Data is a Hot Topic Because Technology Makes it Possible to Analyze ALL Available Data Big data technologies describe a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high velocity capture, discovery and/or analysis. 3 Source: Matt Eastwood, IDC 2013 IBM Corporation

4 Characteristics of Big Data V 4 = Volume Velocity Variety Veracity Cost efficiently processing the growing Volume 50x 35 ZB Responding to the increasing Velocity 30 Billion RFID sensors and counting Collectively analyzing the broadening Variety 80% of the worlds data is unstructured Establishing the Veracity of big data sources 1 in 3 business leaders don t trust the information they use to make decisions IBM Corporation

5 Big Data Sources Transactional & Application Data Machine Data Social Data Enterprise Content Volume Velocity Variety Variety Structured Semi-structured Highly unstructured Highly unstructured Throughput Ingestion Veracity Volume

6 Where Is This Data Coming From? 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide? TBs of data every day 100s of millions of GPS enabled devices sold annually 500+ TBs of log data every day 76 million smart meters in M by billion people on the Web by end IBM Corporation

7 Sources of Big Data Big data sources Respondents were asked which data sources are currently being collected and analyzed as part of active big data efforts within their organization.

8 What is Hadoop? Open source project Written in Java Optimized to handle Massive amounts of data through parallelism A variety of data (structured, unstructured, semi-structured) Using inexpensive commodity hardware Great performance Reliability provided through replication Not for OLTP, not for OLAP/DSS, good for Big Data Current version: 2.2.0

9 Hadoop / MapReduce timeline

10 Two Key Aspects of Hadoop MapReduce framework How Hadoop understands and assigns work to the nodes (machines) Hadoop Distributed File System = HDFS Where Hadoop stores data A file system that spans all the nodes in a Hadoop cluster It links together the file systems on many local nodes to make them into one big file system

11 Hadoop is not for all types of work Not to process transactions (random access) Not good when work cannot be parallelized Not good for low latency data access Not good for processing lots of small files Not good for intensive calculations with little data But, the technology is evolving IBM Corporation

12 Big Data Core Use Cases Resulting from High Value Initiatives Big Data Exploration - Find, visualize, understand ALL big data to improve business knowledge Complete View of the Customer - Achieve a true unified view, incorporating internal and external sources, to drive positive interactions Security/ Intelligence - Lower risk, detect fraud and monitor cyber security in realtime 12 IT Operations Analysis - Analyze a variety of machine data for improved business results Data Warehouse Augmentation -Integrate big data and data warehouse capabilities to increase operational efficiency

13 The IBM Big Data Platform Process any type of data Structured, unstructured, inmotion, at-rest Built-for-purpose engines Designed to handle different requirements Analyze data in motion Manage and govern data in the ecosystem Enterprise data integration Grow and evolve on current infrastructure Solutions Analytics and Decision Management Visualization & Discovery Hadoop System IBM Big Data Platform Application Development Accelerators Stream Computing Information Integration & Governance Big Data Infrastructure Systems Management Data Warehouse

14 RDBMS vs Hadoop RDBMS Hadoop Data sources Structured data with known schemas Data type Records, long fields, objects, XML Files Data Updates Updates allowed Unstructured and structured Only inserts and deletes Language SQL & XQuery Pig (Pig Latin), Hive (HiveQL), Jaql Processing type Data integrity Quick response, random access Data loss is not acceptable Security Security and auditing Partial Batch processing Data loss can happen sometimes Compress Sophisticated data compression Simple file compression Hardware Enterprise hardware Commodity hardware Data access Random access (indexing) History ~40 years of innovation < 5 years old Access files only (streaming) Community Widely used, abundant resources Not widely adopted yet IBM Corporation

15 Warehouse vs Hadoop Data Warehouse Hadoop Data sources Structured, high value data. Pre - Processed Data type Records, long fields, objects, XML Files Data Updates Updates allowed Unstructured and structured Only inserts and deletes Language Vendor specific Pig (Pig Latin), Hive (HiveQL), Jaql Processing type Data integrity Batch Processing Data loss is not acceptable Security Security and auditing Partial Batch processing Data loss can happen sometimes Compress Sophisticated data compression Simple file compression Hardware Enterprise hardware Commodity hardware Data access Random access (indexing) History ~20 years of innovation < 5 years old Access files only (streaming) Community Widely used, abundant resources Not widely adopted yet IBM Corporation

16 Merging the Traditional and Big Data Approaches Traditional Approach Structured & Repeatable Analysis Big Data Approach Iterative & Exploratory Analysis Business Users Determine what question to ask IT Delivers a platform to enable creative discovery IT Structures the data to answer that question Business Explores what questions could be asked Monthly sales reports Profitability analysis Customer surveys Brand sentiment Product strategy Maximum asset utilization IBM IBM Corporation

17 Big Difference: Schema on Run Regular database Schema on load Big Data (Hadoop) Schema on run Raw data Raw data Schema to filter Storage (unfiltered, raw data) Schema to filter Storage (pre-filtered data) Output IBM Corporation

18 Complementary Analytics Traditional Approach Structured, analytical, logical New Approach Creative, holistic thought, intuition Transaction Data Data Warehouse Hadoop Streams Web Logs Internal App Data Structured Repeatable Mainframe Data Linear Monthly sales reports Profitability analysis Customer surveys OLTP System Data Structured Repeatable Linear Enterprise Integration Unstructured Exploratory Iterative Social Data Unstructured Exploratory Iterative Text Data: s Brand sentiment Product strategy Maximum asset utilization Sensor data: images ERP data Traditional Sources New Sources RFID

19 Big SQL interface.... Rich SQL query capabilities SQL '92 and 2011 features Correlated subqueries Windowed aggregates Application SQL Language JDBC / ODBC Driver SQL access to all data stored in InfoSphere BigInsights JDBC / ODBC Server Robust JDBC/ODBC support SQL interface Engine Take advantage of key features of each data source Leverage MapReduce parallelism OR achieving low-latency Data Sources HiveTables HBase tables CSV Files InfoSphere BigInsights

20 The challenge: spreading data transformation and analytic components across multiple platforms can increase data latency, cost, complexity and governance risk Customer Interaction Data In Transactional Data DB2 for z/os IMS VSAM Non IBM Data Movement, Cleansing & Management Replicate, Integrate, Cleanse, Manage Data Warehousing Data Warehouse, Operational Data Store, Data Mart Data Analysis Business Intelligence, Predictive Analytics Business Insight Out zenterprise Off z platform

21 The Solution: DB2 Analytics Accelerator Hybrid Approach Traditional Approach to Analytic Systems Operational Applications Analytic Applications Combined Workloads Transaction Processing Data Store, Business Intelligence, Predictive Analytics Transactional Processing, Traditional Analytics & Business Critical Analytics Data transfer Shared Everything DB Latency? Security? Data Governance? Complexity? Shared Nothing DB Hybrid DB High volume business transactions and batch processing running concurrently Low volume complex queries and batch reporting Reduced Latency. Greater Security. Improved Data Governance. Reduced Complexity. High volume business transactions and batch reporting running concurrently with complex queries Delivering business critical analytics

22 Drivers for Enterprise Data Warehouse Augmentation Need to Leverage Variety of Data Impractical to Store all Data in the EDW Enterprise Data Warehouse Not Optimized Structured, semi-structured, unstructured, and streaming Low latency requirements (hours not weeks or months) Requires query access to data Improved Business Insights Cannot afford to store Big Data in the EDW Potential impact to normal OLAP EDW data volumes reaching Big Data levels A lot of low-touch, cold data Large portion of data in EDW not accessed frequently

23 New Architecture to Leverage All Data and Analytics Data in Motion Data at Rest Data in Many Forms Streams Information Ingestion and Operational Information Real-time Analytics Stream Processing Data Integration Master Data Video/Audio Network/Sensor Entity Analytics Predictive Landing Area, Analytics Zone and Archive Raw Data Structured Data Text Analytics Data Mining Entity Analytics Machine Learning Exploration, Integrated Warehouse, and Mart Zones Discovery Deep Reflection Operational Predictive Information Governance, Security and Business Continuity Intelligence Analysis Decision Management BI and Predictive Analytics Navigation and Discovery 23

24 Filter and Summarize Big Data for the Warehouse BigInsights can manage all enterprise data upon arrival Organizations can manipulate, analyze, and summarize incoming data BigInsights can be utilized as a source for a data warehouse Sift through large volumes of data Broaden analytic coverage without undue burden on systems Augment existing corporate data within warehouses Big data analytic applications BigInsights Traditional analytic tools Data warehouse Filter Transform Aggregate IBM IBM Corporation

25 BigInsights as a Query-ready Archive for a Data Warehouse Allow firms to manage the size of their existing data management platforms Use BigInsights as a query-ready archive With frequently accessed data maintained in the warehouse and cold or outdated information offloaded to BigInsights Better manage the size and usability of data within the enterprise Traditional analytic tools Big Data analytic applications BigInsights IBM IBM Corporation

26 A new architecture to leverage all Data has emerged. All Data Information ingestion and operational information zone Real-time analytics zone Exploration, landing and archive zone Enterprise warehouse data mart and analytic appliances zone Harness All Data & All Paradigms Information governance zone

27 The Big Data Ecosystem: Interoperability is Key Streaming Data Internet- Scale Data Sets Non-Traditional / Non-Relational Data Feeds Non-Traditional / Non-Relational Data Sources Traditional / Relational Data Sources Streams RTAP: Analytics on Data in Motion BigInsights Analytics on Data at Rest Data Explorer Platform Traditional Warehouse Traditional / Relational Data Sources Data Warehouse Analytics on Structured Data

28 Every Industry can Leverage Big Data and Analytics Banking Insurance Telco Energy & Utilities Media & Entertainment Optimizing Offers and Cross-sell Customer Service and Call Center Efficiency 360 View of Domain or Subject Catastrophe Modeling Fraud & Abuse Pro-active Call Center Network Analytics Location Based Services Smart Meter Analytics Distribution Load Forecasting/Scheduling Condition Based Maintenance Business process transformation Audience & Marketing Optimization Retail Travel & Transport Consumer Products Government Healthcare Actionable Customer Insight Merchandise Optimization Dynamic Pricing Customer Analytics & Loyalty Marketing Predictive Maintenance Analytics Shelf Availability Promotional Spend Optimization Merchandising Compliance Civilian Services Defense & Intelligence Tax & Treasury Services Measure & Act on Population Health Outcomes Engage Consumers in their Healthcare Automotive Chemical & Petroleum Aerospace & Defense Electronics Life Sciences Advanced Condition Monitoring Data Warehouse Optimization Operational Surveillance, Analysis & Optimization Data Warehouse Consolidation, Integration & Augmentation Uniform Information Access Platform Data Warehouse Optimization Customer/ Channel Analytics Advanced Condition Monitoring Increase visibility into drug safety and effectiveness

29 Resources Big Data community Harnessing the Power of Big Data ebook Authors: Paul Zikopoulos, Thomas Deutsch, Dirk deroos, Krishnan Parasuraman, David Corrigan, James Giles Big Data Big Agriculture with IBM DB2 for z/os Replay of John Deere webcast url TBA Future of farming videos -

30 Questions? 30

31 InfoSphere BigInsights v2.1 A Closer Look User Interfaces Integration More Than Hadoop Visualization Accelerators Text Analytics Dev Tools BigInsights Engine Admin Console Application Accelerators Databases Content Management Performance & workload optimizations Unique text analytic engines Spreadsheet-style visualization for data discovery & exploration Map Reduce + Indexing Built-in IDE & admin consoles Workload Mgmt Security Information Governance Enterprise-class security Apache Hadoop High-speed connectors to integration with other systems Analytical accelerators

32 IBM InfoSphere Streams v3.1 A platform for real-time analytics on BIG data Volume Terabytes per second Petabytes per day Variety All kinds of data All kinds of analytics Velocity Insights in microseconds Agility Dynamically responsive Rapid application development Millions of events per second Just-in-time decisions Powerful Analytics Sensor, video, audio, text, and relational data sources Microsecond Latency

33 Big Data in Real Time with InfoSphere Streams Filter / Sample Modify Annotate Analyze Fuse Classify Score Windowed Aggregates 33

34 PureData System for Analytics The Simple Appliance for Serious Analytics Built-in Expertise No indexes or tuning Data model agnostic Fully parallel, optimized In Database Analytics Integration by Design Server, Storage, Database in one easy to use package Automatic parallelization and resource optimization to scale economically Enterprise-class security and platform management Simplified Experience Up and running in hours Minimal ongoing administration Standard interfaces to best of breed Analytics, BI, and data integration tools Built-in analytics capabilities allow users to derive insight from data quickly Easy connectivity to other Big Data Platform components

35 IBM Netezza Analytics Ecosystem Tanay GPU Appliance by Fuzzy Logix IBM InfoSphere BigInsights Cloudera IBM InfoSphere Streams Software Development Kit User-Defined Extensions (UDF,UDA, UDTF,UDAP) Language Support (Map/Reduce, Java, Python, Lua, Perl, C, C++, Fortran, PMML) 3 rd rd Party In-Database Analytics Revolution Analytics R Fuzzy Logix SAS Zementis IBM SPSS Mathworks Netezza In-Database Analytics Transformations Mathematical Geospatial [Esri / nzspatial] Predictive Statistics Time Series Data Mining IBM SPSS SAS Revolution Analytics Eclipse BI Tools Esri Apache Hadoop PureData for Analytics AMPP Platform Visualization Tools

36 The proper foundation can optimize these new capabilities All Data IBM Watson Foundations New/Enhanced Applications Information ingestion and operational information zone Real-time analytics zone Exploration, landing and archive zone Enterprise warehouse data mart and analytic appliances zone What action should I take? Decision management What is happening? Discovery and exploration Cognitive Fabric Why did it happen? Reporting, analysis, content analytics Information governance zone What could happen? Predictive analytics and modeling Systems Security Storage On premise, Cloud, As a service IBM Big Data & Analytics Infrastructure

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

Big Data Live selbst analysieren

Big Data Live selbst analysieren Big Data Live selbst analysieren Hands on Workshop zu IBM InfoSphere Big Insights Harald Gröger Wilfried Hoge Gerhard Wenzel IBM 2013 IBM Corporation Agenda 15:00-15:10 Einführung IBM Big Data Plattform

More information

Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management

Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management Agenda for today 1 IBM s viewpoint on on big big data

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1 ActualTests.C2030-136.40Q&A Number: C2030-136 Passing Score: 800 Time Limit: 120 min File Version: 4.8 http://www.gratisexam.com/ C2030-136 Foundations of IBM Big Data & Analytics Architecture V1 Hello,

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM May, 2012 Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005)

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

InfoSphere Warehousing 9.5

InfoSphere Warehousing 9.5 IBM Software Group Optimised InfoSphere Warehousing 9.5 Flexible Simple Phil Downey InfoSphere Warehouse Technical Marketing 2007 IBM Corporation Information On Demand End-to-End Capabilities Optimization

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

Business Analytics and Optimization An IBM Growth Priority

Business Analytics and Optimization An IBM Growth Priority Business Analytics and Optimization An IBM Growth Priority Paul Fitzpatrick Director, WW Industry ISV Partners IBM ISV & Developer Relations Best Student Recognition Event July 6-8, 2011 EMEA IBM Innovation

More information

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group IBM Software Group Flexible Reliable InfoSphere Warehouse Simple Ser Yean Tan Regional Technical Sales Manager Information Management Software IBM Software Group ASEAN 2007 IBM Corporation Business Intelligence

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Sudip Majumder Senior Director Development Industry IoT & Big Data 10/5/2016 Economic Characteristics of Data Data is the New Oil..then

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Publicly traded on New York Stock Exchange Headquartered in Houston, Texas

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

Exploring the Benefits of the Modernized Data Warehouse Philip Russom

Exploring the Benefits of the Modernized Data Warehouse Philip Russom Exploring the Benefits of the Modernized Data Warehouse Philip Russom TDWI Research Director for Data Management October 8, 2014 Sponsor 2 Speakers Philip Russom TDWI Research Director, Data Management

More information

IBM Software IBM InfoSphere BigInsights

IBM Software IBM InfoSphere BigInsights IBM Software IBM InfoSphere BigInsights Enabling new, cost-effective solutions to turn complex information into business insight 2 IBM InfoSphere BigInsights Executive summary Companies are hyper-connected

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies DLT Stack Powering big data, analytics and data science strategies for government agencies Now, government agencies can have a scalable reference model for success with Big Data, Advanced and Data Science

More information

Evolution to Revolution: Big Data 2.0

Evolution to Revolution: Big Data 2.0 Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents

More information

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW HP SummerSchool TechTalks 2013 Kenneth Donau Presale Technical Consulting, HP SW Copyright Copyright 2013 2013 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information The information

More information

Big Data: Essential Elements to a Successful Modernization Strategy

Big Data: Essential Elements to a Successful Modernization Strategy Big Data: Essential Elements to a Successful Modernization Strategy Ashish Verma Director, Deloitte Consulting Technology Information Management Deloitte Consulting Presented by #pbls14 #pbls14 Presented

More information

SAS & HADOOP ANALYTICS ON BIG DATA

SAS & HADOOP ANALYTICS ON BIG DATA SAS & HADOOP ANALYTICS ON BIG DATA WHY HADOOP? OPEN SOURCE MASSIVE SCALE FAST PROCESSING COMMODITY COMPUTING DATA REDUNDANCY DISTRIBUTED WHY HADOOP? Hadoop will soon become a replacement complement to:

More information

Hortonworks Powering the Future of Data

Hortonworks Powering the Future of Data Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA

More information

Hybrid Data Management

Hybrid Data Management Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential

More information

Modernizing Data Integration

Modernizing Data Integration Modernizing Data Integration To Accommodate New Big Data and New Business Requirements Philip Russom Research Director for Data Management, TDWI December 16, 2015 Sponsor Speakers Philip Russom TDWI Research

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

ETL challenges on IOT projects. Pedro Martins Head of Implementation

ETL challenges on IOT projects. Pedro Martins Head of Implementation ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics

More information

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking

More information

Oracle Big Data Discovery The Visual Face of Big Data

Oracle Big Data Discovery The Visual Face of Big Data Oracle Big Data Discovery The Visual Face of Big Data Today's Big Data challenge is not how to store it, but how to make sense of it. Oracle Big Data Discovery is a fundamentally new approach to making

More information

Real-Time Streaming: IMS to Apache Kafka and Hadoop

Real-Time Streaming: IMS to Apache Kafka and Hadoop Real-Time Streaming: IMS to Apache Kafka and Hadoop - 2017 Scott Quillicy SQData Outline methods of streaming mainframe data to big data platforms Set throughput / latency expectations for popular big

More information

The Mainframe s Relevance in the Digital World

The Mainframe s Relevance in the Digital World The Mainframe s Relevance in the Digital World You Don t Have to Own IT to Control IT SM Executive Summary According to Robert Thompson of IBM, 68 percent of the world s production workloads run on mainframes,

More information

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into

More information

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Glenn Anderson, IBM Lab Services and Training What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Winter SHARE March 2014 Session 15126 Today s mainframe is a hybrid system InfoSphere Streams

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA

E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA E nterprises are already recognizing the value that lies in IoT data, but IoT analytics is still evolving and businesses have yet to see the full potential

More information

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Cloud Integration and the Big Data Journey - Common Use-Case Patterns Cloud Integration and the Big Data Journey - Common Use-Case Patterns A White Paper August, 2014 Corporate Technologies Business Intelligence Group OVERVIEW The advent of cloud and hybrid architectures

More information

S/4 HANA Introduction & Roadmap. Dr. Bjoern Ganzhorn Enterprise Architect - SAP Americas Inc.

S/4 HANA Introduction & Roadmap. Dr. Bjoern Ganzhorn Enterprise Architect - SAP Americas Inc. S/4 HANA Introduction & Roadmap Dr. Bjoern Ganzhorn Enterprise Architect - SAP Americas Inc. Agenda SAP S/4 HANA Introduction & Roadmap Why S/4 HANA? Business and IT Challenge What is S/4 HANA? Solution

More information

The disruptive power of big data

The disruptive power of big data Business white paper How big data analytics is transforming business The disruptive power of big data Table of contents 3 Executive overview: The big data revolution 4 The big data imperative 4 Why big

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop 0101 001001010110100 010101000101010110100 1000101010001000101011010 00101010001010110100100010101 0001001010010101001000101010001 010101101001000101010001001010010 010101101 000101010001010 1011010 0100010101000

More information

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2 Page 2 Page 3 Page 4 Page 5 Humanizing Analytics Analytic Solutions that Provide Powerful Insights about Today s Healthcare Consumer to Manage Risk and Enable Engagement and Activation Industry Alignment

More information

Simplifying Your Modern Data Architecture Footprint

Simplifying Your Modern Data Architecture Footprint MIKE COCHRANE VP Analytics & Information Management Simplifying Your Modern Data Architecture Footprint Or Ways to Accelerate Your Success While Maintaining Your Sanity June 2017 mycervello.com Businesses

More information

Architected Blended Big Data With Pentaho. A Solution Brief

Architected Blended Big Data With Pentaho. A Solution Brief Architected Blended Big Data With Pentaho A Solution Brief Introduction The value of big data is well recognized, with implementations across every size and type of business today. However, the most powerful

More information

Data Strategy: How to Handle the New Data Integration Challenges. Edgar de Groot

Data Strategy: How to Handle the New Data Integration Challenges. Edgar de Groot Data Strategy: How to Handle the New Data Integration Challenges Edgar de Groot New Business Models Lead to New Data Integration Challenges Organisations are generating insight Insight is capital 3 Retailers

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Brian Macdonald Big Data & Analytics Specialist - Oracle

Brian Macdonald Big Data & Analytics Specialist - Oracle Brian Macdonald Big Data & Analytics Specialist - Oracle Improving Predictive Model Development Time with R and Oracle Big Data Discovery brian.macdonald@oracle.com Copyright 2015, Oracle and/or its affiliates.

More information

Information Server 11.3 Overview. Kevin D Silva Client Technical Professional, InfoSphere Information Server

Information Server 11.3 Overview. Kevin D Silva Client Technical Professional, InfoSphere Information Server Information Server 11.3 Overview Kevin D Silva Client Technical Professional, InfoSphere Information Server Governance Concerns for Big Customers Integrate & Link Big Big as a Source Big as a Target Transformations

More information

The Evolution of Big Data

The Evolution of Big Data The Evolution of Big Data Andrew Fast, Ph.D. Chief Scientist fast@elderresearch.com Headquarters 300 W. Main Street, Suite 301 Charlottesville, VA 22903 434.973.7673 fax 434.973.7875 www.elderresearch.com

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Can Advanced Analytics Improve Manufacturing Quality?

Can Advanced Analytics Improve Manufacturing Quality? Can Advanced Analytics Improve Manufacturing Quality? Erica Pettigrew BA Practice Director (513) 662-6888 Ext. 210 Erica.Pettigrew@vertexcs.com Jeffrey Anderson Sr. Solution Strategist (513) 662-6888 Ext.

More information

How Data Science is Changing the Way Companies Do Business Colin White

How Data Science is Changing the Way Companies Do Business Colin White How Data Science is Changing the Way Companies Do Business Colin White BI Research July 17, 2014 Sponsor 2 Speakers Colin White President, BI Research Bill Franks Chief Analytics Officer, Teradata 3 How

More information

Your Big Data to Big Data tools using the family of PI Integrators

Your Big Data to Big Data tools using the family of PI Integrators 1 Your Big Data to Big Data tools using the family of PI Integrators Presented by Martin Bryant Field Service Engineer @osisoft PI Integrators PI Integrator for Business Analytics PI Integrator for Business

More information

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Bryan Hinton Senior Vice President, Platform Engineering Health Catalyst Sean Stohl Senior Vice President, Product Development

More information

INTRODUCING BIRST INFOR S GO-FORWARD BUSINESS INTELLIGENCE SOLUTION

INTRODUCING BIRST INFOR S GO-FORWARD BUSINESS INTELLIGENCE SOLUTION INTRODUCING BIRST INFOR S GO-FORWARD BUSINESS INTELLIGENCE SOLUTION BIRST BI & ANALYTICS Connecting organizations through analytics Nick Cicero Regional Sales Director Rob Krause Sr. Solution consultant

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

Big Business Value from Big Data and Hadoop

Big Business Value from Big Data and Hadoop Big Business Value from Big Data and Hadoop Page 1 Topics The Big Data Explosion: Hype or Reality Introduction to Apache Hadoop The Business Case for Big Data Hortonworks Overview & Product Demo Page 2

More information

Reduce Money Laundering Risks with Rapid, Predictive Insights

Reduce Money Laundering Risks with Rapid, Predictive Insights SOLUTION brief Digital Bank of the Future Financial Services Reduce Money Laundering Risks with Rapid, Predictive Insights Executive Summary Money laundering is the process by which the illegal origin

More information

Big Data Analytics for Retail with Apache Hadoop. A Hortonworks and Microsoft White Paper

Big Data Analytics for Retail with Apache Hadoop. A Hortonworks and Microsoft White Paper Big Data Analytics for Retail with Apache Hadoop A Hortonworks and Microsoft White Paper 2 Contents The Big Data Opportunity for Retail 3 The Data Deluge, and Other Barriers 4 Hadoop in Retail 5 Omni-Channel

More information

NEW VALUE FOR THE FUTURE

NEW VALUE FOR THE FUTURE NEW VALUE FOR THE FUTURE THROUGH DISRUPTION OF PARADIGMS SEAMLESS COMBINATION OF PEOPLE AND THINGS AND CO-INNOVATION Francesco Maselli Innovation and Solutions Director #CWIN17, Rome 28 September 2017

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods

More information

Analytics empowering clients to see farther & go faster

Analytics empowering clients to see farther & go faster Analytics empowering clients to see farther & go faster Vendor Agnostic Data & analytics focus with leading technology expertise Business Value Improve business processes via analytic solutions Partner

More information

More information for FREE VS ENTERPRISE LICENCE :

More information for FREE VS ENTERPRISE LICENCE : Source : http://www.splunk.com/ Splunk Enterprise is a fully featured, powerful platform for collecting, searching, monitoring and analyzing machine data. Splunk Enterprise is easy to deploy and use. It

More information

Big Data Anwendungsfälle aus dem Bereich der digitalen Medien

Big Data Anwendungsfälle aus dem Bereich der digitalen Medien Presented by Kate Tickner Date 12 th October 2012 Big Data Anwendungsfälle aus dem Bereich der digitalen Medien Using Big Data and Smarter Analytics to Increase Consumer Engagement Dramatic forces affecting

More information

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017 Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017 Accelerate Big Data Analytics with Intel Frameworks and Libraries with FPGA s 1. Intel Big Data Analytics

More information

IBM i2 Enterprise Insight Analysis

IBM i2 Enterprise Insight Analysis IBM i2 Enterprise Insight Analysis Accelerate the data-to-decision process by rapidly transforming overwhelming data into actionable insight Highlights Uncover hidden connections and insights across massive

More information

Microsoft Dynamics ERP. Success for your business. Success for you.

Microsoft Dynamics ERP. Success for your business. Success for you. Microsoft Dynamics ERP Success for your business. Success for you. Achieve success on your terms Today s organizations compete in an environment dramatically shaped by current economic conditions as well

More information

Enterprise Architecture for Digital Business

Enterprise Architecture for Digital Business Enterprise Architecture for Digital Business Dave Chappelle Enterprise Architect Global EA Program October 26, 2015. Safe Harbor Statement The following is intended to outline our general product direction.

More information

An Effective Convergence of Analytics and Geography

An Effective Convergence of Analytics and Geography An Effective Convergence of Analytics and Geography Gain Competitive Advantage Using Smarter Analytics Tony Boobier BEng CEng FICE FCILA FCIM MICPS Insurance Leader IBM Business Analytics EMEA Agenda 1

More information

IBM Corporation

IBM Corporation 1 Daniel Sánchez Fernández Big Client Technical Professional 16 de Junio de 2014 Big Arquitectura de Referencia Cambios de paradigma habilitados por Big data y Analytics BIG DATA TRADITIONAL & ANALYTICS

More information

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex CASE STUDY Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex DataTorrent delivers better business outcomes for customers using industrial of things (IIoT) data Challenge The industrial

More information

Transforming Big Data to Business Benefits

Transforming Big Data to Business Benefits Transforming Big Data to Business Benefits Automagical EDW to Big Data Migration BI at the Speed of Thought Stream Processing + Machine Learning Platform Table of Contents Introduction... 3 Case Study:

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

THE DATA WAREHOUSE EVOLVED: A FOUNDATION FOR ANALYTICAL EXCELLENCE

THE DATA WAREHOUSE EVOLVED: A FOUNDATION FOR ANALYTICAL EXCELLENCE THE DATA WAREHOUSE EVOLVED: A FOUNDATION FOR ANALYTICAL EXCELLENCE May 2017 Author: Michael Lock Vice President & Principal Analyst, Analytics & Business Intelligence Report Highlights p2 p3 p6 p8 More

More information

In-Memory Analytics: Get Faster, Better Insights from Big Data

In-Memory Analytics: Get Faster, Better Insights from Big Data Discussion Summary In-Memory Analytics: Get Faster, Better Insights from Big Data January 2015 Interview Featuring: Tapan Patel, SAS Institute, Inc. Introduction A successful analytics program should translate

More information

White Paper. Five industries where big data is making a difference

White Paper. Five industries where big data is making a difference Five industries where big data is making a difference To understand how Big Data can transform businesses, we have to understand its nature. Although there are numerous definitions of Big Data, many will

More information

2018 Big Data Trends: Liberate, Integrate & Trust

2018 Big Data Trends: Liberate, Integrate & Trust 2018 Big Data Trends: Liberate, Integrate & Trust Executive Summary Syncsort conducted its fourth annual survey of IT professionals working with Big Data to get a real-world perspective on the opportunities

More information

SAS ANALYTICS AND OPEN SOURCE

SAS ANALYTICS AND OPEN SOURCE GUIDEBOOK SAS ANALYTICS AND OPEN SOURCE April 2014 2014 Nucleus Research, Inc. Reproduction in whole or in part without written permission is prohibited. THE BOTTOM LINE Many organizations balance open

More information

BullSequana S series. Powering Enterprise Artificial Intelligence

BullSequana S series. Powering Enterprise Artificial Intelligence BullSequana S series Powering Enterprise Artificial Intelligence Every enterprise faces digital transformation Customer contact is increasingly managed by intelligent automated routines. The Internet-of-Things

More information

Cloud Object Storage And The Use Of Gateways

Cloud Object Storage And The Use Of Gateways Cloud Object Storage And The Use Of Gateways Live Webcast September 26, 2017 10:00 am PT SNIA Legal Notice The material contained in this presentation is copyrighted by the SNIA unless otherwise noted.

More information

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS DATA HOLDS ALL THE POTENTIAL TO HELP BUSINESSES WIN CUSTOMERS INCREASE REVENUE GAIN COMPETITIVE ADVANTAGE STREAMLINE OPERATIONS BUT

More information

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Matthias Kupczak, Michael Probst; SAP June, 2017 Agenda 09:30-09:55 Coffee 09:55-10:00 Welcome Message T-Systems

More information

White Paper: SAS and Apache Hadoop For Government. Inside: Unlocking Higher Value From Business Analytics to Further the Mission

White Paper: SAS and Apache Hadoop For Government. Inside: Unlocking Higher Value From Business Analytics to Further the Mission White Paper: SAS and Apache Hadoop For Government Unlocking Higher Value From Business Analytics to Further the Mission Inside: Using SAS and Hadoop Together Design Considerations for Your SAS and Hadoop

More information

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

WHITE PAPER SPLUNK SOFTWARE AS A SIEM SPLUNK SOFTWARE AS A SIEM Improve your security posture by using Splunk as your SIEM HIGHLIGHTS Splunk software can be used to build and operate security operations centers (SOC) of any size (large, med,

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

Microsoft BI Product Suite

Microsoft BI Product Suite Microsoft BI Product Suite On Premises and In the Cloud What is Business Intelligence? How is the BI industry evolving? What are the typical components of a BI solution? How can BI be deployed within your

More information

DELL EMC HADOOP SOLUTIONS

DELL EMC HADOOP SOLUTIONS Big Data and Analytics DELL EMC HADOOP SOLUTIONS Helping Organizations Capitalize on the Digital Transformation The digital transformation: a disruptive opportunity Across virtually all industries, the

More information

Internet of Things. Point of View. Turn your data into accessible, actionable insights for maximum business value.

Internet of Things. Point of View. Turn your data into accessible, actionable insights for maximum business value. Point of View Internet of Things Turn your data into accessible, actionable insights for maximum business value Executive Summary Use a connected ecosystem to create new levels of business value The Internet

More information

Big Data Trends to Watch

Big Data Trends to Watch Big Data Trends to Watch Bill Peterson NetApp September, 2012 1 Bill Peterson @thebillp What I hope to accomplish today... ...and avoid this. What is Big Data? Big Data refers to datasets whose volume,

More information