Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Size: px
Start display at page:

Download "Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer"

Transcription

1 Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1

2 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2

3 The world before big data Data warehousing. Research and the definition of dimensions and facts started in the 1960 s. Things really got going in the 1980s. 3

4 So what changed? Big data rocked up to the party. 4

5 Traditional solutions struggled Too much data No Real Time analysis No Data Exploration More expensive hardware to go faster and deeper Overnight batch not good enough Not just structured data in a star schema 5

6 Thankfully we had Google Cue Doug Cutting s son and his elephant, Hadoop Computation Tier uses a framework called MapReduce Storage is provided via a distributed filesystem called HDFS Hadoop runs on commodity hardware 6

7 Competitive Advantage All analytics aren t equal Descriptive, Predictive and Prescriptive. There is also Diagnostic. How can we achieve the best outcome including the effects of variability? How can we achieve the best outcome? What will happen next if? What if these trends continue? What could happen? What actions are needed? What exactly is the problem? How many, how often, where? What happened? Prescriptive Predictive Descriptive Degree of Complexity Source: Based on "Competing on Analytics," Davenport and Harris 7

8 Descriptive Analytics Prescriptive Analytics Predictive Analytics 8

9 Data lakes Today, think of it in terms of co-existence with Enterprise DWH. Both environments are valid. Semi-structured & Unstructured Data Hadoop Based Data Lake Client/Portal Devices Analyze & Report Structured Data Data Transformation ETL/ELT Enterprise DWH Analyze & Report Client/Portal Device CRM ERP OLTP DB Data Security, Backup 9

10 What is a Data Lake? If you think of a datamart as a store of bottled water cleansed and packaged and structured for easy consumption the data lake is a large body of water in a more natural state. *James Dixon, coiner of Data Lake term 10

11 Pragmatic approach to Data Lake Identify Domain Be Pragmatic/Start Small Build Lake infrastructure Fill Lake Build Fishing Poles, exploration, extract value, then expand 11

12 Data Lake Interaction 3 Main Levels of interaction: Real Time: for fast analysis and correlation Interactive: for transactional processing Batch: for large dataset analysis 12

13 Lake Infrastructure EMC Solutions for Data Lake Infrastructure VIPR Controller EMC Big Data Storage DSSD ISILON VNX REAL-TIME INTERACTIVE VIPR Services Commodity ECS BATCH 13

14 Build Lake Infrastructure Use General Purpose Arrays/Commodity Disks As Data Lake Store ViPR Data Services 3 rd Party VNX Commodity Be Fast Reuse your current infrastructure to build an HDFS repository Reduce risk Reduce CAPEX investment required to perform analytics Maintain data protection, compliance at array level Reduce cost and complexity of dedicated clusters Reduce need for new vendor nodes and storage capacity 14

15 Build Lake Infrastructure Object, File And HDFS Operations On The Same Data Object Object & HDFS HDFS VIRTUAL ARRAY ViPR Object & ViPR HDFS access on the same data S3, Swift, Atmos API via the Object head File protocols in development Use your preferred Hadoop distribution Commodity 15

16 Build Lake Infrastructure Use Specialized Arrays As Data Lake Store ECS Appliance Hyper-scale: ECS supports unlimited applications and users on a single, scale- out architecture start at 360 TB and scale to multiple petabytes or even exabytes 3 rd platform applications Pre-Engineered and Pre-Built Commodity Hardware Structured and Unstructured Content 16

17 Build Lake Infrastructure Use Specialized Arrays As Data Lake Store Accelerate the benefits of Hadoop for the enterprise Proven Hadoop solution, faster implementation Greater interoperability with enterprise applications and Hadoop analytics through multi-protocol parallel access from any client Enterprise data protection Fast snapshots, backup, and recovery Simple, reliable data replication for disaster recovery Ultimate flexibility Scale compute and storage resources separately Supports physical and virtualized server environments 17

18 Lake Software EMC/Pivotal Solutions for Data Lake Software REAL-TIME INTERACTIVE Greenplum DB GemFire XD HAWQ REAL-TIME INTERACTIVE BATCH Unlimited Pivotal HD BATCH 18

19 Pivotal HD Architecture - Apache Resource Management & Workflow Yarn Zookeeper HBas e HDFS Pig, Hive, Mahout Map Reduce Sqoop Flume Apache 19

20 HAWQ - Full ANSI SQL Engine on Hadoop HAWQ Advanced Database Services Resource Managemen t & Workflow Yarn HBas e Xtension Framework ANSI SQL + Analytics MADlib Algorithms Catalog Services Dynamic Pipelining Query Optimizer Spring Pig, Hive, Mahout Map Reduce Comman d Center Configure, Deploy, Monitor, Zookeeper Hadoop Virtualization Extension HDFS Unified Storage Service Manage Sqoop Data Loader Flume Apache Pivotal 20

21 GemFire - Real-Time Data Service HAWQ Advanced Database Services GemFire XD Real-Time Database Services Resource Managemen t & Workflow Yarn HBas e Xtension Framework ANSI SQL + Analytics MADlib Algorithms Catalog Services Dynamic Pipelining Query Optimizer Distrubuted In-memory Store ANSI SQL + In-Memory Query Transactions Ingestion Processing Hadoop Driver Parallel with Compaction Spring Pig, Hive, Mahout Map Reduce Comman d Center Configure, Deploy, Monitor, Zookeeper Hadoop Virtualization Extension HDFS Unified Storage Service Manage Sqoop Data Loader Flume Apache Pivotal 21

22 A Reference Architecture Standardized, on-demand services are layered around shared data repositories & processing capabilities to form the data lake. Ingest and data capture Scheduled, Batch data ingest to capture bulk data sources. Micro-batch ingest capturing small quantities of data. Low-latency and real-time ingest of data. Real-time routing of data to complex event processing and persistent storage. Data Sources Existing structured data. Unstructured or semistructured data sources Machine generated data such as logs and sensor data. External data sources. Applications and integration CloudFoundry on vsphere. Build interactive, data-driven applications using modern frameworks and approaches. Data Analytics In-memory performance (GemFire) MPP Processing (Pivotal HD) High performance SQL access to HDFS data (HAWQ). Shared storage and re-use Isilon and ViPR provide shared access to new and existing data sources through HDFS. Minimize data copies. Smart De-dupe for Hadoop. Kerberos Authentication. 22

23 What about services? + Data Science Data Engineering 23

24

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

Emerging Business Applications of High Performance Analytics

Emerging Business Applications of High Performance Analytics Emerging Business Applications of High Performance Analytics August 2014 Tan Yaw, Sr. Data Scientist 1 Table of Contents Introduction Data Lake Analytics Labs 2 Pivotal At-a-Glance New Independent Venture:

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer EMC IT Big Data Analytics Journey Mahmoud Ghanem Sr. Systems Engineer Agenda 1 2 3 4 5 Introduction To Big Data EMC IT Big Data Journey Marketing Science Lab Use Case Technical Benefits Lessons Learned

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

SAS & HADOOP ANALYTICS ON BIG DATA

SAS & HADOOP ANALYTICS ON BIG DATA SAS & HADOOP ANALYTICS ON BIG DATA WHY HADOOP? OPEN SOURCE MASSIVE SCALE FAST PROCESSING COMMODITY COMPUTING DATA REDUNDANCY DISTRIBUTED WHY HADOOP? Hadoop will soon become a replacement complement to:

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

SAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS. Joakim Zetterblad, Director SAP Practice, EMEA

SAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS. Joakim Zetterblad, Director SAP Practice, EMEA SAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS Joakim Zetterblad, Director SAP Practice, EMEA The NEW SAP fromthings IoT Applications IoT Analytics Connected Devices SAP HANA Cloud Platform

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Data: Foundation Of Digital Transformation

Data: Foundation Of Digital Transformation Data: Foundation Of Digital Transformation DellEMC Forum Madrid - 2017 Dave Kloc - Head of Data Sales EMEA Franck Sidi - Head of Pivotal Data Engineering EMEA Copyright 2017 Pivotal Software, Inc. All

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services. Welcome to enterprise-class big data and financial a Putting big data and advanced analytics to work in financial services. MapR-FSI Martin Darling We reinvented the data platform for next-gen intelligent

More information

A NEW PLATFORM FOR A NEW ERA. Russell Acton, VP &GM EMEA,

A NEW PLATFORM FOR A NEW ERA. Russell Acton, VP &GM EMEA, A NEW PLATFORM FOR A NEW ERA Russell Acton, VP &GM EMEA, Pivotal racton@pivotal.io @russellacton Three Examples of Data Driven Companies 2 Don t we do all these today? Retail CRM Customer Scoring Store

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW HP SummerSchool TechTalks 2013 Kenneth Donau Presale Technical Consulting, HP SW Copyright Copyright 2013 2013 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information The information

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

Big and Fast Data: The Path To New Business Value

Big and Fast Data: The Path To New Business Value Big and Fast Data: The Path To New Business Value A Pivotal Overview Umair Riaz vspecialist 2 Gain Business Value with Big and Fast Data Pivotal Provides Agile Platform for Data-Driven Applications Ingest

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem 1Big Data and the Hadoop Ecosystem WHAT S IN THIS CHAPTER? Understanding the challenges of Big Data Getting to know the Hadoop ecosystem Getting familiar with Hadoop distributions Using Hadoop-based enterprise

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

ETL challenges on IOT projects. Pedro Martins Head of Implementation

ETL challenges on IOT projects. Pedro Martins Head of Implementation ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics

More information

GET MORE VALUE OUT OF BIG DATA

GET MORE VALUE OUT OF BIG DATA GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times

More information

Big Data. By Michael Covert. April 2012

Big Data. By Michael Covert. April 2012 Big By Michael Covert April 2012 April 18, 2012 Proprietary and Confidential 2 What is Big why are we discussing it? A brief history of High Performance Computing Parallel processing Algorithms The No

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud DAMA Datametica The Modern Data Platform Enterprise Data Hub Implementations What is happening with Hadoop Why is workload moving to Cloud 1 The Modern Data Platform The Enterprise Data Hub What do we

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

The Alpine Data Platform

The Alpine Data Platform The Alpine Data Platform TABLE OF CONTENTS ABOUT ALPINE.... 2 ALPINE PRODUCT OVERVIEW... 3 PRODUCT ARCHITECTURE.... 5 SYSTEM REQUIREMENTS.... 6 ABOUT ALPINE DATA ADVANCED ANALYTICS FOR THE ENTERPRISE Alpine

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme VIRT1400BU Real-World Customer Architecture for Big Data on VMware vsphere Joe Bruneau, General Mills Justin Murray, Technical Marketing, VMware #VMworld #VIRT1400BU Disclaimer This presentation may contain

More information

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses Nouvelle Génération de l infrastructure Data Warehouse et d Analyses November 2011 André Münger andre.muenger@emc.com +41 79 708 85 99 1 Agenda BIG Data Challenges Greenplum Overview Use Cases Summary

More information

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Publicly traded on New York Stock Exchange Headquartered in Houston, Texas

More information

巨量資料商機如何現代化您的產品及服務, 創造客戶最大的價值

巨量資料商機如何現代化您的產品及服務, 創造客戶最大的價值 巨量資料商機如何現代化您的產品及服務, 創造客戶最大的價值 EMC TAIWAN 客戶服務協理徐再盈 BIG DATA, BIG DEAL MODERNIZE PRODUCT AND SERVICE EXPERIENCES TO UNLOCK COSTUMER VALUE 1 Customer experience matters in today s digital world By 2020,

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

Digging into Hadoop-based Big Data Architectures

Digging into Hadoop-based Big Data Architectures 52 Digging into Hadoop-based Big Data Architectures Allae Erraissi 1, Abdessamad Belangour 2 and Abderrahim Tragha 3 1,2,3 Laboratory of Information Technology and Modeling LTIM, Hassan II University,

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY Unlock the value of your data with analytics solutions from Dell EMC ABSTRACT To unlock the value of their data, organizations around

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Dell EMC IT Big Data Analytics Journey. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Dell EMC IT Big Data Analytics Journey. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Dell EMC IT Big Data Analytics Journey Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Agenda 1 2 3 4 5 6 Dell EMC IT Big Data Journey Building the Data Lake Marketing Science

More information

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud Microsoft Technology Centers Microsoft Technology Centers Experience the Microsoft Cloud Experience the Microsoft Cloud ML Data Camp Ivan Kosyakov MTC Architect, Ph.D. Top Manager IT Analyst Big Data Strategic

More information

Ray M Sugiarto MAPR Champion Indonesia

Ray M Sugiarto MAPR Champion Indonesia Ray M Sugiarto MAPR Champion Indonesia 0815 167 2882 2015 MapR Technologies 2015 MapR Technologies 1 Why Big Data? University of Texas: The median Fortune 1000 company could increase its revenue by more

More information

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Building a Data Lake on AWS

Building a Data Lake on AWS Partner Network EBOOK: Building a Data Lake on AWS Contents What is a Data Lake? Benefits of a Data Lake on AWS Building a Data Lake On AWS Featured Data Lake Partner Bronze Drum Consulting Case Study:Rosetta

More information

DELL EMC HADOOP SOLUTIONS

DELL EMC HADOOP SOLUTIONS Big Data and Analytics DELL EMC HADOOP SOLUTIONS Helping Organizations Capitalize on the Digital Transformation The digital transformation: a disruptive opportunity Across virtually all industries, the

More information

Evolution to Revolution: Big Data 2.0

Evolution to Revolution: Big Data 2.0 Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Bryan Hinton Senior Vice President, Platform Engineering Health Catalyst Sean Stohl Senior Vice President, Product Development

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Berkeley Data Analytics Stack (BDAS) Overview

Berkeley Data Analytics Stack (BDAS) Overview Berkeley Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY What is Big used For? Reports, e.g., - Track business processes, transactions Diagnosis, e.g., - Why is user engagement dropping?

More information

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa July 2015 BlackRock: Who We Are BLK data as of 31 st March 2015 is the world s largest investment manager Manages over $4.7 trillion

More information

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

Big Business Value from Big Data and Hadoop

Big Business Value from Big Data and Hadoop Big Business Value from Big Data and Hadoop Page 1 Topics The Big Data Explosion: Hype or Reality Introduction to Apache Hadoop The Business Case for Big Data Hortonworks Overview & Product Demo Page 2

More information

Hadoop in Production. Charles Zedlewski, VP, Product

Hadoop in Production. Charles Zedlewski, VP, Product Hadoop in Production Charles Zedlewski, VP, Product Cloudera In One Slide Hadoop meets enterprise Investors Product category Business model Jeff Hammerbacher Amr Awadallah Doug Cutting Mike Olson - CEO

More information

Universal Storage for Data Lakes: Dell EMC Isilon

Universal Storage for Data Lakes: Dell EMC Isilon Enterprise Strategy Group Getting to the bigger truth. White Paper Universal Storage for Data Lakes: Dell EMC Isilon By Nik Rouda, ESG Senior Analyst; and Terri McClure, ESG Senior Analyst November 2016

More information

Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks

Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks APPLICATION BRIEF Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks Electronic devices generate data every millisecond they are in operation. This data is

More information

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage Executive Summary What Industry Analysts

More information

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Sudip Majumder Senior Director Development Industry IoT & Big Data 10/5/2016 Economic Characteristics of Data Data is the New Oil..then

More information

Nimble Storage vs Dell EMC: A Comparison Snapshot

Nimble Storage vs Dell EMC: A Comparison Snapshot Nimble Storage vs Dell EMC: 1056 Baker Road Dexter, MI 48130 t. 734.408.1993 Nimble Storage vs Dell EMC: INTRODUCTION: Founders incorporated Nimble Storage in 2008 with a mission to provide customers with

More information

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Glenn Anderson, IBM Lab Services and Training What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Winter SHARE March 2014 Session 15126 Today s mainframe is a hybrid system InfoSphere Streams

More information

vsphere with Operations Management and vcenter Operations VMware vforum, 2014 Mehmet Çolakoğlu 2014 VMware Inc. All rights reserved.

vsphere with Operations Management and vcenter Operations VMware vforum, 2014 Mehmet Çolakoğlu 2014 VMware Inc. All rights reserved. vsphere with Operations Management and vcenter Operations VMware vforum, 2014 Mehmet Çolakoğlu 2014 VMware Inc. All rights reserved. What s on the agenda? vsphere with Operations Management Overview What

More information

Reduce Money Laundering Risks with Rapid, Predictive Insights

Reduce Money Laundering Risks with Rapid, Predictive Insights SOLUTION brief Digital Bank of the Future Financial Services Reduce Money Laundering Risks with Rapid, Predictive Insights Executive Summary Money laundering is the process by which the illegal origin

More information

Creating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN)

Creating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN) Creating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN) Who am I? Practice Director, Analytic Services at DataDirect Networks, Inc.

More information

Strategies for Taming Data Growth through Archiving

Strategies for Taming Data Growth through Archiving 1 Strategies for Taming Data Growth through Archiving Why Does Information Overload Matter? Copyright 2014 EMC Corporation. All rights reserved. 2 STORAGE LICENSES MANAGEMENT BACKUPS POLICIES ediscovery

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

Hortonworks Powering the Future of Data

Hortonworks Powering the Future of Data Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA

More information

Exelon Utilities Data Analytics Journey

Exelon Utilities Data Analytics Journey Exelon Utilities Data Analytics Journey Presented by Dean M Hengst PI System uses with-in Exelon Utilities Intelligent Substation Substation Security Historical Playback / Capacity Planning ComEd as implemented

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into

More information

Copyright 2015 EMC Corporation. All rights reserved. STRATEGIC FORUM 2015 PAUL MARITZ CEO, PIVOTAL SOFTWARE

Copyright 2015 EMC Corporation. All rights reserved. STRATEGIC FORUM 2015 PAUL MARITZ CEO, PIVOTAL SOFTWARE STRATEGIC FORUM 2015 PAUL MARITZ CEO, PIVOTAL SOFTWARE BACK IN MARCH 2013, WE TOLD YOU PIVOTAL IS BEING CREATED TO: Respond to business needs to do new things to generate business value By creating a modern

More information

Modernizing Data Integration

Modernizing Data Integration Modernizing Data Integration To Accommodate New Big Data and New Business Requirements Philip Russom Research Director for Data Management, TDWI December 16, 2015 Sponsor Speakers Philip Russom TDWI Research

More information

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking

More information

EBOOK: Cloudwick Powering the Digital Enterprise

EBOOK: Cloudwick Powering the Digital Enterprise EBOOK: Cloudwick Powering the Digital Enterprise Contents What is a Data Lake?... Benefits of a Data Lake on AWS... Building a Data Lake on AWS... Cloudwick Case Study... About Cloudwick... Getting Started...

More information

Hybrid Data Management

Hybrid Data Management Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential

More information

In-Memory Analytics: Get Faster, Better Insights from Big Data

In-Memory Analytics: Get Faster, Better Insights from Big Data Discussion Summary In-Memory Analytics: Get Faster, Better Insights from Big Data January 2015 Interview Featuring: Tapan Patel, SAS Institute, Inc. Introduction A successful analytics program should translate

More information

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com

More information

Five Questions to Ask Before Choosing a Hadoop Distribution

Five Questions to Ask Before Choosing a Hadoop Distribution Five Questions to Ask Before Choosing a Hadoop Distribution SPONSORED BY CONTENTS Introduction 1 1. What does it take to make Hadoop enterprise-ready? 1 2. Does the distribution offer scalability, reliability,

More information

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D.

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D. Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D. Senior Research Director, TDWI October 27, 2016 Sponsor 2 Speakers Philip Russom Senior Research Director for Data Management, TDWI

More information

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges

More information