Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer
|
|
- Jonathan Harris
- 6 years ago
- Views:
Transcription
1 Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1
2 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2
3 The world before big data Data warehousing. Research and the definition of dimensions and facts started in the 1960 s. Things really got going in the 1980s. 3
4 So what changed? Big data rocked up to the party. 4
5 Traditional solutions struggled Too much data No Real Time analysis No Data Exploration More expensive hardware to go faster and deeper Overnight batch not good enough Not just structured data in a star schema 5
6 Thankfully we had Google Cue Doug Cutting s son and his elephant, Hadoop Computation Tier uses a framework called MapReduce Storage is provided via a distributed filesystem called HDFS Hadoop runs on commodity hardware 6
7 Competitive Advantage All analytics aren t equal Descriptive, Predictive and Prescriptive. There is also Diagnostic. How can we achieve the best outcome including the effects of variability? How can we achieve the best outcome? What will happen next if? What if these trends continue? What could happen? What actions are needed? What exactly is the problem? How many, how often, where? What happened? Prescriptive Predictive Descriptive Degree of Complexity Source: Based on "Competing on Analytics," Davenport and Harris 7
8 Descriptive Analytics Prescriptive Analytics Predictive Analytics 8
9 Data lakes Today, think of it in terms of co-existence with Enterprise DWH. Both environments are valid. Semi-structured & Unstructured Data Hadoop Based Data Lake Client/Portal Devices Analyze & Report Structured Data Data Transformation ETL/ELT Enterprise DWH Analyze & Report Client/Portal Device CRM ERP OLTP DB Data Security, Backup 9
10 What is a Data Lake? If you think of a datamart as a store of bottled water cleansed and packaged and structured for easy consumption the data lake is a large body of water in a more natural state. *James Dixon, coiner of Data Lake term 10
11 Pragmatic approach to Data Lake Identify Domain Be Pragmatic/Start Small Build Lake infrastructure Fill Lake Build Fishing Poles, exploration, extract value, then expand 11
12 Data Lake Interaction 3 Main Levels of interaction: Real Time: for fast analysis and correlation Interactive: for transactional processing Batch: for large dataset analysis 12
13 Lake Infrastructure EMC Solutions for Data Lake Infrastructure VIPR Controller EMC Big Data Storage DSSD ISILON VNX REAL-TIME INTERACTIVE VIPR Services Commodity ECS BATCH 13
14 Build Lake Infrastructure Use General Purpose Arrays/Commodity Disks As Data Lake Store ViPR Data Services 3 rd Party VNX Commodity Be Fast Reuse your current infrastructure to build an HDFS repository Reduce risk Reduce CAPEX investment required to perform analytics Maintain data protection, compliance at array level Reduce cost and complexity of dedicated clusters Reduce need for new vendor nodes and storage capacity 14
15 Build Lake Infrastructure Object, File And HDFS Operations On The Same Data Object Object & HDFS HDFS VIRTUAL ARRAY ViPR Object & ViPR HDFS access on the same data S3, Swift, Atmos API via the Object head File protocols in development Use your preferred Hadoop distribution Commodity 15
16 Build Lake Infrastructure Use Specialized Arrays As Data Lake Store ECS Appliance Hyper-scale: ECS supports unlimited applications and users on a single, scale- out architecture start at 360 TB and scale to multiple petabytes or even exabytes 3 rd platform applications Pre-Engineered and Pre-Built Commodity Hardware Structured and Unstructured Content 16
17 Build Lake Infrastructure Use Specialized Arrays As Data Lake Store Accelerate the benefits of Hadoop for the enterprise Proven Hadoop solution, faster implementation Greater interoperability with enterprise applications and Hadoop analytics through multi-protocol parallel access from any client Enterprise data protection Fast snapshots, backup, and recovery Simple, reliable data replication for disaster recovery Ultimate flexibility Scale compute and storage resources separately Supports physical and virtualized server environments 17
18 Lake Software EMC/Pivotal Solutions for Data Lake Software REAL-TIME INTERACTIVE Greenplum DB GemFire XD HAWQ REAL-TIME INTERACTIVE BATCH Unlimited Pivotal HD BATCH 18
19 Pivotal HD Architecture - Apache Resource Management & Workflow Yarn Zookeeper HBas e HDFS Pig, Hive, Mahout Map Reduce Sqoop Flume Apache 19
20 HAWQ - Full ANSI SQL Engine on Hadoop HAWQ Advanced Database Services Resource Managemen t & Workflow Yarn HBas e Xtension Framework ANSI SQL + Analytics MADlib Algorithms Catalog Services Dynamic Pipelining Query Optimizer Spring Pig, Hive, Mahout Map Reduce Comman d Center Configure, Deploy, Monitor, Zookeeper Hadoop Virtualization Extension HDFS Unified Storage Service Manage Sqoop Data Loader Flume Apache Pivotal 20
21 GemFire - Real-Time Data Service HAWQ Advanced Database Services GemFire XD Real-Time Database Services Resource Managemen t & Workflow Yarn HBas e Xtension Framework ANSI SQL + Analytics MADlib Algorithms Catalog Services Dynamic Pipelining Query Optimizer Distrubuted In-memory Store ANSI SQL + In-Memory Query Transactions Ingestion Processing Hadoop Driver Parallel with Compaction Spring Pig, Hive, Mahout Map Reduce Comman d Center Configure, Deploy, Monitor, Zookeeper Hadoop Virtualization Extension HDFS Unified Storage Service Manage Sqoop Data Loader Flume Apache Pivotal 21
22 A Reference Architecture Standardized, on-demand services are layered around shared data repositories & processing capabilities to form the data lake. Ingest and data capture Scheduled, Batch data ingest to capture bulk data sources. Micro-batch ingest capturing small quantities of data. Low-latency and real-time ingest of data. Real-time routing of data to complex event processing and persistent storage. Data Sources Existing structured data. Unstructured or semistructured data sources Machine generated data such as logs and sensor data. External data sources. Applications and integration CloudFoundry on vsphere. Build interactive, data-driven applications using modern frameworks and approaches. Data Analytics In-memory performance (GemFire) MPP Processing (Pivotal HD) High performance SQL access to HDFS data (HAWQ). Shared storage and re-use Isilon and ViPR provide shared access to new and existing data sources through HDFS. Minimize data copies. Smart De-dupe for Hadoop. Kerberos Authentication. 22
23 What about services? + Data Science Data Engineering 23
24
ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)
ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline
More informationEmerging Business Applications of High Performance Analytics
Emerging Business Applications of High Performance Analytics August 2014 Tan Yaw, Sr. Data Scientist 1 Table of Contents Introduction Data Lake Analytics Labs 2 Pivotal At-a-Glance New Independent Venture:
More informationE-guide Hadoop Big Data Platforms Buyer s Guide part 1
Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors
More informationArchitecture Overview for Data Analytics Deployments
Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics
More informationMapR: Solution for Customer Production Success
2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice
More information5th Annual. Cloudera, Inc. All rights reserved.
5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software
More informationEMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer
EMC IT Big Data Analytics Journey Mahmoud Ghanem Sr. Systems Engineer Agenda 1 2 3 4 5 Introduction To Big Data EMC IT Big Data Journey Marketing Science Lab Use Case Technical Benefits Lessons Learned
More informationSAS and Hadoop Technology: Overview
SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.
More informationSAS & HADOOP ANALYTICS ON BIG DATA
SAS & HADOOP ANALYTICS ON BIG DATA WHY HADOOP? OPEN SOURCE MASSIVE SCALE FAST PROCESSING COMMODITY COMPUTING DATA REDUNDANCY DISTRIBUTED WHY HADOOP? Hadoop will soon become a replacement complement to:
More informationData Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC
Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next
More informationSAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS. Joakim Zetterblad, Director SAP Practice, EMEA
SAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS Joakim Zetterblad, Director SAP Practice, EMEA The NEW SAP fromthings IoT Applications IoT Analytics Connected Devices SAP HANA Cloud Platform
More informationSimplifying the Process of Uploading and Extracting Data from Apache Hadoop
Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution
More informationHadoop Integration Deep Dive
Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors
More informationBringing the Power of SAS to Hadoop Title
WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What
More informationCloud Based Analytics for SAP
Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest
More information1. Intoduction to Hadoop
1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store
More informationBIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW
BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade
More informationApache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.
Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.
More informationMapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia
MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming
More informationData Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB
Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data
More informationData: Foundation Of Digital Transformation
Data: Foundation Of Digital Transformation DellEMC Forum Madrid - 2017 Dave Kloc - Head of Data Sales EMEA Franck Sidi - Head of Pivotal Data Engineering EMEA Copyright 2017 Pivotal Software, Inc. All
More informationAdobe Deploys Hadoop as a Service on VMware vsphere
Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and
More information20775: Performing Data Engineering on Microsoft HD Insight
Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com
More informationAurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect
Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source
More informationBusiness is being transformed by three trends
Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence
More informationOperational Hadoop and the Lambda Architecture for Streaming Data
Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda
More informationWelcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.
Welcome to enterprise-class big data and financial a Putting big data and advanced analytics to work in financial services. MapR-FSI Martin Darling We reinvented the data platform for next-gen intelligent
More informationA NEW PLATFORM FOR A NEW ERA. Russell Acton, VP &GM EMEA,
A NEW PLATFORM FOR A NEW ERA Russell Acton, VP &GM EMEA, Pivotal racton@pivotal.io @russellacton Three Examples of Data Driven Companies 2 Don t we do all these today? Retail CRM Customer Scoring Store
More informationHadoop and Analytics at CERN IT CERN IT-DB
Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured
More informationHP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW
HP SummerSchool TechTalks 2013 Kenneth Donau Presale Technical Consulting, HP SW Copyright Copyright 2013 2013 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information The information
More informationInsights to HDInsight
Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive
More informationMicrosoft Azure Essentials
Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,
More informationCask Data Application Platform (CDAP)
Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop
More informationWelcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.
Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter
More informationBig and Fast Data: The Path To New Business Value
Big and Fast Data: The Path To New Business Value A Pivotal Overview Umair Riaz vspecialist 2 Gain Business Value with Big and Fast Data Pivotal Provides Agile Platform for Data-Driven Applications Ingest
More informationMapR Pentaho Business Solutions
MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business
More informationCOPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem
1Big Data and the Hadoop Ecosystem WHAT S IN THIS CHAPTER? Understanding the challenges of Big Data Getting to know the Hadoop ecosystem Getting familiar with Hadoop distributions Using Hadoop-based enterprise
More informationCommon Customer Use Cases in FSI
Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine
More informationETL challenges on IOT projects. Pedro Martins Head of Implementation
ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics
More informationGET MORE VALUE OUT OF BIG DATA
GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times
More informationBig Data. By Michael Covert. April 2012
Big By Michael Covert April 2012 April 18, 2012 Proprietary and Confidential 2 What is Big why are we discussing it? A brief history of High Performance Computing Parallel processing Algorithms The No
More informationWhy Big Data Matters? Speaker: Paras Doshi
Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn
More informationDatametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud
DAMA Datametica The Modern Data Platform Enterprise Data Hub Implementations What is happening with Hadoop Why is workload moving to Cloud 1 The Modern Data Platform The Enterprise Data Hub What do we
More informationKnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE
FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?
More informationThe Alpine Data Platform
The Alpine Data Platform TABLE OF CONTENTS ABOUT ALPINE.... 2 ALPINE PRODUCT OVERVIEW... 3 PRODUCT ARCHITECTURE.... 5 SYSTEM REQUIREMENTS.... 6 ABOUT ALPINE DATA ADVANCED ANALYTICS FOR THE ENTERPRISE Alpine
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
VIRT1400BU Real-World Customer Architecture for Big Data on VMware vsphere Joe Bruneau, General Mills Justin Murray, Technical Marketing, VMware #VMworld #VIRT1400BU Disclaimer This presentation may contain
More informationNouvelle Génération de l infrastructure Data Warehouse et d Analyses
Nouvelle Génération de l infrastructure Data Warehouse et d Analyses November 2011 André Münger andre.muenger@emc.com +41 79 708 85 99 1 Agenda BIG Data Challenges Greenplum Overview Use Cases Summary
More informationAnalyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP
Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Publicly traded on New York Stock Exchange Headquartered in Houston, Texas
More information巨量資料商機如何現代化您的產品及服務, 創造客戶最大的價值
巨量資料商機如何現代化您的產品及服務, 創造客戶最大的價值 EMC TAIWAN 客戶服務協理徐再盈 BIG DATA, BIG DEAL MODERNIZE PRODUCT AND SERVICE EXPERIENCES TO UNLOCK COSTUMER VALUE 1 Customer experience matters in today s digital world By 2020,
More informationThe Intersection of Big Data and DB2
The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop
More informationDigging into Hadoop-based Big Data Architectures
52 Digging into Hadoop-based Big Data Architectures Allae Erraissi 1, Abdessamad Belangour 2 and Abderrahim Tragha 3 1,2,3 Laboratory of Information Technology and Modeling LTIM, Hassan II University,
More informationMicrosoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
More informationSr. Sergio Rodríguez de Guzmán CTO PUE
PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish
More informationLEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY
LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY Unlock the value of your data with analytics solutions from Dell EMC ABSTRACT To unlock the value of their data, organizations around
More informationBuilding Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
More informationDell EMC IT Big Data Analytics Journey. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC
Dell EMC IT Big Data Analytics Journey Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Agenda 1 2 3 4 5 6 Dell EMC IT Big Data Journey Building the Data Lake Marketing Science
More informationAzure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud
Microsoft Technology Centers Microsoft Technology Centers Experience the Microsoft Cloud Experience the Microsoft Cloud ML Data Camp Ivan Kosyakov MTC Architect, Ph.D. Top Manager IT Analyst Big Data Strategic
More informationRay M Sugiarto MAPR Champion Indonesia
Ray M Sugiarto MAPR Champion Indonesia 0815 167 2882 2015 MapR Technologies 2015 MapR Technologies 1 Why Big Data? University of Texas: The median Fortune 1000 company could increase its revenue by more
More informationGot Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics
Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods
More informationBig Data The Big Story
Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer
More informationBuilding a Data Lake on AWS
Partner Network EBOOK: Building a Data Lake on AWS Contents What is a Data Lake? Benefits of a Data Lake on AWS Building a Data Lake On AWS Featured Data Lake Partner Bronze Drum Consulting Case Study:Rosetta
More informationDELL EMC HADOOP SOLUTIONS
Big Data and Analytics DELL EMC HADOOP SOLUTIONS Helping Organizations Capitalize on the Digital Transformation The digital transformation: a disruptive opportunity Across virtually all industries, the
More informationEvolution to Revolution: Big Data 2.0
Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents
More informationETL on Hadoop What is Required
ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional
More informationORACLE DATA INTEGRATOR ENTERPRISE EDITION
ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT
More informationSession 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy
Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Bryan Hinton Senior Vice President, Platform Engineering Health Catalyst Sean Stohl Senior Vice President, Product Development
More informationBig Data & Hadoop Advance
Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today
More informationBerkeley Data Analytics Stack (BDAS) Overview
Berkeley Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY What is Big used For? Reports, e.g., - Track business processes, transactions Diagnosis, e.g., - Why is user engagement dropping?
More informationBuilding a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa
Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa July 2015 BlackRock: Who We Are BLK data as of 31 st March 2015 is the world s largest investment manager Manages over $4.7 trillion
More informationCask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications
Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February
More informationTop 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11
Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer
More informationAccelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica
Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud
More informationBig Business Value from Big Data and Hadoop
Big Business Value from Big Data and Hadoop Page 1 Topics The Big Data Explosion: Hype or Reality Introduction to Apache Hadoop The Business Case for Big Data Hortonworks Overview & Product Demo Page 2
More informationHadoop in Production. Charles Zedlewski, VP, Product
Hadoop in Production Charles Zedlewski, VP, Product Cloudera In One Slide Hadoop meets enterprise Investors Product category Business model Jeff Hammerbacher Amr Awadallah Doug Cutting Mike Olson - CEO
More informationUniversal Storage for Data Lakes: Dell EMC Isilon
Enterprise Strategy Group Getting to the bigger truth. White Paper Universal Storage for Data Lakes: Dell EMC Isilon By Nik Rouda, ESG Senior Analyst; and Terri McClure, ESG Senior Analyst November 2016
More informationMachine-generated data: creating new opportunities for utilities, mobile and broadcast networks
APPLICATION BRIEF Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks Electronic devices generate data every millisecond they are in operation. This data is
More informationTechValidate Survey Report. Converged Data Platform Key to Competitive Advantage
TechValidate Survey Report Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage Executive Summary What Industry Analysts
More informationReal-time Streaming Insight & Time Series Data Analytic For Smart Retail
Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Sudip Majumder Senior Director Development Industry IoT & Big Data 10/5/2016 Economic Characteristics of Data Data is the New Oil..then
More informationNimble Storage vs Dell EMC: A Comparison Snapshot
Nimble Storage vs Dell EMC: 1056 Baker Road Dexter, MI 48130 t. 734.408.1993 Nimble Storage vs Dell EMC: INTRODUCTION: Founders incorporated Nimble Storage in 2008 with a mission to provide customers with
More informationWhat s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?
Glenn Anderson, IBM Lab Services and Training What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Winter SHARE March 2014 Session 15126 Today s mainframe is a hybrid system InfoSphere Streams
More informationvsphere with Operations Management and vcenter Operations VMware vforum, 2014 Mehmet Çolakoğlu 2014 VMware Inc. All rights reserved.
vsphere with Operations Management and vcenter Operations VMware vforum, 2014 Mehmet Çolakoğlu 2014 VMware Inc. All rights reserved. What s on the agenda? vsphere with Operations Management Overview What
More informationReduce Money Laundering Risks with Rapid, Predictive Insights
SOLUTION brief Digital Bank of the Future Financial Services Reduce Money Laundering Risks with Rapid, Predictive Insights Executive Summary Money laundering is the process by which the illegal origin
More informationCreating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN)
Creating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN) Who am I? Practice Director, Analytic Services at DataDirect Networks, Inc.
More informationStrategies for Taming Data Growth through Archiving
1 Strategies for Taming Data Growth through Archiving Why Does Information Overload Matter? Copyright 2014 EMC Corporation. All rights reserved. 2 STORAGE LICENSES MANAGEMENT BACKUPS POLICIES ediscovery
More informationPentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara
Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our
More informationHortonworks Powering the Future of Data
Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA
More informationExelon Utilities Data Analytics Journey
Exelon Utilities Data Analytics Journey Presented by Dean M Hengst PI System uses with-in Exelon Utilities Intelligent Substation Substation Security Historical Playback / Capacity Planning ComEd as implemented
More informationAnalytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand
Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number
More informationIBM Big Data Summit 2012
IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move
More informationSAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services
SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into
More informationCopyright 2015 EMC Corporation. All rights reserved. STRATEGIC FORUM 2015 PAUL MARITZ CEO, PIVOTAL SOFTWARE
STRATEGIC FORUM 2015 PAUL MARITZ CEO, PIVOTAL SOFTWARE BACK IN MARCH 2013, WE TOLD YOU PIVOTAL IS BEING CREATED TO: Respond to business needs to do new things to generate business value By creating a modern
More informationModernizing Data Integration
Modernizing Data Integration To Accommodate New Big Data and New Business Requirements Philip Russom Research Director for Data Management, TDWI December 16, 2015 Sponsor Speakers Philip Russom TDWI Research
More informationDeloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward
Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking
More informationEBOOK: Cloudwick Powering the Digital Enterprise
EBOOK: Cloudwick Powering the Digital Enterprise Contents What is a Data Lake?... Benefits of a Data Lake on AWS... Building a Data Lake on AWS... Cloudwick Case Study... About Cloudwick... Getting Started...
More informationHybrid Data Management
Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential
More informationIn-Memory Analytics: Get Faster, Better Insights from Big Data
Discussion Summary In-Memory Analytics: Get Faster, Better Insights from Big Data January 2015 Interview Featuring: Tapan Patel, SAS Institute, Inc. Introduction A successful analytics program should translate
More informationAchieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform
Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com
More informationFive Questions to Ask Before Choosing a Hadoop Distribution
Five Questions to Ask Before Choosing a Hadoop Distribution SPONSORED BY CONTENTS Introduction 1 1. What does it take to make Hadoop enterprise-ready? 1 2. Does the distribution offer scalability, reliability,
More informationBig Data Management Best Practices for Data Lakes Philip Russom, Ph.D.
Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D. Senior Research Director, TDWI October 27, 2016 Sponsor 2 Speakers Philip Russom Senior Research Director for Data Management, TDWI
More informationThe Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure
The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges
More information