Hadoop Stories. Tim Marston. Director, Regional Alliances Page 1. Hortonworks Inc All Rights Reserved

Similar documents
Hortonworks Powering the Future of Data

Hortonworks Connected Data Platforms

Powering the Future of Data

Future of Data Hortonworks Data Platform and Hortonworks Data Flow. Eric Thorsen, VP Industry Solutions

Apache Hadoop in the Datacenter and Cloud

Table of Contents. Are You Ready for Digital Transformation? page 04. Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06


5th Annual. Cloudera, Inc. All rights reserved.

Investor Presentation. Second Quarter 2016

Investor Presentation. Fourth Quarter 2015

Hortonworks Data Platform

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Operational Hadoop and the Lambda Architecture for Streaming Data

Big Data Analytics for Retail with Apache Hadoop. A Hortonworks and Microsoft White Paper

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Managing Data in Motion with the Connected Data Architecture

Architecture Overview for Data Analytics Deployments

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

Confidential

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer

Spark and Hadoop Perfect Together

Managing explosion of data. Cloudera, Inc. All rights reserved.

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

Adobe and Hadoop Integration

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Realising Value from Data

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

Adobe and Hadoop Integration

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Common Customer Use Cases in FSI

2013 PARTNER CONNECT

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

Microsoft Big Data. Solution Brief

The Importance of good data management and Power BI

THE CIO GUIDE TO BIG DATA ARCHIVING. How to pick the right product?

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud

Architecting an Open Data Lake for the Enterprise

From Data Deluge to Intelligent Data

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

How Data Science is Changing the Way Companies Do Business Colin White

Hadoop on Shared, Software-defined Storage

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Analytics Platform System

Taking Advantage of Cloud Elasticity and Flexibility

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

2014 Nordic Partner Day. Big Data

Big Data Introduction

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

ADVANCED ANALYTICS & IOT ARCHITECTURES

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

TechArch Day Digital Decoupling. Oscar Renalias. Accenture

GET MORE VALUE OUT OF BIG DATA

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science

By: Shrikant Gawande (Cloudera Certified )

Data. Does it Matter?

How In-Memory Computing can Maximize the Performance of Modern Payments

Cognizant BigFrame Fast, Secure Legacy Migration

Modernizing Your Data Warehouse with Azure

CREATING A FOUNDATION FOR BUSINESS VALUE

Modern Analytics Architecture

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Hortonworks HDP with IBM Spectrum Scale

Executive Brief. 3 Keys to Self-Service Data Preparation

MapR Pentaho Business Solutions

Responsive enterprise the future of the enterprise PERSPECTIVE

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

MapR: Solution for Customer Production Success

Digital Services. How can InfoCentric help you make the most of The Digital Revolution? InfoCentric 2016

Blueprints for Big Data Success. Succeeding with four common scenarios

A complete service guide for MICROSOFT DATA ANALYTICS ENABLEMENT

Analytics in Action transforming the way we use and consume information

Blueprints for Big Data Success

Splunk Discovery Day Moscow

Building data-driven applications with SAP Data Hub and Amazon Web Services

IoT for Lunch (and other critical workplace activities)

CONNECTING THE DOTS FOR BETTER INSIGHT.

Evolution to Revolution: Big Data 2.0

Five Advances in Analytics

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved.

20775A: Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

Hadoop and Analytics at CERN IT CERN IT-DB

Big and Fast Data: The Path To New Business Value

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Integrating MATLAB Analytics into Enterprise Applications

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

INSIGHTS & BIG DATA. Data Science as a Service BIG DATA ANALYTICS

20775: Performing Data Engineering on Microsoft HD Insight

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

POWER NEW POSSIBILITIES

Data: Foundation Of Digital Transformation

Transcription:

Hadoop Stories Tim Marston Director, Regional Alliances EMEA Page 1 @timmarston

Page 2

Plans for Hadoop Adoption (Gartner, May 2015) Start within 1 year 11% Start within 2 years 7% Already doing 27% No Plans 55% Page 3

Page 4

Plans for Hadoop Adoption (Gartner, May 2015) Start within 1 year 11% Start within 2 years 7% Already doing 27% No Plans 55% Page 5

Hortonworks: Hadoop for the Enterprise ONLY Apache 100 open source TM % Hadoop data platform Founded in 2011 1 ST HADOOP distribution to go public IPO Fall 2014 (NASDAQ: HDP) 700+ subscription customers employees 800+ across 1350+ 17 technology partners countries Page 6

The Modern Data Architecture ANALYTICS Data Applications Marts Business Analytics Visualization & Dashboards Applications can be given access to all data through a single platform Batch MP P Batch EDW Batch Interactive Real-Time Partner ISV YARN: Data Operating System HDFS (Hadoop Distributed File System) Simpler governance, security and operations SOURCES ERP CRM SCM Existing Systems Clickstream Web & Social Geolocation Sensor & Machine Server Logs Unstructured Able to handle all sizes and types of applications and datasets Page 7

The Hadoop Journey Page 8

IT executives are delivering substantial reductions in operating costs by modernizing their data architectures with Open Enterprise Hadoop. These cost saving innovations include active archive of cold data, offloading ETL processes and enriching existing data. OPEX Reduction Device Data Ingest Data as a Service Historical Records Fraud Prevention Mainframe Offloads Rapid Reporting Digital Protection Public Data Capture Page 9

Payment Tracking Due Diligence Sentiment Analysis Social Mapping Customer Support Optimize Inventories Next Product Recs Store Design Call Analysis Machine Data Product Design M & A Ad Placement Basket Analysis Segments Proactive Repair Disaster Mitigation Investment Planning Factory Yields Defect Detection Cross- Sell Supply Chain Customer Retention Vendor Scorecards Inventory Predictions Risk Modeling Ad Placement Business executives are driving transformational outcomes with next-generation applications that empower new uses of Big Data including: data discovery, a single view of the customer and predictive analytics. Page 10

The Vision: Enabling the Data Lake SCALE Journey to the Data Lake with Hadoop Systems of Insight DATA LAKE Goal: Centralized Architecture Data-driven Business Data Lake Definition Centralized Architecture Multiple applications on a shared data set with consistent levels of service Any App, Any Data Multiple applications accessing all data affording new insights and opportunities. Unlocks Systems of Insight Advanced algorithms and applications used to derive new value and optimize existing value. Drivers: 1. Cost Optimization 2. Advanced Analytic Apps Page 11 SCOPE

Data Archive for legal cases Project cash positive after 12 months, with order of magnitude Opex savings once implemented. Page 12 Source: http://www.slideshare.net/hadoop_summit/making-the-case-for-hadoop-in-a-large-enterprisebritish-airways

Self-learning cars...if British manufacturing is to survive it needs to be competitive. And it cannot be competitive without data. Page 13 Source: https://diginomica.com/2015/09/11/using-hadoop-inside-jaguar-land-rover-zurich-insurance-and-the-home-office/

Modern Data Architecture The aim is to use Hadoop, a mix of internal and external data, to take Zurich Insurance to the next level of maturity in terms of using data to drive business decisions. Page 14 Source: https://diginomica.com/2015/09/11/using-hadoop-inside-jaguar-land-rover-zurich-insurance-and-the-home-office/

Hadoop at Scale 1300 Hadoop nodes, 42PB stored 20TB ingested via Kafka per day 200TB generated by Hadoop per day Page 15 Source: http://cdn.oreillystatic.com/en/assets/1/event/118/the%20evolution%20of%20hadoop%20at%20spotify-%20through%20failures%20and%20pain%20presentati

The Future? Page 16

The 100% open source Hadoop distribution Page 17

Hortonworks DataFlow Hortonworks DataFlow powered by Apache NiFi Perishable Insights Store Data and Metadata Enrich Context Internet of Anything Hortonworks Data Platform powered by Apache Hadoop Hortonworks Data Platform powered by Apache Hadoop Historical Insights Page 18

Appendix Page 19

Hortonworks Influences the Apache Community We Employ the Committers --one third of all committers to the Apache Hadoop project, and a majority in other important projects Our Committers Innovate and expand Open Enterprise Hadoop We Influence the Hadoop Roadmap by communicating important requirements to the community through our leaders APACHE HADOOP COMMITTERS Page 20

OPEN COMMUNITY THE INNOVATION ADVANTAGE PROPRIETARY HADOOP TIME INNOVATION Hortonworks Data Platform Is Genuinely Open Eliminates Risk of vendor lock-in by delivering 100% Apache open source technology Maximizes Community Innovation with hundreds of developers across hundreds of companies Integrates Seamlessly through committed co-engineering partnerships with other leading technologies MAXIMUM COMMUNITY INNOVATION Page 21

Hortonworks Delivers Proactive Support Integrated Customer Portal Knowledge Base On-Demand Training Hortonworks SmartSense Customer Environment Any cloud Hybrid Environment Multi-tenant Hortonworks SmartSense with machine learning and predictive analytics on your cluster Integrated Customer Portal with knowledge base and on-demand training Page 22

The Open Data Platform April 14 HDP Open Platform with Apache Hadoop Pivotal HD Common ODP Core Apache Hadoop and Apache Ambari Page 23

The Data Governance Initiative Hadoop must snap in to the existing frameworks and openly exchange metadata. Hadoop must address governance within its own stack of technologies Page 24

Cautionary Statement Regarding Forward-Looking Statements This presentation contains forward-looking statements involving risks and uncertainties. Such forward-looking statements in this presentation generally relate to future events, our ability to increase the number of support subscription customers, the growth in usage of the Hadoop framework, our ability to innovate and develop the various open source projects that will enhance the capabilities of the Hortonworks Data Platform, anticipated customer benefits and general business outlook. In some cases, you can identify forward-looking statements because they contain words such as may, will, should, expects, plans, anticipates, could, intends, target, projects, contemplates, believes, estimates, predicts, potential or continue or similar terms or expressions that concern our expectations, strategy, plans or intentions. You should not rely upon forward-looking statements as predictions of future events. We have based the forward-looking statements contained in this presentation primarily on our current expectations and projections about future events and trends that we believe may affect our business, financial condition and prospects. We cannot assure you that the results, events and circumstances reflected in the forward-looking statements will be achieved or occur, and actual results, events, or circumstances could differ materially from those described in the forward-looking statements. The forward-looking statements made in this prospectus relate only to events as of the date on which the statements are made and we undertake no obligation to update any of the information in this presentation. Trademarks Page 25 Hortonworks is a trademark of Hortonworks, Inc. in the United States and other