Sr. Sergio Rodríguez de Guzmán CTO PUE

Similar documents
Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

Taking Advantage of Cloud Elasticity and Flexibility

5th Annual. Cloudera, Inc. All rights reserved.

Hortonworks Connected Data Platforms

Transforming Analytics with Cloudera Data Science WorkBench

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Make Business Intelligence Work on Big Data

Oracle Big Data Cloud Service

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Hortonworks Data Platform

Cask Data Application Platform (CDAP) Extensions

Insights to HDInsight

Leveraging Predictive Tools to Decrease Resolution Time

Apache Hadoop in the Datacenter and Cloud

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

Building a Single Source of Truth across the Enterprise An Integrated Solution

MapR: Solution for Customer Production Success

Spark and Hadoop Perfect Together

Microsoft Azure Essentials

Optimal Infrastructure for Big Data

DataAdapt Active Insight

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Cloudera Hadoop & Industrie 4.0 wohin mit dem Datenstrom?

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

Rapid Start with Big Data Appliance X6-2 Technical & Operational Overview

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform


Big Data Hadoop Administrator.

Legacy Application Retirement Guide

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

Governing Big Data and Hadoop

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments

Cloud Based Analytics for SAP

Cask Data Application Platform (CDAP)

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Big Data Cloud. Simple, Secure, Integrated and Performant Big Data Platform for the Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

TECHNICAL WHITE PAPER. Rubrik and Microsoft Azure Technology Overview and How It Works

Cloudera, Inc. All rights reserved.

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Hybrid Data Management

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud

Business is being transformed by three trends

Hadoop and Analytics at CERN IT CERN IT-DB

How In-Memory Computing can Maximize the Performance of Modern Payments

Ed Turkel HPC Strategist

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.

Managing explosion of data. Cloudera, Inc. All rights reserved.

1 Hortonworks Inc All Rights Reserved

Common Customer Use Cases in FSI

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

Analytics With Hadoop. SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics

INDUSTRY BRIEF THE ENTERPRISE DATA HUB IN FINANCIAL SERVICES: THREE CUSTOMER CASE STUDIES

Building the Enterprise Data Lake with Cloudera & Cisco

Delivering Data Warehousing as a Cloud Service

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

Evolving Your Infrastructure to Cloud

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.

Building a Data Lake on AWS EBOOK: BUILDING A DATA LAKE ON AWS 1

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D.

DOAG Big Data Days 2018 DWH Modernization

MapR Pentaho Business Solutions

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Adobe and Hadoop Integration

OSIsoft Super Regional Transform Your World

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

#mstrworld. A Deep Dive Into Self-Service Data Discovery In MicroStrategy. Vijay Anand Gianthomas Tewksbury Volpe. #mstrworld

Databricks Cloud. A Primer

Safe Harbor Statement

Microsoft Big Data. Solution Brief

Oracle's Big Data analytics portfolio gains critical mass

Cognitive Data Warehouse and Analytics

Realising Value from Data

Adobe and Hadoop Integration

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

When the Status Quo Means Getting Left Behind. Accelerating Analytics Platform Adoption through Evolving Technology

Two offerings which interoperate really well

Reduce Money Laundering Risks with Rapid, Predictive Insights

GET MORE VALUE OUT OF BIG DATA

EBOOK: Cloudwick Powering the Digital Enterprise

BIG DATA AND HADOOP DEVELOPER

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Architecture Overview for Data Analytics Deployments

Analytics Platform System

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Building Your Big Data Team

"Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary

IBM Spectrum Scale. Advanced storage management of unstructured data for cloud, big data, analytics, objects and more. Highlights

New Big Data Solutions and Opportunities for DB Workloads

Meta-Managed Data Exploration Framework and Architecture

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

20775: Performing Data Engineering on Microsoft HD Insight

Transcription:

PRODUCT LATEST NEWS

Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es

Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3

Industry-Leading Consulting and Training PUE is the first Spanish Cloudera Silver Integrator PUE is the only Training partner delivering classes across all Europe Cloudera has trained over 40,000 people on Hadoop since 2009 Source: Fortune, Fortune 500 and Global 500, May 2012. 4

Common BigData Early Problems in a Project Infrastructure investment Security and Compliance concerns Architecture sizing Wide and heterogeneous Hadoop ecosystem Support Ease of management 5

Best-In-Class Support 8.9 95% #1 Overall satisfaction makes Cloudera the industry benchmark for support Customers agree they benefit from Cloudera technical support outreach Ability to solve technical issues is the top reason to recommend Cloudera for Hadoop 6

Cloudera Platform 7

Cloudera Enterprise Making Hadoop Fast, Easy, and Secure Process Discover Model Serve Batch, Stream SQL, Search Analytics, ML NoSQL Security, Governance, Administration Deployment Flexibility Unlimited Storage On-Premises Appliances Engineered Systems Public Cloud Hybrid Cloud Private Cloud Hadoop delivers: One place for unlimited data Unified, multi-framework data access Cloudera delivers: Leading performance Easy system management Compliance-ready security 8

From Hadoop to an Enterprise Data Hub Open Source Scalable Flexible Cost-Effective Managed CLOUDERA S ENTERPRISE DATA HUB BATCH PROCESSING MAPREDUCE ANALYTIC SQL IMPALA SEARCH ENGINE SOLR MACHINE LEARNING SPARK WORKLOAD MANAGEMENT STREAM PROCESSING SPARK STREAMING YARN 3 RD PARTY APPS DATA MANAGEMENT CLOUDERA NAVIGATOR Open Architecture Secure and Governed STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT,, SECURE SENTRY FILESYSTEM ONLINE NOSQL HDFS HBASE SYSTEM MANAGEMENT CLOUDERA MANAGER 9

The Only Complete Hadoop Management Suite Deliver optimum system utilization and meet SLA commitments. Cloudera Manager Focus on the solution, not the cluster, with the only complete, zero-downtime administration tool for Apache Hadoop. Unique Capabilities: Unified configuration, management and monitoring across all services Online installation and upgrades Direct connection to Cloudera Support 3 rd Party Extensibility 10

The Only Portable Cloud Experience for Hadoop Maximize flexibility in Hadoop deployment architectures. Cloudera Director The first portable, self-service solution for deploying and managing enterprise-grade Hadoop in the Cloud. Unique Capabilities: Dynamic cluster lifecycle management Cloud blueprints Multi-cluster health visibility Usage reporting for billing models 11

Why Cloudera is the Leader in Spark Support Integrated with other Cloudera Components Cloudera Manager, Sentry, Navigator, etc. Cloudera more customers running Spark today than all our competitors combined. Installations range from a few nodes to 1000 node installs. Cloudera has been supporting Spark since early 2014 and first Hadoop vendor Between Cloudera and Intel, have over 20 developers working on Spark and 4 Committers The first and only Spark Training Class 12

Apache Kudu Completes Hadoop's storage layer to enable fast analytics on fast data. Data Model Low-latency Random Access Built by and for Operators Stores tables like relational databases Live storage system which supports low-latency millisecond-scale access to individual rows Advanced in-process tracing capabilities, extensive metrics support, and even watchdog threads 13

The Only Hadoop Data Governance Solution Enable compliance and maximize analyst productivity. Cloudera Navigator Minimize risk and maintain compliance with the only native end-to-end data governance solution for Apache Hadoop. Unique Capabilities: Auditing Lineage Metadata Tagging and Discovery Lifecycle Management 14

Adaptive Data Model Management Improve DBA productivity through continuous optimization. Navigator Optimizer Instantly understand data warehouse and Hadoop cluster usage, and drive optimizations to reduce cost and improve performance. Unique Capabilities: Schema and workload profiling Data model discovery Optimization guidance Optimization automation (future) 15

The Only Comprehensively Secure Hadoop Platform Meet compliance requirements and reduce risk exposure from storing sensitive data. 1. Perimeter Standards-based Authentication Process Discover Model Serve 2. Access Unified Role-based Authorization Security and Administration 3. Visibility Auditing & Governance Unlimited Storage 4. Data Encryption & Key Management Cloudera is the leader in Hadoop security. Unique Capabilities: Comprehensive and Unified Secure at the core No Performance Impact Jointly engineered with Intel Compliance-Ready Only distribution to pass PCI audit 16

MasterCard Cloudera: The first PCI-Certified Hadoop Platform Challenge: All applications, databases, or file systems that have the potential to handle personal account-related data must undergo full PCI certification Solution: MasterCard s Cloudera environment fully conforms to the PCI-DSS V 2.0 security standards so it can host PCI datasets and potentially integrate with other internal systems Data privacy and protection is a top priority for MasterCard. As we maximize the most advanced technologies from partners and vendors, they must meet the rigorous security standards we ve set. With Cloudera s commitment to the same standards, we now have additional options in how we manage our data center. Gary VonderHaar Chief Technology Officer, Architecture MasterCard 17

Security and Governance Perimeter Protecting access to the cluster Access Securing access to data Visibility Reporting on data access and lineage Data Protecting data at rest or in transmission Cloudera Unified, Compliance-Ready, Transparent Kerberos with Cloudera Manager Automated, industry-standard authentication integrated with existing systems Apache Sentry Working within the community to deliver centralized, granular RBAC across frameworks Cloudera Navigator Transparent end-to-end data and metadata visibility, including column-level visibility in lineage and audit Cloudera Navigator Transparent, comprehensive, highperformance, compliance-ready encryption and key management Competitors Fragmented, Incomplete, Complex Kerberos Manual configuration and integration Hive ATZ-NG, Ranger RBAC configuration silos, GUI Band-Aid Falcon, Knox, Ranger, Atlas? Manual and limited auditing through multiple half-baked tools, with more added each release to fill gaps N/A 18

Why Cloudera? Your trusted partner for getting results with enterprise Hadoop. Open Source Innovation No one knows Hadoop better than Cloudera. Cloudera leads development of enterprise Hadoop and offers the best support, training, and services. Powerful Enterprise Tools Cloudera extends open source Hadoop with capabilities required by the largest enterprises. Ecosystem Cloudera partners with industry leaders to ensure Hadoop works with the platforms, tools, and integrators our customers rely on. Enterprise Security Meet compliance requirements and reduce risk exposure from storing sensitive data. Data Governance Enable compliance and maximize analyst productivity. Complete Management Deliver optimum system utilization and meet SLA commitments, on-premises or in the cloud, with minimum effort. 19

Thank You! Sergio Rodríguez sergio@pue.es 20