Insights to HDInsight

Similar documents
Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT

Big Data & Advanced Analytics - "managed Services on Azure

20775: Performing Data Engineering on Microsoft HD Insight

Microsoft Azure Essentials

Cloud Based Analytics for SAP

Microsoft Big Data. Solution Brief

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Sr. Sergio Rodríguez de Guzmán CTO PUE

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Hybrid Data Management

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

Pentaho 8.0 Overview. Pedro Alves

Embracing Disruption: Financial Services and the Microsoft Cloud

Oracle Big Data Cloud Service

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Jason Virtue Business Intelligence Technical Professional

Cask Data Application Platform (CDAP)

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016

Common Customer Use Cases in FSI

Cloud service models

Integrating MATLAB Analytics into Enterprise Applications

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

MapR: Solution for Customer Production Success

House Keeping. You are in Listen Only Mode. Azure 101: Azure Overview. Azure 201: How to do a Cost Estimate for Virtual Machines

MapR Pentaho Business Solutions

Make the most of the cloud with Microsoft System Center and Azure

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

How to sell Azure to SMB customers. Paul Bowkett Microsoft NZ

Microsoft Dynamics 365 and Columbus

Hortonworks Data Platform. Buyer s Guide

Microsoft FastTrack For Azure Service Level Description

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

Quick Reference Guide

Deploying Microservices and Containers with Azure Container Service and DC/OS

Designing Business Intelligence Solutions with Microsoft SQL Server 2014

HPE Flexible Capacity with Microsoft Azure & Azure Stack

Why Big Data Matters? Speaker: Paras Doshi

More information for FREE VS ENTERPRISE LICENCE :

SAP Predictive Analytics Suite

Architecture Overview for Data Analytics Deployments

How to Build Your Data Ecosystem with Tableau on AWS

MIGRATING AND MANAGING MICROSOFT WORKLOADS ON AWS WITH DATAPIPE DATAPIPE.COM

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

EBOOK: Cloudwick Powering the Digital Enterprise

Hortonworks Apache Hadoop subscriptions ( Subsciptions ) can be purchased directly through HP and together with HP Big Data software products.

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Bringing the Power of SAS to Hadoop Title

1. Intoduction to Hadoop

Oracle Autonomous Data Warehouse Cloud

Secure information access is critical & more complex than ever

BIG DATA and DATA SCIENCE

Five Questions to Ask Before Choosing a Hadoop Distribution

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem

Cloud Failover Appliance

Hadoop in Production. Charles Zedlewski, VP, Product

Six Keys to Microsoft Azure Migration Success

Building Your Big Data Team

DELL EMC HADOOP SOLUTIONS

Oracle's Cloud Strategie für den Geschäftserfolg Alles Neue von der OOW

Operational Hadoop and the Lambda Architecture for Streaming Data

MICROSOFT AI PLATFORM

Innovate with Oracle Public Cloud Platform & Infrastructure Services

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Product Brief SysTrack VMP

Big Data Job Descriptions. Software Engineer - Algorithms

Limitless Creativity in the Cloud

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Oracle Autonomous Data Warehouse Cloud

Delivering Data Warehousing as a Cloud Service

Analyzing Data with Power BI

IBM Big Data Summit 2012

Azure Infrastructure for SAP

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

ETL challenges on IOT projects. Pedro Martins Head of Implementation

Reduce Money Laundering Risks with Rapid, Predictive Insights

Microsoft BI Product Suite

POWER BI OVERVIEW & FEATURES JANUARY 2017, SINGAPORE. Khilitchandra Prajapati

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Sanoma Big Data Migration Sander Kieft

Setup HSP Ambari Cluster

Berkeley Data Analytics Stack (BDAS) Overview

Hadoop Integration Deep Dive

Free On-Line Microsoft PDF

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAS & HADOOP ANALYTICS ON BIG DATA

RICHARD BEESON. OSIsoft

WHITE PAPER. Top 10 Reasons Why OEMs Choose MicroStrategy for Analytics

COURSE OUTLINE: Course 20533C- Implementing Microsoft Azure Infrastructure Solutions

Modernize Application Development to Succeed as a Digital Business

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

Transcription:

Insights to HDInsight

Why Hadoop in the Cloud?

No hardware costs

Unlimited Scale

Pay for What You Need

Deployed in minutes

Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive for all users Hybrid

Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive for all users Hybrid

Highly Available Designed for the cloud ground up HDInsight provides primary and secondary headnodes allowing for better reliability Have invested in making entire stack including Resource Manager, HiverServer2 HA ready HDInsight stack includes Zookeeper nodes at no extra charge to customer

Highest availability guarantee in the industry for peace of mind *Applies to HDInsight only 99.9% SLA Managed, monitored and supported by Microsoft Enterprise-leading SLA 99.9% uptime for both VM connectivity and Hadoop running in VMs No IT resources needed for upgrades and patching Microsoft monitors your deployment so you don t have to

Always encrypted, Role-based security & Auditing Always encrypted; in motion using SSL, and at rest using keys in Azure Key Vault Single sign-on, multi-factor authentication and integration of on-premises identities w/active Directory integration Fine-grained ACLs for rolebased access controls with Apache Ranger Auditing every access / configuration change with Apache Ranger

Alerting, monitoring, and pre-emptive actions Enhanced workload protection through integration with Microsoft Operations Management Suite (OMS) Threat detection, monitoring, and management

Petabyte size files and Trillions of objects TBs Store EBs Store data in it s native format PB sized files, 200x larger than anyone else Scalable throughput for massively parallel analytics No need to redesign application or reparation data at higher scale

Backed by Microsoft and Hortonworks Microsoft + Hortonworks has 37 committers for Hadoop Core; more than all managed cloud Hadoop vendors combined Uniquely ready to support your deployment Can fix and commit code back to Hadoop

Runs in the most datacenters worldwide Central US Iowa North Central US Illinois West Europe Netherlands China North* Beijing West US California South Central US Texas East US Virginia East US 2 Virginia North Europe Ireland India Central Pune China South* Shanghai Japan East Tokyo, Saitama Japan West Osaka East Asia Hong Kong SE Asia Singapore Azure doubling compute and storage every 6 months Brazil South Sao Paulo State Australia South East Victoria Australia East New South Wales

Lower total cost of ownership No hardware Hadoop support included with Azure support Pay only for what you use Independently scale storage and compute No need to hire specialized operations team 63% lower total cost of ownership than on-premises* *IDC study The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight

Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive for all users Hybrid

Easy for administrators to spin up quickly Deploy big data projects in minutes No hardware to install, tune, configure or deploy No infrastructure or software to manage Scale to tens to thousands of machines instantly

Debug and Optimize your Big Data programs with ease Deep integration with IDEs for developer productivity: Visual Studio, Eclipse, & IntelliJ Integrated with Hive, Pig, Storm, and Spark Visually see execution of Hive jobs ran by the Tez execution engine Full Intellisense

Easy notebook experience for data engineers Most popular notebooks, Jupyter and Zeppelin out-ofthe-box Combine code, statistical equations and visualizations Worked w/ Jupyter community to enhance kernel to allow Spark execution through REST endpoint

Easy for data scientists with familiar R language R Server for HDInsight Largest portable R parallel analytics library Terabyte-scale machine learning 1,000x larger than in open source R Up to 100x faster performance using Spark and optimized vector/math libraries Enterprise-grade security and support *Applies to HDInsight only

Easy for business analysts with interactive reports over big data Interactive BI with big data Spark 2.0 integration Interactive Hive with LLAPkeeps data compressed running in-memory 25x faster ODBC driver to use Power BI or third party tools (Tableau, SAP, Qlik, etc.)

Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive for all users Hybrid

On-premises and cloud Uses Hortonworks Data Platform (HDP) Move projects from onpremises to cloud without code rewrite Hybrid scenarios supported like Dev/Test, burst, back up, disaster recovery

Recognized by top analysts Forrester Wave for Big Data Hadoop Cloud Named industry leader by Forrester with the most comprehensive, scalable, and integrated platforms* Recognized for its cloud-first strategy that is paying off* *The Forrester WaveTM: Big Data Hadoop Cloud Solutions, Q2 2016.

Hadoop is a platform with portfolio of projects

HDFS

MapReduce

Hive

HBase

Storm

Kafka Event producers Collection Stream ingestion Transformation & analytics Long-term storage Presentation and action Live Dashboards Applications Azure ML Web/thick client dashboards HDInsight Devices Sensors Cloud gateways (web APIs) Kafka for HDInsight Storm for HDInsight Spark for HDInsight Storage adapters HBase for HDInsight Data Lake Store DocumentDB Solr Azure Search MongoDB SQL Search and query Data analytics (Excel) Web and social Field gateways Stream processing Event hub Devices to take action

Mahout

Spark Massive data processing framework built on in-memory

Predictive analytics, machine learning, and statistical modeling for big data

Integration with leading productivity applications Spin up Hadoop and Spark clusters pre-integrated and pre-tuned with ISV applications out-of-the-box Runs on the HDInsight clusters; does not require separate VMs Fast and easy way to spin up applications