Apache Mesos. Delivering mixed batch & real-time data infrastructure , Galway, Ireland

Size: px
Start display at page:

Download "Apache Mesos. Delivering mixed batch & real-time data infrastructure , Galway, Ireland"

Transcription

1 Apache Mesos Delivering mixed batch & real-time data infrastructure , Galway, Ireland Naoise Dunne, Insight Centre for Data Analytics Michael Hausenblas, Mesosphere Inc.

2 Types of Workloads batch streaming PaaS MapReduce 2

3

4 Mesos Intro

5 Apache Mesos A top-level ASF project A cluster resource negotiator Scalable to 10,000s of nodes but also useful for a handful of nodes Fault-tolerant, battle-tested An SDK for distributed apps Native Docker support 5

6 What is a Data Center Scheduler? Schedulers run your Distributed Apps An operating system kernel for the cloud Schedulers coordinate execution of work on cluster

7 A Quick History of Schedulers

8 History of Datacenter Schedulers 2010 Nexus (Mesos) 2003 Slurm 2008 Hadoop released 2004 Google Borg Google filesystem 2014 Google Omega paper 2010 Spark Paper Hadoop started 2004 mapreduce paper Quick history of distributed schedulers Mesos Released 2011 Mesos Paper Hadoop Yarn Kubernetes

9 Hadoop - Original O/S Scheduler Linked datat m/r job Monolithic scheduler: Original open source datacenter scheduler jobs are batched and executed Designed only to run Mapreduce jobs No concurrency between apps Evolving into yarn Linked datat app Linked datat m/r job Hadoop hadoop- resource management mesos slave mesos slave Linux Server Linux Server

10 Mesos - a Great Leap Forward 2 level scheduler : More flexible Can Schedule many kinds of applications Frameworks (such as spark) are delegated the per application scheduling Mesos responsible for resource distribution between applications and enforcing overall fairness Very modular, due to 2 level scheduling. frameworks manage apps as they like Linked datat app Linked data job Hadoop M/R job framework marathon framework spark framework chronos Mesos - scheduler jobs Mesos Mesos - resource management mesos slave mesos slave Linux Server Linux Server

11 How Mesos Works

12 Mesos Architecture 12

13 Mesos Resources resource == anything a task/executor consumes in order to do their work standard resources: cpu, mem, disk, ports DRF

14 2015 Mesosphere, Inc. 1 4

15 2015 Mesosphere, Inc. 1 5

16 2015 Mesosphere, Inc. 1 6

17 2015 Mesosphere, Inc. 1 7

18 2015 Mesosphere, Inc. 1 8

19 2015 Mesosphere, Inc. 19

20 2015 Mesosphere, Inc. 20

21 2015 Mesosphere, Inc. 21

22 2015 Mesosphere, Inc. 22

23 2015 Mesosphere, Inc. 23

24 2015 Mesosphere, Inc. 24

25 2015 Mesosphere, Inc. 25

26 2015 Mesosphere, Inc. 26

27 2015 Mesosphere, Inc. 27

28 2015 Mesosphere, Inc. 28

29 2015 Mesosphere, Inc. 29

30 Benefits of using a Scheduler Efficiency - best use of computing resources Agility - change your application mix with no turnaround Scalability - grow to the current demand of your app Modularity - 2 level schedulers have plugin frameworks that allow quick repurposing of core and no reliance on one vendor (more later)

31 Mesos Ecosystem

32 Applications work with frameworks to get resources they need Mesos Ecosystem Mesos Resources Monitor cpu mem disk Managed by Mesos OS Monitor graphx Graph Jobs Spark Fwk Chronos Fwk Mesos - scheduler short jobs Datastores HDT, Neo4J Granatum Revealed Frameworks Negotiate with mesos to run their jobs Marathon Framework Mesos - scheduler long run jobs Mesos Mesos - resource management mesos client Docker Linux Server mesos client Docker Linux Server mesos client Docker Linux Server Docker manages isolation on Linux servers

33 Mesos Ecosystem We need HDFS for large storage on Spark Jobs Need Mesos DNS for service discovery Marathon can now use HDFS to store large Dependencies HDFS Chronos Fwk Spark Fwk Mesos - scheduler short jobs Mesos DNS Marathon fwk Mesos - scheduler long run jobs Docker Registry Mesos Zookeeper you will need docker reg for marathon Mesos - resource management mesos client Mesos & frameworks needs zookeeper Docker mesos client Docker mesos client Linux Server Linux Server Linux Server DCOS DCOS DCOS Docker Universe/Multi verse To run mesos you will need dcos or glue

34 Datacenter schedulers: Why? Schedulers help you focus on your own work and not the infrastructure. its great to be able to focus on what it is you want to be doing rather than worrying about how do you get what it is you need in order to be able to get stuff done - John Wilkes (Google)

35 Mesos Best Practices

36 Mesos Best Practices Discovery Orchestration Composition

37 Discovery

38 Orchestration

39 Orchestration

40 Orchestration

41 Composition Marathon: apps and groups Kubernetes: pods and services Reusability, affinity and loose coupling

42 Monitoring

43 Monitoring

44 Enter DCOS

45 Local OS vs. Distributed OS 45

46 DCOS, A Distributed Operating System kernel (Apache Mesos, written in C++) scales to 10,000 of nodes fault-tolerant in all components, rolling upgrades throughout containers first class citizens (LXC, Docker) local OS per node (+container enabled) scheduling (long-lived, batch) service discovery, monitoring, logging, debugging 46

47 DCOS High Level Overview Any Service or Container Your favorite services, container formats, and those yet to come Mesosphere DCOS Runs distributed apps anywhere as simply as running apps on your laptop Any Infrastructure Build apps once on DCOS, and run it anywhere 47

48 DCOS Benefits Run stateless services such as Web servers, app servers (via Marathon) and stateful services like Spark, Kafka, HDFS, Cassandra, ArangoDB etc. together on one cluster Dynamic partitioning of your cluster, depending on your needs (business requirements) Increased utilization (10% 80% and more) 48

49 DCOS Architecture 49

50 It s demo time

51

52 See Also

53 See Also 53

54 Q&A

DOWNTIME IS NOT AN OPTION

DOWNTIME IS NOT AN OPTION DOWNTIME IS NOT AN OPTION HOW APACHE MESOS AND DC/OS KEEPS APPS RUNNING DESPITE FAILURES AND UPDATES 2017 Mesosphere, Inc. All Rights Reserved. 1 WAIT, WHO ARE YOU? Engineer at Mesosphere DC/OS Contributor

More information

Flink meet DC/OS. Deploying Apache Flink at Scale. Elizabeth K. Ravi FlinkForward San Francisco

Flink meet DC/OS. Deploying Apache Flink at Scale. Elizabeth K. Ravi FlinkForward San Francisco FlinkForward 2017 - San Francisco Flink meet DC/OS Deploying Apache Flink at Scale Elizabeth K. Joseph, @pleia2 Ravi Yadav, @RaaveYadav 1 Talk Outline Part 1 Part 2 Introduction to Apache Mesos, Marathon,

More information

Stateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018

Stateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018 Stateful Services on DC/OS Santa Clara, California April 23th 25th, 2018 Who Am I? Shafique Hassan Solutions Architect @ Mesosphere Operator 2 Agenda DC/OS Introduction and Recap Why Stateful Services

More information

Data Center Operating System (DCOS) IBM Platform Solutions

Data Center Operating System (DCOS) IBM Platform Solutions April 2015 Data Center Operating System (DCOS) IBM Platform Solutions Agenda Market Context DCOS Definitions IBM Platform Overview DCOS Adoption in IBM Spark on EGO EGO-Mesos Integration 2 Market Context

More information

Deploying Microservices and Containers with Azure Container Service and DC/OS

Deploying Microservices and Containers with Azure Container Service and DC/OS Deploying Microservices and Containers with Azure Container Service and DC/OS Intro The explosion of mobile devices, data, and sensors everywhere has enabled the potential for realtime apps for just about

More information

A SINGLE PLATFORM FOR CONTAINER ORCHESTRATION AND DATA SERVICES

A SINGLE PLATFORM FOR CONTAINER ORCHESTRATION AND DATA SERVICES A SINGLE PLATFORM FOR CONTAINER ORCHESTRATION AND DATA SERVICES MESOSPHERE DC/OS WITH KUBERNETES EASES ENTERPRISE ADOPTION OF NEW TECHNOLOGIES FOR DIGITAL TRANSFORMATION EXECUTIVE SUMMARY Digital disruption

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Mastering the Microservices, Fast Data & Hybrid Cloud Trifecta

Mastering the Microservices, Fast Data & Hybrid Cloud Trifecta Mastering the Microservices, Fast Data & Hybrid Cloud Trifecta Edward Hsu, VP Product 2018.10.23 2018 Mesosphere, Inc. All Rights Reserved. 2 Cloud On Who s Terms? 2018 Mesosphere, Inc. All Rights Reserved.

More information

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing GUIDE The Enterprise Buyer s Guide to Public Cloud Computing cloudcheckr.com Enterprise Buyer s Guide 1 When assessing enterprise compute options on Amazon and Azure, it pays dividends to research the

More information

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com

More information

Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics

Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics Dan Widdis, Principal Operations Research Analyst May 10, 2016 Approved for public release; distribution is

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Applicazioni Cloud native

Applicazioni Cloud native Applicazioni Cloud native Marco Dragoni IBM Cloud - Italy Roberto Pozzi IBM Cloud - Italy 2017 IBM Corporation 1 IBM Bluemix is our Integrated Cloud Platform Industry IoT Block Chain Health Financial Services

More information

Resource Scheduling Architectural Evolution at Scale and Distributed Scheduler Load Simulator

Resource Scheduling Architectural Evolution at Scale and Distributed Scheduler Load Simulator Resource Scheduling Architectural Evolution at Scale and Distributed Scheduler Load Simulator Renyu Yang Supported by Collaborated 863 and 973 Program Resource Scheduling Problems 2 Challenges at Scale

More information

Understanding The Value of Containers in a World of DevOps. Advice that empowers. Technology that enables.

Understanding The Value of Containers in a World of DevOps. Advice that empowers. Technology that enables. Understanding The Value of Containers in a World of DevOps Advice that empowers. Technology that enables. Bradley Brodkin - Some Background Founder & CEO of HighVail Systems, Toronto CANADA 31+ year industry

More information

Cloudera, Inc. All rights reserved.

Cloudera, Inc. All rights reserved. 1 Data Analytics 2018 CDSW Teamplay und Governance in der Data Science Entwicklung Thomas Friebel Partner Sales Engineer tfriebel@cloudera.com 2 We believe data can make what is impossible today, possible

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Virtualizing Big Data/Hadoop Workloads. Update for vsphere 6. Justin Murray VMware VMware Inc. All rights reserved.

Virtualizing Big Data/Hadoop Workloads. Update for vsphere 6. Justin Murray VMware VMware Inc. All rights reserved. Virtualizing Big Data/Hadoop Workloads Update for vsphere 6 Justin Murray VMware 2014 VMware Inc. All rights reserved. Agenda The Hadoop Customer Journey Why Virtualize Hadoop? vsphere Big Data Extensions

More information

INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM

INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM EMMANUEL BERNARD PRINCIPAL SYSTEM ENGINEER, CLOUD PLATFORM SPECIALIST DELL EMC @_ebernard GLOBAL SPONSORS Every Business is Becoming

More information

Kubernetes User Experiences

Kubernetes User Experiences Kubernetes User Experiences * 1. What is the status of container usage at your enterprise or organization? Not using containers. May or may not have plans to use them Using containers but not in production

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

Enterprise Development Trends Cloud, Container and Microservices Insights from 2,100 JVM Developers

Enterprise Development Trends Cloud, Container and Microservices Insights from 2,100 JVM Developers Enterprise Development Trends 2016 Cloud, Container and Microservices Insights from 2,100 JVM Developers 1 About This Report Lightbend surveyed 2,151 global Java Virtual Machine (JVM) developers to discover:

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Big Data Hadoop Administrator.

Big Data Hadoop Administrator. Big Data Hadoop Administrator www.austech.edu.au WHAT IS BIG DATA HADOOP ADMINISTRATOR?? Hadoop is a distributed framework that makes it easier to process large data sets that reside in clusters of computers.

More information

MQ on Cloud (AWS) Suganya Rane Digital Automation, Integration & Cloud Solutions. MQ Technical Conference v

MQ on Cloud (AWS) Suganya Rane Digital Automation, Integration & Cloud Solutions. MQ Technical Conference v MQ on Cloud (AWS) Suganya Rane Digital Automation, Integration & Cloud Solutions Agenda CLOUD Providers Types of CLOUD Environments Cloud Deployments MQ on CLOUD MQ on AWS MQ Monitoring on Cloud What is

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

IBM Research Report. Megos: Enterprise Resource Management in Mesos Clusters

IBM Research Report. Megos: Enterprise Resource Management in Mesos Clusters H-0324 (HAI1606-001) 1 June 2016 Computer Sciences IBM Research Report Megos: Enterprise Resource Management in Mesos Clusters Abed Abu-Dbai Khalid Ahmed David Breitgand IBM Platform Computing, Toronto,

More information

Containers and

Containers and Containers and Docker @NetApp 09 Mars 2017 Christophe Danjou & Thibaud Lenik C 1. What are Containers? 2. Why Containers and Docker? Agenda 3. Using Docker with NetApp 2 2016 NetApp, Inc. All rights reserved.

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

The Case for Designing Data-Intensive Cloud-Based Healthcare Applications

The Case for Designing Data-Intensive Cloud-Based Healthcare Applications The Case for Designing Data-Intensive Cloud-Based Healthcare Applications Position Paper Srini Bhagavan 1,2, Khulud Alsultan 2, and Praveen Rao 2 1 IBM, Leawood, KS 66219 srinib@us.ibm.com, 2 Univ. of

More information

Containers in Linux on z Systems: Docker. Utz Bacher STSM Linux and Containers on z Systems

Containers in Linux on z Systems: Docker. Utz Bacher STSM Linux and Containers on z Systems Containers in Linux on z Systems: Docker Utz Bacher STSM Linux and Containers on z Systems A Message Brought To You By Our Lawyers Trademarks of International Business Machines

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

FROM SHORE TO SHIP: USING MESOSPHERE ENTERPRISE DC/OS TO DELIVER REAL TIME MICROSERVICES TO A GLOBAL FLEET OF SHIPS

FROM SHORE TO SHIP: USING MESOSPHERE ENTERPRISE DC/OS TO DELIVER REAL TIME MICROSERVICES TO A GLOBAL FLEET OF SHIPS FROM SHORE TO SHIP: USING MESOSPHERE ENTERPRISE DC/OS TO DELIVER REAL TIME MICROSERVICES TO A GLOBAL FLEET OF SHIPS & WELCOME TO DIGITAL TRANSFORMATION Today we will be taking you through the moments that

More information

Analytics for the NFV World with PNDA.io

Analytics for the NFV World with PNDA.io for the NFV World with.io Speaker Donald Hunter Principal Engineer in the Chief Technology and Architecture Office at Cisco. Lead the MEF OpenLSO project which uses.io as a reference implementation for

More information

Central Role of Messaging Middleware in Cloud and Digital Transformation Initiatives

Central Role of Messaging Middleware in Cloud and Digital Transformation Initiatives White Paper Central Role of Messaging Middleware in Cloud and Digital Transformation Initiatives Sponsored by: IBM Maureen Fleming April 2018 EXECUTIVE SUMMARY Highly decentralized computing is the new

More information

Big Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase

Big Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase BIG DATA COURSE Big Data Application Engineer/ Developer Specialization in Apache Spark, Kafka, Airflow, HBase In Exclusive Association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries

More information

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved. Hadoop in the Cloud Ryan Lippert, Cloudera Product Marketing @lippertryan 1 2 Cloudera Confidential 3 Drive Customer Insights Improve Product & Services Efficiency Lower Business Risk 4 The world s largest

More information

Beyond Virtualization. Derek Collison - Apcera, June 12, QCon New York

Beyond Virtualization. Derek Collison - Apcera, June 12, QCon New York Beyond Virtualization Derek Collison - Apcera, Inc.!!! June 12, 2014 - QCon New York About!! Derek Collison Architected and built TIBCO Rendezvous and EMS Messaging Systems! Co-founded AJAX APIs group

More information

OPENSHIFT CONTAINER PLATFORM

OPENSHIFT CONTAINER PLATFORM OPENSHIFT CONTAINER PLATFORM FUNDAMENTAL OVERVIEW Mike Surbey Emerging Technology Specialist http://msurbey.com AGENDA 2 1. INTRODUCTION Today s Business Challenge 2. KEY CONCEPTS s, DevOps, etc. 3. HOLISTIC

More information

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February

More information

On Cloud Computational Models and the Heterogeneity Challenge

On Cloud Computational Models and the Heterogeneity Challenge On Cloud Computational Models and the Heterogeneity Challenge Raouf Boutaba D. Cheriton School of Computer Science University of Waterloo WCU IT Convergence Engineering Division POSTECH FOME, December

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

Multi-Containers Orchestration with Live Migration and High-Availability for Microservices

Multi-Containers Orchestration with Live Migration and High-Availability for Microservices Multi-Containers Orchestration with Live Migration and High-Availability for Microservices Meet Our Presenters Jay Lyman Research Manager, Cloud Platforms, 451 Research Ruslan Synytsky CEO and Co-founder,

More information

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved. Architecture Optimization for the new Data Warehouse Guido Oswald - @GuidoOswald 1 Use Cases This image cannot currently be displayed. This image cannot currently be displayed. This image cannot currently

More information

Dynamic App Services in Containers PRESENTED BY:

Dynamic App Services in Containers PRESENTED BY: Dynamic App Services in Containers PRESENTED BY: Apps and container market overview Container description and benefits Container platforms and orchestration tools Container integrations: F5 Container Connector

More information

Just Enough Operating System to kick start creativity. Simona Arsene

Just Enough Operating System to kick start creativity. Simona Arsene Just Enough Operating System to kick start creativity Simona Arsene SUSE Linux Enterprise Server JeOS speeds up virtual image deployment Just enough Operating System No need to re-certify Same SUSE Linux

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Glenn Anderson, IBM Lab Services and Training The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Summer SHARE August 2015 Session 17794 2 (c) Copyright 2015 IBM Corporation

More information

Realising Value from Data

Realising Value from Data Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Using Mesos Schedulers with Amazon EC2 Container Service

Using Mesos Schedulers with Amazon EC2 Container Service Using Mesos Schedulers with Amazon EC2 Container Service Ryosuke Iwanaga Solutions Architect, Amazon Web Services Japan July 2016, LinuxCon+ContainerCon Japan 2016, Amazon Web Services, Inc. or its Affiliates.

More information

From Data Deluge to Intelligent Data

From Data Deluge to Intelligent Data SAP Data Hub From Data Deluge to Intelligent Data Orchestrate Your Data for an Intelligent Enterprise Data for Intelligence, Speed, and With Today, corporate data landscapes are growing increasingly diverse

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

Red Hat Container Technology Strategy

Red Hat Container Technology Strategy Red Hat Container Technology Strategy Containers are so 2014 Clayton Coleman Daniel Riek April 2017 What we told you earlier: The future of the Linux OS is a scale-out cluster-as-computer platform for

More information

THE AGAVE PLATFORM SCIENCE AS A SERVICE FOR THE OPEN SCIENCE COMMUNITY

THE AGAVE PLATFORM SCIENCE AS A SERVICE FOR THE OPEN SCIENCE COMMUNITY THE AGAVE PLATFORM SCIENCE AS A SERVICE FOR THE OPEN SCIENCE COMMUNITY Rion Dooley @deardooley deardooley@gmail.com 12/15/20 1 THE EVOLUTION OF A CYBERINFRASTRUCTURE HPC systems have grown up since then

More information

Migrating to Cloud - Native Architectures Using Microservices: An Experience Report

Migrating to Cloud - Native Architectures Using Microservices: An Experience Report Migrating to Cloud - Native Architectures Using Microservices: An Experience Report Armin Balalaie, Abbas Heydarnoori, and Pooyan Jamshidi Sharif University of Technology, Tehran, Iran - 2015 Sonam Gupta

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Understanding Cloud. #IBMDurbanHackathon. Presented by: Britni Lonesome IBM Cloud Advisor

Understanding Cloud. #IBMDurbanHackathon. Presented by: Britni Lonesome IBM Cloud Advisor Understanding Cloud #IBMDurbanHackathon Presented by: Britni Lonesome IBM Cloud Advisor What is this thing called cloud? Cloud computing is a new consumption and delivery model inspired by consumer internet

More information

Lightbend Fast Data Platform. A Technical Overview For Decision Makers

Lightbend Fast Data Platform. A Technical Overview For Decision Makers Lightbend Fast Data Platform A Technical Overview For Decision Makers Mobile and IoT use cases are driving enterprises to modernize how they process large volumes of data. Lightbend provides the fundamental

More information

Container Native Application Development

Container Native Application Development Container Native Application Development Wolfgang Weigend Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated

More information

DRIVING DIGITAL TRANSFORMATION WITH CONTAINERS AND KUBERNETES. How Kubernetes Manages Containerized Applications to Deliver Business Value

DRIVING DIGITAL TRANSFORMATION WITH CONTAINERS AND KUBERNETES. How Kubernetes Manages Containerized Applications to Deliver Business Value WHITE PAPER AUGUST 2017 DRIVING DIGITAL TRANSFORMATION WITH How Kubernetes Manages Containerized Applications to Deliver Business Value Table of Contents Introduction...3 The Digital Transformation and

More information

SAP Machine Learning for Hadoop. Customer

SAP Machine Learning for Hadoop. Customer SAP Machine Learning for Hadoop Customer SAP BusinessObjects Predictive Analytics and Big Data 1. Support for end-to-end operational predictive lifecycle on Hadoop 2. Business Analyst Friendly No coding

More information

Spark, Hadoop, and Friends

Spark, Hadoop, and Friends Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

MICROSOFT AZURE CLOUD CAPABILITIES, COSTS, AND UPDATES

MICROSOFT AZURE CLOUD CAPABILITIES, COSTS, AND UPDATES E-Guide MICROSOFT AZURE CLOUD CAPABILITIES, COSTS, AND UPDATES SearchCloud Computing A s offerings continue to evolve, it becomes imperative to continually assess how various vendors stack up. In this

More information

Omega: flexible, scalable schedulers for large compute clusters. Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes

Omega: flexible, scalable schedulers for large compute clusters. Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes Omega: flexible, scalable schedulers for large compute clusters Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes Cluster Scheduling Shared hardware resources in a cluster Run a mix

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

New Big Data Solutions and Opportunities for DB Workloads

New Big Data Solutions and Opportunities for DB Workloads New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September

More information

Hawk: Hybrid Datacenter Scheduling

Hawk: Hybrid Datacenter Scheduling Hawk: Hybrid Datacenter Scheduling Pamela Delgado, Florin Dinu, Anne-Marie Kermarrec, Willy Zwaenepoel July 10th, 2015 USENIX ATC 2015 1 Introduction: datacenter scheduling Job 1 task task scheduler cluster

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

How Container Schedulers and Software-Defined Storage will Change the Cloud

How Container Schedulers and Software-Defined Storage will Change the Cloud How Container Schedulers and Software-Defined Storage will Change the Cloud David vonthenen {code} by Dell EMC @dvonthenen http://dvonthenen.com github.com/dvonthenen Agenda Review of Software-Defined

More information

Data Analytics Use Cases, Platforms, Services. ITMM, March 5 th, 2018 Luca Canali, IT-DB

Data Analytics Use Cases, Platforms, Services. ITMM, March 5 th, 2018 Luca Canali, IT-DB Data Analytics Use Cases, Platforms, Services ITMM, March 5 th, 2018 Luca Canali, IT-DB 1 Analytics and Big Data Pipelines Use Cases Many use cases at CERN for analytics Data analysis, dashboards, plots,

More information

Special thanks to Chad Diaz II, Jason Montgomery & Micah Torres

Special thanks to Chad Diaz II, Jason Montgomery & Micah Torres Special thanks to Chad Diaz II, Jason Montgomery & Micah Torres Outline: What cloud computing is The history of cloud computing Cloud Services (Iaas, Paas, Saas) Cloud Computing Service Providers Technical

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN

Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Jagadish Venkatraman

More information

Simplify Private Cloud Deployments PRESENTED BY:

Simplify Private Cloud Deployments PRESENTED BY: Simplify Private Cloud Deployments PRESENTED BY: What CIOs are ultimately looking for is the ability to than their competitors, while adhering to regulatory requirements, and. RedMonk Analyst Strong Security

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

IBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation

IBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation IBM Message Hub James Bennett Offering Manager, IBM Cloud Integration 2016 IBM Corporation The continued growth of PaaS 2 Use cases 1 Hub for asynchronously connecting services inside Bluemix or beyond

More information

<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter

<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter Mike Piech Senior Director, Product Marketing The following is intended to outline our general product direction. It

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

St Louis CMG Boris Zibitsker, PhD

St Louis CMG Boris Zibitsker, PhD ENTERPRISE PERFORMANCE ASSURANCE BASED ON BIG DATA ANALYTICS St Louis CMG Boris Zibitsker, PhD www.beznext.com bzibitsker@beznext.com Abstract Today s fast-paced businesses have to make business decisions

More information

DEVOPS AUTOMATION USING DOCKER, KUBERNETES AND OPENSHIFT. Siamak Sadeghianfar Sr Technical Marketing Manager, OpenShift June 2016

DEVOPS AUTOMATION USING DOCKER, KUBERNETES AND OPENSHIFT. Siamak Sadeghianfar Sr Technical Marketing Manager, OpenShift June 2016 DEVOPS AUTOMATION USING DOCKER, KUBERNETES AND Siamak Sadeghianfar Sr Technical Marketing Manager, OpenShift June 2016 DEFINE DEVOPS Everything as code Application monitoring Automate everything Rapid

More information

Towards The Real-Time Enterprise

Towards The Real-Time Enterprise EXECUTIVE BRIEFING PAPER Towards The Real-Time Enterprise How A Cloud-Native Application Architecture Provides An Essential Foundation For Digital Transformation January 2019 Table Of Contents About This

More information

Big Data in Cloud. 堵俊平 Apache Hadoop Committer Staff Engineer, VMware

Big Data in Cloud. 堵俊平 Apache Hadoop Committer Staff Engineer, VMware Big Data in Cloud 堵俊平 Apache Hadoop Committer Staff Engineer, VMware Bio 堵俊平 (Junping Du) - Join VMware in 2008 for cloud product first - Initiate earliest effort on big data within VMware since 2010 -

More information

Red Hat Open Shift Container Platform

Red Hat Open Shift Container Platform Red Hat Open Shift Container Platform Daniel.Froehlich@RedHat.com IT Must Evolve to Stay Ahead of Demands Containers package applications with dependencies and isolate the runtime Easy to deploy and portable

More information

Visual Studio Everywhere. Build Great Cloud Apps

Visual Studio Everywhere. Build Great Cloud Apps Visual Studio Everywhere Build Great Cloud Apps Agenda Why use the cloud to build apps? An overview of Microsoft Azure Virtual machines for lift-shift scenarios Microservices and Azure Service Fabric Data

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Pentaho 8.0 Overview. Pedro Alves

Pentaho 8.0 Overview. Pedro Alves Pentaho 8.0 Overview Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction. It is provided for information

More information

UForge AppCenter 3.8. Introduction March Copyright 2018 FUJITSU LIMITED

UForge AppCenter 3.8. Introduction March Copyright 2018 FUJITSU LIMITED UForge AppCenter 3.8 Introduction March 2018 Copyright 2018 FUJITSU LIMITED Enterprise Cloud Application Journey 3 stages in transitioning legacy enterprise applications to cloud: Cloud-hosted applications:

More information