Apache Mesos. Delivering mixed batch & real-time data infrastructure , Galway, Ireland
|
|
- Bethany Horton
- 6 years ago
- Views:
Transcription
1 Apache Mesos Delivering mixed batch & real-time data infrastructure , Galway, Ireland Naoise Dunne, Insight Centre for Data Analytics Michael Hausenblas, Mesosphere Inc.
2 Types of Workloads batch streaming PaaS MapReduce 2
3
4 Mesos Intro
5 Apache Mesos A top-level ASF project A cluster resource negotiator Scalable to 10,000s of nodes but also useful for a handful of nodes Fault-tolerant, battle-tested An SDK for distributed apps Native Docker support 5
6 What is a Data Center Scheduler? Schedulers run your Distributed Apps An operating system kernel for the cloud Schedulers coordinate execution of work on cluster
7 A Quick History of Schedulers
8 History of Datacenter Schedulers 2010 Nexus (Mesos) 2003 Slurm 2008 Hadoop released 2004 Google Borg Google filesystem 2014 Google Omega paper 2010 Spark Paper Hadoop started 2004 mapreduce paper Quick history of distributed schedulers Mesos Released 2011 Mesos Paper Hadoop Yarn Kubernetes
9 Hadoop - Original O/S Scheduler Linked datat m/r job Monolithic scheduler: Original open source datacenter scheduler jobs are batched and executed Designed only to run Mapreduce jobs No concurrency between apps Evolving into yarn Linked datat app Linked datat m/r job Hadoop hadoop- resource management mesos slave mesos slave Linux Server Linux Server
10 Mesos - a Great Leap Forward 2 level scheduler : More flexible Can Schedule many kinds of applications Frameworks (such as spark) are delegated the per application scheduling Mesos responsible for resource distribution between applications and enforcing overall fairness Very modular, due to 2 level scheduling. frameworks manage apps as they like Linked datat app Linked data job Hadoop M/R job framework marathon framework spark framework chronos Mesos - scheduler jobs Mesos Mesos - resource management mesos slave mesos slave Linux Server Linux Server
11 How Mesos Works
12 Mesos Architecture 12
13 Mesos Resources resource == anything a task/executor consumes in order to do their work standard resources: cpu, mem, disk, ports DRF
14 2015 Mesosphere, Inc. 1 4
15 2015 Mesosphere, Inc. 1 5
16 2015 Mesosphere, Inc. 1 6
17 2015 Mesosphere, Inc. 1 7
18 2015 Mesosphere, Inc. 1 8
19 2015 Mesosphere, Inc. 19
20 2015 Mesosphere, Inc. 20
21 2015 Mesosphere, Inc. 21
22 2015 Mesosphere, Inc. 22
23 2015 Mesosphere, Inc. 23
24 2015 Mesosphere, Inc. 24
25 2015 Mesosphere, Inc. 25
26 2015 Mesosphere, Inc. 26
27 2015 Mesosphere, Inc. 27
28 2015 Mesosphere, Inc. 28
29 2015 Mesosphere, Inc. 29
30 Benefits of using a Scheduler Efficiency - best use of computing resources Agility - change your application mix with no turnaround Scalability - grow to the current demand of your app Modularity - 2 level schedulers have plugin frameworks that allow quick repurposing of core and no reliance on one vendor (more later)
31 Mesos Ecosystem
32 Applications work with frameworks to get resources they need Mesos Ecosystem Mesos Resources Monitor cpu mem disk Managed by Mesos OS Monitor graphx Graph Jobs Spark Fwk Chronos Fwk Mesos - scheduler short jobs Datastores HDT, Neo4J Granatum Revealed Frameworks Negotiate with mesos to run their jobs Marathon Framework Mesos - scheduler long run jobs Mesos Mesos - resource management mesos client Docker Linux Server mesos client Docker Linux Server mesos client Docker Linux Server Docker manages isolation on Linux servers
33 Mesos Ecosystem We need HDFS for large storage on Spark Jobs Need Mesos DNS for service discovery Marathon can now use HDFS to store large Dependencies HDFS Chronos Fwk Spark Fwk Mesos - scheduler short jobs Mesos DNS Marathon fwk Mesos - scheduler long run jobs Docker Registry Mesos Zookeeper you will need docker reg for marathon Mesos - resource management mesos client Mesos & frameworks needs zookeeper Docker mesos client Docker mesos client Linux Server Linux Server Linux Server DCOS DCOS DCOS Docker Universe/Multi verse To run mesos you will need dcos or glue
34 Datacenter schedulers: Why? Schedulers help you focus on your own work and not the infrastructure. its great to be able to focus on what it is you want to be doing rather than worrying about how do you get what it is you need in order to be able to get stuff done - John Wilkes (Google)
35 Mesos Best Practices
36 Mesos Best Practices Discovery Orchestration Composition
37 Discovery
38 Orchestration
39 Orchestration
40 Orchestration
41 Composition Marathon: apps and groups Kubernetes: pods and services Reusability, affinity and loose coupling
42 Monitoring
43 Monitoring
44 Enter DCOS
45 Local OS vs. Distributed OS 45
46 DCOS, A Distributed Operating System kernel (Apache Mesos, written in C++) scales to 10,000 of nodes fault-tolerant in all components, rolling upgrades throughout containers first class citizens (LXC, Docker) local OS per node (+container enabled) scheduling (long-lived, batch) service discovery, monitoring, logging, debugging 46
47 DCOS High Level Overview Any Service or Container Your favorite services, container formats, and those yet to come Mesosphere DCOS Runs distributed apps anywhere as simply as running apps on your laptop Any Infrastructure Build apps once on DCOS, and run it anywhere 47
48 DCOS Benefits Run stateless services such as Web servers, app servers (via Marathon) and stateful services like Spark, Kafka, HDFS, Cassandra, ArangoDB etc. together on one cluster Dynamic partitioning of your cluster, depending on your needs (business requirements) Increased utilization (10% 80% and more) 48
49 DCOS Architecture 49
50 It s demo time
51
52 See Also
53 See Also 53
54 Q&A
DOWNTIME IS NOT AN OPTION
DOWNTIME IS NOT AN OPTION HOW APACHE MESOS AND DC/OS KEEPS APPS RUNNING DESPITE FAILURES AND UPDATES 2017 Mesosphere, Inc. All Rights Reserved. 1 WAIT, WHO ARE YOU? Engineer at Mesosphere DC/OS Contributor
More informationFlink meet DC/OS. Deploying Apache Flink at Scale. Elizabeth K. Ravi FlinkForward San Francisco
FlinkForward 2017 - San Francisco Flink meet DC/OS Deploying Apache Flink at Scale Elizabeth K. Joseph, @pleia2 Ravi Yadav, @RaaveYadav 1 Talk Outline Part 1 Part 2 Introduction to Apache Mesos, Marathon,
More informationStateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018
Stateful Services on DC/OS Santa Clara, California April 23th 25th, 2018 Who Am I? Shafique Hassan Solutions Architect @ Mesosphere Operator 2 Agenda DC/OS Introduction and Recap Why Stateful Services
More informationData Center Operating System (DCOS) IBM Platform Solutions
April 2015 Data Center Operating System (DCOS) IBM Platform Solutions Agenda Market Context DCOS Definitions IBM Platform Overview DCOS Adoption in IBM Spark on EGO EGO-Mesos Integration 2 Market Context
More informationDeploying Microservices and Containers with Azure Container Service and DC/OS
Deploying Microservices and Containers with Azure Container Service and DC/OS Intro The explosion of mobile devices, data, and sensors everywhere has enabled the potential for realtime apps for just about
More informationA SINGLE PLATFORM FOR CONTAINER ORCHESTRATION AND DATA SERVICES
A SINGLE PLATFORM FOR CONTAINER ORCHESTRATION AND DATA SERVICES MESOSPHERE DC/OS WITH KUBERNETES EASES ENTERPRISE ADOPTION OF NEW TECHNOLOGIES FOR DIGITAL TRANSFORMATION EXECUTIVE SUMMARY Digital disruption
More informationIntroduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation
Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop
More informationMastering the Microservices, Fast Data & Hybrid Cloud Trifecta
Mastering the Microservices, Fast Data & Hybrid Cloud Trifecta Edward Hsu, VP Product 2018.10.23 2018 Mesosphere, Inc. All Rights Reserved. 2 Cloud On Who s Terms? 2018 Mesosphere, Inc. All Rights Reserved.
More informationGUIDE The Enterprise Buyer s Guide to Public Cloud Computing
GUIDE The Enterprise Buyer s Guide to Public Cloud Computing cloudcheckr.com Enterprise Buyer s Guide 1 When assessing enterprise compute options on Amazon and Azure, it pays dividends to research the
More informationAchieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform
Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com
More informationCombine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics
Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics Dan Widdis, Principal Operations Research Analyst May 10, 2016 Approved for public release; distribution is
More informationMapR: Solution for Customer Production Success
2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice
More informationIntro to Big Data and Hadoop
Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties
More informationBIG DATA AND HADOOP DEVELOPER
BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1
More information5th Annual. Cloudera, Inc. All rights reserved.
5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software
More informationApplicazioni Cloud native
Applicazioni Cloud native Marco Dragoni IBM Cloud - Italy Roberto Pozzi IBM Cloud - Italy 2017 IBM Corporation 1 IBM Bluemix is our Integrated Cloud Platform Industry IoT Block Chain Health Financial Services
More informationResource Scheduling Architectural Evolution at Scale and Distributed Scheduler Load Simulator
Resource Scheduling Architectural Evolution at Scale and Distributed Scheduler Load Simulator Renyu Yang Supported by Collaborated 863 and 973 Program Resource Scheduling Problems 2 Challenges at Scale
More informationUnderstanding The Value of Containers in a World of DevOps. Advice that empowers. Technology that enables.
Understanding The Value of Containers in a World of DevOps Advice that empowers. Technology that enables. Bradley Brodkin - Some Background Founder & CEO of HighVail Systems, Toronto CANADA 31+ year industry
More informationCloudera, Inc. All rights reserved.
1 Data Analytics 2018 CDSW Teamplay und Governance in der Data Science Entwicklung Thomas Friebel Partner Sales Engineer tfriebel@cloudera.com 2 We believe data can make what is impossible today, possible
More informationBIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW
BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade
More informationVirtualizing Big Data/Hadoop Workloads. Update for vsphere 6. Justin Murray VMware VMware Inc. All rights reserved.
Virtualizing Big Data/Hadoop Workloads Update for vsphere 6 Justin Murray VMware 2014 VMware Inc. All rights reserved. Agenda The Hadoop Customer Journey Why Virtualize Hadoop? vsphere Big Data Extensions
More informationINTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM
INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM EMMANUEL BERNARD PRINCIPAL SYSTEM ENGINEER, CLOUD PLATFORM SPECIALIST DELL EMC @_ebernard GLOBAL SPONSORS Every Business is Becoming
More informationKubernetes User Experiences
Kubernetes User Experiences * 1. What is the status of container usage at your enterprise or organization? Not using containers. May or may not have plans to use them Using containers but not in production
More informationABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.
ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed
More informationEnterprise Development Trends Cloud, Container and Microservices Insights from 2,100 JVM Developers
Enterprise Development Trends 2016 Cloud, Container and Microservices Insights from 2,100 JVM Developers 1 About This Report Lightbend surveyed 2,151 global Java Virtual Machine (JVM) developers to discover:
More informationData Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB
Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data
More informationBig Data Hadoop Administrator.
Big Data Hadoop Administrator www.austech.edu.au WHAT IS BIG DATA HADOOP ADMINISTRATOR?? Hadoop is a distributed framework that makes it easier to process large data sets that reside in clusters of computers.
More informationMQ on Cloud (AWS) Suganya Rane Digital Automation, Integration & Cloud Solutions. MQ Technical Conference v
MQ on Cloud (AWS) Suganya Rane Digital Automation, Integration & Cloud Solutions Agenda CLOUD Providers Types of CLOUD Environments Cloud Deployments MQ on CLOUD MQ on AWS MQ Monitoring on Cloud What is
More informationHow In-Memory Computing can Maximize the Performance of Modern Payments
How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance
More informationIBM Research Report. Megos: Enterprise Resource Management in Mesos Clusters
H-0324 (HAI1606-001) 1 June 2016 Computer Sciences IBM Research Report Megos: Enterprise Resource Management in Mesos Clusters Abed Abu-Dbai Khalid Ahmed David Breitgand IBM Platform Computing, Toronto,
More informationContainers and
Containers and Docker @NetApp 09 Mars 2017 Christophe Danjou & Thibaud Lenik C 1. What are Containers? 2. Why Containers and Docker? Agenda 3. Using Docker with NetApp 2 2016 NetApp, Inc. All rights reserved.
More informationHadoop Integration Deep Dive
Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors
More informationThe Case for Designing Data-Intensive Cloud-Based Healthcare Applications
The Case for Designing Data-Intensive Cloud-Based Healthcare Applications Position Paper Srini Bhagavan 1,2, Khulud Alsultan 2, and Praveen Rao 2 1 IBM, Leawood, KS 66219 srinib@us.ibm.com, 2 Univ. of
More informationContainers in Linux on z Systems: Docker. Utz Bacher STSM Linux and Containers on z Systems
Containers in Linux on z Systems: Docker Utz Bacher STSM Linux and Containers on z Systems A Message Brought To You By Our Lawyers Trademarks of International Business Machines
More informationSr. Sergio Rodríguez de Guzmán CTO PUE
PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish
More informationFROM SHORE TO SHIP: USING MESOSPHERE ENTERPRISE DC/OS TO DELIVER REAL TIME MICROSERVICES TO A GLOBAL FLEET OF SHIPS
FROM SHORE TO SHIP: USING MESOSPHERE ENTERPRISE DC/OS TO DELIVER REAL TIME MICROSERVICES TO A GLOBAL FLEET OF SHIPS & WELCOME TO DIGITAL TRANSFORMATION Today we will be taking you through the moments that
More informationAnalytics for the NFV World with PNDA.io
for the NFV World with.io Speaker Donald Hunter Principal Engineer in the Chief Technology and Architecture Office at Cisco. Lead the MEF OpenLSO project which uses.io as a reference implementation for
More informationCentral Role of Messaging Middleware in Cloud and Digital Transformation Initiatives
White Paper Central Role of Messaging Middleware in Cloud and Digital Transformation Initiatives Sponsored by: IBM Maureen Fleming April 2018 EXECUTIVE SUMMARY Highly decentralized computing is the new
More informationBig Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase
BIG DATA COURSE Big Data Application Engineer/ Developer Specialization in Apache Spark, Kafka, Airflow, HBase In Exclusive Association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries
More informationHadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.
Hadoop in the Cloud Ryan Lippert, Cloudera Product Marketing @lippertryan 1 2 Cloudera Confidential 3 Drive Customer Insights Improve Product & Services Efficiency Lower Business Risk 4 The world s largest
More informationBeyond Virtualization. Derek Collison - Apcera, June 12, QCon New York
Beyond Virtualization Derek Collison - Apcera, Inc.!!! June 12, 2014 - QCon New York About!! Derek Collison Architected and built TIBCO Rendezvous and EMS Messaging Systems! Co-founded AJAX APIs group
More informationOPENSHIFT CONTAINER PLATFORM
OPENSHIFT CONTAINER PLATFORM FUNDAMENTAL OVERVIEW Mike Surbey Emerging Technology Specialist http://msurbey.com AGENDA 2 1. INTRODUCTION Today s Business Challenge 2. KEY CONCEPTS s, DevOps, etc. 3. HOLISTIC
More informationCask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications
Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February
More informationOn Cloud Computational Models and the Heterogeneity Challenge
On Cloud Computational Models and the Heterogeneity Challenge Raouf Boutaba D. Cheriton School of Computer Science University of Waterloo WCU IT Convergence Engineering Division POSTECH FOME, December
More informationHadoop and Analytics at CERN IT CERN IT-DB
Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured
More informationMulti-Containers Orchestration with Live Migration and High-Availability for Microservices
Multi-Containers Orchestration with Live Migration and High-Availability for Microservices Meet Our Presenters Jay Lyman Research Manager, Cloud Platforms, 451 Research Ruslan Synytsky CEO and Co-founder,
More informationArchitecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.
Architecture Optimization for the new Data Warehouse Guido Oswald - @GuidoOswald 1 Use Cases This image cannot currently be displayed. This image cannot currently be displayed. This image cannot currently
More informationDynamic App Services in Containers PRESENTED BY:
Dynamic App Services in Containers PRESENTED BY: Apps and container market overview Container description and benefits Container platforms and orchestration tools Container integrations: F5 Container Connector
More informationJust Enough Operating System to kick start creativity. Simona Arsene
Just Enough Operating System to kick start creativity Simona Arsene SUSE Linux Enterprise Server JeOS speeds up virtual image deployment Just enough Operating System No need to re-certify Same SUSE Linux
More informationCourse Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.
Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,
More informationThe Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data
Glenn Anderson, IBM Lab Services and Training The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Summer SHARE August 2015 Session 17794 2 (c) Copyright 2015 IBM Corporation
More informationRealising Value from Data
Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation
More informationTop 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11
Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer
More information20775A: Performing Data Engineering on Microsoft HD Insight
20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache
More informationHortonworks Connected Data Platforms
Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car
More informationUsing Mesos Schedulers with Amazon EC2 Container Service
Using Mesos Schedulers with Amazon EC2 Container Service Ryosuke Iwanaga Solutions Architect, Amazon Web Services Japan July 2016, LinuxCon+ContainerCon Japan 2016, Amazon Web Services, Inc. or its Affiliates.
More informationFrom Data Deluge to Intelligent Data
SAP Data Hub From Data Deluge to Intelligent Data Orchestrate Your Data for an Intelligent Enterprise Data for Intelligence, Speed, and With Today, corporate data landscapes are growing increasingly diverse
More information20775 Performing Data Engineering on Microsoft HD Insight
Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this
More informationRed Hat Container Technology Strategy
Red Hat Container Technology Strategy Containers are so 2014 Clayton Coleman Daniel Riek April 2017 What we told you earlier: The future of the Linux OS is a scale-out cluster-as-computer platform for
More informationTHE AGAVE PLATFORM SCIENCE AS A SERVICE FOR THE OPEN SCIENCE COMMUNITY
THE AGAVE PLATFORM SCIENCE AS A SERVICE FOR THE OPEN SCIENCE COMMUNITY Rion Dooley @deardooley deardooley@gmail.com 12/15/20 1 THE EVOLUTION OF A CYBERINFRASTRUCTURE HPC systems have grown up since then
More informationMigrating to Cloud - Native Architectures Using Microservices: An Experience Report
Migrating to Cloud - Native Architectures Using Microservices: An Experience Report Armin Balalaie, Abbas Heydarnoori, and Pooyan Jamshidi Sharif University of Technology, Tehran, Iran - 2015 Sonam Gupta
More informationCask Data Application Platform (CDAP)
Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop
More information20775: Performing Data Engineering on Microsoft HD Insight
Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com
More informationUnderstanding Cloud. #IBMDurbanHackathon. Presented by: Britni Lonesome IBM Cloud Advisor
Understanding Cloud #IBMDurbanHackathon Presented by: Britni Lonesome IBM Cloud Advisor What is this thing called cloud? Cloud computing is a new consumption and delivery model inspired by consumer internet
More informationLightbend Fast Data Platform. A Technical Overview For Decision Makers
Lightbend Fast Data Platform A Technical Overview For Decision Makers Mobile and IoT use cases are driving enterprises to modernize how they process large volumes of data. Lightbend provides the fundamental
More informationContainer Native Application Development
Container Native Application Development Wolfgang Weigend Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationDRIVING DIGITAL TRANSFORMATION WITH CONTAINERS AND KUBERNETES. How Kubernetes Manages Containerized Applications to Deliver Business Value
WHITE PAPER AUGUST 2017 DRIVING DIGITAL TRANSFORMATION WITH How Kubernetes Manages Containerized Applications to Deliver Business Value Table of Contents Introduction...3 The Digital Transformation and
More informationSAP Machine Learning for Hadoop. Customer
SAP Machine Learning for Hadoop Customer SAP BusinessObjects Predictive Analytics and Big Data 1. Support for end-to-end operational predictive lifecycle on Hadoop 2. Business Analyst Friendly No coding
More informationSpark, Hadoop, and Friends
Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com
More informationBusiness is being transformed by three trends
Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence
More informationMICROSOFT AZURE CLOUD CAPABILITIES, COSTS, AND UPDATES
E-Guide MICROSOFT AZURE CLOUD CAPABILITIES, COSTS, AND UPDATES SearchCloud Computing A s offerings continue to evolve, it becomes imperative to continually assess how various vendors stack up. In this
More informationOmega: flexible, scalable schedulers for large compute clusters. Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes
Omega: flexible, scalable schedulers for large compute clusters Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes Cluster Scheduling Shared hardware resources in a cluster Run a mix
More informationMapR Pentaho Business Solutions
MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business
More informationNew Big Data Solutions and Opportunities for DB Workloads
New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September
More informationHawk: Hybrid Datacenter Scheduling
Hawk: Hybrid Datacenter Scheduling Pamela Delgado, Florin Dinu, Anne-Marie Kermarrec, Willy Zwaenepoel July 10th, 2015 USENIX ATC 2015 1 Introduction: datacenter scheduling Job 1 task task scheduler cluster
More informationInsights to HDInsight
Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive
More informationMicrosoft Azure Essentials
Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,
More information20775A: Performing Data Engineering on Microsoft HD Insight
20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate
More informationApache Hadoop in the Datacenter and Cloud
Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational
More informationHow Container Schedulers and Software-Defined Storage will Change the Cloud
How Container Schedulers and Software-Defined Storage will Change the Cloud David vonthenen {code} by Dell EMC @dvonthenen http://dvonthenen.com github.com/dvonthenen Agenda Review of Software-Defined
More informationData Analytics Use Cases, Platforms, Services. ITMM, March 5 th, 2018 Luca Canali, IT-DB
Data Analytics Use Cases, Platforms, Services ITMM, March 5 th, 2018 Luca Canali, IT-DB 1 Analytics and Big Data Pipelines Use Cases Many use cases at CERN for analytics Data analysis, dashboards, plots,
More informationSpecial thanks to Chad Diaz II, Jason Montgomery & Micah Torres
Special thanks to Chad Diaz II, Jason Montgomery & Micah Torres Outline: What cloud computing is The history of cloud computing Cloud Services (Iaas, Paas, Saas) Cloud Computing Service Providers Technical
More informationSpark and Hadoop Perfect Together
Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System
More informationProcessing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN
Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Jagadish Venkatraman
More informationSimplify Private Cloud Deployments PRESENTED BY:
Simplify Private Cloud Deployments PRESENTED BY: What CIOs are ultimately looking for is the ability to than their competitors, while adhering to regulatory requirements, and. RedMonk Analyst Strong Security
More informationPentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara
Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our
More informationIBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation
IBM Message Hub James Bennett Offering Manager, IBM Cloud Integration 2016 IBM Corporation The continued growth of PaaS 2 Use cases 1 Hub for asynchronously connecting services inside Bluemix or beyond
More information<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter
Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter Mike Piech Senior Director, Product Marketing The following is intended to outline our general product direction. It
More informationOracle Big Data Cloud Service
Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment
More informationBuilding Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
More informationAccelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica
Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud
More informationSt Louis CMG Boris Zibitsker, PhD
ENTERPRISE PERFORMANCE ASSURANCE BASED ON BIG DATA ANALYTICS St Louis CMG Boris Zibitsker, PhD www.beznext.com bzibitsker@beznext.com Abstract Today s fast-paced businesses have to make business decisions
More informationDEVOPS AUTOMATION USING DOCKER, KUBERNETES AND OPENSHIFT. Siamak Sadeghianfar Sr Technical Marketing Manager, OpenShift June 2016
DEVOPS AUTOMATION USING DOCKER, KUBERNETES AND Siamak Sadeghianfar Sr Technical Marketing Manager, OpenShift June 2016 DEFINE DEVOPS Everything as code Application monitoring Automate everything Rapid
More informationTowards The Real-Time Enterprise
EXECUTIVE BRIEFING PAPER Towards The Real-Time Enterprise How A Cloud-Native Application Architecture Provides An Essential Foundation For Digital Transformation January 2019 Table Of Contents About This
More informationBig Data in Cloud. 堵俊平 Apache Hadoop Committer Staff Engineer, VMware
Big Data in Cloud 堵俊平 Apache Hadoop Committer Staff Engineer, VMware Bio 堵俊平 (Junping Du) - Join VMware in 2008 for cloud product first - Initiate earliest effort on big data within VMware since 2010 -
More informationRed Hat Open Shift Container Platform
Red Hat Open Shift Container Platform Daniel.Froehlich@RedHat.com IT Must Evolve to Stay Ahead of Demands Containers package applications with dependencies and isolate the runtime Easy to deploy and portable
More informationVisual Studio Everywhere. Build Great Cloud Apps
Visual Studio Everywhere Build Great Cloud Apps Agenda Why use the cloud to build apps? An overview of Microsoft Azure Virtual machines for lift-shift scenarios Microservices and Azure Service Fabric Data
More informationAurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect
Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source
More informationPentaho 8.0 Overview. Pedro Alves
Pentaho 8.0 Overview Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction. It is provided for information
More informationUForge AppCenter 3.8. Introduction March Copyright 2018 FUJITSU LIMITED
UForge AppCenter 3.8 Introduction March 2018 Copyright 2018 FUJITSU LIMITED Enterprise Cloud Application Journey 3 stages in transitioning legacy enterprise applications to cloud: Cloud-hosted applications:
More information