Building a Multi-Tenant Infrastructure for Diverse Application Workloads

Similar documents
Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

IBM Grid Offering for Analytics Acceleration: Customer Insight in Banking

Introduction to IBM Technical Computing Clouds IBM Redbooks Solution Guide

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

IBM Platform LSF & PCM-AE Dynamische Anpassung von HPC Clustern für Sondernutzung und Forschungskollaborationen

MapR Pentaho Business Solutions

InfoSphere DataStage Grid Solution

Learn How To Implement Cloud on System z. Delivering and optimizing private cloud on System z with Integrated Service Management

IBM BPM on zenterprise

Data Center Operating System (DCOS) IBM Platform Solutions

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

Common Customer Use Cases in FSI

Cask Data Application Platform (CDAP)

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

WHITEPAPER. Unlocking Your ATM Big Data : Understanding the power of real-time transaction monitoring and analytics.

Going beyond today: Extending the platform for cloud, mobile and analytics

Microsoft Azure Essentials

Product Brief SysTrack VMP

Sr. Sergio Rodríguez de Guzmán CTO PUE

ETL on Hadoop What is Required

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

Business is being transformed by three trends

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

InfoSphere Warehousing 9.5

Introduction to Cloud Computing

Andrew Macdonald ILOG Technical Professional 2010 IBM Corporation

Cloud Computing. University of Economics and Law. Duc.NHM Faculty of Information Systems

Partnering for Business Value

IBM Tivoli Monitoring

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

Real-time Analytics Powered by GPU-Accelerated Databases. Chris Prendergast and Woody Christy GTC, May 8, 2017

Introduction to the IBM MessageSight appliance for Mobile Messaging and M2M

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Microsoft Big Data. Solution Brief

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

IBM ICE (Innovation Centre for Education) Welcome to: Unit 1 Overview of delivery models in Cloud Computing. Copyright IBM Corporation

Bringing the Power of SAS to Hadoop Title

ETL challenges on IOT projects. Pedro Martins Head of Implementation

Oracle Engineered Systems and WeDo Technologies

Transforming Big Data to Business Benefits

Cloud Computing An IBM Perspective

Digital Industrial Transformation Powered by the Industrial IoT

IBM Solutions for Enhancing Business Process Management (BPM)

Automation, Innovation and Consolidation SAS GRID

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

CAPACITY MANAGEMENT in the MODERN DATA CENTER. TURBONOMIC white paper

Integrated Service Management

Introduction to Enterprise Computing. Computing Infrastructure Matters

Cloudera Hadoop & Industrie 4.0 wohin mit dem Datenstrom?

Big Data The Big Story

Job Scheduling Challenges of Different Size Organizations

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Insights to HDInsight

Trusted by more than 150 CSPs worldwide.

Business Analytics and Optimization An IBM Growth Priority

MANAGEMENT METRICS FOR CRITICAL IT SERVICES

The ABCs of. CA Workload Automation

Tech Mahindra s Cloud Platform and PaaS Offering. Copyright 2015 Tech Mahindra. All rights reserved.

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Network maintenance evolution and best practices for NFV assurance October 2016

Integrating MATLAB Analytics into Enterprise Applications

Three-Tier High-Performance Computing Environment

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

SAP Predictive Analytics Suite

DevOps Guide: How to Use APM to Enhance Performance Testing

Build a Future-Ready Enterprise With NTT DATA Modernization Services

rid Computing in he Industrial Sector

Trasformare il Business con Soluzioni Cloud

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud

SAVE MAINFRAME COSTS ZIIP YOUR NATURAL APPS

IBM Tivoli Endpoint Manager for Lifecycle Management

IBM WebSphere Application Server, Version 6 Delivers Business Flexibility

Cognitive Solutions in the Context of IBM Systems Cognitive Analytics / Integration Scenarios / Use Cases

IBM _` iseries systems Retail

MapR Streams A global pub-sub event streaming system for big data and IoT

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Transition to SOA. Oracle SOA Suite. Martin Jäkle Solution Architect TSBU Fusion Middleware Oracle Deutschland

Application Migration to Cloud Best Practices Guide

Business Agility for Smarter Banking

Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN

NFV Orchestrator powered by VMware

INFRASTRUCTURE MONITORING + APM + LOGS

Architected Blended Big Data With Pentaho. A Solution Brief

Creating an Enterprise-class Hadoop Platform Joey Jablonski Practice Director, Analytic Services DataDirect Networks, Inc. (DDN)

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

What is Cloud Computing? Irving Wladawsky-Berger

Increase Value and Reduce Total Cost of Ownership and Complexity with Oracle PaaS

IBM WebSphere Service Registry and Repository, Version 6.0

EMC ATMOS. Managing big data in the cloud A PROVEN WAY TO INCORPORATE CLOUD BENEFITS INTO YOUR BUSINESS ATMOS FEATURES ESSENTIALS

The IBM Reference Architecture for Healthcare and Life Sciences

Service oriented architecture solutions White paper. IBM SOA Foundation: providing what you need to get started with SOA.

Application Performance Management for Cloud

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

BullSequana S series. Powering Enterprise Artificial Intelligence

An Oracle White Paper June Maximizing Performance and Scalability of a Policy Automation Solution

Business Process Management for Innovation and Optimisation. David Bate SOA Software Sales Executive IBM Asia Pacific

Make strategic decisions on future income, market value and risk

Transcription:

Building a Multi-Tenant Infrastructure for Diverse Application Workloads Rick Janowski Marketing Manager IBM Platform Computing 1

The Why and What of Multi-Tenancy 2

Parallelizable problems demand fresh approaches Financial Market and Credit Risk, Insurance Credit-scoring, Fraud Detection Calculate this now! over 500,000 scenarios, 500 instruments, 200 time steps. Life Sciences Genome Mapping Mine 24 months of credit card purchases for 30,000,000 cardholders to identify credit-worthy customers by geography CAE Parametric sweeps, DOE Contrails perform assembly and mapping of large genomes in hours rather than weeks using MapReduce programming model. Perform designs of experiment and parametric sweeps for a variety of computer-aided design applications to find optimal designs without physical prototyping. 3

Business drivers Ever-increasing expectations Insatiable appetite for deeper, more thorough analysis Results increasingly time critical Better insights into exploding volumes of big data Control of administrative and infrastructure costs to grow computing capability within IT budget constrains 4

Technical needs Ever-increasing expectations: Increased performance to support business demands Increased scalability to address huge and growing volumes of data Optimized use of existing resources for scaled performance Efficient data management to remove data bottlenecks Support for new, cloud-native application workload patterns Effective operational management monitoring, alerting, diagnostics and security 5

Scale up versus scale out App App 6

Big Data and Analytics infrastructure silos are inefficient Many new solution workloads in addition to existing apps Leads to costly, siloed, under-utilized infrastructure and replicated data Batch Overnight Financial Reporting Counterparty Credit Risk Modeling Distributed ETL, Sensitivity Analysis Hadoop Sentiment Analysis Low Utilization = Higher cost 7

Scale-out challenges Silos Underutilization of resources Management and reporting challenges Different clusters for different workload types Arduous and time-consuming cluster reconfiguration between workloads types Separate clusters for different versions of Hadoop (or other key applications) 8

The role of multi-tenancy New cost- and space-efficient workload and resource management approaches offer solutions for multiple workload types, applications and Hadoop versions on the same cluster Inconsistent terminology =/ 9

Multi-tenancy: the narrow view A single instance of an application serves multiple client organizations (tenants) In a multitenancy environment, multiple customers share the same application, running on the same operating system, on the same hardware, with the same data-storage mechanism. (Wikipedia: http://en.wikipedia.org/wiki/multitenancy) 10

Dimensions of multi-tenancy (shared services) App Multiple users, groups or departments Multiple workload patterns Multiple sessions Multiple versions Multiple instances Multiple operating environments / platforms App App App App 11

Implementing Multi-Tenancy 12 12

About Platform Symphony Heterogeneous grid management platform Supports multiple users, applications and lines of business on a shared grid A B C D Data Grid / Data Analytics Application A-Algorithmics B Commercial Software C-Proprietary Models D-Other Analytic Apps Workload Manager C C C C C C B B A A A A A A A A C C C C C C B B A A A A A A A A C C C C C C B D D D D D D B B B B B B B B B B D D D D D D B B B Resource Orchestration 13 13

Platform Symphony Reducing Cost Avoid expensive application and departmental silos Share infrastructure while protecting SLAs Avoid to infrastructure spending Improve utililization Department A Department B Department C Application A Application B Application C 14

Platform Symphony Improve Performance and Predictability Sub-millisecond latency Massive scale Flex instantly to reflect business priorities Better quality results faster Department A Department B Department C Application A Application B Application C Scenario: Urgent pretrade analysis to drive critical hedging decisions Result: Resources reallocated instantly according to policy resulting in faster more thorough simulation and a time-to-market advantage. 15

Platform Symphony Sophisticated Resource Sharing Enables sharing while preserving ownership Near 100% sustained resource utilization Allocations flex quickly to reflect business priorities Support new applications with existing infrastructure Platform Symphony improves on application SLAs while using resources more efficiently than competing grid managers 16

Platform Symphony MultiCluster Scale beyond a single cluster Unified management of distributed clusters Full visibility to resources, users and applications Select appropriate cluster at runtime Maximize resource usage Simplify reporting and capacity planning Cluster A Cluster B Platform Symphony Grid Cluster C 17 17

How is Platform Symphony unique? Low Latency / Hi-throughput Sub-millisecond response >17,000 tasks per second throughput Large Scale 10,000 cores per application 280,000 cores per grid Cost-Efficient, Shared Services Multi-tenant grid solution Guarantees SLAs while encouraging resource sharing Easy to on-board new grid applications Maximizes use of Grid Resources Heterogeneous and Open Linux, Windows / Windows HPC, AIX, Solaris C/C++, C#, Java, Excel, Python, R Smart data handling, Data Affinity Native, optimized Map Reduce implementation 18 18

Use Case for Grid: Liquidity Risk Analysis The Client A publicly traded bank with both retail and wholesale operations in over 120 countries and with over $500 billion in assets under management High Growth Impacting its Risk IT The bank uses IBM Algorithmics for its Liquidity Risk Analysis (LRA), running on dedicated servers Time-to-completion increased to 100 hours for analyzing now 150,000-plus records Added Platform Symphony Grid Increased # of available cores by 6 times by borrowing idle cores Decreased time-to-completion for LRA to 10 hours 19 19

Platform Symphony and Algorithmics 20 20

IBM Algorithmics Market risk Credit risk Liquidity risk Collateral and capital management 21

Algorithmics presents multiple opportunities for parallelism Algo One Software Services Customer Data Risk Mapper Algo Data Server Scenario Generation Simulation Aggregation Reporting ASE RW ASE ARDB CSV CSV CSV RM RM ADS ADS ASE RW RW ASE RPM Scenarios RM ADS RW Algo Risk Engine ADB RW RW ARE ARE ACE Algo Cube Explorer Platform Symphony Shared Services Grid Infrastructure for Compute and Data Intensive Applications 22 22

Algorithmics presents multiple opportunities for parallelism Algo One Software Services Customer Data Risk Mapper Algo Data Server Scenario Generation Simulation Aggregation Reporting 2 CSV ASE RM ADS ASE CSV Algo database RM performance ADS can benefit from CSV parallel database technologies RM ADS Scenarios RW RW RW RW 4 ASE Algo Risk Engine ASE is already parallelized, and multiple aggregation activities RPMcan run concurrently across various job streams controlled by Algo Batch Algo Risk Engine ARDB 1 ADB Opportunity for multiple concurrent risk mappers to reformat customer data into a schema loadable by the Algo Data Server RW RW 3 ACE ARE RiskWatch can run multiple parallel Algo Cube simulation scenarios on subsets Explorer of instruments generating ARE cubelets representing simulation results Platform Symphony Shared Services Grid Infrastructure for Compute & Data Intensive Applications 23 23

Why use a Grid Manager with Algorithmics? Best host dynamically selected at run-time Avoid hard-coding hosts Easier to manage as environment grows Grid guarantees task execution (avoiding the need for elaborate exception handling and scripting) Dramatic reduction in process run-times SLAs guaranteed, task completion within batch windows assured Improved administrator productivity Enhanced quality of service to analysts and business users Use assets more efficiently reducing total infrastructure cost 24 24

Platform Symphony Integration from an Algorithmics user s perspective Analytic processes in Algorithmics run under the control of Algo Batch Job Boxes represent discrete units of work in Algo work flows High level tasks can be parallelized within ABE Stream execution can be dramatically accelerated by using a grid manager to accelerate speed of individual job boxes 25 25

Platform Symphony Integration from an Algorithmics user s perspective Platform Symphony can accelerate analytic processes such as simulation and cube generation by enabling tasks to execute reliably in parallel at large scale 26 26

Platform Symphony Integration from an Algorithmics user s perspective Platform Symphony reflects the name of Algorithmics Batch job box names as session names and tags for ease of management 27 27

Unique advantages of the Platform Symphony integration AlgoEngine Application RW RW RW RW Risk watch instances started once and re-used dynamically depending on relative priorities of Algo job streams, avoiding the need to start and stop instances for each MtF cubelet. stop all RW Algo lookup Session 1 Priority X START Session 2 Session N Priority RiskWatch Y instances are dynamically assigned according to Symphony loaning and borrowing rules and the relative priority of job streams. Priority ZResource allocations can flex dynamically based on timebased ownership rules, sharing policies and priorities STOP 28 28

The case for a grid / workload manager Platform Symphony enables near 100% resource utilization by enforcing the notion of ownership while still enabling departments to share resources based on flexible policies. 29 29

Conclusion 30

Big Data and Analytics infrastructure silos are inefficient Many new solution workloads in addition to existing apps Leads to costly, siloed, under-utilized infrastructure and replicated data Batch Overnight Financial Reporting Counterparty Credit Risk Modeling Distributed ETL, Sensitivity Analysis Hadoop Sentiment Analysis Low Utilization = Higher cost 31

IBM Platform Symphony A multi-tenant shared services platform with sophisticated resource-sharing capabilities Manages diverse system and application services on a shared infrastructure ISV applications, in-house developed applications Optimized low-latency Hadoop compatible run-time Can be used to launch, persist and manage non-grid-aware application services Supports long-running cloud-native frameworks such as MongoDB and Cassandra* Service controller guarantees reliable execution, understands dependencies * * Application Service Controller announced October 2014 32

The state of the art Organizations need ways to add new workloads, implement new applications and deploy new versions of Hadoop without having to deploy new clusters While there are emerging approaches aimed at reducing the need for multiple, disparate clusters, each of the approaches has significant limitations IBM Platform Symphony combines multi-tenancy / shared services and multi-modality so organizations can run SOA, batch and long-running service workloads on a single cluster Built on an architecture designed for low-latency scheduling, Platform Symphony enables organizations to gain the performance for running time-critical jobs while maximizing the value of their hardware investments 33

Rick Janowski rjanowski@us.ibm.com +1 720-395-7852 34