Joe Butler, Sharon Ruane Intel Labs Europe. May 11, 2018.

Similar documents
Virtualized network function on-boarding

St Louis CMG Boris Zibitsker, PhD

Workload Engineering: Optimising WAN and DC Resources Through RL-based Workload Placement

Service Assurance for the Virtualizing and Software-Defined Networks

NFV Orchestrator powered by VMware

SAP HANA MADE SIMPLE WITH VALIDATED SOLUTIONS & CONVERGED SYSTEMS. Joakim Zetterblad, Director SAP Practice, EMEA

Recover First, Resolve Next Towards Closed Loop Control for Managing Hybrid Networks

Telco Cloud Operations Transformation: Driving Agility and Customer Centricity

Intel Public Sector 3

Streamlining VNF on-boarding process

Cisco Connected Asset Manager for IoT Intelligence

Telco Cloud and Using Big Data to Improve Customer Experience and to Drive new Revenue Streams

ORACLE CLOUD MANAGEMENT PACK FOR MIDDLEWARE

Sr. Sergio Rodríguez de Guzmán CTO PUE

Service Management for the Mobile Mainframe Delivered via Cloud Lunch and Learn

Target Audience. Executive Summary. Note on TM Forum ZOOM and ETSI MANO

Oracle Enterprise Manager 13c Cloud Control

D5.1 Inter-Layer Cloud Stack Adaptation Summary

Optimize the Performance of Your Cloud Infrastructure

RODOD Performance Test on Exalogic and Exadata Engineered Systems

Steve Bryant-Brown Technology Mayank Nayar Program Manager, Azure Site Recovery. Will Rowley Cloud

Kubernetes for the enterprise

Goodbye to Fixed Bandwidth Reservation: Job Scheduling with Elastic Bandwidth Reservation in Clouds

Next Phase of Evolution in Storage Industry: Impact of Machine Learning

Network maintenance evolution and best practices for NFV assurance October 2016

Microsoft Azure Essentials

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Cognitive management of multi-service multi-tenant 5G mobile networks

DEPLOYING TELCO-GRADE CLOUD SOLUTIONS AND NFV

NSO in an ETSI NFV Context Carl Moberg Technical Director, Tail-f Engineering January 7, 2015

Industrial Connected Product Solutions

Network Cloud Service Orchestrator

Executive Summary. Providing the AGILITY to support digital operations transformation of hybrid networks

DIGITAL BSS CORE Solution Overview

Resources and Services Virtualization without Boundaries (ReSerVoir)

Data Center Operating System (DCOS) IBM Platform Solutions

ONAP Architecture Overview

Workshop on Grids, Clouds & Service Infrastructures December IRMOS: The first step in real-time technologies for distributed systems

Relationships of ONAP and OSS/BSS

ericsson White paper GFMC-17: Uen October 2017 TELECOM IT FOR THE DIGITAL ECONOMY

VMware Cloud Automation Design and Deploy IaaS Service

Microsoft Monitoring and Operating a Private Cloud

BOOST YOUR SUPPLY CHAIN EXECUTION. THE COMPREHENSIVE PLATFORM FOR LOGISTICS 4.0 OPERATIONS

What s New in TrueSight Capacity Optimization February 2016

AWS for Finance Institutes

MCSE: Private Cloud Training Course (System Center 2012)

Dell EMC Consulting Ingo Strutz

Analytics & Business Intelligence (BI) Enablement (Value Proposition)

SOLVING THE MYSTERY OF SDN & NFV SOLUTIONS EVALUATION SYTEL REPLY S VADVISOR TOOL WILL HELP YOU.

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2

DRIVING EFFICIENCY AND SIMPLIFICATION IN TELENOR

Koen van den Biggelaar Senior Manager, Solutions Architecture Amazon Web Services

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

COMPUTE CLOUD SERVICE. Move to Your Private Data Center in the Cloud Zero CapEx. Predictable OpEx. Full Control.

Oracle Communications Unified Inventory Management

NetScaler Management and Analytics System (MAS)

Microsoft FastTrack For Azure Service Level Description

Zero-touch Network and Service Management

Architecture Overview for Data Analytics Deployments

Insights. Automation

Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2

Connect. Challenge. Inspire.

PRODUCT UPDATES APJ PARTNER SUMMIT - BALI. February Software AG. All rights reserved. For internal use only

HW M o n i t o r i n g a n d M a n a g e m e n t S y s t e m f o r T e l c o D a t a C e n t e r. Jungsoo Kim/R&D Manager/SK Telecom

KRnet. IoT Cloud Ecosystem. KangYoon Lee, Korea Lab Director IoT Korea Lab 2014 IBM Corporation

StableNet Enterprise. Automated IT Management & Business Service Assurance

DevOps architecture overview

Innovation Ecosystem About Altice Labs www s.com elabs.com

Industrial. Grundfos use case om Industry 4.0 Big Data cloud arkitektur for IIoT, predictive analytics og Rigtig Industry 4.0

Oracle Integrates Virtual Tape Storage with Public Cloud Economics

Adaptive Power Profiling for Many-Core HPC Architectures

Automation that accelerates transformation. The digital enterprise requires smarter infrastructure automation.

IOT Analytics and business assurance. Ericsson-wedo perspectives October 2017

Oracle Banking Enterprise Collections

Jack Weast. Principal Engineer, Chief Systems Engineer. Automated Driving Group, Intel

THE IOT CORE INDUSTRIAL EDGE INTELLIGENCE PLATFORM

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Application Performance Management for Cloud

How to build and deliver an Intelligent Orchestration Platform

Cloud Cruiser for Cisco Intelligent Automation for Cloud

A Development and Execution Environment for Early Warning Systems for Natural Disasters

HKT s Digital Transformation Journey

SERVICE FULFILMENT SYSTEMS: WORLDWIDE FORECAST

Learn How To Implement Cloud on System z. Delivering and optimizing private cloud on System z with Integrated Service Management

FlashStack For Oracle RAC

TestCraft. Testing as a Service to Accelerate SDN/NFV Service Deployment

NVMe: The Key to Unlocking Next-Generation Tier 0 Storage

Engineering Services Outsourcing

NFV & SDN Migration Challenges. Interoperability, co-existence and specific services as the drivers and opportunities for NFV & SDN migration

CHARTING THE FUTURE OF INNOVATION # ERICSSON TECHNOLOGY EVOLVING OPERATIONS SUPPORT SYSTEMS

Translate Integration Imperative into a solution Framework. A Solution Framework. August 1 st, Mumbai By Dharanibalan Gurunathan

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

AI in telecoms: a game changer?

An Enterprise Architect s Guide to API Integration for ESB and SOA

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

November RADCOM Ltd (RDCM) Corporate Overview

Authorised Technology Partner

Integrated Service Management

White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure

Oracle Autonomous Data Warehouse Cloud

Transcription:

Joe Butler, Sharon Ruane Intel Labs Europe. May 11, 2018.

Orchestrating apps (content) and network. Application And Content Complexity & demand for network performance. Immersive Media, V2X, IoT. Streaming, Gaming High precision orchestration requires reliable insights into application sensitivities. Web 3G 4G 5G & M-RAT, Cloud-RAN, ICN, NFV, SDN Network diversity, complexity, & need for efficiency, flexibility and automation. 2

Orchestration for E2E & Network Edge Stringent SLAs. Sensitive workloads. Complex chaining. Dense packing. Distribution. Heterogeneity. Scale. Service Chain Placement Infrastructure Orchestration & Management Insights & Heuristics APEX LAKE Closed Loop Hi-Res Landscapes Advanced Telemetry Automating generation of usable insights for intelligent, high-precision orchestration. Apex Lake: Intel Labs Orchestration Research Platform. 3

Virtualised network & edge. Fine grained view of resources and services. Resource constraints at edge. Virtualisation & Cloudification brings network as-a-service. FP7 Mobile Cloud Networking (2013-2016) was a collaborative research project focused cloud-style delivery of Telco network services. Heterogeneity. Densification. Dynamic consumption patterns. Small cell + cloud enablement extends on-demand to edge. H2020 5G Essence is a collaborative research project focused on optimised deployment of 5G services across multi-tenant, cloud-enabled, Small Cell edge infrastructure.. 4

Service Function Chaining. Workflow 1: DPI Workflow 2: Workflow 3: QoS Fair Use Transformation Billing QoS Fair Use Streaming Billing QoS Caching & Delivery Billing Web and Content services. H2020 RECAP is a collaborative research project focused on capacity planning of heterogeneous Cloud to the Edge via infrastructure optimization, modelling, simulation and automated self-adaption Service chain components and distributed resource placement options as a directed acyclic graph.. 5

Toward Network Functions -aas. Appliance Era VM/Hypervisor Container FaaS Service Lifetime Fixed-Function. 100% manual deployment and operation Virtualisation. consolidation. Years Months Next stage of progress. Micro-services with millisecond instantiation times, sub-second billing. Tight packing. Self-service, auto-scaling Minutes Fully Automated.! Seconds Service Instantiation weeks days minutes seconds ms 6

Emerging edge use cases. Industry Transport / V2X -> dynamic resource topologies. -> hard constraints on data processing. -> additional constraints: data timeliness and provenance, security and privacy. 7

Resource Allocation: Prescription. Descriptor metadata Goals: - Service Performance - Manageability. VNFD high level object model, source. https://osm.etsi.org/wikipub/images/2/26/osm_r2_information_model.pdf NFV reference architecture diagram. Source: ETSI specifications documents circa 2013.. 8

Resource Allocation: Learning. Monitoring + Analytics + Automation Goals: - Tight Packing, - Platform Awareness, - Resource Affinities, - KPI Mapping / SLA Compliance. Start Vnic-4 = OvS Vnic-4 = SR-IOV Vnic-3 = OvS Vnic-3 = SR-IOV Vnic-5 = SR-IOV Vnic-5 = OvS Vnic-1 = OvS Vnic-1 = SR-IOV Less 3 Gbp s Vnic-3 = OvS Vnic-3 = SR-IOV Less 400 Mbp s Vnic-1 = OvS Vnic-1 = SR-IOV less 800 Mbp s less Decision Tree automatically guides Orchestrator to select SR-IOV enabled NIC based on service characterization.. H2020 Superfluidity is a collaborative research project targeting a super-fluid, cloud-native, converged edge system. 9

Machine Learning in context. Goal automating precise and efficient placement & adjustment, driven by service objectives. - What are the usable key metrics, attributes, and expressions of constraints and objectives? - How can we automatically discover these in context? Approach: Full-stack, adaptive telemetry, Background and foreground analytics loops, Workload fingerprinting, Automation. Techniques: Utility Theory, Cost Functions, Machine Learning, Evolutionary Algorithms, Hybrid Algorithms. Metrics: Service performance, Resource allocation rightsizing, Resource utilization, Accuracy of predictions, Confidence of maintaining SLA. 10

Example: dynamic placement. Test bed emulation of mobile urban environment. A: Baseline B: 100k endpoints C: 140k endpoints 20 gateways with overlapping. coverage, >100k mobile endpoints. Nearest fit / current utilisation as baseline. Max scale16k endpoints, 6/20 gateways oversubscribed, Headroom and utilization balanced, 7ms compute placement. Gateway saturation, 7ms compute placement. Hybrid multi-attribute utility function + evolutionary algorithm formulates capacity and endpoint trajectories. 14ms to compute placement. 11

Sharon Ruane, Joe Butler

Prediction of Heterogeneous Workload behavior: Object store Video transcode Wordpress ERP 0 Virtual Storage Virtual Network Virtual Machine NVM 10Gb SSD Xeon Phi AES-NI Atom Xeon E5 13

Prediction of Heterogeneous Workload behavior: Object store Video transcode Wordpress ERP 0 Virtual Storage Virtual Network Virtual Machine NVM 10Gb SSD Xeon Phi AES-NI Atom Xeon E5 14

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 15

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 Experimentation: CPU Utilization per workload instance Subsystem Interference Different systems exhibit varied saturation patterns 16

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 Experimentation: 1_iozone_5stress 2_iozone_5stress 3_iozone_10stress + = 17

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 Experimentation: 1_iozone_5stress 2_iozone_5stress 3_iozone_10stress + = 18

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 Experimentation: Model 99% CPU 99% NIC 93% DSK 1_iozone_5stress 2_iozone_5stress 3_iozone_10stress + = 19

Prediction of Heterogeneous Workload behavior: Research Questions: Can we predict the behavior of an incoming workload if it s placed on a resource which is already in use? Using this, can we pack the workloads to maximize utilization of all resources, while avoiding overload? 0 Experimentation: Model 99% CPU 99% NIC 93% DSK 1_iozone_5stress 2_iozone_5stress 3_iozone_10stress + = 20

Integrated ML approach to Orchestration Efficient resource allocation KPI identification Network net_interfaces_network_utilization net_interfaces_receive_bytes net_interfaces_transmit_packets net_receive_bytes proc_stat_meminfo_slab proc_stat_meminfo_sunreclaim net_interfaces_receive_packets net_tx_mb net_transmit_bytes net_tx Energy management Selected cores active Rest powered down proc_stat_meminfo_active_file proc_stat_meminfo_sunreclaim proc_stat_meminfo_cached proc_stat_meminfo_active proc_stat_meminfo_active_anon proc_stat_meminfo_anonpages proc_stat_meminfo_memavailable proc_stat_meminfo_anonhugepages proc_stat_meminfo_sreclaimable proc_stat_meminfo_free Optimal placement of chained workloads service chain 0 SLA assurance Optimal placement infrastructure 21

Integrated ML approach to Orchestration 22

Integrated ML approach to Orchestration? 23

Integrated ML approach to Orchestration? workload 24

Integrated ML approach to Orchestration? workload Core 1 Core 2 25

Integrated ML approach to Orchestration? workload Core 1 Core 2 26

Integrated ML approach to Orchestration? workload Core 1 Core 2 NEW class!! UNEXPECTED BEHAVIOUR Monitoring 27

Integrated ML approach to Orchestration? workload Core 1 Core 2 NEW class!! UNEXPECTED BEHAVIOUR Monitoring 28

Integrated ML approach to Orchestration Efficient resource allocation Power minimization? workload Core 1 Core 2 SLA awareness Continuous learning of shared behavior NEW class!! Monitoring UNEXPECTED BEHAVIOUR Continuous learning of new workloads Ability to deal with changes in environment Continuous learning of use for provisioning decisions Experience of common anomalies Ability to share insights with other machines 29