Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017

Similar documents
VICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL

Accelerating a Smart and Interconnected World. Stefano Zammattio Product Marketing Manager April 2018

Ask the right question, regardless of scale

Data Science is a Team Sport and an Iterative Process

How In-Memory Computing can Maximize the Performance of Modern Payments

IBM Power Systems. Bringing Choice and Differentiation to Linux Infrastructure

Goya Deep Learning Inference Platform. Rev. 1.2 November 2018

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Goya Inference Platform & Performance Benchmarks. Rev January 2019

Cognitive Data Warehouse and Analytics

Design System for Machine Learning Accelerator

Confidential

MapR Pentaho Business Solutions

FPGA as a Service in the Cloud. Craig Davies Principal Hardware Architect

Microsoft Azure Essentials

Intel Public Sector 3

Accelerating Computing - Enhance Big Data & in Memory Analytics. Michael Viray Product Manager, Power Systems

WELCOME TO. Cloud Data Services: The Art of the Possible

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

DOWNTIME IS NOT AN OPTION

Accenture* Integrates a Platform Telemetry Solution for OpenStack*

New Big Data Solutions and Opportunities for DB Workloads

Cognitive, AI and Analytics

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Intel s Machine Learning Strategy. Gary Paek, HPC Marketing Manager, Intel Americas HPC User Forum, Tucson, AZ April 12, 2016

ADVANCED ANALYTICS & IOT ARCHITECTURES

Onur Celebioglu. Engineering Director, HPC &AI Solutions

Active Analytics Overview

VIDEO ANALYTICS PLATFORM FOR BIG DATA

Business Insight at the Speed of Thought

Ed Turkel HPC Strategist

BullSequana S series. Powering Enterprise Artificial Intelligence

What s new on Azure? Jan Willem Groenenberg

5th Annual. Cloudera, Inc. All rights reserved.

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter

White Paper. IBM POWER8: Performance and Cost Advantages in Business Intelligence Systems. 89 Fifth Avenue, 7th Floor. New York, NY 10003

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Intelligence artificielle. #experiences17

Big Data Introduction

Machine Learning and Deep Learning for Building Intelligent Platforms Binu Aiyappan, Wipro Technologies. #ESCconf

Analytics for the NFV World with PNDA.io


Understanding the Behavior of In-Memory Computing Workloads. Rui Hou Institute of Computing Technology,CAS July 10, 2014

INTRODUCTION TO R FOR DATA SCIENCE WITH R FOR DATA SCIENCE DATA SCIENCE ESSENTIALS INTRODUCTION TO PYTHON FOR DATA SCIENCE. Azure Machine Learning

DELL EMC POWEREDGE 14G SERVER PORTFOLIO

FlashStack For Oracle RAC

Simplifying Hadoop. Sponsored by. July >> Computing View Point

Integrated Service Management

A Brief History of Messaging, Morse, Maths and Magyar

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Machine Learning For Enterprise: Beyond Open Source. April Jean-François Puget

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale

20775 Performing Data Engineering on Microsoft HD Insight

with Dell EMC s On-Premises Solutions

Oracle Big Data Cloud Service

Five Advances in Analytics

Enabling the Intelligent Enterprise

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

AGILITY & CONTROL IN THE CLOUD WITHOUT TRADEOFFS

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Spark and Hadoop Perfect Together

20775: Performing Data Engineering on Microsoft HD Insight

Uncovering the Hidden Truth In Log Data with vcenter Insight

The Importance of good data management and Power BI

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Deep Learning Insurgency

St Louis CMG Boris Zibitsker, PhD

Data is only getting more complicated and siloed. Each dimension of data is constantly expanding

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Common Customer Use Cases in FSI

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

IBM Db2 Warehouse. Hybrid data warehousing using a software-defined environment in a private cloud. The evolution of the data warehouse

ECLIPSE 2012 Performance Benchmark and Profiling. August 2012

20775A: Performing Data Engineering on Microsoft HD Insight

Azure. Bruno Kovačić Axilis, Microsoft MVP

Intro to Big Data and Hadoop

20775A: Performing Data Engineering on Microsoft HD Insight

Enterprise-Scale MATLAB Applications

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain

IBM Spectrum Scale. Advanced storage management of unstructured data for cloud, big data, analytics, objects and more. Highlights

Mid Atlantic Virtual Users Group May 9, 2013 IBM Big Data Announcements What You Should Know

An Oracle White Paper January Upgrade to Oracle Netra T4 Systems to Improve Service Delivery and Reduce Costs

Dive into nova scheduler performance

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

Bringing the Power of SAS to Hadoop Title

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Oracle Communications Billing and Revenue Management Elastic Charging Engine Performance. Oracle VM Server for SPARC

Konica Minolta Business Innovation Center

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

CREATING A FOUNDATION FOR BUSINESS VALUE

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

WorkloadWisdom Storage performance analytics for comprehensive workload insight

zenterprise Platform Performance Management: Overview

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments

Transcription:

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017

Accelerate Big Data Analytics with Intel Frameworks and Libraries with FPGA s 1. Intel Big Data Analytics Frameworks & API Libraries Accelerate innovation in Big Data Analytics with frameworks built on Software Defined Infrastructure with open standard building blocks. 2. Intel Frameworks & Libraries integrated with FPGAs Run unmodified customer applications, use runtime orchestration with both Xeon and FPGA support, and leverage end to end virtualization & security. 3. Accelerate Relational, NoSQL, and Un-Structured FPGA data access, networking, and algorithm acceleration options with a single FPGA for highly structured, semi-structured, and un-structured data for better TCO, flexibility, and future proofing. Analytics Landscape and Scaling 2

FPGAs Offer Unique Value for Analytics/Streaming Single Multi-function Accelerator Network Streaming Data Data Integrate to Intel Frameworks & API s Run unmodified customer applications Orchestration run time advantage: Intel Xeon processors or Intel FPGAs End to End Security & Virtualization framework Significant Acceleration PCIe lookaside acceleration Networking + streaming + data access Inline acceleration and protocol acceleration Compression, filtering, encryption Fast lookups/hashing 3

Multi-Function FPGA: Algorithm + Networking + Data Access Acceleration Microsoft Scale Out FPGA Multi-Function Accelerator Diversity of cloud workloads and rapid change (weekly or monthly) Search, SmartNIC, Machine Learning, Encrypt, Compress, Big Data Analytics, Lower & predictable latency for ranking vs. software FPGA CNN perf. to Peak TFLOPs ratio similar to gpu MSFT Hot Chips presentation (Sept 2015) Source: Microsoft 4

The power of deep learning on FPGA from Microsoft Build 2017 Conference Performance Tens to hundreds of TOPS of effective inference throughput at low batch sizes Ultra-low latency serving on modern DNNs >10X better than CPUs and GPUs Scale to many FPGAs in single DNN service Flexibility FPGAs ideal for adapting to rapidly evolving ML CNNs, LSTMs, MLPs, reinforcement learning, feature extraction, decision trees, etc. Inference-optimized numerical precision Custom binarized, ternarized, tiny precision nets Sparsity, deep compression for larger, faster models Scale Microsoft has the world s largest cloud investment in FPGAs Multiple Exa-Ops of aggregate AI capacity We have built powerful DNN serving platform on our FPGA fabric Source: Microsoft 5

Ecosystem Partner Solutions & POCs using Intel FPGAs SQL over Relational open databases Traditional Data Warehousing Acceleration Real Time Analytics Acceleration NoSQL Memcached/KVS, Cassandra HADOOP, SPARK Accelerate aggregation or shuffle phase with better compression Data Ingest: FPGA does inline offload of Extract, Transform, and Load (ETL) SPARK MLlib unstructured machine learning api s e.g. K-means, SVM, SQL over SPARK 6

End User Programming Interfaces Host FPGA End User developed Application Specific Functionality End User SW interface (PCIe Driver + Runtime) End User Application AAL (FPGA Runtime) End User Acceleration Function Unit (AFU) End User RTL Interface (CCI-P Specification) Intel provided Infrastructure FPGA Driver CPU QPI/PCIe Link, Protocol, PHY, Security, RAS, Pwr Mgmt, HSSI Software Framework Hardware Framework 7

FPGA Accelerator Card and Integrated FPGA Platforms Intel Xeon + FPGA Accelerator Card Platform Intel Xeon processors+intel FPGA Integrated Platform Intel Xeon CPU Software Framework DDR Intel Xeon CPU Software Framework PCIe PCIe QPI DDR FPGA Accelerator Card Hardware Framework Integrated FPGA Hardware Framework Accelerator Accelerator DDR Accelerator Accelerator Private Memory Subsystem HSSI Multi Chip Package HSSI Today: Intel Arria 10 PCIe Available Today: BDX+FPGA MCP Pilot PCI Express * (PCIe * ) 8

New Cloud Scale Apps and Services Cloud Users Workload IP Library End User Developed IP 3rd party Developed IP Intel Developed IP Launch workload Workload accelerators Software Defined Infrastructure Storage Orchestration Software (FPGA Enabled) Place workload Resource Pool Network Compute Static / Dynamic FPGA program ming OpenStack * based sample solution available from Intel Intel Xeon processor Intel FPGA 9

Intel Xeon processor +Intel FPGA Software Stack for compression acceleration SPARK/Hadoop (Integrated & Accelerator Card Reference Design) SPARK & Hadoop framework SPARK/Hadoop Hadoop Compression Codec Frameworks & SHIM CPU FPGA Compression Accelerator API Accelerator API FPGA Accelerator Abstraction Layer Pre/Post processing Accelerator Support SW Accelerator Abstraction Layer (Driver, Runtime, etc) Intel SW Infrastructure FPGA Compression IP #1 Infrastructure (BlueStream) Intel HW Infrastructure Compression IP #N Accelerator implementation in RTL 10

Different Data Store Approaches Structured Data/ Relational Semi-structured Data/NoSQL Unstructured Data Traditional Relational d/b for OLTP, Business Intelligence, and OLAP NoSQL databases run on multiple servers with no single point of failure Key value store, column-oriented, field, range queries, regex, hybrid, Hadoop designed to work on inexpensive hardware with redundancy Hadoop usually w/unstructured HDFS flat files; SPARK RDD in memory is faster 11

Swarm64 Relational Database Acceleration Two Workloads: Traditional Data Warehousing, Real Time Data Analytics Database accelerate with a plugin Acceleration Overview 5X+ updates/queries for real time data analytics High Velocity Data, 1M updates/queries/s 2X+ queries for data warehousing Using industry standard TPC-DS benchmark 3X+ storage compression Data & tables managed by Swarm64 Note: this is SQL to relational d/b, not SQL to semi/unstructured data. Source: Swarm64 12

SWARM64 Relational Database Acceleration Scale Up Data Warehousing, Real Time Data Analytics, and storage compression Database accelerate with a plugin Overview No customer application change Storage engine plugin: PostgreSQL, MySQL, Query Engine accelerate INSERT, SELECT, Optimized indexing More io bandwidth, mem depth from compress. Significant Acceleration Data access acceleration Compression, filtering, replication Memory mapped acceleration, data cache Optimized Columns indexing 13

SWARM64 Real-Time Analytics Use Cases Banking & Finance Fraud detection Trading & risk management Business Intelligence Real-time process monitoring Government Network monitoring and threat analysis Healthcare Utilities Remote patient monitoring Real-time asset monitoring Smart metering Service quality optimization IT & Telecommunications Network optimization & predictive maintenance Application log analysis (performance, optimization, data security) Retail and e-commerce Multi-channel sales optimization Customer behaviour modelling Real-time recommendation engines Transportation Assets & fleet monitoring Fuel consumption optimization Traffic management 14

10x Faster on Ingests and Query Processing Example Life Network Monitoring Use Case with MariaDB 1M packets/sec Real-Time Ingest Real-Time Processing Source: Swarm64 15

NoSQL: Key Value Store w/algo-logic Systems (Intel Partner) Networking + Data Access Acceleration Compelling KVS Results 150M searchs/s. Sub-microsecond Low jitter on the latency Better energy efficiency Comprehensive KVS White Paper Comparison to: Cpu with sockets Cpu with DPDK Source: Algo-Logic. Intel core i7 4770k 3.4 GHz CPU and an Intel 82598 [Intel82598] 10 GE NIC running on the CentOS (RedHat-based) operating system, and implemented with the OCSMbased KVS in C. Used the software socket implementation as a baseline for the software compared to Nallatech P385 board [P385] with an Intel Altera Stratix V A7 FPGA. http://algo-logic.com/sites/default/files/key_value_search.pdf 16

SPARK: Five Acceleration Areas Shuffle Phase: with high compression ratio (Intel POC) Ingest/Kafka: Extract, Transform, Load (ETL) and filtering (partner) BigDL: Deep Learning acceleration (under investigation) Machine Learning MLlib: ALS, other (Intel POC, partner) SQL over SPARK (partner) 17

Bigstream (startup partner) Hyper-acceleration The only true in-line acceleration of Big Data/ML using Intel FPGAs Frictionless acceleration: up to 10x using Intel Arria 10 and Intel Stratix 10 (Source: Bigstream) Zero code changes Cross platform: Spark, Kafka, TensorFlow Cloud or on-prem Intelligent and adaptive Automatic partitioning of computation Between CPU and FPGA Overlay dataflow execution on FPGA Spark SQL TPC-DS results Up to 2.5X with Intel Xeon processor (Source: Bigstream) Up to 10X with Intel Arria 10 and Intel Stratix 10 (Source: Bigstream) Industry targets: FinServ/FinTech, AdTech, Healthcare Use cases: Spark SQL analytics, ingest/etl, EDW Customer POC with Arria 10 late Summer 2017 http://bigstream.co/resources/whitepaper http://bigstream.co/resources/video-strata 18

Summary Intel has comprehensive standards based frameworks and API s for data analytics Customers can run analytics workloads without change on these frameworks and Intel API s with FPGAs underneath A single FPGA per server can deliver significant acceleration for multiple workloads Broad and Developing Data Analytics Ecosystem Partner Solutions & POCs 19

Legal Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. Check with your system manufacturer or retailer or learn more at www.intel.com. Tests measure performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit www.intel.com/benchmarks. *Other names and brands may be claimed as the property of others. Intel, the Intel logo, Intel Inside, the Intel Inside logo, Xeon, and Arria are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel Corporation 20

Swarm64 Resources Public Landing Page (Product Sheet, Solution Sheet, Videos) https://www.altera.com/solutions/industry/computer-and-storage/applications/data-analytics/solutions.html Overview Article https://www.nextplatform.com/2017/04/18/fpgas-shake-stodgy-relational-databases/ 451 Research has a report on Swarm64 for subscribers https://451research.com/report-short?entityid=92588 22