HPC Clusters as code in the [almost]* Infinite cloud

Similar documents
AWS for Finance Institutes

Koen van den Biggelaar Senior Manager, Solutions Architecture Amazon Web Services

MIGRATING MEDIA WORKFLOWS TO THE CLOUD. Scott Malkie, Systems Engineer

Wandel der IT im Kontext Digitalisierung

Oracle Autonomous Data Warehouse Cloud

Realize Your Product Promise

"Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary

Building a Data Lake on AWS

Oracle Autonomous Data Warehouse Cloud

Azure. Bruno Kovačić Axilis, Microsoft MVP

Oracle's Cloud Strategie für den Geschäftserfolg Alles Neue von der OOW

Insights to HDInsight

Oracle Autonomous Data Warehouse Cloud

Sr. Sergio Rodríguez de Guzmán CTO PUE

Microsoft Azure Essentials

Cost Optimization for Cloud-Based Engineering Simulation Using ANSYS Enterprise Cloud

Zero to Federated at the Speed of Jenkins. A Case Study of Success in DevOps

Microsoft FastTrack For Azure Service Level Description

BARCELONA. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Innovate with Oracle Public Cloud Platform & Infrastructure Services

House Keeping. You are in Listen Only Mode. Azure 101: Azure Overview. Azure 201: How to do a Cost Estimate for Virtual Machines

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

EBOOK: Cloudwick Powering the Digital Enterprise

MIGRATING AND MANAGING MICROSOFT WORKLOADS ON AWS WITH DATAPIPE DATAPIPE.COM

AWS Architecture Case Study: Real-Time Bidding. Tom Maddox, Solutions Architect

Gartner's Top 10 Strategic Technology Trends for 2014

Implementing Microsoft Azure Infrastructure Solutions

Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT

Please note. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion.

Architecture Overview for Data Analytics Deployments

Azure vs. AWS. How to Decide Between Microsoft Azure and Amazon Web Services

Oracle Big Data Cloud Service

Introduction. Highlights. Prepare Library Sequence Analyze Data

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered.

Scalability and High Performance with MicroStrategy 10

Knauf builds high-speed business insight with SAP and IBM

The Benefits of Using an Appliance to Simplify and Accelerate Desktop Virtualization Initiatives

Big Data in Cloud. 堵俊平 Apache Hadoop Committer Staff Engineer, VMware

COURSE OUTLINE: Course 20533C- Implementing Microsoft Azure Infrastructure Solutions

HyperCloud. IT s Cloud Dilemma

Increased Informix Awareness Discover Informix microsite launched

POWER BI OVERVIEW & FEATURES JANUARY 2017, SINGAPORE. Khilitchandra Prajapati

Microsoft Monitoring and Operating a Private Cloud

Research Report. The Major Difference Between IBM s LinuxONE and x86 Linux Servers

Business is being transformed by three trends

Cloud service models

A crash course in Microsoft 365 Business. Achieve more in your business with an integrated security, management and productivity solution all in one.

Building Real-time and Responsive Applications on Azure. Girish Phadke & Maneesha Marathe Tata Consultancy Services Ltd.

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Advancing the adoption and implementation of HPC solutions. David Detweiler

UForge AppCenter 3.8. Introduction March Copyright 2018 FUJITSU LIMITED

EMC Navisphere Management Suite

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017

BOARD 10: the future of decision making

Building Your Big Data Team

Take a Tour of Native Hybrid Cloud & Neutrino. Modern, cloud native platforms

Amazon WorkSpaces Cost Optimizer

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

Design Virtualization and Its Impact on SoC Design

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

#mstrworld. A Deep Dive Into Self-Service Data Discovery In MicroStrategy. Vijay Anand Gianthomas Tewksbury Volpe. #mstrworld

IIoT Data Access with the PI System

Future City Challenge Coaching & more

What s New for Oracle Big Data Cloud Service. Topics: Oracle Cloud. What's New for Oracle Big Data Cloud Service Version

When is HPC cloud computing right for you?

100 Million Subscriber Performance Test Whitepaper:

Deep Dive. Into MS Operations Management Suite. Pete

Real World Use Cases: Hadoop & NoSQL in Production. Big Data Everywhere London 4 June 2015

A Cloud Migration Checklist

BOARD 10: the future of decision making

SOA, Web 2.0, and Web Services

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

How to Build Your Data Ecosystem with Tableau on AWS

MEETS MOBILE MAINFRAME. Access information anytime, anywhere

FUJITSU Technical Computing Solution UNCAI

Digital business processes

Service Provider Integrates Mainframe Operations with Microsoft SQL Server 2005

Plan, Deploy and Configure Microsoft InTune

DELL EMC POWEREDGE 14G SERVER PORTFOLIO

NetScaler Management and Analytics System (MAS)

HPE Flexible Capacity with Microsoft Azure & Azure Stack

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

May 2012 Oracle Spatial User Conference

AVANTUS TRAINING PTE LTD

FlashStack For Oracle RAC

RETAIL ANALYTICS KX FOR RETAIL SOLUTIONS

Highlights. Global Information Technology Provider. The Challenges

Migrating to. Microsoft Azure

How to sell Azure to SMB customers. Paul Bowkett Microsoft NZ

the cloud platform for sap business one

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MCSE: Private Cloud Training Course (System Center 2012)

Sizing SAP Central Process Scheduling 8.0 by Redwood

Multi-Containers Orchestration with Live Migration and High-Availability for Microservices

OS A superior platform for business. Operate seamlessly, automatically, and intelligently anywhere in the world

DevOps architecture overview

Implementing Microsoft Azure Infrastructure Solutions

Challenges of deploying your HPC application to the cloud

Transcription:

HPC Clusters as code in the [almost]* Infinite cloud Brendan Bouffler, Global Scientific Computing @boofla #scico 2016-06-30 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Science is one of the greatest areas of computation Amazon huge, really disruptive, impact what we are about

Research means Collaboration Map of scientific collaboration between researchers - Olivier H. Beauchesne - http://bit.ly/e9ekp2

Existing 1. Oregon 2. California 3. Virginia 4. Dublin 5. Frankfurt 6. Singapore 7. Sydney 8. Seoul 9. Tokyo 10. Sao Paulo 11. Beijng 12. US GovCloud 1. Ohio 2. India 3. UK 4. Canada 5. China+1 AWS Region Availability Zone regions are sovereign leaves your data never

More time spent computing the data than moving the data.

Peak: 58K cores Valley: 12K cores

In the first year: Over 400,000 scenes available Over 1 billion hits globally Used for new product development by: Colin Reilly Senior Director GIS NYC Department of IT & Telecom

ESA, Planet Labs, Copernicus data from ESA s Sentinels & Zooniverse The first live public test for an effort dubbed the Planetary Response Network (PRN) Within 2 hours of the Ecuador test project going live with a first set of 1,300 images, each photo had been checked at least 20 times. It was one of the fastest responses I ve seen, says Brooke Simmons, an astronomer at the University of California, San Diego, who leads the image processing. Steven Reece, who heads the Oxford team s machine-learning effort, says that results a heat map of damage with possible road blockages were ready in another two hours.

Cray Supercomputer

A top500 supercomputer 2013-style

Introducing Alces Flight - self-scaling HPC clusters instantly ready to compute, billed by the hour and using the AWS Spot market by default to achieve supercomputing for ~1c per core per hour. 750+ popular scientific applications AWS Marketplace immediately http://boofla.io/u/alcesflight

Virtual Private Cloud Auto-scaling group Head node Instance /shared Compute node Instance Compute node Instance Compute node Instance Compute node Instance 10G Network Head Instance 2 or more cores (as needed) CentOS 7.x OpenMPI, gcc etc Choice of scheduler: SGE (now) PBSpro & SLURM (coming soon) Compute Instances 2 or more cores (as needed) CentOS 7.x, Amazon Linux Auto Scaling group driven by scheduler queue length. Can start with 0 (zero) nodes and only scale when there are jobs.

Wall clock time: ~1 hour Wall clock time: ~1 week Cost: the same

TECHNICAL & BUSINESS SUPPORT AWS MARKETPLACE MANAGEMENT TOOLS PLATFORM SERVICES ENTERPRISE APPS HYBRID CLOUD MANAGEMENT Support Big Data & HPC Queuing Analytics Data Warehousing Mobile Sync Development Containers App Mobile & Web Front-end Virtual Desktops Direct Connect Professional Services Business Apps Notifications Hadoop Identity Source Code Functions Sharing & Collaboration Identity Federation Partner Ecosystem Security Search Streaming Push Notifications Build Tools Identity Email & Calendaring Deployment Training & Certification Development Orchestration Data Pipelines Mobile Analytics Deploymen t Data Store Directories Backups Email Machine Learning Mobile Backend DevOps Real-time Storage Gateway Integrated Management Account Management Backup SECURITY & MANAGEMENT Solutions Architects Databases Virtual Private Networks Identity & Access Encryption Keys Configuration Monitoring Dedicated Security & Pricing Reports Industry Solutions Regions Availability Zones Compute INFRASTRUCTURE SERVICES Storage Objects, Blocks, Files Databases SQL, NoSQL, Caching Networking CDN AWS Building blocks

C4 Intel Xeon E5-2666 v3, custom built for AWS. Intel Haswell, 16 FLOPS/tick 2.9 GHz, turbo to 3.5 GHz Feature Processor Number Intel Smart Cache Instruction Set Specification E5-2666 v3 25 MiB 64-bit Instruction Set Extensions AVX 2.0 Lithography Processor Base Frequency Max All Core Turbo Frequency Max Turbo Frequency Intel Turbo Boost Technology 2.0 Intel vpro Technology Intel Hyper-Threading Technology Intel Virtualization Technology (VT-x) Intel Virtualization Technology for Directed I/O (VT-d) Intel VT-x with Extended Page Tables (EPT) 22 nm 2.9 GHz 3.2 GHz 3.5 GHz (available on c4.2xlarge) Yes Yes Yes Yes Yes http://docs.aws.amazon.com/awsec2/latest/userguide/c4-instances.html Intel 64 Yes

All the traditional command-line tools will be familiar, but you can also create an Alces session and immediately launch a desktop view of your cluster to run graphical apps. Command Line (ssh) Graphical Console

graphical console simultaneously with your collaborators work together with the visual console in realtime Make more discoveries faster

There are cluster filesystem options, too for when you need extreme I/O scaling.

Disrupting research, wherever it s happening.

39 years of computational chemistry in 9 hours Novartis ran a project that involved virtually screening 10 million compounds against a common cancer target in less than a week. They calculated that it would take 50,000 cores and close to a $40 million investment if they wanted to run the experiment internally. Partnering with Cycle Computing and Amazon Web Services (AWS), Novartis built a platform thst ran across 10,600 Spot Instances (~87,000 cores) and allowed Novartis to conduct 39 years of computational chemistry in 9 hours for a cost of $4,232. Out of the 10 million compounds screened, three were successfully identified.

CHILES will produce the first HI deep field, to be carried out with the VLA in B array and covering a redshift range from z=0 to z=0.45. The field is centered at the COSMOS field. It will produce neutral hydrogen images of at least 300 galaxies spread over the entire redshift range. The team at ICRAR in Australia have been able to implement the entire processing pipeline in the cloud for around $2,000 per month by exploiting the SPOT market, which means the $1.75M they otherwise needed to spend on an HPC cluster can be spent on way cooler things that impact their research like astronomers.

1. AWS Account 3. A problem to solve