OpenWLC: A Workload Characterization System. Hong Ong, Rajagopal Subramaniyan, R. Scott Studham ORNL July 19, 2005

Size: px
Start display at page:

Download "OpenWLC: A Workload Characterization System. Hong Ong, Rajagopal Subramaniyan, R. Scott Studham ORNL July 19, 2005"

Transcription

1 OpenWLC: A Workload Characterization System Hong Ong, Rajagopal Subramaniyan, R. Scott Studham ORNL July 19, 2005

2 Outline Background and Motivation Existing Tools and Services NWPerf OpenWLC Overview Goals Preliminary Design Timeline

3 Typical Machine Evaluation Determine a machine s applicability to a range of applications. Use synthetic codes and application kernels to capture performance properties of: Hardware & architecture (e.g., NAS, SPEC, SPLASH). Operating System components (e.g., lmbench). Software systems and applications (e.g., NPB, HPCtoolkits, Genesis, STSWM). Various kinds of middleware. Work with vendors and system architects to improve the systems. Focus: Evaluation of emerging platforms to understand their benefits.

4 What we want to know? Methods to answer the following: What percentage of jobs use all that memory/disk? What is the average job size? How does job size impact efficiency of the calculation? Where is the bottleneck for the average run of a job? What is the sustained performance for the average job? What are the impacts of slow memory to the average user? Could we have less storage on the next system if we had a shared memory system or a shared disk pool? and more. What is the average user doing vs. what the box was designed for?

5 Workload Characterization Focus: Evaluation of deployed platforms to understand how they are used by average users. How effectively is the box used as opposed to what the box was designed for? Determine how existing platforms are utilized by applications and general users Efficiency of applications, System utilization, Average job sizes. Work with application developers, users and manufacturers to decrease application runtime and increase resource utilization.

6 Percent of jobs Most users do not utilize the full capability of the system. Memory footprint per node during FY04 on 11.4TF HPCS2 system at PNNL Most jobs use <25% of available memory (max avail is 6-8G) Large jobs use more memory. I need 6GB of memory per CPU CPU s Used

7 Aggregate Results 10% of the >256CPU jobs have the CPU scheduled for idle >50% of the time. Sustained Performance as a function of CPU count Busy Cycles as a function of job size The median sustained performance for jobs over 256CPU s is 3%.

8 Aggregate Results Stalled cycles as a function of job size Sum of all stalls due to: BE_FLUSH_BUBBLE_ALL Branch misprediction flush or exception BE_EXE_BUBBLE_ALL Execution unit stalls BE_L1D_FPU_BUBBLE_ALL Stalls due to L1D (L1 data cache) micropipeline or FPU (floating-point unit) micropipeline BE_RSE_BUBBLE_ALL Register stack engine (RSE) stalls BACK_END_BUBBLE_FE Frontend stalls

9 Job size - HPCS2 is primarily focused on small capacity jobs. Percent of cycles used by job size 50% <12% of the computer <33% of the computer <50% of the computer >50% of the computer Oct Nov Dec Jan Feb Mar Apr May Jun Jul

10 Some Observation After analyzing 19,883 jobs Median sustained FLOP performance for jobs over 256CPU s is 3% Less than 20% of the memory is in use at any given point in time. Less than 5% of the jobs use heavy IO (>25MB/s per GF DGEMM) Over 60% of the cycles are for jobs that use less than 1/8 of the system. 10% of the >256CPU jobs have the CPUs scheduled for idle >50% of the time. High User expertise Most Utilization Platform Evaluation Computer Design Low Scalability of application on platform we have determined that most users do not use the system as designed. High

11 Existing Tools System monitoring Provide a continuous collection and aggregation of system performance data. Examples PerfMiner Ganglia NWPerf SuperMon CluMon Nagios PCP MonALISA Application monitoring Measure actual application performance via a batch system. Examples PerfMiner NWPerf Our main focus Profiling Provide a static, instrumentation tool, which focuses on source code that users have direct control. Examples PerfSuite HPCToolkits SvPablo TAU Kojak Vampir IPM

12 Existing Tools: Limitations Ganglia: Great tool, easy to use. Difficult to extend in a low system impact fashion. Unpredictable system performance due to unsynchronized collection (ref: Petrini). RRD storage works well for fixed time series graphs, but not suitable for ad-hoc queries. Supermon: Good per-node performance. Polling-based systems are almost implicitly unsynchronized. Yet to provide a storage solution. Node management seemed excessively cumbersome and manual. Kernel module reliance reduces portability and increases administrative cost of deployment (similar to dproc).

13 Existing Tools: Limitations Other solutions such as dproc, CARD, PARAMON have not proved to be able to scale to large clusters. Clumon/PCP oriented towards point-in-time data, not long term systematic analysis (although potential exists for extension).

14 NWPerf 27 metrics are collected on all nodes once per minute Hardware performance counters including: flops, memory bytes/cycle, total stalls Local scratch usage (obtained via fstat() ) Memory swapped out (total), swap blocks in and out Memory free, used, and used as system buffers Block I/O in, and out Kernel scheduler CPU allocation to user, kernel, and idle time Processes running, and blocked Interrupts, and context switches per second. Lustre I/O (Shared global filesystem) All-to-all 256 CPU (12K runs llcbench) execution time [s] None 1X 10X No LSF 100 The 3 graphs are from the same 3 day 600CPU run 0 Min Mean NWPerf: Ryan Mooney, Scott Studham, Ken Schmidt

15 NWPerf: Unresolved Issues Deployment and configuration: Difficult to deploy and tune. User interface: Requires work to standardize and extend existing APIs. Software architecture: Centralized scheduler and collection server constitute single point of failure. Unable to scale to O(1000) nodes Data management: Collection and storage may not be able to scale. Visualization: Simple. Portability: Specific to Linux cluster. PROJECT IS HALTED!

16 What is next? Version1 CASC 2003 Version2 NWPerf Version3 OpenWLC Ease of use Requires User Interaction No User Interaction No User Interaction Portability Hard coded for one computer Designed for a Linux cluster Runable on all major architectures Users O(10) O(100) O(1000) Jobs O(10^2) O(10^4) O(10^5) Metrics collected O(10) O(20) O(40) Deployed sites 1 1 O(10-100)

17 OpenWLC Goals Resolve issues in NWPerf, i.e., Cross-system portability. Standardized system interface. Better visualization. Improved scalability. Perform cross node and cross job event correlation. For example, we want to be able to say job one is using the shared file system so jobs ten and twenty were slowed down. Perform validation tests at different sampling rates to determine preferred sample rates for different data points. Detect and describe event anomaly based on gathered profiles. Perform (potentially) hardware fault analysis. More importantly, we want to characterize more accurately what parts of the system are important for efficient job performance from an average user s perspective. Deployed at DOD MOD and DOE test locations.

18 OpenWLC Design constraints Load > O(40) metrics (e.g., FLOPs, Int Ops, Mem BW, Network BW, Disk BW.) for all jobs run on the system in a (central) database. Cannot impact jobs performance by greater than 1% Fine data granularity to see the needs to use different algorithms. Efficency (FLOPS) 24% average efficiency over 128 nodes (~1/3TF) 80% 70% 60% 50% 40% 30% 20% 10% 0% Time Keep all data to develop mathematical center profiles. ccsd Cannot be architecture specific for portability ccsd(t)

19 Foreseen Critical Challenges Scalability: Single node: Software scalability and features. Doing more without breaking or rewriting code. # of metrics. System-wide (due to increasing number of nodes): Communication management. Network load. Data volume at the collection node. Data management. Visualization. Storage. A sensible workload model to characterize what parts of the system are important for efficient job performance from an average user s perspective.

20 Preliminary OpenWLC Framework

21 Solutions to Scalability Solving single node scalability: Variable sampling frequency e.g., collect set of 10 parameter. Modules/driver may run as a kernel thread. Encode collected data for transmission to collection node to reduce transmission overhead and network traffic. Solving system-wide scalability: Hierarchically layered architecture. Scheduled communication with sub-collectors. Identify a medium term data storage for highly structured time series data. Existing solutions, MySQL and Postgresql, are less than ideal. Better data aggregation methods

22 Data Aggregation and Management Want a high throughput, low overhead, highly synchronized, massively scalable data collection subsystem for collecting arbitrary data. NWPerf does a pretty good job of most of this except the arbitrary data part and ease of use. Supermon has some interesting capabilities in its data encapsulation, but their collection infrastructure has some weaknesses. Maybe a hybrid approach.

23 Analytical/Modeling subsystem. Could use variations of clustering algorithms. Need to identify workload model for different platforms. Problem: This is a huge project in itself. Potential future work. Do not have to re-engineer our current (preliminary) design.

24 Summary High User expertise Computer Platform Design Evaluation Most Utilization High User expertise Platform Computer EvaluatioDesign n Users Capacity Center Utilization Low High Scalability of application on platform Low High Scalability of application on platform Work with the application community, to raise the users awareness on how to efficiently use the systems and ensure the users applications scale on the given NLCF platforms.

25 Benefits Benefit system designers: What is really important (memory, flops, something else, all of the above?) Benefit users: Is my application sized properly? Is it having obvious problems? Benefit code developers: What resources is the application using, does it have good performance? Stop the hype: What is performance, we needed a simple way to compare performance characteristics for a large set of real world applications.

26 Status Collaborators: ANL, Louisiana Tech. Kick-off meeting in June Presently focusing on OpenWLC system design and APIs. Timeline: October 2005 Finalized design and APIs. Early 2006 Beta version.

27 Acknowledgements Funding for OpenWLC is from the Department of Defense High performance Computing Modernization Office. This NWPerf research described in this presentation was performed using the Molecular Science Computing Facility (MSCF) in the William R. Wiley Environmental Molecular Sciences Laboratory, a national scientific user facility sponsored by the U.S. Department of Energy's Office of Biological and Environmental Research and located at the Pacific Northwest National Laboratory. PNNL is operated for the Department of Energy by Battelle. Ryan Mooney and Ken Schmidt for their tireless work to develop NWPerf. Jarek Nieplocha for his guidance on how to quantify system impacts. Experiments and data collection were performed on the Pacific Northwest National Laboratory (PNNL) 977-node Linux 11.8 TFLOPs cluster (HPCS2) with 1954 Itanium-2 processors. The data collection server is a Dual Xeon Dell system with a 1TB ACNC IDE to SCSI Raid Array. This research is sponsored by the Office of Advanced Scientific Computing Research; U.S. Department of Energy. The work was performed at the Oak Ridge National Laboratory, which is managed by UT-Battelle, LLC under Contract No. De-AC05-00OR22725.

Performance monitors for multiprocessing systems

Performance monitors for multiprocessing systems Performance monitors for multiprocessing systems School of Information Technology University of Ottawa ELG7187, Fall 2010 Outline Introduction 1 Introduction 2 3 4 Performance monitors Performance monitors

More information

STAND: New Tool for Performance Estimation of the Block Data Processing Algorithms in High-load Systems

STAND: New Tool for Performance Estimation of the Block Data Processing Algorithms in High-load Systems STAND: New Tool for Performance Estimation of the Block Data Processing Algorithms in High-load Systems Victor Minchenkov, Vladimir Bashun St-Petersburg State University of Aerospace Instrumentation {victor,

More information

ECLIPSE 2012 Performance Benchmark and Profiling. August 2012

ECLIPSE 2012 Performance Benchmark and Profiling. August 2012 ECLIPSE 2012 Performance Benchmark and Profiling August 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Simplifying Hadoop. Sponsored by. July >> Computing View Point

Simplifying Hadoop. Sponsored by. July >> Computing View Point Sponsored by >> Computing View Point Simplifying Hadoop July 2013 The gap between the potential power of Hadoop and the technical difficulties in its implementation are narrowing and about time too Contents

More information

Vampir MPI-Performance Analysis

Vampir MPI-Performance Analysis Vampir MPI-Performance Analysis 05.08.2013 FB. Computer Science Scientific Computing Christian Iwainsky 1 How does one gain insight about an HPC applciation? Data has to be presented to the developer in

More information

Vampir. (MPI-) Performance Analysis. Dipl. Inf. Christian Iwainsky Fachgebiet Scientific Computing

Vampir. (MPI-) Performance Analysis. Dipl. Inf. Christian Iwainsky Fachgebiet Scientific Computing Vampir (MPI-) Performance Analysis Dipl. Inf. Christian Iwainsky Fachgebiet Scientific Computing 11.11.2013 FB. Computer Science Scientific Computing Christian Iwainsky 1 How does one gain insight about

More information

An Oracle White Paper January Upgrade to Oracle Netra T4 Systems to Improve Service Delivery and Reduce Costs

An Oracle White Paper January Upgrade to Oracle Netra T4 Systems to Improve Service Delivery and Reduce Costs An Oracle White Paper January 2013 Upgrade to Oracle Netra T4 Systems to Improve Service Delivery and Reduce Costs Executive Summary... 2 Deploy Services Faster and More Efficiently... 3 Greater Compute

More information

IBM xseries 430. Versatile, scalable workload management. Provides unmatched flexibility with an Intel architecture and open systems foundation

IBM xseries 430. Versatile, scalable workload management. Provides unmatched flexibility with an Intel architecture and open systems foundation Versatile, scalable workload management IBM xseries 430 With Intel technology at its core and support for multiple applications across multiple operating systems, the xseries 430 enables customers to run

More information

Moreno Baricevic Stefano Cozzini. CNR-IOM DEMOCRITOS Trieste, ITALY. Resource Management

Moreno Baricevic Stefano Cozzini. CNR-IOM DEMOCRITOS Trieste, ITALY. Resource Management Moreno Baricevic Stefano Cozzini CNR-IOM DEMOCRITOS Trieste, ITALY Resource Management RESOURCE MANAGEMENT We have a pool of users and a pool of resources, then what? some software that controls available

More information

Delivering High Performance for Financial Models and Risk Analytics

Delivering High Performance for Financial Models and Risk Analytics QuantCatalyst Delivering High Performance for Financial Models and Risk Analytics September 2008 Risk Breakfast London Dr D. Egloff daniel.egloff@quantcatalyst.com QuantCatalyst Inc. Technology and software

More information

Accelerate HPC Development with Allinea Performance Tools. Olly Perks & Florent Lebeau

Accelerate HPC Development with Allinea Performance Tools. Olly Perks & Florent Lebeau Accelerate HPC Development with Allinea Performance Tools Olly Perks & Florent Lebeau Olly.Perks@arm.com Florent.Lebeau@arm.com Agenda 09:00 09:15 09:15 09:30 09:30 09:45 09:45 10:15 10:15 10:30 Introduction

More information

<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter

<Insert Picture Here> Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter Oracle Exalogic Elastic Cloud: Revolutionizing the Datacenter Mike Piech Senior Director, Product Marketing The following is intended to outline our general product direction. It

More information

Challenges in Application Scaling In an Exascale Environment

Challenges in Application Scaling In an Exascale Environment Challenges in Application Scaling In an Exascale Environment 14 th Workshop on the Use of High Performance Computing In Meteorology November 2, 2010 Dr Don Grice IBM Page 1 Growth of the Largest Computers

More information

NVIDIA QUADRO VIRTUAL DATA CENTER WORKSTATION APPLICATION SIZING GUIDE FOR SIEMENS NX APPLICATION GUIDE. Ver 1.0

NVIDIA QUADRO VIRTUAL DATA CENTER WORKSTATION APPLICATION SIZING GUIDE FOR SIEMENS NX APPLICATION GUIDE. Ver 1.0 NVIDIA QUADRO VIRTUAL DATA CENTER WORKSTATION APPLICATION SIZING GUIDE FOR SIEMENS NX APPLICATION GUIDE Ver 1.0 EXECUTIVE SUMMARY This document provides insights into how to deploy NVIDIA Quadro Virtual

More information

Virtualizing Enterprise SAP Software Deployments

Virtualizing Enterprise SAP Software Deployments Virtualizing SAP Software Deployments A Proof of Concept by HP, Intel, SAP, SUSE, and VMware Solution provided by: The Foundation of V Virtualizing SAP Software Deployments A Proof of Concept by HP, Intel,

More information

Challenges for Performance Analysis in High-Performance RC

Challenges for Performance Analysis in High-Performance RC Challenges for Performance Analysis in High-Performance RC July 20, 2007 Seth Koehler Ph.D. Student, University of Florida John Curreri Ph.D. Student, University of Florida Dr. Alan D. George Professor

More information

Performance Monitoring and In Situ Analytics for Scientific Workflows

Performance Monitoring and In Situ Analytics for Scientific Workflows Performance Monitoring and In Situ Analytics for Scientific Workflows Allen D. Malony, Xuechen Zhang, Chad Wood, Kevin Huck University of Oregon 9 th Scalable Tools Workshop August 3-6, 2015 Talk Outline

More information

DataStager: Scalable Data Staging Services for Petascale Applications

DataStager: Scalable Data Staging Services for Petascale Applications DataStager: Scalable Data Staging Services for Petascale Applications Hasan Abbasi Matthew Wolf Karsten Schwan Fang Zheng College of Computing, Georgia Institute of Technology Scott Klasky Oak Ridge National

More information

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing Jik-Soo Kim, Ph.D National Institute of Supercomputing and Networking(NISN) at KISTI Table of Contents

More information

Addressing the I/O bottleneck of HPC workloads. Professor Mark Parsons NEXTGenIO Project Chairman Director, EPCC

Addressing the I/O bottleneck of HPC workloads. Professor Mark Parsons NEXTGenIO Project Chairman Director, EPCC Addressing the I/O bottleneck of HPC workloads Professor Mark Parsons NEXTGenIO Project Chairman Director, EPCC I/O is key Exascale challenge Parallelism beyond 100 million threads demands a new approach

More information

White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure

White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure Discover how to reshape an existing HPC infrastructure to run High Performance Data Analytics (HPDA)

More information

Accenture* Integrates a Platform Telemetry Solution for OpenStack*

Accenture* Integrates a Platform Telemetry Solution for OpenStack* white paper Communications Service Providers Service Assurance Accenture* Integrates a Platform Telemetry Solution for OpenStack* Using open source software and Intel Xeon processor-based servers, Accenture

More information

Apache Kafka. A distributed publish-subscribe messaging system. Neha Narkhede, 11/11/11

Apache Kafka. A distributed publish-subscribe messaging system. Neha Narkhede, 11/11/11 Apache Kafka A distributed publish-subscribe messaging system Neha Narkhede, LinkedIn @nehanarkhede, 11/11/11 Outline Introduction to pub-sub Kafka at LinkedIn Hadoop and Kafka Design Performance What

More information

Metrics & Benchmarks

Metrics & Benchmarks Metrics & Benchmarks Task 2.3 Marios Dikaiakos George Tsouloupas Dept. of Computer Science University of Cyprus Nicosia, CYPRUS Benchmarks 101 X# Synthetic Codes and Application Kernels with well-defined

More information

Load DynamiX Enterprise 5.2

Load DynamiX Enterprise 5.2 ENTERPRISE DATASHEET TECHNOLOGY VENDORS Load DynamiX Enterprise 5.2 The industry s only collaborative workload acquisition, modeling and performance validation solution for storage technology vendors Key

More information

Hamburg 20 September, 2018

Hamburg 20 September, 2018 Hamburg 20 September, 2018 Data Warehouse Modernization Pivotal Greenplum Dell Greenplum Building Blocks Les Klein EMEA Field CTO, Pivotal @LesKlein Great organizations leverage software, analytics, and

More information

Exalogic Elastic Cloud

Exalogic Elastic Cloud Exalogic Elastic Cloud Mike Piech Oracle San Francisco Keywords: Exalogic Cloud WebLogic Coherence JRockit HotSpot Solaris Linux InfiniBand Introduction For most enterprise IT organizations, years of innovation,

More information

BI CONCEPTS IN MANAGING THE SAS ENVIRONMENT

BI CONCEPTS IN MANAGING THE SAS ENVIRONMENT Paper BI-002 Interrogate the Interrogator: Presenting SAS Usage Information Using BI Tools Emilio Power, ThotWave Technologies, Chapel Hill, NC Shawn Edney, ThotWave Technologies, Chapel Hill, NC ABSTRACT

More information

Leveraging expertise for more economic value from HPC

Leveraging expertise for more economic value from HPC Leveraging expertise for more economic value from HPC November 2014 0 FUJITSU LIMITED 2014 Trends in volume HPC market HPC Revenue $billion Volume HPC business is over 50% of market. Now the fastest growing

More information

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing GUIDE The Enterprise Buyer s Guide to Public Cloud Computing cloudcheckr.com Enterprise Buyer s Guide 1 When assessing enterprise compute options on Amazon and Azure, it pays dividends to research the

More information

HPC USAGE ANALYTICS. Supercomputer Education & Research Centre Akhila Prabhakaran

HPC USAGE ANALYTICS. Supercomputer Education & Research Centre Akhila Prabhakaran HPC USAGE ANALYTICS Supercomputer Education & Research Centre Akhila Prabhakaran OVERVIEW: BATCH COMPUTE SERVERS Dell Cluster : Batch cluster consists of 3 Nodes of Two Intel Quad Core X5570 Xeon CPUs.

More information

Cluster Workload Management

Cluster Workload Management Cluster Workload Management Goal: maximising the delivery of resources to jobs, given job requirements and local policy restrictions Three parties Users: supplying the job requirements Administrators:

More information

A Examcollection.Premium.Exam.35q

A Examcollection.Premium.Exam.35q A2030-280.Examcollection.Premium.Exam.35q Number: A2030-280 Passing Score: 800 Time Limit: 120 min File Version: 32.2 http://www.gratisexam.com/ Exam Code: A2030-280 Exam Name: IBM Cloud Computing Infrastructure

More information

Dell EMC Ready Solutions for HPC Lustre Storage. Forrest Ling HPC Enterprise Technolgist at Dell EMC Greater China

Dell EMC Ready Solutions for HPC Lustre Storage. Forrest Ling HPC Enterprise Technolgist at Dell EMC Greater China Dell EMC Ready Solutions for HPC Lustre Storage Forrest Ling HPC Enterprise Technolgist at Dell EMC Greater China 2018.10.23 Dell EMC Supports HPC Open Source Software Support Open Source Software projects

More information

Increased Informix Awareness Discover Informix microsite launched

Increased Informix Awareness Discover Informix microsite launched Information Management Increased Informix Awareness Discover Informix microsite launched www.ibm.com/discoverinformix 2010 IBM Corporation Informix Panther Early Program Want to be on the cutting-edge

More information

Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2

Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2 Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2 O R A C L E W H I T E P A P E R J A N U A R Y 2 0 1 5 Table of Contents Disclaimer

More information

IBM HPC DIRECTIONS. Dr Don Grice. ECMWF Workshop October 31, IBM Corporation

IBM HPC DIRECTIONS. Dr Don Grice. ECMWF Workshop October 31, IBM Corporation IBM HPC DIRECTIONS Dr Don Grice ECMWF Workshop October 31, 2006 2006 IBM Corporation IBM HPC Directions What s Changing? The Rate of Frequency Improvement is Slowing Moore s Law (Frequency improvement)

More information

A FRAMEWORK FOR CAPACITY ANALYSIS D E B B I E S H E E T Z P R I N C I P A L C O N S U L T A N T M B I S O L U T I O N S

A FRAMEWORK FOR CAPACITY ANALYSIS D E B B I E S H E E T Z P R I N C I P A L C O N S U L T A N T M B I S O L U T I O N S A FRAMEWORK FOR CAPACITY ANALYSIS D E B B I E S H E E T Z P R I N C I P A L C O N S U L T A N T M B I S O L U T I O N S Presented at St. Louis CMG Regional Conference, 4 October 2016 (c) MBI Solutions

More information

Understanding the Behavior of In-Memory Computing Workloads. Rui Hou Institute of Computing Technology,CAS July 10, 2014

Understanding the Behavior of In-Memory Computing Workloads. Rui Hou Institute of Computing Technology,CAS July 10, 2014 Understanding the Behavior of In-Memory Computing Workloads Rui Hou Institute of Computing Technology,CAS July 10, 2014 Outline Background Methodology Results and Analysis Summary 2 Background The era

More information

Optimize code for Intel Xeon Phi. Discovering bottlenecks without pain

Optimize code for Intel Xeon Phi. Discovering bottlenecks without pain Optimize code for Intel Xeon Phi Discovering bottlenecks without pain Agenda Introduction Allinea MAP and optimizing for Xeon Phi Conclusion Allinea Unified environment A modern integrated environment

More information

DELL EMC ISILON INSIGHTIQ

DELL EMC ISILON INSIGHTIQ DATA SHEET DELL EMC ISILON INSIGHTIQ Customizable analytics platform to accelerate workflows and applications on Isilon clusters ESSENTIALS Powerful monitoring and reporting tools to optimize performance

More information

IBM Power Systems. Bringing Choice and Differentiation to Linux Infrastructure

IBM Power Systems. Bringing Choice and Differentiation to Linux Infrastructure IBM Power Systems Bringing Choice and Differentiation to Linux Infrastructure Stefanie Chiras, Ph.D. Vice President IBM Power Systems Offering Management Client initiatives Cognitive Cloud Economic value

More information

Getting Started with vrealize Operations First Published On: Last Updated On:

Getting Started with vrealize Operations First Published On: Last Updated On: Getting Started with vrealize Operations First Published On: 02-22-2017 Last Updated On: 07-30-2017 1 Table Of Contents 1. Installation and Configuration 1.1.vROps - Deploy vrealize Ops Manager 1.2.vROps

More information

Performance Optimization on the Next Generation of Supercomputers

Performance Optimization on the Next Generation of Supercomputers Center for Information Services and High Performance Computing (ZIH) Performance Optimization on the Next Generation of Supercomputers How to meet the Challenges? Zellescher Weg 12 Tel. +49 351-463 - 35450

More information

REQUEST FOR PROPOSAL FOR

REQUEST FOR PROPOSAL FOR REQUEST FOR PROPOSAL FOR HIGH PERFORMANCE COMPUTING (HPC) SOLUTION Ref. No. PHY/ALK/43 (27/11/2012) by DEPARTMENT OF PHYSICS UNIVERSITY OF PUNE PUNE - 411 007 INDIA NOVEMBER 27, 2012 1 Purpose of this

More information

Jack Weast. Principal Engineer, Chief Systems Engineer. Automated Driving Group, Intel

Jack Weast. Principal Engineer, Chief Systems Engineer. Automated Driving Group, Intel Jack Weast Principal Engineer, Chief Systems Engineer Automated Driving Group, Intel From the Intel Newsroom 2 Levels of Automated Driving Courtesy SAE International Ref: J3061 3 Simplified End-to-End

More information

ALCF SITE UPDATE IXPUG 2018 MIDDLE EAST MEETING. DAVID E. MARTIN Manager, Industry Partnerships and Outreach. MARK FAHEY Director of Operations

ALCF SITE UPDATE IXPUG 2018 MIDDLE EAST MEETING. DAVID E. MARTIN Manager, Industry Partnerships and Outreach. MARK FAHEY Director of Operations erhtjhtyhy IXPUG 2018 MIDDLE EAST MEETING ALCF SITE UPDATE DAVID E. MARTIN Manager, Industry Partnerships and Outreach Argonne Leadership Computing Facility MARK FAHEY Director of Operations Argonne Leadership

More information

Bias Scheduling in Heterogeneous Multicore Architectures. David Koufaty Dheeraj Reddy Scott Hahn

Bias Scheduling in Heterogeneous Multicore Architectures. David Koufaty Dheeraj Reddy Scott Hahn Bias Scheduling in Heterogeneous Multicore Architectures David Koufaty Dheeraj Reddy Scott Hahn Motivation Mainstream multicore processors consist of identical cores Complexity dictated by product goals,

More information

WorkloadWisdom Storage performance analytics for comprehensive workload insight

WorkloadWisdom Storage performance analytics for comprehensive workload insight DATASHEET Storage performance analytics for comprehensive workload insight software is the industry s only automated workload acquisition, workload analysis, workload modeling, and workload performance

More information

HPC Workload Management Tools: Tech Brief Update

HPC Workload Management Tools: Tech Brief Update 89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com @EdisonGroupInc 212.367.7400 HPC Workload Management Tools: Tech Brief Update IBM Platform LSF Meets Evolving High Performance Computing

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Architecture-Aware Cost Modelling for Parallel Performance Portability

Architecture-Aware Cost Modelling for Parallel Performance Portability Architecture-Aware Cost Modelling for Parallel Performance Portability Evgenij Belikov Diploma Thesis Defence August 31, 2011 E. Belikov (HU Berlin) Parallel Performance Portability August 31, 2011 1 /

More information

July, 10 th From exotics to vanillas with GPU Murex 2014

July, 10 th From exotics to vanillas with GPU Murex 2014 July, 10 th 2014 From exotics to vanillas with GPU Murex 2014 COMPANY Selected Industry Recognition and Rankings 2013-2014 OVERALL #1 TOP TECHNOLOGY VENDOR #1 Trading Systems #1 Pricing & Risk Analytics

More information

Dynamic Scheduling Strategy of HTCondor at IHEP

Dynamic Scheduling Strategy of HTCondor at IHEP Dynamic Scheduling Strategy of HTCondor at IHEP Shi, Jingyan (shijy@ihep.ac.cn) On behalf of scheduling group of Computing Center, IHEP 17-5-3 HTCondor Week 2017 1 Content 1 2 3 IHEP Cluster Introduction

More information

New Solution Deployment: Best Practices White Paper

New Solution Deployment: Best Practices White Paper New Solution Deployment: Best Practices White Paper Document ID: 15113 Contents Introduction High Level Process Flow for Deploying New Solutions Solution Requirements Required Features or Services Performance

More information

High Performance Computing(HPC) & Software Stack

High Performance Computing(HPC) & Software Stack IBM HPC Developer Education @ TIFR, Mumbai High Performance Computing(HPC) & Software Stack January 30-31, 2012 Pidad D'Souza (pidsouza@in.ibm.com) IBM, System & Technology Group 2002 IBM Corporation Agenda

More information

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com

More information

StackIQ Enterprise Data Reference Architecture

StackIQ Enterprise Data Reference Architecture WHITE PAPER StackIQ Enterprise Data Reference Architecture StackIQ and Hortonworks worked together to Bring You World-class Reference Configurations for Apache Hadoop Clusters. Abstract Contents The Need

More information

IT Architectural Principles for Designing a Dynamic IT Organization

IT Architectural Principles for Designing a Dynamic IT Organization IT Architectural Principles for Designing a Dynamic IT Organization Tim Durniak, IBM STG CTO for Public Sector Tuesday, August 3, 2010: 4:30 PM-5:30 PM Room 108 (Hynes Convention Center) Abstract and Audience

More information

Windows Server Capacity Management 101

Windows Server Capacity Management 101 Windows Server Capacity Management 101 What is Capacity Management? ITIL definition of Capacity Management is: Capacity Management is responsible for ensuring that adequate capacity is available at all

More information

Application Performance Management for Cloud

Application Performance Management for Cloud Application Performance Management for Cloud CMG By Priyanka Arora prarora803@gmail.com Cloud Adoption Trends 2 Spending on public cloud Infrastructure as a Service hardware and software is forecast to

More information

Micro-Virtualization. Maximize processing power use and improve system/energy efficiency

Micro-Virtualization. Maximize processing power use and improve system/energy efficiency Micro-Virtualization Maximize processing power use and improve system/energy efficiency Disclaimers We don t know everything But we know there is a problem and we re solving (at least part of) it And we

More information

Virtual Workspaces Dynamic Virtual Environments in the Grid

Virtual Workspaces Dynamic Virtual Environments in the Grid Virtual Workspaces Dynamic Virtual Environments in the Grid October 5, 2006 ESAC Grid Workshop '06 Borja Sotomayor University of Chicago Index Virtual Workspaces What is a workspace? Why are VM-based workspaces

More information

We benchmarked several Time Series Databases and quickly found that InfluxDB was the most powerful for our use case.

We benchmarked several Time Series Databases and quickly found that InfluxDB was the most powerful for our use case. Overview Factry wanted to help their customer A&S Energie achieve greater efficiencies with energy generation by using a modern process historian (historians are software used to collect process data from

More information

Operational Insight for MDM

Operational Insight for MDM Operational Insight for MDM Introducing the MDM Hub Analyzer Presented By: Hosted By: Jeff Klagenberg VP, Products & Solutions; GlobalSoft. Webinar Agenda Introduction to GlobalSoft Challenges for Operational

More information

Gartner RPE2 Methodology Overview

Gartner RPE2 Methodology Overview Gartner RPE2 Methodology Overview 1 Contents Introduction 2 RPE2 Background 3 RPE2 Definition 3 RPE2 Workload Extensions 4 Recommended Uses for RPE2 5 Server Consolidation Projects 6 Server Purchase Assessments

More information

RODOD Performance Test on Exalogic and Exadata Engineered Systems

RODOD Performance Test on Exalogic and Exadata Engineered Systems An Oracle White Paper March 2014 RODOD Performance Test on Exalogic and Exadata Engineered Systems Introduction Oracle Communications Rapid Offer Design and Order Delivery (RODOD) is an innovative, fully

More information

Deep Learning Acceleration with

Deep Learning Acceleration with Deep Learning Acceleration with powered by A Technical White Paper TABLE OF CONTENTS The Promise of AI The Challenges of the AI Lifecycle and Infrastructure MATRIX Powered by Bitfusion Flex Solution Architecture

More information

An Oracle White Paper September, Oracle Exalogic Elastic Cloud: A Brief Introduction

An Oracle White Paper September, Oracle Exalogic Elastic Cloud: A Brief Introduction An Oracle White Paper September, 2010 Oracle Exalogic Elastic Cloud: A Brief Introduction Introduction For most enterprise IT organizations, years of innovation, expansion, and acquisition have resulted

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Exadata Database Machine IOUG Exadata SIG Update February 6, 2013 Mathew Steinberg Exadata Product Management 2 The following is intended to outline our general product direction. It is intended for

More information

Deploying IBM Cognos 8 BI on VMware ESX. Barnaby Cole Practice Lead, Technical Services

Deploying IBM Cognos 8 BI on VMware ESX. Barnaby Cole Practice Lead, Technical Services Deploying IBM Cognos 8 BI on VMware ESX Barnaby Cole Practice Lead, Technical Services Agenda > Overview IBM Cognos 8 BI Architecture VMware ESX > Deployment Options > Our Testing > Optimization of VMware

More information

High-Performance Computing (HPC) Up-close

High-Performance Computing (HPC) Up-close High-Performance Computing (HPC) Up-close What It Can Do For You In this InfoBrief, we examine what High-Performance Computing is, how industry is benefiting, why it equips business for the future, what

More information

Accelerate Insights with Topology, High Throughput and Power Advancements

Accelerate Insights with Topology, High Throughput and Power Advancements Accelerate Insights with Topology, High Throughput and Power Advancements Michael A. Jackson, President Wil Wellington, EMEA Professional Services May 2014 1 Adaptive/Cray Example Joint Customers Cray

More information

BUILDING A PRIVATE CLOUD

BUILDING A PRIVATE CLOUD INNOVATION CORNER BUILDING A PRIVATE CLOUD How Platform Computing s Platform ISF* Can Help MARK BLACK, CLOUD ARCHITECT, PLATFORM COMPUTING JAY MUELHOEFER, VP OF CLOUD MARKETING, PLATFORM COMPUTING PARVIZ

More information

Product Brief SysTrack VMP

Product Brief SysTrack VMP Product Brief SysTrack VMP Benefits Optimize desktop and server virtualization and terminal server projects Anticipate and handle problems in the planning stage instead of postimplementation Use iteratively

More information

Course 20535A: Architecting Microsoft Azure Solutions

Course 20535A: Architecting Microsoft Azure Solutions Course 20535A: Architecting Microsoft Azure Solutions Module 1: Application Architecture Patterns in Azure This module introduces and reviews common Azure patterns and architectures as prescribed by the

More information

Introduction to IBM Insight for Oracle and SAP

Introduction to IBM Insight for Oracle and SAP Introduction to IBM Insight for Oracle and SAP Sazali Baharom (sazali@my.ibm.com) ISV Solutions Architect IBM ASEAN Technical Sales Target Audience Consultants, Managers, Sr. Managers Infrastructure Service

More information

Stateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018

Stateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018 Stateful Services on DC/OS Santa Clara, California April 23th 25th, 2018 Who Am I? Shafique Hassan Solutions Architect @ Mesosphere Operator 2 Agenda DC/OS Introduction and Recap Why Stateful Services

More information

Services Guide April The following is a description of the services offered by PriorIT Consulting, LLC.

Services Guide April The following is a description of the services offered by PriorIT Consulting, LLC. SERVICES OFFERED The following is a description of the services offered by PriorIT Consulting, LLC. Service Descriptions: Strategic Planning: An enterprise GIS implementation involves a considerable amount

More information

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments August, 2018 AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments Standards Based AMD is committed to industry standards, offering you a choice in x86 architecture with design

More information

HPC in the Cloud Built for Atmospheric Modeling. Kevin Van Workum, PhD Sabalcore Computing Inc.

HPC in the Cloud Built for Atmospheric Modeling. Kevin Van Workum, PhD Sabalcore Computing Inc. HPC in the Cloud Built for Atmospheric Modeling Kevin Van Workum, PhD Sabalcore Computing Inc. kevin@sabalcore.com About Us HPC in the Cloud provider since 2000 Focused on Engineering and Scientific application

More information

The IBM and Oracle alliance. Power architecture

The IBM and Oracle alliance. Power architecture IBM Power Systems, IBM PowerVM and Oracle offerings: a winning combination The smart virtualization platform for IBM AIX, Linux and IBM i clients using Oracle solutions Fostering smart innovation through

More information

The Cost of IT. .

The Cost of IT.  . The Cost of IT Prepared By Astel Email Document info@astel.co.za White Paper No copies of this document or included information may be distributed to third parties without the expressed written permission

More information

Oracle Communications Billing and Revenue Management Elastic Charging Engine Performance. Oracle VM Server for SPARC

Oracle Communications Billing and Revenue Management Elastic Charging Engine Performance. Oracle VM Server for SPARC Oracle Communications Billing and Revenue Management Elastic Charging Engine Performance Oracle VM Server for SPARC Table of Contents Introduction 1 About Oracle Communications Billing and Revenue Management

More information

Increasing Enterprise Support Demand & Complexity

Increasing Enterprise Support Demand & Complexity PTC System Monitor Increasing Enterprise Support Demand & Complexity Diagnostics & Troubleshooting Tools based on Customer & TS Requirements Customer Challenges Visibility into System Health Time To Resolution

More information

ANALYSIS OF BENEFITS AND DRAWBACKS OF TRADITIONAL ERP VERSUS CLOUD BASED ERP SYSTEMS

ANALYSIS OF BENEFITS AND DRAWBACKS OF TRADITIONAL ERP VERSUS CLOUD BASED ERP SYSTEMS ANALYSIS OF BENEFITS AND DRAWBACKS OF TRADITIONAL ERP VERSUS CLOUD BASED ERP SYSTEMS Naveen Chandra 1 & Parag Rastogi 2 International Journal of Latest Trends in Engineering and Technology Vol.(9)Issue(4),

More information

E-BUSINESS SUITE APPLICATIONS R12 (RUP 4) LARGE/EXTRA-LARGE PAYROLL (BATCH) BENCHMARK -

E-BUSINESS SUITE APPLICATIONS R12 (RUP 4) LARGE/EXTRA-LARGE PAYROLL (BATCH) BENCHMARK - O RACLE E-BUSINESS B ENCHMARK R EV. 1.0 E-BUSINESS SUITE APPLICATIONS R12 (RUP 4) LARGE/EXTRA-LARGE PAYROLL (BATCH) BENCHMARK - USING ORACLE10g ON AN IBM SYSTEM X3550 M2 SERVER As a global leader in e-business

More information

CAPACITY MANAGEMENT GUIDELINES FOR VBLOCK INFRASTRUCTURE PLATFORMS

CAPACITY MANAGEMENT GUIDELINES FOR VBLOCK INFRASTRUCTURE PLATFORMS CAPACITY MANAGEMENT GUIDELINES FOR VBLOCK INFRASTRUCTURE PLATFORMS August 2011 WHITE PAPER 2011 VCE Company, LLC. All rights reserved. 1 Table of Contents Executive Summary... 3 Why is Capacity Management

More information

Deep Learning Insurgency

Deep Learning Insurgency Deep Learning Insurgency Data holds competitive value What your data would say if it could talk Amount of data worldwide by 2020 44 zettabytes Internet Of Things Mobile unstructured data structured data

More information

ORACLE COMMUNICATIONS SELFRELIANT

ORACLE COMMUNICATIONS SELFRELIANT SELFRELIANT SERVICE-AVAILABILITY MIDDLEWARE WITH CARRIER-GRADE PERFORMANCE KEY BENEFITS Complete, integrated availability management, distributed messaging, and embedded systems management facilitate 99.999

More information

DELL EMC POWEREDGE 14G SERVER PORTFOLIO

DELL EMC POWEREDGE 14G SERVER PORTFOLIO DELL EMC POWEREDGE 14G SERVER PORTFOLIO Transformation without compromise Seize your share of a huge market opportunity and accelerate your business by combining sales of the exciting new Dell EMC PowerEdge

More information

A TECHNICAL LOOK INSIDE ASE 15.7 S HYBRID THREADED KERNEL

A TECHNICAL LOOK INSIDE ASE 15.7 S HYBRID THREADED KERNEL A TECHNICAL LOOK INSIDE ASE 15.7 S HYBRID THREADED KERNEL K 21 ASE S KERNEL DESIGN FOR THE 21 ST CENTURY DIAL IN NUMBER: 866.803.2143 210.795.1098 PASSCODE: SYBASE A TECHNICAL LOOK INSIDE ASE 15.7 S HYBRID

More information

Cray and Allinea. Maximizing Developer Productivity and HPC Resource Utilization. SHPCP HPC Theater SEG 2016 C O M P U T E S T O R E A N A L Y Z E 1

Cray and Allinea. Maximizing Developer Productivity and HPC Resource Utilization. SHPCP HPC Theater SEG 2016 C O M P U T E S T O R E A N A L Y Z E 1 Cray and Allinea Maximizing Developer Productivity and HPC Resource Utilization SHPCP HPC Theater SEG 2016 10/27/2016 C O M P U T E S T O R E A N A L Y Z E 1 History Cluster Computing to Super Computing

More information

Server Configuration Monitor

Server Configuration Monitor EVALUATOR S GUIDE Server Configuration Monitor How SCM can help improve your efficiency and increase the performance and security of your servers. WHAT IS SOLARWINDS SERVER CONFIGURATION MONITOR SolarWinds

More information

GPU-Meta-Storms: Computing the similarities among massive microbial communities using GPU

GPU-Meta-Storms: Computing the similarities among massive microbial communities using GPU GPU-Meta-Storms: Computing the similarities among massive microbial communities using GPU Xiaoquan Su $, Xuetao Wang $, JianXu, Kang Ning* Shandong Key Laboratory of Energy Genetics, CAS Key Laboratory

More information

ORACLE DATABASE PERFORMANCE: VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018

ORACLE DATABASE PERFORMANCE: VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018 ORACLE DATABASE PERFORMANCE: VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018 Table of Contents Executive Summary...3 Introduction...3 Test Environment... 4 Test Workload... 6 Virtual Machine Configuration...

More information

ANSYS FLUENT Performance Benchmark and Profiling. October 2009

ANSYS FLUENT Performance Benchmark and Profiling. October 2009 ANSYS FLUENT Performance Benchmark and Profiling October 2009 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, ANSYS, Dell, Mellanox Compute

More information

Overview and Frequently Asked Questions

Overview and Frequently Asked Questions Overview and Frequently Asked Questions OVERVIEW On April 20, 2009, Oracle announced that it has agreed to acquire Sun Microsystems. The transaction is subject to regulatory approval and until such time

More information

How Jukin Media Leverages Metricly for Performance and Capacity Monitoring

How Jukin Media Leverages Metricly for Performance and Capacity Monitoring CASE STUDY How Jukin Media Leverages Metricly for Performance and Capacity Monitoring Jukin Media is a global entertainment company powered by user-generated video content. Jukin receives more than 2.5

More information

Aspects of Fair Share on a heterogeneous cluster with SLURM

Aspects of Fair Share on a heterogeneous cluster with SLURM Aspects of Fair Share on a heterogeneous cluster with SLURM Karsten Balzer Computing Center, CAU Kiel balzer@rz.uni-kiel.de 88-4659 ZKI Tagung Heidelberg 216 (13th Oct. 216, 16:3-17:) Background I HPC

More information