SHENGYUAN LIU, JUNGANG XU, ZONGZHENG LIU, XU LIU & RICE UNIVERSITY

Size: px
Start display at page:

Download "SHENGYUAN LIU, JUNGANG XU, ZONGZHENG LIU, XU LIU & RICE UNIVERSITY"

Transcription

1 EVALUATING TASK SCHEDULING IN HADOOP-BASED CLOUD SYSTEMS SHENGYUAN LIU, JUNGANG XU, ZONGZHENG LIU, XU LIU UNIVERSITY OF CHINESE ACADEMY OF SCIENCES & RICE UNIVERSITY

2 OUTLINE Background & Motivation Hadoop Task scheduler Benchmark & Methodology Evaluation CONCLUSIONS & Future work

3 PRIVATE CLOUD "The NIST Definition of Cloud Computing", National Institute of fstandards d and dtechnology. Retrieved 24 July 2011 The cloud infrastructure is provisioned for exclusive use by a single organization comprising multiple consumers (e.g., business units). It may be owned, managed, and operated by the organization, a third party, or some combination of them, and it may exist on or off premises.

4 MOTIVATION A private cloud serves multiple users. Different ee ttask priorities tes Different task types Different task data sizes Optimizing the performance of private cloud is necessary and urgent A challenge for task scheduling!

5 OUTLINE Background & Motivation Hadoop Task scheduler Benchmark & Methodology Evaluation CONCLUSIONS C O S & Future work

6 HADOOP OVERVIEW Hadoop An open-source software framework for processing a large volume of data on a cluster

7 HADOOP TASK SCHEDULER FIFO Naïve Fair sharing Fair Sharing with Delay Scheduling Capacity Scheduling HOD

8 OUTLINE Background & Motivation Hadoop Task scheduler Benchmark & Methodology Evaluation CONCLUSIONS C O S & Future work

9 CLOUDRANK-D A benchmark presented by ICT of CAS A benchmark suite for private cloud Help researchers to simulate various multi-user applications in industrial scenarios Benchmark provides a set of 13 representative data analysis tools Basic operations Data mining operations Data warehouse operations

10 DATA SOURCES OF EACH PROGRAM IN CLOUDRANK-D Application Sort Word count Grep Naive Bayes Support vector machine K-means Item based collaborative filtering Frequent pattern growth Hidden Markov model Grep select Ranking select User visits aggregation User visits-rankings join Data sources Automatically generated News and Wikipedia Scientist search Sougou corpus Ratings on movies Retail market basket data Click-stream data of an on-line news portal Traffic accident data Collection of web html document Scientist search Automatically generated table

11 CONTENT Background & Motivation Hadoop Task scheduler Benchmark & Methodology Evaluation CONCLUSIONS C O S & Future work

12 WORKLOAD DESIGN Image processing Log processing Data mining Reporting 2% Text indexing Web crawling Machine learning Data storage 17% 17% 11% 16% Web crawling Data mining i Machine learning 15% Image Processing Text Indexing Log Processing 15% Reporting 7% Data Storage Applications in CloudRank-D Percent private clouds Applications age Naive Bayes SVM HMM IBCF FPG Basic Operations 35% 31% Hive 34%

13 WORKLOAD DESIGN Category Application Jobs 100 Jobs Basic Operations Data Mining Operations Data Warehouse Operations Sort 9 Word count 11 Grep 11 Naïve Bayes 6 Support vector machine 6 K-means 7 Item based collaborative 3 Frequent pattern growth 7 Hidden Markov model 6 Grep select Ranking select user visits aggregation 34 user visits-rankings join

14 JOB SUBMITTING Follows the distribution of input data size in Taobao Follows an exponential distribution with a mean of 14 seconds(facebook) Job submitted in a random order Input Data size Percentage <25MB 40.57% 25MB-625MB 39.33% 1.2GB-5GB 12.03% >5GB 8.07%

15 TESTBED Hadoop cluster with 5 nodes (1 NameNode,4 DataNodes) CPU Type Intel Xeon E5645 Intel CPU Core 6 cores@2.40g L1 D/I Cache L2 Cache L3 Cache Memory Disk 6 32 KB KB 12MB 16GB 8TB OS Hadoop Mahout Hive CentOS

16 HADOOP CONFIGURATION Hadoop Parameter Value Description The maximum number of map tasks that mapred.tasktracker. dt k 12 will be executed simultaneously by a task map.tasks.maximum tracker. mapred.tasktracker.r The maximum number of reduce tasks that educe.tasks.maximu m 12 will be executed simultaneously by a task tracker. mapred.map.tasks 48 Maximum number of concurrent running reduce task. mapred.reduce.tasks 45 Maximum number of concurrent running map task. dfs.replication 2 The actual number of replications specified when the file is created. mapreduce.tasktrack er.outofband.heartbe tb TRUE Open the out of band heartbeat. t at

17 HADOOP SCHEDULER EVALUATION Data Processed per Second Turnaround time Running time Waiting Time Throughput

18 DATA PROCESSED PER SECOND Total runnin ng time (103 3s) DPS (MB/s s) Fair with DS Naïve Fair Capacity FIFO HOD Task Scheduler 0 Fair with DS Naïve Fair Capacity FIFO HOD Task Scheduler The total running time (10 3 sec) of running full workload by using five schedulers respectively The Data Processed per Second (Megabytes processed per second) of running full workload by using five schedulers respectively.

19 TURNAROUND TIME Turn around time (103s) Fair with DS Naïve Fair Capacity FIFO HOD Task Scheduler The average job turnaround time (10 3 sec) of running full workload by using five schedulers respectively.

20 AVERAGE JOB RUNNING TIME & WAITING TIME Running tim me (103s) sec.) Wa aiting time ( Task Scheduler Fair with DS Naïve Fair Capacity FIFO HOD Task Scheduler The average job running time (10 3 sec) of running full workload by using five schedulers respectively. Average job waiting time (second) of running full workload by using five schedulers respectively.

21 THROUGHPUT Th hroughput (j jobs/min) Fair with DS Naïve Fair Capacity FIFO HOD Task Scheduler The throughput (number of jobs processed in one minute) of running The throughput (number of jobs processed in one minute) of running full workload by using five schedulers respectively

22 EVALUATION RESULT ANALYSIS Fair with delay scheduling scheduler is the most efficient scheduler some jobs with large size will have longer time to finish than usual jobs Fair with delay scheduling, naïve fair, capacity, these three schedulers are all have the better performance than default FIFO scheduler HOD h d l f d t ll HOD scheduler preformed not very well, affected by the extra cost of virtualization

23 CONCLUSIONS & FUTURE WORK Optimizing i i the performance of Hadoop clusters is very necessary and significant The choice of task schedulers is very critical for system performance improvement of Hadoop cluster With fair sharing with delay scheduling, DPS is improved by 20% than that of FIFO scheduler Optimization and design of the scheduler need to refer to the characteristics of the workload In the future, we will use more complex workloads to study and evaluate more efficient task schedulers for Hadoop based cloud systems

24 Q & A THANKS! SOUNDER_LIU@163.COM, XUJG@UCAS.AC.CN

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

Scalability and High Performance with MicroStrategy 10

Scalability and High Performance with MicroStrategy 10 Scalability and High Performance with MicroStrategy 10 Enterprise Analytics and Mobility at Scale. Copyright Information All Contents Copyright 2017 MicroStrategy Incorporated. All Rights Reserved. Trademark

More information

Infor LN Minimum hardware requirements. Sizing Documentation

Infor LN Minimum hardware requirements. Sizing Documentation Infor LN Minimum hardware requirements Sizing Documentation Copyright 2014 Infor Important Notices The material contained in this publication (including any supplementary information) constitutes and contains

More information

Brian Macdonald Big Data & Analytics Specialist - Oracle

Brian Macdonald Big Data & Analytics Specialist - Oracle Brian Macdonald Big Data & Analytics Specialist - Oracle Improving Predictive Model Development Time with R and Oracle Big Data Discovery brian.macdonald@oracle.com Copyright 2015, Oracle and/or its affiliates.

More information

ORACLE S PEOPLESOFT HRMS 9.1 FP2 SELF-SERVICE

ORACLE S PEOPLESOFT HRMS 9.1 FP2 SELF-SERVICE O RACLE E NTERPRISE B ENCHMARK R EV. 1.1 ORACLE S PEOPLESOFT HRMS 9.1 FP2 SELF-SERVICE USING ORACLE DB 11g FOR LINUX ON CISCO UCS B460 M4 AND B200 M3 Servers As a global leader in e-business applications,

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

SAP Public Budget Formulation 8.1

SAP Public Budget Formulation 8.1 Sizing Guide Document Version: 1.0 2013-09-30 CUSTOMER Typographic Conventions Type Style Example Example EXAMPLE Example Example EXAMPLE Description Words or characters quoted from the screen.

More information

Using the Blaze Engine to Run Profiles and Scorecards

Using the Blaze Engine to Run Profiles and Scorecards Using the Blaze Engine to Run Profiles and Scorecards 1993, 2016 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording

More information

SAP Predictive Analytics Suite

SAP Predictive Analytics Suite SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem

More information

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming

More information

COMPUTE CLOUD SERVICE. Move to Your Private Data Center in the Cloud Zero CapEx. Predictable OpEx. Full Control.

COMPUTE CLOUD SERVICE. Move to Your Private Data Center in the Cloud Zero CapEx. Predictable OpEx. Full Control. COMPUTE CLOUD SERVICE Move to Your Private Data Center in the Cloud Zero CapEx. Predictable OpEx. Full Control. The problem. You run multiple data centers with hundreds of servers hosting diverse workloads.

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Sizing SAP Central Process Scheduling 8.0 by Redwood

Sizing SAP Central Process Scheduling 8.0 by Redwood Sizing SAP Central Process Scheduling 8.0 by Redwood Released for SAP Customers and Partners January 2012 Copyright 2012 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted

More information

RESOURCE MANAGEMENT IN CLUSTER COMPUTING PLATFORMS FOR LARGE SCALE DATA PROCESSING

RESOURCE MANAGEMENT IN CLUSTER COMPUTING PLATFORMS FOR LARGE SCALE DATA PROCESSING RESOURCE MANAGEMENT IN CLUSTER COMPUTING PLATFORMS FOR LARGE SCALE DATA PROCESSING A Dissertation Presented By Yi Yao to The Department of Electrical and Computer Engineering in partial fulfillment of

More information

Hadoop Fair Scheduler Design Document

Hadoop Fair Scheduler Design Document Hadoop Fair Scheduler Design Document August 15, 2009 Contents 1 Introduction The Hadoop Fair Scheduler started as a simple means to share MapReduce clusters. Over time, it has grown in functionality to

More information

CASH: Context Aware Scheduler for Hadoop

CASH: Context Aware Scheduler for Hadoop CASH: Context Aware Scheduler for Hadoop Arun Kumar K, Vamshi Krishna Konishetty, Kaladhar Voruganti, G V Prabhakara Rao Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, India {arunk786,vamshi2105}@gmail.com,

More information

IBM xseries 430. Versatile, scalable workload management. Provides unmatched flexibility with an Intel architecture and open systems foundation

IBM xseries 430. Versatile, scalable workload management. Provides unmatched flexibility with an Intel architecture and open systems foundation Versatile, scalable workload management IBM xseries 430 With Intel technology at its core and support for multiple applications across multiple operating systems, the xseries 430 enables customers to run

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

ECLIPSE 2012 Performance Benchmark and Profiling. August 2012

ECLIPSE 2012 Performance Benchmark and Profiling. August 2012 ECLIPSE 2012 Performance Benchmark and Profiling August 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing Jik-Soo Kim, Ph.D National Institute of Supercomputing and Networking(NISN) at KISTI Table of Contents

More information

Job Scheduling for Multi-User MapReduce Clusters

Job Scheduling for Multi-User MapReduce Clusters Job Scheduling for Multi-User MapReduce Clusters Matei Zaharia Dhruba Borthakur Joydeep Sen Sarma Khaled Elmeleegy Scott Shenker Ion Stoica Electrical Engineering and Computer Sciences University of California

More information

A New Hadoop Scheduler Framework

A New Hadoop Scheduler Framework A New Hadoop Scheduler Framework Geetha J 1, N. Uday Bhaskar 2 and P. Chenna Reddy 3 Research Scholar 1, Assistant Professor 2 & 3 Professor 2 Department of Computer Science, Government College (Autonomous),

More information

Towards Seamless Integration of Data Analytics into Existing HPC Infrastructures

Towards Seamless Integration of Data Analytics into Existing HPC Infrastructures Towards Seamless Integration of Data Analytics into Existing HPC Infrastructures Michael Gienger High Performance Computing Center Stuttgart (HLRS), Germany Redmond May 11, 2017 :: 1 Outline Introduction

More information

SAP Cloud Platform Pricing and Packages

SAP Cloud Platform Pricing and Packages Platform Pricing and Packages Get Started Packages Fast. Easy. Cost-effective. Get familiar and up-and-running with Platform in no time flat. Intended for non-production use. Designed to help users become

More information

ANSYS, Inc. March 12, ANSYS HPC Licensing Options - Release

ANSYS, Inc. March 12, ANSYS HPC Licensing Options - Release 1 2016 ANSYS, Inc. March 12, 2017 ANSYS HPC Licensing Options - Release 18.0 - 4 Main Products HPC (per-process) 10 instead of 8 in 1 st Pack at Release 18.0 and higher HPC Pack HPC product rewarding volume

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

RODOD Performance Test on Exalogic and Exadata Engineered Systems

RODOD Performance Test on Exalogic and Exadata Engineered Systems An Oracle White Paper March 2014 RODOD Performance Test on Exalogic and Exadata Engineered Systems Introduction Oracle Communications Rapid Offer Design and Order Delivery (RODOD) is an innovative, fully

More information

Planning the Capacity of a Web Server: An Experience Report D. Menascé. All Rights Reserved.

Planning the Capacity of a Web Server: An Experience Report D. Menascé. All Rights Reserved. Planning the Capacity of a Web Server: An Experience Report Daniel A. Menascé George Mason University menasce@cs.gmu.edu Nikki Dinh SRA International, Inc. nikki_dinh@sra.com Robert Peraino George Mason

More information

Increased Informix Awareness Discover Informix microsite launched

Increased Informix Awareness Discover Informix microsite launched Information Management Increased Informix Awareness Discover Informix microsite launched www.ibm.com/discoverinformix 2010 IBM Corporation Informix Panther Early Program Want to be on the cutting-edge

More information

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Matthias Kupczak, Michael Probst; SAP June, 2017 Agenda 09:30-09:55 Coffee 09:55-10:00 Welcome Message T-Systems

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

Recording. Solutions. Redefined. call recording

Recording. Solutions. Redefined. call recording Recording. Solutions. Redefined. call recording OrecX is the industry s most widely-used multi-tenant call recorder with over 250 Service Provider clients. OrecX will meet the your customers recording

More information

ORACLE BIG DATA APPLIANCE

ORACLE BIG DATA APPLIANCE ORACLE BIG DATA APPLIANCE BIG DATA FOR THE ENTERPRISE KEY FEATURES Massively scalable infrastructure to store and manage big data Big Data Connectors delivers unprecedented load rates between Big Data

More information

Performance Interference of Multi-tenant, Big Data Frameworks in Resource Constrained Private Clouds

Performance Interference of Multi-tenant, Big Data Frameworks in Resource Constrained Private Clouds Performance Interference of Multi-tenant, Big Data Frameworks in Resource Constrained Private Clouds Stratos Dimopoulos, Chandra Krintz, Rich Wolski Department of Computer Science University of California,

More information

Building Efficient Large-Scale Big Data Processing Platforms

Building Efficient Large-Scale Big Data Processing Platforms University of Massachusetts Boston ScholarWorks at UMass Boston Graduate Doctoral Dissertations Doctoral Dissertations and Masters Theses 5-31-2017 Building Efficient Large-Scale Big Data Processing Platforms

More information

Recording. Solutions. Redefined. CALL RECORDING

Recording. Solutions. Redefined. CALL RECORDING Recording. Solutions. Redefined. CALL RECORDING Oreka TR is compliance tested and approved on the Avaya Aura, IP Office & Communication Server platforms. Oreka TR integrates with AES, DMCC & TSAPI - providing

More information

Analytical Capability Security Compute Ease Data Scale Price Users Traditional Statistics vs. Machine Learning In-Memory vs. Shared Infrastructure CRAN vs. Parallelization Desktop vs. Remote Explicit vs.

More information

Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2

Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2 Oracle Financial Services Revenue Management and Billing V2.3 Performance Stress Test on Exalogic X3-2 & Exadata X3-2 O R A C L E W H I T E P A P E R J A N U A R Y 2 0 1 5 Table of Contents Disclaimer

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

ARIA: Automatic Resource Inference and Allocation for MapReduce Environments

ARIA: Automatic Resource Inference and Allocation for MapReduce Environments ARIA: Automatic Resource Inference and Allocation for MapReduce Environments Abhishek Verma University of Illinois at Urbana-Champaign, IL, US verma7@illinois.edu Ludmila Cherkasova Hewlett-Packard Labs

More information

An Oracle White Paper April, Enterprise Manager 12c Cloud Control Metering and Chargeback

An Oracle White Paper April, Enterprise Manager 12c Cloud Control Metering and Chargeback An Oracle White Paper April, 2012 Enterprise Manager 12c Cloud Control Executive Overview... 2 Introduction... 2 I.T. Chargeback... 2 Oracle Enterprise Manager... 3 Installing and Configuring Enterprise

More information

Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework

Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework Lu Sun 1, Kiejin Park 2 and Limei Peng 1 1 Department of Industrial Engineering, Ajou

More information

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered.

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered. Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered November 20, 2017 Contents GLOSSARY PUBLIC CLOUD SERVICES-NON-METERED... 9 API Call...

More information

In-Memory Analytics: Get Faster, Better Insights from Big Data

In-Memory Analytics: Get Faster, Better Insights from Big Data Discussion Summary In-Memory Analytics: Get Faster, Better Insights from Big Data January 2015 Interview Featuring: Tapan Patel, SAS Institute, Inc. Introduction A successful analytics program should translate

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Oracle Autonomous Data Warehouse Cloud

Oracle Autonomous Data Warehouse Cloud Oracle Autonomous Data Warehouse Cloud 1 Lower Cost, Increase Reliability and Performance to Extract More Value from Your Data With Oracle Autonomous Database Cloud Service for Data Warehouse Today s leading-edge

More information

E-BUSINESS SUITE APPLICATIONS R12 (12.1.3) EXTRA- LARGE PAYROLL (BATCH) BENCHMARK - USING ORACLE11g ON AN IBM Power System S824

E-BUSINESS SUITE APPLICATIONS R12 (12.1.3) EXTRA- LARGE PAYROLL (BATCH) BENCHMARK - USING ORACLE11g ON AN IBM Power System S824 Employees per Hour O R A C L E E - B U S I N E S S B E N C H M A R K R EV. 1.1 E-BUSINESS SUITE APPLICATIONS R12 (12.1.3) EXTRA- LARGE PAYROLL (BATCH) BENCHMARK - USING ORACLE11g ON AN IBM Power System

More information

#23164 FASTEST GPU-BASED OLAP AND DATA MINING: BIG DATA ANALYTICS ON DGX. Speaker: Roman Raevsky, Co-Founder & CEO, Polymatica

#23164 FASTEST GPU-BASED OLAP AND DATA MINING: BIG DATA ANALYTICS ON DGX. Speaker: Roman Raevsky, Co-Founder & CEO, Polymatica #23164 FASTEST GPU-BASED OLAP AND DATA MINING: BIG DATA ANALYTICS ON DGX Speaker: Roman Raevsky, Co-Founder & CEO, Polymatica rar@polymatica.com ABOUT POLYMATICA OWN BI PLATFORM DEVELOPMENT SINCE 2010

More information

[Header]: Demystifying Oracle Bare Metal Cloud Services

[Header]: Demystifying Oracle Bare Metal Cloud Services [Header]: Demystifying Oracle Bare Metal Cloud Services [Deck]: The benefits and capabilities of Oracle s next-gen IaaS By Umair Mansoob Introduction As many organizations look to the cloud as a way to

More information

BMC CONTROL-M WORKLOAD OPTIMIZATION

BMC CONTROL-M WORKLOAD OPTIMIZATION BMC CONTROL-M WORKLOAD OPTIMIZATION TIPS & TRICKS FOR ADMINISTERING BMC CONTROL-M BMC Communities Site for South East User Group About Cetan Corp Cetan Corp is a leading independent provider of Workload

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Apache Kafka. A distributed publish-subscribe messaging system. Neha Narkhede, 11/11/11

Apache Kafka. A distributed publish-subscribe messaging system. Neha Narkhede, 11/11/11 Apache Kafka A distributed publish-subscribe messaging system Neha Narkhede, LinkedIn @nehanarkhede, 11/11/11 Outline Introduction to pub-sub Kafka at LinkedIn Hadoop and Kafka Design Performance What

More information

Get The Best Out Of Oracle Scheduler

Get The Best Out Of Oracle Scheduler Get The Best Out Of Oracle Scheduler Vira Goorah Oracle America Redwood Shores CA Introduction Automating the business process is a key factor in reducing IT operating expenses. The need for an effective

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

BI Portal User Guide

BI Portal User Guide Contents 1 Overview... 3 2 Accessing the BI Portal... 3 3 BI Portal Dashboard... 3 3.1 Adding a new widget... 4 3.2 Customizing an Existing Widget... 8 3.3 Additional Widget Operations... 9 4 Widget Gallery...

More information

Securing MapReduce Result Integrity via Verification-based Integrity Assurance Framework

Securing MapReduce Result Integrity via Verification-based Integrity Assurance Framework , pp.53-70 http://dx.doi.org/10.14257/ijgdc.2014.7.6.05 Securing MapReduce Result Integrity via Verification-based Integrity Assurance Framework Yongzhi Wang 1, Jinpeng Wei 2 and Yucong Duan 3 1,2 Floridia

More information

Oracle PaaS and IaaS Universal Credits Service Descriptions

Oracle PaaS and IaaS Universal Credits Service Descriptions Oracle PaaS and IaaS Universal Credits Service Descriptions December 1, 2017 Oracle PaaS_IaaS_Universal_CreditsV120117 1 Metrics... 3 Oracle PaaS and IaaS Universal Credit... 8 Oracle PaaS and IaaS Universal

More information

DynamicCloudSim: Simulating Heterogeneity in Computational Clouds

DynamicCloudSim: Simulating Heterogeneity in Computational Clouds DynamicCloudSim: Simulating Heterogeneity in Computational Clouds Marc Bux, Ulf Leser Department of Computer Science, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany {buxmarcn,leser}@informatik.hu-berlin.de

More information

How to Build Your Data Ecosystem with Tableau on AWS

How to Build Your Data Ecosystem with Tableau on AWS How to Build Your Data Ecosystem with Tableau on AWS Moving Your BI to the Cloud Your BI is working, and it s probably working well. But, continuing to empower your colleagues with data is going to be

More information

InfoSphere DataStage Grid Solution

InfoSphere DataStage Grid Solution InfoSphere DataStage Grid Solution Julius Lerm IBM Information Management 1 2011 IBM Corporation What is Grid Computing? Grid Computing doesn t mean the same thing to all people. GRID Definitions include:

More information

A Contention-Aware Hybrid Evaluator for Schedulers of Big Data Applications in Computer Clusters

A Contention-Aware Hybrid Evaluator for Schedulers of Big Data Applications in Computer Clusters A Contention-Aware Hybrid Evaluator for Schedulers of Big Data Applications in Computer Clusters Shouvik Bardhan Department of Computer Science George Mason University Fairfax, VA 22030 Email: sbardhan@masonlive.gmu.edu

More information

SLA-Driven Planning and Optimization of Enterprise Applications

SLA-Driven Planning and Optimization of Enterprise Applications SLA-Driven Planning and Optimization of Enterprise Applications H. Li 1, G. Casale 2, T. Ellahi 2 1 SAP Research, Karlsruhe, Germany 2 SAP Research, Belfast, UK Presenter: Giuliano Casale WOSP/SIPEW Conference

More information

IBM SPSS & Apache Spark

IBM SPSS & Apache Spark IBM SPSS & Apache Spark Making Big Data analytics easier and more accessible ramiro.rego@es.ibm.com @foreswearer 1 2016 IBM Corporation Modeler y Spark. Integration Infrastructure overview Spark, Hadoop

More information

Sizing Component Extension 6.0 for SAP EHS Management

Sizing Component Extension 6.0 for SAP EHS Management Sizing Guide Document Version: 1.3 2016-05-18 Sizing Component Extension 6.0 for SAP EHS Management Disclaimer Some components of this product are based on Java. Any code change in these components may

More information

USING HPC CLASS INFRASTRUCTURE FOR HIGH THROUGHPUT COMPUTING IN GENOMICS

USING HPC CLASS INFRASTRUCTURE FOR HIGH THROUGHPUT COMPUTING IN GENOMICS USING HPC CLASS INFRASTRUCTURE FOR HIGH THROUGHPUT COMPUTING IN GENOMICS Claude SCARPELLI Claude.Scarpelli@cea.fr FUNDAMENTAL RESEARCH DIVISION GENOMIC INSTITUTE Intel DDN Life Science Field Day Heidelberg,

More information

Automation Test Introduction

Automation Test Introduction Product Introduction Automation Introduction Network Master Pro MT1000A Proposal Outline Product Introduction Work Flow Application Examples Automation Steps Step 1: Office Work Step 2: Registering Scenario

More information

SPECjbb2015 Benchmark Design Document

SPECjbb2015 Benchmark Design Document Standard Performance Evaluation Corporation (SPEC) SPECjbb2015 Benchmark Design Document 7001 Heritage Village Plaza, Suite 225 Gainesville, VA 20155, USA SPEC OSG JAVA Committee Table of Contents 1. Introduction

More information

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise Cloud Service Model Selecting a cloud service model Different cloud service models within the enterprise Single cloud provider AWS for IaaS Azure for PaaS Force fit all solutions into the cloud service

More information

Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN

Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Processing over a trillion events a day CASE STUDIES IN SCALING STREAM PROCESSING AT LINKEDIN Jagadish Venkatraman

More information

Operations Management Suite

Operations Management Suite PRICING AND LICENSING DATASHEET MARCH 2017 Delivered from Azure, Operations Management Suite (OMS) enables you to gain visibility and control with comprehensive operations management and security across

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017

Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017 Mike Strickland, Director, Data Center Solution Architect Intel Programmable Solutions Group July 2017 Accelerate Big Data Analytics with Intel Frameworks and Libraries with FPGA s 1. Intel Big Data Analytics

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

Oracle Business Intelligence Suite Enterprise Edition 4,000 User Benchmark on an IBM System x3755 Server running Red Hat Enterprise Linux

Oracle Business Intelligence Suite Enterprise Edition 4,000 User Benchmark on an IBM System x3755 Server running Red Hat Enterprise Linux Oracle Business Intelligence Suite Enterprise Edition 4,000 User Benchmark on an IBM System x3755 Server running Red Hat Enterprise Linux An Oracle White Paper April 2008 Oracle Business Intelligence Suite

More information

OPERATING SYSTEMS. Systems and Models. CS 3502 Spring Chapter 03

OPERATING SYSTEMS. Systems and Models. CS 3502 Spring Chapter 03 OPERATING SYSTEMS CS 3502 Spring 2018 Systems and Models Chapter 03 Systems and Models A system is the part of the real world under study. It is composed of a set of entities interacting among themselves

More information

Research Report. The Major Difference Between IBM s LinuxONE and x86 Linux Servers

Research Report. The Major Difference Between IBM s LinuxONE and x86 Linux Servers Research Report The Major Difference Between IBM s LinuxONE and x86 Linux Servers Executive Summary The most important point in this Research Report is this: mainframes process certain Linux workloads

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Comparative Analysis of Scheduling Algorithms of Cloudsim in Cloud Computing

Comparative Analysis of Scheduling Algorithms of Cloudsim in Cloud Computing International Journal of Computer Applications (975 8887) Comparative Analysis of Scheduling Algorithms of Cloudsim in Cloud Computing Himani Department of CSE Guru Nanak Dev University, India Harmanbir

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Oracle Autonomous Data Warehouse Cloud

Oracle Autonomous Data Warehouse Cloud Oracle Autonomous Data Warehouse Cloud 1 Lower Cost, Increase Reliability and Performance to Extract More Value from Your Data With Oracle Autonomous Data Warehouse Cloud Today s leading-edge organizations

More information

Lecture 11: CPU Scheduling

Lecture 11: CPU Scheduling CS 422/522 Design & Implementation of Operating Systems Lecture 11: CPU Scheduling Zhong Shao Dept. of Computer Science Yale University Acknowledgement: some slides are taken from previous versions of

More information

Konica Minolta Business Innovation Center

Konica Minolta Business Innovation Center Konica Minolta Business Innovation Center Advance Technology/Big Data Lab May 2016 2 2 3 4 4 Konica Minolta BIC Technology and Research Initiatives Data Science Program Technology Trials (Technology partner

More information

NetApp Flexgroup Volumes in ONTAP. August 2017 SL10312 Version 1.0

NetApp Flexgroup Volumes in ONTAP. August 2017 SL10312 Version 1.0 August 217 SL1312 Version 1. TABLE OF CONTENTS 1 Introduction... 3 1.1 What is a NetApp FlexGroup?... 3 1.2 The Value of NetApp FlexGroup... 3 1.3 Lab Objectives... 4 1.4 Prerequisites... 4 2 Lab Environment...

More information

Make the most of the cloud with Microsoft System Center and Azure

Make the most of the cloud with Microsoft System Center and Azure December 2015 Make the most of the cloud with Microsoft System Center and Azure Writer: Daniel Örneling Amsterdam - Dallas - Ottawa Table of Content 1.1 The Dilemma: too many customers 3 1.2 Is the cloud

More information

Energy-Efficient Scheduling of Interactive Services on Heterogeneous Multicore Processors

Energy-Efficient Scheduling of Interactive Services on Heterogeneous Multicore Processors Energy-Efficient Scheduling of Interactive Services on Heterogeneous Multicore Processors Shaolei Ren, Yuxiong He, Sameh Elnikety University of California, Los Angeles, CA Microsoft Research, Redmond,

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

Cost Optimization for Cloud-Based Engineering Simulation Using ANSYS Enterprise Cloud

Cost Optimization for Cloud-Based Engineering Simulation Using ANSYS Enterprise Cloud Application Brief Cost Optimization for Cloud-Based Engineering Simulation Using ANSYS Enterprise Cloud Most users of engineering simulation are constrained by computing resources to some degree. They

More information

Certified Functions: WebDAV Storage interface, Server functionality WebDAV Storage Interface LOAD Test performed Solution Manager Ready functionality

Certified Functions: WebDAV Storage interface, Server functionality WebDAV Storage Interface LOAD Test performed Solution Manager Ready functionality SAP AG hereby confirms that the software component Interface for IBM Content Collector version V2.2 for the product IBM Content Collector version V2.2 of the company IBM Deutschland Research & Development

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Your Big Data to Big Data tools using the family of PI Integrators

Your Big Data to Big Data tools using the family of PI Integrators 1 Your Big Data to Big Data tools using the family of PI Integrators Presented by Martin Bryant Field Service Engineer @osisoft PI Integrators PI Integrator for Business Analytics PI Integrator for Business

More information

Microsoft FastTrack For Azure Service Level Description

Microsoft FastTrack For Azure Service Level Description ef Microsoft FastTrack For Azure Service Level Description 2017 Microsoft. All rights reserved. 1 Contents Microsoft FastTrack for Azure... 3 Eligible Solutions... 3 FastTrack for Azure Process Overview...

More information

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks Data Analytics with Adam Filion Application Engineer MathWorks 2015 The MathWorks, Inc. 1 Case Study: Day-Ahead Load Forecasting Goal: Implement a tool for easy and accurate computation of dayahead system

More information

Starting with Oracle Data Science in the Cloud

Starting with Oracle Data Science in the Cloud Starting with Oracle Data Science in the Cloud Kscope 17 Tim Vlamis Tuesday, June 27, 2017 @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+

More information

Краеугольный камень ИТ-трансформации в Новую Экономическую Эру

Краеугольный камень ИТ-трансформации в Новую Экономическую Эру Краеугольный камень ИТ-трансформации в Новую Экономическую Эру Huawei-SAP Collaboration History FusionCube SAP HANA was launched. SAP HANA on FusionSphere was certified. Huawei and SAP delivered the world's

More information

Oracle Utilities Mobile Workforce Management Benchmark

Oracle Utilities Mobile Workforce Management Benchmark Oracle Utilities Mobile Workforce Management Benchmark Demonstrates Superior Scalability for Large Field Service Organizations O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 Introduction Large

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information