Agent based parallel processing tool for Big Scientific Data. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille

Size: px
Start display at page:

Download "Agent based parallel processing tool for Big Scientific Data. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille"

Transcription

1 Agent based parallel processing tool for Big Scientific Data A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille IHEP, Beijing, 5 May 2015

2 Plan The problem of the HEP data processing DIRAC Project Agent based Workload Management System Accessible computing resources Data Management Interfaces DIRAC as a service Conclusions 2

3 Big Data Data that exceeds the boundaries and sizes of normal processing capabilities, forcing you to take a non-traditional approach for the treatment Google trends: Clouds Grids Big Data 3

4 Big Data LHC RAW data per year Already in 2010 the LHC experiments produced 13 PB of data That rate outstripped any other scientific effort going on 4

5 Big Data LHC RAW data per year LHC RAW data volumes are inflated by storing derived data products, replication for safety and efficient access, and by the need for storing even more simulated data than the RAW data: ~130 PB overall WLCG data on the Grid 5

6 Data flow to permanent storage: 4-6 GB/sec MB/sec 1-2 GB/sec ~ 4 GB/sec 1-2 GB/sec 6

7 Massive computations: HEP LHC experiments pioneered the massive use of computational grids as a solution to the High Energy Physics Big Data problem 10s of PBytes of data per year 100s of thousands CPUs in 100s of centers 10s of Gbyte/s data transfers 100s of users from 100s of institutions CERN Director General Rolf Heuer about the Higgs discovery: "It was a global effort and it is a global success. The results today are only possible because of the extraordinary performance of the accelerators, including the infrastructure, the experiments, and the Grid computing. Other domains are catching up quickly with the HEP experiments Life sciences, earth sciences, astrophysics, social sciences, etc 7

8 Worldwide LHC Computing Grid Collaboration (WLCG) Distributed infrastructure of 150 computing centers in 40 countries 300+ k CPU cores (~ 2M HEP-SPEC-06) The biggest site with ~50k CPU cores, 12 T2 with 2-30k CPU cores Distributed data, services and operation infrastructure 8

9 Computing grids Grids are providing common infrastructure and rules to integrate multiple computing centers Common computing element access protocols Common user payload scheduling system Common policies and security rules Users are signing up once and have access to multiple computing centers Common monitoring of the user jobs Common accounting of computing resources 9

10 Standard Grid Architecture Centralized Workload Management System Computing sites publish information on their availability Users submit jobs to the central repository The central Resource Broker (RB) pushes user jobs to sites after matching the job/resource compatibility 10

11 Standard grid problems Heavy reliance on the site status information flow Any delays result in suboptimal scheduling decisions Insufficient RB power for large infrastructures Necessity to deploy multiple central (!) RB s Heavy infrastructures required on the sites resources providers Providing timely status, accounting and monitoring information Ensuring per user authentication/authorization Managing community policies via local batch queues parameterization 11

12 Virtualized Computing Resources Since recently computing centers started to provide their resources using cloud technologies Virtualized computing infrastructures Very flexible provides resources adapted to the user needs Operating systems, memory, CPU power, etc. However, large scientific collaborations can have access to multiple computational clouds Dealing with independent clouds separately requires a huge management effort Federating multiple clouds into a single coherent system is necessary to provide a transparent access for users. 12

13 DIRAC Grid Solution DIRAC is developed originally for the LHCb experiment at with the goals: Integrate all the heterogeneous computing resources available to the community Provide solution for both WMS and DMS tasks Minimize human intervention at sites providers of resources Make the grid convenient for the users: Simpler intuitive interfaces Fault tolerance, quicker turnaround of user jobs Enabling Community policies DIRAC has introduced a notion of Pilot Jobs A concept of distributed agents bringing together various disjoint computing resources 13

14 14 DIRAC Workload Management

15 Physicist User Production Manager Matcher Service EGI/WLCG Grid NDG Grid Amazon EC2 Cloud CREAM CE 15 EGI Pilot Director NDG Pilot Director Amazon Pilot Director CREAM Pilot Director

16 WMS: using heterogeneous resources Including resources in different grids and standalone clusters is simple with Pilot Jobs Needs a specialized Pilot Director per resource type Users just see new sites appearing in the job monitoring 16

17 u In DIRAC both User and Production jobs are treated by the same WMS Same Task Queue WMS: applying VO policies u This allows to apply efficiently policies for the whole VO ª Assigning Job Priorities for different groups and activities ª Static group priorities are used currently ª More powerful scheduler can be plugged in demonstrated with MAUI scheduler Users perceive the DIRAC WMS as a single large batch system 17

18 18 DIRAC as a resource manager

19 19 Computing Grids DIRAC was initially developed with the focus on accessing conventional Grid computing resources WLCG grid resources for the LHCb Collaboration It fully supports glite middleware based grids European Grid Infrastructure (EGI), Latin America GISELA, etc Using glite/emi middleware Northern American Open Science Grid (OSG) Using VDT middleware Northern European Grid (NDGF) Using ARC middleware Other types of grids can be supported As long we have customers needing that

20 Clouds VM scheduler developed for Belle MC production system Dynamic VM spawning taking Amazon EC2 spot prices and Task Queue state into account Discarding VMs automatically when no more needed The DIRAC VM scheduler by means of dedicated VM Directors is interfaced to OCCI compliant clouds: OpenStack, OpenNebula CloudStack Amazon EC2 20

21 Standalone computing clusters Off-site Pilot Director Site delegates control to the central service Site must only define a dedicated local user account The payload submission through the SSH tunnel The site can be a single computer or a cluster with a batch system LSF, BQS, SGE, PBS/Torque, Condor, OAR More to come: SLURM, LoadLeveler. etc The user payload is executed with the owner credentials No security compromises with respect to external services 21

22 Examples: DIRAC.Yandex.ru >2000 cores Torque batch system, no grid middleware, access by SSH Second largest LHCb MC production site Standalone computing clusters LRZ Computing Center, Munich SLURM batch system, GRAM5 CE service Gateway access by GSISSH Considerable resources for biomed community (work in progress) Mesocentre Aix-Marseille University OAR batch system, no grid middleware, access by SSH Open to multiple communities (work in progress) 22

23 BOINC Desktop Grids On the client PC the third party components are installed: VirtualBox hypervisor Standard BOINC client A special BOINC application Starts a requested VM within the VirtualBox Passes the Pilot Job to the VM and starts it Once the Pilot Job starts in the VM, the user PC becomes a normal DIRAC Worker Node 23

24 Data Storage and Catalogs Managing data is more complicated than workload management Already existing data in various storage resources must be connected Contents of already existing File Catalogs must be either connected or copied to the DIRAC File Catalog Need for common interfaces to various types of storage and catalog services 24

25 Data Management components For DIRAC users the use of any Storage Element or File Catalog is transparent Up to a user community to choose components to use Different SE types can be mixed together Several File Catalogs can be used in parallel Complementary functionality Redundancy Users see depending on the DIRAC Configuration Logical Storage Elements e.g. DIRAC-USER, M3PEC-disk Logical File Catalog 25

26 Interware: DMS Data Management Multiple types of Storage Elements ( SRM, XRootD, irods, DAV, DIP, ) seen as logical entities for the users SE Proxy service for protocol independence Data replication: FTS3 File Transfer Service Third party Local pipe 26

27 Interware: DMS DIRAC File Catalog Replica Catalog User Metadata Catalog Support for data provenance information Efficient storage usage reports Allow for user quota policies Support for user defined datasets LCG File Catalog Specialized File Catalogs 27

28 Resource provisioning service DIRAC attempts to provide access to all kinds of existing resources and services useful for its users At the same time it provides its own solutions, e.g. catalogs, storage and computing element, transfer service, etc. By design services from different providers can be used together in parallel not necessarily replacing one another Example: use of DFC and LFC together, see below The final choice of the services to use is left to the user 28

29 Interware } DIRAC has all the necessary components to build ad-hoc grid infrastructures interconnecting computing resources of different types. This allows to speak about the DIRAC interware. 29

30 30 Interfaces

31 DIRAC: Secure Web Portal Focus on the Web Portal as the main user tool for interactions with the grid Intuitive desktop application like interface Ajax, Tornado, ExtJS Javascript library Monitoring and control of all activities User registration, proxy upload User job monitoring and manipulation, downloading results Data manipulation and downloads DIRAC Systems configuration and management Secure access Standard grid certificates Fine grained authorization rules 31

32 32 Web Portal examples

33 Application Web Portals Specific application portals can be built in the DIRAC Web Portal framework Community Application Servers DIRAC RESTful interface Language neutral Suitable to use with portals written in Java, PHP, etc Other interfaces include Extensive Python API A rich set of command line tools ( >200 commands ) Available on UNIX-like systems 33

34 34 DIRAC Users: large communities

35 LHCb Collaboration 35 Up to 100K concurrent jobs in ~120 distinct sites Equivalent to running a virtual computing center with a power of 100K CPU cores Further optimizations to increase the capacity are possible Hardware, database optimizations, service load balancing, etc

36 Experiments: Belle II Combination of the non-grid, grid sites and (commercial) clouds is a requirement 2 GB/s, 40 PB of data in 2019 Belle II grid resources WLCG, OSG grids KEK Computing Center Amazon EC2 cloud First production run is done Thomas Kuhr, Belle II 36

37 Belle II DIRAC Scalability tests Hideki Miyake, KEK 37

38 Community installations ILC/CLIC detector Collaboration, Calice VO Dedicated installation at CERN, 10 servers, DB-OD MySQL server MC simulations DIRAC File Catalog was developed to meet the ILC/CLIC requirements BES III, IHEP, China Using DIRAC DMS: File Replica and Metadata Catalog, Transfer services Dataset management developed for the needs of BES III CTA CTA started as France-Grilles DIRAC service customer Now is using a dedicated installation at PIC, Barcelona Using complex workflows Geant4 Dedicated installation at CERN Validation of MC simulation software releases Alpes Laser Company using dedicated DIRAC setup for simulation of the laser hardware Mostly using commercial Amazon resources 38 DIRAC evaluations by other experiments LSST, Auger, TREND, Daya Bay, Juno Evaluations can be done with general purpose DIRAC services

39 39 DIRAC as a Service

40 DIRAC as a service DIRAC client is easy to install Part of a usual tutorial DIRAC services are easy to install but Needs dedicated hardware for hosting Configuration, maintenance needs expert manpower Monitoring computing resources is a tedious every-day task Small user communities can not afford maintaining dedicated DIRAC services Still need easy access to computing resources Large grid infrastructures can provide DIRAC services for their users. 40

41 Several regional and university campus installations in France Complex maintenance Joint effort to provide France-Grid DIRAC service Hosted by the CC/IN2P3, Lyon, T1 center Distributed team of service administrators 15 VOs, ~100 registered users Mostly biomed applictions 41 France-Grid DIRAC service

42 Multi-VO DIRAC service in IHEP IHEP Computing Center is the main resource provider for the BES experiment The BES Collaboration decided to use DIRAC to build its distributed computing project Developed a good expertize level in developing and managing DIRAC services The BES dedicated DIRAC service is being extended to support other user communities CEPC, Juno, Daya Bay, Non-HEP communities? Reuse of acquired experience Single Administration team Minimization of the user support and maintenance effort 42

43 43 Conclusions

44 Conclusions The computational grids are no more something exotic, they are used in a daily work for various applications Agent based workload management architecture allows to seamlessly integrate different kinds of grids, clouds and other computing resources DIRAC is providing a framework for building distributed computing systems and a rich set of ready to use services. This is used now in a number of DIRAC service projects on a regional and national levels Services based on DIRAC technologies can help users to get started in the world of distributed computations and reveal its full potential 44

DIRAC Services for Grid and Cloud Infrastructures. A.Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille, 29 January 2018, CC/IN2P3, Lyon

DIRAC Services for Grid and Cloud Infrastructures. A.Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille, 29 January 2018, CC/IN2P3, Lyon DIRAC Services for Grid and Cloud Infrastructures A.Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille, 29 January 2018, CC/IN2P3, Lyon Plan DIRAC in a nutshell DIRAC communities Services for multi-community installations

More information

Virtual Imaging Platform - VIP -

Virtual Imaging Platform - VIP - Virtual Imaging Platform - VIP - Sorina Pop Laboratoire Creatis Université de Lyon, CREATIS; CNRS UMR5220; Inserm U1044; INSA-Lyon; Université Lyon 1, France Overview General VIP presentation Performance

More information

Some history Enabling Grids for E-sciencE

Some history Enabling Grids for E-sciencE Outline Grid: architecture and components (on the example of the glite middleware) Slides contributed by Dr. Ariel Garcia Forschungszentrum Karlsruhe Some history Grid and the middleware glite components,

More information

ARC Middleware and its deployment in the distributed Tier1 center by NDGF. Oxana Smirnova Lund University/NDGF Grid 2008, July , Dubna

ARC Middleware and its deployment in the distributed Tier1 center by NDGF. Oxana Smirnova Lund University/NDGF Grid 2008, July , Dubna ARC Middleware and its deployment in the distributed Tier1 center by NDGF Oxana Smirnova Lund University/NDGF Grid 2008, July 1 2008, Dubna Outlook ARC Classic overview NDGF Tier1 Future of ARC: next generation

More information

PoS(EGICF12-EMITC2)032

PoS(EGICF12-EMITC2)032 Experiment Representation at the WLCG Tier-1 Center GridKa Christopher Jung E-mail: christopher.jung@kit.edu E-mail: andreas.petzold@kit.edu Marian Zvada E-mail: marian.zvada@kit.edu The GridKa Computing

More information

February 14, 2006 GSA-WG at GGF16 Athens, Greece. Ignacio Martín Llorente GridWay Project

February 14, 2006 GSA-WG at GGF16 Athens, Greece. Ignacio Martín Llorente GridWay Project February 14, 2006 GSA-WG at GGF16 Athens, Greece GridWay Scheduling Architecture GridWay Project www.gridway.org Distributed Systems Architecture Group Departamento de Arquitectura de Computadores y Automática

More information

Globus and glite Platforms

Globus and glite Platforms Globus and glite Platforms Outline glite introduction services functionality Globus ToolKit overview components architecture glite - middleware a layer between services and resources glite simple view

More information

glideinwms in the Cloud

glideinwms in the Cloud glideinwms in the Cloud ANTHONY TIRADANI AND THE GLIDEINWMS TEAM glideinwms (review of basic principles) Pilot-based WMS that creates an on demand dynamicallysized overlay condor batch system to address

More information

Testing SLURM batch system for a grid farm: functionalities, scalability, performance and how it works with Cream-CE

Testing SLURM batch system for a grid farm: functionalities, scalability, performance and how it works with Cream-CE Testing SLURM batch system for a grid farm: functionalities, scalability, performance and how it works with Cream-CE DONVITO GIACINTO (INFN) ZANGRANDO, LUIGI (INFN) SGARAVATTO, MASSIMO (INFN) REBATTO,

More information

The architecture of the AliEn system

The architecture of the AliEn system www.eu-egee.org The architecture of the AliEn system Predrag Buncic A. J. Peters, P.Saiz, J-F. Grosse-Oetringhaus CHEP 2004 EGEE is a project funded by the European Union under contract IST-2003-508833

More information

Distributed system updates

Distributed system updates Distributed system updates Armando Fella XVII SuperB Workshop and kick off meeting La Biodola (Isola d'elba), Italy from 28 May 2011 to 02 June 2011 1 Presentation Layout Introduction: general view Distributed

More information

A brief history of glite Past, present and future

A brief history of glite Past, present and future A brief history of glite Past, present and future Maria Alandes Pradillo, CERN ESAC E-Science Workshop '10 www.eu-egee.org EGEE and glite are registered trademarks Contents What is glite? Origins of glite

More information

ARC Control Tower: A flexible generic distributed job management framework

ARC Control Tower: A flexible generic distributed job management framework Journal of Physics: Conference Series PAPER OPEN ACCESS ARC Control Tower: A flexible generic distributed job management framework Recent citations - Storageless and caching Tier-2 models in the UK context

More information

Deep Learning Acceleration with MATRIX: A Technical White Paper

Deep Learning Acceleration with MATRIX: A Technical White Paper The Promise of AI The Challenges of the AI Lifecycle and Infrastructure MATRIX Solution Architecture Bitfusion Core Container Management Smart Resourcing Data Volumes Multitenancy Interactive Workspaces

More information

Workload Management draft mandate and workplan

Workload Management draft mandate and workplan CCS Data and Workload Management CERN, Monday 20 Sept 2004 Workload Management draft mandate and workplan Stefano Lacaprara Stefano.Lacaprara@pd.infn.it INFN and Padova University Stefano Lacaprara CCS

More information

HP Cloud Maps for rapid provisioning of infrastructure and applications

HP Cloud Maps for rapid provisioning of infrastructure and applications Technical white paper HP Cloud Maps for rapid provisioning of infrastructure and applications Table of contents Executive summary 2 Introduction 2 What is an HP Cloud Map? 3 HP Cloud Map components 3 Enabling

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

GRID Computing at Forschungszentrum Karlsruhe suitable for LOFAR

GRID Computing at Forschungszentrum Karlsruhe suitable for LOFAR Forschungszentrum Karlsruhe In der Helmholtz-Gemeinschaft Institute for Data Processing and Electronics GRID Computing at Forschungszentrum Karlsruhe suitable for LOFAR Rainer Stotzka, Hartmut Gemmeke

More information

IBM Tivoli Workload Automation View, Control and Automate Composite Workloads

IBM Tivoli Workload Automation View, Control and Automate Composite Workloads Tivoli Workload Automation View, Control and Automate Composite Workloads Mark A. Edwards Market Manager Tivoli Workload Automation Corporation Tivoli Workload Automation is used by customers to deliver

More information

EUDAT How manage Data into the Collaborative Data Infrastructure: a general overview of EUDAT services

EUDAT How manage Data into the Collaborative Data Infrastructure: a general overview of EUDAT services EUDAT How manage Data into the Collaborative Data Infrastructure: a general overview of EUDAT services Giovanni Morelli www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme

More information

ORACLE CLOUD MANAGEMENT PACK FOR MIDDLEWARE

ORACLE CLOUD MANAGEMENT PACK FOR MIDDLEWARE ORACLE CLOUD MANAGEMENT PACK FOR MIDDLEWARE Oracle Enterprise Manager is Oracle s strategic integrated enterprise IT management product line. It provides the industry s first complete cloud lifecycle management

More information

Hybrid Cloud. Private and public clouds under a single service

Hybrid Cloud. Private and public clouds under a single service Hybrid Cloud Private and public clouds under a single service Combine the bespoke, costeffective nature of our private cloud with the latest technology and scalability of the public Azure platform and

More information

Realize Your Product Promise

Realize Your Product Promise Realize Your Product Promise ANSYS Enterprise Cloud is a complete simulation platform, delivered in your secure, dedicated environment on the public cloud. Complete and extensible, ANSYS Enterprise Cloud

More information

CMS readiness for multi-core workload scheduling

CMS readiness for multi-core workload scheduling CMS readiness for multi-core workload scheduling Antonio Pérez-Calero Yzquierdo, on behalf of the CMS Collaboration, Computing and Offline, Submission Infrastructure Group CHEP 2016 San Francisco, USA

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers Gu nther Erli, Frank Fischer, Georg Fleig, Manuel Giffels, Thomas Hauth, Gu nter Quast, Matthias Schnepf (IEKP), Andreas

More information

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise Cloud Service Model Selecting a cloud service model Different cloud service models within the enterprise Single cloud provider AWS for IaaS Azure for PaaS Force fit all solutions into the cloud service

More information

White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure

White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure White paper A Reference Model for High Performance Data Analytics(HPDA) using an HPC infrastructure Discover how to reshape an existing HPC infrastructure to run High Performance Data Analytics (HPDA)

More information

IBM WebSphere Information Integrator Content Edition Version 8.2

IBM WebSphere Information Integrator Content Edition Version 8.2 Introducing content-centric federation IBM Content Edition Version 8.2 Highlights Access a broad range of unstructured information sources as if they were stored and managed in one system Unify multiple

More information

Harvester. Tadashi Maeno (BNL)

Harvester. Tadashi Maeno (BNL) Harvester Tadashi Maeno (BNL) Outline Motivation Design Workflows Plans 2 Motivation 1/2 PanDA currently relies on server-pilot paradigm PanDA server maintains state and manages workflows with various

More information

2nd EUDAT User Forum

2nd EUDAT User Forum 2nd EUDAT User Forum Data staging to HPC Giuseppe Fiameni SuperComputing, Application and Innovation CINECA, Italy 2nd EUDAT User Forum, London 11, 12 March 2013 Agenda Topic Data Staging to HPC- Conveners:

More information

Integrating MATLAB Analytics into Enterprise Applications

Integrating MATLAB Analytics into Enterprise Applications Integrating MATLAB Analytics into Enterprise Applications David Willingham 2015 The MathWorks, Inc. 1 Run this link. http://bit.ly/matlabapp 2 Key Takeaways 1. What is Enterprise Integration 2. What is

More information

Learn How To Implement Cloud on System z. Delivering and optimizing private cloud on System z with Integrated Service Management

Learn How To Implement Cloud on System z. Delivering and optimizing private cloud on System z with Integrated Service Management Learn How To Implement Cloud on System z Delivering and optimizing private cloud on System z with Integrated Service Mike Baskey, Distinguished Engineer, Tivoli z Architecture IBM August 9th, 2012 Session:

More information

SurPaaS Analyzer. Cut your application assessment. Visualize Your Cloud Options. Time by a factor of 10x and Cost by 75% Unique Features

SurPaaS Analyzer. Cut your application assessment. Visualize Your Cloud Options. Time by a factor of 10x and Cost by 75% Unique Features SurPaaS Analyzer Visualize Your Cloud Options DATASHEET Cut your application assessment Time by a factor of 10x and Cost by 75% SurPaaS Analyzer is a Cloud smart decision-enabling tool that analyzes your

More information

AsyncStageOut: Distributed user data management for CMS Analysis

AsyncStageOut: Distributed user data management for CMS Analysis Journal of Physics: Conference Series PAPER OPEN ACCESS AsyncStageOut: Distributed user data management for CMS Analysis To cite this article: H Riahi et al 2015 J. Phys.: Conf. Ser. 664 062052 View the

More information

Executing Multi-workflow simulations on a mixed grid/cloud infrastructure using the SHIWA Technology Peter Kacsuk MTA SZTAKI and Univ.

Executing Multi-workflow simulations on a mixed grid/cloud infrastructure using the SHIWA Technology Peter Kacsuk MTA SZTAKI and Univ. Executing Multi-workflow simulations on a mixed grid/cloud infrastructure using the SHIWA Technology Peter Kacsuk MTA SZTAKI and Univ.of Westminster kacsuk@sztaki.hu What is a multi-workflow simulation?

More information

Reinforcing User Data Analysis with Ganga in the LHC Era: Scalability, Monitoring and User-support

Reinforcing User Data Analysis with Ganga in the LHC Era: Scalability, Monitoring and User-support Reinforcing User Data Analysis with Ganga in the LHC Era: Scalability, Monitoring and User-support Johannes Elmsheuser Ludwig-Maximilians-Universität München on behalf of the GangaDevelopers team and the

More information

Multi-Containers Orchestration with Live Migration and High-Availability for Microservices

Multi-Containers Orchestration with Live Migration and High-Availability for Microservices Multi-Containers Orchestration with Live Migration and High-Availability for Microservices Meet Our Presenters Jay Lyman Research Manager, Cloud Platforms, 451 Research Ruslan Synytsky CEO and Co-founder,

More information

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing

HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing HTCaaS: Leveraging Distributed Supercomputing Infrastructures for Large- Scale Scientific Computing Jik-Soo Kim, Ph.D National Institute of Supercomputing and Networking(NISN) at KISTI Table of Contents

More information

IBM ICE (Innovation Centre for Education) Welcome to: Unit 1 Overview of delivery models in Cloud Computing. Copyright IBM Corporation

IBM ICE (Innovation Centre for Education) Welcome to: Unit 1 Overview of delivery models in Cloud Computing. Copyright IBM Corporation Welcome to: Unit 1 Overview of delivery models in Cloud Computing 9.1 Unit Objectives After completing this unit, you should be able to: Understand cloud history and cloud computing Describe the anatomy

More information

OGF Europe Tutorial. How to make sustainable a grid enabled e Infrastructure

OGF Europe Tutorial. How to make sustainable a grid enabled e Infrastructure 4th EGEE User Forum/OGF 25 and OGF Europe's 2nd International Event Le Ciminiere, Catania, Sicily, Italy 2-6 March 2009 OGF Europe Tutorial How to make sustainable a grid enabled e Infrastructure by Pasquale

More information

Real Time Monitor of Grid job executions

Real Time Monitor of Grid job executions Journal of Physics: Conference Series Real Time Monitor of Grid job executions To cite this article: D J Colling et al 2010 J. Phys.: Conf. Ser. 219 062020 View the article online for updates and enhancements.

More information

Genomic Data Is Going Google. Ask Bigger Biological Questions

Genomic Data Is Going Google. Ask Bigger Biological Questions Genomic Data Is Going Google Ask Bigger Biological Questions You know your research could have a significant scientific impact and answer questions that may redefine how a disease is diagnosed or treated.

More information

BUILDING A PRIVATE CLOUD

BUILDING A PRIVATE CLOUD INNOVATION CORNER BUILDING A PRIVATE CLOUD How Platform Computing s Platform ISF* Can Help MARK BLACK, CLOUD ARCHITECT, PLATFORM COMPUTING JAY MUELHOEFER, VP OF CLOUD MARKETING, PLATFORM COMPUTING PARVIZ

More information

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia

More information

Secure information access is critical & more complex than ever

Secure information access is critical & more complex than ever WHITE PAPER Purpose-built Cloud Platform for Enabling Identity-centric and Internet of Things Solutions Connecting people, systems and things across the extended digital business ecosystem. Secure information

More information

Towards Exascale Across Scales!

Towards Exascale Across Scales! Towards Exascale Across Scales! Shantenu Jha Rutgers Advanced DIstributed Cyberinfrastructure & Applications Laboratory (RADICAL) http://radical.rutgers.edu Big Science to the Long Tail of Science Convergence

More information

Building a Data Lake on AWS

Building a Data Lake on AWS Partner Network EBOOK: Building a Data Lake on AWS Contents What is a Data Lake? Benefits of a Data Lake on AWS Building a Data Lake On AWS Featured Data Lake Partner Bronze Drum Consulting Case Study:Rosetta

More information

Resource Allocation for the French National Grid Initiative

Resource Allocation for the French National Grid Initiative Author manuscript, published in "Lecture notes in computer science 7156 (2012) pp.55-63" DOI : 10.1007/978-3-642-29740-3_8 Resource Allocation for the French National Grid Initiative Gilles Mathieu, Hélène

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

Microsoft Azure Cloud Integration Infrastructure & Technical Overview

Microsoft Azure Cloud Integration Infrastructure & Technical Overview Microsoft Azure Cloud Integration Infrastructure & Technical Overview MessageSolution, Inc. 1851 McCarthy Blvd. Suite 105 Milpitas, CA 95035 Tel: +001 408 383 0100 E: AMTeam@MessageSolution.com W: www.messagesolution.com

More information

Compiere ERP Starter Kit. Prepared by Tenth Planet

Compiere ERP Starter Kit. Prepared by Tenth Planet Compiere ERP Starter Kit Prepared by Tenth Planet info@tenthplanet.in www.tenthplanet.in 1. Compiere ERP - an Overview...3 1. Core ERP Modules... 4 2. Available on Amazon Cloud... 4 3. Multi-server Support...

More information

IBM Platform LSF & PCM-AE Dynamische Anpassung von HPC Clustern für Sondernutzung und Forschungskollaborationen

IBM Platform LSF & PCM-AE Dynamische Anpassung von HPC Clustern für Sondernutzung und Forschungskollaborationen IBM Platform LSF & PCM-AE Dynamische Anpassung von HPC Clustern für Sondernutzung und Forschungskollaborationen ZKI Meeting 2012 - Düsseldorf Heiko Lehmann Account Manager Heiko.lehmann@de.ibm.com Bernhard

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

Origin and Evolution of the Spanish NGI

Origin and Evolution of the Spanish NGI Thanks to: V. Hernández, I. Campos, I. Martín Llorente, I.Blanquer, J.Gomes Origin and Evolution of the Spanish NGI Jesús Marco de Lucas [marco (at) ifca.unican.es] CSIC Research Professor at Instituto

More information

Scalability and High Performance with MicroStrategy 10

Scalability and High Performance with MicroStrategy 10 Scalability and High Performance with MicroStrategy 10 Enterprise Analytics and Mobility at Scale. Copyright Information All Contents Copyright 2017 MicroStrategy Incorporated. All Rights Reserved. Trademark

More information

May 2012 Oracle Spatial User Conference

May 2012 Oracle Spatial User Conference 1 May 2012 Oracle Spatial User Conference May 23, 2012 Ronald Reagan Building and International Trade Center Washington, DC USA Eamon Walsh CTO, espatial GIS Software as a Service for Business using Oracle

More information

World Leading Storage Cloud at ETH Zürich

World Leading Storage Cloud at ETH Zürich Felix Sutter Dr. Tilo Steiger IT Architect, IBM Switzerland Ltd Head of Storage Services, ETH Zürich Informatikdienste World Leading Storage Cloud at ETH Zürich Agenda The challenges From IT Silos to Cloud

More information

Kepion Solution vs. The Rest. A Comparison White Paper

Kepion Solution vs. The Rest. A Comparison White Paper Kepion Solution vs. The Rest A Comparison White Paper In the Business Intelligence industry, Kepion competes everyday with BI vendors such as IBM Cognos, Oracle Hyperion and SAP BusinessObjects. At first

More information

QuickSpecs. Why use Platform LSF?

QuickSpecs. Why use Platform LSF? Overview 8 is a job management scheduler from Platform Computing Inc. Why use? A powerful workload manager for demanding, distributed and mission-critical computing environments. Includes a comprehensive

More information

Product presentation. Fujitsu HPC Gateway SC 16. November Copyright 2016 FUJITSU

Product presentation. Fujitsu HPC Gateway SC 16. November Copyright 2016 FUJITSU Product presentation Fujitsu HPC Gateway SC 16 November 2016 0 Copyright 2016 FUJITSU In Brief: HPC Gateway Highlights 1 Copyright 2016 FUJITSU Convergent Stakeholder Needs HPC GATEWAY Intelligent Application

More information

Architectural Design. Objectives

Architectural Design. Objectives Architectural Design Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 11 Slide 1 Objectives To introduce architectural design and to discuss its importance To explain the architectural design

More information

Single Euro Payments Area

Single Euro Payments Area Single Euro Payments Area Background The Single Euro Payments Area (SEPA) is a payment-integration initiative of the European Union for simplification of bank transfers. As of March 2012, SEPA consists

More information

Dell Advanced Infrastructure Manager (AIM) Automating and standardizing cross-domain IT processes

Dell Advanced Infrastructure Manager (AIM) Automating and standardizing cross-domain IT processes Systems Automating and standardizing cross-domain IT processes By Hal Clark The combination of Dell Advanced Infrastructure Manager (AIM) and BMC Atrium Orchestrator enables the creation of automated,

More information

Cisco Workload Optimization Manager: Setup and Use Cases

Cisco Workload Optimization Manager: Setup and Use Cases Cisco Workload Optimization Manager: Setup and Use Cases 2017 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public Information. Page 1 of 49 Contents Introduction Minimum requirements

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights ORACLE PRODUCT LOGO Virtualization and Cloud Deployments of Oracle E-Business Suite Ivo Dujmović, Director, Applications Development 2 Copyright

More information

Dashboard for the LHC experiments.

Dashboard for the LHC experiments. Dashboard for the LHC experiments. J. Andreeva 1, S. Belov 2, A. Berejnoj 3, C. Cirstoiu 1,4, Y.Chen 5, T.Chen 5, S. Chiu 5, M. De Francisco De Miguel 1, A. Ivanchenko 1, B. Gaidioz 1, J. Herrala 1, M.

More information

NSF {Program (NSF ) first announced on August 20, 2004} Program Officers: Frederica Darema Helen Gill Brett Fleisch

NSF {Program (NSF ) first announced on August 20, 2004} Program Officers: Frederica Darema Helen Gill Brett Fleisch NSF07-504 {Program (NSF04-609 ) first announced on August 20, 2004} Program Officers: Frederica Darema Helen Gill Brett Fleisch Computer Systems Research Program: Components and Thematic Areas Advanced

More information

InfoSphere DataStage Grid Solution

InfoSphere DataStage Grid Solution InfoSphere DataStage Grid Solution Julius Lerm IBM Information Management 1 2011 IBM Corporation What is Grid Computing? Grid Computing doesn t mean the same thing to all people. GRID Definitions include:

More information

Datasheet FUJITSU Software UForge AppCenter 3.7

Datasheet FUJITSU Software UForge AppCenter 3.7 Datasheet FUJITSU Software UForge AppCenter 3.7 Hybrid IT Application Delivery and Migration Hybrid IT Application Delivery Hybrid IT adoption continues to grow at a rapid pace. Enterprises are looking

More information

Connected Geophysical Solutions Real-Time Interactive Seismic Modeling & Imaging

Connected Geophysical Solutions Real-Time Interactive Seismic Modeling & Imaging Connected Geophysical Solutions Real-Time Interactive Seismic Modeling & Imaging September 2011 Seismic Imaging For ninety years, any given oil company s profitability has relied largely on the low risk

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

BlackPearl Customer Created Clients for Media & Entertainment Using Free & Open Source Tools

BlackPearl Customer Created Clients for Media & Entertainment Using Free & Open Source Tools BlackPearl Customer Created Clients for Media & Entertainment Using Free & Open Source Tools Contents ABSTRACT... 3 INTRODUCTION... 3 BULDING A CUSTOMER CREATED CLIENT... 5 BlackPearl Simulator... 5 Eon

More information

Scheduling and Resource Management in Grids

Scheduling and Resource Management in Grids Scheduling and Resource Management in Grids ASCI Course A14: Advanced Grid Programming Models Ozan Sonmez and Dick Epema may 13, 2009 1 Outline Resource Management Introduction A framework for resource

More information

Nuances of Low Cost Effective LIMS Implementation: Can SaaS be an option?

Nuances of Low Cost Effective LIMS Implementation: Can SaaS be an option? February 2010 Nuances of Low Cost Effective LIMS Implementation: Can SaaS be an option? By Padmanabhan Nagalingam Contents Abstract 2 What is LIMS? 3 What is Cloud Computing? 3 LIMS through Cloud Computing

More information

Service-oriented Architecture with BS2000/OSD

Service-oriented Architecture with BS2000/OSD Service-oriented Architecture with BS2000/OSD Issue April 2009 Pages 6 Introduction BS2000 applications are deployed to handle core processes in industrial and commercial organizations and public authorities

More information

In Cloud, Can Scientific Communities Benefit from the Economies of Scale?

In Cloud, Can Scientific Communities Benefit from the Economies of Scale? PRELIMINARY VERSION IS PUBLISHED ON SC-MTAGS 09 WITH THE TITLE OF IN CLOUD, DO MTC OR HTC SERVICE PROVIDERS BENEFIT FROM THE ECONOMIES OF SCALE? 1 In Cloud, Can Scientific Communities Benefit from the

More information

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered.

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered. Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered November 20, 2017 Contents GLOSSARY PUBLIC CLOUD SERVICES-NON-METERED... 9 API Call...

More information

CLOUD COMPUTING- A NEW EDGE TO TECHNOLOGY

CLOUD COMPUTING- A NEW EDGE TO TECHNOLOGY CLOUD COMPUTING- A NEW EDGE TO TECHNOLOGY Prof. Pragati Goel Asso. Prof. MCA Dept., Sterling Institute of Management Studies, Nerul, Navi Mumbai. Navi Mumbai, India Email: goelpragati78@gmail.com The Cloud

More information

Research Data Management

Research Data Management globus online Research Data Management www.globusonline.org Rachana Ananthakrishnan University of Chicago & Argonne National Lab We started with technology proven in many large-scale grids GridFTP GRAM

More information

Data Center Operating System (DCOS) IBM Platform Solutions

Data Center Operating System (DCOS) IBM Platform Solutions April 2015 Data Center Operating System (DCOS) IBM Platform Solutions Agenda Market Context DCOS Definitions IBM Platform Overview DCOS Adoption in IBM Spark on EGO EGO-Mesos Integration 2 Market Context

More information

Introducing FUJITSU Software Systemwalker Centric Manager V15.0

Introducing FUJITSU Software Systemwalker Centric Manager V15.0 Introducing FUJITSU Software Systemwalker Centric Manager V15.0 < Version 1.0 > November 2013 FUJITSU LIMITED Contents Integrated Monitoring Required in Virtualization/Server Integration Characteristics

More information

Sizing SAP Central Process Scheduling 8.0 by Redwood

Sizing SAP Central Process Scheduling 8.0 by Redwood Sizing SAP Central Process Scheduling 8.0 by Redwood Released for SAP Customers and Partners January 2012 Copyright 2012 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted

More information

Science as a Service Accelerating Scientific Discovery using Cloud

Science as a Service Accelerating Scientific Discovery using Cloud Science as a Service Accelerating Scientific Discovery using Cloud Ravi Madduri madduri@anl.gov Internet2 Global Summit 2016, Chicago Outline Scientific Discovery Process The Case for Cloud Science as

More information

DIET: New Developments and Recent Results

DIET: New Developments and Recent Results A. Amar 1, R. Bolze 1, A. Bouteiller 1, A. Chis 1, Y. Caniou 1, E. Caron 1, P.K. Chouhan 1, G.L. Mahec 2, H. Dail 1, B. Depardon 1, F. Desprez 1, J. S. Gay 1, A. Su 1 LIP Laboratory (UMR CNRS, ENS Lyon,

More information

IBM Storwize Family Scaling Capabilities and Value

IBM Storwize Family Scaling Capabilities and Value Technology Insight Paper IBM Storwize Family Scaling Capabilities and Value By Randy Kerns January, 2013 Enabling you to make the best technology decisions IBM Storwize Family Scaling Capabilities and

More information

Superlink-online and BOINC

Superlink-online and BOINC Distributed Systems Laboratory Computational Biology Laboratory Superlink-online and BOINC Artyom Sharov, Mark Silberstein, Dan Geiger, Assaf Schuster CS Department, Technion www.eu-egee.org The 3rd Pan-Galactic

More information

2017 IBM Elastic Storage Server - Update

2017 IBM Elastic Storage Server - Update IBM Elastic Storage Server - Update Falk Steinbrueck Program Manager, IBM Systems Storage Disclaimer IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

Optimize your FLUENT environment with Platform LSF CAE Edition

Optimize your FLUENT environment with Platform LSF CAE Edition Optimize your FLUENT environment with Platform LSF CAE Edition Accelerating FLUENT CFD Simulations ANSYS, Inc. is a global leader in the field of computer-aided engineering (CAE). The FLUENT software from

More information

Service Management for the Mobile Mainframe Delivered via Cloud Lunch and Learn

Service Management for the Mobile Mainframe Delivered via Cloud Lunch and Learn Service Management for the Mobile Mainframe Delivered via Cloud Lunch and Learn Mike Baskey, DE, Cloud and Smarter Infrastructure IBM August 15, 2013 Session 14345 Mainframe applications increasingly used

More information

Ultimus Adaptive BPM Suite 8 Product Overview

Ultimus Adaptive BPM Suite 8 Product Overview Accelerate Performance Adaptive BPM Suite 8 Product Overview Contact Information 15000 Weston Parkway Cary, North Carolina 27513 USA Telephone: +1 919 678-0900 Fax: +1 919 678-0901 Accelerate Performance

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

Service Virtualization

Service Virtualization Service Virtualization A faster, more efficient and less costly way to develop and test enterprise-class applications As cloud and mobile computing gain rapid acceptance, IT departments are expected to

More information

EMC ATMOS. Managing big data in the cloud A PROVEN WAY TO INCORPORATE CLOUD BENEFITS INTO YOUR BUSINESS ATMOS FEATURES ESSENTIALS

EMC ATMOS. Managing big data in the cloud A PROVEN WAY TO INCORPORATE CLOUD BENEFITS INTO YOUR BUSINESS ATMOS FEATURES ESSENTIALS EMC ATMOS Managing big data in the cloud ESSENTIALS Purpose-built cloud storage platform designed for unlimited global scale Intelligently automates management of content through highly flexible policies

More information

Trasformare il Business con Soluzioni Cloud

Trasformare il Business con Soluzioni Cloud Trasformare il Business con Soluzioni Cloud Marco Sebastiani Product Manager, IBM Tivoli Cloud Solutions 1 What is different about cloud computing? Without cloud computing With cloud computing Virtualized

More information

glite Workload Management System Performance Measurements

glite Workload Management System Performance Measurements EGEE-PUB-26-36 glite Workload Management System Performance Measurements Svraka, N (IPB) et al Revised on 2 February 27 Proceedings of IV INDEL, Banjaluka EGEE is a project funded by the European Commission

More information

POWER PORTFOLIO PRODUCT BROCHURE

POWER PORTFOLIO PRODUCT BROCHURE POWER PORTFOLIO PRODUCT BROCHURE 2 Power Portfolio Change is constant and its impacts can affect you personally. Whether man-made or naturally occurring, our changing landscape influences your plans and

More information

PeopleSoft on Oracle Ravello Cloud Service ORACLE WHITE PAPER AUGUST 2017

PeopleSoft on Oracle Ravello Cloud Service ORACLE WHITE PAPER AUGUST 2017 PeopleSoft on Oracle Ravello Cloud Service ORACLE WHITE PAPER AUGUST 2017 Oracle Ravello Cloud Service Oracle Ravello is an overlay cloud that enables enterprises to run their VMware and KVM applications,

More information

SAP Cloud Platform Pricing and Packages

SAP Cloud Platform Pricing and Packages Platform Pricing and Packages Get Started Packages Fast. Easy. Cost-effective. Get familiar and up-and-running with Platform in no time flat. Intended for non-production use. Designed to help users become

More information