Integrating New Cost Model into HMA-Based Grid Resource Scheduling

Similar documents
Trust Based Grid Scheduling Algorithm for Commercial Grids

Research on Architecture and Key Technology for Service-Oriented Workflow Performance Analysis

Incentive-Based P2P Scheduling in Grid Computing

Enhancing Resource Reservation by Applying Defer Time in FCFS Scheme for Grid Computing Environments

Availability Modeling of Grid Computing Environments Using SANs

Performance Evaluation of Scheduling Policies for Volunteer Computing

RISK MANAGEMENT OF RESOURCE ALLOCATION IN GRID COMPUTING

Resource Scheduling in Hybrid Grid Environment

Trust and Reputation Based Association Among Grid Entities

An Evaluation of Economy-based Resource Trading and Scheduling on Computational Power Grids for Parameter Sweep Applications

QoS-based Scheduling for Task Management in Grid Computing

Application-adaptive resource scheduling in a computational grid *

Economic Scheduling in Grid Computing

GRID META BROKER SELECTION STRATEGIES FOR JOB RESERVATION AND BIDDING

American Journal of Engineering Research (AJER) Implementing Short Term Scheduler and Preemptive Algorithm in Grid Computing Scheduling

Path Optimization for Inter-Switch Handoff in Wireless ATM Networks

CROWNBench: A Grid Performance Testing System Using Customizable Synthetic Workload

WEB SERVICES COMPOSING BY MULTIAGENT NEGOTIATION

Inferring Social Ties across Heterogeneous Networks

AGORA: An Architecture for Strategyproof Computing in Grids

Web Customer Modeling for Automated Session Prioritization on High Traffic Sites

Packet Scheduling in Cloud by Employing Genetic Algorithm

Enhanced Algorithms for Multi-Site Scheduling

A Proactive Fault Tolerant Strategy for Desktop Grid

Our MCMC algorithm is based on approach adopted by Rutz and Trusov (2011) and Rutz et al. (2012).

Modeling of Parameters in Supercomputer Workloads

Resource Allocation Strategies in a 2-level Hierarchical Grid System

A logistic regression model for Semantic Web service matchmaking

Cascading Behavior in Networks. Anand Swaminathan, Liangzhe Chen CS /23/2013

Mixed Constraint Fuzzy Transshipment Problem

JOB AND APPLICATION-LEVEL SCHEDULING IN DISTRIBUTED COMPUTING

Railroad Yard Location Optimization Using A Genetic Algorithm

Modeling of Parameters in Supercomputer Workloads

Towards Modelling-Based Self-adaptive Resource Allocation in Multi-tiers Cloud Systems

On the Impact of Reservations from the Grid on Planning-Based Resource Management

The Job Assignment Problem: A Study in Parallel and Distributed Machine Learning

A RESOURCE LEASING POLICY FOR ON-DEMAND COMPUTING

AN ADVANCED IWD BASED HYPER-HEURISTIC WORKFLOW SCHEDULING IN COMPUTATIONAL GRID

An Autonomic Service Oriented Architecture in Computational Engineering Framework

A self-configuration model for execution management in Grid computing

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions

PRIORITISATION OF ROAD CORRIDORS BY DEVELOPING COMPOSITE INDEX

Chapter 2 Research on Logistics Center Layout Based on SLP

Supporting Peer-To-Peer Collaboration Through Trust

Decision Resource Management and Scheduling on the Grid

Automatic Neighboring BS List Generation Scheme for Femtocell Network

Expert System Applied to High-Speed Railway Track Circuit Coding and Its Simulation

Application of Data-Mining and Knowledge Discovery in Automotive Data Engineering

Design and Development of Enhanced Optimization Techniques based on Ant Colony Systems

Mapping of Self-Organization Properties and Non-Functional Requirements in Smart Grids

Decentralized Scheduling of Bursty Workload on Computing Grids

A Method for Determining Inventory Policy Parameters for a Single Product under Limited Space

CMS Conference Report

The Design and Implementation of Railway Freight Applications on China's Railway Freight Data Grid Platform

Job Type Approach for Deciding Job Scheduling in Grid Computing Systems

SOFTWARE AGENT AND CLOUD COMPUTING: A BRIEF REVIEW Gambang, Pahang, Malaysia 2 College of Information Technology, Universiti Tenaga Nasional,

CHAPTER 4 A NEW ALTERNATE METHOD OF TRANS-SHIPMENT PROBLEM

COMPARING VARIOUS WORKFLOW ALGORITHMS WITH SIMULATED ANNEALING TECHNIQUE

Time Optimal Profit Maximization in a Social Network

Operation and supply chain management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras

Trust-based IoT Participatory Sensing for Hazard Detection and Response

The Staffing Problem of the N-Design Multi-Skill Call Center Based on Queuing Model

TRANSPORTATION PROBLEM AND VARIANTS

GROUPING BASED USER DEMAND AWARE JOB SCHEDULING APPROACH FOR COMPUTATIONAL GRID

Enterprise Output Management For Banking, Finance, and Insurance

Scalability Investigations on Communication Traffic in Distributed Routing of Autonomous Logistic Objects

Autonomic Approach to Planning and Scheduling in Networked Small Factories

Scheduling and Resource Management in Grids

15th ICCRTS. The Evolution of C2. Title of Paper: Network Centric Simulation Architecture. Topic(s): 6, 1, 8

Optimal Design, Evaluation, and Analysis of AGV Transportation Systems Based on Various Transportation Demands

QoS-Aware Routing in Wireless Networks Using Aerial Vehicles

A case for cooperative and incentive-based federation of distributed clusters

Negotiation Strategies Considering Opportunity Functions for Grid Scheduling

Asymptotic Analysis of Real-Time Queues

Design and Implementation of Office Automation System based on Web Service Framework and Data Mining Techniques. He Huang1, a

On the Comparison of CPLEX-Computed Job Schedules with the Self-Tuning dynp Job Scheduler

A case for cooperative and incentive-based federation of distributed clusters

Managing Risk of Inaccurate Runtime Estimates for Deadline Constrained Job Admission Control in Clusters

Job co-allocation strategies for multiple high performance computing clusters

System Virtualization and Efficient ID Transmission Method for RFID Tag Infrastructure Network

Distributed Accounting on the Grid

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Applications of Intelligent Agent Technology to the Grid

REA VALUE CHAIN AND SUPPLY CHAIN

Group #2 Project Final Report: Information Flows on Twitter

Two Level Job-Scheduling Strategies for a Computational Grid

Design of intelligent logistics information platform based on wireless sensor RFID technology. Hai Ye 1, a

Design and Implementation of Heterogeneous Workflow System Integration Mode Based on SOA Framework

A Deadline and Budget Constrained Cost-Time Optimisation Algorithm for Scheduling Task Farming Applications on Global Grids

SCHEDULING JOBS ON GRID COMPUTING USING FIREFLY ALGORITHM

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

C. Wohlin, P. Runeson and A. Wesslén, "Software Reliability Estimations through Usage Analysis of Software Specifications and Designs", International

COST ESTIMATION OF ADVANCE RESERVATIONS OVER QUEUED JOBS: A QUANTITATIVE STUDY

CONVERGENCE OF CLOUD COMPUTING, SERVICE ORIENTED ARCHITECTURE AND ENTERPRISE ARCHITECTURE

Service Selection in Web Service Composition: A Comparative Review of Existing Approaches

Deterministic Crowding, Recombination And Self-Similarity

COLLABORATIVE WORKSPACE OVER SERVICE-ORIENTED GRID

Goodbye to Fixed Bandwidth Reservation: Job Scheduling with Elastic Bandwidth Reservation in Clouds

Decision Support System for reducing Post-Harvest Loss in Supply Chain Logistics using geographical Information System in Agriculture

Intelligent Agent Supported Flexible Workflow Monitoring System

Transcription:

Integrating New Cost Model into HMA-Based Grid Resource Scheduling Jun-yan Zhang, Fan Min, and Guo-wei Yang College of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610051, China {jyzhang, minfan, gwyang}@uestc.edu.cn Abstract. Grid systems can provide a virtual framework for management and scheduling of resources across different domains. This paper proposes an HMA-base grid resource scheduling system to implement resource finding and scheduling. A new cost model is also given, considering resource finding cost and resource deciding cost beyond traditional model. In succession, the new cost model is integrated into the HMA-base grid resource scheduling system. Our experiment shows that optimal solution under traditional cost model is no longer optimal under our model. Keywords. Grid, resource scheduling, Agent, cost model 1 Introduction In traditional distributed computing environments (DCEs), resource management systems (RMSs) were primarily responsible for allocating resources for tasks [3]. They also performed functions such as resource discovery and monitoring to support their primary roles. With large amount of distributed resources and users, grid systems can provide a virtual framework for management and scheduling of resources across different domains, and they have been the focus of much research activities in recent years. A computational grid is an emerging computing infrastructure that enables effective access to high performance computing resources [5]. Resource management and scheduling are key grid services, where issues of utilizing grid resources reasonably by minimizing total cost represent a common concern for most grid infrastructure and scheduling algorithm developers. Resource management and scheduling in grid systems [1] [2] is challenging due to: (a) geographical distribution of resources; (b) resource heterogeneity; (c) autonomously administered grid domains having their own resource policies and practices; and (d) grid domains using different access and cost models. In this paper, we adopt Hierarchical Multi-Agent-based (HMA-based) methodology to grid resource scheduling, achieved by integrating cost into the HMA-base grid resource scheduling system to implement minimal cost resource scheduling. M. Li et al. (Eds.): GCC 2003, LNCS 3033, pp. 652 659, 2004. Springer-Verlag Berlin Heidelberg 2004

Integrating New Cost Model into HMA-Based Grid Resource Scheduling 653 The paper is organized as follows: Section 2 introduces the traditional cost model. In section 3, the HMA-based grid resource scheduling system is described. In section 4, we integrate new cost model into our HMA-based grid resource scheduling systems. Comparing experimental results are included in section 5 and the paper concludes in section 6. 2 Traditional Cost Model The traditional cost model of the Internet forms the basis of most call admission, routing and reservation algorithms today. The model combines the cost classifying, switching, queuing and scheduling at a node with the cost of transmission over the next link in one abstract figure associated with each node-link pair (See Figure 1)[6]. Based on this model, the total cost of a system can be defined as: C = Cs + Cq + Cl (1) Where, C denotes the total cost of system; Cs denotes the cost of switching and scheduling; Cq denotes the cost of queuing; Cl denotes the cost of link. Fig. 1. Traditional Cost Model Although simple, this model has proven effective and robust in designing many popular network protocols. Shortest Path Routing, for example, uses this cost model to find the route between a pair of origin-destination nodes by minimizing the sum of per-hop costs.

654 J.-y. Zhang, F. Min, and G.-w. Yang 3 HMA-Based Grid Resource Scheduling System In our resource scheduling system, we consider the whole grid system as a Global grid, which is made up of n Grid Domains. Let GD i (i = 1, 2,, n) denote the ith Grid Domain (See Figure 2). The GD i is an autonomous, administrative and interactive entity consisting of a set of resources, services and users managed by a single Management Agent (MAgent i ). In our system, we divide each GD into two subdomains: (a) a resource domain (RD) which signifies the resources within the GD; and (b) a user domain (UD) which signifies the users within the GD. Fig. 2. HMA-Based Grid Resource Scheduling System First of all, we define γ = (γi, j) n k as system resource matrix, where γi, j specifies how many type j resources GD i possesses. We also define ξ = (ξi, j) n k as system processing power matrix, where ξi, j specifies processing power of GD i for the jth type of resource, clearly ξij 0 for any legal i, j. In this system, Resource Deciding Agent (RDAgent) maintains an n n table D which records the distance between GDs, where D ij denotes the distance between GD i and GD j. It also holds a quadruple Θ (θ, γθ, ξ, μ) to describe each GD in detail provides by each MAgent. Where, (a) θ denotes serial number of current GD, θ = 1, 2,, n; (b) γθ = [γθ1, γθ2,, γθk] specifies local resources; (c) ξθ = [ξθ1, ξθ2,, ξθk] denotes the processing power of current GD; (d) μ = [μ1, μ1,, μl] denotes the users of UDθ, μ i (i 0)signifies the ith user within UDθ. At the beginning, RDAgent initializes the distance table D and quadruple Θ (θ, γ, ξ, μ) according to the information submitted by all existed MAgents. When RDAgent polls MAgents on its own initiative, MAgents would refresh the table and the quadru-

Integrating New Cost Model into HMA-Based Grid Resource Scheduling 655 Fig. 3. A New Cost Model ple if the update happens within current GD. Otherwise, MAgents needn t submit the update and only keep the update information locally. Let TR denote a task requirement originated from a GD, it includes the explicit user IP address, and its original GD. It also includes the information of resource requirement. The requirement amount of each type of resource is denoted as x 1, x 2,, x K respectively. After a TR is recognized and analyzed by Resource Scheduling Agent (RSAgent), RSAgent orders Resource Finding Agent (RFAgent) to find the location of resources which can meet the requirement TR. At first, RFAgent queries RDAgent for resource information. Because RDAgent has a distance table D and a quadruple Θ, after RFAgent finishes the finding processing, it can get a resource deployment matrix RDP n k. The columns specify RS and the rows specify Grid Domains. If RS m locates at GR θ and γ θm x m, we let RDP θm =(γ θm, D iθ ), else we let RDP θm = (0, ). Now RFAgent examines this matrix RDP n k, if RFAgent finds a column whose elements are all (0, ), it will ask the RDAgent to update table D and quadruple Θ. After update table D and quadruple Θ, if matrix RDP n k still does not satisfy the TR, the next time update will start after a period of time (which we set to 5 minutes). If appropriate resource combination cannot be found after some times (which we set to 3 times), RFAgent concludes that the TR cannot be met and the task cannot be implemented, and sends this message to RSAgent. RSAgent will cancel this task and notify the user. If appropriate resource combination can be found, RFAgent sends this resource combination information to RSAgent. RSAgent will schedule resources according to this information and return result to the user, which procedure will be given in the next section. 4 Integrating New Cost Model into HMA-Based Grid Resource Scheduling System Generally speaking, resource schedulers and resource managers tend to choose the nearest resource because they make decisions depending upon traditional cost model [4]. Now we present a new cost model based on grid systems; therefore, the cost of resource finding and cost of resource deciding will affect the total cost significantly.

656 J.-y. Zhang, F. Min, and G.-w. Yang Thus, the system total cost C consists of three parts: cost of resource finding C F, cost of resource deciding C D, and cost of resource scheduling Cs. Its architecture is illustrated with Figure 3. On the view of Figure 3, C S can be further divided into processing cost C P and transmission cost C T. The total cost of resource scheduling C can be defined as: C = C F + C D + C P + C T (2) Now we can integrate the new cost model into HMA-based grid resource scheduling systems to implement reasonable scheduling with minimal cost. On receiving a TR, RSAgent will forward resources and processing power requirement to RFAgent. RFAgent examines this matrix RDP n k, if it finds a column whose elements are all (0, ), it will ask the RDAgent to update table D and quadruple Θ. We use integer variable count to record the update times. After refreshing table D and quadruple Θ, if matrix RDP n k still does not satisfy the TR, the next time update will start after a period of time (about 5 minutes). Therefore, the cost of finding resources can be measured by count and denoted as C F. That is to say, C F = count. If count > M (M is a constant and M >0. In our paper, we set M = 3), then let C F =, RFAgent consider the TR cannot be met and tell this message to RSAgent. RSAgent will cancel this task and notify the user. If only one element is not (0, ) in each column, this means only one kind combination of resources and Grid Domains can satisfy TR. So RSAgent has no choice but select this combination as the only solution regardless of cost C. If there is more than one resource combinations can satisfy TR, RDAgent will list all possible resource combinations and make decisions which combination to be chosen. The combinations can be signified as follows: CB (TR) = [RA, RA,, RA ] (3) 1 2 K where RAi represents the GD which offers the ith resource, 0 for no requirement of respective resource. The requirement of each type of resource can be met by some GD. Each type of resources is selected independently, so we need choose among at most n values each time. Accordingly, the time complexity is O(n k) instead of O(n k ). If some CB α which satisfies TR and includes λ GD, which means there are λ kinds resources can be provided locally, and (k λ) kinds resources locate at other GDs. We can define C D as: C D = (k λ)/k (4) Resource scheduling cost C S is made up of processing cost C P and transmission cost C T. The stronger the processing power of GD, the smaller the C P of GD. For ξ i, j represents computation power of GDi for resource j, then k ξ = i= 1 is the total computational power when choosing respective combination. ξ i, j (5)

Integrating New Cost Model into HMA-Based Grid Resource Scheduling 657 Accordingly, we use processing power s reciprocal to signify C P, i.e. C P = 1/ξ. Because we assume the task to be processed cannot be divided further, the GD is unique. The father the distance between two GDs, the larger the C T. Accordingly, we use logarithm of the distance between GDs and GD i to signify C T, so we have: k C T = log Dij j= 1 (6) Table 1. Distance between GD i (unit: hop) GD 1 GD 2 GD 3 GD 4 GD 5 GD 6 GD 7 GD 8 GD 9 GD 10 GD 1 0 1 1 3 4 2 2 4 3 3 GD 2 0 1 2 3 2 1 3 4 2 GD 3 0 2 3 1 2 3 2 4 GD 4 0 1 1 1 3 2 2 GD 5 0 2 2 2 1 3 GD 6 0 2 2 1 3 GD 7 0 2 3 1 GD 8 0 1 1 GD 9 0 2 GD 10 0 Table 2. Initial Information about all GDs Table 3. Task Requirements Information γθ ξθ μθ Task1 γ13= 5 γ21 = 3 γ 53 = 4 5 6 2 μ51 Task2 γ34 = 1 γ71 = 2 3 9 μ92 Task3 γ12 = 9 γ31 = 7 γ81 = 1 γ10,1 = 5 1 3 2 7 μ13 Task4 γ11 = 2 γ44 = 4 7 8 μ74 Task5 γ61 = 4 γ92 = 5 γ10,3 = 1 1 8 3 μ61 Task6 γ33 = 6 γ73 = 3 γ81 = 1 γ94 = 1 7 3 2 1 μ45 Task7 γ24 = 2 γ62 = 9 4 1 μ10,1 Task8 γ41 = 3 γ83 = 3 γ93 = 6 5 3 3 μ33

658 J.-y. Zhang, F. Min, and G.-w. Yang 3 2.5 traditional cost model average cost 2 1.5 new cost model 1 0.5 10 20 30 40 50 60 70 80 90 sum of resource requirement Fig. 4. Performance comparison Based on equation (2), the system total resource scheduling cost is: C = count + (k λ)/k + 1/ξ + log k Dij j= 1 (7) RDAgent will choose the CB with minimal C and recommend to RSAgent. RSAgent would perform scheduling according this CB and return results to corresponding user. In this paper, we don t compare costs of two tasks, so how many resources (such as communication traffic, computing overhead and so on) are needed is not our concern when we calculate total cost C. That is to say, the requirement we compute is relative but not absolute. 5 Comparing Experiment We simulate a system containing 10 GDs. 4 types of resource are provided. Distance between these GDs are listed in Table 1, initial information about all GDs are listed in Table 2, and task requirements information are listed in Table 3. Figure 4 shows average cost of tasks under two conditions: 1, optimal solution with traditional cost model and 2, optimal solution with new cost model. Here we can see that optimal solution under traditional cost model may not be optimal under new cost model. 6 Conclusion With the growing popularity of middleware dedicated at making so-called grids of processing and storage resources, network based computing will soon offer to users a

Integrating New Cost Model into HMA-Based Grid Resource Scheduling 659 dramatic increase in the available aggregate processing power. We propose an HMAbased grid resource scheduling system and a new cost model is given. Especially, we formalize resource information and task requirement in detail. Comparing experiment shows that optimal solution under traditional cost model is no longer optimal under our model. References [1] I. Foster, C. Kesselman, and S. Tuecke, The anatomy of the Grid: Enabling scalable virtual organizations, Int l Journal on Supercomputer Applications, 2001. [2] K. Krauter, R. Buyya, and M. Maheswaran, A taxonomy and survey of Grid resource management systems, Software Practice and Experiance, Vol. 32, No. 2, Feb 2002, pp. 135 164. [3] F. Berman, R. Wolski, S. Figueira, J. Schopf, and G. Shao, Application-level scheduling on distributed heterogeneous networks, in Proc. 1996 Supercomputing, Pittsburgh, PA, USA, 1996. [4] B. Davie, S. Casner, C. Iturradle, D. Oran, J. Wroelawski, Integrated Services in the presence of Compressible Flows, Internet Draft, February 1999. [5] I. Foster and C. Kesselman, The GRID: Blueprint for a New Computing Infrastructure, Morgan-Kaufmann, 1998. [6] Kazem Najafi and Alberto Leon-Garcia, A Novel Cost Model for Active Networks, Communication Technology Proceedings 2000, vol.2, pp. 1073 1080.