A Negotiation Meta Strategy Combining Trade-off and Concession Moves

Size: px

Start display at page:

Download "A Negotiation Meta Strategy Combining Trade-off and Concession Moves"

Margaret Gaines
5 years ago
Views:

1 A Negotiation Meta Strategy Combining Trade-off and Concession Moves Raquel Ros, Carles Sierra Artificial Intelligence Research Institute IIIA Spanish Council for Scientific Research CSIC Bellaterra, Barcelona, Catalonia, Spain. Abstract. In this paper we present a meta strategy that combines two negotiation tactics. The first one based on concessions, and the second one, a trade-off tactic. The goal of this work is to demonstrate by eperimental analysis that the combination of different negotiation tactics allows agents to improve the negotiation process and as a result, to obtain more satisfactory agreements. The scenario proposed is based on two agents, a buyer and a seller, which negotiate over four issues. The paper presents the results and analysis of the meta strategy s behaviour. 1 Introduction During the last years automated negotiation has become an important challenge in the MAS field. It is the main key for autonomous agent interaction. In a multi agent system we find autonomous agents who decide which actions to eecute, when and how. In consequence it is often the case that their own interests conflict with others agents interests. To solve these conflicts, we must equip them with appropriate negotiation strategies; negotiation is not ust a protocol which can be used for agent communication, but also a way to achieve agreements when agents have conflicting interests. To specify a negotiation process we must define [4]: Negotiation Protocols: set of rules that govern the interaction (who can participate, which are the negotiation states, what events cause negotiation states to change and what are the valid actions of the participants in each particular state). Negotiation Obects: the range of issues over which an agreement must be reached. Agents Decision Making Models: the decision making apparatus the participants employ to act in line with the negotiation protocol in order to achieve their obectives. Basically, how agents negotiate. We can define the negotiation process as a distributed search through a space of potential agreements (Figure 1). The dimensionality of the space is determined by the structure of the negotiation obect. If we consider each attribute of our negotiation obect to have a separate dimension, we clearly see that the space in Figure 1 concerns two attributes. The preferences of the participants are represented by regions in the negotiation space. If an intersection between these regions eist, then a possible solution to the conflict may be found. The way to reach this solution can be done by interchanging proposals. Formally, a proposal is a solution to the negotiation problem. Each proposal can

2 be represented as a point (or region) in the negotiation space. The negotiation process consists of receiving the others agents proposals and responding to them with a new proposal or an acceptance. The process terminates when the participants find a mutually acceptable point in the negotiation space or when the protocol dictates that the search should be terminated (for whatever reason) without reaching an agreement. Researchers have proposed different negotiation models. The aim of this work is to combine two eisting models in order to improve the negotiation process, i.e. to propose more satisfactory offers for the agents. As a result, we epect to increase the agents utilities obtained by the agreement achieved. The idea is to switch from one model to the other trying to eploit as much as possible their advantages and to avoid their disadvantages. The first one, we will call it negoengine, is based on concessions [1], and the second one, the trade-off strategy [2] where multiple decision variables are traded-off against one another (e.g., paying a higher price in order to obtain an earlier delivery date or waiting longer in order to obtain a higher quality service). We also propose a modification to the trade-off algorithm in order to improve its performance. Somehow we try to guess the opponent s preferences from the negotiation dialogue in order to propose more acceptable offers. To summarise, this paper presents a step towards the combination of already eisting models and a modification of one them to compute more satisfactory offers. The rest of the paper is organised as follows. Section 2 gives a brief summary of the research done until now on negotiation. Section 3 describes the basic notation used through the paper and the strategies mentioned before. Section 4 details the modification of the trade-off algorithm and introduces the meta strategy proposed. Section 5 presents a scenario for the eperiments and the results obtained. Finally, section 6 shows the conclusions and future work. Ai Ai O Ai s current region of acceptability Ai s initial region of acceptability Previous offer Current offer A1 A3 O O A2 Fig. 1. Negotiation space

3 2 Related Work During the past years different negotiation approaches have been studied in the fields of game theory and artificial intelligence. Game theory generally assumes agents have complete information about their opponents preferences. However, in real environments, this situation is unrealistic. For this reason, researches in the AI field have designed new techniques to solve the negotiation problem with incomplete information and uncertainty. Faratin, et al. use heuristic functions to compute the proposals to offer at each time [1]. Parsons, Sierra and Jennings propose an argumentation model where agents echange proposals and counter-proposals arguing why they reect an offer. [11]. Mugdal and Vassileva propose a sequential decision making in which agents use a preference model of the user incorporating the risk attitude. The decision making is modelled using an influence diagram [9]. Zeng and Sycara propose a sequential decision model which is able to learn. For this purpose, they model the beliefs about the negotiation environment and the participating agents under a probabilistic framework using Bayesian learning representation and updating mechanism [14]. Li, Giampapa and Sycara study the impact of outside options during a negotiation process. They claim that an outside option affects the negotiation strategy via its impact on the reservation price [6]. Some work has also been done regarding agents with firm deadlines as private information. Fatima, Wooldridge and Jennings search for the optimal strategy to be selected based on the remaining negotiation time [3]. Sandholm and Vulkan show that the only sequential equilibrium outcome is one where the agents wait until the first deadline, at which point that agent concedes everything to the other [12]. For an etensive review on bilateral negotiation see [5]. Research has been mainly focused over different parts of the whole negotiation problem. However, trying to integrate all these parts in one general model is still a task to complete and few proposals can be found in the literature. Among them, Lopes et al. present a generic negotiation model. Their main goal is the integration of two models: an individual behaviour model and a negotiation model based on concessions [7]. As we already mentioned, much work has been done in epanding the negotiation process along different dimensions, for instance, time constraints, outside options, multilateral negotiations, etc. But little work has been done regarding the integration of already designed tactics. In this contet, this paper addresses the integration of two negotiation tactics [1,2] in order to improve the outcome. Some modifications have been done on the trade-off algorithm to learn the opponent s preferences in order to compute more satisfactory offers. 3 Negotiation Strategies In order to understand the notation used in previous models [1,2], we firstly describe their basics. Then, in the net subsections, we make a quick review of the negotiation models. Let i (i {a, b}) represent the negotiating agents and ( 1,..., n) be the decision variables under negotiation (attributes of our negotiation obect). Negotiations can range over quantitative (e.g. price, delivery time, and penalty) or qualitative (e.g. quality of service) decision variables. Quantitative decision variables are defined over a real domain (i.e. i Di = [mini, mai ]). Qualitative decision variables are defined

4 over a partially ordered set (i.e. i Di = {q 1, q 2,..., q p }). Each agent has a scoring function V i : Di [0, 1] that gives the score it assigns to a value of decision variable in the range of its acceptable values. For convenience, scores are kept in the interval [0, 1]. The relative importance that an agent assigns to each decision variable under negotiation is modelled as a weight, w i, that gives the importance of decision variable for agent i. We assume the weights of both agents are normalised, i.e. 1 n wi = 1, for all i {a, b}. An agent s scoring function for a contract, = ( 1,.... n ) in the multi-dimensional space defined by the decision variables value ranges, is then defined as: V i () = 1 n wi V i( ). We assume both parties have a deadline by when they must complete the negotiation. This time can be different for each agent and if its deadline passes the agent withdraws from the negotiation. An agent accepts a proposal when the value of the offered contract is higher than the offer the agent is ready to send at that moment in time. A negotiation thread between agents a and b at time t n is a finite sequence of proposals from one agent to the other ordered over time: X tn a b = (t1 a b, t2 b a, t3 a b,...) Optionally, the las element of the sequence is {accept, reect}. 3.1 NegoEngine This subsection describes the first negotiation model (for more details refer to [1]). It is based on defining a set of tactics to be used, either one at a time or as a combination of them. Tactics are the set of functions that determine how to compute the value of a decision variable. For instance: Time dependent: as time passes, the agent will concede more rapidly trying to achieve an agreement before arriving to the deadline. The value to be uttered by agent a for a decision variable at time t, with 0 t t a ma is computed as follows: { min t a = + α a (t)(ma a mina ) (1) min a + (1 αa (t))(ma a mina ) (2) (1) if V a is a decreasing function (2) if V a is an increasing function where α a is a function depending on time and parametrised by a value β R +. ( t α a (t) = t a ma For each value of β, infinite number of possible tactics can be represented. However, two qualitatively different classes can be identified: boulware tactics if β < 1, and conceder tactics if β > 1. ) 1 β

5 Behaviour dependent or Imitative: to imitate the opponent s behaviour. tn+1 = min a if P mina ma a if P > maa P otherwise The parameter P determines the type of imitation to be performed. We can find the following families: Relative Tit-For-Tat: the agent reproduces, in percentage terms, the behaviour that its opponent performed δ 1 steps ago. P = t n 2δ t n 2δ+2 tn 1 Absolute Tit-For-Tat (Absolute-TFT): the same as before, but in absolute terms. P = tn 1 + t n 2δ t n 2δ+2 Averaged Tit-For-Tat (Average-TFT): the agent applies the average of percentages of changes in a window of size λ 1 of its opponents history. P = t n 2λ tn 1 tn Once we define the tactics to be used during the negotiation process,we also define a combination strategy. We compute the values for the decision variables under negotiation according to each tactic. The final value of each decision variable is a linear combination of these values. To represent this linear combination we use a matri of weights Γ. Each column represents a tactic and each row, a decision variable. The matri value γ pm represents the weight assigned to the tactic m for the decision variable p. During the negotiation the Γ matri may change and then, the behaviour of the negotiating agent. 3.2 Trade-off The main idea of this tactic is to find a proposal with the same utility as the previous one offered, but epecting to be more acceptable for its opponent (for more details, see [2]). The problem here is how to determinate which offer may increase the opponent s utility, without knowing its preferences. Given an agent a, who receives a proposal y from agent b, the mechanism should allow agent a to choose a new proposal to offer to its opponent which fulfils two conditions: 1. the new proposal must have the same utility as the offer previously proposed, (this is called a s aspiration level); 2. the new proposal must be the most similar to the offer y proposed by b.

6 This way, on the one hand we maintain our aspiration level, and on the other hand, we maimise the probability of acceptance of our offer as similarity is in many cases correlated with utility. Regarding the aspiration level, we define the iso-curves, which are curves formed by all the proposals with the same utility value for an agent: iso a (θ) = { V a () = θ} From this set of proposals, the agent must now choose one. To find the most similar one we use similarity functions which are based on criteria evaluation functions. These evaluation functions determine how much a given element matches the criteria, i.e. h : D [0, 1]. Thus a similarity function between to values induced by a single criteria h can be defined as: Sim h (, y) = 1 h() h(y). In some cases, multiple criteria can be used to compute the similarity between two values. To aggregate the individual similarities Sim hi a weighted means procedure is employed. Given a domain of values D, a similarity between two values, y D over m criteria is defined as: Sim (, y ) = 1 i m w i (1 h i ( ) h i (y ) ) where 1 i m w i = 1 is the set of weights representing the importance of the criteria functions in the computation of similarity. Finally, the similarity between two contracts and y over the set of decision variables J for agent a is defined as: Sim(, y) = J w a Sim (, y ) Formalising the trade-off tactic, given the proposal offered by agent a, and a subsequent offer y received from agent b, where θ = V a (), agent a makes trade-off the following way: trade-off a (, y) = arg ma z iso a(θ) {Sim(z, y)} The algorithm proposed in 3.2 performs an iterated hill-climbing search in a landscape of possible contracts. The search begins with the last offer received from our opponent and generates a set of N proposals that lie closer to the iso-curve. At the end of each iteration, the most similar contract is selected. The algorithm terminates when the iso-curve is reached after S steps. We can see the algorithm steps in Figure 2. 4 Contributions Net we eplain the contributions of this work. First we detail the modification the trade-off algorithm and then we eplain the meta strategy proposed.

7 y iso-curve X E step 2 step 3 step 1 Fig. 2. Schema of the trade-off algorithm with N=3 and S= Modification of the trade-off algorithm We first eplain in more detail the steps performed by the algorithm used in 3.2 to compute new proposals. Given the offer y proposed by agent b in time t i and the previous proposal offered by agent a in time t i 1 with V a (y) < V a (), the algorithm must compute a new proposal to offer in time t i+1, where V a ( ) = V a (). The idea is to increase the utility of the proposal y, V a (y), until it achieves the current aspiration level (V a ()) after S steps. For simplicity, from now on we assume a single step (S = 1). As eplained in the beginning of the section, the utility of a proposal is the sum of the issues weighted utilities, V a( ), under negotiation. Thus, if we increase each individual utility, we also increase the whole proposal s utility. First, the new proposal is initialised, = y. Then, the algorithm chooses an issue and increments its utility (either increasing o decreasing its value according to the utility function s monotony). If the aspiration level is not reached yet, V a ( ) < V a (), a second issue is chosen and a new value is computed. The process continues until the iso-curve is reached or no issues are left. In the first case, the algorithm finishes and returns the new proposal. Otherwise, the loop begins again with the first issue. It is easy to see that the order in which each issue is selected affects the final outcome. Given the list of decision variables, the first ones have a higher probability to be modified than the last ones. Thus, in order to propose more satisfactory offers to our opponent, we propose to order the issues according to the opponent s guessed preferences. During human negotiation, it is easy to notice that the most important variables are the ones with less variations between offers. For eample, if we are not interested in time delivery, it makes no difference to us to change it as our opponent demands it. But, in the case of the price issue, it is important to us to try to keep it as stable as possible with small variations. Then, from our opponent s contracts history we can deduce somehow its own preferences. With this information we can propose more satisfactory offers variating first the values of those issues that are not so important to our opponent. If we order the decision variables following our opponent s preferences (first those less preferred), the algorithm will begin modifying

8 those that are not so important. In the best case, if the new proposal already achieves the current aspiration level, no more changes will be needed and the most preferred issues may maintain their original values. In the worst case, all issues will be modified. But even in this case, the utility gain needed to achieve the current level will be lower when the most preferred issues are computed. As a summary, we use the similarity approach presented in [2] but using as much as possible the knowledge about our opponents preferences. We bias the eploration in the similarity landscape. We then define the variability of a decision variable in a window of size m of the contracts history as: f() = m 2 i=0 tn 2i t n 2(i+1) (m 1) ma with ma = ma(d ) min(d ), m > 1 and t the current time. The resulting algorithm includes the computation of the issues variability and the storing of the offers proposed by the opponent. Thus, the main steps are: Algorithm Smart Trade-off 1. Store received proposal y in the contract history 2. For each decision variable i do Compute variability(i) 3. Order the decision variables based on their variability 4. Compute a new offer using the trade-off algorithm 4.2 Meta Strategy First, we review the advantages and disadvantages of both models. On one hand, the negoengine tactic allows us to compute offers considering the remaining time to end the negotiation process and our opponent s behaviour, both important aspects to consider when trying to achieve an agreement. The disadvantage of this model is that every offer proposed is a concession; this means that our aspiration level decreases in every step of the negotiation process. On the other hand, the trade-off algorithm advantage is that it searches all possible offers which maintain our aspiration level. Thus, our utility gain does not decrease during the negotiation until it is deliberately indicated. An eternal mechanism is defined to decrease the current aspiration level to achieve an agreement (otherwise, if we never concede, the chance of achieving an agreement is minimum). Faratin et al. proposed to decrease the aspiration level by a predefined amount whenever a deadlock was detected. The problem is that other aspects, as time, are not taken into account. After reviewing the advantages and drawbacks of the models, we now proceed to describe the meta strategy designed. The main idea is to eploit as much as possible the current aspiration level. If no agreement is reached in a given negotiation step, we reduce our aspiration level epecting to find, in a lower level, a new proposal that

9 satisfies both participants. To manage this behaviour the agent applies a trade-off tactic to maintain the aspiration level until a deadlock is achieved. A deadlock is detected when the last offer proposed by the opponent does not improve the utility of the offer proposed two steps before. Then, the negoengine tactic is used in order to decrease the current aspiration level. Using this strategy ensures us to concede in a more rational way, considering the remaining time to end the negotiation and our opponent s behaviour. Net eample shows the meta strategy behaviour from the initial state until a deadlock situation is detected: t V a () V a (y) t t t t t t t where V a ( ) is agent a s utility function,, agent a s proposals, and y corresponds to agent b s proposals. The initial aspiration level is set to 0.8. At time t 0 agent a proposes an offer. Then, agent b offers a proposal in time t 1 with a utility value of 0.2 for agent a. The process continues until time t 5, where b s utility proposal decreases compared to the proposal received in time t 3. The meta strategy detects the deadlock situation. Thus, in time t 6 agent a computes the new proposal using the negoengine tactic, decreasing the current aspiration level to Algorithm Meta Strategy 1. While deadline is not reached, t ma, or no agreement is found, V a () V a (y), do (a) Given the last offer proposed by agent a, compute θ θ = V a () (b) If no deadlock then propose a new offer using the smart trade-off tactic. else propose a new offer using the negoengine tactic. 2. If the deadline t ma is reached then withdraw and terminate. Else accept the proposal y and terminate. 5 Eperiments In this section we first present the scenario used in our eperimentation. Then the eperiments realized are eplained, and finally we proceed on the analysis of the results obtained. The eperiments involve two players, a and b bargaining over fabric products. The decision variables under negotiation are color, material, price and time delivery. Even

10 though color and material issues are discrete decision variables, to simplify the scenario description we assume that all are modelled as a continuous domain. Regarding the color issue, values are ordered based on color temperature (meaning 0 for etreme cold colors and increasing with the warmth of the color). The same way, material issue is based on wrinkle resistance (0 means no wrinkle resistance at all, increasing as more resistant is the material). D c = [0, 5] D m = [0, 4] D p = [30euros, 70euros] D d = [5days, 15days] The weight vectors representing the agents preferences for each decision variable are fied during the negotiation: W a = [0.35, 0.15, 0.45, 0.05]and W b = [0.10, 0.15, 0.40, 0.35], where each weight corresponds to color, material, price and delivery time issues. Regarding to the evaluation functions we use linear functions: Vc a() = 5 Vc b 5 () = 5 Vm() a = 4 Vm() b = 4 4 Vp a 70 () = V p b 30 () = () = () = V a d 15 5 V b d 15 5 And finally, the similarity functions shared by both agents. Similarity for price and delivery are each based on two criteria: low and high price, h lp and h hp respectively; and fast and slow time delivery, h fd and h sd. The weights are defined as follows: wlp a = 0.8, whp a = 0.2, wa fd = 0.8 and wa sd = 0.2, for agent a; and wb lp = 0.2, wb hp = 0.8, wfd b = 0.2 and wb lp = 0.8, for agent b. 1 > h hp () = 80 [20, 100] 0 < 20 1 < 5 30 h fd () = 25 [5, 30] 0 > 30 1 < h lp () = 80 [20, 100] 0 > > 30 5 h sd () = 25 [5, 30] 0 < 5 Color and material similarity are represented as linear function based on a single criteria: h() = min ma min The eperiments involve a complete negotiation process. An agent makes its first offer and the opponent responds with another one. The interaction continues until an agreement is found or the negotiation time epires. The first offer proposed by the

11 agents is computed with the negoengine tactic and then different combinations of tactics are used to compare the performance of our meta strategy. Thus, we define the net types of agents: NegoTO agent: this agent employs the meta strategy defined on section 4.2. That is, applying the trade-off tactic until it reaches a deadlock, and then making an offer with the negoengine tactic. Random agent: the net strategy to compute the new offer is chosen randomly. For instance: negoengine, trade-off, trade-off, negoengine, trade-off, negoengine, negoengine,... Sequential agent: altering both strategies throughout the negotiation process, one at a time: negoengine, trade-off, negoengine, trade-off, negoengine, trade-off,... TO agent: in this case, the agent only applies the trade-off tactic while the utility of the offer received is higher than the previous one received. Otherwise, the aspiration level is decreased by a fied 0.05 and a new proposal is generated. Nego agent: this agent only uses the negoengine tactic during the negotiation. Regarding the negoengine tactic, we model five types of behaviours: very boulware (B), boulware (b), neutral (n), conceder (c) and very conceder (C). We also combine two tactics: time dependent and relative tit-for-tat. The weights of the Γ matri are γ td = 0.1 and γ tft = 0.9 respectively. Two measures were obtained during the eperimentation: 1. utility product: once an agreement is achieved, the product of the utilities obtained by both participants is computed. 2. utility difference: once an agreement is achieved, the difference of the utilities obtained is computed. The first measure indicates us the oint outcome, while the second one, indicates the distance between both utilities. There is an important relation between these two measures and compromise should be taken in account. Even though a high oint outcome is epected, it is also important that the difference between both utilities is low. For eample, an agent can obtain an utility of 0.75, while the other one, The oint utility would be 0.36, which is quite good. But the difference is also high enough to consider the contract as an unsatisfactory agreement (with an assumption of equal negotiation power). For this reason, we will evaluate the results obtained, not only based on the utility product, but also on the utility difference. We realized 100 bilateral negotiations for every pair of agents to obtain an average of the outcome utility (each eecution will variate as the trade-off algorithm includes a randomness): NegoTO agent vs. NegoTO agent, NegoTO agent vs. Random agent, Sequential agent vs. NegoTO agent, Sequential agent vs. Random, and so on. We also modified the behaviour of every pair, changing from a very conceder one, to a very boulware to study its influence on the final outcome. The negotiation deadline was fied to 40 steps for both agents. The net tables show the averages of the utility outcomes for both agents (V a () corresponds to the utility obtained by the reference agent indicated in the table s caption and V i (), to the utility obtained by the rest of the agents a i ), the utility products and the utility differences obtained with neutral behaviour in all cases.

12 We can clearly see that in general the NegoTO agents improve the negotiations achieving satisfactory agreements for both participants. As epected, negotiations performed by at least one NegoTO agent fulfil the desired properties, i.e. high utilities and low utility differences. As we can see on table 1 (top) the highest utility product (0.360) is obtained between a NegoTO agent and a TO agent, but the utility difference (0.244) also increases. This means that while the NegoTO agent achieves a higher utility, its opponent achieves a lower one. The best equilibrium is found when two NegoTO agents negotiate together (0.350 for the utility product and for the utility difference). Also notice that comparing to the rest of the agents utilities, the NegoTO agent achieves in all cases, the higher one (V a () > V i (), where a is the NegoTO agent, and i, all the rest). agent i V a () V i () NegoTO Random Sequential TO Nego agent i V a () V i () NegoTO Random Sequential TO Nego Table 1. Where refers to the utility product, and, to the utility difference. Left table: utility measures obtained by a NegoTO agent vs. all the rest. Right table: Utility measures obtained by a Sequential agent vs. all the rest. Neutral behaviour in all cases. In table 1 (bottom), where the reference agent is the Sequential agent again we confirm the results shown before. The NegoTO agent achieves the higher utility products and the lower utility differences. While the Sequential agent, ecept for the NegoTO agent, always obtains a higher utility compared to its opponents utilities. This means that competing with other agents, the sequential meta strategy does a quite good performance obtaining advantages among the rest. Regarding the Random agent the best combination is obtained when negotiating against a TO agent. But as depicted on table 2 (top) comparing the utility obtained in the agreement against the NegoTO agent and the Sequential agent, it finalises the negotiation with a lower utility gain. Thus, we can say that it cannot improve the other agents performance. In table 2 (middle) we observe that the TO agent achieves the higher utility product against itself, and also the lower utility difference, but it cannot improve the others final utility gain, ecept when negotiating with a Nego agent. Finally, and also as epected, the Nego agent is the one with the worst performance. This is obvious as it has a concession strategy where it does not look for an improvement. It only tries to achieve an agreement as soon as possible and conceding as much as it can. We can see the results in table 2 (bottom). Figure 3 depicts two eamples of a complete negotiation process. We represent the proposals offered by the buyer (NegoTO agent) in red circles and the ones offered by the seller (agent i ) in blue crosses. The ais represents the utility perceived by the seller, while the y ais, represents the buyer s utility (in the range [0,1]). After analysing the

13 agent i V a () V i () NegoTO Random Sequential TO Nego agent i V a () V i () NegoTO Random Sequential TO Nego agent V a () V i () NegoTO Random Sequential TO Nego Table 2. Where refers to the utility product, and, to the utility difference. Top left table: utility measures obtained by a Random agent vs. all the rest. Top right table: utility measures obtained by a TO agent vs. all the rest. Bottom table: Utility measures obtained by a Nego agent vs. all the rest. Neutral behaviour in all cases. complete process between a NegoTO agent and the rest of the agents, we can observe two situations: in the best case, the NegoTO agent tries to maintain the current utility gain as much as it can. If its opponent always concedes, offering better proposals at each step, the NegoTO agent continues sending proposals computed with the trade-off algorithm as long as the negotiation process lasts. Finally the agreement ends without modifying (or modifying very little) the initial aspiration level. in the worst case, the NegoTO agent behaves similar to a sequential strategy. Suppose we have a very boulware opponent which makes few concessions. In the beginning the NegoTO agent proposes an offer using the trade-off algorithm. Net a deadlock is detected (because the offers received do not improve previous ones), and a new proposal is generated with the negoengine tactic. In the net step it tries again with the trade-off tactic. Then, if a new deadlock occurs, the negoengine tactic is employed one more time. The process is repeated again and again until an agreement is reached or agents withdraw. It is also important to mention that changing the behaviour of the negoengine tactic did not really affect the final outcome. The reason is that the participation of this strategy during the negotiation process is mainly used to finish the eecution on time, and not to improve the final outcome. As more conceder is the behaviour, the faster the agreement is achieved. In situations where the negotiation time is significant to evaluate the performance of the negotiation, the agent s behaviour must be taken into account. If a time cost is introduced into the model, the negoengine parameters would need to be modified in order to reduce the necessary time to reach an agreement, allowing maybe greater concessions in the proposals computed. As the trade-off algorithm does not take

14 into account the time factor the meta strategy should also be modified to switch from one strategy to the other one when time cost increases. More eperiments were realized modifying other parameters as the issues weights, (w i, the domains, Di i, and utilities functions, V. They did not caused a significance variation on the results shown Fig. 3. Negotiation process between two players. On the left, NegoTO vs. NegoTO; and on the right, NegoTO vs. Sequential. 6 Conclusions and Future Work This paper presents the design of meta strategies to combine negotiation tactics. More precisely, a model based on concession functions and a trade-off tactic. As we have seen, the first ones try to achieve an agreement decreasing the epected utility of the final proposal accepted by the participants, while the latter searches a satisfactory proposal maintaining the utility as much as possible. Combining tactics allows agents to better adapt to different situations. In this article we described two simple combinations (random and sequential), and a more accurate one, the meta strategy showed on section 4.2. After the eperiments we could see that the meta strategy developed obtains better results than the other combinations. In the worst case it behaves as a sequential strategy, which also performs quite well. In the best case, it eploits as much as possible the trade-off tactic in order to maintain the current aspiration level. We also presented a mechanism to detect our opponent s preference in order to propose more satisfactory offers. As a consequence, the probability of acceptance is increases. As future work we propose the refinement of the parameters involved in both models. For this purpose we suggest genetic algorithms due to the huge quantity of parameters of the models. It would be also interesting to include other negotiation models, such as argumentation based models [11]. In these models agents respond not only with a new proposal, but also with an argument eplaining why they produced that offer or why they reect the received one. This way, agents can get to know its opponents interests and thus propose them more satisfactory offers. More modifications can be eperimented, as in [13], where fuzzy-logic is used to model the utility function. Also,

15 bilateral negotiation could be transformed into a multilateral negotiation. Instead of performing negotiations between two agents, an agent could negotiate against more than one agent. Finally, including time restrictions to achieve an agreement could be done. In this paper, time influence is only represented during the eecution of the negoengine tactic, where the new proposal is computed based on a time dependent tactic. It would be interesting to introduce a cost function in the meta strategy to consider time as the negotiation process progresses. References 1. P. Faratin, C. Sierra, N.R. Jennings. Negotiation decision functions for autonomous agents. In Robotics and Autonomous Systems 24, pages , P. Faratin, C.Sierra, N.R. Jennings. Using Similarity Criteria to Make Issue Trade-Offs in Automated Negotiations. In Artificial Intelligence 142, pages , S. Fatima, M. Wooldridge, N.R. Jennings. Optimal Negotiation Strategies for Agents with Incomplete Information. In Proc. 8th Int. Workshop on Agent Theories, Architectures and Languages (ATAL), Seattle, USA, pages 53-68, N.R. Jennings, P. Faratin, A.R. Lomuscio, S. Parsons, C. Sierra and M. Wooldrige. Automated Negotiation: Prospects, Methods and Challenges. In International Journal of Group Decision and Negotiation, pages , C. Li, J. Giampapa, K. Sycara. A Review of Research Literature on Bilateral Negotiations. In Tech. report CMU-RI-TR-03-41, Robotics Institute, Carnegie Mellon University, November, C. Li, J. Giampapa, K. Sycara. Bilateral Negotiation Decisions with Uncertain Dynamic Outside Options. In The First IEEE International Workshop on Electronic Contracting, IEEE Computer Society, Los Alamitos, California (USA), pages , July F. Lopes, N. Mamede, A. Q. Novais, H. Coelho. A negotiation model for autonomous computational agents: Formal description and empirical evaluation. In Journal of Intelligent and Fuzzy Systems, 12, , N. Matos, C. Sierra. Evolutionary Computing and Negotiation Agents. In Agent Mediated Electronic Commerce, n pages , C. Mudgal and J. Vassileva. Bilateral Negotiation with Incomplete and Uncertain Information: A Decision-Theoretic Approach Using a Model of the Opponent. Proceedings of the 4th International Workshop on Cooperative Information Agents IV, The Future of Information Agents in Cyberspace, pages , P. Noriega, C. Sierra. Agent-Mediated Integrative Negotiation for Retail Electronic Commerce. In Agent Mediated Electronic Commerce, pages 70-90, Minneapolis, Minnesota, may S. Parsons, C. Sierra and N.R. Jennings. Agents that reason and negotiate by arguing. In Journal of Logic and Computation, pages , T. Sandholm, N. Vulkan. Bargaining with Deadlines. In Proceedings of the National Conference on Artificial Intelligence (AAAI), pages 44-51, Orlando, FL, Frank Teuteberg. Eperimental Evaluation of a Model for Multilateral Negotiation with Fuzzy Preferences on an Agent-based Marketplace. In Electronic Markets, Volume 13(1):21-32, D. Zeng and K. Sycara. Bayesian Learning in Negotiation. International Journal of Human- Computer Studies, 48: , 1998.