Post-Classical Game Theory: Opportunities for IS Researchers

Size: px

Start display at page:

Download "Post-Classical Game Theory: Opportunities for IS Researchers"

Quentin Andrews
5 years ago
Views:

1 1 / 40 Post-Classical Game Theory: Opportunities for IS Researchers Steven O. Kimbrough University of Pennsylvania kimbrough [á] wharton.upenn.edu and Karlsruhe Service Research Institute, KIT CSWIM, 30 June 2012,

2 2 / 40 Outline 1 A Grand Challenge 2 The Making of the Challenge 3 How to Proceed? 4 Example: Oligopoly Markets Bertrand Competition Cournot Competition 5 Discussion 6 End Matter

3 3 / 40 The design, monitoring, and maintenance of social institutions. A grand challenge of our time. (Why is this an IS (IS/ICT/IM) topic?) Goal of this talk: to introduce and motivate a program of research in IS, one that addresses this grand challenge. Note: I am not saying that this is the only way forward. It is one of several, non-competing alternatives. Here, I will draw extensively on results from post-classical game theory.

4 4 / 40 See my book: [Kimbrough, 2012] here will be routed separately from ance to review and make corrections AGENTS, GAMES, AND EVOLUTION AGENTS, GAMES, AND EVOLUTION Strategies at Work and Play Kimbrough K11564 Steven Orla Kimbrough

5 5 / 40 Classical game theory 1 Observe a CSI in the wild. 2 Model the CSI rigorously as a game. 3 Assume ideal rationality predict outcome as equilibrium. 4 Find the equilibri(um/a) of the game. CSI: context of strategic interaction (interdependent decision making). What s not to like? There are more things in heaven and earth, Horatio, Than are dreamt of in your philosophy. (Long story. Includes prediction failures, implausible assumptions, and narrowness of scope. Checkers?)

6 6 / 40 Post-classical game theory 1 Observe a CSI in the wild. 2 Model the CSI rigorously as a game. NB. Model may be specified by rules or by procedures. 3 Undertake a strategy acquisition process. (Or observe it.) Exogenously: Discover a satisfactory consideration set of strategies/policies of play. Endogenously: Specify a learning / procedural regime whereby the agents (players) acquire strategies/policies of play. 4 Find the behavior of the resulting system. What happens when the players/agents with their acquired strategies play in the model? (Note: Address both strategy selection questions and institutional design questions.)

7 7 / 40 On the demand side... New technologies and globalization New possibilities for kinds of markets and other institutions. Some requirements: quickly and easily formed institutions, not supported by advertising. Example: Social Clouds (explored at KSRI). Increasing complexity in existing institutions, evisceration or capture of governments, increasing size and potential for extraordinary returns Urgent need to monitor and maintain institutions, to prevent their capture by special interests and their becoming extractive [Acemoglu and Robinson, 2012]. Examples: California electricity deregulation. Financial services, banking?

8 8 / 40 On the theory side: severe problems Thinking of the economy as a thermodynamic machine (an energy-driven system) is out of the mainstream. Also insufficiently factored into current thinking about design, monitoring and management of institutions: Externalities. Such as air pollution. Common pool resource problems. Such as protecting the atmosphere, preserving a fishery, maintaining social capital. Environmental sustainability. Monitoring and maintaining the well-being of environmental resources. The account of rationality assumed by theory is not viable. With full rationality and full knowledge. Even the assumption of equilibrium in models (in both economics and in game theory) is problematic.

9 9 / 40 Just looking at equilibrium Many challenges, such as time to reach it, too many equilibria. More fundamentally: Unless a given game has a self-evident way to play, self-evident to the participants, the notion of a Nash equilibrium has no particular claim upon our attention. [Kreps, 1990, page 31] And even more fundamentally...

10 It is simply not rational to play the equilibrium strategy ,15 12,17 10,15 8,13 6,11 4,9 4 17,12 13,13 10,15 8,13 6,11 4,9 3 15,10 15,10 11,11 8,13 6,11 4,9 2 13,8 13,8 13,8 9,9 6,11 4,9 1 11,6 11,6 11,6 11,6 7,7 4,9 0 9,4 9,4 9,4 9,4 9,4 5,5 Table: A cascading Prisoner s Dilemma in strategic form. Five rounds of Axelrod s stage game. Variations of GRIM TRIGGER. [Kimbrough, 2012, page 414]. If i plays n, then i wants to play n 1, n > / 40

11 11 / 40 How to Proceed? And what is the role for IS? Broadly, philosophical position is Pragmatism. Think: If you can t make one, then you don t know how it works. So, build realistic models (not merely stylized models), see how they track reality. Principles of simplicity and minimalism. Simple models, minimal rationality, then elaborate as appropriate. Claim: Real progress will require procedural, computationally feasible models, tested by real data (field, experimental). ICT? Most institutions of import will be non-trivially mediated by ICT systems. Required: design, monitoring, adjustment, transparency. Needed: Substantial social immune systems. Think: beyond autonomic computing.

12 12 / 40 Can you be a little more specific? A good model is Braitenberg s Vehicles

13 13 / 40 Begin simply, study in depth, elaborate. Cycle

14 14 / 40 Begin simply, study in depth, elaborate. Cycle

15 15 / 40 Begin simply, study in depth, elaborate. Cycle

16 16 / 40 After multiple cycles Complex systems, emergent properties. We know how it works because we built it. Substantial biological verisimilitude demonstrated in the book. (Not merely stylized models.) Much detailed knowledge gained about the model systems. Doors opened for a program of research. OK, what about an example involving an institution?

17 Bertrand Competition Example: Competition on price Each period all firms offer a price and the market takes all demand from the low-price firm. Economics theory: collusion is impossible. Even with just two firms in the market they will compete away their profits. If firm 1 really believes that firm 2 will charge a price ˆp that is greater than the marginal cost, it will always pay firm 1 to cut its price to ˆp ε. But firm 2 can reason the same way! Thus any price higher than marginal cost cannot be an equilibrium; the only equilibrium is the competitive equilibrium. [Varian, 2003, page 488] Note the business literature on this: Don t do it! 17 / 40

18 18 / 40 Bertrand Competition PROBE AND ADJUST [Kimbrough, 2012] A kind of reinforcement learning for a continuous quantity. Episode (round of play). Epoch (a number of episodes). Probe uniformly ±δ anchor value in each episode. Adjust anchor value ±ε at the end of each epoch. +δ δ +ε ε Anchor value Record rewards per episode (up, down) and adjust according to update policy.

19 Bertrand Competition Bertrand (price) competition with PROBE AND ADJUST Both firms using the update policy of Own Returns. Replicates standard theory. 19 / 40

20 Bertrand Competition Bertrand (price) competition with PROBE AND ADJUST Both firms using the update policy of MR-COR. Contradicts standard theory. 20 / 40

21 Bertrand Competition Comments These findings are robust to starting positions, costs to firms, etc. What does matter is the number of firms in the market. Reverts to competitive market after a tipping point, affected by the number of firms, the patience of the firms, and their epoch lengths. See [Kimbrough, 2012, Kimbrough and Murphy, 2009] for detailed discussions, including pseudo-code. See AGEbook/nlogo/OligopolyBidPrice.html for the NetLogo program. This simple model explains tacit collusion and its loss with increasing numbers of firms, etc. It also counsels patience. Think: executive compensation. 21 / 40

22 Cournot Competition Cournot (quantity) competition The other main theoretical model of oligopoly. Quantity competition: Each period firms offer quantities of a good and the market sets the price. Each firm receives a reward that is the product of the quantity it put to the market and the realized market price. 22 / 40

23 Cournot Competition Cournot reference model Roughly: A market for a particular product supplied by n firms. During each time step each of the supplying firms offers quantity Q i (i = 1, 2,..., n) to the market, so that the total supply in a given period is Q = n Q i (1) i=1 The unit price resulting is determined by the demand function P = max{a slope Q, 0} (2) Each firm i receives revenues of P Q i. 23 / 40

24 24 / 40 Cournot Competition Cournot reference model (con t.) Firms may independently and without communication with each other adjust the quantities they offer to the market, their Q i s. In setting their Q i s each firm takes into account its unit cost of production, k i, and the behavior of the other firms. Each firm follows the best response strategy If all of the firms do this they will reach the Cournot equilibrium in which the individual firm Cournot quantities are Qi C (a k i ) (n, k i ) = (3) (n + 1) slope

25 Cournot Competition Is there another way of modeling this? The Cournot conclusion follows mathematically provided you make the best response assumption. But why should you? Behaviorally implausible. What if the agents follow PROBE AND ADJUST in learning to set their quantities? See [Kimbrough and Murphy, 2009], Learning to Collude Tacitly on Production Levels by Oligopolistic Agents or Chapter 10 of Agents, Games, Evolution. 25 / 40

26 Cournot Competition Quantity setting with PROBE AND ADJUST Agents collectively reach the Cournot quantity. That is, they individually and collectively put to the market the total quantity that is predicted by the Cournot model. Without the implausible Cournot assumptions! And under a plausible behavioral procedure. Also observed: number effects consistent with behavioral experiments. 26 / 40

27 27 / 40 Cournot Competition Cournot (quantity) competition with PROBE AND ADJUST Both firms update with Own Returns and settle near the Cournot equilibrium. Replicates the standard theory. Robust generally and specifically to the number of firms.

Cournot Competition Cournot competition with PROBE AND ADJUST Both firms update with Market Returns and settle near the monopoly equilibrium.

28 Cournot Competition Cournot competition with PROBE AND ADJUST Both firms update with Market Returns and settle near the monopoly equilibrium. Contradicts the standard theory. Robust generally and specifically to the number of firms. Market Returns update policy is highly exploitable. 28 / 40

29 Cournot Competition Cournot competition with PROBE AND ADJUST Both firms update with MR-COR and settle near the monopoly equilibrium. Contradicts the standard theory. Robust generally and specifically to the number of firms. 29 / 40

30 Cournot Competition Cournot competition with PROBE AND ADJUST Firm 0 updates with MR-COR, firm 1 with Own Returns. They settle near the Cournot equilibrium. Beyond the scope of the standard theory. 30 / 40

31 Cournot Competition Comments Firm 1 does slightly worse than firm 0. Firm 0 would make both firms better off by switching to MR-COR. = MR-COR is not exploitable. Robust generally and specifically to the number of firms. [Kimbrough, 2012, Kimbrough and Murphy, 2009]. Similar results for supply curve bidding with step functions (electricity markets). [Kimbrough, Murphy, working paper]. Worse, theoretical results: With reaction consistency, Cournot = Bertrand [Kimbrough, Murphy, Smeers working paper]. 31 / 40

32 Cournot Competition Conclusions on oligopoly results Standard oligopoly theory is seriously deficient. The results on display here 1 Give a unified contradiction of oligopoly theory, 2 Provide a credible explanation for why tacit collusion and market power are possible, and 3 Explain the differences between price and quantity competition without having to assume reaction inconsistency in one model and not the other. Even very simple markets need new approaches to investigation. 32 / 40

33 33 / 40 Vehicle 1 Think of the material here on oligopoly markets and PROBE AND ADJUST as analogous to Braitenberg s Vehicle 1. How should we build Vehicle 2?

34 What do we want to know about our markets? And more generally, our social institutions? Can market/institutional power be realized? If so, how? (See above on oligopoly.) What are the social welfare characteristics of the institution? Fairness? Productivity?... Pareto efficiency is hardly sufficient. Stability? Robustness? Resilience? Reconfigurability? Autonomic potential? Credible alternatives and their properties? Externalities? Positive? Negative? Can they be endogenized? If so, how? What are the epistemic burdens placed on participants? How can they be reduced and with what consequences? 34 / 40

35 35 / 40 What do we want to know about our institutions? (continued) Privacy? Liquidity? What needs to be monitored and why? (Both inside and outside of the institution.) How can we monitor what needs to be monitored? How will real people behave with a given institution and what will its resulting behavior be? How can we find effective strategies for acting within a given institution?...

36 36 / 40 Points arising 1 We can easily make the list of interesting questions much longer. 2 Just articulating the list in depth is an important research challenge. 3 While analytic, closed form results are always welcome, we should use whatever methods yield tangible results for designing, monitoring, and maintaining social institutions. Real institutions, not merely stylized models of them. This will surely involve procedural modeling and simulation (and much else). Game theory, economics, and institutional design must be seen as branches of empirical science, not as branches of applied mathematics.

37 37 / 40 And IS? The role of ICT looms large in this grand project. The field of IS is potentially, but not inevitably, a major player. Systems analysis with a higher calling. Look beyond the system to the entire institution.

38 38 / 40 Acemoglu, D. and Robinson, J. A. (2012). Why Nations Fail: The Origins of Power, Prosperity, and Poverty. Crown Publishers, New York, NY. Kimbrough, S. O. (2012). Agents, Games, and Evolution: Strategies at Work and Play. CRC Press, Boca Raton, FL. Kimbrough, S. O. and Murphy, F. H. (2009). Learning to collude tacitly on production levels by oligopolistic agents. Computational Economics, 33(1): and sokpapers/2009/oligopoly-panda-r2.pdf.

39 39 / 40 Kreps, D. M. (1990). Game Theory and Economic Modeling. Clarendon Press, Oxford, England. Varian, H. R. (2003). Intermediate Microeconomics: A Modern Approach. W. W. Norton & Company, New York, NY, sixth edition.

40 40 / 40 $Id: CSWIM-beamer.tex :21:24Z sok $