Prepare for your exams
Get points
Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

For each uploaded document

Answer questions

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Game Theory: Noncooperative Equilibria and Strategies in Two-Player Finite Games, Study notes of Game Theory

University of Washington (UW) - Tacoma Game Theory

Noncooperative game theory, focusing on equilibrium states in two-player finite games. Topics include the battle of the sexes and war game, individual dominance, and the concept of Nash and perfect equilibria in pure and mixed strategies. The document also discusses strategies, elementary games, and the role of beliefs and preferences in repeated games.

What you will learn

How does the use of mixed strategies differ from pure strategies in game theory?
What are the conditions for the existence of a pure strategy Nash equilibrium in an infinite game?
What is the problem of compatibility in the war game?

What are the differences between cooperative and noncooperative game theory?
What are the equilibrium states in the battle of the sexes game?

Typology: Study notes

2021/2022

Uploaded on 03/31/2022

butterflymadam 🇺🇸

4.4

(26)

312 documents

1 / 24

This page cannot be seen from the preview

Don't miss anything!

bg1

i

0.1 THE PRINCIPLES OF GAME THEORY

Bernard WALLISER

Ecole Nationale des Ponts et Chaussées

Ecole des Hautes Etudes en Sciences Sociales

0.2 Introduction

The prehistory of game theory is relatively short, devoted to an algorithm

for the resolution of extensive-form games (Zermelo) and an equilibrium

notion for zero-sum normal-form games (Borel, von Neumann). The theory

appeared in an already elaborated form in the pioneering work ”Theory

of Games and Economic Behavior” (1944), issued from the collaboration

between the mathematician von Neumann and the economist Morgenstern.

It saw a first period of development during the 1950s, when its main equi-

librium concept - Nash equilibrium - was introduced (Nash). It then fell

into neglect for a time, before enjoying a second burst of life in the 1970s

with the explicit integration of time (Selten) and uncertainty (Harsanyi)

into the equilibrium notions. It enjoyed a further boost in the 1990s, with

the explicit internalization of players’ beliefs (Aumann) and the advent of

evolutionary game theory.

The central purpose of game theory is to study the strategic relations

between supposedly rational players. It thus explores the social structures

within which the consequences of a player’s action depend, in a conscious

way for the player, on the actions of the other players. To do so, it takes as

its basis the rational model of individual decision, although current work

is increasingly focused on the limited rationality of the players. It studies

direct multilateral relations between players, non-mediatized by prior insti-

tutions. Game theory is generally divided into two branches, although there

are bridges that connect the two. Noncooperative game theory studies the

equilibrium states that can result from the autonomous behavior of players

unable to define irrevocable contracts. Cooperative game theory studies

the results of games governed by both individual and collective criteria of

rationality, which may be imposed by an agent at some superior level.

Game theory’s natural field of application is economic theory: the eco-

nomic system is seen as a huge game between producers and consumers,

who transact through the intermediation of the market. It can be more

specifically applied to situations outside the realm of perfectly competitive

markets, i.e. situations in which the agents acquire some power over the

fixing of prices (imperfect competition, auction mechanisms, wage negoti-

ations). It can be applied equally well to relations between the state and

agents, or to relations between two states. Nonetheless, it is situated at

a level of generality above that of economic theory, for it considers non-

pf3

pf4

pf5

pf8

pf9

pfa

pfd

pfe

pff

pf12

pf13

pf14

pf15

pf16

pf17

pf18

Related documents

Mixed Strategies in Two-Player Games: Nash Equilibria and Optimal Mixes

Game Theory: Strategic Conflicts and Equilibria in Two-Player Games

Mixed Strategies Equilibria in 2-Player Games: Pennies, Niche, and Liar's Poker

Game Theory: Dominant Strategies & Nash Equilibria in 2-Player Simultaneous-Move Games - P

Mixed Strategies Nash Equilibria and Payoffs Equality Theorem in Game Theory

Game Theory: Strategic Games and Solution Concepts

Game Theory: Nash Equilibria in General Sum Games

Game Theory: Understanding Strategy and Equilibria in Economics

Game Theory Exercises: Nash Equilibria in Various Games

Finitely Repeated Games: Cooperation and Nash Equilibria

Repeated Games in Economics: Equilibria and Strategies - Prof. Daniel R. Vincent

Strategic Form of Two-Player Games: Prisoner's Dilemma and Quality Choice

Partial preview of the text

Download Game Theory: Noncooperative Equilibria and Strategies in Two-Player Finite Games and more Study notes Game Theory in PDF only on Docsity!

i

0.1 THE PRINCIPLES OF GAME THEORY

Bernard WALLISER Ecole Nationale des Ponts et Chaussées Ecole des Hautes Etudes en Sciences Sociales

0.2 Introduction

The prehistory of game theory is relatively short, devoted to an algorithm for the resolution of extensive-form games (Zermelo) and an equilibrium notion for zero-sum normal-form games (Borel, von Neumann). The theory appeared in an already elaborated form in the pioneering work ”Theory of Games and Economic Behavior” (1944), issued from the collaboration between the mathematician von Neumann and the economist Morgenstern. It saw a first period of development during the 1950s, when its main equi- librium concept - Nash equilibrium - was introduced (Nash). It then fell into neglect for a time, before enjoying a second burst of life in the 1970s with the explicit integration of time (Selten) and uncertainty (Harsanyi) into the equilibrium notions. It enjoyed a further boost in the 1990s, with the explicit internalization of players’ beliefs (Aumann) and the advent of evolutionary game theory. The central purpose of game theory is to study the strategic relations between supposedly rational players. It thus explores the social structures within which the consequences of a player’s action depend, in a conscious way for the player, on the actions of the other players. To do so, it takes as its basis the rational model of individual decision, although current work is increasingly focused on the limited rationality of the players. It studies direct multilateral relations between players, non-mediatized by prior insti- tutions. Game theory is generally divided into two branches, although there are bridges that connect the two. Noncooperative game theory studies the equilibrium states that can result from the autonomous behavior of players unable to define irrevocable contracts. Cooperative game theory studies the results of games governed by both individual and collective criteria of rationality, which may be imposed by an agent at some superior level. Game theory’s natural field of application is economic theory: the eco- nomic system is seen as a huge game between producers and consumers, who transact through the intermediation of the market. It can be more specifically applied to situations outside the realm of perfectly competitive markets, i.e. situations in which the agents acquire some power over the fixing of prices (imperfect competition, auction mechanisms, wage negoti- ations). It can be applied equally well to relations between the state and agents, or to relations between two states. Nonetheless, it is situated at a level of generality above that of economic theory, for it considers non-

ii

specialized - though heterogeneous - agents performing actions of one na- ture or another within an institution-free context. It can therefore be con- sidered a general matrix for the social sciences and be applied to social relations as studied in political science, military strategy, sociology, or even relations between animals in biology. In this paper, we shall restrict ourselves to the study of noncooperative game theory. This is the prototype of formalized social science theory which, though making enormous use of diverse mathematical tools, transmits very simple, even simplistic, literary messages. From a syntactic point of view, it provides explanations of some social phenomena on the basis of a small number of concepts and mechanisms, thus endowing these phenomena with a duly signposted domain of validity. From a semantic point of view, it is the subject of laboratory experimentation, although this is aimed more at testing the consequences of existing theories than at inducing original regularities from the results. From a pragmatic point of view, it provides a unifying language for parties faced with decision-making, helping them better to express their shared problems, but supplying few instructions able to help them solve these problems. In what follows, we shall only consider two-player games, for the sake of simplicity (thus avoiding the question of coalitions between players). The games will be examined not only from the point of view of the modelizer overlooking them but also from the points of view of the players them- selves. Each player is characterized by three ”choice determinants”: his opportunities (sets of possible actions), his beliefs (representations of the environment) and his preferences (value judgments on the effects of ac- tions). Interaction between the players is liable to lead to an ”equilibrium state” defined - as it is in mechanics - as a situation that remains stable in the absence of perturbations from the environment. We shall start with the simplest forms of game and make the model more complex by the gradual introduction of the concepts of time and uncertainty. In each section, we shall present behavior assumptions illustrated by simple examples and then deal with the equilibrium concepts introduced with their properties.

0.3 Static games without uncertainty

0.3.1 Behavior assumptions

We assume that it is possible to isolate a game clearly delimited from the complex situations in which the players are immersed and particularly from the parallel or sequential games in which they may participate. In other words, the description of the game must be self-sufficient in the sense that no external consideration can be introduced subsequently to account for the game. The choice determinants for player i (internalizing the rules of the game) are expressed in the following manner:

iv

the added value of 2 to the utility of being together. The ”war game” is a zero-sum game in which two opposing armies, the attacker (player 1) and the defender (player 2), can position their troops in either place A or place B. If the troops are in the same place, the defender wins (utility 1 for the defender, 0 for the attacker); if they are in different places, the attacker wins (utility of 1 for the attacker, 0 for the defender). 1/2 ballet boxing 1/2 in A in B boxing (1,1) (3,2) in A (0,1) (1,0) ballet (2,3) (0,0) in B (1,0) (0,1) A representative infinite game is the ”duopoly game”, which opposes two firms in imperfect competition on a market for one type of good. Each firm i can freely fix its level of production qi and seeks to maximize its profits, which depend on the price p of the good and on the marginal cost (constant) ci: Πi = pqi − ciqi = qi(p − ci). The price is determined by balancing the supply: q = q 1 + q 2 = O(p) and the demand: q = D(p). The quantity demanded is assumed to be a decreasing linear function of the price if p is below a certain threshold and zero if the price is above this threshold. The inverse demand p = D−^1 (q) is such that the price is undetermined (above a certain threshold) if the quantity is zero, and is then a decreasing linear function of the quantity (until it reaches zero). In the useful zone, therefore, we have: p = a − b(q 1 + q 2 ). The profit of firm i can thus be written: Πi = qi(a − b(q 1 + q 2 ) − ci). This game cannot be represented by a matrix. However, it is still possible to consider matrices with specific values of quantities produced.

0.3.3 Strategies

The choice of a player revolves around his “strategies” si. A “pure strat- egy” is nothing other than an action: si = ai. A “mixed strategy” is a probability distribution on the actions: si = σi, where σi(ai) is the prob- ability allocated to the action ai. The actions to which a strictly positive probability is assigned are called the support of the mixed strategy. It is assumed, provisionally, that a player implements a mixed strategy by draw- ing at random an action from the corresponding probability distribution. He gains from a mixed strategy the expected utility on all outcomes, evalu- ated with the (independent) probability distributions of the player and his opponent. If the use of pure strategies requires only that the utilities must be ordinal (the utility of a player is defined up to an increasing function), the use of mixed strategies requires that the utilities must be cardinal (the utility of a player is defined up to an affine increasing function). It should be noted that the rule of maximization of expected utility, natural in a choice against nature assumed to be passive, has been transposed to the choice against a rational and therefore active opponent, thus “naturalizing” the other player.

v

The possible outcomes of a finite game can be represented by a domain in the players’ utility space (a set of points in the case of pure strategies, an often continuous domain in the case of mixed strategies). These out- comes can be compared with the help of a very simple social criterion. One outcome Pareto-dominates another if it provides both players with a greater utility (and strictly greater for at least one of the two players). An outcome is Pareto-optimal if it is not Pareto-dominated by any other outcome. There is always at least one Pareto-optimal outcome, and they are generally multiple. In the players’ utility space, the Pareto-optimal outcomes correspond to the points on the North-East frontier of possible outcomes. From the modelizer’s point of view, it appears highly desirable to achieve a Pareto-optimal outcome. But in a noncooperative game, in which the players cannot sign binding agreements, nothing guarantees that such an outcome will actually be achieved through an equilibrium.

0.3.4 Dominant strategy equilibrium

The first leading principle is that of individual dominance between the pure strategies of a player. A player’s pure strategy si dominates weakly (strongly) another strategy s^0 i of the same player if it provides him with a (strictly) greater utility against any of the opponent’s strategies (’against any defense’):Ui(si, sj ) ≥ Ui(si, sj ), ∀sj. A player’s strategy is said to be weakly (strongly) dominant if it (strongly) dominates all other strategies. A strategy is said to be (strongly) dominated if it is (strongly) dominated by another strategy. The concept of individual dominance can easily be transferred from pure strategies to mixed strategies. One can observe, how- ever, that a player’s strategy can be dominated by a mixed strategy of this player without being dominated by any pure strategy. Above all, it should be noted that a player’s dominant strategy is defined without any reference to the precise action of his opponent; the player can therefore calculate his dominant strategy without making any assumptions about the behavior of the other. A first equilibrium concept is that of “equilibrium in weakly (strongly) dominant strategies”. The associated equilibrium state is defined as an outcome in which each player plays a dominant strategy. Of course, such an equilibrium state may not exist. If it does exist, it is generally unique when dominance is strong but may be multiple when dominance is weak. A weaker concept is that of “sophisticated equilibrium”. It is obtained by considering an iterated process of elimination of dominated strategies. In the first round, the dominated strategies of each player are eliminated; in the second round, in the remaining game, the dominated strategies are once again eliminated; the process stops when there are no more domi- nated strategies. This process is univocal for the elimination of strongly dominated strategies, but not for weakly dominated strategies. In this lat- ter case, the order of treatment of the players matters and if two strategies

vii

(correspondences), each strategy of one player being a best reply to the (ex- pected) strategy of the other. Here again, a weaker concept exists, that of “rationalizable equilibrium”. A rationalizable equilibrium state is defined by the fact that each player plays his best reply to the expected strategy of the other player, which is itself expected to be the best reply to some strategy of the first, and so on, until the circle closes at some level. Nash equilibrium is a particular case of rationalizable equilibrium when the circle closes at the very first level. It can also be demonstrated that if only one rationalizable equilibrium exists, then it is also the sole Nash equilibrium. However, introduced in this way, the concept of Nash equilibrium is not constructive: the process by which the players can reach an equilib- rium state is not made explicit. Everything takes place as if some kind of “Nashian regulator” existed (along the lines of the ”Walrasian auction- eer” of economic theory) who calculates the equilibrium and suggests it to the players (which is not sufficient to establish it). In addition, the concept of equilibrium is incomplete: several equilibrium states may exist, between which the theory cannot choose. In a finite game, pure strategy Nash equilibria may well not exist, just as they may be multiple. In an in- finite game, sufficient conditions for the existence of a pure strategy Nash equilibrium have been established: the set of possible actions of a player is a compact, convex subset of a finite dimensional space on reals; the utility function of a player is continuous in relation to the actions of the play- ers and quasi-concave in relation to the action of this player. In a finite game, there is always at least one mixed strategy equilibrium (the condi- tions defined above are met). A mixed strategy Nash equilibrium can be calculated by expressing that all the pure strategies of the support of a player’s mixed strategy equilibrium state give the same utility against the opponent’s mixed strategy at equilibrium. In the prisoner’s dilemma, the only Nash equilibrium (in pure or mixed strategies) is the outcome in which both players confess. In the battle of the sexes, two asymmetrical Nash equilibria exist in pure strategies, corre- sponding to the outcomes in which husband and wife go to the same show (an additional symmetrical equilibrium exists in mixed strategies). What occurs here is a “problem of co-selection” for which one of the two equilib- ria must be chosen, knowing that the first is more favorable to one player and the second is more favorable to the other player. In the war game, there are no Nash equilibria in pure strategies, which raises a “problem of compatibility”; however, there is a Nash equilibrium in mixed strategies, with each player choosing A and B with equiprobability. In the duopoly game, with identical marginal costs ci for the two firms (ci =c), there is one sole Nash equilibrium in pure strategies, the “Cournot equilibrium”, which coincides with the sophisticated equilibrium, in which the productions are: q 1 = q 2 = (a − c)/ 3 b. One can also define a “collusive equilibrium” when, contrary to the founding hypothesis of non-cooperative game, the two firms can unite to form a monopoly in the face of the consumers, in which case

viii

the productions are: q 1 = q 2 = (a − c)/ 4 b.

0.4 Dynamic games without uncertainty

0.4.1 Behavior assumptions

We will now consider that the game develops sequentially over a given hori- zon, and that one sole player plays in each period. The choice determinants of the agents are now more complicated:

the opportunities of action are defined each time the player has a move and they may depend on the past history of the game, i.e. all the moves that have already been made
the beliefs of each player are still perfect, but they cover not only the structure of the game (characteristics of opponents, horizon of the game), but also the unfolding of the game (actions taken in the past)
the player’s preferences are only defined globally over the whole history of the game; even if they are defined for each period, they are aggregated along the path followed by the play.

The game is finite if it allows a finite number of possible histories, with a finite horizon if all histories have a finite horizon. A finite game can be expressed in “extensive form” (or “developed form”) by a “game tree” (a tree is a connected graph with no cycles):

the non-terminal nodes correspond to the instants when it is the turn of one player to move and the edges that start from this node are the possible actions of this player
each global history of the game corresponds to one of the possible paths in the game tree and ends in a terminal node
the terminal nodes indicate the outcomes of the game, i.e. the overall utilities obtained by the players, through inclusion of all the partial utilities obtained along the corresponding path.

0.4.2 Elementary games

The “entry game”, also known as the “chain-store paradox”, is a finite game in which an incumbent monopolist (player 2) faces a firm that could enter into the same market (player 1). The potential entrant can decide to enter or not to enter; if he enters, the monopolist can decide to be aggressive or pacific. If the potential entrant decides not to enter, the monopolist obtains a utility of 2 corresponding to the value of the market and the potential

x

delegate his behavior to a passive entity that mechanically implements the desired action. As for the concept of mixed strategy, it can be introduced in two different forms. A “random strategy” is obtained by considering a probability distribution on pure strategies. A “behavioral strategy” is obtained by considering, at each node where it is the player’s move, a probability distribution on the actions possible on this node. These two concepts coincide if the player has a perfect memory, i.e. if he remembers his past actions perfectly. However, the second concept is better at conserving the time aspect of the game and the decentralization of choices that goes with it. Thanks to the concept of strategy, all extensive-form finite games can be transformed into strategic-form games. In a tree, each couple of (pure) strategies of the players leads to a well defined terminal node and there- fore to a precise outcome. The game matrix is thus formed by putting the strategies of the two players into lines and columns and writing the corre- sponding utilities of the players into each box at their intersections. In the opposite direction, one matrix could come from different trees; but in this case the strategic and extensive forms are nevertheless strategically equiv- alent. Above all, one cannot find a corresponding tree for every matrix; the reason is that a tree assumes that the players have perfect knowledge of past moves, whereas a matrix allows for players to play simultaneously without knowing the opponent’s move. In the entry game, as each player only plays in a single node, each strategy coincides with an action; the strategic form is illustrated below. In the centipede game, the second player also only plays once and his strategies therefore coincide with his actions; the first player is liable to play twice and he possesses four strategies, obtained by combining the choices of stopping or continuing on the first and on the third node; note that the strategy foresees what the player would do on the third node even if he has in fact stopped the game on the first node, in other words if he were somehow to be involuntarily parachuted to the third node. In the ultimatum game, the first player’s strategy is his proposal for dividing up the cake; the second player’s strategy is to set a threshold X such that he will accept the first player’s proposal if x is below the threshold, otherwise he will reject it. E/M A P ¬E (0,2) (0,2) E (-1,-1) (1,1)

0.4.4 Nash equilibrium and subgame-perfect equilibrium

In the strategic form associated with the extensive form of the game, the concept of Nash equilibrium can still be defined. A Nash equilibrium obeys the principle of “conditional optimality”: on whatever node a player may be, he never has anything to gain from deviating unilaterally from his local move (both his own and his opponent’s strategies being fixed). However,

xi

this concept has nothing to say about what happens outside the equilib- rium path. In particular, this may rely on incredible threats (or promises). A threat is a sub-strategy announcing the move that one player would make if he were on a certain node, with the aim of dissuading the other player from arriving on this node; a threat is incredible if the player making it has nothing to gain from carrying it out if the node in question is actually reached. We can eliminate these incredible threats by using a stronger con- cept than that of Nash equilibrium, one which strengthens the property of conditional optimality for a game with finite horizon: a player has nothing to gain by deviating unilaterally from a local move on any of the nodes of the game, whether or not they are on the equilibrium path. A state that is a Nash equilibrium in the initial game and in every sub- game (a subgame being simply a truncated game that starts in any situation where a player has to move) is called a “subgame-perfect equilibrium” (ab- breviated to “perfect equilibrium”, although this latter term exists with a different meaning elsewhere). For a finite game in extensive form, perfect equilibrium is obtained by a “backward induction” procedure on the game tree: on the nodes preceding the terminal nodes, the players concerned choose their best actions; on the anti-penultimate nodes, the players con- cerned choose their best actions, taking into account the actions to be chosen subsequently; and so on until arriving at the initial node. For the finite game expressed in strategic form, the perfect equilibrium is obtained by a corresponding process of sequential elimination of weakly dominated strategies onto the game matrix: if a strategy is weakly dominated by an- other without being equivalent to it, it is eliminated; if two strategies are equivalent, we eliminate the one that contains actions present in the strate- gies that could previously be eliminated directly. Obviously, all perfect equilibrium states are Nash equilibrium states, which leads us to qualify the concept of perfect equilibrium as a “refine- ment” of the concept of Nash equilibrium. It is in fact a Nash equilibrium that is robust withh regard to “trembles” which appear during the imple- mentation of actions. In a finite game, as the process of backward induction is constructive, there is always a perfect equilibrium. Moreover, this equi- librium is unique when the game is generic, in other words when the same player never obtains the same utility in two different terminal nodes (or if one player obtains the same utility, the other obtains the same as well). Of course, this does not mean that all games have a Nash equilibrium in pure strategies: the strategic forms obtained from the extensive forms are very particular and, conversely, not all strategic forms can be translated into extensive forms (without uncertainty). In an infinite game, on the other hand, there may be a multiplicity of perfect equilibria. In the entry game, there are two Nash equilibria. The first is the perfect equilibrium obtained by backward induction: in the second period, if the potential entrant has entered the market, it is in the monopolist’s interest to be pacific; in the first period, knowing this, it is in the potential entrant’s

xiii

It can be shown that all outcomes of dynamic Nash equilibria are not only feasible but also individually rational. The aim of folk theorems is to ex- amine whether this property holds true in the other direction. They are largely based on the threats (and promises) made by a player concerning the consequences for the other player of deviating from his equilibrium path. On one hand, for repeated games with infinite horizon, the number of Nash or even of perfect equilibria is considerably multiplied in comparison with the basic game. It can be demonstrated that, for a sufficiently large discount factor, all feasible and admissible outcomes of the game can be obtained (up to ε) by a dynamic Nash equilibrium (exactly, when δ = 1). A similar result can be obtained for perfect equilibria. On the other hand, for repeated games with finite horizon, the number of Nash or perfect equilibria is drastically reduced. If all Nash equilibria of the basic game provide ex- actly individually rational utility and not more, only the repetition of such a basic equilibrium is a perfect equilibrium of the repeated game. Likewise, if one sole Nash equilibrium exists, the only dynamic perfect equilibrium is the repetition of this Nash equilibrium. However, if the basic game has sev- eral Nash equilibria that are not strictly individually rational, the number of possible equilibria in the repeated game is once again greatly multiplied. Thus, under various different assumptions, any feasible and individually rational outcome is once again obtained (up to ε) by a dynamic Nash equi- librium or a perfect equilibrium, if the game horizon is sufficiently long. In the case of the repeated prisoner’s dilemma, if the game horizon is infinite, all feasible outcomes that give each player a minimum utility of 1 can be obtained by a dynamic Nash or perfect equilibrium, if the discount factor δ is high enough. In particular, it is possible to obtain the symmet- rical Pareto-optimal outcome if δ > 12 , the players refusing to confess in each period of the game. The dynamic Nash equilibrium is supported by the threat of the “grim” strategy, which consists in not confessing in the first round, then continuing not to confess as long as the other player does not confess, but switching to unforgiving, constant confession if the other player should confess just once. The perfect equilibrium is supported by the threat of the “tit for tat” strategy, which consists in not confessing in the first round, then confessing if the other player confessed in the previous round and not confessing if the other player did not confess in the previous round. If the game horizon is finite, only the Nash outcome of the repeated basic game can be obtained, with the two players confessing in each round. For in the last round, it is in the interest of both players to confess; in the penultimate round, the same is true as the action chosen here will have no effect on the other player’s action in the last round; in the anti-penultimate round, the same is therefore true and so on.

xiv

0.5 Static games with incomplete information

0.5.1 Behavior assumptions

Let us now return to the situation of a static game in which the two players make simultaneous moves. However, assume that for each player, the choice determinants of the other players are partially unknown; the player is then confronted with “structural uncertainty” or “incomplete information”. On the other hand, each player is assumed to know the determinants of his own choice; the player is then said to be in a situation of “asymmetric information”. A player’s uncertainty can firstly concern the opportunities of the other player, if the latter’s means of action are not well known. But this is reduced to uncertainty about preferences by considering that an action that is not envisaged is envisaged with a utility equal to -∞. A player’s uncertainty then concerns the preferences of the other player if the latter’s utility function is not well known. This in turn can be reduced to uncertainty about beliefs, insofar as each player acts in accordance with a belief about his preferences. Finally, uncertainty concerns the beliefs of the other player and in particular his beliefs about the structure of the game (essentially the players’ choice determinants). To begin with, the structural uncertainty of player i is “parametrized” by assuming that he considers his opponent to be of a certain type tj , drawn from a set Tj of possible types. The type of the opponent thus summarizes the choice determinants that guide his decisions. A difficulty arises in the case of beliefs, for each player has primary uncertainty about the physical elements of the system. The other player then has secondary uncertainty about the uncertainty of his opponent, which involves the players in a sys- tem of crossed beliefs about their respective uncertainties. The type is then nothing other than the hierarchy of crossed beliefs of a player concerning the physical elements of the game, and this hierarchy can be constructed in a coherent way. The set of belief hierarchies of all the players constitutes a “possible world” of a psycho-physical nature. Within a hierarchy, each player is generally expected to know his own type. It is moreover assumed that each player attributes probabilities concern- ing the other player’s type, i.e. defines a probability distribution on Tj. The belief hierarchy is then formed of a probability distribution on the phys- ical states of the system, a probability distribution on the other player’s probability distribution, and so on. The probabilistic belief hierarchies are “coherent” when a prior probability distribution exists on the whole set of types and is shared by all the players; this is denoted p(ti, tj ). Such a prior probability distribution is justified by the “Harsanyi doctrine”, according to which the players’ current beliefs only differ in relation to private messages received by them, but are founded on initial common beliefs. Using this common probability, each player calculates the probability of the other’s type, conditionally on his own (known) type: pi(tj /ti) = p(ti, tj )/p(ti). The

xvi

1/2 buy not buy 1/2 buy not buy sell (-2,6) (0,0) sell (4,-4) (0,0) not sell (0,0) (0,0) not sell (0,0) (0,0) In the duopoly, each firm is endowed with uncertainty concerning dif- ferent elements of the game structure. The uncertainty may concern, for example, the global demand function; if the latter is linear of the form a-bq, each firm can be uncertain about the parameters a or b. More often, the uncertainty concerns the production cost function of the other firm; if scale returns are constant, this uncertainty applies to the unit cost ci of the other firm (each firm knows its own unit cost). In a standard configuration, the first firm is considered to have a marginal cost c 1 that is known by both firms; the second firm has a marginal cost that can have a low value c^02 or a high value c 2 ” for the first firm, but which is known to the second.

0.5.3 Strategies

The choices of the players still bear on strategies, but in an extended sense. A strategy of player i is none other than the action of this player, condi- tional on his type: si(ti). As each player knows his own type, the strat- egy determines univocally the action that he will take. For his opponent, on the other hand, the strategy determines the action the player should take according to the type he is assumed to endorse. The utility function of player i thus depends on his type (and the other’s type) in two ways: firstly through his strategy and secondly directly: Ui(si(ti), sj (tj ), ti, tj ). Of course, one could consider a player implementing mixed strategies, but this case is rarely envisaged. A “conjecture” is still one player’s expectation of the other’s strategy. In a Bayesian direct-revelation game, the strategy of a player describes the type announced by the player in relation to his real type. In the battle of the sexes, there are four conditional strategies for each player, which appear more or less natural: always go to the boxing, always go to the ballet, go to one’s preferred show if one is egoist and to the other if one is altruist, go to one’s preferred show if one is altruist and to the other if one is egoist. In the lemon game, there are two conditional strategies for the buyer (to buy or not to buy) and four conditional strategies for the seller (always sell, never sell, sell only if the quality is good, sell only if the quality is bad). In the duopoly game, the conditional strategy of the first firm is embodied in its level of production q 1 and the conditional strategy of the second firm consists in producing q 2 ’ if its cost is low and q 2 ” if its cost is high.

0.5.4 Bayesian equilibrium

The equilibrium concept introduced here is that of “Bayesian equilibrium”. In fact, this is simply the concept of Nash equilibrium applied to the game

xvii

extended to type-conditional strategies. At equilibrium, the conditional strategy of a player is the best reply to the conditional strategy of the other player. Finite games (with finite types) always have a Bayesian equi- librium, possibly with mixed strategies. A Bayesian direct-revelation game is said to be “revealing” if the Bayesian equilibrium associated with it is such that each player truthfully announces his real type: si ∗ (ti) = ti. The “revelation principle” asserts that with every Bayesian game, it is possi- ble to associate a revealing Bayesian direct-revelation game (with adapted utility functions) whose Bayesian equilibrium leads to the same outcome as that of the primitive game. Therefore, if simply interested in utilities (and not strategies), one can restrict his attention to revealing direct-revelation games. In the extended battle of the sexes, one can firstly observe that an egoistic player has a dominant action, namely to go to his or her preferred show. The only conditional strategies that subsist are therefore the “uniform strategy” by which a player always goes to his/her preferred show and the “alternate strategy” by which a player goes to his/her preferred show if he is egoistic and to the preferred show of his/her partner if he/she is altruistic. The first strategy cannot lead to a Bayesian equilibrium because, for an altruist, the best reply to the fact that the other player is going to his preferred show is to follow him or her. The second strategy, on the other hand, leads to a symmetric Bayesian equilibrium, but only if p ≤ 14. In this equilibrium, two altruistic players end up, paradoxically, at each other’s preferred shows, two egoistic players each go to their preferred show, whereas if one player is egoistic and the other altruistic, the egoist will go to his or her preferred show, accompanied by the other player. This is revealing, as the action of the player enables one to deduce his or her type. In the lemon game, the uncertain buyer implicitly considers a matrix which is the average of the two matrices with known types and appears below. In this matrix, the equilibrium is obtained when the seller sells and the buyer buys. It is non-revealing as the exchange takes places in all cases. In the case of perfect information, on the other hand, there is no exchange at the price fixed: the buyer only buys if the car is of good quality, in which case the seller does not sell; the seller only sells when the car is of bad quality, in which case the buyer does not buy. 1/2 buy not buy sell (1,1) (0,0) not sell (0,0) (0,0) In the duopoly game, the calculation of a Bayesian equilibrium is more complicated. It ends up with the following quantities produced:

the first firm produces a quantity q 1 = [a − 2 c 1 + (c^02 + c^002 )/2]/ 3 b
the second firm produces q 20 = [a + c 1 − (7c^02 + c^002 )/4]/ 3 b

q^002 = [a + c 1 − (7c^002 + c^02 )/4]/ 3 b

xix

concerning the players’ types is therefore positive for both players. In the lemon game, the utility of each player under uncertainty is -1 whereas it is 0 under certainty. The value of the information communicated to the buyer concerning the quality of the car is negative for both the seller and the buyer. In the duopoly game, two cases appear, depending on whether the cost of the first firm is high or low. If it is low, the transmission to the first firm of the information about the second firm’s cost has a negative value for the first firm and a positive value for the second. If the cost is high, the transmission of the same information has a positive value for both firms.

0.6 Dynamic games with imperfect information

0.6.1 Behavior assumptions

Consider now a dynamic context into which some uncertainty is introduced. It is a new form of uncertainty, bearing on the past actions of the players; the player is thus confronted with “factual uncertainty” or “imperfect in- formation”. Here again, it is often assumed that each player has a perfect memory of his own past actions; the player faces a situation of “asymmetric information”. In fact, imperfect information arises not only when a player cannot observe his opponent’s moves, but also when he ignores past events. As in the theory of individual decision, an additional agent called nature is introduced. Nature is liable to clothe states through a random mechanical process, and these states are (provisionally) unknown to the players. It is generally assumed, however, that the players know a prior probability dis- tribution on the states of nature. As the game develops, nature may make successive moves, the corresponding states of nature being correlated or not. The extensive form of the game shaped as a tree is conserved, but uncer- tainty is represented by “information sets” on the tree. For a player whose turn it is to move, the nodes between which he cannot distinguish (be- cause they correspond to indiscernible past events) are grouped together in one information set. If the player has a perfect memory, the information sets are formed of nodes situated at the same level of the tree. At each node of an information set, of course, the possible action sets of a player must be identical (otherwise he could partially infer from them the node on which he is currently situated). The information sets of one player form a partition of the nodes where he has to move. The situation of perfect information is obtained when each information set has exactly one node. As for preferences, they are still expressed by the outcomes obtained at the terminal nodes of the tree, i.e. for all paths consisting of actions of players and states of nature, but of course the corresponding utilities only concern the players. The representation of a game in extensive form with imperfect informa-

xx

tion is very widespread. All static games can be expressed in this form, by assuming that one of the players plays first, the other player then playing without knowing what move the first player has made. Of course, there are several ways of proceeding, with each player being represented as playing first in each period. A fortiori, all repeated games can be represented in this form. All static games with incomplete information can also be expressed in this form, provided that the probability distribution on types is coher- ent. The attribution of types to the players is performed by nature, which plays the first move and which carries out a draw with the help of a a prior probability distribution attributed to the types of players.

0.6.2 Elementary games

One typical game is “simplified poker”. At the start of the game, each player pays a sum of 1 into the pot; nature gives the first player a card, high or low with equiprobability, which the second player cannot see. Having looked at his card, the first player can either fold, in which case the second player wins the pot, or raise by paying 2 more into the pot. If he raises, the second player can either fold, in which case the first player wins the pot, or call to see the card by paying 2 into the pot himself. If the second player calls, the first player wins the pot if the card is high and the second wins if it is low.

C | R H L R | C ←−←−←− 2 ←−←−←− 1 ←−←−←−N−→−→−→ 1 −→−→−→ 2 −→−→−→ ↓ F↓ F↓ F↓ F↓ ↓ (3,-3) (1,-1) (-1,1) (-1,1) (1,-1) (-3,3)

Another example is a variant of the entry game. We now assume that the potential entrant can choose between not entering, entering softly or entering forcefully; in this last case he wins the whole market if the mo- nopolist is pacific. The monopolist cannot distinguish between forceful and soft entry, although he obtains different utilities in each case.

A | Es Ef | A ←−←−←− 2 ←−←−←− 1 −→−→−→ 2 −→−→−→ ↓ P↓ ¬E↓ P↓ ↓ (-1,-1) (2,0) (0,2) (1,1) (-1,-1)

0.6.3 Strategies

The pure strategy of a player is now defined as the move made by the player in every information set (i.e. the same move at each node of the information