Descarga la aplicación para disfrutar aún más
Vista previa del material en texto
UC Berkeley Fall 2007 Economics 201A - Section 6 Marina Halac 1 What we learnt this week • Basics: trigger strategy • Classes of games: infinitely repeated games, static Bayesian games • Solution concepts: subgame perfect Nash equilibrium, Bayesian Nash equilibrium • Applications: auction model 2 Problems Problem 1: Infinitely repeated asymmetric prisoners’ dilemma Consider an infinitely repeated game with discount factor δ and the following stage-game C D C 3, 2 -1, 3 D 5, -1 0, 0 (i) What are the payoffs in the worst possible SPNE? (ii) What is the lowest discount factor δ∗ for which the players achieve cooperation in a SPNE? (iii) Is it possible to achieve payoffs better than (0, 0) in a SPNE for discount factors lower than δ∗? Hint: Think about a SPNE in which the players alternate between (C, D) and (D, C) on the equilibrium path. Problem 2: Equilibria with payoffs worse than Nash Consider an infinitely repeated game with discount factor δ and the following stage-game D E D 0,0 -1,-1 E -1,-1 -2,-2 Assume δ > 1/2. Can (E,E) be played in a SPNE? Note that (E,E) gives payoffs (-2,-2) which are lower than the payoffs achieved in the stage-game Nash equilibrium, (0,0). If your answer is yes, propose strategies that sustain (E,E) in some period in a SPNE. If your answer is no, prove it. Problem 3: Cournot oligopoly Let Yk be player k’s payoff in his worst Nash equilibrium of the stage game. Let Dk(a1, ..., aN ) ≡ maxa∈Ak Uk(a1, ..., ak−1, a, ak+1, ..., aN ). In problem 11 of Problem Set Nagano, you are asked to prove the following theorem: Consider a stage-game G and an action profile of G, (a1, ..., aN ), with payoffs (X1, X2, ..., XN ) such that Xk > Yk. Then for all δ > maxk Dk−XkDk−Yk , there exists a SPNE of G(T=∞,δ) where players play (a1, ..., aN ) each period. 1 Now consider n firms in a Cournot oligopoly. Inverse demand is given by P (Q) = a−Q, where Q = q1 + ...+qn. Total costs for firm i are ci(qi) = cqi. Consider the infinitely repeated game based on this stage game. Using the theorem above, find the lowest value of δ such that the firms can sustain the monopoly output level in a SPNE. How does you answer vary with n? Problem 4: Bayesian game C D A x,y 4,5 B 2,y x,3 Let x ∈ {3, 5} and y ∈ {4, 6}, independently drawn. Both values of x and both values of y are equally likely. Suppose that each player knows his own payoff: player 1 observes x and player 2 observes y. Player 1 does not observe y and player 2 does not observe x. Find the BNE. Problem 5: Cournot with asymmetric information (Gibbons 3.2) Consider a Cournot duopoly operating in a market with inverse demand P (Q) = a − Q, where Q = q1 + q2 is the aggregate quantity on the market. Both firms have total costs ci(qi) = cqi, but demand is uncertain: it is high (a = aH) with probability θ and low (a = aL) with probability 1−θ. Furthermore, information is asymmetric: firm 1 knows whether demand is high or low, but firm 2 does not. All of this is common knowledge. The two firms simultaneously choose quantities. What are the strategy spaces for the two firms? Make assumptions concerning aH , aL, θ, and c such that all equilibrium quantities are positive. What is the BNE of this game? Problem 6: Second-price auction (MWG 8B3) Consider the following action (known as second-price or Vickrey auction). An object is auctioned off to I bidders. Bidder i’s valuation of the object is vi. Each bidder observes only his own valuation. The auction rules are that each player submit a bid (a nonnegative number) in a sealed envelope. The envelopes are then opened, and the bidder who has submitted the highest bid gets the object but pays the auctioneer the amount of the second highest bid. If more than one bidder submits the highest bid, each gets the object with equal probability. Show that submitting a bid of vi with certainty is a weakly dominant strategy for bidder i. Also argue that this is bidder i’s unique weakly dominant strategy. Problem 7: First-price auction Consider a first-price sealed-bid auction of an object with two bidders. Each bidder i’s valuation of the object is vi. Each bidder observes only his own valuation. The valuation is distributed uniformly and independently on [0, 1] for each bidder. The auction rules are that each player submit a bid in a sealed envelope. The envelopes are then opened, and the bidder who has submitted the highest bid gets the object and pays the auctioneer the amount of his bid. If the bidders submit the same bid, each gets the object with probability 1/2. Derive the BNE of this auction. Hint: Look for an equilibrium in which bidder i’s strategy is bi(vi) = civi for some constant ci. 2 3 Answers Problem 1 (i) The payoffs in the worst SPNE are (0, 0). They are achieved by the repetition of the stage- game Nash equilibrium (D,D). It is not possible to achieve payoffs worse than 0 since either player can always guarantee himself a payoff of at least 0 by playing D in every period. (ii) According to part (a) the worst available punishment has a payoff of 0 for both players. So for neither player to want to deviate from (C, C), both of the following conditions must hold 5 6 3/(1− δ) ⇔ δ > 2/5 (deviation of player 1) 3 6 2/(1− δ) ⇔ δ > 1/3 (deviation of player 2) We find that δ∗ = 2/5. (iii) Yes. Define the following regimes: regime 1: play (C, D); regime 2: play (D, C). Then consider the following strategies (for each player): Start in regime 1. If the outcome in regime 1 is (C,D), go to regime 2; otherwise, play (D,D) forever. If the outcome in regime 2 is (D,C), go to regime 1; otherwise, play (D,D) forever. We now prove that there exists δ < δ∗ for which these strategies constitute a SPNE with payoffs better than (0,0). In this SPNE, the players alternate between (C,D) and (D,C) as long as neither player deviates, and deviations are punished by the stage-game Nash equilibrium forever. For no player to want to deviate in any regime, the following conditions must hold 3 6 5 − δ + 5δ2 − δ3... = (5− δ)/(1− δ2) always holds (player 1 in regime (D,C)) 0 6 −1 + 3δ − δ2 + 3δ3... = (−1 + 3δ)/(1− δ2) holds if δ > 1/3 (player 2 in regime (D,C)) 0 6 −1 + 5δ − δ2 + 5δ3... = (−1 + 5δ)/(1− δ2) holds if δ > 1/5 (player 1 in regime (C,D)) 2 6 3 − δ + 3δ2 − δ3... = (3− δ)/(1− δ2) always holds (player 2 in regime (C,D)) Note also that no player deviates from (D,D) (in the subgame following deviation) as (D,D) is a Nash equilibrium of the stage game. Hence, a SPNE in which the players alternate between (C, D) and (D, C) exists if δ > 1/3 (note that 1/3 < δ∗). If players start in regime (C, D), they get strictly positive payoffs of (−1 + 5δ)/(1− δ2) and (3− δ)/(1− δ2). Problem 2 The answer is yes. For δ > 1/2, a SPNE that implements (E,E) in some period exists. Define the following regimes: regime 1: play (E,E); regime 2: play (D,D). Consider the following strategies (for each player): Start in regime 1. If outcome is (E,E) in regime 1, go to regime 2. Otherwise, stay in regime 1. Regime 2 is absorbing. We check that neither player wants to deviate in regimes 1 and 2. Regime 2 is a repetition of a stage-game Nash equilibrium, so no player has incentives to deviate. In regime 1, if a player does not deviate, his payoff is −2. If a player deviates once, his payoff is −1− 2δ + 0δ2 + ... = −1− 2δ Hence, a SPNE in which the players play (E,E) in the first period exists if δ > 1/2. (Note that I used the One-Stage Deviation Principle: In a repeated game, a combination of players’ strategies s is a SPNE iff no player can gain by deviating in a single stage, and confirming to s thereafter.) 3 Problem 3 The monopoly output is QM = (a− c)/2. Thus, if the n firms share the monopoly output equally, each produces qM = (a− c)/(2n), with profit πM = (a− c)2 4n A firm’s optimal stage-game deviation, qD, when the n− 1 other firms choose qM is qD = arg max q q(a− (n− 1)a− c 2n − q − c) = n+ 1 4n (a− c) This gives the deviator a profit level of πD = (n+ 1)2 16n2 (a− c)2 The unique stage-gameNE is for each firm to choose qi = arg max qi qi(a− qi − ∑ j 6=i qj − c) = a− ∑ j 6=i qj − c 2 Solving the system of equations, the n-firm Cournot NE outcome involves each firm producing qC = (a− c)/(n+ 1), with profit πC = (a− c)2 (n+ 1)2 Consider the proposed trigger-strategy for each player: Play qM in period 1. In period t, play qM if all firms have played qM in all previous periods; play qC otherwise. Using the theorem above, these trigger strategies constitute a SPNE if δ ≥ π D − πM πD − πC = (n+1)2 16n2 − 14n (n+1)2 16n2 − 1 (n+1)2 ≡ δ∗ Note that δ∗ → 1 as n→∞. Hence, collusion is harder to sustain when there are more firms. Problem 4 Each player has two types: T1 = {t1L, t1H}, where t1L knows that x = 3 and t1H knows that x = 5, and T2 = {t2L, t2H}, where t2L knows that y = 4 and t2H knows that y = 6. To find the BNE, note that for type t1L, playing A strictly dominates playing B (regardless of his belief about player 2’s action, A is t1L’s best response), and for type t2H , playing C strictly dominates playing D. Hence, in any BNE, t1L plays A and t2H plays C. Consider next type t1H . His expected payoff from playing A is 5p1(s2 = C|t1H) + 4(1− p1(s2 = C|t1H)) = 4 + p1(s2 = C|t1H) 4 and his expected payoff from playing B is 2p1(s2 = C|t1H) + 5(1− p1(s2 = C|t1H)) = 5− 3p1(s2 = C|t1H) Thus, for t1H , playing A is a best response iff the probability that player 2 plays C is at least 1/4. Now note that in equilibrium, the probability that player 2 plays C is always greater than 1/4, since for any q ∈ [0, 1], p1(s2 = C|t1H) = p1(t2H |t1H).1 + p1(t2L|t1H)q = 1/2 + q/2 > 1/2 Hence, in any BNE, t1H plays A. Finally, consider type t2L. Regardless of what player 1 does, the payoff from playing C is always y = 4 for this type. His expected payoff from playing D is U2(s∗1L, D; t1L, t2L)p2(t1L|t2L) + U2(s∗1H , D; t1H , t2L)p2(t1H |t2L) = U2(A,D; t1L, t2L)p2(t1L|t2L) + U2(A,D; t1H , t2L)p2(t1H |t2L) = 5 > 4 Hence, in any BNE, t2L plays D. We have found that there exists a unique BNE with (s∗1L, s ∗ 1H , s ∗ 2L, s ∗ 2H) = (A,A,D,C). Problem 5 Firm 1’s strategy space is [0,∞)× [0,∞), and firm 2’s strategy space is [0,∞). The strategies (q∗1L, q ∗ 1H , q ∗ 2) constitute a BNE iff q∗1L = arg maxq q(aL − q − q ∗ 2 − c), q∗1H = arg maxq q(aH − q − q ∗ 2 − c), and q∗2 = arg maxq θq(aH − q − q ∗ 1H − c) + (1− θ)q(aL − q − q∗1L − c) The three first-order conditions are: q∗1L = aL − c− q∗2 2 , q∗1H = aH − c− q∗2 2 , and q∗2 = θaH + (1− θ)aL − c− (θq∗1H + (1− θ)q∗1L) 2 Therefore, the unique BNE is: (q∗1L, q ∗ 1H , q ∗ 2) = ( −θaH + (2 + θ)aL − 2c 6 , (3− θ)aH − (1− θ)aL − 2c 6 , θaH + (1− θ)aL − c 3 ) Problem 6 We prove that bidding vi is at least as good as any other bid no matter what other bids are. Consider an alternative bid bi > vi. Case 1: There is another bid higher than bi. Then neither of the bids wins the auction and so both give a payoff of 0. Case 2: The second highest bid is less than vi. Then bidder i wins the auction and pays the same second highest bid in both cases. Both bids give the same payoff. Case 3: The highest bid of other players is between vi and bi. Then bid vi would lose the auction and give a payoff of 0. Bid bi would win the auction but the bidder would 5 have to pay more than his valuation, obtaining a negative payoff. Hence, bidding bi > vi is weakly dominated by bidding vi. Consider next an alternative bid bi < vi. Case 1: There is another bid higher than vi. Then neither of the bids wins the auction and so both give a payoff of 0. Case 2: The second highest bid is less than bi. Then bidder i wins the auction and pays the same second highest bid in both cases. Both bids give the same payoff. Case 3: The highest bid of other players is between bi and vi. Then bid bi would lose the auction and give a payoff of 0. Bid vi would win the auction and the bidder would have to pay less than his valuation, obtaining a positive payoff. Hence, bidding bi < vi is weakly dominated by bidding vi. This argument implies that bidding vi is the unique weakly dominant strategy for player i. Problem 7 Following the hint, suppose player j’s strategy is bj(vj) = cjvj . For a given value of vi, player i’s best response solves max bi (vi − bi)Pr(bi > bj) = (vi − bi)Pr(bi > cjvj) (Note that we can ignore a tie since Pr(bi = bj) = 0 because bj(vj) = cjvj and vj is uniformly distributed, so bj is uniformly distributed.) Now note that Pr(bi > cjvj) = Pr(vj < bi/cj) = bi/cj where the last equality uses bi/cj ∈ [0, 1], which follows from the fact that player i would not bid below 0 nor above cj . Hence, player i’s best response is bi(vi) = vi/2 The same analysis for player j yields bj(vj) = vj/2 So each player submits a bid equal to half his valuation. This reflects the trade-off that a player faces in an auction: the higher the bid, the more likely to win; the lower the bid, the larger the gain if the player wins. 6
Compartir