Goldsmiths Company Mathematics Course for Teachers. Game Theory. Dr. Reto Mueller

Similar documents
A review of game-theoretic models of road user behaviour

GameTheoryNotes.notebook March 01, Game Theory. First, for each possible outcome, determine it's probability and it's "value"

Table 7: Battle of the Sexes Woman P rize F ight Ballet P rize F ight 2,1 0, 0 Man Ballet 0, 0 1,2. Payoffs to: (Man, Woman).

University of Notre Dame Department of Finance Economics of the Firm Spring 2012

Game Theory (MBA 217) Final Paper. Chow Heavy Industries Ty Chow Kenny Miller Simiso Nzima Scott Winder

ECONOMICS DEPT NON - ASSESSED TEST. For Internal Students of Royal Holloway. Time Allowed: 1 hour. Instructions to candidates:

GameTheoryNotesDay2.notebook March 06, Equilibrium Points (or Saddle Points) in Total conflict games (no collaborating)

A Game Theoretic Study of Attack and Defense in Cyber-Physical Systems

(d) Give a formal definition of a strategy in an extensive form game.

Existence of Nash Equilibria

Mohammad Hossein Manshaei 1393

The Performance-Enhancing Drug Game. by Kjetil K. Haugen. Molde University College Servicebox 8, N-6405 Molde, Norway

We can use a 2 2 array to show all four situations that can arise in a single play of this game, and the results of each situation, as follows:

We can use a 2 2 array to show all four situations that can arise in a single play of this game, and the results of each situation, as follows:

Columbia University. Department of Economics Discussion Paper Series. Auctioning the NFL Overtime Possession. Yeon-Koo Che Terry Hendershott

The Codfish Game. April 21, 2011 Eric Rasmusen,

Extensive Games with Perfect Information

The Economic Principles in Transportation Planning

2012 Petaluma Traffic Safety Report Collisions Rise for 2 nd Year in a Row

A Route Choice Experiment Involving Monetary Payouts and Actual Waiting Times

Vision Zero in Canada. 9 th International Conference on Urban Traffic Safety August 2017

One Dimensional Kinematics Challenge Problems

EXAMINATION ONE. I. Explanation, Theory, and the Scientific Method [10 points]

Models for Pedestrian Behavior

A Road Safety Strategy for Greece

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) (a) (b) (c) (d) (e)...

TEST OF ENGLISH FOR EDUCATIONAL PURPOSES

A Group of Factors Influencing the Development of the Greeks Volleyball Athletes at School Age

A Game-Theoretic Intelligent Agent for the Board Game Football Strategy

CS/Ec101b (2005) Game Theory Module

Game Theory. 1. Introduction. Bernhard Nebel and Robert Mattmüller. Summer semester Albert-Ludwigs-Universität Freiburg

Level 3 Mathematics and Statistics (Statistics), 2013

CAM Final Report John Scheele Advisor: Paul Ohmann I. Introduction

TOPIC 10: BASIC PROBABILITY AND THE HOT HAND

On the advantage of serving first in a tennis set: four years at Wimbledon

Crash Patterns in Western Australia. Kidd B., Main Roads Western Australia Willett P., Traffic Research Services

Distracted Driving: Keeping Your Employees Safe

TIMETABLING IN SPORTS AND ENTERTAINMENT

Contingent Valuation Methods

Peer Effect in Sports: A Case Study in Speed. Skating and Swimming

LEARNING OBJECTIVES. Overview of Lesson. guided practice Teacher: anticipates, monitors, selects, sequences, and connects student work

PEDESTRIAN AND DRIVER KNOWLEDGE OF PRIORITY RULES FOR VARIOUS TYPES OF CROSSINGS J. Hatfield 1, R.F.S. Job 2 & K. Smith 3

DECISION OF THE CHIEF CIVILIAN DIRECTOR OF THE INDEPENDENT INVESTIGATIONS OFFICE

The Evolution of Transport Planning

Professor Stephen Hawking s World Cup Study for Paddy Power

The price of anarchy on the roads and in football

Road design and Safety philosophy, 1 st Draft

The race for boats. Christian Mullon (IRD-France), Charles Mullon (UFL- Switzerland)

Analysis of Car-Pedestrian Impact Scenarios for the Evaluation of a Pedestrian Sensor System Based on the Accident Data from Sweden

The Effect of Pressure on Mixed-Strategy Play in Tennis: The Effect of Court Surface on Service Decisions

Intersec(ons of the Future: Using Fully Autonomous Vehicles

Roundabouts along Rural Arterials in South Africa

Film Guide for Educators. Designing for Safety

To focus on winning one competition or try to win more: Using a Bayesian network to help decide the optimum strategy for a football club

Examining NBA Crunch Time: The Four Point Problem. Abstract. 1. Introduction

Live by the Three, Die by the Three? The Price of Risk in the NBA

A strategy is strictly dominated if there is some strategy which strictly dominatesit. Game Theory Dr. F. Fatemi Page 100

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 1. (a) (b) (c) (d) (e) 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) 4. (a) (b) (c) (d) (e)...

London Safety Camera Partnership

An evidence-based journey towards democratic streets

CONSIDERATION OF DENSITY VARIATIONS IN THE DESIGN OF A VENTILATION SYSTEM FOR ROAD TUNNELS

The Mathematics of Gambling

Antonio La Torre. Anti-Doping Education I Run Clean Workshop. Vilnius 2017

Louis M. Edwards Mathematics Super Bowl Valencia Community College -- April 19, 2002

The New Mexico Lottery Continued. Lotteries II 1/26

Puzzles Choose a Curtain, Duel-ity, Two Point Conversions, and More

EUROPEAN TRANSPORT CONFERENCE 2006 ROAD SAFETY IN LONDON PAST SUCCESSES AND FUTURE INNOVATIONS

Performance Task # 1

Road Safety Partnership

M-58 HIGHWAY ACCESS MANAGEMENT STUDY Mullen Road to Bel-Ray Boulevard. Prepared for CITY OF BELTON. May 2016

Reduction of Speed Limit at Approaches to Railway Level Crossings in WA. Main Roads WA. Presenter - Brian Kidd

MGF 1106 Liberal Arts Mathematics Final Review

Recommended Roadway Plan Section 2 - Land Development and Roadway Access

OV United Soccer Club U-8 Coaches Guide

A Behavioral Theory. Teck H. Ho. & National University of Singapore. Joint Work with Noah Lim, Houston Juanjuan Zhang, MIT. July 2008 Teck H.

The Timing and Deterrence of Terrorist Attacks 1. Keywords: Terrorism, terror capacity, threat, conflict, dynamics, contest success function

Can Roger do, what Roger can? Optimal serving in tennis

20mph. We want to make Edinburgh a better and safer place to live, work and play.

Clinic (U5/U6) Program Adventure Theme Practice Plans

ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17

Study Guide and Intervention

Brazilian torture survivor Ines Etienne Romeu dies

THEORY OF TRAINING, THEORETICAL CONSIDERATIONS WOMEN S RACE WALKING

Lesson 22: Average Rate of Change

Blue Ocean Strategy Innovation Reconsidered. Project Management Institute March 29, 2012 Teri Brown

Tokyo: Simulating Hyperpath-Based Vehicle Navigations and its Impact on Travel Time Reliability

Frequently Asked Questions February 2015 SETUP

CIA Interrogation: Torture or Technique?

Urgent duty driving (UDD) This chapter contains these topics: Overview Legal provisions - defences Responsibilities

by Robert Gifford and Jorge Aranda University of Victoria, British Columbia, Canada

When Betting Odds and Credences Come Apart: More Worries for Dutch Book Arguments

Optimization and Search. Jim Tørresen Optimization and Search

Journal of Chemical and Pharmaceutical Research, 2014, 6(3): Research Article

Postgraduate Programmes Registration and Welcome Presentation Schedule 2018

Introduction Politeness Theory Game Theory Trust and Modals Conclusion. About Face! Jason Quinley, Christopher Ahern.

The key to achievement of a sustainable, safe traffic system lies in the systematic and consistent application of safety principles.

States of Matter. The Behavior of Gases

[ Fundación PONS ] [ Fundación PONS ] [ Expertise PONS ] Investing in road safety

Speed Limit Policy Isle of Wight Council

Frequently asked questions (FAQ) about a borough-wide 20 mph speed limit

Transcription:

Goldsmiths Company Mathematics Course for Teachers Game Theory Dr. Reto Mueller r.mueller@qmul.ac.uk

Game Theory is the study of strategic decision making Game Theory is the study of mathematical models of conflict and cooperation between intelligent rational decision-makers. An alternative name for this theory is interactive decision theory. Game theory is mainly used in economics, political science, and psychology, as well as logic, computer science, and biology. A game is usually represented by a matrix or table which shows the players, strategies, and payoffs.

Game Theory is the study of strategic decision making Game Theory is the study of mathematical models of conflict and cooperation between intelligent rational decision-makers. An alternative name for this theory is interactive decision theory. Game theory is mainly used in economics, political science, and psychology, as well as logic, computer science, and biology. A game is usually represented by a matrix or table which shows the players, strategies, and payoffs. Player B plays strategy left Player B plays strategy right Player A plays 4 \ 3 2 \ 0 strategy up Player A plays 3 \ 2 1 \ 1 strategy down

What is the best strategy? Which strategy should the players pick? Let us look at the previous game again... A \ B left right up 4 \ 3 2 \ 0 down 3 \ 2 1 \ 1

What is the best strategy? Which strategy should the players pick? Let us look at the previous game again... A \ B left right up 4 \ 3 2 \ 0 down 3 \ 2 1 \ 1 A \ B left right up 0 \ 3 2 \ 2 down 1 \ 1 3 \ 0

What is the best strategy? Which strategy should the players pick? Let us look at the previous game again... A \ B left right up 4 \ 3 2 \ 0 down 3 \ 2 1 \ 1 A \ B left right up 0 \ 3 2 \ 2 down 1 \ 1 3 \ 0 In the first game, it is clear that the players will choose (up, left), maximising the profit for both of them. But what about the second game?

What is the best strategy? Which strategy should the players pick? Let us look at the previous game again... A \ B left right up 4 \ 3 2 \ 0 down 3 \ 2 1 \ 1 A \ B left right up 0 \ 3 2 \ 2 down 1 \ 1 3 \ 0 In the first game, it is clear that the players will choose (up, left), maximising the profit for both of them. But what about the second game? Player A does better with the down strategy (no matter what B picks) and player B does better with the left strategy (no matter what A picks). So will they end up picking (down, left) despite the fact that with (up, right) they would both profit more?

Nash equilibrium Definition: Informally, a set of strategies is a Nash equilibrium if no player can do better by unilaterally changing his or her strategy.

Nash equilibrium Definition: Informally, a set of strategies is a Nash equilibrium if no player can do better by unilaterally changing his or her strategy. To see what this means, imagine that each player is told the strategies of the others and then asks himself or herself: Knowing the strategies of the other players, can I benefit by changing my strategy? If any player would answer Yes, then that set of strategies is not a Nash equilibrium. But if every player prefers not to switch (or is indifferent between switching and not) then the set of strategies is a Nash equilibrium.

Nash equilibrium Definition: Informally, a set of strategies is a Nash equilibrium if no player can do better by unilaterally changing his or her strategy. To see what this means, imagine that each player is told the strategies of the others and then asks himself or herself: Knowing the strategies of the other players, can I benefit by changing my strategy? If any player would answer Yes, then that set of strategies is not a Nash equilibrium. But if every player prefers not to switch (or is indifferent between switching and not) then the set of strategies is a Nash equilibrium. A \ B left right up 4 \ 3 2 \ 0 down 3 \ 2 1 \ 1 A \ B left right up 0 \ 3 2 \ 2 down 1 \ 1 3 \ 0

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game.

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game. 2. The players are flawless in execution.

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game. 2. The players are flawless in execution. 3. The players have sufficient intelligence to deduce the solution.

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game. 2. The players are flawless in execution. 3. The players have sufficient intelligence to deduce the solution. 4. The players know the strategy of all of the other players.

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game. 2. The players are flawless in execution. 3. The players have sufficient intelligence to deduce the solution. 4. The players know the strategy of all of the other players. 5. The players believe that a deviation in their own strategy will not cause deviations by any other players.

Nash equilibrium If a game has one Nash equilibrium, then all players will play the Nash equilibrium strategy if 1. The players all will do their utmost to maximize their expected payoff as described by the game. 2. The players are flawless in execution. 3. The players have sufficient intelligence to deduce the solution. 4. The players know the strategy of all of the other players. 5. The players believe that a deviation in their own strategy will not cause deviations by any other players. 6. There is common knowledge that all players meet these conditions, including this one. So, not only must each player know the other players meet the conditions, but also they must know that they all know that they meet them, and know that they know that they know that they meet them, etc.

Example 1: Prisoner s Dilemma Imagine two prisoners held in separate cells, interrogated simultaneously, and offered deals (lighter jail sentences) for betraying their fellow criminal. [They both go to prison for one year for stealing a car, but the police thinks they also robbed a jewellery shop together.]

Example 1: Prisoner s Dilemma Imagine two prisoners held in separate cells, interrogated simultaneously, and offered deals (lighter jail sentences) for betraying their fellow criminal. [They both go to prison for one year for stealing a car, but the police thinks they also robbed a jewellery shop together.] A \ B stay quiet betray the other stay quiet -1 \ -1-3 \ 0 betray the other 0 \ -3-2 \ -2

Example 1: Prisoner s Dilemma Imagine two prisoners held in separate cells, interrogated simultaneously, and offered deals (lighter jail sentences) for betraying their fellow criminal. [They both go to prison for one year for stealing a car, but the police thinks they also robbed a jewellery shop together.] A \ B stay quiet betray the other stay quiet -1 \ -1-3 \ 0 betray the other 0 \ -3-2 \ -2 While both players would do better staying silent, the rational behaviour for both of them is to betray the other! A Prisoner s Dilemma has a unique Nash equilibrium (which is not however not optimal).

Example 1: Prisoner s Dilemma in the real world 1. Doping in sport. If none of two athletes takes doping, then neither gains an advantage over the other. If only one does, then that athlete gains a significant advantage over the competitor. If both athletes take the drug, however, the benefits cancel out and only the drawbacks remain (the drug might be illegal and dangerous), putting them both in a worse position than if neither had used doping.

Example 1: Prisoner s Dilemma in the real world 1. Doping in sport. If none of two athletes takes doping, then neither gains an advantage over the other. If only one does, then that athlete gains a significant advantage over the competitor. If both athletes take the drug, however, the benefits cancel out and only the drawbacks remain (the drug might be illegal and dangerous), putting them both in a worse position than if neither had used doping. A \ B don t take it take it don t take it 0 \ 0-5 \ 3 take it 3 \ -5-2 \ -2

Example 1: Prisoner s Dilemma in the real world 1. Doping in sport. If none of two athletes takes doping, then neither gains an advantage over the other. If only one does, then that athlete gains a significant advantage over the competitor. If both athletes take the drug, however, the benefits cancel out and only the drawbacks remain (the drug might be illegal and dangerous), putting them both in a worse position than if neither had used doping. A \ B don t take it take it don t take it 0 \ 0-5 \ 3 take it 3 \ -5-2 \ -2 The rational strategy for both athletes is to take the doping!

Example 1: Prisoner s Dilemma in the real world 2. Arms race. During the Cold War the opposing alliances had the choice to arm or disarm. From each side s point of view, disarming whilst their opponent continued to arm would have led to military inferiority and possible annihilation. Conversely, arming whilst their opponent disarmed would have led to superiority. If both sides chose to arm, or if both sides chose to disarm, war can be avoided. However, arming comes at the high cost of developing and maintaining a nuclear arsenal...

Example 1: Prisoner s Dilemma in the real world 2. Arms race. During the Cold War the opposing alliances had the choice to arm or disarm. From each side s point of view, disarming whilst their opponent continued to arm would have led to military inferiority and possible annihilation. Conversely, arming whilst their opponent disarmed would have led to superiority. If both sides chose to arm, or if both sides chose to disarm, war can be avoided. However, arming comes at the high cost of developing and maintaining a nuclear arsenal... 3. Advertising. If only one of two competing companies advertises its product [or invests more in advertising], it gains a huge advantage over the other. If they advertise at the same level (including not advertising at all), then none of them improves compared to the other. But advertising comes at some high cost.

Example 2: Coordination game A typical case for a coordination game is choosing the sides of the road upon which to drive, a social standard which can save lives if it is widely adhered to. In a simplified example, assume that two drivers meet on a narrow dirt road. Both have to swerve in order to avoid a head-on collision. If both execute the same swerving maneuver they will manage to pass each other, but if they choose differing maneuvers they will collide.

Example 2: Coordination game A typical case for a coordination game is choosing the sides of the road upon which to drive, a social standard which can save lives if it is widely adhered to. In a simplified example, assume that two drivers meet on a narrow dirt road. Both have to swerve in order to avoid a head-on collision. If both execute the same swerving maneuver they will manage to pass each other, but if they choose differing maneuvers they will collide. A \ B left right left 1 \ 1-5 \ -5 right -5 \ -5 1 \ 1

Example 2: Coordination game A typical case for a coordination game is choosing the sides of the road upon which to drive, a social standard which can save lives if it is widely adhered to. In a simplified example, assume that two drivers meet on a narrow dirt road. Both have to swerve in order to avoid a head-on collision. If both execute the same swerving maneuver they will manage to pass each other, but if they choose differing maneuvers they will collide. A \ B left right left 1 \ 1-5 \ -5 right -5 \ -5 1 \ 1 There are two Nash equilibria! Once a social standard is achieved, nobody has any interest to change it (unless all the others change as well).

Example 2: Coordination game - Variants 1. Party or stay home?

Example 2: Coordination game - Variants 1. Party or stay home? A \ B Party Stay home Party 5 \ 5-2 \ -2 Stay home -2 \ -2 1 \ 1

Example 2: Coordination game - Variants 1. Party or stay home? A \ B Party Stay home Party 5 \ 5-2 \ -2 Stay home -2 \ -2 1 \ 1 2. Change to compatible technologies?

Example 2: Coordination game - Variants 1. Party or stay home? A \ B Party Stay home Party 5 \ 5-2 \ -2 Stay home -2 \ -2 1 \ 1 2. Change to compatible technologies? A \ B Change Don t Change 5 \ 5-2 \ 2 Don t 2 \ -2 1 \ 1

Example 2: Coordination game - Variants 1. Party or stay home? A \ B Party Stay home Party 5 \ 5-2 \ -2 Stay home -2 \ -2 1 \ 1 2. Change to compatible technologies? A \ B Change Don t Change 5 \ 5-2 \ 2 Don t 2 \ -2 1 \ 1 3. Battle of the sexes (BoS also Bach or Stravinsky )

Example 2: Coordination game - Variants 1. Party or stay home? A \ B Party Stay home Party 5 \ 5-2 \ -2 Stay home -2 \ -2 1 \ 1 2. Change to compatible technologies? A \ B Change Don t Change 5 \ 5-2 \ 2 Don t 2 \ -2 1 \ 1 3. Battle of the sexes (BoS also Bach or Stravinsky ) A \ B Football Opera Football 5 \ 3 0 \ 0 Opera -2 \ -2 3 \ 5

Example 3: Anti-Coordination Game Chicken The game chicken is a game in which two drivers drive towards each other on a collision course: one must swerve, or both may die in the crash, but if one driver swerves and the other does not, the one who swerved will be called a chicken, meaning a coward.

Example 3: Anti-Coordination Game Chicken The game chicken is a game in which two drivers drive towards each other on a collision course: one must swerve, or both may die in the crash, but if one driver swerves and the other does not, the one who swerved will be called a chicken, meaning a coward. A \ B Swerve Don t Swerve 0 \ 0-1 \ 1 Don t 1 \ -1-10 \ -10

Example 3: Anti-Coordination Game Chicken The game chicken is a game in which two drivers drive towards each other on a collision course: one must swerve, or both may die in the crash, but if one driver swerves and the other does not, the one who swerved will be called a chicken, meaning a coward. A \ B Swerve Don t Swerve 0 \ 0-1 \ 1 Don t 1 \ -1-10 \ -10 Like the coordination game, also the anti-coordination game has two Nash equilibria.

Example 4: Competition game This can be illustrated by a two-player game in which both players simultaneously choose an integer from 0 to 3 and they both win the smaller of the two numbers in points. In addition, if one player chooses a larger number than the other, then he/she has to give up two points to the other.

Example 4: Competition game This can be illustrated by a two-player game in which both players simultaneously choose an integer from 0 to 3 and they both win the smaller of the two numbers in points. In addition, if one player chooses a larger number than the other, then he/she has to give up two points to the other. A \ B 0 1 2 3 0 0 \ 0 2 \ -2 2 \ -2 2 \ -2 1-2 \ 2 1 \ 1 3 \ -1 3 \ -1 2-2 \ 2-1 \ 3 2 \ 2 4 \ 0 3-2 \ 2-1 \ 3 0 \ 4 3 \ 3

Example 4: Competition game This can be illustrated by a two-player game in which both players simultaneously choose an integer from 0 to 3 and they both win the smaller of the two numbers in points. In addition, if one player chooses a larger number than the other, then he/she has to give up two points to the other. A \ B 0 1 2 3 0 0 \ 0 2 \ -2 2 \ -2 2 \ -2 1-2 \ 2 1 \ 1 3 \ -1 3 \ -1 2-2 \ 2-1 \ 3 2 \ 2 4 \ 0 3-2 \ 2-1 \ 3 0 \ 4 3 \ 3 Like the coordination game, also the anti-coordination game has two Nash equilibria.

Example 5: Network traffic Every traveler has a choice of 3 strategies, where each strategy is a route from A to D (either ABD, ABCD, or ACD).

Example 5: Network traffic Every traveler has a choice of 3 strategies, where each strategy is a route from A to D (either ABD, ABCD, or ACD). The payoff of each strategy is the travel time of each route. In the graph above, a car travelling via ABD experiences travel time of (1 + x/100) + 2, where x is the number of cars traveling on edge AB. Thus, payoffs for any given strategy depend on the choices of the other players, as is usual. However, the goal in this case is to minimize travel time, not maximize it.

Example 5: Network traffic Equilibrium will occur when the time on all paths is exactly the same. When that happens, no single driver has any incentive to switch routes, since it can only add to his/her travel time. For the graph above, if, for example, 100 cars are travelling from A to D, then equilibrium will occur when 25 drivers travel via ABD, 50 via ABCD, and 25 via ACD. Every driver now has a total travel time of 3.75 (to see this, note that a total of 75 cars take the AB edge, and likewise 75 cars take the CD edge).

Example 5: Network traffic Nash Equilibrium: 25 drivers via ABD, 50 via ABCD, and 25 via ACD. Every driver now has a total travel time of 3.75.

Example 5: Network traffic Nash Equilibrium: 25 drivers via ABD, 50 via ABCD, and 25 via ACD. Every driver now has a total travel time of 3.75. BUT if the 100 cars agreed that 50 travel via ABD and the other 50 through ACD, then travel time for any single car would actually be 3.5 which is less than 3.75.

Example 5: Network traffic Nash Equilibrium: 25 drivers via ABD, 50 via ABCD, and 25 via ACD. Every driver now has a total travel time of 3.75. BUT if the 100 cars agreed that 50 travel via ABD and the other 50 through ACD, then travel time for any single car would actually be 3.5 which is less than 3.75. This is also the Nash equilibrium if the path between B and C is removed, which means that adding an additional possible route can decrease the efficiency of the system (Braess s paradox).

Is there always a Nash equilibrium? In the matching pennies game, both players reveal a penny at the same time. If they match (i.e. both show heads or both show tails), Player A gets both of them. If the don t match, Player B gets them. So the payoff matrix looks as follows:

Is there always a Nash equilibrium? In the matching pennies game, both players reveal a penny at the same time. If they match (i.e. both show heads or both show tails), Player A gets both of them. If the don t match, Player B gets them. So the payoff matrix looks as follows: A \ B Heads Tails Heads 1 \ -1-1 \ 1 Tails -1 \ 1 1 \ -1 Is there a Nash equilibrium?

Is there always a Nash equilibrium? In the matching pennies game, both players reveal a penny at the same time. If they match (i.e. both show heads or both show tails), Player A gets both of them. If the don t match, Player B gets them. So the payoff matrix looks as follows: A \ B Heads Tails Heads 1 \ -1-1 \ 1 Tails -1 \ 1 1 \ -1 Is there a Nash equilibrium? Another example: A \ B Strategy 1 Strategy 2 Strategy 1 0 \ 2 1 \ 1 Strategy 2 2 \ 0 0 \ 2

Mixed strategies The answer to the question is: There is always a Nash equilibrium if one allows mixed strategies, i.e. an assignment of a probability to each pure strategy. [Note: Since probabilities are continuous, there are infinitely many mixed strategies available to a player, even if there are only finitely many pure strategies.]

Mixed strategies The answer to the question is: There is always a Nash equilibrium if one allows mixed strategies, i.e. an assignment of a probability to each pure strategy. [Note: Since probabilities are continuous, there are infinitely many mixed strategies available to a player, even if there are only finitely many pure strategies.] In the game with two players having two strategies, this is easy: Player A chooses with probability p (where 0 p 1) the first and with probability (1 p) the other strategy. Player B picks with probability q (where 0 q 1) the first strategy and with probability (1 q) the second strategy.

Matching pennies A \ B H T qh+(1-q)t H 1 \ -1-1 \ 1 2q-1 \ 1-2q T -1 \ 1 1 \ -1 1-2q \ 2q-1 ph+(1-p)t 2p-1 \ 1-2p 1-2p \ 2p-1... \...

Matching pennies A \ B H T qh+(1-q)t H 1 \ -1-1 \ 1 2q-1 \ 1-2q T -1 \ 1 1 \ -1 1-2q \ 2q-1 ph+(1-p)t 2p-1 \ 1-2p 1-2p \ 2p-1... \... In order for Player B to be willing to randomize, Player A should play in such a way that the expected payoff for each pure strategy of B (and hence also every mixed strategy) should be the same.

Matching pennies A \ B H T qh+(1-q)t H 1 \ -1-1 \ 1 2q-1 \ 1-2q T -1 \ 1 1 \ -1 1-2q \ 2q-1 ph+(1-p)t 2p-1 \ 1-2p 1-2p \ 2p-1... \... In order for Player B to be willing to randomize, Player A should play in such a way that the expected payoff for each pure strategy of B (and hence also every mixed strategy) should be the same. So player A picks p in such a way that 1 2p = 2p 1, i.e. p = 1/2. Similarly, Player B picks q = 1/2.

Matching pennies A \ B H T qh+(1-q)t H 1 \ -1-1 \ 1 2q-1 \ 1-2q T -1 \ 1 1 \ -1 1-2q \ 2q-1 ph+(1-p)t 2p-1 \ 1-2p 1-2p \ 2p-1... \... In order for Player B to be willing to randomize, Player A should play in such a way that the expected payoff for each pure strategy of B (and hence also every mixed strategy) should be the same. So player A picks p in such a way that 1 2p = 2p 1, i.e. p = 1/2. Similarly, Player B picks q = 1/2.We get 1 A \ B H T 2 H + 1 2 T H 1 \ -1-1 \ 1 0 \ 0 T -1 \ 1 1 \ -1 0 \ 0 1 2 H + 1 2 T 0 \ 0 0 \ 0 0 \ 0

Mixed strategies in previous games Coordination game: A \ B left right left 1 \ 1-5 \ -5 right -5 \ -5 1 \ 1

Mixed strategies in previous games Coordination game: A \ B left right left 1 \ 1-5 \ -5 right -5 \ -5 1 \ 1 pl+(1-p)r... \ p-5(1-p)... \ -5p+(1-p) We solve 5 + 6p = 1 6p, which gives p = 1/2. From symmetry reasons, Player B then picks q = 1/2.

Mixed strategies in previous games Coordination game: A \ B left right left 1 \ 1-5 \ -5 right -5 \ -5 1 \ 1 pl+(1-p)r... \ p-5(1-p)... \ -5p+(1-p) We solve 5 + 6p = 1 6p, which gives p = 1/2. From symmetry reasons, Player B then picks q = 1/2.This means that a mixed strategy is randomly going either left or right to avoid the crash. Surprisingly, this is a Nash equilibrium! A \ B L R 1 2 L + 1 2 R L 1 \ 1-5 \ -5-2 \ -2 R -5 \ -5 1 \ 1-2 \ -2 1 2 L + 1 2R -2 \ -2-2 \ -2-2 \ -2

Mixed strategies in previous games Battle of the sexes: A \ B Football Opera Football 5 \ 3 0 \ 0 Opera -2 \ -2 3 \ 5

Mixed strategies in previous games Battle of the sexes: A \ B Football Opera Football 5 \ 3 0 \ 0 Opera -2 \ -2 3 \ 5 pf+(1-p)opera... \ 3p-2(1-p)... \ 0+5(1-p) We solve 2 + 5p = 5 5p, which gives p = 0.7. From symmetry reasons, Player B then picks his favourite option (namely Opera) with probability 0.7.

Mixed strategies in previous games Battle of the sexes: A \ B Football Opera Football 5 \ 3 0 \ 0 Opera -2 \ -2 3 \ 5 pf+(1-p)opera... \ 3p-2(1-p)... \ 0+5(1-p) We solve 2 + 5p = 5 5p, which gives p = 0.7. From symmetry reasons, Player B then picks his favourite option (namely Opera) with probability 0.7.This means that a mixed strategy is going with probability 0.7 to your favourite event! A \ B F O 0.3F + 0.7O F 5 \ 3 0 \ 0 1.5 \ 1.5 O -2 \ -2 3 \ 5 1.5 \ 1.5 0.7F + 0.3O 1.5 \ 1.5 1.5 \ 1.5 1.5 \ 1.5

The Final Problem From the book The Final Problem... where Holmes and Watson attempted to flee England to get away from Professor Moriarty, who was pursuing Holmes for the purpose of killing him before Holmes investigation could bring down the Professor and his criminal empire.

The Final Problem From the book The Final Problem... where Holmes and Watson attempted to flee England to get away from Professor Moriarty, who was pursuing Holmes for the purpose of killing him before Holmes investigation could bring down the Professor and his criminal empire. Holmes elected to flee to the European continent while the London police did their work to wrap up the case he had provided for them. Holmes and Watson took a train and headed to Dover, where they could board a steamer to Calais and leave Moriarty behind and vulnerable. Holmes knew the train schedule, as did Moriarty. There was an intermediate stop in Canterbury. By this time, Moriarty and his confederates were close upon the trail chasing Holmes.

The Final Problem From the book The Final Problem... where Holmes and Watson attempted to flee England to get away from Professor Moriarty, who was pursuing Holmes for the purpose of killing him before Holmes investigation could bring down the Professor and his criminal empire. Holmes elected to flee to the European continent while the London police did their work to wrap up the case he had provided for them. Holmes and Watson took a train and headed to Dover, where they could board a steamer to Calais and leave Moriarty behind and vulnerable. Holmes knew the train schedule, as did Moriarty. There was an intermediate stop in Canterbury. By this time, Moriarty and his confederates were close upon the trail chasing Holmes. Holmes \ Moriarty Out at C Go to D Out at C 0 \ 2 1 \ 1 Go to D 2 \ 0 0 \ 2

The Final Problem Holmes \ Moriarty Out at C Go to D Out at C 0 \ 2 1 \ 1 Go to D 2 \ 0 0 \ 2 What is the strategy that they choose?

The Final Problem Holmes \ Moriarty Out at C Go to D Out at C 0 \ 2 1 \ 1 Go to D 2 \ 0 0 \ 2 What is the strategy that they choose? SOLUTION: Holmes gets out at Canterbury with probability 2/3. Moriarty on the other hand stays on the train with probability 2/3.

The Final Problem Holmes \ Moriarty Out at C Go to D Out at C 0 \ 2 1 \ 1 Go to D 2 \ 0 0 \ 2 What is the strategy that they choose? SOLUTION: Holmes gets out at Canterbury with probability 2/3. Moriarty on the other hand stays on the train with probability 2/3. SPOILER: In the book, Holmes gets indeed out at Canterbury while Moriarty s special pursuit train sped through hoping to catch Holmes in Dover and kill him there.