Mohammad Hossein Manshaei 1393

Similar documents
ECONOMICS DEPT NON - ASSESSED TEST. For Internal Students of Royal Holloway. Time Allowed: 1 hour. Instructions to candidates:

Existence of Nash Equilibria

Goldsmiths Company Mathematics Course for Teachers. Game Theory. Dr. Reto Mueller

(d) Give a formal definition of a strategy in an extensive form game.

Game Theory (MBA 217) Final Paper. Chow Heavy Industries Ty Chow Kenny Miller Simiso Nzima Scott Winder

Planning and Acting in Partially Observable Stochastic Domains

Columbia University. Department of Economics Discussion Paper Series. Auctioning the NFL Overtime Possession. Yeon-Koo Che Terry Hendershott

CS/Ec101b (2005) Game Theory Module

Taking Your Class for a Walk, Randomly

Extensive Games with Perfect Information

Designing Mechanisms for Reliable Internet-based Computing

Table 7: Battle of the Sexes Woman P rize F ight Ballet P rize F ight 2,1 0, 0 Man Ballet 0, 0 1,2. Payoffs to: (Man, Woman).

A Game Theoretic Study of Attack and Defense in Cyber-Physical Systems

Lift generation: Some misconceptions and truths about Lift

Wing-Body Combinations

Lesson 14: Games of Chance and Expected Value

Dynamic Matching Pennies with Asymmetries: An Application to NFL Play-Calling

Lift for a Finite Wing. all real wings are finite in span (airfoils are considered as infinite in the span)

The Tennis Player s Posture The Value of Balance

Lesson 3 Part 1 of 2. Demonstrating and Describing the Forehand Drive Components. Purpose: National Tennis Academy

GameTheoryNotesDay2.notebook March 06, Equilibrium Points (or Saddle Points) in Total conflict games (no collaborating)

University of Notre Dame Department of Finance Economics of the Firm Spring 2012

A Characterization of Equilibrium Set of Persuasion Games with Binary Actions

Extra: What is the minimum (fewest) number of train cars that it would take to hold all those passengers at once, if each car holds 12 passengers?

Fantasy Golf Made Easy

Opleiding Informatica

ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17

Quality Planning for Software Development

Sport statistics: how to assemble your team elivian.nl high quality, poorly written update history current(v2.1):7-oct-16, original(v1):13-sep-14

Primary Objectives. Content Standards (CCSS) Mathematical Practices (CCMP) Materials

LOW PRESSURE EFFUSION OF GASES adapted by Luke Hanley and Mike Trenary

July 2011, Number 62 ALL-WAYS TM NEWSLETTER

A Hare-Lynx Simulation Model

BASIC FUTSAL COACHING PREPARATION

Can Roger do, what Roger can? Optimal serving in tennis

LOW PRESSURE EFFUSION OF GASES revised by Igor Bolotin 03/05/12

Flaws of Averages. BLOSSOMS Module. Supplement to Flaw of Averages #3: The average depends on your perspective

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg

AP Physics 1 Summer Packet Review of Trigonometry used in Physics

ABOUT THE REPORT. This is a sample report. Report should be accurate but is not

Professionals Play Minimax: Appendix

Ranger Walking Initiation Stephanie Schneider 5/15/2012 Final Report for Cornell Ranger Research

Chance and Uncertainty: Probability Theory

The final set in a tennis match: four years at Wimbledon 1

Chapter 12 Practice Test

Perfects Shooting Drill

Guidelines for Applying Multilevel Modeling to the NSCAW Data

Predicting Tennis Match Outcomes Through Classification Shuyang Fang CS074 - Dartmouth College

Training &Development Program

The Handy Book of. Sigrid Schöpe HORSE TRICKS. Easy Training Methods for Great Results. Includes. of the World s Most Popular Tricks!

ISOLATION OF NON-HYDROSTATIC REGIONS WITHIN A BASIN

A fast and complete convection scheme for ocean models. Stefan Rahmstorf. Institut für Meereskunde Kiel Germany. published in. Ocean Modelling Nr.

Teaching styles within handball training Martin Tůma, EHF lecturer

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

FUTSAL 2018: GENERAL INFORMATION

When Betting Odds and Credences Come Apart: More Worries for Dutch Book Arguments

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

Support Vector Machines: Optimization of Decision Making. Christopher Katinas March 10, 2016

Football is one of the biggest betting scenes in the world. This guide will teach you the basics of betting on football.

Lab 2: Probability. Hot Hands. Template for lab report. Saving your code

Physics 2048 Test 2 Dr. Jeff Saul Spring 2001

THE CHAOS THEORY ROULETTE SYSTEM

[Check Out A Short Video Of The Ratings Software >Here<]

Fairbanks outdoorsman shares his bear safety expertise

Chapter 13 Temperature, Kinetic Theory, and the Gas Laws 497

Lecture 1: Knot Theory

Russell Hunter Roulette Strike Strategy Video Guide Russell Hunter Publishing, Inc.

U6 Division Game Rules

SITUATION TRAINING: Build Baseline Tactics

ASSISTANT REFEREE TRAINING 2015

Statistics for Process Validation Training Master Handbook

Physics 2048 Test 1 Name: Dr. Jeff Saul

INTRODUCTION AND METHODS OF CMAP SOFTWARE BY IPC-ENG

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

NSAA: No Student Access Allowed

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

The Performance-Enhancing Drug Game. by Kjetil K. Haugen. Molde University College Servicebox 8, N-6405 Molde, Norway

Fail Operational Controls for an Independent Metering Valve

Examples of Carter Corrected DBDB-V Applied to Acoustic Propagation Modeling

Politecnico di Torino. Porto Institutional Repository

OpenStax-CNX module: m Pascal's Principle. OpenStax. By the end of this section, you will be able to:

PHYS Tutorial 7: Random Walks & Monte Carlo Integration

Inside Baseball Take cues from successful baseball strategies to improve your game in business. By Bernard G. Bena

Regression to the Mean at The Masters Golf Tournament A comparative analysis of regression to the mean on the PGA tour and at the Masters Tournament

Guided Study Program in System Dynamics System Dynamics in Education Project System Dynamics Group MIT Sloan School of Management 1

VOLLEYBALL PLAYBOOK: VOLLEYBALL. Playbook: 4-2

Use of Instrument at Different Altitudes

Prologue. TSUNAMI - To Survive from Tsunami World Scientific Publishing Co. Pte. Ltd.

Basics: KEYS TO COACHING LACROSSE. 1. Players at every level must play every day with both hands.

Handout 24: Game Day 3 Auction Day Analysis

Case Study: How Misinterpreting Probabilities Can Cost You the Game. Kurt W. Rotthoff Seton Hall University Stillman School of Business

Cowboy Fast Draw Scoring Posse (Large Match) Standard Operating Procedures (SOPs), As of

Operational Risk Management: Preventive vs. Corrective Control

Building an NFL performance metric

Section 1: Multiple Choice

General Accreditation Guidance. User checks and maintenance of laboratory balances

Individual Behavior and Beliefs in Parimutuel Betting Markets

AP Lab 11.3 Archimedes Principle

IN THE MATTER OF appeals heard on September 2, 1997, under section 67 of the Customs Act, R.S.C. 1985, c. 1 (2nd Supp.);

Transcription:

Mohammad Hossein Manshaei manshaei@gmail.com 1393

2

We re going to look at a tennis game Assume two players (Federer and Nadal) Where Nadal is at the net Federer RIGHT Nadal right Viewpoint LEFT le- 3

l Nadal r Federer L R 50,50 80,20 90,10 20,80 q (1- q) p (1- p) Have a look at the payoffs E.g.: if Federer chooses L and Nadal guesses wrong and jumps to the r, Federer wins the point 80% of the time Is there any dominated strategy? Is there a pure strategy NE profile? 4

Let s find the mixed strategy NE Lesson 1: Each player s randomization is the best response to the other player s randomization Lesson 2: If players are playing a mixed strategy as part of a NE, then each of the pure strategies involved in the mix must itself be a best response 5

Find a mixture for Nadal and one for Federer that are in equilibrium TRICK: To find Nadal s mix (q) I m going to put myself in Federer s shoes and look at his payoffs And vice-versa for Federer s mix (p) 6

Federer s expected payoffs: E" # U ( Federer L, ( q,1 q) ) $ % = 50q +80(1 q) E" # U Federer ( R, ( q,1 q) ) $ % = 90q + 20(1 q) If Federer is mixing in this NE then the payoff to the left and to the right must be equal, they must both be best responses Otherwise Federer would not be mixing 7

Federer s expected payoffs must be equal: E" # U Federer E" # U Federer ( L, ( q,1 q) ) ( R, ( q,1 q) ) $ % = 50q +80(1 q) $ % = 90q + 20(1 q) 50q +80(1 q) = 90q + 20(1 q) 40q = 60(1 q) q = 0.6 I was able to derive Nadal s mixing probability This is the solution to the equation in one unknown that equates Federer s payoffs in the mix 8

Nadal s expected payoffs: E" # U Nadal E" # U Nadal (( p,1 p),l) $ % (( p,1 p), r) $ % = 50 p +10(1 p) = 20 p +80(1 p) 50p +10(1 p) = 20 p +80(1 p) 30p = 70(1 p) p = 0.7 Similarly, we computed Federer s mixing probability L R l 50,50 80,20 90,10 20,80 q r (1- q) p (1- p) 9

We found the mixed strategy NE: Federer Nadal è [(0.7, 0.3), (0.6, 0.4)] L R l r What would happen if Nadal jumped to the left more often than 0.6? Federer would be better of playing the pure strategy R! What if he jumped less often than 0.6? Federer would be shooting to the L all time! 10

l Nadal r Federer L R 30,70 80,20 90,10 20,80 p (1- p) q (1- q) Suppose a new coach teaches Nadal how to forehand, and the payoff would change accordingly There is still no pure strategy NE What would happen in this game? 11

Let s first let our intuition work Basically Nadal is better at his forehand and when Federer shoots there, Nadal scores more often than before è Direct effect: Nadal should increase his q But, Federer knows Nadal is better at his forehand, hence he will shoot there less often è Indirect effect: Nadal should decrease his q 12

Let s compute again q: E" # U Federer E" # U Federer ( L, ( q,1 q) ) ( R, ( q,1 q) ) $ % = 30q +80(1 q) $ % = 90q + 20(1 q) 30q +80(1 q) = 90q + 20(1 q) 60q = 60(1 q) q = 0.5 We see that in the end Nadal s q went down from 0.6 to 0.5!! The indirect effect was predominant 13

Nadal s expected payoffs: E" # U (( Nadal p,1 p),l) $ % = 70p +10(1 p) E" # U Nadal (( p,1 p), r) $ % = 20p +80(1 p) 50 p +10(1 p) = 20p +80(1 p) 50 p = 70(1 p) p = 7 /12 = 0.5833 < 0.7 The direct effect was predominant Federer will be shooting to the left with less probability 14

We just performed a comparative statistics exercise We looked at a game and found an equilibrium, then we perturbed the original game and found another equilibrium and compared the two NE Suppose Nadal s q had not changed Federer would have never shot to the left But this couldn t be a mixed strategy NE There was a force to put back things at equilibrium and that was the force that pulled down Nadal s q 15

Every Finite Game has a Mixed Strategy Nash Equilibrium Why is this important? Without knowing the existence of an equilibrium, it is difficult (perhaps meaningless) to try to understand its properves. Armed with this theorem, we also know that every finite game has an equilibrium, and thus we can simply try to locate the equilibria. 16

17

l Nadal r Federer L R 50,50 80,20 90,10 20,80 p (1- p) q (1- q) We identified the mixed strategy NE for this game Federer Nadal è [(0.7, 0.3), (0.6, 0.4)] L R l r p* (1-p*) q* (1-q*) 18

How do we actually check that this is indeed an equilibrium? Let s verify that in fact p* is BR(q*) Federer s payoffs: Pure strategy L è 50*0.6 + 80*0.4 = 62 Pure strategy R è 90*0.6 + 20*0.4 = 62 Mix p* è 0.7*62 + 0.3*62 = 62 Nadal s payoffs: Pure Strategy l è 50*0.7 + 10*0.3 = 38 Pure Strategy r è 20*0.7 + 80*0.3 = 38 Mix q* è 0.6*38 + 0.4*38 = 38 Federer has no strictly profitable pure-strategy deviation Note (again): You cannot always win by playing NE 19

But is this enough? There are no pure-strategy deviations, but could there be any other mixes? Any mixed strategy yields a payoff that is a weighted average of the pure strategy payoffs This already tells us: if you didn t find any purestrategy deviations then you ll not find any other mixes that will be profitable To check if a mixed strategy is a NE we only have to check if there are any pure-strategy profitable deviations 20

Since we re in a mixed strategy equilibrium, it must be the case that the payoffs are equal Indeed, if it was not the case, then you shouldn t be randomizing!! 21

After the security problems in the U.S. and worldwide airports due to high risks of attacks, the need for devices capable of inspecting luggage has raised considerably The problem is that there are not enough of such machines Wrong statements have been promoted by local governments: If we put a check device in NY then all attacks will be shifted to Boston, but if we put a check device in Boston, the attacks will be shifted to yet another city è The claim was that whatever the security countermeasure, it would only shift the problem 22

What if you wouldn t notify where you would actually put the check devices, which boils down to randomizing? The hard thing to do in practice is how to mimic randomization!! 23

24

Player 2 M N Player 1 M N 2,1 0,0 0,0 1,2 p (1- p) q 1- q We already know a lot about this game There are two pure-strategy NE: (M,M) and (N,N) We know that there is a problem of coordination We know that without communication, it is possible (and quite probable) that the two players might fail to coordinate 25

Player 1 perspective, find NE q: E" # U ( 1 M, ( q,1 q) ) $ % = 2q + 0(1 q) & ( ' 2q = (1 q) q = 1 E" # U ( 1 N, ( q,1 q) ) $ % = 0q +1(1 q) 3 )( Player 2 perspective, find NE p: E" # U (( 2 p,1 p), M ) $ % =1p + 0(1 p) & ( ' 1p = 2(1 p) p = 2 E" # U (( 2 p,1 p), N) $ % = 0p + 2(1 p) 3 )( 26

Let s check that p=2/3 is indeed a BR for Player1: '!! 1 E U 1 M, # 3, 2 $ $ * ) # &&, = 2 1 ( " " 3% % + 3 + 0 2 - / 3/ '!! 1 E U 1 N, # 3, 2 $ $ * ) # &&, = 0 1. ( " " 3% % + 3 +12 / 3 / 0 E [ U ] 1 = 2 3 2 3 + 1 3 2 3 = 2 3 = 2 3 27

We just found out that there is no strictly profitable pure-strategy deviation è There is no strictly profitable mixed-strategy deviation The mixed strategy NE is: Player 1 Player 2 2 3 1 1,,, 3 3 2 3 p 1- p q 1- q 28

What are the payoffs to players when they play such a mixed strategy NE? u,u =! 2 1 2 3, 2 $ # 3 & " % Why are the payoffs so low? What is the probability for the two players not to meet? è Prob(meet) = 2/3*1/3+1/3*2/3=4/9 è 1- Prob(meet) = 5/9!!! 29

This results seems to confirm our intuition that magically achieving the pure-strategy NE would be not always possible So the real question is: why are those players randomizing in such a way that it is not profitable? 30

Rather than thinking of players actually randomizing over their strategies, we can think of them holding beliefs of what the other players would play What we ve done so far is to find those beliefs that make players indifferent over what they play since they re going to obtain the same payoffs 31

Time-division channel Reward for successful transmission: 1 Cost of transmission: c (0 < c << 1) Green Blue Quiet Transmit Quiet Transmit (0, 0) (0, 1-c) (1-c, 0) (-c, -c) There is no strictly dominating strategy There are two Nash equilibria

p: probability of transmit for Blue q: probability of transmit for Green ublue = p(1 q)(1 c) pqc= p(1 c q) u = q(1 c p) q green 1 BR G (p) objectives Blue: choose p to maximize u blue Green: choose q to maximize u green 1- c BR B (q) p * =1 c, q * =1 c is a Nash equilibrium 33 0 1- c 1 p

802.11 MAC Layer 34

N links with the same physical condition (single-collision domain): 4 3 2 AP 1 N-2 N-1 N MAC Layer π = Probability of Transmission PHY Layer P = Probability of Collision = More than one transmission at the same time = 1 (1- π) N-1 35

Sending unicast packets station has to wait for DIFS before sending data receiver acknowledges at once (after waiting for SIFS) if the packet was received correctly (CRC) automatic retransmission of data packets in case of transmission errors sender receiver other stations DIFS data SIFS waiting time ACK DIFS Contention window data t The ACK is sent right at the end of SIFS (no contention) 36

37

Basically it is a system of two nonlinear equations with two variables p and π: { A mixing over transmission strategy In fact π is the mixing probability of pure strategy of transmission for each mobile user Try to find all NE of a game between N mobile user? 38