Lecture 11 and 12: Probabilistic Ranking

Similar documents
Introduction to Ranking

QATAR EXXONMOBIL OPEN: DAY 3 MEDIA NOTES Wednesday, January 6, 2016

AEGON CHAMPIONSHIPS: DAY 6 MEDIA NOTES Saturday, June 20, 2015

AEGON CHAMPIONSHIPS: DAY 2 MEDIA NOTES Tuesday, June 16, 2015

ROLEX PARIS MASTERS: DAY 3 MEDIA NOTES NADAL, FEDERER TOGETHER AGAIN; RACE TO LONDON HEATS UP

ALL-SPANISH QUARTER-FINAL CLASH HEADLINES BASTAD ACTION

by Court Day seed first time. In the last meeting between Thiem. Simon) and compiling an 18-5 in his Umag finals Croat in the main player in the Top

bet-at-home OPEN: DAY 2 MEDIA NOTES Tuesday, July 28, 2015

MEN S SINGLES QUARTER-FINALS (TOP HALF)

NADAL SEEKS RECORD-TYING 30TH MASTERS 1000 TITLE

MEN S SINGLES QUARTER-FINALS (BOTTOM HALF)

FEATURED MEN S SINGLES CHAMPIONSHIP [1] Rafael Nadal (ESP) vs [28] Kevin Anderson (RSA) Nadal Leads 4-0

ABIERTO MEXICANO LOS CABOS TALKING POINTS

BRISBANE INTERNATIONAL: DAY 8 MEDIA NOTES Sunday, 7 January 2018 AUSTRALIAN NO. 1 KYRGIOS TO BATTLE HARRISON IN BRISBANE FINAL

ERSTE BANK OPEN 500: DAY 7 MEDIA NOTES TSONGA-POUILLE SQUARE OFF IN ALL-FRENCH CHAMPIONSHIP

AEGON CHAMPIONSHIPS: PREVIEW & DAY 1 MEDIA NOTES Monday, June 15, 2015

FEATURED MEN S SINGLES QUARTER-FINALS

HAASE-BAUTISTA AGUT, FERRER-DEL POTRO MEET IN SEMI-FINALS

FIRST-TIME MASTERS 1000 FINALISTS SEEK CINCY TITLE

RAMOS-VINOLAS, VERDASCO HEADLINE BASTAD SECOND-ROUND PLAY

AEGON INTERNATIONAL: DAY 6 MEDIA NOTES Saturday, July 1, 2017 MONFILS LOOKS TO END LONG LOSING STREAK VS. DJOKOVIC

FIRST TIME QUALIFIERS GOFFIN, DIMITROV BATTLE IN CHAMPIONSHIP

FOUR OF TOP SIX SEEDS, LED BY THIEM, ANDERSON IN QUARTERS

ROLEX SHANGHAI MASTERS: DAY 5 MEDIA NOTES REMATCHES AWAIT TOP 4 SEEDS IN SHANGHAI THIRD ROUND

BAUTISTA AGUT, DEL POTRO BATTLE FOR SECOND AUCKLAND TITLE

QATAR EXXONMOBIL OPEN: DAY 6 MEDIA NOTES Saturday, 6 January 2018 MONFILS SEEKS ELUSIVE TITLE, HISTORY ON LINE FOR RUBLEV

INTERNAZIONALI BNL D ITALIA: DAY 7 MEDIA NOTES NADAL AND DJOKOVIC RENEW RECORD RIVALRY IN ROME SEMIS

MEN S SINGLES 4R MATCHES (BOTTOM HALF)

SWISS INDOORS BASEL: DAY 7 MEDIA NOTES FEDERER AIMS FOR 9TH BASEL TITLE, FACES QUALIFIER COPIL IN FINAL

MUTUA MADRID OPEN: DAY 6 MEDIA NOTES NADAL, ZVEREV, EDMUND FACE FAMILIAR FOES IN QUARTER-FINALS

SWISS INDOORS BASEL: DAY 7 MEDIA NOTES FEDERER, DEL POTRO MEET IN 3RD BASEL FINAL, 4TH TIME IN 2017

FEATURED MEN S SINGLES 4R MATCHES

ROLEX PARIS MASTERS: DAY 2 MEDIA NOTES DJOKOVIC COMES FULL CIRCLE IN QUEST FOR WORLD NO. 1

NADAL TO MEET KYRGIOS MEET IN CHINA OPEN CHAMPIONSHIP

FEVER-TREE CHAMPIONSHIPS: DAY 7 MEDIA NOTES TOP SEED CILIC MEETS RESURGENT DJOKOVIC IN LONDON FINAL

Kastle vs. Kastle BRYAN BROTHERS. vs. vs.

NISHIKORI AIMS FOR 3RD TOKYO TITLE AGAINST QUALIFIER MEDVEDEV

UNOFFICIAL WORLD JUNIOR CHAMPION AGE 12

RIO OPEN PRESENTED BY CLARO: PREVIEW & DAY 1 MEDIA NOTES Monday, February 15, 2016

WAWRINKA, M. ZVEREV BATTLE FOR CHAMPIONSHIP

MUTUA MADRID OPEN: DAY 5 MEDIA NOTES NADAL NEARS 34-YEAR-OLD RECORD, CANADIANS CLASH ON DAY 5

BB&T ATLANTA OPEN: DAY 4 MEDIA NOTES

ROLEX SHANGHAI MASTERS: DAY 8 MEDIA NOTES DJOKOVIC MEETS FIRST-TIME MASTERS 1000 FINALIST CORIC FOR TITLE

ARGENTINA OPEN: DAY 7 MEDIA NOTES

MIAMI OPEN PRESENTED BY ITAU: DAY 12 MEDIA NOTES ZVEREV, ISNER EACH SEEK FIRST TITLE OF 2018 IN MIAMI SHOWDOWN

Player Bio Personal Year By Year

RAKUTEN JAPAN OPEN TENNIS CHAMPIONSHIPS MANNARINO LOOKS FOR FIRST ATP TITLE, GOFFIN SECOND IN A ROW

RIO OPEN PRESENTED BY CLARO: DAY 7 MEDIA NOTES SCHWARTZMAN AND VERDASCO VIE FOR GLORY IN RIO OPEN FINAL

BNP PARIBAS OPEN: 16 MARCH MEDIA NOTES SEMI-FEDAL: FEDERER VS. NADAL XXXIX FOLLOWS RAONIC VS. THIEM

INTERNAZIONALI BNL D ITALIA: DAY 3 MEDIA NOTES SHAPOVALOV SEEKS ANOTHER MAJOR MILESTONE IN ROME DEBUT

MERCEDESCUP: DAY 7 MEDIA NOTES Sunday, June 18, 2017 EMIRATES ATP RANKING POINTS AND PRIZE MONEY

WINSTON-SALEM OPEN: DAY 2 MEDIA NOTES Monday, August 24, 2015

@matt_ebden

10 THINGS TO WATCH IN TOKYO

MOSELLE OPEN: DAY 7 MEDIA NOTES Sunday, September 24, 2017

ROLEX PARIS MASTERS: PREVIEW & DAY 1 MEDIA NOTES

MOSELLE OPEN: DAY 5 MEDIA NOTES Friday, September 22, 2017 SEEDS GOFFIN, M. ZVEREV, PAIRE HEADLINE QUARTER-FINALS

MUTUA MADRID OPEN: DAY 8 MEDIA NOTES TOP 10 TUSSLE IN MADRID AS ZVEREV AND THIEM MEET FOR TITLE

APIA INTERNATIONAL SYDNEY: DAY 7 MEDIA NOTES Saturday, January 14, 2017 EVANS, MULLER MEET WITH FIRST ATP WORLD TOUR TITLE AT STAKE

THE BOXES UNDER THE HIGH PATRONAGE OF H.S.H. THE SOVEREIGN PRINCE OF MONACO

SHENZHEN OPEN: DAY 7 MEDIA NOTES Sunday, October 1, 2017 DOLGOPOLOV, GOFFIN TO SQUARE OFF IN CHAMPIONSHIP MATCH

SHANGHAI ROLEX MASTERS: DAY 8 MEDIA NOTES Sunday, October 15, 2017

INTERNAZIONALI BNL D ITALIA TALKING POINTS

BIG SERVERS KARLOVIC, MULLER BATTLE IN CHAMPIONSHIP

INTRUM STOCKHOLM OPEN: DAY 7 MEDIA NOTES Sunday, October 22, 2017 FORMER CHAMPIONS DIMITROV, DEL POTRO MEET IN TITLE MATCH

NEWS. Every singles winner of the

Tennis Coach Roger Federer Us Open 2012 Schedule

ST. PETERSBURG OPEN: DAY 7 MEDIA NOTES Sunday, September 24, 2017

FEATURED MEN S SINGLES 1R MATCHES

ACTIVE PLAYERS FORMER CHAMPIONS OPEN ERA WINNERS. ATPWorldTour.com

OPEN PARC AUVERGNE-RHONE-ALPES LYON: 26 MAY MEDIA NOTES

FEATURED MEN S SINGLES 1R MATCHES In Order of Play By Court

ROLEX MONTE-CARLO MASTERS: DAY 8 MEDIA NOTES

OPEN PARC AUVERGNE-RHONE-ALPES LYON: 25 MAY MEDIA NOTES. [1] Dominic Thiem (AUT) vs. Guillermo Garcia-Lopez (ESP) Thiem Leads 1-0

SYDNEY INTERNATIONAL: DAY 4 MEDIA NOTES Wednesday, 10 January 2018 DE MINAUR, MILLMAN FACE TOUGH TASKS ON DAY 4 IN SYDNEY

The Monte-Carlo Country Club in 1928

BNP PARIBAS OPEN: 14 MARCH MEDIA NOTES 19-YEAR-OLD LUCKY LOSER MAKING NAME FOR HIMSELF IN INDIAN WELLS

MUTUA MADRID OPEN: DAY 3 MEDIA NOTES VERDASCO, FOGNINI, HAASE SEEK MILESTONE WINS IN MADRID

RIO OPEN PRESENTED BY CLARO: DAY 4 MEDIA NOTES TOP 3 SEEDS ALL IN SECOND-ROUND ACTION ON RIO DAY 4

ALBERT Albert

FEATURED MEN S SINGLES 1R MATCHES

TOP 10 STARS ANDERSON AND DEL POTRO MEET FOR ACAPULCO GLORY

TATA OPEN MAHARASHTRA: DAY 6 MEDIA NOTES Saturday, 6 January 2018 SIMON, ANDERSON ATTEMPT TO WIN FIRST ATP TITLE SINCE 2015

TOP TWO SEEDS FEDERER, DIMITROV SQUARE OFF IN FINAL

MUTUA MADRID OPEN: DAY 2 MEDIA NOTES Monday, 7 May 2018 Caja Magica Madrid, Spain 6-13 May 2018 Draw: S-56, D-24 Prize Money: 6,200,860 Surface: Clay

Apr 8 Houston Fayez Sarofim & Co. U.S. Men s Clay Court Championship CL 28 $583,585 $652,245

GERRY WEBER OPEN: DAY 7 MEDIA NOTES NO. 1 FEDERER AND RISING CORIC TO FACE OFF FOR HALLE CROWN

FEDEX ATP HEAD2HEADS: TUESDAY FIRST-ROUND SINGLES MATCHES

SYDNEY INTERNATIONAL: DAY 3 MEDIA NOTES Tuesday, 9 January 2018 SYDNEY NATIVES DE MINAUR, VUKIC OPEN CAMPAIGNS ON DAY 3

MILLENNIUM ESTORIL OPEN: 5 MAY MEDIA NOTES

ATP WORLD TOUR AWARDS ANNOUNCED AT NITTO ATP FINALS OFFICIAL LAUNCH PRESENTED BY MOËT & CHANDON AT TOWER OF LONDON

FIVE TOP 10 PLAYERS AND DEFENDING CHAMPION IN ACTION ON DAY 2

PLAYER BIOGRAPHIES ACTIVE PLAYERS FORMER CHAMPIONS OPEN ERA WINNERS * TOP 100 SINGLES TOP 8 DOUBLES TEAMS TOP 200 #NEXTGENATP

ARGENTINA OPEN: DAY 3 MEDIA NOTES FORMER CHAMP THIEM OPENS CAMPAIGN AS TOP SEED

MILLENNIUM ESTORIL OPEN: 6 MAY MEDIA NOTES

THIEM, CECCHINATO AIM TO KEEP EXCELLING ON CLAY IN HAMBURG

OPEN 13 PROVENCE: 19 FEBRUARY MEDIA NOTES

@dutzee

FEATURED MEN S SINGLES 3R MATCHES

India vs Australia, 5 ODIs, Two T20Is Another chance for both teams to give their players ODI exposure for the WC.

Transcription:

Lecture 11 and 12: Probabilistic Ranking 4F13: Machine Learning Joaquin Quiñonero-Candela and Carl Edward Rasmussen Department of Engineering University of Cambridge http://mlg.eng.cam.ac.uk/teaching/4f13/ Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 1 / 13

Motivation for ranking Competition is central to our lives. It is an innate biological trait, and the driving principle of many sports. In the Ancient Olympics (700 BC), the winners of the events were admired and immortalised in poems and statues. Today in pretty much every sport there are player or team rankings. (Football leagues, Poker tournament rankings, etc). We are going to focus on one example: tennis players, in men singles games. We are going to keep in mind the goal of answering the following question: What is the probability that player 1 defeats player 2? Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 2 / 13

The ATP ranking system for tennis players Men Singles ranking as of 28 December 2011: Rank, Name & Nationality Points Tournaments Played 1 Djokovic, Novak (SRB) 13,675 19 2 Nadal, Rafael (ESP) 9,575 20 3 Federer, Roger (SUI) 8,170 19 4 Murray, Andy (GBR) 7,380 19 5 Ferrer, David (ESP) 4,880 23 6 Tsonga, Jo-Wilfried (FRA) 4,335 25 7 Berdych, Tomas (CZE) 3,700 24 8 Fish, Mardy (USA) 2,965 24 9 Tipsarevic, Janko (SRB) 2,595 28 10 Almagro, Nicolas (ESP) 2,380 27 11 Del Potro, Juan Martin (ARG) 2,315 22 12 Simon, Gilles (FRA) 2,165 28 13 Soderling, Robin (SWE) 2,120 22 14 Roddick, Andy (USA) 1,940 20 15 Monfils, Gael (FRA) 1,935 23 16 Dolgopolov, Alexandr (UKR) 1,925 30 17 Wawrinka, Stanislas (SUI) 1,820 23 18 Isner, John (USA) 1,800 25 19 Gasquet, Richard (FRA) 1,765 21 20 Lopez, Feliciano (ESP) 1,755 28 ATP: Association of Tennis Professionals (www.atpworldtour.com) Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 3 / 13

The ATP ranking system explained (to some degree) Sum of points from best 18 results of the past 52 weeks. Mandatory events: 4 Grand Slams, and 8 Masters 1000 Series events. Best 6 results from International Events (4 of these must be 500 events). Points breakdown for all tournament categories (2012): W F SF QF R16 R32 R64 R128 Q Grand Slams 2000 1200 720 360 180 90 45 10 25 Barclays ATP World Tour Finals *1500 ATP World Tour Masters 1000 1000 600 360 180 90 45 10(25) (10) (1)25 ATP 500 500 300 180 90 45 (20) (2)20 ATP 250 250 150 90 45 20 (5) (3)12 Challenger 125,000 +H 125 75 45 25 10 5 Challenger 125,000 110 65 40 20 9 5 Challenger 100,000 100 60 35 18 8 5 Challenger 75,000 90 55 33 17 8 5 Challenger 50,000 80 48 29 15 7 3 Challenger 35,000 +H 80 48 29 15 6 3 Futures** 15,000 +H 35 20 10 4 1 Futures** 15,000 27 15 8 3 1 Futures** 10,000 18 10 6 2 1 The Grand Slams are the Australian Open, the French Open, Wimbledon, and the US Open. The Masters 1000 Tournaments are: Cincinnati, Indian Wells, Madrid, Miami, Monte-Carlo, Paris, Rome, Shanghai, and Toronto. The Masters 500 Tournaments are: Acapulco, Barcelona, Basel, Beijing, Dubai, Hamburg, Memphis, Rotterdam, Tokyo, Valencia and Washington. The Masters 250 Tournaments are: Atlanta, Auckland, Bangkok, Bastad, Belgrade, Brisbane, Bucharest, Buenos Aires, Casablanca, Chennai, Delray Beach, Doha, Eastbourne, Estoril, Gstaad, Halle, Houston, Kitzbuhel, Kuala Lumpur, London, Los Angeles, Marseille, Metz, Montpellier, Moscow, Munich, Newport, Nice, Sao Paulo, San Jose, s-hertogenbosch, St. Petersburg, Stockholm, Stuttgart, Sydney, Umag, Vienna, Vina del Mar, Winston-Salem, Zagreb and Dusseldorf. Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 4 / 13

A laundry list of objections and open questions Some questions: Rank, Name & Nationality Points 1 Djokovic, Novak (SRB) 13,675 2 Nadal, Rafael (ESP) 9,575 3 Federer, Roger (SUI) 8,170 4 Murray, Andy (GBR) 7,380 Is a player ranked higher than another more likely to win? What is the probability that Murray defeats Djokovic? How much would you (rationally) bet on Murray? And some concerns: The points system ignores who you played against. 6 out of the 18 tournaments don t need to be common to two players. Other examples: Premier League. Meaningless intermediate results throughout the season: doesn t say whom you played and whom you didn t! Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 5 / 13

Towards a probabilistic ranking system What we really want is to infer is a player s skill. Skills must be comparable: a player of higher skill is more likely to win. We want to do probabilistic inference of players skills. We want to be able to compute the probability of a game outcome. A generative model for game outcomes: 1 Take two tennis players with known skills (w i R) Player 1 with skill w 1. Player 2 with skill w 2. 2 Compute the difference between the skills of Player 1 and Player 2: s = w 1 w 2 3 Add noise (n N(0, 1)) to account for performance inconsistency: t = s + n 4 The game outcome is given by y = sign(t) y = +1 means Player 1 wins. y = 1 means Player 2 wins. Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 6 / 13

TrueSkill, a probabilistic skill rating system w 1 and w 2 are the skills of Players 1 and 2. We treat them in a Bayesian way: prior p(w i ) = N(w i ; µ i, σ 2 i) s = w 1 w 2 is the skill difference. t N(t; s, 1) is the performance difference. The probability of outcome y given known skills is: p(y w 1, w 2 ) = p(y t)p(t s)p(s w 1, w 2 )dsdt likelihood The posterior over skills given the game outcome is: p(w 1, w 2 y) = p(w 1 )p(w 2 )p(y w 1, w 2 ) p(w1 )p(w 2 )p(y w 1, w 2 )dw 1 dw 2 TrueSkill : A Bayesian Skill Rating System. Herbrich, Minka and Graepel, NIPS19, 2007. Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 7 / 13

Φ(a) is the Gaussian cumulative distribution function, or probit function. Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 8 / 13 The likelihood in detail p(y w 1, w 2 ) = p(y t)p(t s)p(s w 1, w 2 )dsdt = p(y t)p(t w 1, w 2 )dt = = = y = = + + δ(y sign(t))n(t; w 1 w 2, 1)dt δ(1 sign(yt))n(yt; y(w 1 w 2 ), 1)dt +y y + + 0 δ(1 sign(z))n(z; y(w 1 w 2 ), 1)dz (use z yt) δ(1 sign(z))n(z; y(w 1 w 2 ), 1)dz N(z; y(w 1 w 2 ), 1)dz = = Φ(y(w 1 w 2 )) y(w1 w 2 ) N(x; 0, 1)dx (use x y(w 1 w 2 ) z) ( a where Φ(a) = N(x; 0, 1)dx )

The likelihood in a picture Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 9 / 13

An intractable posterior The joint posterior distribution over skills does not have a closed form: p(w 1, w 2 y) = N(w 1 ; µ 1, σ 2 1 )N(w 2; µ 2, σ 2 2 )Φ(y(w 1 w 2 )) N(w1 ; µ 1, σ 2 1 )N(w 2; µ 2, σ 2 2 )Φ(y(w 1 w 2 ))dw 1 dw 2 w 1 and w 2 become correlated, the posterior does not factorise. The posterior is no longer a Gaussian density function. The normalising constant of the posterior, the prior over y does have closed form: ( p(y) = N(w 1 ; µ 1, σ 2 1)N(w 2 ; µ 2, σ 2 y(µ1 µ 2 ) ) 2)Φ(y(w 1 w 2 ))dw 1 dw 2 = Φ 1 + σ 2 1 + σ2 2 This is a smoother version of the likelihood p(y w 1, w 2 ). Can you explain why? Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 10 / 13

Gibbs sampling for the TrueSkill model We have g = 1,..., N games where I g : id of Player 1 and J g : id of Player 2. The outcome of game g is y g = +1 if I g wins, y g = 1 if J g wins. Gibbs sampling alternates between sampling skills w = [w 1,..., w M ] conditional on fixed performance differences t = [t 1,..., t N ], and sampling t conditional on fixed w. 1 Initialise w, e.g. from the prior p(w). 2 Sample the performance differences from p(t g w Ig, w Jg, y g ) δ(y g sign(t g ))N(t g ; w Ig w Jg, 1) 3 Jointly sample the skills from 4 Go back to step 2. p(w t) }{{} N(w;µ,Σ) p(w) }{{} g=1 N(w;µ 0,Σ 0 ) N p(t g w Ig, w Jg ) }{{} N(w;µ g,σ g ) Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 11 / 13

Gaussian identities The conditional for the performance is both Gaussian in t g and proportional to a Gaussian in w p(t g w Ig, w Jg ) exp ( 1 2 (w I g w Jg t g ) 2) ( ( N 1 w Ig µ 1 ) [ 1 1 2 w Jg µ 2 1 1 with µ 1 µ 2 = t g. Notice that [ 1 1 1 1 ] ( µ 1 µ 2 ) = ( t g t g ) ] ( w Ig µ 1 ) ) w Jg µ 2 Remember that for products of Gaussians precisions add up, and means weighted by precisions (natural parameters) also add up: N(w; µ a, Σ a )N(w; µ b, Σ b ) = z c N ( w; Σ c (Σ 1 a µ a + Σ 1 µ b), Σ c = (Σ 1 b a + Σ 1 b ) 1) Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 12 / 13

Conditional posterior over skills given performances We can now compute the covariance and the mean of the conditional posterior. N Σ 1 = Σ 1 0 + Σ 1 g µ = Σ ( N Σ 1 0 µ ) 0 + Σ 1 g µ g g=1 } {{ } Σ 1 To compute the mean it is useful to note that: N ( µ i = t g δ(i Ig ) δ(i J g ) ) g=1 And for the covariance we note that: N [ Σ 1 ] ii = δ(i I g ) + δ(i J g ) g=1 [ Σ 1 ] i j = g=1 } {{ } µ N δ(i I g )δ(j J g ) + δ(i J g )δ(j I g ) g=1 Quiñonero-Candela & Rasmussen (CUED) Lecture 11 and 12: Probabilistic Ranking 13 / 13