Opening up the court (surface) in tennis grand slams

Similar documents
Predictors for Winning in Men s Professional Tennis

How the interpretation of match statistics affects player performance

HOW THE TENNIS COURT SURFACE AFFECTS PLAYER PERFORMANCE AND INJURIES. Tristan Barnett Swinburne University. Graham Pollard University of Canberra

Grand Slam Tennis Computer Game (Version ) Table of Contents

Men s Best Shots Poll

A point-based Bayesian hierarchical model to predict the outcome of tennis matches

First Server Advantage in Tennis. Michelle Okereke

Gender Differences in Performance in Competitive Environments: Field Evidence from Professional Tennis Players *

There are various possibilities of data outcomes

INSIDE VOLLEY TENNIS

BODY HEIGHT AND CAREER WIN PERCENTAGE IN RELATION TO SERVE AND RETURN GAMES EFFECTIVENESS IN ELITE TENNIS PLAYERS

Personalised program. Internal ID Number Tracking (U ) Type of Contact (Requested Information)

APPLYING TENNIS MATCH STATISTICS TO INCREASE SERVING PERFORMANCE DURING A MATCH IN PROGRESS

UNOFFICIAL WORLD JUNIOR CHAMPION AGE 12

Tennis Europe Events U12 / U14 & U

Grand Slams are short changing women s tennis

SUMMER CAMP NEWSLETTER JUNE 16-23, 2013

HOME OF THE GREATEST SELECTION OF LIVE MOMENTS

Improving the Australian Open Extreme Heat Policy. Tristan Barnett

The Effect of Pressure on Mixed-Strategy Play in Tennis: The Effect of Court Surface on Service Decisions

ADVANCED TACTICS: CONCEPTS, FEATURES AND 5 GAME SITUATIONS

Summarizing tennis data to enhance elite performance. Tristan Barnett PhD University of South Australia

Tennis. The performance of skills and techniques in isolation/unopposed situations

A Correlation Study of Nadal's and Federer's Technical Indicators in Different Tennis Arenas

Using Soft Computing Techniques for Prediction of Winners in Tennis Matches

Mesa Regal Pickleball Club

Player Development. Journal

Player Development. Journal

RED 1 CERTIFICATE OF ACHIEVEMENT

Australian open 2016 prize money. Australian open 2016 prize money.zip

Learning to Play, Playing to Learn. Action Fantasy Games Handout. Hans van der Mars

Predicting Tennis Match Outcomes Through Classification Shuyang Fang CS074 - Dartmouth College

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

The final set in a tennis match: four years at Wimbledon 1

PREDICTING the outcomes of sporting events

2016 Open Book Test for Certified Officials. Open Book Test Answers

University of Groningen. Home advantage in professional tennis Koning, Ruud. Published in: Journal of Sports Sciences

Pickleball Skill Level Definitions

Navigate to the golf data folder and make it your working directory. Load the data by typing

A Novel Approach to Predicting the Results of NBA Matches

Swinburne Research Bank

Surfaces of the tennis courts

Winning in straight sets helps in Grand Slam tennis

accuracy and placement of your Volleys and Drop shots.

Should bonus points be included in the Six Nations Championship?

[5] S S Blackman and Casey J W. Development of a rating system for all tennis players. Operations Research, 28: , 1980.

Federer By Chris Bowers READ ONLINE

Honours for The Woodies

Detecting Match-Fixing in Tennis

The 2017 women s Australian Open final attracted 360,000 more viewers than the final of T20 cricket s Big Bash League (aired on another free-to-air

SPORTS EVENT SPOTLIGHT. 11 th November 2016

1 Introduction. 2 Review of tennis prediction models. Stephanie Ann Kovalchik* Searching for the GOAT of tennis win prediction. 2.

PE 461 TENNIS Skill/game category assessment criteria Dr. Tim Hopper 1/3/04 DRIVES Name:

COMPILED BY : - GAUTAM SINGH STUDY MATERIAL SPORTS Tennis - Overview

THE ATP WORLD TOUR: HOW DO PRIZE STRUCTURE AND GAME FORMAT AFFECT THE OUTCOME OF A MATCH?

This Learning Packet has two parts: (1) text to read and (2) questions to answer.

Are differences in ranks good predictors for Grand Slam tennis matches?

NBA TEAM SYNERGY RESEARCH REPORT 1

THE FLATMATES Language point: Tennis vocabulary

How to Play a Tennis Tiebreaker

Existence of Nash Equilibria

Evaluating and Classifying NBA Free Agents

Tennis Coach Roger Federer Us Open 2012 Schedule

JUNIOR RANKING SYSTEM (NJRS)

News English.com Ready-to-use ESL / EFL Lessons

Economic Value of Celebrity Endorsements:

Opleiding Informatica

MONEYBALL. The Power of Sports Analytics The Analytics Edge

Evaluating NBA Shooting Ability using Shot Location

Predicting the Total Number of Points Scored in NFL Games

Multilevel Models for Other Non-Normal Outcomes in Mplus v. 7.11

Information Visualization and Presentation

Comparing the Crash Injury Severity Risk Factors at High-Volume and Low-Volume Intersections with Different Traffic Control in Alabama

Tournament Operation Procedures

Tennis and the Game Sense approach

2013 Australian open women s singles champion and runner-up technical and tactics features research based on mathematical statistic analysis

Novak Djokovic Interview by Dr Bane Krivokapic

SPORTING LEGENDS: MARTINA NAVRATILOVA

15 Tennis Divas Who Intensely Heated Up The Court

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

Up-and-Coming Players

2014 US OPEN & TENNIS TEACHERS CONFERENCE August 20-30,

On the advantage of serving first in a tennis set: four years at Wimbledon

Why We Should Use the Bullpen Differently

Detailed Statistics Report (4) Serena Williams [USA] d. (12) Ana Ivanovic [SRB] 6-1, 6-3

FEATURED MEN S SINGLES CHAMPIONSHIP [1] Rafael Nadal (ESP) vs [28] Kevin Anderson (RSA) Nadal Leads 4-0

GLEND VEER NEWSLETTER

A Study of Olympic Winning Times

1 Why use a Tagging Panel?

SPATIAL STATISTICS A SPATIAL ANALYSIS AND COMPARISON OF NBA PLAYERS. Introduction

Blind and Visually Impaired Regional Tennis Series 2018 Information, Rules and Format

Good luck to alumni Jared Hiltzik and Aleks Vukic, along with our current Fighting Illini competing this week!

Pokémon Organized Play Tournament Operation Procedures

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

CS 221 PROJECT FINAL

Leeds Tennis League. Singles League Entry Form. Name: Address: Contact no. 1. Preferred method of contact: DoB: Gender: M F

REDUCING THE LIKELIHOOD OF LONG TENNIS MATCHES

The Effects of Altitude on Soccer Match Outcomes

TENNIS SPORT RULES. Tennis Sport Rules. VERSION: June 2018 Special Olympics, Inc., All rights reserved.

A Reproducible Method for Offensive Player Evaluation in Football

Transcription:

Opening up the court (surface) in tennis grand slams Kayla Frisoli, Shannon Gallagher, and Amanda Luby Department of Statistics & Data Science Carnegie Mellon University CMSAC -- October 20, 2018

Tennis, anyone? Roger Federer @ WIMBLEDON GOAT? 20 grand slam titles Grass extraordinaire (8 GS @ Wim.) 2

Tennis, anyone? Rafael Nadal @ FRENCH OPEN GOAT? 17 grand slam titles King of Clay (11 GS @ FO) 3

Tennis, anyone? Serena Williams @ US OPEN @ AUSTRALIAN OPEN GOAT? 24 grand slam titles Jack of all trades (7 GS @ USO) (6 GS @ AO) 4

The grand slams are played on distinct surfaces and may affect player performance. Grand Slam Surface Known top players AUSTRALIAN OPEN DecoTurf (hard court) Serena Williams FRENCH OPEN clay Rafael Nadal WIMBLEDON grass Roger Federer US OPEN Plexicushion (hard court) Serena Williams 5

The grand slams are played on distinct surfaces and may affect player performance. Grand Slam Surface Known top players AUSTRALIAN OPEN DecoTurf (hard court) Serena Williams FRENCH OPEN clay Rafael Nadal WIMBLEDON grass Roger Federer US OPEN Plexicushion (hard court) Serena Williams Are these real or perceived effects? Do these effects vary by player? 6

Two data sets- two perspectives Data from Jeff Sackman s website (https://github.com/jeffsackmann) Accessed via the R `deuce` package (Kovalchik, S 2017) Grand Slam Results (GS) Grand Slam Point by Point (PbP) 2013-2017 2013-2017 One row = one match One row = one point 5080 matches 720,465 points from 3066 matches 4 GS, 7 rounds each Missingness Match and game scores Additional variables: winners, aces, unforced errors (UEs), etc. 7

Exploring tournament differences Players perform differently at Wimbledon, as displayed by the clustering of purple points. 8

Spaniards outperform on clay surface Spanish players win more at the French Open, despite their, on average, worse rankings. 9

We build a series of models to assess the match effects of court surface and individual players 10

Approach 1 Data Y Fixed X Random X Regression Conclusions GS Data Did win? league, country, year, late round op. rank, rank surface Logistic No significant effects besides rank / opponent rank n=10,160 (Yes = 1, No = 0) 11

Approach 1 Data Y Fixed X Random X Regression Conclusions GS Data Did win? league, country, year, late round op. rank, rank surface Logistic No significant effects besides rank / opponent rank league, country, year, late round op. rank, rank surface Linear Significant surface effects for Williams, Federer, Nadal n=10,160 Approach 2 GS Data %>% join(pbp) n=6,132 (aces) n=6,132 (net) n=6,132 (UE) (Yes = 1, No = 0) Aces Points won at net UE 12

Player effects vary by court surface Williams and Federer have more aces in general, and most on grass and hard court Federer makes far fewer unforced errors on grass compared to others and himself 13

Approach 1 Data Y Fixed X Random X Regression Conclusions GS Data Did win? league, country, year, late round op. rank, rank surface Logistic No significant effects besides rank / opponent rank league, country, year, late round op. rank, rank surface Linear Significant surface effects for Williams, Federer, Nadal Linear Significant effects vary by players of interest (Williams, Federer, Nadal) n=10,160 Approach 2 GS Data %>% join(pbp) n=6,132 (aces) n=6,132 (net) n=6,132 (UE) Approach 3 GS Data %>% join(pbp) %>% filter(player == {Player} ) n=75 (Nadal) n=83 (Federer) n=59 (Williams) (Yes = 1, No = 0) Aces Points won at net UE % points won league, country, year, late round op. rank, rank average service speed, winners, unforced errors, break points won, net points won, etc. 14

Federer, Nadal, and Williams: most available data and most detailed individual models Player Model Finding Interpretation Federer Expected % points won at US Open greater than at Wimbledon if W/UE large On average, better at Wimbledon but given peak performance, better at US Open Nadal Expected % points won decreases as % of points won at net increases Indicative of a change of strategy Williams Expected % points won at French Open greater than at Australian Open if % of aces increases by 1% Serving well at French Open is more important than serving well at Australian Open 15

Conclusions Surface effects are not apparent until we utilize tennis-specific features (e.g. unforced errors, aces) and vary across players With full, feature rich player data, we can make more interesting conclusions for individual players (e.g. Williams, Federer, Nadal) Our data are only available for some matches -- need more, detailed tennis data for modeling lower-tier players We are in talks with the Chief Technology Officer for the US Tennis Association 16

Game. Set. Match. https://github.com/shannong19/courtsports Kayla Frisoli @stat_frizz http://stat.cmu.edu/~kfrisoli Shannon Gallagher @shannonkgallagh http://stat.cmu.edu/~sgallagh/ Amanda Luby @amandaluby http://stat.cmu.edu/~aluby/ 17

18

19

20

Game. Set. Match. 21

Modeling win probability: only rank is signif. Outcome: Wins Predictors: ATP, IOC, Late round, Rank, Opponent Rank, Court, Year logit (P( Y=1 X)) = B1Xfixed+ B2Xrandom No significant player-level effects 22

Does surface matter? For whom? Do results differ across the three surface types (grass, clay, hard)? Yes. How useful is including tennis specific features (e.g. winners, aces, unforced errors)? Quite useful. Are there player-level effects in performance on different surfaces? Only when looking at tennis-specific outcomes 23

Modeling of Individuals: Details Linear regression: E[(% Points Won)Player ] = BXPlayer Covariates (X) include opponent ranking, surface type, average service speed, winners, unforced errors, break points won, net points won, etc. Models fit using forward-backwards stepwise regression Best model for each player chosen with AIC 24

Logistic Model (GS data): logit ( P( Y=1 X ) ) = B1Xfixed+ B2Xrandom Y Winner? (1 = yes, 0 = no) Xfixed ATP, IOC, Late round, *Rank, *Opponent Rank, Year Xrandom Court Linear Model: E ( Y X ) = B1Xfixed+ B2Xrandom Y Number of aces, number of net points won, or number of unforced errors Xfixed ATP, IOC, Late round, *Rank, *Opponent Rank, Year Xrandom Court Model 3: E ( Y X ) = B1Xfixed Y % points won by Federer, % points won by Nadal, % points won by Williams Xfixed opponent ranking, surface type, average service speed, winners, unforced errors, break points won, net points won, etc. 25

The grand slams are played on distinct surfaces and may affect player performance. Grand Slam AUSTRALIAN OPEN FRENCH OPEN WIMBLEDON US OPEN Surface DecoTurf (hard court) clay grass Plexicushion (hard court) 26

Federer, Nadal, and Williams: most available data and most detailed individual models Federer E[points won] @Wimbledon compared to other slams on average W/UE large more E[points won] @US Open compared to Wimbledon Nadal E[points won] as volley points won Williams E[points] more for number of aces @French Open compared to @Australian Open 27