Quantifying Foul Tilt of NBA Players

Similar documents
Supplementary Online Content

NBA Salary Prediction

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

Figure 1. Winning percentage when leading by indicated margin after each inning,

The Impact of Star Power and Team Quality on NBA Attendance THESIS

Unassisted breathing and death as competing events in critical care trials

Opleiding Informatica

Game Theory (MBA 217) Final Paper. Chow Heavy Industries Ty Chow Kenny Miller Simiso Nzima Scott Winder

Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

NBA TEAM SYNERGY RESEARCH REPORT 1

STAT 155 Introductory Statistics. Lecture 2: Displaying Distributions with Graphs

Introduction to population dynamics and stock assessments

Name: Date: Math in Basketball: Take the Challenge Student Handout

Without Limits Female Player Survey

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes

Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held

BIOL 101L: Principles of Biology Laboratory

Gain the Advantage. Build a Winning Team. Sports

A position graph will give the location of an object at a certain time.

The Aging Curve(s) Jerry Meyer, Central Maryland YMCA Masters (CMYM)

THE MAGICAL PROPERTY OF 60 KM/H AS A SPEED LIMIT? JOHN LAMBERT,

PROJECTS STREETS

Running head: DATA ANALYSIS AND INTERPRETATION 1

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Football Play Type Prediction and Tendency Analysis

Section 5.1 Randomness, Probability, and Simulation

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Lab Report Outline the Bones of the Story

PREDICTING the outcomes of sporting events

OFFENCE STRATEGY OF BARCELONA

1 Streaks of Successes in Sports

Sandra Nutter, MPH James Sallis, PhD Gregory J Norman, PhD Sherry Ryan, PhD Kevin Patrick, MD, MS

Natural Bridges Field Trip Activity

The potential impact of high density docking stations on Bike Share distribution activities Biella Research June 2016

Pierce 0. Measuring How NBA Players Were Paid in the Season Based on Previous Season Play

DGA Tools: Duval Triangles and Pentagons

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) (a) (b) (c) (d) (e)...

Planning and Acting in Partially Observable Stochastic Domains

FIRST Research Report Light Management in Greenhouses III. The Effect of Hanging Baskets on the Greenhouse Light Environment James E.

An Investigation: Why Does West Coast Precipitation Vary from Year to Year?

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent

What Does It Mean to Draft Perfectly in the NHL?

Study on Characteristics of Pedestrian Crossing the Street in Shijingshan

Name Class Date. What are some properties of gases? How do changes of pressure, temperature, or volume affect a gas?

EXCELLENCE. Executive Summary. Destination: 2014 Downtown Detroit Perceptions Survey

Presentation Summary Why Use GIS for Ped Planning? What Tools are Most Useful? How Can They be Applied? Pedestrian GIS Tools What are they good for?

Blue cod 5 (BCO5) pot mesh size review

9A Gas volume and pressure are indirectly proportional.

Two types of physical and biological standards are used to judge the performance of the Wheeler North Reef 1) Absolute standards are measured against

Classroom Voting Questions: Statistics

Hunter Perceptions of Chronic Wasting Disease in Illinois

Ruahine Forest Park (RFP) Recreational Hunters Survey

Examining a Sunk Cost Effect in the Managerial Decisions of Major League Baseball

Projecting Three-Point Percentages for the NBA Draft

RMAG, Snowbird, UT, October 6-9, Michael Holmes, Antony M. Holmes, and Dominic I. Holmes, Digital Formation, Inc.

A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS

Additional On-base Worth 3x Additional Slugging?

Billy Beane s Three Fundamental Insights on Baseball and Investing

Girls Basketball Association By-Laws

POTENTIAL ENERGY BOUNCE BALL LAB

Human Factors for Limited-Ability Autonomous Driving Systems

Survey Results. What We Heard

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

AP Physics 1 Summer Packet Review of Trigonometry used in Physics

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Just How Big are the Pockets, Anyway? Part II

12. School travel Introduction. Part III Chapter 12. School travel

Psychology - Mr. Callaway/Mundy s Mill HS Unit Research Methods - Statistics

Analysis of Highland Lakes Inflows Using Process Behavior Charts Dr. William McNeese, Ph.D. Revised: Sept. 4,

Review of 2009 MLB Baseball Bat Regulations

It s conventional sabermetric wisdom that players

100 Most Congested Roadways in Texas

Standard 3.1 The student will plan and conduct investigations in which

Scoresheet Sports PO Box 1097, Grass Valley, CA (530) phone (530) fax

Shotgun Distance Determination Test No Summary Report

D u. n k MATH ACTIVITIES FOR STUDENTS

September 2018 REFEREES

Rick Torbett Read and React offence

Average Runs per inning,

PROFIT RECALL 2 USER GUIDE

7. Teach your small guards to hit the floor after a layup attempt has been blocked.

Race and Retention in a Competitive Labor Market: The Role of Historically Black Colleges and Universities in NCAA Basketball

Mrs. Daniel- AP Stats Ch. 2 MC Practice

Factors Associated with the Gender Gap in Bicycling Over Time

Reducing Crashes at Rural Thru-STOP Controlled Intersections

Calculation of Trail Usage from Counter Data

Public Opinion, Traffic Performance, the Environment, and Safety After Construction of Double-Lane Roundabouts

CHAPTER 10 TOTAL RECREATIONAL FISHING DAMAGES AND CONCLUSIONS

C Dissolved Gas Analysis Of Load Tap Changers. Working Group

A Network-Assisted Approach to Predicting Passing Distributions

Energy capture performance

Market Factors and Demand Analysis. World Bank

How Much Trouble is Early Foul Trouble? Strategically Idling Resources in the NBA

Duke Press Conference Quotes Duke vs. LSU March 22, 2010 Cameron Indoor Stadium Durham, N.C.

Today s plan: Section 4.2: Normal Distribution

Equine Injury Database Update and Call for More Data

Physics 2048 Test 1 Fall 2000 Dr. Jeff Saul Name:

Sherwood Drive Traffic Circle

Honest Mirror: Quantitative Assessment of Player Performances in an ODI Cricket Match

Transcription:

Quantifying Foul Tilt of NBA Players Katherine Evans, PhD Verily Life Sciences CMU Sports Analytics Conference 2018

Disclaimer Opinions expressed are solely my own and do not express the views or opinions of my employer

Introduction

What Do I Mean By Tilt? Tilt is a term that originated in poker and refers to a general state of being where a person is confused or frustrated and then adopts a suboptimal strategy Typically over-aggression. Over-aggression then often leads to more suboptimal play which leads to more frustration. How could we measure tilt?

Boogie Sure Does Get Upset Foul A Lot Season Games Fouls 10-11 81 332 11-12 64 257 12-13 75 269 13-14 71 270 14-15 59 241 15-16 65 236 16-17 72 278 17-18 48 183

Do Fouls Beget Fouls? Once a player commits a foul, is he more likely to commit another one? Moreover, is a player more likely to commit a 5 th foul once he has 4 than he is to commit a 2 nd foul one he has 1?

Modeling We can consider a survival model, and look at the failure time for each foul ie. the time it takes a player to commit his 1st found, 2nd foul, etc. If, for example, the time between the 2nd and 3rd foul is significantly longer than than the time between the 4th and 5th foul, we might have evidence of some sort of tilt. Player time, not game clock time We can use a stratified Cox model to analyze the data. Also, going to ignore technical fouls for now Which are probably a bigger indicator of tilt But way less frequent

Data

Data Play-by-play data and box-score data from the NBA for the 2011-2012 through 2017-2018 seasons. This data is publicly available from nba.com Cleaned up data available at eightthirtyfour.com Using the box-score data and substitutions in the play by play for each game, we can determine the amount of time any given player has actively played in the current game at each event in the play by play data. We also generated censoring times using the maximum playing time for a given player This is because games end, censoring observations

Basic Analysis

Censoring If games were infinitely long, and players continued to play, we would observe every player until he fouled out. Many fouls are censored due the end of follow up time. Censoring biases the numbers we see The graph looks like it supports our hypothesis, but the underlying data is biased because of censoring Rate densities when we ignore censoring

Kaplan-Meier Curves A survival curve, in general, is used to map the length of time that elapses before an event occurs. Here, they give the probability that a player has survived to a certain time without committing a particular number foul. The graph is now less biased But still problematic KM Curves for DeMarcus Cousins

Basic Summary Basic summary statistics give some insight Time to foul seems to shrink But again, these numbers do not account for censoring Eg. Boogie had 72 games where he committed only 1 or 2 fouls We need to look only at games with a large number of fouls

Subset the Data Instead, we will examine games for each player where they committed a minimum of five fouls And limit our analysis to the first four fouls. Do this to ensure we have comparable games Because of the censoring... This foul restriction gives us a larger sample size, though restricts us from gaining understanding about how players accrue their 5th and 6th fouls.

Better Analysis

Kaplan-Meier Curves Part 2 Now we can see pretty distinct ordering in how fouls are accrued If there were no tilt, then we d expect to see fouls occur a lot more randomly Evidence towards our hypothesis Though certainly not proof KM Curves for DeMarcus Cousins

KM Curves for More Players KM Curves for Al Horford KM Curves for Cousins, Horford, Howard, Bogut, Chandler, M. Gasol, B. Lopez, & R. Lopez pooled

Time for Some Math Conditional risk set model since we have repeated events/fouls But really a stratified Cox model since we distinguish between fouls Covariates, e.g. score, game time etc. Survival time Baseline hazard

Cox Model Results Let s also control from some other variables, score and game time We see that fouls 2, 3, and 4 are committed sooner than the prior foul. Hazard ratio = exp(coeff k - coeff k-1 ) Eg. when Cousins has 3 fouls he is 374% times as likely to commit a foul at any given time than when he only has 2 fouls.

Further Thoughts Oh god, its so much more complicated All this subsetting allows for some comparable fouls, but then we are ignoring all the games where he didn t foul very much What about hacking? What about the other side of the court? Players may get mad if they are fouled Censoring makes these sorts of problems really hard

Thank You! Udam Saini - my amazing collaborator All our data is available at eightthirtyfour.com Sam - for tricking me into giving this talk My job - for having an office nearby and for letting me do sports as a side gig Mike Lopez - even though he s not here Jackie Bradley Jr - for nine RBIs with two outs And, of course, DeMarcus Cousins - a constant source of inspiration and entertainment

Thanks! Contact me: causalkathy@gmail.com @CausalKathy on Twitter www.causalkathy.com