Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

Similar documents
The Cold Facts About the "Hot Hand" in Basketball

Do Steph Curry and Klay Thompson Have Hot Hands?

Extreme Shooters in the NBA

The Hot Hand Myth In Professional Basketball. Jonathan J. Koehler* University of Texas at Austin. Caryn A. Conley** New York University

Does Momentum Exist in Competitive Volleyball?

download.file(" destfile = "kobe.rdata")

The Hot Hand, Competitive Experience, and Performance Differences by Gender

NBA TEAM SYNERGY RESEARCH REPORT 1

arxiv: v2 [stat.ap] 4 Nov 2017

Lab 2: Probability. Hot Hands. Template for lab report. Saving your code

Horseshoe pitchers hot hands

Hot Hands and Equilibrium

This page intentionally left blank

Performance under Pressure in the NBA. Zheng Cao Oregon State University. Joseph Price Brigham Young University

How to Win in the NBA Playoffs: A Statistical Analysis

STUDY BACKGROUND. Trends in NCAA Student-Athlete Gambling Behaviors and Attitudes. Executive Summary

1 Streaks of Successes in Sports

The final set in a tennis match: four years at Wimbledon 1

TOPIC 10: BASIC PROBABILITY AND THE HOT HAND

Legendre et al Appendices and Supplements, p. 1

The Efficacy of Momentum-Stopping Timeouts on Short-Term Performance in the National Basketball Association

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

On the advantage of serving first in a tennis set: four years at Wimbledon

Existence of Nash Equilibria

Two Machine Learning Approaches to Understand the NBA Data

Home Team Advantage in the NBA: The Effect of Fan Attendance on Performance

Journal of Quantitative Analysis in Sports

Professionals Play Minimax: Appendix

Evaluating NBA Shooting Ability using Shot Location

PREDICTING the outcomes of sporting events

No Christoph Bühren and Stefan Krabel. Individual Performance after Success and Failure - A Natural Experiment

Game Theory (MBA 217) Final Paper. Chow Heavy Industries Ty Chow Kenny Miller Simiso Nzima Scott Winder

Game Rules BASIC GAME. Game Setup NO-DICE VERSION. Play Cards and Time Clock

Behavior under Social Pressure: Empty Italian Stadiums and Referee Bias

Predicting Momentum Shifts in NBA Games

Examining NBA Crunch Time: The Four Point Problem. Abstract. 1. Introduction

WHAT CAN WE LEARN FROM COMPETITION ANALYSIS AT THE 1999 PAN PACIFIC SWIMMING CHAMPIONSHIPS?

QED. Queen s Economics Department Working Paper No Correcting for bias in hot hand analysis: Analyzing performance streaks in youth golf

Pierce 0. Measuring How NBA Players Were Paid in the Season Based on Previous Season Play

Investigation of Winning Factors of Miami Heat in NBA Playoff Season

Regression to the Mean at The Masters Golf Tournament A comparative analysis of regression to the mean on the PGA tour and at the Masters Tournament

Our Shining Moment: Hierarchical Clustering to Determine NCAA Tournament Seeding

A Novel Approach to Predicting the Results of NBA Matches

Fit to Be Tied: The Incentive Effects of Overtime Rules in Professional Hockey

STUDY OF PSYCHOMOTOR VARIABLES OF BASKETBALL PLAYERS AT DIFFERENT LEVELS OF COMPETITIONS

Chapter 5: Methods and Philosophy of Statistical Process Control

Volume 37, Issue 3. Elite marathon runners: do East Africans utilize different strategies than the rest of the world?

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Team 1. Lars Eller vs. Montreal Canadiens. Submissions on behalf of Montreal Canadiens (Team Side)

FYG Backing for Work Zone Signs

Using Spatio-Temporal Data To Create A Shot Probability Model

BASKETBALL. Game Rules. Use an official basketball, though any size will do. Consider using smaller balls for smaller kids.

Fishery Resource Grant Program Final Report 2010

Five Great Activities Using Spinners. 1. In the circle, which cell will you most likely spin the most times? Try it.

Ups and Downs: Team Performance in Best-of-Seven Playoff Series

Opleiding Informatica

Expected Value 3.1. On April 14, 1993, during halftime of a basketball game between the. One-and-One Free-Throws

When Falling Just Short is a Good Thing: the Effect of Past Performance on Improvement.

The probability of winning a high school football game.

Navigate to the golf data folder and make it your working directory. Load the data by typing

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

Cabrillo College Transportation Study

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

DESIGNING PERCEPTUAL MOTOR SKILL DEVELOPMENT PROGRAMS FOR SQUASH

Reference Points, Prospect Theory and Momentum on the PGA Tour

Quantitative Literacy: Thinking Between the Lines

Average Runs per inning,

Should bonus points be included in the Six Nations Championship?

Decision Making in American Football: Evidence from 7 Years of NFL Data

Lab Report Outline the Bones of the Story

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

How Effective is Change of Pace Bowling in Cricket?

The Effect Touches, Post Touches, and Dribbles Have on Offense for Men's Division I Basketball

One of the most-celebrated feats

Decision-Making under the Gambler s Fallacy:

Bhagwant N. Persaud* Richard A. Retting Craig Lyon* Anne T. McCartt. May *Consultant to the Insurance Institute for Highway Safety

Section 5.1 Randomness, Probability, and Simulation

Prize Structures, Gender, and Performance in Competitive Marathon Running

Journal of Quantitative Analysis in Sports

Predictors for Winning in Men s Professional Tennis

Additional On-base Worth 3x Additional Slugging?

Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players

AN ANALYSIS OF TEAM STATISTICS IN AUSTRALIAN RULES FOOTBALL. Andrew Patterson and Stephen R. Clarke 1. Abstract 1. INTRODUCTION

An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables

Round Numbers as Goals Evidence From Baseball, SAT Takers, and the Lab

Attack-Tempo and Attack-Type as predictors of attack point made by opposite players in Volleyball

SPORTS TEAM QUALITY AS A CONTEXT FOR UNDERSTANDING VARIABILITY

Figure 1. Winning percentage when leading by indicated margin after each inning,

An Analysis of Factors Contributing to Wins in the National Hockey League

Racial Bias in the NBA: Implications in Betting Markets

Basketball data science

Is Home-Field Advantage Driven by the Fans? Evidence from Across the Ocean. Anne Anders 1 John E. Walker Department of Economics Clemson University

Journal of Quantitative Analysis in Sports

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

Wildlife Ad Awareness & Attitudes Survey 2015

ABSTRACT MOMENTUM ON THE PGA TOUR. Kyle Savage, PhD, Professor Seppo E. Iso-Ahola, Kinesiology

It s conventional sabermetric wisdom that players

Analyzing the Influence of Player Tracking Statistics on Winning Basketball Teams

*Author for Correspondence

HSIS. Association of Selected Intersection Factors With Red-Light-Running Crashes. State Databases Used SUMMARY REPORT

Transcription:

Calhoun: The NPS Institutional Archive DSpace Repository Faculty and Researchers Faculty and Researchers Collection 2010 Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework Arkes, Jeremy Journal of Quantitative Analysis in Sports, Volume 6, Issue 1, 2010 http://hdl.handle.net/10945/43641 Downloaded from NPS Archive: Calhoun

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework Journal of Quantitative Analysis in Sports, Volume 6, Issue 1 2010 Article 2 Jeremy Arkes Naval Postgraduate School Ingersoll-335 Monterey, CA 93943 arkes@nps.edu Abstract Despite the conventional wisdom among players and fans of the existence of the hot hand in basketball, studies have found only no evidence or weak evidence for the hot hand in game situations, although stronger evidence in controlled settings. These studies have considered both free s and field goals. Given the heterogeneous nature of field goals and several potential sources that could cause a positive or negative correlation between consecutive shots (such as having a weak defender), free s may provide for a more controlled setting to test for the hot hand. Almost all studies have tested for the hot hand at the individual level in univariate frameworks, and as some have pointed out, the studies may not have had enough power to detect the hot hand. In this study, I use a sample based on every free attempted during the 2005-06 NBA season. I used a multivariate framework with individual player fixed effects. I find that hitting the first free is associated with a 2- to 3-percentage-points higher probability of hitting the second free. Furthermore, the infrequent foul-shooters have more streakiness in their shooting than frequent foul-shooters. 1. Introduction

A common perception among basketball fans, players, and commentators is the existence of the hot hand or streak shooting, where a player has a higher probability of making the next shot had he or she made a few shots in a row. Many people who play basketball believe in the hot hand, as sometimes they just have a better feel for the ball, are more confident in their ability to make the shot, and have better concentration. And teams will often find the player who has made a few shots in a row to take the next shot, under the assumption that they have a higher probability than anyone else of making the next shot (Burns, 2004). Despite this conventional wisdom that the hot hand in basketball is a real phenomenon, there has been little evidence supporting the theory. Researchers have used runs tests, binomial tests, and conditional probability tests, and generally found that the streaks of consecutive made shots are statistically natural. That is, such runs of consecutive made shots occur about as frequently as they would be expected given the natural variation in the probability of making a shot. One potential problem with these studies was insufficient power. Many test players individually rather than in a larger framework. In this study, I test for conditional probabilities within a pooled, multivariate framework. I examine free s from the 2005-06 season. With tens of thousands of observations in my sample, I find statistically significant evidence for the hot hand, as hitting a prior free leads to a 2- to 3- percentage-point higher probability of hitting a given free. 2. Literature Review Bar-Eli et al. (2006) provide a nice review of the literature on the hot hand in all sports. For basketball, the studies examining the hot hand in basketball have used three types of shots: field goals, free, and shots in a controlled experiment. Testing for the hot hand with field goals could be problematic because of several possible sources of correlation between the probability of success for consecutive shots. The direction of this correlation, however, is ambiguous. If a player has a weak defender on him who allows him to drive easily to the basket for a lay-up or to take open shots, then there would be a positive correlation between the success of consecutive shots. On the other hand, several factors could contribute to a negative correlation between the success of consecuvitve field goal. Players may naturally mix up their shot selection so that they will drive for a lay-up (perhaps an easier shot) on one possession and then be more likely to shoot from outside (a more difficult shot) on the next possession. Similarly, if a player misses a few shots from the outside and loses confidence, he may be more likely to wait for an easy shot to get his confidence back. In addition, one could argue that if a player hits one shot from outside, then the defender will play him tighter on the next opportunity so that he may not get as easy a shot. These factors could contribute to the findings of no evidence for the hot hand, when field goal data are used, as discussed below. 1

Free data does not have these problems because each shot is the same, without any defenders. Consecutive shots could, however, be correlated due to a few factors. First, the pressure of a free could be positively correlated, as it would depend on the game situation. Second, a player may be fatigued, and this fatigue may only slightly ease between the first and second free s. Thus, free s are not perfect for testing for the hot hand, but it may be optimal over using field goals. Shots from a controlled experiment, therefore, may allow for the cleanest test of the hot hand. However, shooting uncontested in a gym is a far different situation from shooting in a game situation. Thus, the applicability of such results to a game situation is uncertain. In the seminal paper on the hot hand, Gilovich et al. (1985) used all three types of data: free shooting among the Philadelphia 76ers, Boston Celtics, and Cornell University basketball teams, field goals among several field goals among players from the New York Knicks and New Jersey Nets, and a controlled shooting experiment among the Cornell University men s and women s teams. Analyzing conditional probability, run counts, and serial correlations, they find no evidence for the hot hand. In the only other study on free s in game situations, Wardrop (1995) finds no statistically significant evidence across individuals members of the Boston Celtics (over the 1980-81 and 1981-82 seasons) that players were more likely to make their second free had they made their first free. Pooling all players, however, the data do suggest that making the first free is correlated with making the second free, as players hit 78.9% of their free s after a make, but only 74.3% after a miss. However, pooling data like this, without any control for the players, naturally produces such a result because the good free shooters will have many observations in which he hits both the first and second free s, while the poor shooters will have many observations in which he misses both. If the best and worst free shooters (Larry Bird and Rick Robey) are taken out of the data, then the 4.6% difference is reduced to a 3.4% difference. [[IS THIS A VALID CRITIQUE??]] Other studies testing for differences in conditional probabilities have also failed to find evidence for the hot hand. Tversky and Gilovich (1989) corrected and analyzed data, originally collected by Larkey et al. (1989), on streaks that occur in short periods of time among 18 of the elite players in the game. They find no evidence for the hot hand. Another study using field goal data was conducted for the website 82games.com. Among NBA players in the 2005-06 season who had at least 200 field goals, 17 had a higher shooting percentage after missing the last shot than after making the last attempt by at least 10 percentage points, while no players had at least a 10-percentage-point higher shooting percentage after making the last shot than after missing the last shot. 1 Two other interesting studies used experiments. Koehler and Conley (2003) examined 23 players in the NBA s 3-point shooting contests from 1994 to 1997. This is a quasiexperiment because there is no defender. However, players typically are rushed to get 25 1 These statistics come from http://www.82games.com/random25.htm. 2

shots off in a minute, so for the last several shots, fatigue may set in or they may be rushed. Still, they find that only two of the players demonstrated any unusual streaks of success. Wardrop (1999) uses a controlled setting for a female basketball player at the University of Wisconsin. She shot 100 shots on 20 different days. Wardrop (1999) finds statistically significant evidence against the hypothesis that the shots follow a Bernoulli trial. In summary, there is no evidence that there is a hot hand for field goals in game situations. However, there was some evidence finding significant evidence for the hot hand based on a controlled experiment and on pooling free s for players on the Boston Celtics. It is uncertain whether the results from a controlled experiment would translate to game situations. And the strategy of pooling players without controls for the players is questionable. A common critique of the studies has been that the statistical tests are too weak to detect significance (Miyoshi, 2000; Wardrop, 1999; Frame et al., 2003; Dorsey-Palmateer and Smith, 2004). For example, Miyoshi (2000) used simulations to show that, when certain parameters are set to realistic levels for the sensitivity tests used in Gilovich et al. (1985), then the runs test would detect only 12% of all instances of the hot hand. In this paper, I get around the power problem by using data on all NBA players from the 2005-06 season. I pool all players into one model, while controlling for each player in a multivariate framework. 3. Methods The data for this analysis were provided by 82games.com. The data include detailed information on all 64,698 free s taken in the 2005-06 season. The variables include an indicator for the player, whether the player made the free, what sequence of a set of free s each shot is (e.g., 1 of 1, 1 of 2, etc.), which quarter of the game the shot took place in, how many free s the player made in the prior 1, 2, 3, 4, and 5, how many field goals the player made in the prior 1, 2, and 3, and how many of the past 10 free s the team attempted were made. For a first look at the data, I examined the 89 players who had at least 50 sets of two or more free s. Comparing probabilities using individual-level conditional probabilities, six players had a significantly (at the 10% level) higher probability of hitting a second free after making the first one relative to missing the first one, while two players had significantly higher probabilities of making a free after a miss. Whereas past studies typically used runs tests for individual players, I combine all players into one model in a multivariate, fixed-effects framework. Specifically I use fixed-effect logit models to estimate the effect of making previous free s on the probability of 3

making a given free. Thus, the estimates represent within-player comparisons. Such a framework provides much greater power for testing for the hot hand than previous analyses. Furthermore, it allows me to control for other factors, such as the quarter of the game during which the free occurs. I use several variants of the model to address different questions and to test for the sensitivity of the results. First, I estimate whether a player is more likely to make a second free in a set of free s had he made the first free. I estimate this model for all players (435 who remain in the regression sample). And to test whether frequent or infrequent foul-shooters are more subject to the hot hand (or streakiness), I estimate the model separately for the 213 players who had fewer than 100 free s during the season and the 113 players who had at least 200 free s. The next set of models will then examine whether success on a free relies on success on prior to the last free. This set of models uses the first free in a set as the outcome, which reduces potential problems from a possible correlation between free s in a set due to being in the same game situation and level of fatigue. In these models, I examine how success on up to the last 5 free s affects the probability of making the current free. 4. Results Table 1 presents the marginal effects and their standard errors for the main set of models. In these models, I examine the second of all sets of two or three free s. I test whether a player is more likely to make the second free if he had made the first free. The only other control variables are quarter dummy variables. The first column shows the results for all players without fixed effects, with all 28,240 second free s analyzed for 485 different players. The next three models add player fixed effects to the model. The fixed-effect model with all players, in the second column, loses 172 observations for 50 players who made all or made none of their free s. The third column shows the results for those 213 players who had fewer than 100 free s during the season; and the fourth column shows the results for the 113 players who had more than 200 free s during the season. The comparison of the estimates shows whether regular free shooters are more or less likely to be subject to the hot hand than infrequent free shooters. The evidence for all players and for the frequent and infrequent free shooters all provides evidence for the hot hand. Not controlling for player fixed effects, the estimated effect of hitting the first free is a 6.4-percentage-point increase in the probability of hitting the second free. However, when player fixed effects are included, hitting the first free is estimated to raise the probability that a player will hit the second free by an estimated 2.9 percentage points for all players. Thus, it appears that not including player fixed effects, as Waldrop (1995) had not done, could cause an upward bias in the estimated hot hand effect. Separating players by free frequency, hitting the first free is estimated to increase the probability of hitting the second 4

shot by 2.7 percentage points for infrequent shooters, and by 5.0 percentage points for frequent shooters. The estimates are statistically significant at the 5-percent level for infrequent shooters and at the one-percent level for all players and frequent shooters. However, the difference in the estimates between the frequent and infrequent shooters is not statistically significant. As mentioned earlier, one potential criticism of examining the correlation of success for free s in a set of two free s is that the situation of the set of free s could affect the probability of success for both free s. I can reduce this potential problem by estimating how success on prior free sets could affect the probability that a person makes a free. In Table 2, I examine the first free (of a single free or a set of free s). Being the first free, the correlation to the previous free s due to being in a similar situation (fatigue and game pressure) should be significantly lower. The first column shows how making the prior free affects the probability of making the current free, with the sample for this model being all first free s when the player had at least one prior free in the game. The next column estimates how the number of free s made in the prior two affects the probability of making the current free, with the sample this time requiring that the player had at least two prior free s. The following columns are for the number of made free s in the prior 3, 4, and 5. The estimated effect of making the prior free is a 1.9-percentage-point increase in the probability of making the current free, which is significant at the 10-percent level. This estimate is smaller than the 2.9-percentage-point effect from the prior free within the same set of free s. The estimate for the number of made free s in the prior two is significant at the 5-percent level. Making one of the prior two free s increases the probability of making a free (relative to not making any of the prior two ) by 1.5 percentage points, while making both of the last two free s increases the probability by 3.0 percentage points. The estimates for the number of free s made in the last 3, 4, and 5 were all insignificant. Interestingly, a free following a made field goal with a foul (an And-one situation) does not have a significant effect on the probability of making the free. One application of the hot hand, or an adrenaline effect, would be that making a shot as one is getting fouled is indicative of shooting well or having adrenaline. However, it does not appear to be correlated with success on the foul line. Caveats One drawback of the approach in this study is that a cold hand could potentially drive the results. That is, the finding of a higher probability of making a shot after making the previous shot could be the result of a player having periods in which he made a low percentage of shots but not any particular periods in which he made an extraordinarily 5

high percentage of shots. So, the evidence presented in this paper may be for the existence of the cold hand and not necessarily the hot hand. 5. Conclusions Previous studies testing for the hot hand have mostly been based on testing individuals. Thus, many studies have not had sufficient power to detect any evidence for the hot hand. The only convincing study that had detected the hot hand was based on a controlled experimental setting, which may not translate well to a game situation. In this study, I use a pooled, multivariate framework to provide greater power in testing for the hot hand in free shooting among NBA players. In contrast to past studies for game situations, I find strong evidence for the hot hand, as hitting the prior free increases the probability of making a free by 2 to 3 percentage points. The hot hand appears to be stronger for infrequent than for frequent foul-shooters. Furthermore, whereas success on the prior two free s increases the probability of making a free, success on free s prior to that apparently does not matter. 6

References Bar-Eli M, Avugos S, Raab M (2006), Twenty years of hot hand research: Review and critique, Psychology of Sport and Exercise 7(6), 525-553. Burns, B. D. (2004). Heuristics as beliefs and as behaviors: The adaptiveness of the hot hand. Cognitive Psychology, 48, 295 331. Dorsey-Palmateer, R., & Smith, G. (2004). Bowlers hot hands. The American Statistician, 58, 38 45. Frame, D., Hughson, E., & Leach, J. C. (2003). Runs, regimes, and rationality: The hot hand strikes back. Working paper. Gilovich, T., Vallone, R., & Tversky, A. (1985). The hot hand in basketball: On the misperception of random sequences. Cognitive Psychology, 17, 295 314. Larkey, P. D., Smith, R. A., & Kadane, J. B. (1989). It s okay to believe in the hot hand. Chance, 2, 22 30. Miyoshi, H. (2000). Is the hot hands phenomenon a misperception of random events? Japanese Psychological Research, 42, 128 133. Tversky, A., & Gilovich, T. (1989b). The hot hand : Statistical reality or cognitive illusion? Chance, 2, 31 34. Wardrop, R. L. (1995). Simpson s paradox and the hot hand in basketball. The American Statistician, 49, 24 28. Wardrop, R. L. (1999). Statistical tests for the hot-hand in basketball in a controlled setting. http://www.stat.wisc.edu/_wardrop/papers/tr1007.pdf. 7

Table 1: Marginal effects from fixed-effect logit models (Sample = second of two free s) Dependent variable = whether made the foul shot No player fixed effects All made last free 0.064*** (0.006) 2nd quarter -0.001 (0.008) 3 rd quarter -0.011 (0.008) 4 th quarter -0.005 (0.007) All 0.029*** (0.008) -0.003 (0.011) -0.019* (0.011) -0.020* (0.011) Player fixed effects Less than 100 free At least 200 s in free s season in season 0.027** 0.050*** (0.011) (0.019) 0.011-0.038 (0.014) (0.030) -0.004-0.070** (0.014) -0.011 (0.014) (0.030) -0.038 (0.029) # observations 28,240 28,068 4149 17,225 # players 485 435 213 113 8

Table 2: Marginal effects from fixed-effect logit models (Sample = first free in a set and having at least 1, 2, 3, 4, or 5 prior free in the game) Dependent variable = whether made the foul shot made last free # free s made in last 2 # free s made in last 3 # free s made in last 4 # free s made in last 5 2nd quarter 3rd quarter 4th quarter And-one (following a made field goal) Had at least 1 prior free attempt 0.019* (0.010) -0.027 (0.017) 0.000 (0.016) -0.006 (0.016) 0.008 (0.012) Had at least 2 prior free 0.015** (0.007) -0.039** (0.018) -0.006 (0.018) -0.013 (0.017) 0.002 (0.012) Had at least 3 prior free 0.010 (0.008) -0.008 (0.029) 0.024 (0.026) 0.006 (0.026) 0.002 (0.016) Had at least 4 prior free 0.008 (0.007) -0.003 (0.033) 0.031 (0.030) 0.015 (0.029) -0.004 (0.017) Had at least 5 prior free # observations 19,516 17,458 10,733 8794 5609 # players 383 373 283 260 200 0.000 (0.008) -0.002 (0.046) 0.038 (0.040) 0.020 (0.039) -0.015 (0.022) 9