Having the Lead vs. Lagging Behind : The Incentive Effect of Handicaps in Tournaments

Similar documents
Gender Differences in Competition: Evidence from Swimming Data Shoko Yamane 1 Ryohei Hayashi 2

Peer Effect in Sports: A Case Study in Speed. Skating and Swimming

Measuring the Sorting and Incentive Effects of Tournament Prizes

WHAT CAN WE LEARN FROM COMPETITION ANALYSIS AT THE 1999 PAN PACIFIC SWIMMING CHAMPIONSHIPS?

Deciding When to Quit: Reference-Dependence over Slot Machine Outcomes

Prize Structures, Gender, and Performance in Competitive Marathon Running

Existence of Nash Equilibria

Are Competitors Forward Looking in Strategic Interactions? Evidence from the Field

Are Workers Rewarded for Inconsistent Performance?

Fit to Be Tied: The Incentive Effects of Overtime Rules in Professional Hockey

Measuring the Sorting and Incentive Effects of Tournament Prizes

Gamblers Favor Skewness, Not Risk: Further Evidence from United States Lottery Games

Legendre et al Appendices and Supplements, p. 1

Generating Power in the Pool: An Analysis of Strength Conditioning and its Effect on Athlete Performance

Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement

Potential, Prizes and Performance: Testing Tournament Theory with Professional Tennis Data

Behavior under Social Pressure: Empty Italian Stadiums and Referee Bias

Working Paper No

Racial Bias in the NBA: Implications in Betting Markets

Teams as Superstars: Effort and Risk Taking in Rank-Order Tournaments for Women and Men. Mario Lackner. Working Paper No December 2016

Journal of Quantitative Analysis in Sports

Basic Wage for Soccer Players in Japan :

The final set in a tennis match: four years at Wimbledon 1

What Causes the Favorite-Longshot Bias? Further Evidence from Tennis

Men in Black: The impact of new contracts on football referees performances

Running head: DATA ANALYSIS AND INTERPRETATION 1

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball

Chapter 2: Modeling Distributions of Data

Volume 37, Issue 3. Elite marathon runners: do East Africans utilize different strategies than the rest of the world?

Lesson 14: Modeling Relationships with a Line

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

MAXIMUM ECONOMIC YIELD AND RESOURCE ALLOCATION IN THE SPINY LOBSTER INDUSTRY* Joel S. Williams and Fred J. Prochaska

!!!!!!!!!!!!! One Score All or All Score One? Points Scored Inequality and NBA Team Performance Patryk Perkowski Fall 2013 Professor Ryan Edwards!

Effects of Incentives: Evidence from Major League Baseball. Guy Stevens April 27, 2013

The Effect of Newspaper Entry and Exit on Electoral Politics Matthew Gentzkow, Jesse M. Shapiro, and Michael Sinkinson Web Appendix

Staking plans in sports betting under unknown true probabilities of the event

Analysis of recent swim performances at the 2013 FINA World Championship: Counsilman Center, Dept. Kinesiology, Indiana University

What does it take to produce an Olympic champion? A nation naturally

Is Home-Field Advantage Driven by the Fans? Evidence from Across the Ocean. Anne Anders 1 John E. Walker Department of Economics Clemson University

LOW PRESSURE EFFUSION OF GASES revised by Igor Bolotin 03/05/12

Keywords: multiple linear regression; pedestrian crossing delay; right-turn car flow; the number of pedestrians;

BENDIGO EAST SWIMMING CLUB INC. Reg No 000A281 RACING RULES

Returns to Skill in Professional Golf: A Quantile Regression Approach

A Hare-Lynx Simulation Model

Tournament Selection Efficiency: An Analysis of the PGA TOUR s. FedExCup 1

IMPROVING POPULATION MANAGEMENT AND HARVEST QUOTAS OF MOOSE IN RUSSIA

The relationship between payroll and performance disparity in major league baseball: an alternative measure. Abstract

Station heterogeneity and the dynamics of retail gasoline prices

Extensive Games with Perfect Information

Superstar Effects in Golf Tournaments. Zachary Lazow, Colgate University

Does the Three-Point Rule Make Soccer More Exciting? Evidence from a Regression Discontinuity Design

EE 364B: Wind Farm Layout Optimization via Sequential Convex Programming

Emergence of a professional sports league and human capital formation for sports: The Japanese Professional Football League.

Rank and Performance in Dynamic Tournaments: Evidence from the PGA Tour

League Quality and Attendance:

Gizachew Tiruneh, Ph. D., Department of Political Science, University of Central Arkansas, Conway, Arkansas

Journal of Quantitative Analysis in Sports

Original Article. Correlation analysis of a horse-betting portfolio: the international official horse show (CSIO) of Gijón

Utilization of the spare capacity of exclusive bus lanes based on a dynamic allocation strategy

Journal of Human Sport and Exercise E-ISSN: Universidad de Alicante España

IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1

University of Kassel Swim Start Research

Columbia University. Department of Economics Discussion Paper Series. Auctioning the NFL Overtime Possession. Yeon-Koo Che Terry Hendershott

Journal of Sports Economics 2000; 1; 299

STAT 625: 2000 Olympic Diving Exploration

MICROSIMULATION USING FOR CAPACITY ANALYSIS OF ROUNDABOUTS IN REAL CONDITIONS

Basketball data science

WIND-INDUCED LOADS OVER DOUBLE CANTILEVER BRIDGES UNDER CONSTRUCTION

Marginal Satisfaction of Recreational Hunters Red Deer Harvests Geoffrey N. Kerr Lincoln University

save percentages? (Name) (University)

Department of Economics Working Paper

SPATIAL STATISTICS A SPATIAL ANALYSIS AND COMPARISON OF NBA PLAYERS. Introduction

Jodi M. Cossor and Bruce R. Mason Australian lnstitute of Sport Biomechanics Department, Canberra, Australia

EXPLORING MOTIVATION AND TOURIST TYPOLOGY: THE CASE OF KOREAN GOLF TOURISTS TRAVELLING IN THE ASIA PACIFIC. Jae Hak Kim

Vorgee Endurance 1000

Quitters Never Win: The (Adverse) Incentive E ects of Competing with Superstars

Working Paper no Sequential Teamwork in Competitive Environments: Theory and Evidence from Swimming Data

The pth percentile of a distribution is the value with p percent of the observations less than it.

Contingent Valuation Methods

65+ Men, Women One Man, One Woman 75+ Men, Women One Man, One Woman. Participants may compete in only one division at Ontario 55+ Summer Games.

The Effects of Altitude on Soccer Match Outcomes

Novel empirical correlations for estimation of bubble point pressure, saturated viscosity and gas solubility of crude oils

International Discrimination in NBA

The risk assessment of ships manoeuvring on the waterways based on generalised simulation data

EFFICIENCY OF TRIPLE LEFT-TURN LANES AT SIGNALIZED INTERSECTIONS

Real-Time Electricity Pricing

OXYGEN POWER By Jack Daniels, Jimmy Gilbert, 1979

Chapter 12 Practice Test

Title: Modeling Crossing Behavior of Drivers and Pedestrians at Uncontrolled Intersections and Mid-block Crossings

A Group of Factors Influencing the Development of the Greeks Volleyball Athletes at School Age

Lab 3: Pumps in Series & Pumps in Parallel

DOI /HORIZONS.B P23 UDC : (497.11) PEDESTRIAN CROSSING BEHAVIOUR AT UNSIGNALIZED CROSSINGS 1

On the association of inrun velocity and jumping width in ski. jumping

Kinetic Energy Analysis for Soccer Players and Soccer Matches

E. Agu, M. Kasperski Ruhr-University Bochum Department of Civil and Environmental Engineering Sciences

The Impact of Managerial Change on Performance. The Role of Team Heterogeneity

Navigate to the golf data folder and make it your working directory. Load the data by typing

A IMPROVED VOGEL S APPROXIMATIO METHOD FOR THE TRA SPORTATIO PROBLEM. Serdar Korukoğlu 1 and Serkan Ballı 2.

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

Validation Study of Gas Solubility Correlations at bubble point pressure for Some Libyan Crude Oils Using Three chosen Correlations

Transcription:

Having the Lead vs. Lagging Behind : The Incentive Effect of Handicaps in Tournaments Andreas Steinmayr a, Rudi Stracke b, and Dainis Zegners c a LMU Munich b Ministry of Finance, NRW c University of Cologne February 4, 2018 Abstract This article seeks to answer whether handicaps can restore optimal effort provision by agents in heterogeneous contests. To this end, we study data from swimming relay competitions where swimmers with heterogeneous abilities inherit leads or lags from their previous team members. Inheriting a lead or lag corresponds to a head start or handicap affecting the effective level of heterogeneity in races. Our results suggest that as predicted by theory appropriately chosen handicaps can restore optimal effort provision: In our data, swimmers who receive a lead (lag) that corresponds to a third of their ability disadvantage (advantage) exert the same effort as swimmers in balanced races. We thank Oliver Gürtler and audiences at Technical-University Munich, LMU Munich, and the LEOH Workshop on Organizational Economics in 2017 at the University of Cologne for their valuable comments. Corresponding author: Dainis Zegners, University of Cologne, Faculty of Management, Economics and Social Sciences, Digital Transformation and Value Creation Group, Universitätsstraße 91, D-50931 Köln, e-mail: dainis.zegners@uni-koeln.de.

1 Introduction Tournaments are widely used to motivate agents in the workplace and in contexts such as sports, science and innovation, or elections. However, a disadvantage of using tournaments to motivate agents is that heterogeneity in ability may reduce agents effort provision. Lowerability agents have a decreased incentive to exert high effort as their probability to win the tournament against a higher-ability agent is low even if they exert high effort. Higher-ability agents also have a decreased incentive to exert high effort as their probability to win the tournament against a lower-ability agent is high even if they exert low effort. Therefore, the larger the heterogeneity in agents abilities in a tournament, the lower the effort they exert. This prediction originally made by Lazear and Rosen (1981) has been confirmed both in the lab (Schotter and Weigelt, 1992) and in the field (Brown, 2011; Sunde, 2009). A natural remedy to the problem of decreased effort provision in heterogeneous tournaments already studied theoretically by Lazear and Rosen is to reduce the effective level of heterogeneity by introducing head starts or handicaps to level the playing field. An example of such heterogeneity-reducing interventions are affirmative action policies that have the goal to remedy the effects of discrimination (Feinberg, 2003). Although such policies are widely used, empirical evidence on their impact on effort provision is limited. Existing studies either focus on tournaments without monetary incentives (Franke, 2012), on particular types of agents such as children or adolescents (Calsamiglia et al., 2013; Sutter et al., 2016), or are confined to lab-environments (Schotter and Weigelt, 1992; Balafoutas and Sutter, 2012; Leibbrandt et al., 2017). In this paper, we seek to fill this gap in the literature by using data from amateur and professional swimming team competitions. We study the performance of swimmers of heterogeneous abilities who inherit leads or lags from their previous team members. We observe races where inherited leads or lags partly or perfectly compensate for ability differences, as well as races where they increases the effective level of heterogeneity. Since inherited leads or lags affect the chances of swimmers to win a race, theory predicts that effort is maximized 2

when leads or lags compensate for ability differences between swimmers. Our results show that swimmers strongly react to ability-compensating head starts and handicaps: Both favorites and underdogs increase their effort when inherited leads or lags reduce the effective level of heterogeneity in races. In terms of size, our estimations suggest that inheriting a lead or lag that reduces heterogeneity by a third restores optimal effort provision. This suggests that heterogeneous contests with an appropriately designed handicap structure can be as effective as balanced contests in incentivizing agents to exert maximal effort. We contribute mainly to the literature studying contests as a device to motivate agents. The theoretical literature on contests was initiated by Lazear and Rosen (1981), who already proposed handicaps as a remedy to restore effort provision in heterogeneous contests. Other early theoretical contest papers are Green and Stokey (1983), Nalebuff and Stiglitz (1983) and O Keeffe et al. (1984). See Konrad (2009) for an overview of the literature on contests. Theoretical papers that deal more specifically with heterogeneity in contests are Baik (1994), Moldovanu and Sela (2001) and Szymanski and Valletti (2005). There is also a small theoretical literature on handicaps in tournaments (e.g. Denter and Sisak, 2016) and on biased or unbalanced contests, where one of the contestants has an advantage over other contestants (Imhof and Kräkel, 2016; Drugov and Ryvkin, 2017). The effects of heterogeneity in contests has also been studied empirically both in the lab (Bull et al., 1987; Schotter and Weigelt, 1992) and in the field using data from sports competitions (Brown, 2011; Sunde, 2009; Bach et al., 2009; Nieken and Stegh, 2010; Deutscher et al., 2013; Berger and Nieken, 2016; Bamieh, 2016), internal firm records (Cowgill, 2015) or data from online platforms (Boudreau et al., 2016; Lemus and Marshall, 2017). Particular close to our paper is Franke (2012), who studies handicaps in Amateur Golf competitions, and Brown and Chowdhury (2017), who study handicaps in horse-racing focusing on the incidence of sabotage. 3

Figure 1: The Effect of Head Starts and Handicaps on Effort by Relative Ability (Theory) Effort 1.0 upper envelope 0.8 0.6 a 1 a 2 5 a 1 a 2 0 a 1 a 2 5 0.4 0.2 10 5 0 5 10 Note: The figure plots equation (3) for a 1 a 2 { 5, 0, 5} as a function of the head start h, assuming M = 1 and c(e) = 39 8 e2. 2 Model In this section, we present a simple theoretical model to guide our empirical analysis. Consider the simplest Lazear and Rosen (1981) tournament with two agents 1 and 2 who compete for a prize M that is awarded to the agent whose performance is relatively higher. The performance y i of agent i {1; 2} reads y i (e i ) = a i + e i + u i, (1) where a i is the ability, e i is the effort, and u i is the random performance component of agent i. Consider the optimization problem of agent i who maximizes her expected pay-off Π i (e i, e j ) = p[y i (e i ) > y j (e j ) + h] M c(e i ), (2) where i j, p[ ] is the probability that the prize is awarded to agent i, h is a head start (or handicap when negative) awarded to agent j, and c( ) is the cost of effort function. Assuming that the random performance components of both agents are independent draws from a normal distribution with mean zero and that the cost function is sufficiently convex, 4

equilibrium efforts in the unique symmetric pure strategy Nash equilibrium satisfy e e 1 = e 2 = c [g(a 1 a 2 h) M] 1, (3) where g( ) is the density of the normal distribution, and c [ ] 1 is the inverse function of the first derivative of the cost function. As the density is symmetric and has its maximum at zero, the equilibrium effort is strictly decreasing in the absolute value of a 1 a 2 h for a given prize M. Figure 1 shows this relation graphically: Effort is highest whenever the head start or handicap just compensates for the difference in ability between both agents, and the upper envelope is a line with slope zero through e max = c [g(0) M] 1. The model provides the following predictions that we will take to the data: 1. Agents with an ability disadvantage (underdogs) increase their effort when a head start increases their probability of winning the contest. 1 2. Agents with an ability advantage (favorites) increase their effort when a handicap decreases their probability of winning the contest. 3. Combined effort of agents is highest whenever contests are balanced in terms of ability or handicaps and head starts fully compensate for differences in ability. Note that there is an interaction between ability differences and head starts / handicaps: Whether a head start or handicap increases or decreases agents effort depends on whether the agent is a favorite or an underdog. 3 Empirical Setting and Data We use data from amateur and professional swimming competitions collected by the website swimrankings.net that are based on information from the European Swimming Federation 1 As long as it does not overcompensate the underdog, i.e. making it the favorite. 5

(LEN). 2 Relay competitions in swimming are perfectly suited to analyze the effect of handicaps on effort provision for several reasons: First, the performance of swimmers in individual competition races provides us with a proxy for ability, since we know how much time any athlete needs to swim a certain distance within the same environment. Second, abilities of competitors are likely to be common knowledge among swimmers. In particular, a swimmer in a relay competition is likely to know the ability of her immediate competitors. In addition, a swimmer in a relay competition can assess the lead or lag, respectively, to competing teams before choosing effort. 3 Our data set comprises 6,193 relay races from the years 2000-2015. In a relay competition, a team consists of four swimmers that race consecutively 4 Types of relays in our data set include 4 x 50m freestyle competitions (36 percent), 4 x 50m medley 5 competitions (34 percent), 4 x 100m medley competitions (19 percent) and 4 x 100m freestyle competitions (11 percent). For each relay competition, our data set includes the individual swimming times of each swimmer, their gender and birth year, and the length of the pool (25m or 50m). To concentrate on cases where winning a competition is the main goal of a team, we only look at final races in our analysis. We also exclude timed finals, where teams compete in separate races and the team with the fastest racing time across races is declared the overall winner. Additional to the relay competitions, we also have data on swimming times of swimmers in individual competitions in the same and other meetings throughout a year. We will use these individual swimming times to compute the ability of each swimmer within a swim style on a certain distance. To mimic most closely the set-up of the theoretical model in the previous section, we focus 2 For a detailed data description, see Neugart and Richiardi (2013), who use the same data to investigate free-riding in sequential team production settings. Another recent paper using data from swimming competitions is Jiang (2016), who studies peer effects. 3 Imhof and Kräkel (2016) distinguish between ex-ante unbalanced tournaments, where ability differences are known to agents at the start of the tournament, and ex-post unbalanced tournaments, where agents find out about ability differences at the end of the tournament. We argue that swimming relay competitions are best thought of as ex-ante unbalanced. 4 We exclude relay competitions with more than four swimmers from our analysis. 5 In a medley swimming competition, each consecutive swimmer swims the distance in a different swimming style: butterfly stroke, backstroke, breaststroke or freestyle. 6

only on the two leading swimmers in a race in our analysis. Distance to and the ability of the swimmer placed third will be included as controls in our regression analyses. We exclude the starting leg of each relay, i.e. the first swimmers, as neither swimmer inherits a lead or lag and it is not clear which two swimmers are leading. 3.1 Computing Effort, Ability and Handicaps Even though effort is not directly observable in our data, we observe the performance of all swimmers in the relay competition. In what follows, we compute effort using the time taken by a swimmer to complete his or her leg of the race compared to the average time of the same swimmer within the same year in individual competitions for the same swim style, same distance and pool type. To make the measure comparable across different types of races, effort is normalized by the average time of all swimmers in relay competitions in the same year, swimming the same distance within the same swim style, gender, pool type and age-group. Formally, we calculate effort for swimmer i in race k as effort i,k = average individual time i,same year relay time i,j average relay time all swimmers, same year. (4) By subtracting the relay time from the average time in individual races, our measure increases the less time a swimmer needs to complete the relay race i.e. the faster he or she swims. To compute relative ability of two swimmers competing in a relay, we calculate the difference in time taken by both swimmers to complete the same distance within the same swim style within the same meeting as the focal relay. We normalize the difference between the times taken by both swimmers again using the average time of all swimmers in relay competitions in the same year, swimming the same distance within the same swim style, gender, pool type and age-group. Formally, we calculate relative ability of swimmer i competing 7

against swimmer j in race k as relative ability i,j,k = individual time j,same meeting individual time i,same meeting average relay time all swimmers, same year (5) By using the average individual time within the same year excluding the current meeting to compute effort, and the individual time from the current meeting to compute relative ability, we avoid any spurious correlation between both measures e.g. induced by the form of a swimmer. The lead or lag of swimmer i competing against swimmer j in race k is computed as: lead/lag i,j,k = starting time j,k starting time i,k average relay time all swimmers, same year. (6) The starting time is the sum of the times taken by previous swimmers of the same team, therefore our measure is positive when a swimmer inherits a lead and negative when he or she inherits a lag. Using the same normalization, we also compute the lead over and the ability of the swimmer positioned third in a race, which we will include as controls in our regression analysis. 6 3.2 Outliers The distribution of our three measures for effort, relative ability and lead/lag shown in figure 2 indicates the presence of outliers in our data. Inspecting several of these outliers showed that they are likely to be the result of measurement problems such as entry errors. Therefore, we chose to truncate the data by excluding observations that are outside the 5 and 95 percentiles of the empirical distributions of the effort, relative ability and lead/lag measures. 6 We use the absolute ability but not the ability difference to the third swimmer as the ability difference would be strongly correlated with our main measure of the difference in ability between the first and the second swimmer. The reason is that a swimmer with a high (low) ability often also has an ability advantage (disadvantage) over the third swimmer. 8

Figure 2: Histograms of Untruncated Distributions of Effort, Relative Ability and Lead/Lag. Density 0 2 4 6 8 10-1 -.5 0.5 1 effort Density 0 2 4 6-1 -.5 0.5 1 Relative Ability Density 0 1 2 3 4-2 0 2 4 Lag or Lead Figure 3: Histograms of Truncated Distributions of Effort, Relative Ability and Lead/Lag. Density 0 5 10 15 -.1 -.05 0.05.1 effort Density 0 2 4 6 -.2 -.1 0.1.2 Relative Ability Density 0 2 4 6 8 -.4 -.2 0.2.4 Lag or Lead Such truncating of data is a common strategy for dealing with outliers and measurement error (e.g. Angrist and Krueger, 1999; Acemoglu et al., 2012). Figure 3 shows histograms of the truncated empirical distributions. The empirical distribution of the lead/lag measure appears to be more skewed, probably due to leads and lags accumulating over consecutive swimmers in a race. 3.3 Summary Statistics Table 1 shows the summary statistics for the 15,097 observations in our final data set. Effort has a positive mean of 0.0122 suggesting that swimmers exert higher effort when competing in a relay as compared to individual competitions. This effect, however, is driven by the fact that the second, third, and fourth swimmer in a relay start upon observing their previous team member touching the wall, which is called a flying start. This decreases their reaction 9

Table 1: Summary Statistics N Mean Sd Min Max Effort 15097 0.0122 0.0278-0.0841 0.0844 Relative Ability 15097 0.00563 0.0621-0.142 0.171 Lead or Lag 15097 0.00272 0.0801-0.259 0.285 Lead over 3rd Swimmer 15097 0.0474 0.0449 0 0.188 Relative Ability 3rd Swimmer 15097 0.960 0.0913 0.644 1.159 Age Group Adult 7208 Below 15 7842 Above 30 42 Total 15092 Swim Style Breaststroke 3196 Butterfly 2527 Freestyle 9374 Total 15097 Type Relay 4 x 100m Freestyle 1606 4 x 100m Medley 3514 4 x 50m Freestyle 5309 4 x 50m Medley 4668 Total 15097 Order Swimmer 2 5792 3 4862 4 4443 Total 15097 time as compared to starts in individual competitions, where swimmers start upon hearing the starting signal. 4 Results In this section, we present the results of our empirical analysis. We start by examining the effects of heterogeneity in relative ability, head starts and handicaps separately, ignoring their theoretically predicted interaction. In a next step, we examine their interaction by looking at the dichotomized effects, i.e. the effects of head starts and handicaps on favorites and underdogs, ignoring additional heterogeneity in these measures. Then, we examine the effects of heterogeneity in our measures. Finally, we present results from semi-parametric 10

estimations of the effects of relative ability and head starts, providing a clearer picture of their interaction. 4.1 Main Effects of Heterogeneity in Ability and Head Starts Figure 4 shows means of effort by relative ability (upper panel) and leads or lags (lower panel). Inspecting these graphs hints at effort being decreasing in both the heterogeneity in relative ability and the size of the lead or lag that a swimmer inherits. In both cases, effort seems to be highest close to zero i.e. in balanced contests. Table 2 shows the results of linear panel regressions including either athlete-level random or fixed effects. For 6986 out of 9908 athletes our data includes only one observation, limiting the amount of within-athlete variation and statistical power of the fixed effects estimation. Throughout the paper, we therefore prefer to present results using both random and fixed effects. As can be seen in table 2 in columns (1) and (2), heterogeneity in ability has a statistical significant negative effect on effort when included as a main effect (p < 0.01 and p < 0.1). In columns (3) and (4), we examine the effect of heterogeneity in ability separately for favorites and underdogs by including an interaction with a dummy indicating whether an athlete is a favorite. The results indicate that heterogeneity in ability has only an effect for favorites., although the result is only statistically significant in case of the random effects regression (p < 0.01). 7 Table 3 shows regression results adding the distance between competitors as a measure of the absolute size of head starts and handicaps. It has a statistically significant negative effect in all specifications (p < 0.01), indicating that a larger distance between competitors decreases effort provision. In columns (3) and (4), an interaction between the distance 7 There are papers in experimental economics showing that underdogs exert higher effort in heterogeneous contests than favorites (Bull et al., 1987). In the field, Bach et al. (2009), Berger and Nieken (2016) and Nieken and Stegh (2010) also find that favorites reduces effort when facing underdogs but not vice-versa. Results seem, however, to be mixed in literature: According to Nieken and Stegh: While Bull et al. (1987) report that in their experiment the mean effort level of disadvantaged subjects was higher than equilibrium effort, Fershtman and Gneezy (2011) show that quitting may depend on the relation of tournament incentives and social costs of quitting. Regarding our setting in hockey the social costs of reducing effort might be higher for underdogs as those teams are the presumable losers of the match. 11

Figure 4: Means of Effort by Relative Ability and Handicaps or Head Starts. Effort (Mean) -.01 -.005 0.005.01.015.02.025.03 -.15 -.1 -.05 0.05.1.15 Relative Ability Effort (Mean) -.01 -.005 0.005.01.015.02.025.03 -.15 -.1 -.05 0.05.1.15 Handicap (Lag) or Head Start (Lead) Notes: Error bars show 95 percent confidence bands. between competitors and whether a swimmer is leading is added, therefore measuring whether a head start has a different effect than a handicap. The coefficient on the interaction is positive and statistically significant in case of the random effects model (p < 0.05), indicating that a larger head start reduces effort provision to a lower degree than a handicap. Overall, we conclude that heterogeneity in ability and the distance between competitors has the predicted effects of decreasing effort provision. 4.2 Interaction Between Ability and Head Starts Now we turn to examining the interaction between ability, head starts and handicaps. This is the central test of the theory: Head starts should increase the effort of underdogs as they compensate for their disadvantage in ability, reducing the effective level of heterogeneity in a contest. By the same argument, handicaps should increase the effort of favorites Our analyses in the previous section, looking at ability and head starts separately, ignored this interaction. We test these predictions by examining how swimmers react to leads or lags inherited from previous team members depending on whether they compete against a faster or slower swimmer, i.e. whether they are the favorite or the underdog in a given match-up. Figure 5 shows a comparison of average effort between swimmers with ability advantages and disadvantages inheriting leads or lags. The main effects of head starts and handicaps 12

Table 2: Impact of Heterogeneity in Relative Ability Dependent variable: Effort (1) (2) (3) (4) Heterogeneity Ability 0.021 0.017 0.003 0.000 (0.006) (0.010) (0.009) (0.015) Favorite Heterogeneity Ability 0.037 0.023 (0.008) (0.015) Athlete RE X X Athlete FE X X Number Athletes 9908 9908 9908 9908 R 2 Overall 0.045 0.043 0.047 0.045 Observations 15092 15092 15092 15092 Clustered standard errors in parentheses p < 0.1, p < 0.05, p < 0.01 Note: This table shows results from linear random and fixed effects regressions. Each individual observation is an athlete observed within a race. Additional controls include the distance to the swimmer placed third, the ability of the third swimmer, athletes age group, the swim style, the type of relay and the order of the swimmer. Only swimmers placed first and second in a relay are included. Standard errors are clustered on the athlete-level. Table 3: Impact of Head starts and Handicaps Dependent variable: Effort (1) (2) (3) (4) Heterogeneity Ability 0.009 0.003 0.015 0.000 (0.009) (0.015) (0.009) (0.015) Favorite Heterogeneity Ability 0.035 0.019 0.039 0.012 (0.008) (0.015) (0.008) (0.015) Distance btw. Competitors 0.048 0.052 0.053 0.040 (0.005) (0.007) (0.006) (0.010) Head Start (Lead) Distance btw. Competitors 0.013 0.016 (0.006) (0.010) Athlete RE X X Athlete FE X X Number Athletes 9908 9908 9888 9888 R 2 Overall 0.053 0.051 0.052 0.048 Observations 15092 15092 15055 15055 Clustered standard errors in parentheses p < 0.1, p < 0.05, p < 0.01 Note: This table shows results from linear random and fixed effects regressions. Each individual observation is an athlete observed within a race. Additional controls include the distance to the swimmer placed third, the ability of the third swimmer, athletes age group, the swim style, the type of relay and the order of the swimmer. Only swimmers placed first and second in a relay are included. Standard errors are clustered on the athlete-level. 13

are as predicted: Underdogs increase their effort by 0.39 units (p < 0.01) when inheriting a lead as compared to underdogs who inherit a lag. Favorites increase their effort by 0.20 units (p < 0.01) when inheriting a lag as compared to favorites inheriting a lead. As all measures are normalized using the average relay time of all swimmers, and leading swimmers lead on average by 5.8 percent of the average relay time, the effect sizes correspond to 4 to 7 percent of an average lead, a sizable effect. Examining the same comparisons within a regression analysis in table 4 confirms these main effects. Figure 6 shows means of effort plotted against how large a head start or handicap of a favorite or underdogs is. Visually inspecting these graphs also indicates that effort of favorites is higher when they inherit a handicap in the form of lag, and effort of underdogs is higher when they inherit a head start in the form of a lead. Table 5 shows regression results of the impact of handicaps on effort for favorites only. The estimated coefficient on the dummy indicating whether a favorite inherits a handicap in the form of a lag is positive and statistically significant (p < 0.05) in columns (1) and (2). This, however, is no longer the case when adding a continuous measure of the distance between competitors and an interaction between this measure and whether a swimmer is lagging in columns (3) and (4). Here, the results are not conclusive, as both the dummy and the interaction are statistically insignificant. Table 6 shows regression results of the impact of head starts on effort for underdogs only. The estimated coefficient on a dummy indicating whether an underdog is leading is positive and statistically significant (p < 0.01) in the random effects model (column 1) although it is not statistically significant in the fixed effects model (column 2). The coefficient remains statistically significant (p < 0.05) when adding the continuous measure of the distance between competitors into the random effects model in column (3). In the fixed effects model in column (4), both the main effect and the interaction between the distance between competitors remain statistically insignificant. Overall, we find evidence that there is an effect of head starts and handicaps on effort as 14

Figure 5: Effort for Favorites and Underdogs by Head Start (Lead) and Handicap (Lags) Mean of Effort.01.012.014.016.018 Underdog Favorite Handicap (Lag) Head Start (Lead) Notes: Error bars show 95 percent confidence intervals. predicted by the theory, however, only in the form of a main effect of leading or lagging, but not in the form of differential impacts of the continuous measures of the lead or lag. 4.3 Semi-Parametric Estimation In this section, we turn to a semi-parametric estimation of the effects of heterogeneity in ability and head starts and handicaps. This is a more appropriate approach compared to the previous parametric regression analyses as it allows for a more flexible estimation of the interaction between ability, head starts and handicaps. According to the model, head starts and handicaps increase effort when they reduce the effective level of heterogeneity in a contest but decrease effort when they increase heterogeneity. However, when this will be actually the case in our empirical setting depends also on how swimmers react to ability differences, which also has to be estimated from the data. 15

Table 4: Impact of Head Starts and Handicaps - Dichotomous Effects Dependent variable: Effort (1) (2) (3) (4) Favorite 0.002 0.001 0.001 0.001 (0.000) (0.001) (0.001) (0.001) Handicap x Favorite 0.002 0.003 (0.001) (0.001) Headstart x Underdog 0.003 0.003 (0.001) (0.001) Athlete RE X X Athlete FE X X Number Athletes 9908 9908 9888 9888 R 2 Overall 0.045 0.044 0.048 0.046 Observations 15092 15092 15055 15055 Clustered standard errors in parentheses p < 0.1, p < 0.05, p < 0.01 Note: This table shows results from linear random and fixed effects regressions. Each individual observation is an athlete observed within a race. Additional controls include the distance to the swimmer placed third, the ability of the third swimmer, athletes age group, the swim style, the type of relay and the order of the swimmer. Only swimmers placed first and second in a relay are included. Standard errors are clustered on the athlete-level. Figure 6: Means of Effort by Handicaps or Head Starts - Favorites and Underdogs (a) Favorites (b) Underdogs Effort (Mean) - Only Favorites -.015 -.01 -.005 0.005.01.015.02.025.03 -.15 -.1 -.05 0.05.1.15 Handicap (Lag) or Head Start (Lead) Effort (Mean) - Only Underdogs -.015 -.01 -.005 0.005.01.015.02.025.03 -.15 -.1 -.05 0.05.1.15 Handicap (Lag) or Head Start (Lead) Notes: Error bars show 95 percent confidence intervals. 16

Table 5: Impact of Heterogeneity in Ability and Handicaps - Favorites Only Dependent variable: Effort (1) (2) (3) (4) Heterogeneity Ability 0.033 0.026 0.027 0.020 (0.008) (0.014) (0.008) (0.014) Handicap (Lag) 0.001 0.003 0.001 0.001 (0.001) (0.001) (0.001) (0.001) Distance btw. Competitors 0.041 0.061 (0.007) (0.012) Handicap (Lag)=1 Distance btw. Competitors 0.005 0.009 (0.012) (0.022) Athlete RE X X Athlete FE X X Number Athletes 5724 5724 5724 5724 R 2 Overall 0.046 0.043 0.053 0.049 Observations 7966 7966 7966 7966 Clustered standard errors in parentheses p < 0.1, p < 0.05, p < 0.01 Note: This table shows results from linear random and fixed effects regressions. Each individual observation is an athlete observed within a race. Additional controls include the distance to the swimmer placed third, the ability of the third swimmer, athletes age group, the swim style, the type of relay and the order of the swimmer. Only swimmers placed first and second in a relay are included. Standard errors are clustered on the athlete-level. Table 6: Impact of Heterogeneity in Ability and Head Starts - Underdogs Only Dependent variable: Effort (1) (2) (3) (4) Heterogeneity Ability 0.011 0.002 0.019 0.001 (0.011) (0.024) (0.011) (0.024) Head Start (Lead) 0.004 0.002 0.002 0.003 (0.001) (0.001) (0.001) (0.002) Distance btw. Competitors 0.060 0.035 (0.009) (0.020) Head Start (Lead) Distance btw. Competitors 0.020 0.014 (0.014) (0.027) Athlete RE X X Athlete FE X X Number Athletes 5710 5710 5710 5710 R 2 Overall 0.054 0.045 0.063 0.052 Observations 7089 7089 7089 7089 Clustered standard errors in parentheses p < 0.1, p < 0.05, p < 0.01 17

To account for this interaction and to estimate it flexibly, we use the following semiparametric approach: In a first step, we estimate a parametric linear regression model using the same specification and control variables as in the previous section with athlete-level fixed effects, but without including ability and leads or lags as dependent variables. In a next step, we use the residuals from this model to estimate the relationship between effort and relative ability, and the lead/lag of swimmer i competing against swimmer j in race k as residual(effort i,j,k ) = f(relative ability i,j,k, lead/lag i,j,k ) (7) where f(.) is a flexible function estimated non-parametrically using the residuals. Figure 7 shows a contour plot of the surface of f(.) estimated using local polynomial regression smoothing (Cleveland and Loader, 1996). In general, the estimated function appears to be concave along the lag/lead dimension, with effort being maximal for a lead/lag between -0.05 and 0.05. The green line tracing the maxima of effort support our previous results and show that favorites exert their highest effort when receiving a lag, while underdogs exert their highest effort when receiving a lead. The slope of the green line indicates that receiving a handicap (head start) that corresponds to approximately a third of the ability advantage (disadvantage) restores maximum effort provision of swimmers in our sample. Although the estimated function appears to be double-peaked with two local maxima close to the extremes of the relative ability and lag/lead distribution, approximately 50 percent of peaks within 100 bootstrap samples of the data where located close to origin where contests are balanced in terms of relative ability and head starts / handicaps. Using the estimated function f(.), we can also compute the contest that maximizes combined effort of the favorite and the underdog. Figure 8 shows the contour plot of the surface of the estimated combined effort function as a function of the relative ability disadvantage of the underdog and the head start / handicap given to the underdog. Combined effort in general is maximized when the underdog receives a handicap corresponding to approximately half of his or her ability disadvantage. Although the global peak of combined effort lies close 18

Figure 7: Semi-parametrically Estimated Impact of Ability and Head Starts on Effort Relative Ability 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.00085 0.00105 0.002 0.0012 0.0014 0.0017 0.00185 0.00205 0.00225 0.0024 0.0006 0.00145 0.0016 0.0008 0.00195 0.0021 0.0009 0.0011 0.00175 0.00035 0.00055 0.00115 0.0013 0.00165 0.0018 0.00015 0.0003 0.0005 0.0007 0.00135 0.0015 0.00005 0.0001 0.00025 0.00045 0.00125 0.00025 0.00005 0.0002 0.00095 0.00015 0.001 0.0001 0 0.00065 0.00075 0.0003 0.0002 0.0004 0.00035 0.00045 0.0008 0.00075 0.0007 0.0004 0.00065 0.0006 0.00055 0.0005 0.0005 Maximum Effort for Fixed Ability Global Peak (Actual Data) 0.002 Global Peak (Bootstrap Samples) 0.00055 0.0006 0.2 0.1 0.0 0.1 0.2 0.00035 0.0003 0.00005 0.0003 0.00045 0.00205 0.0017 0.0014 0.0011 0.00035 0.0001 0.00015 0.0024 0.0021 0.0028 0.00175 0.00145 0.00115 0.00085 0.0006 0.0004 0.00245 0.0032 0.00285 0.00215 0.0018 0.0015 0.0012 0.0009 0.00065 0.00015 0.0001 0.00025 0.0004 0.0002 0.0025 0.0033 0.0029 0.00365 0.00325 0.0022 0.0037 0.00295 0.00255 0.00185 0.00155 0.00125 0.00095 0.0007 0.00045 0.0002 0.00005 0 0.00415 0.00225 0.0046 0.0042 0.00375 0.00335 0.003 0.0026 0.0019 0.0016 0.0013 0.001 0.00075 0.0005 0.00025 0.0051 0.0047 0.0043 0.00385 0.00345 0.00305 0.00265 0.0023 0.00195 0.00165 0.00135 0.00105 0.0008 0.00055 Lag or Lead Notes: This figure shows a contour plot of the surface of semi-parametrically estimated impact of relative ability and head starts or handicaps on the effort of swimmers. The function is estimated using local polynomial regression smoothing (LOESS) on the residuals from a linear regression controlling for age-group, swim-style, relay-type, starting order, distance to and ability of third swimmer and athlete-level fixed effects. For each point on the (x, y) grid where the function is estimated, 90 percent of observations in its neighborhood are utilized for estimation using tricubic weighting The blue dotted line connects the maxima of the function for every fixed value of relative ability. Bootstrap samples for computing alternative peaks where simulated by resampling the data and re-estimating both the linear regression model and the non-parametric estimation on the residuals. 19

to the edge of the empirical distribution where contests are very unbalanced, about 50 percent of global peaks in a sample of 100 bootstrap samples lie close to the origin where contests are balanced. A conservative interpretation of this result is that an unbalanced contest is equally likely to induce optimal combined effort, given that head starts and handicaps are appropriately chosen to compensate for ability differences. 5 Conclusion Our empirical results show that handicaps restore incentives in heterogeneous tournaments as theory predicts. In our sample, giving a disadvantaged swimmer a head start that compensates for his or her ability differences restores maximal effort. In terms of optimizing combined effort of contestants, we find that unbalanced contests are equally likely to induce optimal combined effort, given that head starts and handicaps are appropriately chosen to compensate for ability differences between contestants. 20

Figure 8: Effort-Maximizing Contests Relative Ability of Underdog 0.14 0.12 0.10 0.08 0.06 0.04 0.02 0.00 0.0044 0.0043 0.0048 0.0047 0.0046 0.0045 0.0051 0.005 0.0049 0.0054 0.0053 0.0052 0.0059 0.0058 0.0057 0.0062 0.0061 0.0065 0.0064 0.0068 0.0056 0.0055 0.007 0.0072 0.0038 0.0037 0.0042 0.0041 0.004 0.0063 0.0036 0.0035 0.0034 0.0039 0.0031 0.003 0.0033 0.0025 0.0028 0.0023 0.0027 0.002 0.0029 0.0032 0.0022 0.0018 0.0026 0.0021 0.0024 0.0014 0.0017 0.0013 0.0015 0.0019 0.001 0.0011 0.0012 0.0016 0.0006 0.0007 0.0009 Global Peak (Actual Data) Global Peak (Bootstrap Samples) 0.0008 0.0002 0.0004 0.0005 0 0.0003 0.0001 0.0001 0.0003 0.0004 0.0002 0.0005 0.0006 0.0007 0.0008 0.0009 0.001 0.0011 0.0012 0.0013 0.0008 0.0014 0.0005 0.0009 0.0007 0 0.0004 0.0002 0.0004 0.0002 0.0006 0.0012 0.0001 0.0006 0.0003 0.0018 0.0003 0.0008 0.0027 0.0017 0.0001 0.0024 0.0021 0.0007 0.0033 0.0015 0.0005 0.003 0.0029 0.0026 0.0023 0.002 0.0014 0.0011 0.0009 0.004 0.0037 0.0034 0.0031 0.0028 0.0025 0.0022 0.0019 0.0016 0.0013 0.001 0.2 0.1 0.0 0.1 0.2 Lag or Lead of Underdog Notes: This figure shows a contour plot of the surface of the combined effort of the underdog and favorite as a function of relative ability of the underdog and the head start / handicap she receives. Effort is estimated using the same semi-parametric estimation as in figure 7. Bootstrap samples to compute alternative peaks where also resampled using the same procedure as previously. 21

References Acemoglu, D., Johnson, S., and Robinson, J. A. (2012). The colonial origins of comparative development: An empirical investigation: Reply. American Economic Review, 102 (6), 3077 3110. Angrist, J. D., and Krueger, A. B. (1999). Empirical strategies in labor economics. In Handbook of labor economics, vol. 3, 1277 1366, Elsevier. Bach, N., Gürtler, O., and Prinz, J. (2009). Incentive Effects in Tournaments with Heterogeneous Competitors - an Analysis of the Olympic Rowing Regatta in Sydney 2000. Management Revue, 20 (3), 239 253. Baik, K. (1994). Effort Levels in Contests with Two Asymmetric Players. Southern Economic Journal, 61, 367 378. Balafoutas, L., and Sutter, M. (2012). Affirmative action policies promote women and do not harm efficiency in the laboratory. Science, 335 (6068), 579 582. Bamieh, O. (2016). The Adverse Incentive Effects of Heterogeneity in Tournaments Empirical Evidence from The German Bundesliga. Working Paper. Berger, J., and Nieken, P. (2016). Heterogeneous contestants and the intensity of tournaments: An empirical investigation. Journal of Sports Economics, 17 (7), 631 660. Boudreau, K. J., Lakhani, K. R., and Menietti, M. (2016). Performance responses to competition across skill levels in rank-order tournaments: field evidence and implications for tournament design. The RAND Journal of Economics, 47 (1), 140 165. Brown, A., and Chowdhury, S. M. (2017). The hidden perils of affirmative action: Sabotage in handicap contests. Journal of Economic Behavior and Organization, 133, 273 284. Brown, J. (2011). Quitters Never Win: The (Adverse) Incentive Effects of Competing with Superstars. Journal of Political Economy, 119, 982 1013. Bull, C., Schotter, A., and Weigelt, K. (1987). Tournaments and Piece Rates: An Experimental Study. Journal of Political Economy, 95, 1 33. Calsamiglia, C., Franke, J., and Rey-Biel, P. (2013). The Incentive Effects of Affirmative Action in a Real Effort Tournament. Journal of Public Economics, 98, 15 31. Cleveland, W. S., and Loader, C. (1996). Smoothing by local regression: Principles and methods. In Statistical Theory and Computational Aspects of Smoothing, 10 49, Springer. Cowgill, B. (2015). Competition and productivity in employee promotion contests. Working Paper. Denter, P., and Sisak, D. (2016). Head starts in dynamic tournaments? Economics Letters, 149, 94 97. 22

Deutscher, C., Frick, B., Gürtler, O., and Prinz, J. (2013). Sabotage in tournaments with heterogeneous contestants: Empirical evidence from the soccer pitch. The Scandinavian Journal of Economics, 115 (4), 1138 1157. Drugov, M., and Ryvkin, D. (2017). Biased Contests for Symmetric Players. Games and Economic Behavior, 103, 116 144. Feinberg, W. (2003). Affirmative action. In The Oxford Handbook of Practical Ethics. Fershtman, C., and Gneezy, U. (2011). The tradeoff between performance and quitting in high power tournaments. Journal of the European Economic Association, 9 (2), 318 336. Franke, J. (2012). The Incentive Effects of Leveling the Playing Field - An Empirical Analysis of Amateur Golf Tournaments. Applied Economics, 65, 1193 1200. Green, J., and Stokey, N. (1983). A Comparison of Tournaments and Contracts. Journal of Political Economy, 91, 349 365. Imhof, L., and Kräkel, M. (2016). Ex post unbalanced tournaments. The RAND Journal of Economics, 47 (1), 73 98. Jiang, L. (2016). Splash with a teammate: Peer effects in high-stakes tournaments. Working Paper. Konrad, K. (2009). Strategy and Dynamics in Contests. Oxford University Press. Lazear, E. P., and Rosen, S. (1981). Rank-Order Tournaments as Optimal Labor Contracts. Journal of Political Economy, 89, 841 864. Leibbrandt, A., Wang, L. C., and Foo, C. (2017). Gender quotas, competitions, and peer review: Experimental evidence on the lash against women. Management Science. Lemus, J., and Marshall, G. (2017). Dynamic tournament design: An application to prediction contests. Working Paper. Moldovanu, B., and Sela, A. (2001). The Optimal Allocation of Prizes in Contests. American Economic Review, 91 (3), 542 558. Nalebuff, B. J., and Stiglitz, J. E. (1983). Prizes and Incentives: Towards a General Theory of Compensation and Competition. The Bell Journal of Economics, 14, 21 43. Neugart, M., and Richiardi, M. G. (2013). Sequential Teamwork in Competitive Environments: Theory and Evidence from Swimming Data. European Economic Review, 63, 186 205. Nieken, P., and Stegh, M. (2010). Incentive effects in asymmetric tournaments empirical evidence from the german hockey league. SFB/TR 15 Discussion Paper. O Keeffe, M., Viscusi, W. K., and Zeckhauser, R. J. (1984). Economic Contests: Comparative Reward Schemes. Journal of Labor Economics, 2, 27 56. 23

Schotter, A., and Weigelt, K. (1992). Asymmetric Tournaments, Equal Opportunity Laws, and Affirmative Action: Some Experimental Results. Quarterly Journal of Economics, 107 (2), 511 539. Sunde, U. (2009). Heterogeneity and Performance in Tournaments: A Test for Incentive Effects using Professional Tennis Data. Applied Economics, 41, 3199 3208. Sutter, M., Glätzle-Rützler, D., Balafoutas, L., and Czermak, S. (2016). Cancelling out early age gender differences in competition: an analysis of policy interventions. Experimental Economics, 19 (2), 412 432. Szymanski, S., and Valletti, T. M. (2005). Incentive Effects of Second Prizes. European Journal of Political Economy, 21 (2), 467 481. 24