Percentage. Year. The Myth of the Closer. By David W. Smith Presented July 29, 2016 SABR46, Miami, Florida

Similar documents
Figure 1. Winning percentage when leading by indicated margin after each inning,

Table 1. Average runs in each inning for home and road teams,

Average Runs per inning,

Why We Should Use the Bullpen Differently

Team Number 6. Tommy Hanson v. Atlanta Braves. Side represented: Atlanta Braves

Matt Halper 12/10/14 Stats 50. The Batting Pitcher:

Starting pitching staffs and pitching rotations. By David W. Smith Presented July 9, 2011 at SABR41, Long Beach, California

2014 National Baseball Arbitration Competition

Expansion: does it add muscle or fat? by June 26, 1999

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

The Rise in Infield Hits

It s conventional sabermetric wisdom that players

JEFF SAMARDZIJA CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4

Jenrry Mejia v. New York Mets Submission on Behalf of the New York Mets Midpoint: $2.6M Submission by Team 32

When Should Bonds be Walked Intentionally?

A Remark on Baserunning risk: Waiting Can Cost you the Game

2013 National Baseball Arbitration Competition. Tommy Hanson v. Atlanta Braves. Submission on behalf of Atlanta Braves. Submitted by Team 28

Machine Learning an American Pastime

Opleiding Informatica

LEE COUNTY WOMEN S TENNIS LEAGUE

How to Make, Interpret and Use a Simple Plot

Shedding Light on Motion Episode 4: Graphing Motion

2014 Tulane National Baseball Arbitration Competition Jeff Samardzija v. Chicago Cubs (MLB)

College Teaching Methods & Styles Journal First Quarter 2007 Volume 3, Number 1

What s UP in the. Pacific Ocean? Learning Objectives

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

2014 National Baseball Arbitration Competition

Draft - 4/17/2004. A Batting Average: Does It Represent Ability or Luck?

Future Expectations for Over-Performing Teams

Jenrry Mejia v. New York Mets Submission on Behalf of New York Mets Midpoint: $2.6 Million Submission by Team 18

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

Calculation of Trail Usage from Counter Data

A V C A - B A D G E R R E G I O N E D U C A T I O N A L T I P O F T H E W E E K

LEE COUNTY WOMEN S TENNIS LEAGUE

Nebraska Births Report: A look at births, fertility rates, and natural change

Modern volleyball aspects

Bicycle and Pedestrian Level of Service Evaluation

Additional On-base Worth 3x Additional Slugging?

Travel and Rider Characteristics for Metrobus

July 2011, Number 62 ALL-WAYS TM NEWSLETTER

Assessment Summary Report Gulf of Mexico Red Snapper SEDAR 7

The Aging Curve(s) Jerry Meyer, Central Maryland YMCA Masters (CMYM)

Golfers in Colorado: The Role of Golf in Recreational and Tourism Lifestyles and Expenditures

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

A Brief Shutdown Innings Study. Bob Mecca

Review of A Detailed Investigation of Crash Risk Reduction Resulting from Red Light Cameras in Small Urban Areas by M. Burkey and K.

OPERATIONAL AMV PRODUCTS DERIVED WITH METEOSAT-6 RAPID SCAN DATA. Arthur de Smet. EUMETSAT, Am Kavalleriesand 31, D Darmstadt, Germany ABSTRACT

DP/Flex Chalk Talk. Dee Abrahamson. NCAA Softball Secretary Rules Editor. September Dee SRE 9-13

2013 National Baseball Arbitration Competition

Eventing Risk Management Programme Statistics ( )

A New Chart for Pitchers and My Top 10 Pitching Thoughts Cindy Bristow - Softball Excellence

SAP Predictive Analysis and the MLB Post Season

The potential impact of high density docking stations on Bike Share distribution activities Biella Research June 2016

Shotgun Chokes Which choke should I use?

A Markov Model of Baseball: Applications to Two Sluggers

Capital Bikeshare 2011 Member Survey Executive Summary

GREG PITCHER COMPLETE RECORD

THE BOOK--Playing The Percentages In Baseball

2014 NATIONAL BASEBALL ARBITRATION COMPETITION

The Impact of TennCare: A Survey of Recipients 2009

Analyses of the Scoring of Writing Essays For the Pennsylvania System of Student Assessment

PREDICTING the outcomes of sporting events

Non-motorized Transportation Planning Resource Book Mayor s Task Force on Walking and Bicycling City of Lansing, Michigan Spring 2007 pg.

When you think of baseball, you think of a game that never changes, right? The

Crease Play. The Crease Defined. Teaching Crease Play. Individual Crease Play

IN BASEBALL PITCHERS CHRIS O LEARY

Winning Trainers 2.0. Numbers. Proof that Winning Trainers 2.0 Works!

Growth: Humans & Surf Clams

What is going on in modern volleyball

Hitting with Runners in Scoring Position

SCIENTIFIC COMMITTEE THIRTEENTH REGULAR SESSION. Rarotonga, Cook Islands 9-17 August 2017

Chapter 2 - Displaying and Describing Categorical Data

Chapter 3 - Displaying and Describing Categorical Data

TG GUIDELINES CONCERNING CALIBRATION INTERVALS AND RECALIBRATION

12. School travel Introduction. Part III Chapter 12. School travel

Insurance. The practical application. of credibility models in the rating of Health

At each type of conflict location, the risk is affected by certain parameters:

Should pitchers bat 9th?

There are 3 sections to the home page shown in the first screenshot below. These are:

Baseball Scorekeeping for First Timers

Regression to the Mean at The Masters Golf Tournament A comparative analysis of regression to the mean on the PGA tour and at the Masters Tournament

WHAT CAN WE LEARN FROM COMPETITION ANALYSIS AT THE 1999 PAN PACIFIC SWIMMING CHAMPIONSHIPS?

Exemplar for Internal Achievement Standard. Mathematics and Statistics Level 1

Men s Best Shots Poll

Running head: DATA ANALYSIS AND INTERPRETATION 1

Pitching Performance and Age

Anatomy of a Homer. Purpose. Required Equipment/Supplies. Optional Equipment/Supplies. Discussion

Gouwan Strike English Manual

Effectiveness of Red Light Cameras in Chicago: An Exploratory Analysis

Analysis of iracing Race Summary and Lap Data

2013 Tulane National Baseball Arbitration Competition

Los Angeles District 4 Data Analysis Report

Revision UCL Reconstruction. No Disclosures

Queue analysis for the toll station of the Öresund fixed link. Pontus Matstoms *

2018 Winter League N.L. Web Draft Packet

I. General Coaching Tips

A Hare-Lynx Simulation Model

Application Block Library Fan Control Optimization

The Influence of Free-Agent Filing on MLB Player Performance. Evan C. Holden Paul M. Sommers. June 2005

Transcription:

The Myth of the Closer By David W. Smith Presented July 29, 216 SABR46, Miami, Florida Every team spends much effort and money to select its closer, the pitcher who enters in the ninth inning to seal the deal and nail down a win. Of course, this now universal use of a closer only became the norm relatively recently and the reason given is simple: teams feel their chance of winning is increased by having this ace specialist. I decided to look at this assumption more closely and came to a probably startling conclusion: it isn t true that closers increase the chance for a team to win, or at least it is marginally true at best. I will start by presenting a graph that looks a lot like a conclusion. Then I will work backward and take it apart to look for explanations. As a starting point, I will use the term closer to refer to a pitcher entering in the 9 th inning in a save situation but we will return to examine that definition a little later. Here is the summary in Figure 1, covering 1912 to 215. Figure 1. Closer Entry and Relation to Winning, 1912-215 1. Percentage 8. 6. 4. 2.. 191 193 195 197 199 21 The blue line is the percentage of games in which a team entered the 9 th inning with a lead of three runs or fewer, the modern definition of a save situation. The red line shows the percentage of those games in which a relief pitcher was brought in to start the 9 th. Finally the green line is winning percentage in these games. There are four immediate conclusions. First, the chance of a team starting the 9 th inning in a save situation (about 25%) is essentially unchanged over the past 14 seasons, covering 176791 games. Second, from 1912 to about 198, there was very little change in the frequency of bringing in a new pitcher, being stable at about 1 %. Third, things started to change in 198 and there is a dramatic increase in using a new pitcher and it

now happens over 9% of the time. Fourth and most importantly the chance of winning the game in these situations has shown amazingly little variation at a bit over 9%. No matter when one wishes to claim the era of the closer began, it is clear from this graph that closers do not increase the chance of winning the game. Of course, this analysis does not explain individual games, but on the aggregate level, it is clear that in terms of wins, there is no longterm benefit to the closer strategy. One must wonder why it is employed so universally when there is no advantage to doing so. Although it would be tempting to stop the analysis here with these conclusions, I decided to go a bit further to see if I could deconstruct the results to elucidate underlying features that may have led to the current pattern. The first thing I did is pretty obvious, which is to look at the three different kinds of save situation, namely leads of one, two, or three runs. These results are in Figure 2. Figure 2. Winning Percentage as Function of 9 th Inning Lead when Reliever Entered, 1912-215 1 Percentage 8 6 4 2 191 193 195 197 199 21 The scale is the same as in Figure 1 to facilitate comparisons. The expected pattern is observed, namely that entering with a three run lead translates to a win an average of 97% of the time, two runs, 93% and one run, 84%. The winning percentages presented so far are for all 9 th inning save situations. In order to explore the closer effect more explicitly, I subdivided these appearances as follows: 1. New relief pitcher. 2. Relief pitcher who was already in the game or starter still in. I present the results from 198 through 215, since it is clear from the first graph that this period is the time of rapid change in 9 th inning relief pitcher usage and is the appropriate focus to address the effect of closers. Figure 3. Winning percentage with and without a reliever to start 9 th inning, 198-215.

Percentage Winning Percentage With and Without Reliever to Start 9th 1 8 6 4 2 198 199 2 21 There really are two lines in this graph, with the blue line for games with a reliever to start the 9 th and the red line when a new pitcher does not start the inning. The percentage scale is the same as in previous graphs for the sake of comparison. However, to see the picture more clearly, Figure 4 presents the same data, but with the percentage axis expanded. Figure 4. Winning percentage with and without a reliever to start 9 th inning, 198-215. Percentage 1 95 9 85 Winning Percentage With and Without Reliever to Start 9th 8 198 199 2 21 It is now clearer that there are two separate lines, with more variation in the games with the new pitcher. The average winning percentages over these years are 9.8 when a reliever starts the 9 th inning and 91.2 when the same pitcher continues from the 8 th, whether starter or reliever.

I expanded this result by separating the 9 th inning relievers into closers and non-closers. Of course, this requires a definition of closer and there are many ways to do that. I finally settled on the following two criteria: 1. Minimum of 4 relief appearances (pro-rated for strike years) 2. Minimum of 5% of appearances in save situations. How many men met both of these criteria from 198-215? Figure 5. Percentage of teams with closer, 198-215 Percentage Percentage of Teams with Closer 1 8 6 4 2 198 199 2 21 The results are expressed as a percentage because there were different numbers of teams in this period, due to expansions. Based on these results, it looks like the closer era did not begin in earnest until around 199 or even later. For example, in 198, there were only six pitchers who met these criteria and 211 is the only year that all the teams had a pitcher who did. What about individual pitchers? Who were closers for the most seasons? There are 11 men who had at least 1 seasons as a closer by my definition. They are: Table 1. Closers for most seasons, 198-215 Mariano Rivera 16 Trevor Hoffman 15 Lee Smith 13 John Franco 12 Billy Wagner 12 Dennis Eckersley 1 Roberto Hernandez 1 Troy Percival 1 Joe Nathan 1

Jonathan Papelbon 1 Huston Street 1 Using this definition of closer, I revisited the previous graph which showed a small benefit to not bringing in a new pitcher for the 9 th inning. Figure 5 separates the 9 th inning relievers into ace closers and others. Figure 6. Winning percentage in relation to using an ace closer to start the 9 th inning. Percentage Winning Percentage With and Without Ace Closer to Start 9th 1 8 6 4 2 198 199 2 21 The blue line is the winning percentage when an ace closer enters and the red is the result when a someone else starts the 9th. There is an advantage to using the ace here, with a winning percentage difference of 92 vs 88. All of my analysis to this point has addressed the effect of 9 th inning pitcher usage on the chance of a team s winning the game, which is of course the ultimate purpose and should drive these decisions. However, there is another aspect to consider, which is the performance of pitchers in the 9 th inning, which will also quite reasonably affect a manager s decision to change pitchers or not. I chose two parameters to examine the effectiveness of pitchers in the 9 th inning WHIP (walks plus hits per inning) ERA WHIP is more directly related to the individual pitcher, whereas ERA can depend on subsequent pitchers dealing with men left on base. These two measures were applied to three categories of pitcher: 1. Starters still in the game in the 9 th. 2. Relievers not starting the 9 th (may already be in game or enter during 9 th ). 3. Relievers starting the 9 th.

First, let s look at the overall variation in WHIP (blue line) and ERA (red line) from 198-215, as shown in Figure 7. Figure 7. WHIP and ERA for all pitchers in 9 th inning, 198-215 WHIP and ERA WHIP and ERA in 9th Inning 5 4 3 2 1 198 199 2 21 ERA clearly varies more over these 36 seasons than WHIP does. Table 2 summarizes these results and compares them to overall performance in all innings. Table 2. WHIP and ERA, 198-215 WHIP ERA Overall 1.37 4.16 9 th inning 1.29 3.6 Therefore, the 9 th inning performance in these two measures has been is a bit better than the rest of the game, especially in ERA. What about the three categories of pitchers I defined a moment ago? Table 3 summarizes these breakdowns. Table 3. WHIP and ERA in 9 th inning, 198-215 WHIP ERA Starter 1.27 3.9 Reliever not entering to begin ninth 1.33 3.56 Reliever entering to begin ninth 1.28 3.54

There is a big variation in the results for the starters in recent years because so few starters are still pitching in the 9 th and the sample size is therefore very small, leading to more noise in the data. So both WHIP and ERA are lower for the pitchers selected to start the 9 th compared to other relievers, but not dramatically so. In terms of WHIP the relievers are only slightly different from the starters, but there is a larger difference in ERA. I suspect the ERA difference arises in part from starters leaving the game with men on base who later score. At this point, it is clear to me that the modern use of a closer to increase the chance of winning is a myth. Of course, it is actually much more complicated than that. The modern era of ubiquitous closer usage has other consequences. The well-known pattern for many teams is now: Starter pitchers 6 innings, hopefully leaving with a lead 7 th Inning specialist enters 8 th Inning specialist enters, also termed the setup man Closer finishes off the win Any given game will likely not follow that pattern, but it is enough of a template that teams organize their staffs around it. It is common now to refer to the back end of the bullpen which means all three of these specialists, not just the closer. There are two main reasons I hear offered for this new reality: 1) it results in fewer innings and pitches, especially for starters; and 2) each man in the bullpen knows his role and will presumably maximize his success with the appropriate mental preparation. I cannot speak to the mental aspect, although I am rather skeptical about it, but distributing the workload in a predictable way to lessen the physical strain makes sense to me. How did we get to this pattern? After all, as I showed above, modern closer usage is a relatively recent phenomenon (2-25 years) and not all teams got there at the same time. I decided to review some long term trends which I have reported on before, as have others. The percentage of complete games has dropped steadily since 191, as shown in Figure 8. Figure 8. Percentage of Complete Games, 191-215

Percentage Complete Games, 191-215 1 8 6 4 2 19 192 194 196 198 2 There are a couple of interesting blips during both World Wars and the advent of the DH. The high point was 87.6% complete games in 192 and the low is 2.1%, set in 215. The corresponding piece of information is the number of relievers per game, as shown in Figure 9. Figure 9. Relief Pitchers pre Game, 191-215 Relievers per Game Relief Pitchers, 191-215 4 3 2 1 19 192 194 196 198 2 As expected, this graph is essentially the mirror image of the previous one, including the blips. The low value was.15 relievers per game in 191, which is about one reliever for every 7 starts and the high value was 215, when it reached 3.1 per game, the all-time high. One of the fascinating aspects of this pattern concerns the advent of the DH in 1973. To illustrate this in more detail, I focused on the years 1965 to 1995. The percentages for each league are shown in Figure 1.

Figure 1. Complete Game Percentage by League, 1965-1995. 4 AL and NL Complete Games, 1965-95 Percentage 3 2 1 1965 197 1975 198 1985 199 1995 The AL values are in blue and the NL in red. From 1965-1972, the two leagues were similar in complete game percentage, but in 1973 the AL percentage jumped way up for the first time in over 2 years, while the NL dropped. It is easy to explain this as the AL managers were adjusting to the realities of having the DH, which was implemented in 1973. However over the next15 years, the gap continually narrowed and the two leagues are now nearly identical. This correlates nicely with the increased relief pitcher usage so that the different world for starting pitchers when there is a DH no longer carries over to the end of the game. There are other meaningful measures for the shifting balance between starters and relievers, including ERA, innings pitched, and batters faced per game. ERA has varied a great deal during the last 96 seasons, as shown in Figure 11. Figure 11. Major League ERA, 192-215

ERA Major League ERA 5 4.5 4 3.5 3 2.5 192 194 196 198 2 The high value of 4.81 came in 193 and the low point of 2.98 occurred in 1968, the of the Pitcher. The 215 value of 3.96 is almost identical to the 96-year average of 3.98. Of course, my focus here is on relief pitchers so I calculated relief and starter ERA separately. Rather than present a graph with two lines that can be noisy and hard to interpret, I present in Figure 12, the difference between them, with the reported value being reliever ERA minus starter ERA. Figure 12. Difference in Starter and Reliever ERA, 192-215 Reliever - Starter ERA Difference.6.4.2 -.2 -.4 -.6 -.8 192 194 196 198 2 The heavy red line in the middle is at zero which would occur if starters and relievers had the same ERA. In fact, we see that with very few exceptions (1938-1941 and 1952-1953), from 192 through 1953, the starters had the better ERA. Since 1954, the relievers have had better ERA every year except 1968 and 1969, when starters were lower by very small amounts:.2 and.5, respectively. This is a remarkable result and goes to the question of how individual

pitchers are chosen to be starters or relievers. My summary is that from 192 to 1953, the pattern was that the starters had better performance by this measure and since then the balance has switched and relievers now have lower ERA, with recent values in the vicinity of.4 runs better, or nearly half a run a game. The biggest difference was.62 runs in 1982. This fits with the notion that relievers used to be those pitchers who weren t good enough to start, but now relievers are groomed (not just closers) for these roles. Perhaps they do well because they know they will have shorter stints, perhaps leading to higher velocity on a more consistent basis. This pattern of progressively shorter outings is shown clearly in Figure 13. Figure 13. Batters Faced per Appearance by Starters and Relievers, 1912-215 BF per Appearance Starter and Relief Batters Faced 35 3 25 2 15 1 5 191 193 195 197 199 21 The continuous decline is obvious with starters dropping from a high of 3.7 to the current low of 24.5 and relievers decreasing from 12.8 to 4.3. These lines are not strictly additive since as we saw in Figure 9, there are over three times as many relief pitchers per game in 215 than in 1912 and the data in Figure 13 are calculated on a per appearance basis. Presumably the shorter outings may save wear and tear on pitchers arms. In addition, as I have reported before (Figure 11 in http://retrosheet.org/research/smithd/batting%2order%2lineup26.pdf), pitchers have a significant advantage against batters in their first matchup of each game. That is, batters learn during the game. In the last 25 years, the number of times a relief pitcher faces a given batter twice is vanishingly small, especially for those who pitch at the end of the game. This is a major contributing factor to the improved ERA for relievers. Finally, I must address the statistical category of saves. It is common to hear announcers in the 8 th inning explain that the closer will only enter in a save situation and that if the lead were to grow to four runs, then someone else would pitch the ninth. This situation is often accompanied by a view of the bullpen showing two pitchers warming up side by side. Of all the aspects of closer usage, this is the one that troubles me the most. The decision of which pitcher to bring in

is strongly determined by a statistical category, the save. I can think of no other example in which a decision is driven by something other than the chance to win the game. Cardinal manager Mike Matheny said: The save stat cannot be ignored; there are contracts involved. Others have addressed the use of closers: Bill James suggests that using an ace in the 7 th inning in a crucial situation may be more important; I agree. Mike Emeigh believes that closers are more important in extra innings. Wayne Towers has a presentation tomorrow afternoon on the possible changing meaning of high save totals. Conclusions 1. The entry of a new pitcher to start the 9 th inning has increased dramatically since 198. 2. The presence of this new pitcher has had almost no effect on a team s chances to win. 3. Ace closers bring slightly more wins than other 9 th inning pitchers (92% vs 88%) 4. Performance of 9 th inning pitchers is almost indistinguishable between closers and others. 5. Increased use 9 th inning pitchers correlates with overall increase of relief pitchers. 6. Pitchers have had progressively shorter stints for over 1 years. 7. Current pattern of closer usage is not justified by their contributions to team wins.