Figure 1. Winning percentage when leading by indicated margin after each inning,

Similar documents
Percentage. Year. The Myth of the Closer. By David W. Smith Presented July 29, 2016 SABR46, Miami, Florida

Table 1. Average runs in each inning for home and road teams,

Average Runs per inning,

Starting pitching staffs and pitching rotations. By David W. Smith Presented July 9, 2011 at SABR41, Long Beach, California

Why We Should Use the Bullpen Differently

Expansion: does it add muscle or fat? by June 26, 1999

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Matt Halper 12/10/14 Stats 50. The Batting Pitcher:

Additional On-base Worth 3x Additional Slugging?

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

The Influence of Free-Agent Filing on MLB Player Performance. Evan C. Holden Paul M. Sommers. June 2005

Minimum Quality Standards in Baseball and the Paradoxical Disappearance of the.400 Hitter

JEFF SAMARDZIJA CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4

Team Number 6. Tommy Hanson v. Atlanta Braves. Side represented: Atlanta Braves

The Rise in Infield Hits

2014 National Baseball Arbitration Competition

When Should Bonds be Walked Intentionally?

Jenrry Mejia v. New York Mets Submission on Behalf of the New York Mets Midpoint: $2.6M Submission by Team 32

College Teaching Methods & Styles Journal First Quarter 2007 Volume 3, Number 1

Machine Learning an American Pastime

Chapter 2 - Displaying and Describing Categorical Data

Chapter 3 - Displaying and Describing Categorical Data

Do Clutch Hitters Exist?

It s conventional sabermetric wisdom that players

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

Salary correlations with batting performance

A Brief Shutdown Innings Study. Bob Mecca

Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players

Simulating Major League Baseball Games

A V C A - B A D G E R R E G I O N E D U C A T I O N A L T I P O F T H E W E E K

A Hare-Lynx Simulation Model

B aseball is America s pastime, and with attendance and

Alpine Safety Research, German Alpine Club

Lab Report Outline the Bones of the Story

2014 National Baseball Arbitration Competition

TRAFFIC CRASHES involving BICYCLISTS

Analysis of performance at the 2007 Cricket World Cup

Los Angeles District 4 Data Analysis Report

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

Lesson 22: Average Rate of Change

2013 National Baseball Arbitration Competition. Tommy Hanson v. Atlanta Braves. Submission on behalf of Atlanta Braves. Submitted by Team 28

Raymond HV Gallucci, PhD, PE;

How Effective is Change of Pace Bowling in Cricket?

Quantitative Literacy: Thinking Between the Lines

A Remark on Baserunning risk: Waiting Can Cost you the Game

An average pitcher's PG = 50. Higher numbers are worse, and lower are better. Great seasons will have negative PG ratings.

When you think of baseball, you think of a game that never changes, right? The

Draft - 4/17/2004. A Batting Average: Does It Represent Ability or Luck?

Dick Bowdler Acoustic Consultant

Chapter. 1 Who s the Best Hitter? Averages

6 NATIONS 2004 STATISTICAL REVIEW AND MATCH ANALYSIS

Compression Study: City, State. City Convention & Visitors Bureau. Prepared for

Chapter 3 Displaying and Describing Categorical Data

CHAPTER 1 ORGANIZATION OF DATA SETS

2018 Winter League N.L. Web Draft Packet

Should pitchers bat 9th?

What is going on in modern volleyball

The Changing Hitting Performance Profile In the Major League, September 2007 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO.

The LEAGUE THAT MATTERS Rules effective January 2018 Page 1 of 5. League Rules effective as of 20 January 2018

WORLD. Geographic Trend Report for GMAT Examinees

The Impact of TennCare: A Survey of Recipients 2009

A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS

Pitching Performance and Age

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

Chapter 2: Modeling Distributions of Data

The benefits of improved car secondary safety

A Markov Model of Baseball: Applications to Two Sluggers

Pitching Performance and Age

Acknowledgement: Author is indebted to Dr. Jennifer Kaplan, Dr. Parthanil Roy and Dr Ashoke Sinha for allowing him to use/edit many of their slides.

ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17

Insurance. The practical application. of credibility models in the rating of Health

MLB SHOWDOWN DCI Floor Rules Tournament Season Effective June 15, 2000

UnOPS (OPS w/o E minus OPS w/ E)

Analysis of the Article Entitled: Improved Cube Handling in Races: Insights with Isight

How to Make, Interpret and Use a Simple Plot

2014 Tulane National Baseball Arbitration Competition Jeff Samardzija v. Chicago Cubs (MLB)

DP/Flex Chalk Talk. Dee Abrahamson. NCAA Softball Secretary Rules Editor. September Dee SRE 9-13

Bulldog Baseball Player Guide

CS 221 PROJECT FINAL

AMERICA S MAIL LEAGUE CONSTITUTION. Updated October 13, (2011 points of emphasis or changes in BOLD)

2014 National Baseball Arbitration Competition

Two Machine Learning Approaches to Understand the NBA Data

The pth percentile of a distribution is the value with p percent of the observations less than it.

AggPro: The Aggregate Projection System

Surf Survey Summary Report

2010 TRAVEL TIME REPORT

2015 Winter Combined League Web Draft Rule Packet (USING YEARS )

Reduction of Speed Limit at Approaches to Railway Level Crossings in WA. Main Roads WA. Presenter - Brian Kidd

Forecasting Baseball

Practice Test Unit 6B/11A/11B: Probability and Logic

2013 Tulane National Baseball Arbitration Competition

Analysis of iracing Race Summary and Lap Data

Quantitative Risk Analysis (QRA)

Should bonus points be included in the Six Nations Championship?

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

SAP Predictive Analysis and the MLB Post Season

Relationship Between Missing Ballast and Development of Track Geometry Defects

Effectiveness of Red Light Cameras in Chicago: An Exploratory Analysis

NordFoU: External Influences on Spray Patterns (EPAS) Report 16: Wind exposure on the test road at Bygholm

Relative Value of On-Base Pct. and Slugging Avg.

Transcription:

The 7 th Inning Is The Key By David W. Smith Presented June, 7 SABR47, New York, New York It is now nearly universal for teams with a 9 th inning lead of three runs or fewer (the definition of a save situation since 975), to bring in a specialist (a closer ) to pitch. This strategy reflects the belief that the chances of winning the game are enhanced by the pitching change. I demonstrated last year that there is in fact no significant improvement in the chance of winning by this reliance on new pitchers for the 9 th inning. For those who did not attend my presentation on closers at last year s SABR convention, you can find the complete paper on the Retrosheet site at: (http://retrosheet.org/research/smithd/mythofthecloser.pdf). Since so few games are decided in the 9 th inning, the logical question is to ask at what point in the game does the winning team take the decisive lead. We know that more runs are scored in the first inning than in any other, especially by the home team. I examined this in detail at the Houston SABR meeting in 4. That presentation is also on the Retrosheet site at: (http://retrosheet.org/research/smithd/whydohometeamsscoresomuchinthefirstinning.pdf).. Although the lead certainly can change hands several times in a given game, the aggregate pattern is that once a team gets a lead it most often maintains it. Before we get to the details, I present my usual summary of the size of the database I used. It is: Table. Data used in present study Seasons: 7 (9 6) Games 8, 75 As for maintaining the lead, Figure shows a team which is ahead by even one run after the first inning wins over 6% of the time. Figure. Winning percentage when leading by indicated margin after each inning, 9-6.

Percentage 95 9 85 8 75 7 65 6 Percentage Games Won 4 5 6 7 8 Inning Lead > any That percentage increases with each passing inning so that the team with a one-run lead at the end of the 8 th wins over 85% of the time. This pattern was found to be remarkably consistent across all 7 years studied even though total scoring changed significantly in different eras over this last century plus, as we shall see in a few minutes. It is reasonable to ask how much the size of the lead matters and that is addressed on the same graph by adding three more lines. These are the probabilities of winning the game when leading by two, three, or more than three runs at the end of the indicated inning. I saw no point in generating separate lines for four, five, six run leads, etc, so they were all combined. As the game proceeds, the value of the lead increases although the rise is not steady, with a proportionally greater value in the last three innings. As the size of the lead increases, so does the translation to a win, with a first inning lead of over three runs resulting in a win 88% of the time. The final line for this figure combines all these different values to give the win probability for a lead of any size. This is a more linear response than the individual leads. Note that after the sixth inning, the team with a lead - no matter what the margin is - wins 86% of the time. It is arbitrary to be sure, but a situation in which the chance of winning exceeds 85% is my candidate for the "deciding point" of a game with a decisive lead, although that may be too ambitious a term. I mentioned that these patterns of lead and probability of winning were constant across the 7 seasons and I address that in the next two figures. Figure shows the probability of winning from 9 to 6 when leading by any amount after the sixth inning.

Figure. Percentage Games Won Won With Lead After 6 95 9 85 8 75 7 65 6 9 9 94 96 98 The results for other innings show the same narrow range of variation. Rather than present eight separate slides, one for each inning, I prepared Figure which shows the average value for each inning across the 7 years I studied. This is the same line as the summary line in Figure.

Figure. Percentage Won With Lead After Each Inning Percentage Won 95 9 85 8 75 7 65 6 4 5 6 7 8 Inning The conclusion is inescapable. A lead is always important, no matter how early it comes and the larger the lead, the more likely it will lead to a win. It is good to remember at this point that these are aggregate values from the more than 8, games I examined. I do not claim that any individual game will follow this pattern, although there is still value in this general result. I will continue the analysis with the end of the sixth inning as my deciding point, hence the title of this presentation. My approach is very much concerned with starting pitcher usage and effectiveness, which are two distinct measures. My paper on closers showed a major change in reliever usage starting in 98 which I will summarize below. Starters are no longer routinely expected to pitch 9 or even 8 innings as bullpens have a variety of "roles" with a general design of using them in sequence after starter has hopefully completed six. This is connected to pitch counts as well, but I want to look at very long term trends and pitch data are only routinely available beginning in 988. I will start with the baseline of changes in starting pitcher usage. Figure 4 shows two measures of usage.

Figure 4. CG Percentage Complete Game % and Innings Pitched 9 8 8 7 6 6 5 4 4 9 9 94 96 98 Innings Pitched The first measure is the percentage of complete games and for this line the data go back to 9, which uses the left-hand vertical axis. The high point was 87.6% in 9 and the low was.7 % in 6. In fact, last year was the first in history with fewer than complete games, as the two leagues combined for a total of 8 (9 NL, 44 AL). The second measure, on the right-hand vertical axis, is the average innings pitched per start. This also shows a fairly steady decline since 9, but much more gradual. There are interesting blips in both lines in WWI, WWII, and the years just before the advent of the Designated Hitter. The value of 5.65 innings pitched per start in 6 is the lowest in history and it has been below 6. for 7 of the last 8 years, with only crossing that line with an average of 6.. Of course, there is a corresponding increase in relief pitcher innings, but as I have shown before (Figure 9 in http://retrosheet.org/research/smithd/mythofthecloser.pdf ) there is also a clear pattern of increased number of relief pitchers used in each game, so that the average relief appearance now consists of facing 4. batters, while the average starter sees 4.5 batters per start. In addition to patterns of pitcher usage, we must consider performance. The first way I chose to look at this is ERA. As many people have shown, there is a great deal of annual variation as Figure 5 shows.

Figure 5. Runs, ERA, WHIP and OPS Runs, ERA, WHIP and OPS 6 5 4 9 9 94 96 98 Runs ERA WHIP OPS From 9 (first year that earned runs were officially recorded) through 6 the range was a high of 5.5 per team per game in 9 and a low of.68, which came in 97, not in 968! This pattern reflects total scoring per team as the second line on the graph indicates, but although the lines mirror each other, the difference between the two has changed as unearned runs have become much less common. In 9 there was just under one unearned run per game per team (.97) and in 6, this value was down to.9 as errors have become progressively less common. The pattern of ERA changes is interesting in light of the other performance measures I calculated, which are WHIP (walks plus hits per inning) and OPS (on-base average plus slugging average). As the last lines in Figure 5 show, WHIP and OPS have been much more stable than ERA, although there are certainly differences. For WHIP, the high was.5 in 9 and the low was.9 in 968. For OPS, the range is a low of.6 in 98 and a high of.79 in 9, to no one's surprise! I find it surprising that scoring has varied so much more than what I like to call the components of scoring over the last century, especially when these component measures have been so stable. This relation of having a lead to the chance of winning is discussed commonly in broadcasts, usually in terms such as "Our team is and 5 when they lead after six innings". That is a winning percentage of.869 which is certainly impressive, but what is usually not mentioned is what I think is the proper comparison or context, namely that all teams combined have a winning percentage of around.86 when they lead after six. So it is wise to take pronouncements by announcers with a grain of salt. What I am mostly interested in is what effect pitcher usage has on this relation between having a lead and winning the game. Before I do that, I must digress to another aspect that is not often mentioned. That is, how often do these situations occur? Remember that Figure showed how increasingly important having the lead is, but we need the context of how often each occurs. Figure 6 shows us how often a lead of each size occurs after each inning.

Figure 6. 6 5 4 Occurrence of Each Lead 4 5 6 7 8 Inning Lead > The first line shows that the most common situation after the first inning is a tie score, which occurs in 5% of all games. Of these games, 9% are -, which is hardly a surprise since most would predict that a scoreless first inning is the most frequent occurrence. The average game quickly leaves the realm of being tied with a steady but nonlinear decline in tied games with a low of % being tied after 8 innings, very few of which are still -, of course. As the ties disappear, what patterns arise? The next three lines present the data for leads of one, two, three, or more than three runs after each inning. Games with a one-run lead have an interesting bump to just over % in the second inning, followed by a very steady decline through the rest of the game. Leads of two runs show a gentle increase through the fourth inning and then a very slight drop thereafter. Similarly games with three-run leads show a similar gradual rise, but not much decline. However, the biggest surprise to me occurs with leads of more than runs. A lead of this size is rare after the first inning (4%), but rises very steadily throughout the rest of the game until they reach 7% of all games after the 8th. I suggest that a game with a score differential of more than three runs is not a close game in any meaningful sense and furthermore that after the sixth inning, these large leads collectively predominate and that dominance continues to rise. The frequent occurrence of a large lead raises another question, namely what is the average margin of victory? It turns out to be larger than most people suspect. First, Table shows the five most common margins of victory from 9 to 6 with the percentage of each score.

Table. Most common final scores, 9-6. Final Score Occurrences Percentage - 45 5.8 4-7 5.6-87 4.7 5-4 7895 4.4 4-664.7 There were differences over the eras, but the general pattern holds pretty well. Note that the two most common are very close in frequency and these are the two that vie for the lead in most seasons. These are all close games (one or two runs), but together this group is only 4% of all games. Figure 7 shows the frequency of each winning margin up to. Figure 7. Frequency of Each Winning Margin 5 5 5 4 5 6 7 8 9 Margin The clear leader is the one-run win with each larger margin being less. I stopped at runs because almost 98% of all games are decided by runs or fewer, which is hardly a surprise. The games decided by one, two, or three runs are collectively 6% of the total So what is the average margin of victory? In light of the dominance of one-run wins, it may be surprising to see in Figure 8 that although it has varied, it has almost always been over, with a grand average of.4. We can conclude that it is necessary to be careful not to confuse the most common with the average.

Figure 8. 4 Average Winning Margin Margin How does all this fit into pitcher usage? One of the more dramatic findings I had last year was that starting in 98, there was a surge in bringing in a fresh pitcher to start the ninth inning in a save situation. Figure 9 is an extract of the data from Figure of last year s paper Figure 9. 9 9 95 97 99 Ninth Inning Pitchers and Wins 8 6 4 97 98 99 Beginning in 98, there was an increase in the entry of new pitchers to start the ninth inning in save situations. However, there is no change in the chance of winning the game, which led me to seriously question the now nearly universal use of closers in these situations (again that is an aggregate statement and does not refer to individual games). Given my emphasis in the current presentation on the significance of the first six innings, I decided to revisit this question of

bringing in a new pitcher to start the seventh. Since the change in closer pattern began in 98, I decided to analyze the seventh inning relievers the same way. These results are in Figures,, and. First I determined how often the different lead situations occurred, Figure. Percentage of games with leads of,,, or more after sixth inning. Lead After 6 Innings 4 5 5 5 98 99 Lead > The lines appear in a possibly unexpected order, but remember from before that the large leads predominate at this point in the game. There is quite a bit of statistical noise in these lines but the general pattern shows no clear trends. Figure shows how often a new pitcher was brought in, again as a function of the size of the lead. Figure. Percentage of games with a lead that have new pitcher to start seventh inning. New Pitcher To Start 7th 4 5 5 5 98 99 Lead >

I divided these entries based on the size of the lead since a direct comparison to the save situation of the ninth inning is not appropriate. There has certainly been an increasing trend of bringing in a reliever in these situations, but the three types of close game are similar in pattern, although there is a lot of noise. The fourth line is the result for large leads. The new reliever is called upon less often here, but the shape of the curve is similar, continuing the usage change I first demonstrated with the ninth inning. Finally, of course, we have to ask how effective these substitutions are in securing a win. That is addressed in Figure. Figure. Percentage of games won after sixth inning lead of one, two, or three runs. Games Won 95 9 85 8 75 7 65 6 98 99 Lead > The results are remarkably similar to the ninth inning closer results from my 6 presentation, namely that these new patterns of usage have had little effect on the final outcome. Conclusions:. The patterns of scoring and winning margin have been remarkably stable across the last 7 seasons.. Getting the lead early, preferably a big one, is enormously important.. The changing use of starters and therefore relievers, has not mattered very much. 4. The point of decision, as shown with a win probability of more than 85% is the end of the sixth inning.