Predicting Baseball Win/Loss Records from Player. Projections

Size: px
Start display at page:

Download "Predicting Baseball Win/Loss Records from Player. Projections"

Transcription

1 Predicting Baseball Win/Loss Records from Player Projections Connor Daly November 29, Introduction When forecasting future results in major league baseball (MLB), there are essentially two sources from which you can derive your predictions: teams and players. How do players perform individually, and how do their collaborative actions coalesce to form a team s results? Several methods of both types exist, but often are shrouded with proprietary formulas. Currently, several mature and highly sophisticated player projection systems are used to forecast season results. None are abundantly transparent about their methodology. Here I set out to develop the simplest possible player-based team projection system and try to add one basic improvement. 2 Predicting Wins and Losses 2.1 Team-Based Projections One approach to such forecasting is to analyze team performance in head to head matchups. A common implementation of this approach is known as an elo rating and prediction system. 1

2 Elo systems start by assigning teams an average rating. After games are played, winning teams ratings increase and losing teams ratings decrease relative to the expected outcome of the matchup. Expected outcomes are determined by the difference in rating between the two teams. If a very good team almost loses to a really bad team, its rating will only increase slightly. If an underdog pulls of an upset, however, it will earn relatively more points. As more games are played, older games become progressively less meaningful. Essentially, this prediction method considers only who you played, what the margin of victory was, and where the match was played (home team advantage is adjusted for). Using Monte Carlo simulations, one can predict the outcomes of individual seasons for each team. In between seasons, teams are regressed towards the mean. For a detailed explanation of a baseball elo model, see FiveThirtyEight [Boi]. The main advantage of this kind of team-based aproach is that it can capture some of the hard to pin down factors that make teams more than just the sum of their parts. Without figuring out what the secret sauce is, this method estimates the sum total contributions of ownership, coaches, team philosophy, and an uncountable number of other factors. The method does have a significant downside, however, in that it can t take advantage of known changes in team dynamics, such as changes in players and coaches. If I know Babe Ruth is leaving the Yankess after a particular season, I probably want to project them differently than I would have otherwise. This model fails to capture that. 2.2 Player-Based Projections Baseball enjoys a unique advantage over other major American sports in that it is significantly easier to decouple the performance of individual players to determine who was ultimately responsible for creating a certain result. If a batter hits a home run, we can say with a high degree of certainty that the batter and the pitcher combined to cause this event. By looking at the large number of combinations of batter/pitcher matchups, we can gauge the relative skill of each by their performance against a wide variety of opponents. On the 2

3 other hand, a sport such as football presents significant challenges to gauging the true skill of individual players. Looking at the running game, how can one intelligently and objectively pass out credit and blame? If a running back runs for a seven yard gain on a toss sweep to the right, how much credit should the left guard receive? Decoupling in baseball isn t perfect, but compared to other sports, it s much easier Wins Above Replacement A foundational pillar of sabermetrics, the empirical, quantitative analysis of baseball, is the concept of wins above replacement (WAR). Essentially, the idea is that all meaningful baseball statistics must measure how events on the field help or hurt a team s chances of winning in expectation. The way games are won is by teams scoring runs and preventing runs from being scored. Thus, every event can thus be understood in the context of runs allowed or runs created. This idea can be hard to grasp at first. How many runs does a home run create? One? Rather counter-intuitively, the generally accepted value is around 1.4 runs. How is this? Well, not only did the batter score himself, but he will have also batted in any potential runners on base. You must also consider the possibility that had the batter made an out instead of scoring these base runners, following batters could have driven them in. Using real playing data, we can determine the expected run creating or subtracting value of every event in baseball. See Table 1 for a complete breakdown of the run value of such events. By looking at the total contributions of a player over the course of a season, we can sum up the expected run contributions of every event the player caused. Now we need to compare our player against a baseline. A first intuition might be to compare the player to league average. Well, defining league average to be a baseline of zero runs created sells league average players short. A league average player is better than approximately half of the players in the league. That s valuable production! Instead, we scale our player s contribution against the idea of a replacement level player. The production of a replacement level player 3

4 is intended to be equivalent to the contributions of an infinitely replaceable minimum salary veteran or minor league free agent. For reference, a team of replacement players is defined by Fangraphs to win approximately 48 games over the course of a 162 game season. Using this replacement level, we determine the original player s runs above replacement player. Next, we scale the runs above replacement by the amount of runs per win in an average game. Finally, we scale this calculated WAR to the the number of possible wins in a season, so that the sum of all WAR and replacement runs equals the total number of wins in season. On average, the player s context free stats would have resulted in a team winning an extra number of games corresponding to his WAR than if the same player had been replaced with a replacement level player. There is a finite pool of WAR for all players. When one player performs better, that means less WAR will be allocated to the rest of the players. Unfortunately for the reader, there are several variants of WAR, and all define things slightly differently. Several rely on inexplicably chosen constants or proprietary formulas. The main basis of my calculations relies on Fangraphs WAR, but I did make some alterations, which will be explained later. For more in depth explanations of WAR and its underpinnings, see [Joh85], [Tom06], and [Fanb] WAR to Wins By projecting a season s worth of players expected WAR contributions, we can group players by target year team and take the sum total of their contributions. The combined total of their WAR should be able to help predict the team s actual number of wins. This relationship isn t necessarily one-to-one, as will be discussed in 4.5. This method benefits from being able to track players as they change teams. 4

5 Table 1: Run Values by Event Event Run Value Event Run Value Home Run Balk Triple Intentional Walk Double Stolen Base Error Defensive Indifference Single Bunt Interference Sacrifice Bunt Hit By Pitch Pickoff Non-intetional Walk Out Passed Ball Strikeout Wild Pitch Caught Stealing Empirical measurements of the run value of events from season. Data from [Tom06] 3 Projecting Players To create a player projection based season long team projecting system, the first step is to project players. Essentially, you need to look at player s past performance and predict how he will perform in the future. Some methods of doing this are highly sophisticated, others quite simple. Systems like Baseball Prospectus s PECOTA, Dan Szymborski s ZiPS, and Chris Mitchell s KATOH all combine bunches of variables and various calculations to compute projected outcomes. PECOTA in particular is based primarily around player similarity scores. Mainly, it uses various metrics to find comparable players for a given to-be-projected player and uses the performance of those comparables to infer a trajectory for the targeted player s future performance. Although its general methodology has been discussed, its specific implementation is proprietary. On the other end of the sophistication system is perhaps the simplest possible projection system: Marcel the Monkey. 3.1 Marcel the Monkey Marcel the Monkey, or simply Marcel, is a player projection system invented by Tom Tango [Tan]. It sets out to be the simplest possible player projection system. Essentially, it takes 5

6 a weighted average of a player s last three years (5/4/3 for batters and 3/2/1 for pitchers), regresses the player toward the mean by 1200 plate appearances, and applies an aging curve to increase player s skills until age 29 after which point they begin to decline. These projections make no attempt to differentiate for team, league, or position, with the exception that some different constants are used for starting pitchers and relief pitchers. Rather than calculating counting stats such as hits or home runs specifically, Marcel projects rate stats like hits or home runs per plate attempt. Plate attempts for batters are calculated from the previous two years and then added to a baseline of 200 plate appearances. Thus, all players are projected to have at least 200 plate appearances in the target year, even a player that may have retired two years prior. When translating from player to team projections, this is controlled for by setting rosters with the actual players who played on teams in the target year. A note about pitchers. Pitchers are projected per inning pitched rather than by plate appearance. Starting pitchers are projected to a minimum of 60 innings and relievers are projected to a minimum of 25. A pitcher s starter or relief role is defined by the ratio of games started to games played. A starter has started more than half his appearances in the given period. Marcel player projections are the foundation of my Marcel-based projection system. The first phase of my project centered around implementing a Marcel projection scheme in R for both batters and pitchers. Going back in time, older seasons don t contain the same amount of statistical data that modern seasons do. Because of this, I am only able to create Marcel projections for seasons from 1955 onwards. 4 Marcel the Monkey to Marcel the Manager After developing my Marcel projections, the next step in projecting team seasonal results was to group the players into teams and sum their accomplishments. 6

7 4.1 Season Lengths Prior to 1961, both the American and National league played a 154 game season before later switching to 162. Other regular seasons have also been shortened such as by the player strike in As such, all projections must account for varying season lengths. All reported accuracy statistics will be scaled to a 162 game season. 4.2 Adjustments to WAR Calculation Although I generally followed standard calculations for Fangraphs WAR, my calculations did diverge enough to be considered significantly different. For position players, I only considered batting runs created, not fielding runs or baserunning runs. The numbers also aren t position, league, or park adjusted. Because WAR is designed to be a retrospective statistic and my numbers are forward looking, I did my best to remove them from all possible context. My projections don t take things like park factors or league adjustments into account, so neither should my WAR calculations. Fielding runs were not calculated because advanced fielding statistics are not provided in the Lahman database I used to gather my projections. See A.1 for more information on data sources. 4.3 New Season s Rosters Rosters for target year teams were assembled by looking at batting statistics for the following year. I defined being on the team for that year to be having at least one plate appearance for said team and that being the first team the player appeared with that year. Because I drew my batting stats from the Lahman database (see A.1), I only could project through the 2016 season. I will soon be able to predict the 2017 season when the 2017 version of the Lahman database is published, likely in the coming weeks. To predict a season before it actually happens, I would need to add a new source of data to determine which players to include on a roster. 7

8 4.4 Rescaling WAR Going by Fangraphs definition of WAR, a team of only replacement players is expected to achieve a winning percentage of approximately Over a 162 game season, this corresponds to about 48 wins; however, not all seasons since 1955 contained 162 games. Additionally, not all seasons feature 30 teams. Hence, the number of available wins for players to earn fluctuates from year to year. To calculate the available WAR in year x: W AR(x) = (NumT eams(x) NumGames(x)) (1/2 ReplacementLevel) (1) where NumT eams(x) is the number of teams playing in year x, NumGames(x) is the mode of a team s played games in year x, and ReplacementLevel is the winning percentage for a team of exclusively replacement level players. WAR is then divided so that 57% is allocated for position players and 43% for pitchers. Once the total amount of WAR has been allocated, players must be scaled so that their projected WAR sums to the number of available WAR per season. 4.5 Correcting Diminishing Returns The next step in projecting teams is to sum individual player WAR to establish a team WAR. Once that has been done, we can add a team s total WAR to the season s per team replacement win total. The resulting win total is a that team s win projection for the year. You could stop there; however, doing so makes a key incorrect assumption about win totals. That is, win totals increase linearly with run differentials. Unfortunately, that is not the case. As you will see in the results section, there are clear diminishing returns at extreme ends of the projection spectrum. The more WAR a team adds over a projected 81 wins (a.500 season), the more the model will overestimate the value of those WAR in predicting the number of wins. Similarly, the fewer WAR a team has below.500, more the model will underestimate them. The relationship between WAR and wins is not entirely linear! 8

9 A simple solution to this problem is to apply a correcting function to the projections. I looked at applying two different correction models to the data, one linear one cubic. I used simultaneous perturbation stochastic approximation (SPSA) to help determine the parameters [Spa03]. For a more detailed explanation of model selection, see Measuring Correctness Validity and Verification When constructing mathematical models of reality, one must always ask two questions: is the model correctly implemented and does the model actually represent some semblance of reality. To answer the first question, we will look at publicly available data on Marcel player projections and wins above replacement. For the second, we will construct a loss function to measure how predictive our model can be. The first step in model verification is to ensure that my implementation of Marcel projections matches the intended projections of the method s creator. Thanksfully, he has many years of Marcel projections posted on his website [Tan]. Although our numbers aren t in complete agreement, they appear to be within an acceptable bound. Differences are on the order of one or two per stat and are likely due to implementation details such as digit precisions and rounding decisions. To verify I m computing WAR correctly, I compared my projected WAR totals to the actual WAR earned in the target season. Looking mostly at the top of the board, I checked that my WAR projections seemed to be a reasonable weighted average from the previous three years. If a player averaged three WAR per year and was projected for six, I d know something was off. I did recognize, however, that there would likely be reasonably large divergences for players who were extreme defensively, either extremely good or extremely bad. On the aggregate, the WAR totals seemed to match up pretty well, but I don t have a rigorout calculation showing this is true. Finally, to verify I aggregated team projections correctly, I look at the sum of all projected 9

10 wins per year and compared it to the total number of available wins. I made sure the calculated number was within a couple wins of the actual. I allowed small differences because some years teams play a different number of games and rounding can cause a win or two to fall through the cracks. The projections will still be reasonable The Loss Function When measuring the validity of the model, it may seem tempting to say that we can measure its accuracy directly. But what exactly is it that our model is trying to measure? Are we trying to predict actual wins and losses or are we trying to predict true talent, which can only be measured noisily via wins and losses. I would espouse that we attempt to ascertain true talent by noisily measuring wins and losses. Thus, we define our loss function y(θ): y(θ) = L(θ) + ɛ(θ) (2) y(θ) = 162 n n i=1 abs( ˆx i x i ) N umgames(i) (3) where L(θ) measures the loss of the prediction s ability to measure true talent and ɛ(θ) is a noise term. The more concrete version specifies that the loss can be measured as the mean absolute error of the model s predictions scaled to a 162 game season. That is, for teams numbered 1,..., i,..., n, ˆx i is the model s predicted number of wins for team i, x i is the team s actual number of wins, and NumGames(i) is the number of games played by team i. This computes the mean absolute error for all teams, scaled to 162 games. This allows us to compare results from teams who played seasons of different lengths. 5 Results Without applying any correction model, I was able to achieve a loss of 8.16 wins per team per 162 game season. See Figure 1 for a visual representation of results. Although, not perfect, 10

11 Figure 1: Results of Uncorrected Projections (a) Uncorrected Projections for (b) Residuals for Uncorrected Projections there is a clear trend line between the predictions and results. The residuals from the one-toone, however, appear to show a positive trend, meaning the model is overestimating teams at the right end of the graph and underestimating teams at the left. We can attempt to correct for this. 5.1 Calculating Correction Parameters I used two different models to attempt to apply corrections: one linear and one cubic. For both, I used SPSA to determine the optimal value. I chose a linear and a cubic because I assumed that the correction needed to be reasonably antisymmetric around 81 projected wins, corresponding to a.500 record. Both a negative sloping linear and a cubic function could provide that correction. I picked my initial parameters by guessing a scaling factor and choosing the other terms such that the x-intercept was 81. I attempted to find the correction factor such that: CorrectedW ins = P rojectedw ins + Correction (4) 11

12 5.1.1 Linear Model For the linear model, I modelled Correction = β 0 proj.wins β 1 starting with an intial β value set [.25, 20.25]. The intial beta values were determined by manually guessing and checking a few test values. After a million runs with parameters A = 1000, a =.01, c =.015, α = 0.602, γ = 0.101, and a Bernoulli distribution (+1,-1) for my deltas, I determined my optimal value to be [ , ]. The resulted in a net loss of 7.77 wins per team per 162 game season. I used the gain sequence provided in [Spa03], so I know that the gain sequence conditions for convergence are satisfied. By using a Bernoulli distribution for my deltas, as in [Spa03], I ve satisfied conditions on deltas. The rest of the conditions are unknowable without knowledge of L, but it seems reasonable that it is sufficiently smooth and bounded Cubic Model For the cubic model, I used vertex form to model Correction = β 0 (proj.wins β 1 ) 3 starting with an intial β value set [.01, 81]. The intial beta values were determined by manually guessing and checking a few test values. After a million runs with parameters A = 1000, a =.0001, c =.0015, α = 0.602, γ = 0.101, and a Bernoulli distribution (+1,-1) for my deltas, I determined my optimal value to be [ , ]. The resulted in a net loss of 8.02 wins per team per 162 game season. I used a scalar multiple of the gain sequence provided in [Spa03], so I know that the gain sequence conditions for convergence are satisfied. By using a Bernoulli distribution for my deltas, as in [Spa03], I ve satisfied conditions on deltas. The rest of the conditions are unknowable without knowledge of L, but it seems reasonable that it is sufficiently smooth and bounded. 12

13 5.2 Results with Corrections After analyzing both the linear and the cubic model, I needed to decide which to the use for my corrections. I decided to use cross validation to determine which model to use. I used three different test sets that were created by grouping the data points by their position modulo three. Performing the same SPSA calculations as in the individual trials but with 10,000 runs, I found the linear model had an average loss of 7.82 wins per 162 game season and the cubic model had 8.04 wins per 162 game season. I decided to use the linear model for my corrections. Although this helped reduce our loss function, our corrected model still isn t perfect. Noticeably, the corrected left tail in figure 2a isn t as well predicted as in the uncorrected version. Overall, though, the corrected model sees noticeable improvements year to year over the uncorrected model as in figure 2b. Figure 2: Looking at Corrected Projections (a) Corrected Projections for (b) Year to Year Correction Improvement

14 5.3 Perfect and Perfectly Imperfect Knowledge So how good is the model actually? We know we can achieve an average loss of under eight games per season, but is that any good? If we were to assume we knew nothing about individual MLB teams and instead only knew the distribution of MLB records. We can assume that it is approximately normal and by definition will have an average winning percentage of.5. The standard deviation in win percent turns out to be around That corresponds to around 11.3 wins per 162 game season. If we randomly assign teams a win percentage from this distribution, we end up with a loss function around 13 wins per 162 game season. Similarly, if we were to project a.500 record for every team, we d off by about 9.5 wins per 162 game season. Contrastingly, how good of a projection could we ever hope to get? The best predictor of how many wins a team accrues turns out to be an estimation based solely on its runs scored and runs allowed. These estimations are called pythagorean win projections, the most accurate of which is referred to as the pythagenpat win total [Pro]. If we had perfect knowledge of how many runs a team would score and allow, we could use their pythagenpat wins to predict their record, like in Figure 3. Yet still with this perfect knowledge, we can only come within 3.18 wins per 162 game season. If we consider projecting all teams to a.500 record to be the low point and with 3.18 wins as a theoretical upper bound, our model appears to have achieved 27% of all possible knowledge gain. 6 Park Effects Now, I attempt to add one final improvement to the model: park effects. Essentially, not all ballparks in major league baseball are created equal; they have different dimensions and atmospheric effects that make some parks easier to score runs in than others. Using park 14

15 Figure 3: Predicting Wins from Pythagenpat Wins effects data, see A, I deflated all player stats to remove park effects before computing their Marcel projection. After their season was projected, I looked at their destination home ballpark and inflated their numbers to reflect their new home. Surprisingly, these made my projections worse across the board, only beating my standard Marcel model twice in 60 years, shown in Figure 4. A clear shift occurs around In 1974, greater detailed park effects were released which led to improved predictions. Although park effects are certainly real, I m left to conlcude that averaged out over a very large sample of players, the current level of granularity is too course to be very predictive. 7 Conclusions At the start of this, I wanted to build the dumbest player based projection model possible and see if I could improve it. Beyond a simply error correction, I couldn t in the short time I had. Although my model may be dumb enough for a monkey, it is still reasonably predictive and appears likely to hold up with far more sophisticated predictions. 15

16 Figure 4: Comparing Projections With and Without Park Effects Marcel PECOTA FanGraphs Davenport Banished to the Pen Essays Composite Table 2: 2016 Projection Comparison 7.1 Comparison to Other Models Unfortunately, many of the data points required to do a full many year model comparison lie behind pay walls or aren t easily searchble on the internet. Baseball Prospectus has currently taken down the seasonal PECOTA projections as they upgrade their site. We can, however, look at the year After training my model on the years , it predicts 2016 with a loss of 6 wins per 162 game season. Look at Table 2 to see how it stacked up to the competetion. Basically, Marcel projections went toe-to-toe with the best of the best. Data is courtesy of [Aus]. 7.2 Challenges and Future Directions There are several limitations to my model, some mathematical, some sabermetrical. First, my measurements of WAR only look at a batting and defense independent pitching. This removes skills related to baserunning and fielding from the game. This causes players with 16

17 fielding or baserunning talent signficantly different from league average to be incorrectly valued. Secondly, Marcel doesn t do a great job of adjusting for playing time. Every player is projected a minimum of 200 plate attempts with no regard for their expected role on the team. More intelligently modelling fielding and baserunning skill as well as better adjusting for playing time could significantly improve the model. Another simple improvement would be to add a more robust aging curve. Different positions tend to age differently. A position specific aging curve could add benefits. Obviously, I d like a better way to manage roster data so I can project current rosters into the future without relying on Lahman data. Mathematically, I would have liked to run better SPSA optimizations. For the amount of time I ran them, I wasn t able to move my final parameters very far from my initial guess. This caused me to need to check a lot of values by hand to figure out where the best place to start the optimization was. Better choices of SPSA parameters and longer runnings times likely would have helped. Additionally, my model doesn t account well for uncertainty. Marcel has a way to measure reliability based on how much the player s projection comes from his own stats versus how much it is regressed towards the mean. I would ve liked to have added a similar component that could perhaps provide confidence intervals around a team s projection. A Appendix A.1 Sources of Data Seasonal batting and pitching data was obtained from the Lahman database [Lah]. I made use of years through 2016, which was the last published year with entries at the time of writing. Park effect factors came from Fangraphs [Fana]. Full detailed factors were available after 1973 thru Earlier years only had basic effects available. 17

18 References [Joh85] Pete Palmer John Thorn. The Hidden Game of Baseball. University of Chicago Press, isbn: [Spa03] James C. Spall. Introduction to Stochastic Search and Optimization. Wiley-Interscience Series in Discrete Mathematics and Optimization. John Wiley and Sons, isbn: [Tom06] Andrew E. Dolphin Tom M. Tango Mitchel G. Lichtman. The Book: Playing the Percentages in Baseball. TMA Press, isbn: [Aus] Darius Austin. Evaluating the 2016 Season Preview Predictions. url: [Boi] Jay Boice. How Our 2017 MLB Predictions Work. url: com/features/how-our-2017-mlb-predictions-work/. [Fana] Fangraphs. Park Factors. url: pf&teamid=0&season=2012. [Fanb] Fangraphs. WAR for Position Players. url: https : / / www. fangraphs. com / library/war/war-position-players/. [Lah] Sean Lahman. Lahman s Baseball Database. url: baseball-archive/statistics/. [Pro] Baseball Prospectus. Pythagenpat. url: com/glossary/index.php?mode=viewstat&stat=136. [Tan] Tom Tango. The 2004 Marcels. url: stud0346.shtml. 18

Simulating Major League Baseball Games

Simulating Major League Baseball Games ABSTRACT Paper 2875-2018 Simulating Major League Baseball Games Justin Long, Slippery Rock University; Brad Schweitzer, Slippery Rock University; Christy Crute Ph.D, Slippery Rock University The game of

More information

Building an NFL performance metric

Building an NFL performance metric Building an NFL performance metric Seonghyun Paik (spaik1@stanford.edu) December 16, 2016 I. Introduction In current pro sports, many statistical methods are applied to evaluate player s performance and

More information

The Rise in Infield Hits

The Rise in Infield Hits The Rise in Infield Hits Parker Phillips Harry Simon December 10, 2014 Abstract For the project, we looked at infield hits in major league baseball. Our first question was whether or not infield hits have

More information

Matt Halper 12/10/14 Stats 50. The Batting Pitcher:

Matt Halper 12/10/14 Stats 50. The Batting Pitcher: Matt Halper 12/10/14 Stats 50 The Batting Pitcher: A Statistical Analysis based on NL vs. AL Pitchers Batting Statistics in the World Series and the Implications on their Team s Success in the Series Matt

More information

CS 221 PROJECT FINAL

CS 221 PROJECT FINAL CS 221 PROJECT FINAL STUART SY AND YUSHI HOMMA 1. INTRODUCTION OF TASK ESPN fantasy baseball is a common pastime for many Americans, which, coincidentally, defines a problem whose solution could potentially

More information

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present) Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present) Jonathan Tung University of California, Riverside tung.jonathanee@gmail.com Abstract In Major League Baseball, there

More information

It s conventional sabermetric wisdom that players

It s conventional sabermetric wisdom that players The Hardball Times Baseball Annual 2009 How Do Pitchers Age? by Phil Birnbaum It s conventional sabermetric wisdom that players improve up to the age of 27, then start a slow decline that weeds them out

More information

Predicting Season-Long Baseball Statistics. By: Brandon Liu and Bryan McLellan

Predicting Season-Long Baseball Statistics. By: Brandon Liu and Bryan McLellan Stanford CS 221 Predicting Season-Long Baseball Statistics By: Brandon Liu and Bryan McLellan Task Definition Though handwritten baseball scorecards have become obsolete, baseball is at its core a statistical

More information

MONEYBALL. The Power of Sports Analytics The Analytics Edge

MONEYBALL. The Power of Sports Analytics The Analytics Edge MONEYBALL The Power of Sports Analytics 15.071 The Analytics Edge The Story Moneyball tells the story of the Oakland A s in 2002 One of the poorest teams in baseball New ownership and budget cuts in 1995

More information

B. AA228/CS238 Component

B. AA228/CS238 Component Abstract Two supervised learning methods, one employing logistic classification and another employing an artificial neural network, are used to predict the outcome of baseball postseason series, given

More information

Chapter. 1 Who s the Best Hitter? Averages

Chapter. 1 Who s the Best Hitter? Averages Chapter 1 Who s the Best Hitter? Averages The box score, being modestly arcane, is a matter of intense indifference, if not irritation, to the non-fan. To the baseball-bitten, it is not only informative,

More information

Why We Should Use the Bullpen Differently

Why We Should Use the Bullpen Differently Why We Should Use the Bullpen Differently A look into how the bullpen can be better used to save runs in Major League Baseball. Andrew Soncrant Statistics 157 Final Report University of California, Berkeley

More information

Figure 1. Winning percentage when leading by indicated margin after each inning,

Figure 1. Winning percentage when leading by indicated margin after each inning, The 7 th Inning Is The Key By David W. Smith Presented June, 7 SABR47, New York, New York It is now nearly universal for teams with a 9 th inning lead of three runs or fewer (the definition of a save situation

More information

Do Clutch Hitters Exist?

Do Clutch Hitters Exist? Do Clutch Hitters Exist? David Grabiner SABRBoston Presents Sabermetrics May 20, 2006 http://remarque.org/~grabiner/bosclutch.pdf (Includes some slides skipped in the original presentation) 1 Two possible

More information

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008 Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008 Do clutch hitters exist? More precisely, are there any batters whose performance in critical game situations

More information

Running head: DATA ANALYSIS AND INTERPRETATION 1

Running head: DATA ANALYSIS AND INTERPRETATION 1 Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

1. OVERVIEW OF METHOD

1. OVERVIEW OF METHOD 1. OVERVIEW OF METHOD The method used to compute tennis rankings for Iowa girls high school tennis http://ighs-tennis.com/ is based on the Elo rating system (section 1.1) as adopted by the World Chess

More information

TOP OF THE TENTH Instructions

TOP OF THE TENTH Instructions Instructions is based on the original Extra Innings which was developed by Jack Kavanaugh with enhancements from various gamers, as well as many ideas I ve had bouncing around in my head since I started

More information

Opleiding Informatica

Opleiding Informatica Opleiding Informatica Determining Good Tactics for a Football Game using Raw Positional Data Davey Verhoef Supervisors: Arno Knobbe Rens Meerhoff BACHELOR THESIS Leiden Institute of Advanced Computer Science

More information

Percentage. Year. The Myth of the Closer. By David W. Smith Presented July 29, 2016 SABR46, Miami, Florida

Percentage. Year. The Myth of the Closer. By David W. Smith Presented July 29, 2016 SABR46, Miami, Florida The Myth of the Closer By David W. Smith Presented July 29, 216 SABR46, Miami, Florida Every team spends much effort and money to select its closer, the pitcher who enters in the ninth inning to seal the

More information

Pitching Performance and Age

Pitching Performance and Age Pitching Performance and Age By: Jaime Craig, Avery Heilbron, Kasey Kirschner, Luke Rector, Will Kunin Introduction April 13, 2016 Many of the oldest players and players with the most longevity of the

More information

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball Zachary Taylor 1 Haverford College Department of Economics Advisor: Dave Owens Spring 2016 Abstract: This study

More information

AggPro: The Aggregate Projection System

AggPro: The Aggregate Projection System Gore, Snapp and Highley AggPro: The Aggregate Projection System 1 AggPro: The Aggregate Projection System Ross J. Gore, Cameron T. Snapp and Timothy Highley Abstract Currently there exist many different

More information

Pitching Performance and Age

Pitching Performance and Age Pitching Performance and Age Jaime Craig, Avery Heilbron, Kasey Kirschner, Luke Rector and Will Kunin Introduction April 13, 2016 Many of the oldest and most long- term players of the game are pitchers.

More information

Effects of Incentives: Evidence from Major League Baseball. Guy Stevens April 27, 2013

Effects of Incentives: Evidence from Major League Baseball. Guy Stevens April 27, 2013 Effects of Incentives: Evidence from Major League Baseball Guy Stevens April 27, 2013 1 Contents 1 Introduction 2 2 Data 3 3 Models and Results 4 3.1 Total Offense................................... 4

More information

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player?

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player? Which On-Base Percentage Shows the Highest True Ability of a Baseball Player? January 31, 2018 Abstract This paper looks at the true on-base ability of a baseball player given their on-base percentage.

More information

Analysis of the Article Entitled: Improved Cube Handling in Races: Insights with Isight

Analysis of the Article Entitled: Improved Cube Handling in Races: Insights with Isight Analysis of the Article Entitled: Improved Cube Handling in Races: Insights with Isight Michelin Chabot (michelinchabot@gmail.com) February 2015 Abstract The article entitled Improved Cube Handling in

More information

Predicting the Total Number of Points Scored in NFL Games

Predicting the Total Number of Points Scored in NFL Games Predicting the Total Number of Points Scored in NFL Games Max Flores (mflores7@stanford.edu), Ajay Sohmshetty (ajay14@stanford.edu) CS 229 Fall 2014 1 Introduction Predicting the outcome of National Football

More information

Machine Learning an American Pastime

Machine Learning an American Pastime Nikhil Bhargava, Andy Fang, Peter Tseng CS 229 Paper Machine Learning an American Pastime I. Introduction Baseball has been a popular American sport that has steadily gained worldwide appreciation in the

More information

Additional On-base Worth 3x Additional Slugging?

Additional On-base Worth 3x Additional Slugging? Additional On-base Worth 3x Additional Slugging? Mark Pankin SABR 36 July 1, 2006 Seattle, Washington Notes provide additional information and were reminders during the presentation. They are not supposed

More information

Baseball Scorekeeping for First Timers

Baseball Scorekeeping for First Timers Baseball Scorekeeping for First Timers Thanks for keeping score! This series of pages attempts to make keeping the book for a RoadRunner Little League game easy. We ve tried to be comprehensive while also

More information

Average Runs per inning,

Average Runs per inning, Home Team Scoring Advantage in the First Inning Largely Due to Time By David W. Smith Presented June 26, 2015 SABR45, Chicago, Illinois Throughout baseball history, the home team has scored significantly

More information

Chapter 12 Practice Test

Chapter 12 Practice Test Chapter 12 Practice Test 1. Which of the following is not one of the conditions that must be satisfied in order to perform inference about the slope of a least-squares regression line? (a) For each value

More information

SAP Predictive Analysis and the MLB Post Season

SAP Predictive Analysis and the MLB Post Season SAP Predictive Analysis and the MLB Post Season Since September is drawing to a close and October is rapidly approaching, I decided to hunt down some baseball data and see if we can draw any insights on

More information

Our Shining Moment: Hierarchical Clustering to Determine NCAA Tournament Seeding

Our Shining Moment: Hierarchical Clustering to Determine NCAA Tournament Seeding Trunzo Scholz 1 Dan Trunzo and Libby Scholz MCS 100 June 4, 2016 Our Shining Moment: Hierarchical Clustering to Determine NCAA Tournament Seeding This project tries to correctly predict the NCAA Tournament

More information

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together Statistics 111 - Lecture 7 Exploring Data Numerical Summaries for Relationships between Variables Administrative Notes Homework 1 due in recitation: Friday, Feb. 5 Homework 2 now posted on course website:

More information

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO BABE: THE SULTAN OF PITCHING STATS? by Matthew H. LoRusso Paul M. Sommers August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO. 10-30 DEPARTMENT OF ECONOMICS MIDDLEBURY COLLEGE MIDDLEBURY, VERMONT

More information

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent Evaluating The Best Exploring the Relationship between Tom Brady s True and Observed Talent Heather Glenny, Emily Clancy, and Alex Monahan MCS 100: Mathematics of Sports Spring 2016 Tom Brady s recently

More information

THE BOOK--Playing The Percentages In Baseball

THE BOOK--Playing The Percentages In Baseball So as a baseball flies towards home plate, the moment when it passes from central to peripheral vision could THE BOOK ARTICLES BLOG ABOUT US The Book Tom M. Tango, Mitc... Buy New $14.93 Privacy Information

More information

Draft - 4/17/2004. A Batting Average: Does It Represent Ability or Luck?

Draft - 4/17/2004. A Batting Average: Does It Represent Ability or Luck? A Batting Average: Does It Represent Ability or Luck? Jim Albert Department of Mathematics and Statistics Bowling Green State University albert@bgnet.bgsu.edu ABSTRACT Recently Bickel and Stotz (2003)

More information

March Madness Basketball Tournament

March Madness Basketball Tournament March Madness Basketball Tournament Math Project COMMON Core Aligned Decimals, Fractions, Percents, Probability, Rates, Algebra, Word Problems, and more! To Use: -Print out all the worksheets. -Introduce

More information

2011 COMBINED LEAGUE (with a DH) DRAFT / RULES PACKET

2011 COMBINED LEAGUE (with a DH) DRAFT / RULES PACKET 2011 COMBINED LEAGUE (with a DH) DRAFT / RULES PACKET Welcome to Scoresheet Baseball. Here is the rules packet, for a Combined League (drafting both National and American League players), a description

More information

Calculation of Trail Usage from Counter Data

Calculation of Trail Usage from Counter Data 1. Introduction 1 Calculation of Trail Usage from Counter Data 1/17/17 Stephen Martin, Ph.D. Automatic counters are used on trails to measure how many people are using the trail. A fundamental question

More information

Lab 11: Introduction to Linear Regression

Lab 11: Introduction to Linear Regression Lab 11: Introduction to Linear Regression Batter up The movie Moneyball focuses on the quest for the secret of success in baseball. It follows a low-budget team, the Oakland Athletics, who believed that

More information

2014 National Baseball Arbitration Competition

2014 National Baseball Arbitration Competition 2014 National Baseball Arbitration Competition Jeff Samardzija v. Chicago Cubs Submission on Behalf of Chicago Cubs Midpoint: $4.9 million Submission by: Team 26 Table of Contents I. Introduction and Request

More information

Section I: Multiple Choice Select the best answer for each problem.

Section I: Multiple Choice Select the best answer for each problem. Inference for Linear Regression Review Section I: Multiple Choice Select the best answer for each problem. 1. Which of the following is NOT one of the conditions that must be satisfied in order to perform

More information

Predicting Tennis Match Outcomes Through Classification Shuyang Fang CS074 - Dartmouth College

Predicting Tennis Match Outcomes Through Classification Shuyang Fang CS074 - Dartmouth College Predicting Tennis Match Outcomes Through Classification Shuyang Fang CS074 - Dartmouth College Introduction The governing body of men s professional tennis is the Association of Tennis Professionals or

More information

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5 Prof. C. M. Dalton ECN 209A Spring 2015 Practice Problems (After HW1, HW2, before HW3) CORRECTED VERSION Question 1. Draw and describe a relationship with heteroskedastic errors. Support your claim with

More information

2015 Winter Combined League Web Draft Rule Packet (USING YEARS )

2015 Winter Combined League Web Draft Rule Packet (USING YEARS ) 2015 Winter Combined League Web Draft Rule Packet (USING YEARS 1969-1972) Welcome to Scoresheet Baseball: the winter game. This document details the process of drafting your Old Timers Baseball team on

More information

2018 Winter League N.L. Web Draft Packet

2018 Winter League N.L. Web Draft Packet 2018 Winter League N.L. Web Draft Packet (WEB DRAFT USING YEARS 1981-1984) Welcome to Scoresheet Baseball: the 1981-1984 Seasons. This document details the process of drafting your 2010 Old Timers Baseball

More information

Baseball Portfolio Optimization

Baseball Portfolio Optimization University of Arkansas, Fayetteville ScholarWorks@UARK Industrial Engineering Undergraduate Honors Theses Industrial Engineering 5-2016 Baseball Portfolio Optimization Keegan Henderson University of Arkansas,

More information

1: MONEYBALL S ECTION ECTION 1: AP STATISTICS ASSIGNMENT: NAME: 1. In 1991, what was the total payroll for:

1: MONEYBALL S ECTION ECTION 1: AP STATISTICS ASSIGNMENT: NAME: 1. In 1991, what was the total payroll for: S ECTION ECTION 1: NAME: AP STATISTICS ASSIGNMENT: 1: MONEYBALL 1. In 1991, what was the total payroll for: New York Yankees? Oakland Athletics? 2. The three players that the Oakland Athletics lost to

More information

March Madness Basketball Tournament

March Madness Basketball Tournament March Madness Basketball Tournament Math Project COMMON Core Aligned Decimals, Fractions, Percents, Probability, Rates, Algebra, Word Problems, and more! To Use: -Print out all the worksheets. -Introduce

More information

When you think of baseball, you think of a game that never changes, right? The

When you think of baseball, you think of a game that never changes, right? The The Strike Zone During the PITCHf/x Era by Jon Roegele When you think of baseball, you think of a game that never changes, right? The rules are the same as they were over 100 years ago, right? The bases

More information

Sportvision first arrived on the baseball scene with the

Sportvision first arrived on the baseball scene with the An Introduction to FIELDf/x by Kate McSurley and Greg Rybarczyk Sportvision first arrived on the baseball scene with the pitch tracking PITCHf/x system, which allows viewers to identify the speed and trajectory

More information

JEFF SAMARDZIJA CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4

JEFF SAMARDZIJA CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4 JEFF SAMARDZIJA V. CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4 Table of Contents I. Introduction...1 II. III. IV. Performance and Failure to Meet Expectations...2 Recent Performance of the Chicago Cubs...4

More information

One could argue that the United States is sports driven. Many cities are passionate and

One could argue that the United States is sports driven. Many cities are passionate and Hoque 1 LITERATURE REVIEW ADITYA HOQUE INTRODUCTION One could argue that the United States is sports driven. Many cities are passionate and centered around their sports teams. Sports are also financially

More information

2017 B.L. DRAFT and RULES PACKET

2017 B.L. DRAFT and RULES PACKET 2017 B.L. DRAFT and RULES PACKET Welcome to Scoresheet Baseball. The following information gives the rules and procedures for Scoresheet leagues that draft both AL and NL players. Included is information

More information

Applying Occam s Razor to the Prediction of the Final NCAA Men s Basketball Poll

Applying Occam s Razor to the Prediction of the Final NCAA Men s Basketball Poll to the Prediction of the Final NCAA Men s Basketball Poll John A. Saint Michael s College One Winooski Park Colchester, VT 05439 (USA) jtrono@smcvt.edu Abstract Several approaches have recently been described

More information

Modeling Fantasy Football Quarterbacks

Modeling Fantasy Football Quarterbacks Augustana College Augustana Digital Commons Celebration of Learning Modeling Fantasy Football Quarterbacks Kyle Zeberlein Augustana College, Rock Island Illinois Myles Wallin Augustana College, Rock Island

More information

College Teaching Methods & Styles Journal First Quarter 2007 Volume 3, Number 1

College Teaching Methods & Styles Journal First Quarter 2007 Volume 3, Number 1 The Economics Of The Duration Of The Baseball World Series Alexander E. Cassuto, (E-mail: aleaxander.cassuto@csueastbay.edu), California State University, Hayward Franklin Lowenthal, (E-mail: frabklin.lowenthal@csueastbay.edu),

More information

Triple Lite Baseball

Triple Lite Baseball Triple Lite Baseball As the name implies, it doesn't cover all the bases like a game like Playball, but it still gives a great feel for the game and is really quick to play. One roll per at bat, a quick-look

More information

An average pitcher's PG = 50. Higher numbers are worse, and lower are better. Great seasons will have negative PG ratings.

An average pitcher's PG = 50. Higher numbers are worse, and lower are better. Great seasons will have negative PG ratings. Fastball 1-2-3! This simple game gives quick results on the outcome of a baseball game in under 5 minutes. You roll 3 ten-sided dice (10d) of different colors. If the die has a 10 on it, count it as 0.

More information

2013 National Baseball Arbitration Competition

2013 National Baseball Arbitration Competition 2013 National Baseball Arbitration Competition Dexter Fowler v. Colorado Rockies Submission on behalf of the Colorado Rockies Midpoint: $4.3 million Submission by: Team 27 Table of Contents: I. Introduction

More information

LEE COUNTY WOMEN S TENNIS LEAGUE

LEE COUNTY WOMEN S TENNIS LEAGUE In order for the Lee County Women s Tennis League to successfully promote and equitably manage 2,500+ members and give all players an opportunity to play competitive tennis, it is essential to implement

More information

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful? Averages October 19, 2005 Discussion item: When we talk about an average, what exactly do we mean? When are they useful? 1 The Arithmetic Mean When we talk about an average, we can mean different things

More information

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Tony Liu Introduction The broad aim of this project is to use the Bradley Terry pairwise comparison model as

More information

When Should Bonds be Walked Intentionally?

When Should Bonds be Walked Intentionally? When Should Bonds be Walked Intentionally? Mark Pankin SABR 33 July 10, 2003 Denver, CO Notes provide additional information and were reminders to me for making the presentation. They are not supposed

More information

A Novel Approach to Predicting the Results of NBA Matches

A Novel Approach to Predicting the Results of NBA Matches A Novel Approach to Predicting the Results of NBA Matches Omid Aryan Stanford University aryano@stanford.edu Ali Reza Sharafat Stanford University sharafat@stanford.edu Abstract The current paper presents

More information

GUIDE TO BASIC SCORING

GUIDE TO BASIC SCORING GUIDE TO BASIC SCORING The Score Sheet Fill in this section with as much information as possible. Opposition Fielding changes are indicated in the space around the Innings Number. This is the innings box,

More information

ANALYSIS OF A BASEBALL SIMULATION GAME USING MARKOV CHAINS

ANALYSIS OF A BASEBALL SIMULATION GAME USING MARKOV CHAINS ANALYSIS OF A BASEBALL SIMULATION GAME USING MARKOV CHAINS DONALD M. DAVIS 1. Introduction APBA baseball is a baseball simulation game invented by Dick Seitz of Lancaster, Pennsylvania, and first marketed

More information

A One-Parameter Markov Chain Model for Baseball Run Production

A One-Parameter Markov Chain Model for Baseball Run Production for Winthrop University April 13, 2013 s s and : A is an ideal candidate for mathematical modelling, as it has these features: a relatively small number of configurations, a relatively small number of

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(3): Research Article

Journal of Chemical and Pharmaceutical Research, 2014, 6(3): Research Article Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research 2014 6(3):304-309 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 World men sprint event development status research

More information

Two Machine Learning Approaches to Understand the NBA Data

Two Machine Learning Approaches to Understand the NBA Data Two Machine Learning Approaches to Understand the NBA Data Panagiotis Lolas December 14, 2017 1 Introduction In this project, I consider applications of machine learning in the analysis of nba data. To

More information

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia Efficiency Wages in Major League Baseball Starting Pitchers 1998-2001 Greg Madonia Statement of Problem Free agency has existed in Major League Baseball (MLB) since 1974. This is a mechanism that allows

More information

2016 Scoresheet Hockey Drafting Packet (for web draft leagues) Player Lists Explanation

2016 Scoresheet Hockey Drafting Packet (for web draft leagues) Player Lists Explanation 2016 Scoresheet Hockey Drafting Packet (for web draft leagues) The game rules are the same no matter how you draft your team. But if you are in a league that is using the web draft system then you can

More information

Queue analysis for the toll station of the Öresund fixed link. Pontus Matstoms *

Queue analysis for the toll station of the Öresund fixed link. Pontus Matstoms * Queue analysis for the toll station of the Öresund fixed link Pontus Matstoms * Abstract A new simulation model for queue and capacity analysis of a toll station is presented. The model and its software

More information

SABA Major League Rules vs. Official Little League Rules

SABA Major League Rules vs. Official Little League Rules SABA Major League Rules vs. Official Little League Rules SABA is not affiliated with Little League and therefore is not obligated to follow any Little League Rules or Regulations. SABA follows many of

More information

Navigate to the golf data folder and make it your working directory. Load the data by typing

Navigate to the golf data folder and make it your working directory. Load the data by typing Golf Analysis 1.1 Introduction In a round, golfers have a number of choices to make. For a particular shot, is it better to use the longest club available to try to reach the green, or would it be better

More information

DOE Golfer Experiment

DOE Golfer Experiment DOE Golfer Experiment A Design of Experiments Report Travis Anderson Jake Munger Deshun Xu 11/11/2008 INTRODUCTION We used Response Surface Methodology to optimize a golf putter. A face centered Central

More information

ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17

ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17 ECO 199 GAMES OF STRATEGY Spring Term 2004 Precept Materials for Week 3 February 16, 17 Illustration of Rollback in a Decision Problem, and Dynamic Games of Competition Here we discuss an example whose

More information

A statistical model of Boy Scout disc golf skills by Steve West December 17, 2006

A statistical model of Boy Scout disc golf skills by Steve West December 17, 2006 A statistical model of Boy Scout disc golf skills by Steve West December 17, 2006 Abstract: In an attempt to produce the best designs for the courses I am designing for Boy Scout Camps, I collected data

More information

The MACC Handicap System

The MACC Handicap System MACC Racing Technical Memo The MACC Handicap System Mike Sayers Overview of the MACC Handicap... 1 Racer Handicap Variability... 2 Racer Handicap Averages... 2 Expected Variations in Handicap... 2 MACC

More information

Chapter 1 The official score-sheet

Chapter 1 The official score-sheet Chapter 1 The official score-sheet - Symbols and abbreviations - The official score-sheet - Substitutions - Insufficient space on score-sheet 13 Symbols and abbreviations Symbols and abbreviations Numbers

More information

Jenrry Mejia v. New York Mets Submission on Behalf of the New York Mets Midpoint: $2.6M Submission by Team 32

Jenrry Mejia v. New York Mets Submission on Behalf of the New York Mets Midpoint: $2.6M Submission by Team 32 2015 NATIONAL BASEBALL ARBITRATION COMPETITION Jenrry Mejia v. New York Mets Submission on Behalf of the New York Mets Midpoint: $2.6M Submission by Team 32 Table of Contents 1. INTRODUCTION AND REQUEST

More information

Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players

Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players Jonathan White Paper Title: An Analysis of the Relationship between Pressure and Performance in Major League Baseball Players If you were to scrutinize Alex Rodriguez s statistics during the 2006 season,

More information

A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS

A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS 1987, 1990, 1993, 1999, 2011 A PRIMER ON BAYESIAN STATISTICS BY T. S. MEANS DEPARTMENT OF ECONOMICS SAN JOSE STATE UNIVERSITY SAN JOSE, CA 95192-0114 This

More information

Author s Name Name of the Paper Session. Positioning Committee. Marine Technology Society. DYNAMIC POSITIONING CONFERENCE September 18-19, 2001

Author s Name Name of the Paper Session. Positioning Committee. Marine Technology Society. DYNAMIC POSITIONING CONFERENCE September 18-19, 2001 Author s Name Name of the Paper Session PDynamic Positioning Committee Marine Technology Society DYNAMIC POSITIONING CONFERENCE September 18-19, 2001 POWER PLANT SESSION A New Concept for Fuel Tight DP

More information

Should bonus points be included in the Six Nations Championship?

Should bonus points be included in the Six Nations Championship? Should bonus points be included in the Six Nations Championship? Niven Winchester Joint Program on the Science and Policy of Global Change Massachusetts Institute of Technology 77 Massachusetts Avenue,

More information

Legendre et al Appendices and Supplements, p. 1

Legendre et al Appendices and Supplements, p. 1 Legendre et al. 2010 Appendices and Supplements, p. 1 Appendices and Supplement to: Legendre, P., M. De Cáceres, and D. Borcard. 2010. Community surveys through space and time: testing the space-time interaction

More information

Lorenzo Cain v. Kansas City Royals. Submission on Behalf of the Kansas City Royals. Team 14

Lorenzo Cain v. Kansas City Royals. Submission on Behalf of the Kansas City Royals. Team 14 Lorenzo Cain v. Kansas City Royals Submission on Behalf of the Kansas City Royals Team 14 Table of Contents I. Introduction and Request for Hearing Decision... 1 II. Quality of the Player s Contributions

More information

Should pitchers bat 9th?

Should pitchers bat 9th? Should pitchers bat 9th? Mark Pankin SABR 37 July 26, 2007 St. Louis, Missouri Notes provide additional information and were reminders during the presentation. They are not supposed to be anything close

More information

2014 NATIONAL BASEBALL ARBITRATION COMPETITION

2014 NATIONAL BASEBALL ARBITRATION COMPETITION 2014 NATIONAL BASEBALL ARBITRATION COMPETITION Jeff Samardzija v. Chicago Cubs Submission on Behalf of Jeff Samardzija Midpoint: $4,900,000 Submission by Team 17 Table of Contents I. Introduction and Request

More information

Effect of homegrown players on professional sports teams

Effect of homegrown players on professional sports teams Effect of homegrown players on professional sports teams ISYE 2028 Rahul Patel 902949215 Problem Description: Football is commonly referred to as America s favorite pastime. However, for thousands of people

More information

The pth percentile of a distribution is the value with p percent of the observations less than it.

The pth percentile of a distribution is the value with p percent of the observations less than it. Describing Location in a Distribution (2.1) Measuring Position: Percentiles One way to describe the location of a value in a distribution is to tell what percent of observations are less than it. De#inition:

More information

SOUTHBURY YOUTH BASEBALL 2008 SPRING RULES RULES COMMITTEE

SOUTHBURY YOUTH BASEBALL 2008 SPRING RULES RULES COMMITTEE SOUTHBURY YOUTH BASEBALL 2008 SPRING RULES RULES COMMITTEE Michael D Agostino, Tom Mendicino, Marc Shay, Paul Mooney, Chris Dewitt, Tim Cremin, John O Rourke Babe Ruth League The Cal Ripken Divisions (from

More information

Fairfax Little League PPR Input Guide

Fairfax Little League PPR Input Guide Fairfax Little League PPR Input Guide Each level has different participation requirements. Please refer to the League Bylaws section 7 for specific details. Player Participation Records (PPR) will be reported

More information

The Reliability of Intrinsic Batted Ball Statistics Appendix

The Reliability of Intrinsic Batted Ball Statistics Appendix The Reliability of ntrinsic Batted Ball Statistics Appendix Glenn Healey, EECS Department University of California, rvine, CA 92617 Given information about batted balls for a set of players, we review

More information

Bhagwant N. Persaud* Richard A. Retting Craig Lyon* Anne T. McCartt. May *Consultant to the Insurance Institute for Highway Safety

Bhagwant N. Persaud* Richard A. Retting Craig Lyon* Anne T. McCartt. May *Consultant to the Insurance Institute for Highway Safety Review of The Impact of Red Light Cameras (Photo-Red Enforcement) on Crashes in Virginia by Nicholas J. Garber, John S. Miller, R. Elizabeth Abel, Saeed Eslambolchi, and Santhosh K. Korukonda Bhagwant

More information