Salary correlations with batting performance

Similar documents
Pitching Performance and Age

Pitching Performance and Age

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

The Rise in Infield Hits

2013 National Baseball Arbitration Competition

2014 NATIONAL BASEBALL ARBITRATION COMPETITION ERIC HOSMER V. KANSAS CITY ROYALS (MLB) SUBMISSION ON BEHALF OF THE CLUB KANSAS CITY ROYALS

Dexter Fowler v. Colorado Rockies (MLB)

Chapter. 1 Who s the Best Hitter? Averages

Additional On-base Worth 3x Additional Slugging?

Team 10. Texas Rangers v. Nelson Cruz. Brief in support of Nelson Cruz

Lorenzo Cain v. Kansas City Royals. Submission on Behalf of the Kansas City Royals. Team 14

Correction to Is OBP really worth three times as much as SLG?

2014 National Baseball Arbitration Competition

2014 Tulane Baseball Arbitration Competition Eric Hosmer v. Kansas City Royals

2013 Tulane National Baseball Arbitration Competition

2014 Tulane Baseball Arbitration Competition Eric Hosmer v. Kansas City Royals (MLB)

2015 NATIONAL BASEBALL ARBITRATION COMPETITION. Lorenzo Cain v. Kansas City Royals (MLB) SUBMISSION ON BEHALF OF KANSAS CITY ROYALS BASEBALL CLUB

Figure 1. Winning percentage when leading by indicated margin after each inning,

#35 CODY BELLINGER #58 EDWARD PAREDES

Running head: DATA ANALYSIS AND INTERPRETATION 1

Team Number 6. Tommy Hanson v. Atlanta Braves. Side represented: Atlanta Braves

2014 National Baseball Arbitration Competition

2013 National Baseball Arbitration Competition. Tommy Hanson v. Atlanta Braves. Submission on behalf of Atlanta Braves. Submitted by Team 28

OAKLAND ATHLETICS MATHLETICS MATH EDUCATIONAL PROGRAM. Presented by ROSS Dress for Less and Comcast SportsNet California

Package mlbstats. March 16, 2018

2015 NATIONAL BASEBALL ARBITRATION COMPETITION. Mark Trumbo v. Arizona Diamondbacks. Submission on Behalf of Mark Trumbo. Midpoint: $5,900,000

Relative Value of On-Base Pct. and Slugging Avg.

Betaball. Using Finance to Evaluate. Baseball Contracts. Jamie O Donohue

MONEYBALL. The Power of Sports Analytics The Analytics Edge

2015 NATIONAL BASEBALL ARBITRATION COMPETITION

Matt Halper 12/10/14 Stats 50. The Batting Pitcher:

Correlation and regression using the Lahman database for baseball Michael Lopez, Skidmore College

2014 Tulane Baseball Arbitration Competition Josh Reddick v. Oakland Athletics (MLB)

A V C A - B A D G E R R E G I O N E D U C A T I O N A L T I P O F T H E W E E K

Average Runs per inning,

Department of Economics Working Paper Series

Monthly Indicators % + 8.2% % Market Overview New Listings Pending Sales. Closed Sales. Days on Market Until Sale. Median Sales Price

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball

Predicting Season-Long Baseball Statistics. By: Brandon Liu and Bryan McLellan

JEFF SAMARDZIJA CHICAGO CUBS BRIEF FOR THE CHICAGO CUBS TEAM 4

TULANE UNIVERISTY BASEBALL ARBITRATION COMPETITION NELSON CRUZ V. TEXAS RANGERS BRIEF FOR THE TEXAS RANGERS TEAM # 13 SPRING 2012

Do Clutch Hitters Exist?

Raymond HV Gallucci, PhD, PE;

Draft - 4/17/2004. A Batting Average: Does It Represent Ability or Luck?

Infield Hits. Infield Hits. Parker Phillips Harry Simon PARKER PHILLIPS HARRY SIMON

TULANE NATIONAL BASEBALL ARBITRATION COMPETITION 2012 NELSON CRUZ, PLAYER TEXAS RANGERS, TEAM BRIEF FOR TEXAS RANGERS, TEAM. Team no.

CS 221 PROJECT FINAL

Effects of Incentives: Evidence from Major League Baseball. Guy Stevens April 27, 2013

to the Kansas City Royals for the purposes of an arbitration hearing governed by the Major

2013 National Baseball Arbitration Competition Tulane University Law School

Dexter Fowler v. Colorado Rockies. Submission on Behalf of the Colorado Rockies. Team 18

The Influence of Free-Agent Filing on MLB Player Performance. Evan C. Holden Paul M. Sommers. June 2005

2015 National Baseball Arbitration Competition

A Database Design for Selecting a Golden Glove Winner using Sabermetrics

An average pitcher's PG = 50. Higher numbers are worse, and lower are better. Great seasons will have negative PG ratings.

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

Small Business Dynamics and Job Creation. Small Business Numbers, Pretty Pictures and Not So Pretty Pictures

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Gouwan Strike English Manual

JULY 2012 GREYSCALE Mathletics_Workbook_6-8.indd 1

B. AA228/CS238 Component

When Should Bonds be Walked Intentionally?

2017 PFF RUSHING REPORT

A Comparison of Team Values in Professional Team Sports ( )

CHAPTER 2 Modeling Distributions of Data

TULANE BASEBALL ARBITRATION COMPETITION. Isaac Benjamin Davis. New York Metropolitan Baseball Club, Inc. ARBITRATION BRIEF FOR ISAAC BENJAMIN DAVIS

Lab 11: Introduction to Linear Regression

George F. Will, Men at Work

The MLB Language. Figure 1.

How to Make, Interpret and Use a Simple Plot

AggPro: The Aggregate Projection System

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

2014 Tulane National Baseball Arbitration Competition Jeff Samardzija v. Chicago Cubs (MLB)

2014 National Baseball Arbitration Competition

2015 National Baseball Arbitration Competition

NES-89-USA. The NES Files

One could argue that the United States is sports driven. Many cities are passionate and

9.3 Histograms and Box Plots

DO YOU KNOW WHO THE BEST BASEBALL HITTER OF ALL TIMES IS?...YOUR JOB IS TO FIND OUT.

Machine Learning an American Pastime

Billy Beane s Three Fundamental Insights on Baseball and Investing

Lesson 2 Pre-Visit Slugging Percentage

GUIDE TO BASIC SCORING

THE HOMESTAND AHEAD Toronto tries to play catch-up in Wild Card race as they host Seattle and Minnesota.

NUMB3RS Activity: Is It for Real? Episode: Hardball

AP Stats Chapter 2 Notes

4-3 Rate of Change and Slope. Warm Up. 1. Find the x- and y-intercepts of 2x 5y = 20. Describe the correlation shown by the scatter plot. 2.

Regression Analysis of Success in Major League Baseball

Gain the Advantage. Build a Winning Team. Sports

Bouton Championship Dynasty CBA

The pth percentile of a distribution is the value with p percent of the observations less than it.

Expansion: does it add muscle or fat? by June 26, 1999

Charlotte Region Monthly Indicators

2015 SAN DIEGO PADRES ADDITIONAL PLAYER BIOS

2018 Winter League N.L. Web Draft Packet

2014 NATIONAL BASEBALL ARBITRATION COMPETITION

Background Information. Project Instructions. Problem Statement. EXAM REVIEW PROJECT Microsoft Excel Review Baseball Hall of Fame Problem

2015 NATIONAL BASEBALL ARBITRATION COMPETITION

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia

Transcription:

Salary correlations with batting performance By: Jaime Craig, Avery Heilbron, Kasey Kirschner, Luke Rector, Will Kunin Introduction Many teams pay very high prices to acquire the players needed to make that team the best it can be. While it often seems that high budget teams like the New York Yankees are often very successful, is the high price tag worth the improvement in performance? We compared many statistics including batting average, on base percentage, slugging, on base plus slugging, home runs, strike outs, stolen bases, runs created, and BABIP (batting average for balls in play) to salaries. We predicted that higher salaries will correlate to better batting performances. We also divided players into three groups by salary range, with the low salary range going up to $1 million per year, the mid-range salaries from $1 million to $10 million per year, and the high salaries greater than $10 million per year. We expected a stronger correlation between batting performance and salaries for players in the higher salary range than the correlation in the lower salary ranges. Low Salary Below $1 million In figure 1 is a correlation plot between salary and batting statistics. This correlation plot is for players that are making below $1 million. We see in all of the plots that there is not a significant correlation between salary and batting statistics. It is, however, evident that players earning the lowest salaries show the lowest correlations. The overall trend for low salary players--which would be expected--is a negative correlation between salary and batting performance. This negative correlation is likely a result of players getting paid according to their specific performance, or the data are reflecting underpaid rookies who have not bloomed in the major leagues yet.

Figure 1. Correlation plot of players with salaries under $1 million against offensive statistics Medium Salary Between $1-$10 million For medium salary players, who we categorized as getting paid $1-$10 million per year, it seems that salary has a relatively high correlation (see figure 2) with batting performance when compared with low salary and high salary players. The highest correlation for medium salary players is the "Runs Created (RC)" statistic, with homeruns (HR), slugging percentage (SLG) and on-base plus slugging (OPS) following closely behind. Statistics often used to represent batting performance, like batting average (AVG) and on-base percentage (OBP), showed little correlation with salary, suggesting that medium salary batters are getting paid for power hitting. Figure 2. Correlation plot of players with salaries between $1 million and $10 million against offensive statistics

High Salary Over $10 million We categorized high salary players as players who earned more than $10 million dollars each season. When examining correlations between salary and different offensive statistics, we find it surprising that a high salary is not necessarily indicative of strong offensive performance (figure 3). For this top level salary bracket, the highest correlations were for HR and "Runs Created". Both of these correlations were 0.18. This correlation with HR can be explained since high HR hitters are generally considered stars and thus demand a higher salary. Additionally, Salary's correlation with "Runs Created" makes sense because teams place a great deal of value in runs. Interestingly the correlation between salary and HR and Runs Created is much lower than the same correlations for medium salary players. Figure 3. Correlation plot of players with salaries above $10 million against offensive statistics

Figure 4. Correlation plot of all players with salaries against offensive statistics In the contemporary era, we found relatively low correlations with batting performance and salary even when segmented into three different salary ranges. The lowest correlation we found was in the salary ranging below one million annually; most hitting metrics in fact had negative correlations with salary. The highest correlation for batting performance was in the middle earning range, where runs created, home runs, on base plus slugging, and slugging percentage held the highest correlations. In the highest earning range ($10M+), these statistics correlate less than before; yet home runs, on base plus slugging, and on base percentages saw only slight drops. Runs created saw a significant drop in correlation from the middle range to the high range. Stolen bases also saw a significant drop in the high earning range. In the high salary range, batting average, on base percentage and home runs are most highly regarded as they are some of the highest predictive metrics and also show the lowest drop from the middle earning range. With the majority of Major League players earning in the middle range, we are able to see that clubs value run creation the most for these players, followed closely by home runs. This study also gives insight on what makes a high salary player earn what they do, and how they differentiate from the other salary ranges.while we

could compare correlation values between salary ranges and batting statistics, our data showed that there are no values remotely close to 1. The highest correlation value (0.35) we had was between salary and "runs created" in the combined correlation plot.