George F. Will, Men at Work

Similar documents
1988 Graphics Section Poster Session. Displaying Analysis of Baseball Salaries

New York Yankees (34-31) 6, Seattle Mariners (34-32) 3 June 12, 2014

SAP Predictive Analysis and the MLB Post Season

Lesson 3 Pre-Visit Teams & Players by the Numbers

Stats in Algebra, Oh My!

Seattle Mariners (42-49) 4, New York Yankees (49-41) 3 July 18, 2015

1: MONEYBALL S ECTION ECTION 1: AP STATISTICS ASSIGNMENT: NAME: 1. In 1991, what was the total payroll for:

Paul M. Sommers. March 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

Seattle Mariners (36-42) 7, San Diego Padres (37-43) 0 July 1, 2015

Seattle Mariners (52-45) 3, Los Angeles Angels (58-38) 2 July 19, 2014

Additional On-base Worth 3x Additional Slugging?

Teaching Math Through Sports

Texas Rangers (15-9) 6, Seattle Mariners (9-14) 3 April 26, 2014

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Los Angeles Angels (47-39) 7, Seattle Mariners (40-47) 3 July 10, 2015

Lesson 1 Pre-Visit Batting Average

Minnesota Twins (7-10) 8, Seattle Mariners (7-10) 5 April 25, 2015

Houston Astros (7-14) 5, Seattle Mariners (7-13) 2 April 22, 2014

Chapter. 1 Who s the Best Hitter? Averages

One could argue that the United States is sports driven. Many cities are passionate and

Seattle Mariners (15-15) 4, Oakland Athletics (19-13) 2 May 5, 2014

6-8th GRADE WORKBOOK CLAYTON KERSHAW HEIGHT: 6 3 WEIGHT: 220 BATS: LEFT THROWS: LEFT BORN: 3/19/1988 MLB DEBUT: 5/25/2008

2014 MAJOR LEAGUE LEAGUE BASEBALL ATTENDANCE NOTES

Scotty s Spring Training

Baltimore Orioles (57-45) 2, Seattle Mariners (53-50) 1 July 25, 2014

2013 National Baseball Arbitration Competition. Tommy Hanson v. Atlanta Braves. Submission on behalf of Atlanta Braves. Submitted by Team 28

Do Clutch Hitters Exist?

PROMOS / CONCEPT=MLB ON FS1

Relative Value of On-Base Pct. and Slugging Avg.

American League Ballpark

Double Play System 1.0

2014 Tulane Baseball Arbitration Competition Eric Hosmer v. Kansas City Royals (MLB)

Boston Red Sox (18-19) 4, Seattle Mariners (16-20) 2 May 16, 2015

Seattle Mariners (39-45) 7, Detroit Tigers (42-41) 6 July 7, 2015

2019 LSU BASEBALL Overall Statistics for LSU (as of Feb 24, 2019) (All games Sorted by Batting avg) (All games Sorted by Earned run avg)

NFL SCHEDULE SAMPLE. Green Bay

Triple Lite Baseball

GUIDE TO BASIC SCORING

Seattle Mariners (43-51) 11, Detroit Tigers (46-47) 9 July 21, 2015

OYO Baseball Hall of Fame Collector s Checklist

2016 MAJOR LEAGUE BASEBALL ATTENDANCE HIGHLIGHTS

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

Southern U. Baseball 2017 Overall Statistics for Southern U. (as of Apr 01, 2017) (All games Sorted by Batting avg)

2017 International Baseball Tournament. Scorekeeping Hints

PHILLIES RECORD WHEN THEY (2016)

2017 BALTIMORE ORIOLES SUPPLEMENTAL BIOS

Los Angeles Dodgers (17-13) vs. Miami Marlins (15-14) Friday, May 02, 2014 Marlins Park, Miami, FL

(56.3%) AL (60%) (62%) (69%) (+4149) 7* 9-5 (64%) +450 (400% ROI

1977 Boston Red Sox. Record: t-2nd Place American League East Manager: Don Zimmer

Seattle Mariners (16-19) 2, Boston Red Sox (17-19) 1 May 15, 2015

NFL SCHEDULE SAMPLE. Green Bay

2017 BALTIMORE ORIOLES SUPPLEMENTAL BIOS

NFL SCHEDULE SAMPLE. Green Bay

Lesson 2 Pre-Visit Big Business of the Big Leagues

Seattle Mariners (42-36) 8, Boston Red Sox (35-43) 2 June 24, 2014

MONEYBALL. The Power of Sports Analytics The Analytics Edge

OFFICIAL RULEBOOK. Version 1.08

A Markov Model for Baseball with Applications

KANSAS CITY ROYALS POSTGAME NOTES

CHICAGO WHITE SOX POSTGAME NOTES. BOSTON RED SOX (16-11) at CHICAGO WHITE SOX (19-9) Wednesday, May 4, 2016 U.S. Cellular Field, Chicago, Ill.

Lesson 5 Post-Visit Do Big League Salaries Equal Big Wins?

A market in which freedom is limited by a reserve rule distributes players about as a free market would.

Since the National League started in 1876, there have been

2014 Baltimore Orioles

Offensive & Defensive Tactics. Plan Development & Analysis

#35 CODY BELLINGER #58 EDWARD PAREDES

EMU Baseball vs. Kansas, March 1, 2013

Average Runs per inning,

May the best (statistically chosen) team win! Danielle Pope

Expansion: does it add muscle or fat? by June 26, 1999

Defining Greatness A Hall of Fame Handbook

THE BIRD ON S.T.E.M.

MINNESOTA TWINS (70-64) VS. KANSAS CITY ROYALS (66-67) FRIDAY, SEPTEMBER 1, 2017 TARGET FIELD MINNEAPOLIS, MN

George Brett - #5. Third Baseman, Brett s Major League Career Statistics

2013 Baltimore Orioles

MAJOR LEAGUE BASEBALL 2014 ATTENDANCE ANALYSIS. Compiled and Written by David P. Kronheim.

CARLOS GONZÁLEZ. Outfielder C, GONZÁLEZ. ROCKIES.com Twitter.com/Rockies Twitter.com/RockiesPR 97

Player AVG GP-GS AB R H 2B 3B HR RBI TB SLG% BB HBP SO GDP OB% SF SH SB-ATT PO A E FLD%

Arizona Diamondbacks Turner Field Atlanta Braves Chase Field Turner Field - Advanced Ballpark Factors

Baseball Basics for Brits

Rating Player Performance - The Old Argument of Who is Bes

Descriptive Statistics Project Is there a home field advantage in major league baseball?

EMU Baseball vs. Kansas Game 1, March 2, 2013

Traveling Salesperson Problem and. its Applications for the Optimum Scheduling

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

When Should Bonds be Walked Intentionally?

2012 Baltimore Orioles

Future Expectations for Over-Performing Teams

A Competitive Edge? The Impact of State Income Taxes on the Acquisition of Free Agents by Major League Baseball Franchises

OFFICIAL RULEBOOK. Version 1.16

Should pitchers bat 9th?

An average pitcher's PG = 50. Higher numbers are worse, and lower are better. Great seasons will have negative PG ratings.

1982 Atlanta Braves. Record: st Place National League West Manager: Joe Torre

KANSAS CITY ROYALS POSTGAME NOTES

The Automated ScoreBook New Mexico Highlands at New Mexico Lobos Feb 08, 2003 at Albuquerque, N.M. (Lobo Field)

1991 Boston Red Sox. Record: t-2nd Place American League East Manager: Joe Morgan

NFL SCHEDULE SAMPLE. Green Bay

2004 Baltimore Orioles

NFL SCHEDULE SAMPLE. Green Bay

Transcription:

Part of baseball s charm is the illusion it offers that all aspects of it can be completely reduced to numerical expressions and printed in agate type in the sport section. George F. Will, Men at Work

Tom O Brien

The Official Rules of Major League Baseball 1.02 The objective of each team is to win by scoring more runs than their opponent 1.03 The winner of the game shall be that team which shall have scored, in accordance with these rules, the greater number of runs at the conclusion of a regulation game.

What are the important factors in winning games? Suppose our team outdoes the opposition in category X. What is the probability that we will win the game?

Event Probability, % W/L Runs created 82.6 4.75 Hits + Walks 79.6 3.90 Total Bases 79.5 3.88 Hits 76.3 3.22 Walks 66.1 1.95 (Errors) 63.9 1.77 Stolen Bases 61.1 1.58

The Society for American Baseball Research (SABR) SABeRmetrics Sabermetrics is the mathematical and statistical analysis of baseball records - Bill James

Offense 50% Defense 50% Offense Runs Created Linear Weights OPS Modified Defense Pitching - ~40% Fielding - ~10%

Wins above Replacement Player (WAR) Team Data Runs Scored Ratio or Difference Predict Number of Games Won Individual Data Runs Allowed Other Calculate "Runs per Win" Stuff Defensive Stats Hits Walks Runs Extra Bases Created Pitching Stats Outs Define Runs "Replacement above WAR Player" (R. P.) R. P.

The Importance of Felix Hernandez Cy Young Award 2001 2012 Won-Lost Percentages of Starting Pitchers (starters have won 23 of 24 awards) 2010 AL: Felix Hernandez, Seattle Mariners Won 13, Lost 12 (others average 20-6) Team record 61-101 (worst in AL) Felix produced 4.24 wins above team, 6.8 WAR ERA 2.27; best in league Pitched 250 innings; 230 K s 2009: Felix went 19-5.880.870.828.828.821.818.808.808.800.783.778.769.760.760.759.731.731.724.682.677.667.667.520

1 2

Position 1B 2B 3B SS LF CF RF Player Albert Pujols Mark Teixeira Chase Utley (Scott Rolen) Adrian Beltre * Evan Longoria Brendan Ryan Brett Gardner Michael Bourn (Andruw Jones) Jason Heyward Ichiro Suzuki * ( ) Retired 2012 * Beyond peak/dh

Wins above Replacement Player (WAR) Team Data Runs Scored Ratio or Difference Predict Number of Games Won Individual Data Runs Allowed Other Calculate "Runs per Win" Stuff Defensive Stats Hits Walks Runs Extra Bases Created Pitching Stats Outs Define Runs "Replacement above WAR Player" (R. P.) R. P.

RC = (H + W) x (TB) / (AB + W) Rearranged: RC = [(H + W) / (AB + W)] x (TB) RC = OBP x (SP x AB) Quality: OBP x SP Quantity: At Bats

RC = (H + W CS) x (TB + 0.55 x SB) / (AB + W) = A x B / C A Reaching base B Advancement C Opportunities

A factor: B factor: H + W CS becomes H + W CS + HBP GIDP TB + 0.55 x SB becomes TB + 0.26(TBB IBB + HBP) + 0.52(SH + SF + SB) C factor: AB + W becomes AB + W + HBP + SH + SF Bill James has about 15 different technical versions use depends on availability of various stats

The Runs Created Formulas 1964 AL Team Runs Basic SB version Tech Boston 688 735 732 725 Detroit 699 690 691 707 New York 730 695 698 705 Minnesota 737 764 763 775 Baltimore 679 667 667 671 Cleveland 689 658 653 666 Chicago 642 614 614 643 Los Angeles 544 559 555 552 Kansas City 621 644 643 645 Washington 578 559 556 566 Average 661 658 657 665 Std Error of Estimate 60.3 26.9 26.5 22.1 Analysis of Variance Basic 80.1 Tech 6.5 Residue 13.4

Runs Created 1985 NL Team A B C RC R StL 1863 2432 6182 733 747 NY 1807 2390 6248 691 695 Mtl 1671 2295 6053 634 633 Chi 1809 2426 6177 710 686 Phl 1720 2340 6122 657 667 Pgh 1677 2135 6099 587 568 4012 3996 LA 1838 2382 6222 704 682 Cin 1778 2322 6143 672 677 Hou 1774 2386 6192 684 706 SD 1774 2238 6150 646 650 Atl 1728 2234 6206 622 632 SF 1612 2123 6063 564 556 3892 3903 7904 7899 Division East 10547 14018 36881 4009 3996 West 10504 13685 36976 3888 3903 7897 7899 League 21051 27703 73857 7896 7899

Wins above Replacement Player (WAR) Team Data Runs Scored Ratio or Difference Predict Number of Games Won Individual Data Runs Allowed Other Calculate "Runs per Win" Stuff Defensive Stats Hits Walks Runs Extra Bases Created Pitching Stats Outs Define Runs "Replacement above WAR Player" (R. P.) R. P.

Batting, Pitching, & Fielding Statistics Franchise Encyclopedia: 1997 / 1999 114-48, Finished 1st in AL East (Schedule and Results) View League Standings and Leaders Manager: Joe Torre (114-48) Scored 965 runs, Allowed 656 runs. Pythagorean W-L: 108-54 Ballparks: Yankee Stadium II & Shea Stadium Attendance: 2,955,193 (3rd of 14) Park Factors Over 100 favors batters, under 100 favors pitchers. Multi-year: Batting - 97, Pitching - 95 one-year: Batting - 100, Pitching - 97 Postseason: Won World Series (4-0) over San Diego Padres Won AL Championship Series (4-2) over Cleveland Indians Won AL Division Series (3-0) over Texas Rangers

Batting, Pitching, & Fielding Statistics Franchise Encyclopedia: 1997 / 1999 114-48, Finished 1st in AL East (Schedule and Results) View League Standings and Leaders Manager: Joe Torre (114-48) Scored 965 runs, Allowed 656 runs. Pythagorean W-L: 108-54 Ballparks: Yankee Stadium II & Shea Stadium Attendance: 2,955,193 (3rd of 14) Park Factors Over 100 favors batters, under 100 favors pitchers. Multi-year: Batting - 97, Pitching - 95 one-year: Batting - 100, Pitching - 97 Postseason: Won World Series (4-0) over San Diego Padres Won AL Championship Series (4-2) over Cleveland Indians Won AL Division Series (3-0) over Texas Rangers

Percentage = (runs scored) 2 / [(runs scored) 2 + (runs allowed) 2 ] P = R 2 / (R 2 + S 2 ) Has found its way into popular culture ****** Variations: P = r 2 / (r 2 + 1) (where r = R/S) W/L = R 2 /S 2 (or W/L = r 2 )

P = R 2 / (R 2 + S 2 ) Pythagorean theorem involves an assumption. The best exponent may not be 2.00000000. A generalization: P = R n / (R n + S n ) By trial, the best value of n is 1.80-1.85. 1.83 usually assumed. Advantage: (Slightly) more accurate Disadvantages: More complex Can t call it Pythagorean

Linear Results shown by straight line Percentage Coordinates are scoring percentage and winning percentage Neutrality Line must go through (0.500, 0.500) Slope The best value for the slope of the line is close to 1.8

W/(W + L) 0.5 = 1.8[R/(R + S) 0.5] P = W/(W + L) = (1.4R 0.4S)/(R + S) Δ = R S P = (R + 0.4Δ)/(R + S)

The Beer-Mat Test

Situation Proportional (recognize value of runs) Sabermetric (weight value of runs) % of variance AL NL 75.1 67.1 22.3 24.0 Residue 2.5 8.9

It depends on the average scoring level Let s call the average number of runs, by both teams, ρ Tabulate ratio of runs required to average level ρ Method r= 1 1.1 1.25* Pythagorean 1 1.007 1.038 James Index 1.093 1.099 1.125 Linear Percentage 1.111 1.111 1.111 (*or 0.8) (Values are runs required for an incremental win, divided by ρ)

OPS RPG Correlation

Base-Stealing Performance 1950-2010

1) Reduce prep time (simple windup now prevalent) 2) Reduce time to get the ball to the fielder (quick turnaround): Time Success Rate (sec) (%) <3.25 61.4 3.25-3.4 68.7 3.4-3.55 73.9 >3.55 77.1

Thank You Slides by Kathleen Jelliffe

Alex Rodriguez 115.5 Albert Pujols 91.8 Derek Jeter 72.3 [2012 Hall-of-Famers] [70.2-70.6] Carlos Beltran 65.7 Roy Halladay 65.2 Adrian Beltre 65.0 Todd Helton 61.6 Andy Pettitte 58.6 Ichiro Suzuki 57.1 Tim Hudson 56.5 Above first line: in top 50, all-time Above second line: in top 100, all-time (through April 2013)