May the best (statistically chosen) team win! Danielle Pope

Similar documents
Lesson 3 Pre-Visit Teams & Players by the Numbers

A Brief Shutdown Innings Study. Bob Mecca

OYO Baseball Hall of Fame Collector s Checklist

1988 Graphics Section Poster Session. Displaying Analysis of Baseball Salaries

Department of Economics Working Paper Series

6-8th GRADE WORKBOOK CLAYTON KERSHAW HEIGHT: 6 3 WEIGHT: 220 BATS: LEFT THROWS: LEFT BORN: 3/19/1988 MLB DEBUT: 5/25/2008

2005 NFL Regular Season Pools. Bryan Clair

3-5th GRADE WORKBOOK ADRIAN GONZALEZ

CLAYTON KERSHAW HEIGHT: 6 3 WEIGHT: 220 BATS: LEFT THROWS: LEFT BORN: 3/19/1988 MLB DEBUT: 5/25/2008

The stats that I am using to project player stats are Hits (H), Runs (R), Homeruns (HR), Walks (BB), and Strikeouts (SO). I will look at each batter

OAKLAND ATHLETICS MATHLETICS MATH EDUCATIONAL PROGRAM. Presented by. #00 Stomper Mascot

SEASON TICKET HOLDER BROCHURE

NORTHBORO PHOENIX. * = uncarded

Since the National League started in 1876, there have been

George F. Will, Men at Work

The competitiveness of games in professional sports leagues

ENTER GROUP #1 HR'S HR's Here 1 MARK TRUMBO BAL JAKE LAMB ARZ 30 2 NOLAN ARENADO COL PAUL GOLDSCHMIDT ARZ 36 3 CHRIS DAVIS BAL 26 41

FRONT REDS REDS HISTORY REDS IN HISTORY MINOR

Math 154 Chapter 7.7: Applications of Quadratic Equations Objectives:

NFL Direction-Oriented Rushing O -Def Plus-Minus

Advanced Metrics Matchup Guide

Stats in Algebra, Oh My!

Standardization of Winning Streaks in Sports

- 6 - Home Run Records

2 QB» 6-2» 215» michigan state

MALCOLM SMITH SOUTHERN CALIFORNIA

APBA Master Board Game, 2006 edition Bob Jordan

How to Win in the NBA Playoffs: A Statistical Analysis

Do Clutch Hitters Exist?

DIGITAL TICKETING... 2 EXCLUSIVE EVENTS... 3 BENEFITS AND DISCOUNTS... 4 TICKET EXCHANGES... 4 EARN ADDITIONAL BENEFITS... 6 CONTACT US...

Largest Comeback vs. Eagles vs. Minnesota Vikings at Veterans Stadium, December 1, 1985 (came back from 23-0 deficit in 4th qtr.

Liberty Braves Group

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

A TIMELESS 2013 GROUP TICKETS

PHILLIES RECORD WHEN THEY

Follow links for Class Use and other Permissions. For more information send to:

WIDE RECEIVER LBS COLLEGE: MINNESOTA ACQUIRED: FREE AGENT NFL EXPERIENCE (NFL/TITANS): 8/1 HOMETOWN: COLD SPRING, MINN

1: MONEYBALL S ECTION ECTION 1: AP STATISTICS ASSIGNMENT: NAME: 1. In 1991, what was the total payroll for:

9 K» 6-0» 190» PENN STATE

2017 BALTIMORE ORIOLES SUPPLEMENTAL BIOS

Future Expectations for Over-Performing Teams

RUNNING BACK LBS COLLEGE: MISSISSIPPI ACQUIRED: UNRESTRICTED FREE AGENT (KC) NFL EXPERIENCE (NFL/TITANS): 7/3 HOMETOWN: LARGO, FLA

Algebra I: Strand 3. Quadratic and Nonlinear Functions; Topic 1. Pythagorean Theorem; Task 3.1.2

KICKER LBS COLLEGE: SOUTH CAROLINA ACQUIRED: FREE AGENT NFL EXPERIENCE (NFL/TITANS): 9/4 HOMETOWN: HICKORY, N.C

Realignment in the NHL, MLB, the NFL, and the NBA

most greatly to the outcome of the season? In my project, I plan to explore the effect that many

Chapter. Similar Triangles. Copyright Cengage Learning. All rights reserved.

A Comparison of Team Values in Professional Team Sports ( )

Player Acquisition & Development Analysis

THREE-DAY WEEKEND!: MATCHUP

LOS ANGELES DODGERS (82-63) at Arizona Diamondbacks (61-84) ONTO ARIZONA: MATCHUP vs. DIAMONDBACKS Dodgers: Diamondbacks: All-Time: 2016:

Tournaments. Dale Zimmerman University of Iowa. February 3, 2017

- 9 - Single Season Pitching Records

KICK IT UP A NOTCH: MATCHUP

QUARTERBACK LBS COLLEGE: SOUTHERN CALIFORNIA ACQUIRED: FREE AGENT NFL EXPERIENCE (NFL/TITANS): 13/2 HOMETOWN: NORTHRIDGE, CALIF

SOUVENIR CITY: MATCHUP

PHILLIES RECORD WHEN THEY (2017)

PHILLIES RECORD WHEN THEY (2016)

CH 21 THE PYTHAGOREAN THEOREM

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

95 KYLE WILLIAMS. p CAREER HIGHS WILLIAMS CAREER STATISTICS

NUMB3RS Activity: Is It for Real? Episode: Hardball

20 CB» 5-11» 195» MICHIGAN

MORE TRIGONOMETRY

PHILLIES RECORD WHEN THEY

HARVARD LODI, OH 6TH YEAR ACQUIRED FA IN 17

zwins, an Alternative Calculation of Wins Above Replacement in Baseball

2017 POSTSEASON OUTLOOK 2017 WILD CARD GAMES (AL GAMES IN RED, NL GAMES IN GREEN) 2017 DIVISION SERIES (AL GAMES IN RED, NL GAMES IN GREEN)

Top Fantasy Baseball Prospects for 2016

Math 4. Unit 1: Conic Sections Lesson 1.1: What Is a Conic Section?

(79-71) LOS ANGELES DODGERS

22 FRED JACKSON. HEIGHT: 6-1 WEIGHT: 216 AGE: 34 HOMETOWN: Fort Worth, TX JACKSON S CAREER STATISTICS

13.7 Quadratic Equations and Problem Solving

FBN TRADES note: MLB teams noted are carded teams for 2015

PHILLIES RECORD WHEN THEY

CCM8 Unit 7: Pythagorean Theorem Vocabulary

MONEYBALL. The Power of Sports Analytics The Analytics Edge

TINSEL TITLETOWN:

Munich 2016 Emergency Munich Marauders Series Instructions (1992 and 2007 TBL Champions)

Why We Should Use the Bullpen Differently

TE BRENT CELEK th Year BORN: Jan. 25, 1985 (Age 33) Cincinnati, Ohio La Salle High School MOST RECEPTIONS EAGLES HISTORY

2018 LAS VEGAS GAMBLERS PLAY GUIDE OVERVIEW The 2017 season was one of the best in Gamblers history. 95 wins and a relatively easy sprint to the

Chicago Cubs (1-0) vs. Washington Nationals (0-1)

SHOOT OFF THE FIREWORKS:

Analyzing Baseball Statistics Using Data Mining

Section I: Multiple Choice Select the best answer for each problem.

SHINE A LITTLE LIGHT:

COLORADO ROCKIES (4-0) vs. CHICAGO CUBS (1-2) RHP Kyle Kendrick (1-0, 0.00) vs. RHP Jason Hammel (NR) Game #5 / Home #2 Saturday, April 11, 2015

ON THE ROAD AGAIN: MATCHUP

Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up

Lesson 3: Using the Pythagorean Theorem. The Pythagorean Theorem only applies to triangles. The Pythagorean Theorem + = Example 1

STANFORD COMPTON, CA 8TH YEAR ACQUIRED FA IN 18

S RODNEY MCLEOD 23. 6th Year BORN: June 23, 1990 (Age 27) Hyattsville, Maryland. DeMatha Catholic High School

SERIES BY SERIES: MATCHUP

2018 S E AS O N T I C K ET H O LD ER PLAYBOOK

(79-70) LOS ANGELES DODGERS

DEFENSE WINS CHAMPIONSHIPS:

COLORADO ROCKIES GAME NOTES

Atlanta Braves (16-37) at LOS ANGELES DODGERS (28-27) WELCOME HOME!: MATCHUP vs. BRAVES Dodgers: Braves: All-Time: 2016: 4/19 at ATL: 4/20 at ATL:

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Transcription:

May the best (statistically chosen) team win! Danielle Pope

The Burning Question: What does the Pythagorean Expectation tell us, and how can the Pythagorean Expectation be improved?

Pythagorean Expectation of Winning Percentage Formula invented by Bill James to estimate how many games a baseball team "should" have won based on the number of runs they scored and allowed. x^2/(x^2+y^2) where x=runs Scored, y=runs Allowed Name derived from the formula's resemblance to Pythagoras formula to compute the length of the hypotenuse of a triangle from the lengths of its other two sides.

Example of Pythagorean Expectation Formula 2001 Arizona Diamondbacks X=Runs Scored= 818 Y=Runs Allowed= 677 P.E. Winning Percentage: x^2/(x^2+y^2)= (818^2)/(818^2+677^2) =.593 This means their WP would be expected to be.593 when it was actually.568

Pythagorean Expectation Continued 162 games in a season (.593)*162= 96 wins Pythagorean Record 96-66 Actual Record 92-70 Based on the Pythagorean Expectation, the Diamondbacks should have won 4 more games than they actually did.

Next Objective: Is there something significant about using the exponent 2?

Which exponent is best? Methodology Collect League statistics for teams from 1906-2006- runs scored, runs allowed & winning percentages Find Pythagorean Expectation of each team. Determine which exponent works best by finding the sum of squares of the residuals for each team and repeating it for each number chosen.

NL 2006 Which exponent is best? RUNS SCORED Pythagorean Expectation RUNS ALLOWED WIN PERCENTAGE PYTHAGOREAN EXPECTATION ARI 773 788 0.469 0.490 ATL 849 805 0.488 0.527 CHC 716 834 0.407 0.424 CIN 749 801 0.494 0.466 COL 813 812 0.469 0.501 FLA 758 772 0.481 0.491 HOU 735 719 0.506 0.511 LAD 820 751 0.543 0.544 MIL 730 833 0.463 0.434 NYM 834 731 0.599 0.566 PHI 865 812 0.525 0.532 PIT 691 597 0.414 0.573 SDP 731 679 0.543 0.537 SFG 746 790 0.472 0.471 STL 781 762 0.516 0.512 WSN 746 872 0.438 0.423 TOTAL 12337 12358

AL 2006 Which exponent is best? RUNS SCORED Pythagorean Expectation RUNS ALLOWED WIN PERCENTAGE PYTHAGOREAN EXPECTATION BAL 768 899 0.432 0.422 BOS 820 825 0.531 0.497 CHW 868 794 0.556 0.544 CLE 870 782 0.481 0.553 DET 822 675 0.586 0.597 KCR 757 971 0.383 0.378 LAA 766 732 0.549 0.523 MIN 801 683 0.593 0.579 NYY 930 767 0.599 0.595 OAK 771 727 0.579 0.529 SEA 756 792 0.481 0.477 TBD 689 856 0.377 0.393 TEX 835 784 0.494 0.531 TOR 809 754 0.537 0.535 TOTALS 11262 11041

Which exponent is best? Residual Residual= Pythagorean WP Actual WP Example: AZ Diamondbacks.593-.568 =.025 Square of residual (sum of squares)= (Pythagorean WP Actual WP)^2 Example: (.025)^2=.000625

Sum of Squares- exponent 2 NATIONAL ARI ATL CHC CIN COL FLA HOU LAD MIL NYM PHI PIT SDP SFG STL WSN TOTAL SUM SQ(2) 0.000457603 0.001488676 0.000299624 0.000756837 0.000999533 9.70312E-05 2.50279E-05 6.99742E-07 0.000818759 0.001120184 4.3198E-05 0.025152281 3.8077E-05 3.87315E-07 1.36028E-05 0.000237299 0.03154882 0.00010205 0.001158687 0.000133713 0.005201012 0.000126755 2.47199E-05 0.000692466 0.000195573 1.46399E-05 0.002465403 1.80051E-05 0.000261071 0.001403978 3.4402E-06 0.011801513 AMERICAN BAL BOS CHW CLE DET KCR LAA MIN NYY OAK SEA TBD TEX TOR TOTAL

Which exponent is best? Add the sum of squares for each league, each year to get the total sum of squares for exponent 2: 2006 NL.03154882 2006 AL.011801513 2001 NL.012981895 2001 AL.004850473 1996 NL.004758341 1996 AL.005388855 1991 NL.00386536 1991 AL.002701488 1921 NL.001985464 1921 AL.006720416 1916 NL.003344076 1916 AL.002204864 1911 NL.014133613 1911 AL.009422904 1906 NL.007804765 1906 AL.010944075 TOTAL SS:.329895525

Which exponent is best? Try using 1.9 as the exponent in the formula x^1.9/(x^1.9+y^1.9)

Sum of Squares- exponent 1.9 NL ARI ATL CHC CIN COL FLA HOU LAD MIL NYM PHI PIT SDP SFG STL WSN TOTAL AL BAL BOS CHW CLE DET KCR LAA MIN NYY OAK SEA TBD TEX TOR TOTALS SUM SQ(1.9) 0.000478383 0.001388048 0.000442695 0.000667695 0.000997588 0.000106251 1.98276E-05 1.80875E-06 0.000643551 0.001347683 2.49778E-05 0.024028863 6.40952E-05 6.49431E-07 1.85206E-05 0.000134372 0.030365007 SUM SQ(1.9) 3.91382E-05 0.001148364 0.000189734 0.004827567 4.23841E-05 8.08756E-07 0.000753369 0.000319488 7.1868E-05 0.002612954 9.50262E-06 0.000455716 0.001288815 1.30081E-05 0.011772716

Which exponent is best? Add the sum of squares for each league, each year to get the total sum of squares for exponent 1.9: 2006 NL.030365007 2006 AL.011772716 2001 NL.012521547 2001 AL.005637482 1996 NL.004973493 1996 AL.005096314 1991 NL.004029142 1991 AL.002883718 1921 NL.001580867 1921 AL.006314575 1916 NL.003114606 1916 AL.002560682 1911 NL.012795209 1911 AL.009427654 1906 NL.007010311 1906 AL.009578747 TOTAL SS:.317387682

Since 1.9 is a closer estimate for the Pythagorean Expectation formula, what about using lots of additional exponents to see if there is an even better estimate

Sum of Squares Results Exponent Sum of Squares Total 1.5 0.403156 1.6 0.360806 1.7 0.332567 1.8 0.318185 1.83 0.316532 1.84 0.316251 1.85 0.316105 1.86 0.316094 1.87 0.316208 1.9 0.317388 2 0.329895 2.1 0.355416 2.2 0.393649

Sum of Squares Total Sum of Squares 0.4 0.35 0.3 1.4 1.6 1.8 2 2.2 Pythagorean Expectation Exponent Values

Which exponent is best? Finding the quadratic regression to the points on the graph: y=.6717258774x^2 2.497560796x + 2.637625204 Finding the vertex of the parabola, the close estimate turns out to be 1.86, the exponent chosen for the best result.

Sum of Squares Total Sum of Squares 0.4 0.35 0.3 1.4 1.6 1.8 2 2.2 Pythagorean Expectation Exponent Values y=.6717258774x^2 2.497560796x + 2.637625204 On the calculator, this quadratic equation is the parabola that fits all these points. The parabola goes through all the points.

Conclusion Based on the data collected from the Sum of Squares of the years used, using an exponent of 1.86 gives a better estimate than using an exponent of 2. In several baseball sites using the Pythagorean Expectation, they also claim to use different exponents other than 2, without citing their reasons why. Empirically, this formula correlates fairly well with how baseball teams actually perform, although an exponent of 1.81 is slightly more accurate. - Wikipedia Many sabermetricians feel the results will be more accurate if, instead of SQUARING the numbers, those results are actually calculated to the power of 1.83. - Bryan P. Douglass, FantasyBaseball.com

Pitching Wins Championships The menu for a championship in baseball is short. Good defense and good pitching. - Rob Parker, Detroit News It's unusual for a below-average pitching team to make it to the World Series, but you can be a below-average offensive team and make it. - Jeff Merron and David Schoenfield, ESPN

Pitching Wins Championships The connection to the Pythagorean expectation shows why pitching wins a championship. Take a partial derivative of the Pythagorean expectation to find how scoring a run and allowing a run directly affects the overall winning percentage.

Partial Derivative for Pythagorean Expectation with respect to x

Partial Derivative for Pythagorean Expectation with respect to y

Comparison between the partial derivatives Notice the expressions inside the parentheses they are identical X= runs scored Y= runs allowed When X>Y (for winning teams having above.500 average), the team improves their winning percentage more by allowing one fewer run than scoring one more run. This may be why many say pitching wins championships.

Does Data from the World Series Illustrate that fact? To find out, calculate µ, σ of runs scored and runs allowed for all teams in the league to find a z-score for the league championship team and then compare the two z-scores found to determine if in fact the team with the better pitching z-score (runs allowed) won the World Series.

Z-Score Calculations First, find the mean and standard deviation of runs scored and runs allowed for each league, each year Next, plug those numbers into the z-score formula Z-Score

NL Z-SCORE RS Z-SCORE RA AL Z-SCORE RS Z-SCORE RA 2006 STL 0.194733344-1.638839605 DET 0.290790956-1.412501077 2001 ARI 0.780931288-1.151858161 NYY 0.205456501-0.777743611 1996 ATL 0.204840227-1.233741957 NYY -0.012615617-0.944884237 1991 ATL 1.564589336-0.423192354 MIN 0.707827798-1.191288926 1986 NYM 1.996219461-1.471314568 BOS 0.779098388-0.893670799 1981 LAD 0.69635972-1.447698188 NYY -0.319454682-2.030715838 1976 CIN 2.339930493-0.175717437 NYY 1.24462412-1.372467953 1971 PIT 1.865925943-0.577659308 BAL 1.543223353-1.533702254 1966 LAD -0.789771733-1.906833885 BAL 1.860599854-0.43042547 1961 CIN 0.185148906-0.75615602 NYY 1.395009492-1.41882585 1956 BRO 0.772041345-1.047387634 NYY 1.401748248-0.779630861 1951 NYG 0.930312174-0.760341691 NYY 1.293369515-1.00588321 1946 STL 1.434625642-1.038400753 BOS 1.773559645-0.562098783 1941 BRO 1.558896226-0.971415458 NYY 1.184313269-1.321640337 1936 NYG 0.207181637-1.230004921 NYY 1.729289013-1.192730924 1931 STL 1.164011433-1.081042178 PHA 0.432341967-1.682430111 1926 STL 1.652217289-0.267554315 NYY 1.317940177-0.149208513 1921 PIT -0.145545839-1.078596345 NYY 1.305893272-0.978745281 1916 BRO 1.029260748-0.963833913 BOS -0.371233954-1.035533309 1911 NYG 0.943844174-0.965239152 PHA 1.570683422-1.374934551 1906 CHC 1.621410355-1.865425992 CHW 0.122730548-1.158494878

How do the winners win? Pitching (7): 06, 16, 21, 41, 46, 91, 06 Hitting (7): 31, 36, 56, 66, 71, 76, 81 Both (6): 11, 26, 51, 61, 86, 01 Neither (1): 96 Based on the z-scores calculated using this particular research, there is not enough evidence to conclude that pitching, in fact, wins championships.

Rounding 3 rd and Heading for Home The resulting data shows that the Pythagorean Expectation formula can be improved by using a slightly more complicated exponent other than 2. It may vary from year to year which exponent works best, but over the data used, 1.86 seems the best fit. Partial differentiation of the Pythagorean Expectation shows that in order to become an even better winning team, you are better off allowing one fewer run than scoring one more run. The term pitching wins championships doesn t necessarily hold true with the data collected for this project pitching certainly helps with getting to the championship, but doesn t ensure a win.

References Baseball Reference. Sports Reference, Inc. 2006. October 2006. <http://www.baseball-reference.com/>. Douglass, Bryan P. FB501: Advanced Sabermetrics. Fantasy Baseball. 24 Feb 2005. Nov. 2006. <http://www.fantasybaseball.com/modules/wfsection/article.p hp?articleid=71>. Merron, Jeff and David Schoenfield. Playoff Theories: do they hold up? ESPN.com. 29 Sept. 2005. Nov 2006. <http://sports.espn.go.com/espn/page2/story?page=baseball/on e/051003>. Pythagorean Expectation. Wikipedia. 26 Oct 2006. Nov 2006. <http://en.wikipedia.org/wiki/pythagorean_expectation>.