Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Baseball

Size: px
Start display at page:

Download "Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Baseball"

Transcription

1 Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Baseball Shane T. Jensen Department of Statistics, The Wharton School, University of Pennsylvania May 23, 2011 Joint Work with Kenny Shirley, James Piette and Abraham Wyner

2 Quantifying Fielding Performance in Baseball Overall goal: accurate evaluation of the fielding performance of each major league baseball player Official MLB Method: Errors and Fielding Percentage Only punishes for bad plays, does not reward for good plays or account for difficulty of successful plays Better Methods: Ultimate Zone Rating (M. Lichtman) and Probabilistic Model of Range (D. Pinto) divide field into large zones and tabulates of successful vs. unsuccessful plays for each fielder within zones Weakness of these methods: field not treated as single continuous surface Each zone/slice is large which limits ability to detect small differences between fielders

3 Baseball Info Solutions (BIS) Data Ball-in-play data available via Baseball Info Solutions 7 seasons (02-08) with balls-in-play (BIP) per year Three BIP types: 42% grounders, 33% flys, 25% liners Each BIP is mapped to a more resolute area than zones of previous methods BIP velocity information as ordinal category Flyballs Caught by CF Flyballs Not Caught by CF Y Coordinate Y Coordinate X Coordinate X Coordinate

4 Smooth Fielding Curves High-resolution data allows us to fit smooth fielding curves to the continuous playing surface Flyballs Liners Groundballs Y Coordinate Y Coordinate Y Coordinate X Coordinate X Coordinate X Coordinate Instead of non-parametric contours, we will use a hierarchical parametric model for fielding curves Hierarchical models allow for principled sharing of information within and between individual players

5 Bernoulli Successes and Failures The outcome of each play is either a success or failure: { 1 if the j th BIP hit to the i th player leads to out S ij = 0 if the j th BIP hit to the i th player leads to hit Observed successes and failures are modeled as Bernoulli realizations from an underlying probability: S ij Bernoulli(p ij ) Each p ij is a function of available data for that BIP: (x, y) ij location, velocity V ij and type of the BIP These probability functions will be smooth parametric curves that can vary between different players

6 Representation for Different BIP Types Two-dimensional curves needed for flys/liners: success depends on velocity, direction and distance to BIP One-dimensional curves needed for grounders: success depends on velocity, direction and angle to BIP Flyballs and Liners Grounders Y Coordinate Backward Forward CF Location at (0,324) Distance BIP Location Y Coordinate SS Location Grounder Trajectory Left Right θ Angle X Coordinate X Coordinate

7 Probit function for each smooth curve Probit regression used to model smooth curves for probability p ij of successfully fielding BIP j by player i Φ( ) = CDF of Normal distribution Probit function for fly-balls/liners: p ij = Φ(β i0 + β i1 D ij + β i2 D ij F ij + β i3 D ij V ij + β i4 D ij V ij F ij ) = Φ(X ij β i ) D ij = distance to BIP, V ij = vel, F ij = 1 if forward (vs. back) Probit function for grounders: p ij = Φ(β i0 + β i1 θ ij + β i2 θ ij L ij + β i3 θ ij V ij + β i4 θ ij V ij L ij ) = Φ(X ij β i ) θ ij = angle to BIP, V ij = velocity, L ij = 1 if left (vs. right)

8 Fitting Aggregate Model at each Position Standard probit regression software can be used to calculate MLE of coefficients β Can fit coefficients β + of aggregate model using data from all players at each position. Baseline probability of making a play for each infield position Probability B SS P 2B 1B Degrees Nice to compare between positions, but more interesting to compare between individual players at each position!

9 Individual Models for Grounders Can fit MLE coefficients β i of player-specific model using only data for player i Compare infielders based on MLE curves for grounders P(Success) for Everett, Jeter vs. average SS 1.0 P(Success) Average Jeter Everett rd Base SS Location 2nd Base 1st Base Degrees from SS

10 Individual Models for Fly/Liners Can again fit MLE coefficients β i of player-specific model using only data for player i Compare outfielders based on MLE curves for flys or liners

11 Problems with MLE Estimation Ideally want curve variability, not just MLE estimate Small samples for some players leads to unstable estimates of individual curves 2005 Centerfielder Curves Success Probability Individual Aggregate Distance Bayesian hierarchical model: shares information (and models variability) instead of separate estimation of player

12 Bayesian Hierarchical Model Player-specific coefficient vector now share common prior distribution across all players at a position: β i Normal(µ,Σ) µ is unknown mean vector and Σ unknown covariance matrix shared across all players Need prior for shared population parameters µ and Σ Assume a priori independence of β i components so Σ has off-diagonal elements of zero and diagonal elements σ 2 k Use recommended (Gelman, 2006) non-informative prior distribution: p(µ k,σ k ) 1 k = 0,...,4 Note that a priori independence will still lead to posterior dependence between β i components.

13 Model Implementation For each position and BIP-type, we need to estimate full posterior distribution of parameters p(β,µ,σ S, X) p(s β, X) p(β µ,σ) p(µ,σ) We estimate each posterior distribution using MCMC methods, specifically Gibbs sampling Iteratively sample from conditional distribution of each parameter set given other parameter values Data augmentation for Gibbs sampling with probit models: augment data with Z ij Normal(X ij β i, 1) and observe that P(S ij = 1) = Φ(X ij β i ) = P(Z ij 0) Z are a representation of our outcome data S that is now conjugate with the rest of our hierarchical model

14 Gibbs Sampling Steps 1 Sample Z ij N(X ij β i, 1) with truncation: sample Z ij until Z ij 0 (if S ij = 1) or Z ij < 0 (if S ij = 0) 2 Sample player coefficients β i from β i Normal [(Σ 1 + X i X i ) 1 (Σ 1 µ + X i Z i ), (Σ 1 + X i X i ) 1] 3 Sample population means µ k from ( m ) i=1 µ k Normal β ik, σ2 k m m k = 0,...,4 4 Sample population variances σk 2 from m (β σk 2 ik µ k ) 2 Inv Gamma (m 1) i=1, 2 2 k = 0,...,4

15 Results for Population Parameters As an example, examine µ for flyballs hit to CFs in 2005 Estimates and 95% Intervals for µ Intercept Distance Velocity Forward Posterior Coefficient Not surprisingly, coefficients for distance and velocity are negative: decreasing P(catch) with higher values Distance is clearly the predictor that explains the most variation in the outcome

16 Examining Shrinkage of Player Coefficients Hierarchical model shrinks each β i towards population µ Intercept Distance No Pooling No Pooling Partial Pooling Partial Pooling Complete Pooling Complete Pooling No Pooling Velocity No Pooling Forward Partial Pooling Partial Pooling Complete Pooling Complete Pooling No pooling = MLE, Partial pooling = Hierarchical model, Complete pooling = µ

17 Examining Shrinkage of Player Curves We can also examine shrinkage of player-specific curves (a) No Pooling (MLE)) (b) Partial Pooling (c) Complete Pooling P(Catch) P(Catch) P(Catch) Distance Distance Amount of shrinkage depends on sample size n i and variation in performance through X i and Z i Still heterogeneity between players even after shrinkage Distance

18 Numerical Summary of Overall Performance Beyond comparing curves between players, we can derive an overall numerical estimate of fielder performance SAFE: Spatial Aggregate Fielding Evaluation For each player, aggregate differences between individual curve (based on β i ) and overall curve (based on µ) Aggregation done by numerical integration over fine grid of values (1D grid for grounders, 2D grid for flys/liners) Can calculate SAFE for each sample from posterior distribution of β i, giving us the posterior mean and 95% posterior interval of SAFE for each player

19 Differential Weighting in SAFE Our full aggregation also weights grid points by BIP frequency, run value, and shared consequence (a) P(Success) for Jeter vs. Average (b) Density Estimate of Grounder Angle 1.0 P(Success) Average Jeter Density 0.0 3rd Base SS Location 2nd Base 1st Base 3rd Base SS Location 2nd Base 1st Base Degrees from SS (c) Run Consequence for Grounders Degrees from SS (d) Shared Responsibility of SS Runs Responsibility Fraction rd Base SS Location 2nd Base 1st Base 0.0 3rd Base SS Location 2nd Base 1st Base Degrees from SS Degrees from SS SAFE value: runs saved/cost of fielder vs. average

20 Results for Corner Infielders: Best/Worst Posterior SAFE values Ten Best 1B Player-Years Ten Best 3B Player-Years Name and Year Mean 95% Interval Name and Year Mean 95% Interval Doug Mientkiewicz, ( 2.8, 11.3 ) Marco Scutaro, ( 10.0, 16.6 ) Andy Phillips, ( 2.6, 11.4 ) Mark Bellhorn, ( 4.0, 17.1 ) Rich Aurilia, ( 2.7, 10.2 ) Hank Blalock, ( 4.2, 16.5 ) Albert Pujols, ( 3.1, 8.2 ) Sean Burroughs, ( 3.4, 14.2 ) Doug Mientkiewicz, ( 1.8, 9.1 ) David Bell, ( 1.7, 13.3 ) Albert Pujols, ( 1.9, 8.1 ) Scott Rolen, ( 1.9, 12.1 ) Kendry Morales, ( -0.5, 10.3 ) Hank Blalock, ( 1.4, 11.3 ) Ken Harvey, ( 1.5, 8.0 ) Damian Rolls, ( 0.1, 13.6 ) Howie Kendrick, ( -0.8, 9.6 ) Pedro Feliz, ( 0.5, 13.3 ) Albert Pujols, ( 1.0, 6.8 ) Joe Crede, ( 0.0, 15.8 ) Ten Worst 1B Player-Years Ten Worst 3B Player-Years Name and Year Mean 95% Interval Name and Year Mean 95% Interval Richie Sexson, ( -8.2, -1.9 ) Eric Munson, ( -12.4, -2.8 ) Robert Fick, ( -11.3, 2.0 ) Michael Cuddyer, ( -11.4, -2.9 ) Mo Vaughn, ( -9.7, -0.3 ) Michael Cuddyer, ( -14.1, -2.3 ) Dmitri Young, ( -9.9, 0.1 ) Garrett Atkins, ( -12.4, -2.4 ) Tony Clark, ( -11.7, -1.6 ) Fernando Tatis, ( -14.2, -2.0 ) Fred McGriff, ( -9.4, -2.8 ) Chone Figgins, ( -18.7, -1.4 ) Mike Jacobs, ( -9.4, -2.9 ) Travis Fryman, ( -15.2, -4.4 ) Ben Broussard, ( -10.4, -2.2 ) Joe Randa, ( -17.3, -2.8 ) Nomar Garciaparra, ( -11.1, -3.5 ) Ryan Braun, ( -17.4, -2.9 ) Jason Giambi, ( -13.4, -3.2 ) Jose Bautista, ( -17.4, -5.9 )

21 Results for Middle Infielders: Best/Worst Posterior SAFE values Ten Best 2B Player-Years Ten Best SS Player-Years Name and Year Mean 95% Interval Name and Year Mean 95% Interval Julius Matos, ( 12.4, 22.1 ) Pokey Reese, ( 12.0, 31.2 ) Erick Aybar, ( 10.0, 24.6 ) Adam Everett, ( 10.4, 27.4 ) Junior Spivey, ( 4.7, 27.1 ) Adam Everett, ( 9.0, 21.8 ) Tony Graffanino, ( 4.6, 27.6 ) Craig Counsell, ( 6.9, 21.1 ) Adam Kennedy, ( 1.7, 18.6 ) Jorge Velandia, ( 3.0, 24.0 ) Willie Bloomquist, ( 4.3, 17.8 ) Alex Cora, ( 3.0, 24.6 ) Jose Valentin, ( 4.2, 17.9 ) Alex Rodriguez, ( 3.5, 24.4 ) Chase Utley, ( 5.7, 17.5 ) Maicer Izturis, ( 3.8, 22.2 ) Chase Utley, ( 3.1, 17.7 ) Marco Scutaro, ( 4.0, 20.1 ) Craig Counsell, ( 5.3, 18.0 ) Brent Lillibridge, ( 5.0, 19.1 ) Ten Worst 2B Player-Years Ten Worst SS Player-Years Name and Year Mean 95% Interval Name and Year Mean 95% Interval Ronnie Belliard, ( -19.5, 2.6 ) Erick Almonte, ( -26.9, 2.3 ) Geoff Blum, ( -17.5, -1.7 ) Derek Jeter, ( -21.7, -5.8 ) Miguel Cairo, ( -17.9, -3.1 ) Michael Morse, ( -23.0, -4.5 ) Terry Shumpert, ( -22.2, 0.7 ) Damian Jackson, ( -30.6, -3.5 ) Roberto Alomar, ( -19.3, -4.6 ) Brandon Fahey, ( -22.4, -8.2 ) Enrique Wilson, ( -18.9, -6.2 ) Marco Scutaro, ( -22.0, ) Alberto Callaspo, ( -20.4, -4.5 ) Derek Jeter, ( -24.8, -6.4 ) Dave Berg, ( -25.1, -2.4 ) Michael Young, ( -23.6, -7.2 ) Luis Rivas, ( -20.9, -6.4 ) Josh Wilson, ( -26.5, -6.4 ) Bret Boone, ( -22.4, -8.1 ) Derek Jeter, ( -29.1, -9.2 )

22 Results for Outfielders: Best/Worst Posterior SAFE values Ten Best Left Fielders Ten Best Center Fielders Ten Best Right Fielders Name and Year Mean 95% Interval Name and Year Mean 95% Interval Name and Year Mean 95% Interval E Brown, ( 2.2, 27.9 ) J Michaels, ( 3.3, 32.5 ) G Matthews Jr., ( 5.7, 22.3 ) D Dellucci, ( 5.7, 20.4 ) C Figgins, ( 3.8, 31.2 ) D Mohr, ( 2.3, 28.0 ) R Johnson, ( 2.3, 21.0 ) J Hairston Jr., ( 0.3, 28.6 ) T Nixon, ( 3.3, 18.1 ) C Crisp, ( 4.1, 17.8 ) A Jones, ( 2.2, 20.7 ) G Matthews Jr., ( 2.6, 19.0 ) S Hairston, ( 1.1, 23.5 ) D Glanville, ( -3.1, 30.5 ) R Langerhans, ( 4.6, 19.3 ) S Podsednik, ( 6.1, 17.8 ) J Payton, ( 0.0, 17.8 ) T Nixon, ( 0.7, 19.4 ) M Byrd, ( 0.5, 22.7 ) J Edmonds, ( -0.5, 20.5 ) A Escobar, ( -0.1, 19.3 ) G Vaughn, ( 1.9, 16.9 ) J Gathright, ( -6.6, 25.0 ) A Ochoa, ( -2.3, 20.9 ) O Palmeiro, ( 0.8, 22.2 ) D Erstad, ( -1.2, 20.7 ) E Marrero, ( -0.9, 20.7 ) T Long, ( -0.8, 21.7 ) C Patterson, ( 1.9, 17.9 ) J Drew, ( -1.4, 20.8 ) Ten Worst Left Fielders Ten Worst Center Fielders Ten Worst Right Fielders Name and Year Mean 95% Interval Name and Year Mean 95% Interval Name and Year Mean 95% Interval K Mench, ( -19.4, -2.3 ) C Hermansen, ( -23.6, 4.3 ) G Kapler, ( -13.2, -1.6 ) K Mench, ( -14.9, -7.3 ) D Roberts, ( -21.0, 2.2 ) L Walker, ( -17.2, 1.2 ) A Piatt, ( -16.8, -6.9 ) R Ledee, ( -19.6, 0.5 ) J Guillen, ( -17.0, 0.7 ) L Berkman, ( -16.7, -8.2 ) K Griffey Jr., ( -24.4, -1.3 ) K Mench, ( -17.7, 0.7 ) M Ramirez, ( -19.1, -5.4 ) B Williams, ( -24.5, -3.1 ) E Kingsale, ( -13.1, -2.9 ) R Sierra, ( -16.2, ) S Green, ( -28.3, 2.8 ) W Pena, ( -17.6, 0.2 ) B Kielty, ( -16.7, -9.1 ) L Terrero, ( -29.4, 5.8 ) C Wilson, ( -22.1, -2.6 ) T Womack, ( -25.0, -5.0 ) B Williams, ( -23.4, -5.3 ) J Gonzalez, ( -16.4, ) L Berkman, ( -18.9, ) M Grissom, ( -34.2, -9.4 ) M Tucker, ( -21.8, -8.1 ) M Ramirez, ( -24.8, ) J Cruz, ( -36.2, -5.4 ) Sheffield, ( -21.6, -9.5 )

23 Sharing Information Over Time Want to share information across multiple years within each fielder (up to 7 seasons per player) Constant Ability Model: each season for a player is a realization from same underlying ability γ i β it Normal(γ i,τ 2 ) If players are consistent across time, our estimation of γ i should be more precise than seasonal β it s Still share information between players at the same position with common prior on γ i γ i Normal(µ,σ 2 )

24 Constant Ability Model Adding level to previous hierarchy to group player-seasons by player Position (µ) = average curve across that position. Player-Overall (γ i ) = average curve for player i. Player-Season (β it ) = average curve for player i in year t.

25 Constant Ability Model: Infielder Examples Infielder Grounders SAFE runs Chase Utley Luis Castillo Derek Jeter Overall Individual Years

26 Constant Ability Model: Outfielder Examples Outfielder Fly Balls Andruw Jones Jim Edmonds Manny Ramirez SAFE runs Overall Individual Years

27 Alternative: Moving Average By Age Model Assume that player ability is tied to an overall age-specific mean µ a for all players at that age. β ia Normal(µ a,σ 2 ) Information is shared across all seasons of players at that particular age.

28 Moving Average Model: Infielder Examples

29 Moving Average Model: Outfielder Examples

30 Model Comparison: Posterior predictive validation Predicted deviation: predicted probability of an out versus the actual outcome for all BIPs in a test dataset. 1 Split data into training ( 02-07) and testing ( 08). 2 Training data: get posterior samples from each model for each player 3 Testing data: calculate probability p ij of an out on each BIP using posterior samples 4 Predicted deviation is difference between predicted probability and actual outcome: D i = j y ij p ij, n i 5 Predicted deviation averaged across all posterior samples Compare models with Win% = proportion of players for which each model has smallest predicted deviations

31 Model Comparison: Results Moving average vs. Constant vs. Combined vs. MLE MA model performs best for most BIP x positions

32 Alternative: Autoregressive Model Add more individual specificity by modeling evolution over time within an individual fielder β it Normal(φ ait β i,t 1, τ 2 a it ) Player age a it incorporated through φ ait and τ ait parameters Allows autocorrelation within a player, but there is also age-dependent discounting that is shared across players through the φ a parameters. Can use dynamic linear model framework along with Kalman sampler implementation (current work)

33 Summary BIP data allows more detailed examination of differences between players Parametric approach: smooth probability function reduces variance of results by sharing information between all points near to a fielder Hierarchical structure also allows sharing of information between players at each position SAFE run value aggregates individual differences while weighting for BIP frequency, run value, and shared consequence between positions Lots of recent work on extending model to account for multiple seasons over time per player

34 Thank you! Paper: Bayesball: a Bayesian hierarchical model for evaluating fielding in major league baseball. Annals of Applied Statistics 3: SAFE Project Website: stjensen/research/safe.html Google search: shane jensen safe

35 Player Starting Positions Glossed over reality: we don t actually know where fielders are located when a particular BIP is hit! Each distance/angle is an estimated value, not a known value since starting location is not truly known Starting location for each position fixed at point with highest overall success probability Problematic when fielders are systematically different in their positioning e.g. 1B when there is a man on base Ideally, would build information about defensive shifts and men on base into model

36 Differences between Ballparks Current analysis does not take into account differences in the playing field for different parks Could impact both evaluation of infielders (turf vs. grass) and outfielders (different outfield shapes) Park-specific BIP densities could account for differences in shape but may have higher variance (less data)

37 Internal Validation: Posterior Predictive Simulation Can explore model fit with posterior predictive simulation Compare statistic of observed data to same statistic from data generated from model Repeatedly simulating datasets from model gives a posterior predictive distribution for observed statistic Look for systematic differences between data generated from model vs. observed data Substantial differences suggest aspect of data that model is failing to capture As an example, compare our hierarchical model (partial pooling) to complete pooling model with no heterogeneity between players

38 Posterior Predictive: Partial Pooling vs. Complete Pooling 500 datasets from partial & complete pooling models Statistic: % of flyballs caught for best CF minus worst CF Statistic for observed data indicated by vertical line Percentage Difference Distributions Partial Pooling Complete Pooling 15 Density Percentage Difference Partial pooling much better fit to observed data

39 External Validation: Comparison to UZR Decent overall agreement between SAFE and UZR Overall correlation between SAFE and UZR around 0.4 No gold standard for comparison, but can examine correlation between years (2002 vs. 2003) POS SAFE UZR DIFF 1B B B SS CF LF RF TOTAL Outfield is a big strength for SAFE, Infield has poorer performance (except for good 3B correlation)

Statistics Lecture 25. Advanced Statistical Research in Baseball. Administrative Notes. Measuring Fielding in Baseball: Present and Future

Statistics Lecture 25. Advanced Statistical Research in Baseball. Administrative Notes. Measuring Fielding in Baseball: Present and Future Administrative Notes Statistics 111 - Lecture 25 Advanced Statistical Research in Baseball Homework 7 due in recitation on Friday, April 24 Recitation on Friday, April 24 th is mandatory Recitation on

More information

By Shane T. Jensen, Kenneth E. Shirley and Abraham J. Wyner. University of Pennsylvania

By Shane T. Jensen, Kenneth E. Shirley and Abraham J. Wyner. University of Pennsylvania The Annals of Applied Statistics 2009, Vol. 3, No. 2, 491 520 DOI: 10.1214/08-AOAS228 c Institute of Mathematical Statistics, 2009 BAYESBALL: A BAYESIAN HIERARCHICAL MODEL FOR EVALUATING FIELDING IN MAJOR

More information

Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Major League Baseball

Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Major League Baseball University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2009 Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Major League Baseball Shane T. Jensen University

More information

Journal of Quantitative Analysis in Sports

Journal of Quantitative Analysis in Sports Journal of Quantitative Analysis in Sports Manuscript 1463 Estimating Fielding Ability in Baseball Players Over Time James Piette, University of Pennsylvania Shane T. Jensen, University of Pennsylvania

More information

Evaluating Throwing Ability in Baseball

Evaluating Throwing Ability in Baseball University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 7-2007 Evaluating Throwing Ability in Baseball Matthew Carruth University of Pennsylvania Shane T. Jensen University

More information

A Bayesian Hierarchical Model of Pitch Framing in Major League Baseball

A Bayesian Hierarchical Model of Pitch Framing in Major League Baseball A Bayesian Hierarchical Model of Pitch Framing in Major League Baseball Sameer K. Deshpande and Abraham J. Wyner Statistics Department, The Wharton School University of Pennsylvania 1 August 2016 Introduction

More information

Quantifying the Effect of The Shift in Major League Baseball

Quantifying the Effect of The Shift in Major League Baseball Bard College Bard Digital Commons Senior Projects Spring 2017 Bard Undergraduate Senior Projects 2017 Quantifying the Effect of The Shift in Major League Baseball Christopher John Hawke Jr. Bard College

More information

Hierarchical Bayesian Modeling of Hitting Performance in Baseball

Hierarchical Bayesian Modeling of Hitting Performance in Baseball Bayesian Analysis (2004) 1, Number 1, pp. 1 17 Hierarchical Bayesian Modeling of Hitting Performance in Baseball Shane T. Jensen Department of Statistics The Wharton School University of Pennsylvania stjensen@wharton.upenn.edu

More information

arxiv: v1 [stat.ap] 22 May 2007

arxiv: v1 [stat.ap] 22 May 2007 EVALUATING THROWING ABILITY IN BASEBALL MATTHEW CARRUTH and SHANE T. JENSEN 2 Abstract arxiv:0705.3257v [stat.a] 22 May 2007 We present a quantitative analysis of throwing ability for major league outfielders

More information

Measuring Baseball Defensive Value using Statcast Data

Measuring Baseball Defensive Value using Statcast Data Measuring Baseball Defensive Value using Statcast Data by Drew Jordan Department of Statistical Science Duke University Date: Approved: Colin Rundel, Supervisor Sayan Mukherjee Brian Hare Thesis submitted

More information

arxiv: v2 [stat.ap] 1 Jun 2007

arxiv: v2 [stat.ap] 1 Jun 2007 EVALUATING THROWING ABILITY IN BASEBALL MATTHEW CARRUTH 1 and SHANE T. JENSEN 2 Abstract arxiv:0705.3257v2 [stat.a] 1 Jun 2007 We present a quantitative analysis of throwing ability for major league outfielders

More information

arxiv: v1 [stat.ap] 9 Feb 2009

arxiv: v1 [stat.ap] 9 Feb 2009 HIERARCHICAL BAYESIAN MODELING OF HITTING PERFORMANCE IN BASEBALL SHANE T. JENSEN 1, BLAKE MCSHANE 2 and ABRAHAM J. WYNER 3 Abstract arxiv:0902.1360v1 [stat.ap] 9 Feb 2009 We have developed a sophisticated

More information

George F. Will, Men at Work

George F. Will, Men at Work Part of baseball s charm is the illusion it offers that all aspects of it can be completely reduced to numerical expressions and printed in agate type in the sport section. George F. Will, Men at Work

More information

The Intrinsic Value of a Batted Ball Technical Details

The Intrinsic Value of a Batted Ball Technical Details The Intrinsic Value of a Batted Ball Technical Details Glenn Healey, EECS Department University of California, Irvine, CA 9617 Given a set of observed batted balls and their outcomes, we develop a method

More information

Lesson 1 Pre-Visit Batting Average

Lesson 1 Pre-Visit Batting Average Lesson 1 Pre-Visit Batting Average Objective: Students will be able to: Calculate a batting average. Review prior knowledge of conversion between fractions, decimals, and percentages. Use basic linear

More information

Business and housing market cycles in the euro area: a multivariate unobserved component approach

Business and housing market cycles in the euro area: a multivariate unobserved component approach Business and housing market cycles in the euro area: a multivariate unobserved component approach Laurent Ferrara (a) and Siem Jan Koopman (b) http://staff.feweb.vu.nl/koopman (a) Banque de France (b)

More information

Hitting with Runners in Scoring Position

Hitting with Runners in Scoring Position Hitting with Runners in Scoring Position Jim Albert Department of Mathematics and Statistics Bowling Green State University November 25, 2001 Abstract Sportscasters typically tell us about the batting

More information

Looking at Spacings to Assess Streakiness

Looking at Spacings to Assess Streakiness Looking at Spacings to Assess Streakiness Jim Albert Department of Mathematics and Statistics Bowling Green State University September 2013 September 2013 1 / 43 The Problem Collect hitting data for all

More information

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Outline: MMSE estimation, Linear MMSE (LMMSE) estimation, Geometric formulation of LMMSE estimation and orthogonality principle. Reading:

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

Game 6 Sox Score 4 in the 4th

Game 6 Sox Score 4 in the 4th Game 6 Sox Score 4 in the 4th Millar doubles and Varitek singles him home. 1-0. Cabrera Singles. Runners at 1 st and 2 nd. Bellhorn hits a fan interference double. Francona appeals. Umps huddle. Its ruled

More information

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together Statistics 111 - Lecture 7 Exploring Data Numerical Summaries for Relationships between Variables Administrative Notes Homework 1 due in recitation: Friday, Feb. 5 Homework 2 now posted on course website:

More information

Advanced Putting Metrics in Golf

Advanced Putting Metrics in Golf Advanced Putting Metrics in Golf Kasra Yousefi and Tim B. Swartz Abstract Using ShotLink data that records information on every stroke taken on the PGA Tour, this paper introduces a new metric to assess

More information

Building an NFL performance metric

Building an NFL performance metric Building an NFL performance metric Seonghyun Paik (spaik1@stanford.edu) December 16, 2016 I. Introduction In current pro sports, many statistical methods are applied to evaluate player s performance and

More information

Chapter 20. Planning Accelerated Life Tests. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Chapter 20. Planning Accelerated Life Tests. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Chapter 20 Planning Accelerated Life Tests William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Copyright 1998-2008 W. Q. Meeker and L. A. Escobar. Based on the authors

More information

Real-Time Electricity Pricing

Real-Time Electricity Pricing Real-Time Electricity Pricing Xi Chen, Jonathan Hosking and Soumyadip Ghosh IBM Watson Research Center / Northwestern University Yorktown Heights, NY, USA X. Chen, J. Hosking & S. Ghosh (IBM) Real-Time

More information

Seattle Mariners (52-45) 3, Los Angeles Angels (58-38) 2 July 19, 2014

Seattle Mariners (52-45) 3, Los Angeles Angels (58-38) 2 July 19, 2014 Seattle Mariners (52-45) 3, Los Angeles Angels (58-38) 2 1 2 3 4 5 6 7 8 9 10 11 12 R H E Seattle 0 0 0 0 0 1 0 0 0 0 0 2 3 9 1 LA Angels 0 0 0 0 0 0 1 0 0 0 0 1 2 5 0 Seattle AVG AB R H 2B 3B HR RBI BB

More information

Running head: DATA ANALYSIS AND INTERPRETATION 1

Running head: DATA ANALYSIS AND INTERPRETATION 1 Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various

More information

A point-based Bayesian hierarchical model to predict the outcome of tennis matches

A point-based Bayesian hierarchical model to predict the outcome of tennis matches A point-based Bayesian hierarchical model to predict the outcome of tennis matches Martin Ingram, Silverpond September 21, 2017 Introduction Predicting tennis matches is of interest for a number of applications:

More information

The Star Fleet Battles Tournament Report Version 1.0

The Star Fleet Battles Tournament Report Version 1.0 The Star Fleet Battles Tournament Report Version.0 Robert W. Schirmer Battlegroup Baltimore Ellicott City, Maryland USA 5 September 20 Introduction The Star Fleet Battles Tournament Report is intended

More information

arxiv: v2 [stat.ap] 9 Sep 2017

arxiv: v2 [stat.ap] 9 Sep 2017 A Hierarchical Bayesian Model of Pitch Framing Sameer K. Deshpande and Abraham J. Wyner The Wharton School University of Pennsylvania arxiv:1704.00823v2 [stat.ap] 9 Sep 2017 9 September 2017 Abstract Since

More information

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008

Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008 Clutch Hitters Revisited Pete Palmer and Dick Cramer National SABR Convention June 30, 2008 Do clutch hitters exist? More precisely, are there any batters whose performance in critical game situations

More information

Richard S. Marken The RAND Corporation

Richard S. Marken The RAND Corporation Fielder s Choice: A Unified Theory of Catching Fly Balls Richard S. Marken The RAND Corporation February 11, 2002 Mailing Address: 1700 Main Street Santa Monica, CA 90407-2138 Phone: 310 393-0411 ext 7971

More information

(Under the Direction of Cheolwoo Park) ABSTRACT. Major League Baseball is a sport complete with a multitude of statistics to evaluate a player s

(Under the Direction of Cheolwoo Park) ABSTRACT. Major League Baseball is a sport complete with a multitude of statistics to evaluate a player s PENALIZED REGRESSION MODELS FOR MAJOR LEAGUE BASEBALL METRICS by MUSHIMIE LONA PANDA (Under the Direction of Cheolwoo Park) ABSTRACT Major League Baseball is a sport complete with a multitude of statistics

More information

Evaluating NBA Shooting Ability using Shot Location

Evaluating NBA Shooting Ability using Shot Location Evaluating NBA Shooting Ability using Shot Location Dennis Lock December 16, 2013 There are many statistics that evaluate the performance of NBA players, including some that attempt to measure a players

More information

Lesson 2 - Pre-Visit A World of Baseball Talent

Lesson 2 - Pre-Visit A World of Baseball Talent Lesson 2 - Pre-Visit A World of Baseball Talent Objective: Students will be able to: Identify that players from all over the world come to the United States to play professional baseball. Research and

More information

Journal of Quantitative Analysis in Sports

Journal of Quantitative Analysis in Sports Journal of Quantitative Analysis in Sports Volume 4, Issue 1 2008 Article 3 Streaky Hitting in Baseball Jim Albert Bowling Green State University, albert@bgnet.bgsu.edu Copyright c 2008 The Berkeley Electronic

More information

SPATIAL STATISTICS A SPATIAL ANALYSIS AND COMPARISON OF NBA PLAYERS. Introduction

SPATIAL STATISTICS A SPATIAL ANALYSIS AND COMPARISON OF NBA PLAYERS. Introduction A SPATIAL ANALYSIS AND COMPARISON OF NBA PLAYERS KELLIN RUMSEY Introduction The 2016 National Basketball Association championship featured two of the leagues biggest names. The Golden State Warriors Stephen

More information

Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments

Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments Nathaniel Beck Jonathan N. Katz Draft of April 16, 2006 We gratefully acknowledge the financial support of the National

More information

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG GOAL OF PROJECT The goal is to predict the winners between college men s basketball teams competing in the 2018 (NCAA) s March

More information

Simulating Major League Baseball Games

Simulating Major League Baseball Games ABSTRACT Paper 2875-2018 Simulating Major League Baseball Games Justin Long, Slippery Rock University; Brad Schweitzer, Slippery Rock University; Christy Crute Ph.D, Slippery Rock University The game of

More information

COACH WORKBOOK. Contexts: Module: Outfield Play. Competition Development. Coaching Association of Canada, 2008 Baseball Canada 20011

COACH WORKBOOK. Contexts: Module: Outfield Play. Competition Development. Coaching Association of Canada, 2008 Baseball Canada 20011 COACH WORKBOOK Contexts: Competition Development Module: Outfield Play Coaching Association of Canada, 2008 Baseball Canada 20011 1 2 Unless otherwise indicated, images and pictures contained in this document

More information

arxiv: v1 [stat.ap] 23 Nov 2009

arxiv: v1 [stat.ap] 23 Nov 2009 A Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics arxiv:0911.4503v1 [stat.ap] 23 Nov 2009 Blakeley B. McShane, Alexander Braunstein, James Piette, and Shane T. Jensen Department

More information

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Tony Liu Introduction The broad aim of this project is to use the Bradley Terry pairwise comparison model as

More information

McFarlane Sports Picks MLB, NASCAR, NBA, NFL, & NHL

McFarlane Sports Picks MLB, NASCAR, NBA, NFL, & NHL Full Name: Phone: E-mail: Guidelines for Selling Your Last Updated: April 14, 2017 Address: Delivery W730 State Road 35 Address: Fountain City, WI 54629 Tel: 608.687.7572 ext: 3 How did you hear about

More information

Estimating an NBA Player s Impact on is Team s Chances of Winning

Estimating an NBA Player s Impact on is Team s Chances of Winning University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 6-216 Estimating an NBA Player s Impact on is Team s Chances of Winning Sameer K. Deshpande University of Pennsylvania

More information

A Novel Approach to Predicting the Results of NBA Matches

A Novel Approach to Predicting the Results of NBA Matches A Novel Approach to Predicting the Results of NBA Matches Omid Aryan Stanford University aryano@stanford.edu Ali Reza Sharafat Stanford University sharafat@stanford.edu Abstract The current paper presents

More information

EDGAR MARTINEZ: HALL OF FAME CANDIDATE

EDGAR MARTINEZ: HALL OF FAME CANDIDATE EDGAR MARTINEZ: HALL OF FAME CANDIDATE At the end of the 2004 baseball season, Edgar Martinez announced his retirement after 18 years in the Major Leagues, all with one team - the Seattle Mariners. One

More information

Discussion: Illusions of Sparsity by Giorgio Primiceri

Discussion: Illusions of Sparsity by Giorgio Primiceri Discussion: Illusions of Sparsity by Giorgio Primiceri Pablo Guerron-Quintana Boston College June, 2018 Pablo Guerron-Quintana Discussion 1 / 17 Executive Summary Motivation: Is regular coke (dense model)

More information

A Combined Recruitment Index for Demersal Juvenile Cod in NAFO Divisions 3K and 3L

A Combined Recruitment Index for Demersal Juvenile Cod in NAFO Divisions 3K and 3L NAFO Sci. Coun. Studies, 29: 23 29 A Combined Recruitment Index for Demersal Juvenile Cod in NAFO Divisions 3K and 3L David C. Schneider Ocean Sciences Centre, Memorial University St. John's, Newfoundland,

More information

SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION August 2011 Pohnpei, Federated States of Micronesia

SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION August 2011 Pohnpei, Federated States of Micronesia SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION 9-17 August 2011 Pohnpei, Federated States of Micronesia CPUE of skipjack for the Japanese offshore pole and line using GPS and catch data WCPFC-SC7-2011/SA-WP-09

More information

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON Dorit Hammerling 1, Matthew Cefalu 2, Jessi Cisewski 3, Francesca Dominici 2, Giovanni Parmigiani 2,4, Charles Paulson 5, Richard Smith 1,6 1 Statistical

More information

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Oliver Stevenson & Brendon Brewer PhD candidate, Department of Statistics, University of Auckland o.stevenson@auckland.ac.nz

More information

intended velocity ( u k arm movements

intended velocity ( u k arm movements Fig. A Complete Brain-Machine Interface B Human Subjects Closed-Loop Simulator ensemble action potentials (n k ) ensemble action potentials (n k ) primary motor cortex simulated primary motor cortex neuroprosthetic

More information

Seattle Mariners (15-15) 4, Oakland Athletics (19-13) 2 May 5, 2014

Seattle Mariners (15-15) 4, Oakland Athletics (19-13) 2 May 5, 2014 Seattle Mariners (15-15) 4, Oakland Athletics (19-13) 2 1 2 3 4 5 6 7 8 9 R H E Seattle 2 0 0 0 1 1 0 0 0 4 9 0 Oakland 0 0 0 2 0 0 0 0 0 2 4 0 Seattle AVG AB R H 2B 3B HR RBI BB SO PO A Saunders, M, CF-LF-RF.274

More information

The probability of winning a high school football game.

The probability of winning a high school football game. Columbus State University CSU epress Faculty Bibliography 2008 The probability of winning a high school football game. Jennifer Brown Follow this and additional works at: http://csuepress.columbusstate.edu/bibliography_faculty

More information

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments of Finite Sample Properties

Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments of Finite Sample Properties Random Coefficient Models for Time-Series Cross-Section Data: Monte Carlo Experiments of Finite Sample Properties Nathaniel Beck Jonathan N. Katz Draft of April 1, 2006 We gratefully acknowledge the financial

More information

1. OVERVIEW OF METHOD

1. OVERVIEW OF METHOD 1. OVERVIEW OF METHOD The method used to compute tennis rankings for Iowa girls high school tennis http://ighs-tennis.com/ is based on the Elo rating system (section 1.1) as adopted by the World Chess

More information

Recovery of sea level fields of the last decades from altimetry and tide gauge data

Recovery of sea level fields of the last decades from altimetry and tide gauge data Recovery of sea level fields of the last decades from altimetry and tide gauge data Francisco M. Calafat, Damià Gomis, Ananda Pascual, Marta Marcos and Simón Ruiz Mediterranean Institute for Adavanced

More information

Robust specification testing in regression: the FRESET test and autocorrelated disturbances

Robust specification testing in regression: the FRESET test and autocorrelated disturbances Robust specification testing in regression: the FRESET test and autocorrelated disturbances Linda F. DeBenedictis and David E. A. Giles * Policy and Research Division, Ministry of Human Resources, 614

More information

Safety Assessment of Installing Traffic Signals at High-Speed Expressway Intersections

Safety Assessment of Installing Traffic Signals at High-Speed Expressway Intersections Safety Assessment of Installing Traffic Signals at High-Speed Expressway Intersections Todd Knox Center for Transportation Research and Education Iowa State University 2901 South Loop Drive, Suite 3100

More information

Predicting the Draft and Career Success of Tight Ends in the National Football League

Predicting the Draft and Career Success of Tight Ends in the National Football League University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 10-2014 Predicting the Draft and Career Success of Tight Ends in the National Football League Jason Mulholland University

More information

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg 1 Introduction Relationship between Spring Constant and Length of Bungee Cord In this experiment, we aimed to model the behavior of the bungee cord that will be used in the Bungee Challenge. Specifically,

More information

Name May 3, 2007 Math Probability and Statistics

Name May 3, 2007 Math Probability and Statistics Name May 3, 2007 Math 341 - Probability and Statistics Long Exam IV Instructions: Please include all relevant work to get full credit. Encircle your final answers. 1. An article in Professional Geographer

More information

Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate

Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate Mixture Models & EM Nicholas Ruozzi University of Texas at Dallas based on the slides of Vibhav Gogate Previously We looked at -means and hierarchical clustering as mechanisms for unsupervised learning

More information

Calculation of Trail Usage from Counter Data

Calculation of Trail Usage from Counter Data 1. Introduction 1 Calculation of Trail Usage from Counter Data 1/17/17 Stephen Martin, Ph.D. Automatic counters are used on trails to measure how many people are using the trail. A fundamental question

More information

Decentralized Autonomous Control of a Myriapod Locomotion Robot

Decentralized Autonomous Control of a Myriapod Locomotion Robot Decentralized utonomous Control of a Myriapod Locomotion Robot hmet Onat Sabanci University, Turkey onat@sabanciuniv.edu Kazuo Tsuchiya Kyoto University, Japan tsuchiya@kuaero.kyoto-u.ac.jp Katsuyoshi

More information

Chapter 12 Practice Test

Chapter 12 Practice Test Chapter 12 Practice Test 1. Which of the following is not one of the conditions that must be satisfied in order to perform inference about the slope of a least-squares regression line? (a) For each value

More information

23 RD INTERNATIONAL SYMPOSIUM ON BALLISTICS TARRAGONA, SPAIN APRIL 2007

23 RD INTERNATIONAL SYMPOSIUM ON BALLISTICS TARRAGONA, SPAIN APRIL 2007 23 RD INTERNATIONAL SYMPOSIUM ON BALLISTICS TARRAGONA, SPAIN 16-20 APRIL 2007 AN INVESTIGATION INTO THE INTERRELATION BETWEEN THE INTERNAL AND EXTERNAL BALLISTICS OF FIRING A TP-T TANK AMMUNITION M. H.

More information

Predicting the use of the sacrifice bunt in Major League Baseball BUDT 714 May 10, 2007

Predicting the use of the sacrifice bunt in Major League Baseball BUDT 714 May 10, 2007 Predicting the use of the sacrifice bunt in Major League Baseball BUDT 714 May 10, 2007 Group 6 Charles Gallagher Brian Gilbert Neelay Mehta Chao Rao Executive Summary Background When a runner is on-base

More information

Nonlife Actuarial Models. Chapter 7 Bühlmann Credibility

Nonlife Actuarial Models. Chapter 7 Bühlmann Credibility Nonlife Actuarial Models Chapter 7 Bühlmann Credibility Learning Objectives 1. Basic framework of Bühlmann credibility 2. Variance decomposition 3. Expected value of the process variance 4. Variance of

More information

Process Dynamics, Operations, and Control Lecture Notes - 20

Process Dynamics, Operations, and Control Lecture Notes - 20 Lesson 0. Control valves 0.0 Context Controller output is a signal that varies between 0 and 100%. Putting this signal to use requires a final control element, a device that responds to the controller

More information

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player?

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player? Which On-Base Percentage Shows the Highest True Ability of a Baseball Player? January 31, 2018 Abstract This paper looks at the true on-base ability of a baseball player given their on-base percentage.

More information

Legendre et al Appendices and Supplements, p. 1

Legendre et al Appendices and Supplements, p. 1 Legendre et al. 2010 Appendices and Supplements, p. 1 Appendices and Supplement to: Legendre, P., M. De Cáceres, and D. Borcard. 2010. Community surveys through space and time: testing the space-time interaction

More information

Process Simulator Evaluates Blower and Valve Control Strategies for WWTP Aeration

Process Simulator Evaluates Blower and Valve Control Strategies for WWTP Aeration Process Simulator Evaluates Blower and Valve Control Strategies for WWTP Aeration inshare By Steve Kestel and Tilo Stahl (BioChem Technology) and Matthew Gray (Keystone Engineering Group) Introduction

More information

Instant Replay: Investigating statistical analysis in sports.

Instant Replay: Investigating statistical analysis in sports. Instant Replay: Investigating statistical analysis in sports. arxiv:1102.5549v4 [stat.ap] 11 Oct 2011 Gagan Sidhu, Department of Computing Science, University of Alberta, Edmonton, Canada November 5, 2018

More information

Section I: Multiple Choice Select the best answer for each problem.

Section I: Multiple Choice Select the best answer for each problem. Inference for Linear Regression Review Section I: Multiple Choice Select the best answer for each problem. 1. Which of the following is NOT one of the conditions that must be satisfied in order to perform

More information

SAP Predictive Analysis and the MLB Post Season

SAP Predictive Analysis and the MLB Post Season SAP Predictive Analysis and the MLB Post Season Since September is drawing to a close and October is rapidly approaching, I decided to hunt down some baseball data and see if we can draw any insights on

More information

Problem 1 Yankees VS. Mets Comparing distributions without graphing

Problem 1 Yankees VS. Mets Comparing distributions without graphing Yankees vs. Mets Class Problem 1 Yankees VS. Mets Comparing distributions without graphing Below are two data sets that represent the 2008 salaries for the New York Yankees and the New York Mets, the two

More information

Development of Decision Support Tools to Assess Pedestrian and Bicycle Safety: Development of Safety Performance Function

Development of Decision Support Tools to Assess Pedestrian and Bicycle Safety: Development of Safety Performance Function Development of Decision Support Tools to Assess Pedestrian and Bicycle Safety: Development of Safety Performance Function Valerian Kwigizile, Jun Oh, Ron Van Houten, & Keneth Kwayu INTRODUCTION 2 OVERVIEW

More information

Author s Name Name of the Paper Session. Positioning Committee. Marine Technology Society. DYNAMIC POSITIONING CONFERENCE September 18-19, 2001

Author s Name Name of the Paper Session. Positioning Committee. Marine Technology Society. DYNAMIC POSITIONING CONFERENCE September 18-19, 2001 Author s Name Name of the Paper Session PDynamic Positioning Committee Marine Technology Society DYNAMIC POSITIONING CONFERENCE September 18-19, 2001 POWER PLANT SESSION A New Concept for Fuel Tight DP

More information

Estimating breeding value

Estimating breeding value Estimating breeding value Key terms Estimated breeding value (EBV) Heritability Contemporary groups Using information from relatives Accuracy Which animal to select? Animal P1 P2 EBV sire's perform. Index

More information

GLMM standardisation of the commercial abalone CPUE for Zones A-D over the period

GLMM standardisation of the commercial abalone CPUE for Zones A-D over the period GLMM standardisation of the commercial abalone for Zones A-D over the period 1980 2015 Anabela Brandão and Doug S. Butterworth Marine Resource Assessment & Management Group (MARAM) Department of Mathematics

More information

Has the NFL s Rooney Rule Efforts Leveled the Field for African American Head Coach Candidates?

Has the NFL s Rooney Rule Efforts Leveled the Field for African American Head Coach Candidates? University of Pennsylvania ScholarlyCommons PSC Working Paper Series 7-15-2010 Has the NFL s Rooney Rule Efforts Leveled the Field for African American Head Coach Candidates? Janice F. Madden University

More information

LESSON PLAN. Name of Activity. Evaluation. Time. Points. Tryout Plan #3 0:00 0:40 1:10. 1:40 Team Tactical Game

LESSON PLAN. Name of Activity. Evaluation. Time. Points. Tryout Plan #3 0:00 0:40 1:10. 1:40 Team Tactical Game Tryout Plan #3 Time 0:00 Name of Activity LESSON PLAN Description Key Teaching Points Warm - Up Cornerstone Dynamic Warm-Up* 1.) Movement Prep 2.) Improve Agility 3.) Rotational Explosive Movement 0:15

More information

Bayesian model averaging with change points to assess the impact of vaccination and public health interventions

Bayesian model averaging with change points to assess the impact of vaccination and public health interventions Bayesian model averaging with change points to assess the impact of vaccination and public health interventions SUPPLEMENTARY METHODS Data sources U.S. hospitalization data were obtained from the Healthcare

More information

Winter Steelhead Redd to Fish conversions, Spawning Ground Survey Data

Winter Steelhead Redd to Fish conversions, Spawning Ground Survey Data Winter Steelhead Redd to Fish conversions, Spawning Ground Survey Data Oregon Department of Fish and Wildlife (ODFW); Corvallis Research Office Oregon Adult Salmonid Inventory and Sampling Project (June

More information

Lesson 2 - Pre-Visit A World of Baseball Talent

Lesson 2 - Pre-Visit A World of Baseball Talent Lesson 2 - Pre-Visit A World of Baseball Talent Objective: Students will be able to: Identify that players from a variety of cultures come to the United States to play professional baseball. Recognize

More information

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Migyeong Kim a, Gyusung Kim a, *, Jongjin Jung a and Wooseung Sim a a Advanced Technology Institute,

More information

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions Announcements Announcements Lecture 19: Inference for SLR & Statistics 101 Mine Çetinkaya-Rundel April 3, 2012 HW 7 due Thursday. Correlation guessing game - ends on April 12 at noon. Winner will be announced

More information

Journal of Quantitative Analysis in Sports Manuscript 1039

Journal of Quantitative Analysis in Sports Manuscript 1039 An Article Submitted to Journal of Quantitative Analysis in Sports Manuscript 1039 A Simple and Flexible Rating Method for Predicting Success in the NCAA Basketball Tournament Brady T. West University

More information

arxiv: v1 [stat.ap] 18 Oct 2018

arxiv: v1 [stat.ap] 18 Oct 2018 Augmenting Adjusted Plus-Minus in Soccer with FIFA Ratings Francesca Matano *, Lee F. Richardson *, Taylor Pospisil *, Collin Eubanks *,**, and Jining Qin * arxiv:1810.08032v1 [stat.ap] 18 Oct 2018 * Department

More information

OREGON DEPARTMENT OF FISH AND WILDLIFE SUMMARY OF COUGAR POPULATION MODEL AND EFFECTS OF LETHAL CONTROL

OREGON DEPARTMENT OF FISH AND WILDLIFE SUMMARY OF COUGAR POPULATION MODEL AND EFFECTS OF LETHAL CONTROL OREGON DEPARTMENT OF FISH AND WILDLIFE SUMMARY OF COUGAR POPULATION MODEL ODFW is authorized to reduce human-cougar conflict, livestock depredation, and benefit native ungulate populations through the

More information

IMO REVISION OF THE INTACT STABILITY CODE. Proposal of methodology of direct assessment for stability under dead ship condition. Submitted by Japan

IMO REVISION OF THE INTACT STABILITY CODE. Proposal of methodology of direct assessment for stability under dead ship condition. Submitted by Japan INTERNATIONAL MARITIME ORGANIZATION E IMO SUB-COMMITTEE ON STABILITY AND LOAD LINES AND ON FISHING VESSELS SAFETY 49th session Agenda item 5 SLF 49/5/5 19 May 2006 Original: ENGLISH REVISION OF THE INTACT

More information

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question: Data Set 7: Bioerosion by Parrotfish Background Bioerosion of coral reefs results from animals taking bites out of the calcium-carbonate skeleton of the reef. Parrotfishes are major bioerosion agents,

More information

Reconciling disparate 20th Century Indo-Pacific ocean temperature trends in the instrumental record and in CMIP5

Reconciling disparate 20th Century Indo-Pacific ocean temperature trends in the instrumental record and in CMIP5 Reconciling disparate 20th Century Indo-Pacific ocean temperature trends in the instrumental record and in CMIP5 Matt Newman and Amy Solomon CIRES/CDC, University of Colorado and NOAA/ESRL/PSD Solomon,

More information

Machine Learning an American Pastime

Machine Learning an American Pastime Nikhil Bhargava, Andy Fang, Peter Tseng CS 229 Paper Machine Learning an American Pastime I. Introduction Baseball has been a popular American sport that has steadily gained worldwide appreciation in the

More information

Trouble With The Curve: Improving MLB Pitch Classification

Trouble With The Curve: Improving MLB Pitch Classification Trouble With The Curve: Improving MLB Pitch Classification Michael A. Pane Samuel L. Ventura Rebecca C. Steorts A.C. Thomas arxiv:134.1756v1 [stat.ap] 5 Apr 213 April 8, 213 Abstract The PITCHf/x database

More information

E. Agu, M. Kasperski Ruhr-University Bochum Department of Civil and Environmental Engineering Sciences

E. Agu, M. Kasperski Ruhr-University Bochum Department of Civil and Environmental Engineering Sciences EACWE 5 Florence, Italy 19 th 23 rd July 29 Flying Sphere image Museo Ideale L. Da Vinci Chasing gust fronts - wind measurements at the airport Munich, Germany E. Agu, M. Kasperski Ruhr-University Bochum

More information