The probability of winning a high school football game.

Similar documents
The factors affecting team performance in the NFL: does off-field conduct matter? Abstract

NCAA Football Statistics Summary

The Drivers of Success in the NFL: Differences in Factors Affecting the Probability of Winning Based on First Half Performance

Predicting the Total Number of Points Scored in NFL Games

Modeling Fantasy Football Quarterbacks

save percentages? (Name) (University)

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent

Building an NFL performance metric

SPORTSCIENCE sportsci.org

Coach s Clipboard

An Analysis of Factors Contributing to Wins in the National Hockey League

ANALYSIS OF SIGNIFICANT FACTORS IN DIVISION I MEN S COLLEGE BASKETBALL AND DEVELOPMENT OF A PREDICTIVE MODEL

NBA TEAM SYNERGY RESEARCH REPORT 1

Friday, October 14, 2011 (07:02 PM) League game at Highland

Has the NFL s Rooney Rule Efforts Leveled the Field for African American Head Coach Candidates?

Check here if you're new to football, having a difficult time following the games or if you just need to look up some terms.

Scoring summary Qtr Time Scoring play And HHS Andrean 1 8:17 #38 10 yd run (#32 kick) 7 0 Andrean 2 11:50 #3 33 yd pass to #82 (#32 kick) 14 0

DISMAS Evaluation: Dr. Elizabeth C. McMullan. Grambling State University

An examination of try scoring in rugby union: a review of international rugby statistics.

Volleyball. Volleyball

yellow orange Game play

Running head: DATA ANALYSIS AND INTERPRETATION 1

Driv e accu racy. Green s in regul ation

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

CAPSULIZED RULES FOR YOUTH FLAG FOOTBALL LEAGUE January 1, 2017

Quick Start Guide. Quick Start Guide

2010 Football Statistician's Manual

Journal of Human Sport and Exercise E-ISSN: Universidad de Alicante España

GREY BOARD. Coach s Clipboard

360,000 student-athletes 23 sports at

A Novel Approach to Predicting the Results of NBA Matches

Determinants of college hockey attendance

Football Statisticians Manual

University of Nevada, Reno. The Effects of Changes in Major League Baseball Playoff Format: End of Season Attendance

The Rise in Infield Hits

GHS 2 TOL TOL 2 MCHS. GHS Leaders. 4th Quarter. MCHS Leaders. 1st down and 10 to go from the +39 yard line CURRENT DRIVE: 1 PLAYS YARDS.

2018 FOOTBALL STATISTICIANS MANUAL

Department of Economics Working Paper

HUNTSVILLE FOOTBALL LEAGUE SPRING 2017 RULES

Is lung capacity affected by smoking, sport, height or gender. Table of contents

4th Down Expected Points

FLAG FOOTBALL PACKET # 31 INSTRUCTIONS INTRODUCTION HISTORY OF THE GAME

4-3 Rate of Change and Slope. Warm Up. 1. Find the x- and y-intercepts of 2x 5y = 20. Describe the correlation shown by the scatter plot. 2.

c. Smoking/Vaping: The KFFSC Draft will be a non-smoking & non-vaping event.

THE YANKEES EFFECT: THE IMPACT OF INTERLEAGUE PLAY AND THE UNBALANCED SCHEDULE ON MAJOR LEAGUE BASEBALL ATTENDANCE

This page intentionally left blank

How to Win in the NBA Playoffs: A Statistical Analysis

NCAA 265-8/04 FM 04

AN ANALYSIS OF TEAM STATISTICS IN AUSTRALIAN RULES FOOTBALL. Andrew Patterson and Stephen R. Clarke 1. Abstract 1. INTRODUCTION

Navigate to the golf data folder and make it your working directory. Load the data by typing

Fantasy Football 2018 Rules

Referee Training Illegal Rush: Test Your Knowledge

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg

ENDTHEZONE RULES NO FIGHTING PLAYERS EJECTED AND ANY GAMES DECIDED ARE AT THE DISCRETION OF THE REFEREES AND/OR LEAGUE OFFICIALS.

Pierce 0. Measuring How NBA Players Were Paid in the Season Based on Previous Season Play

2017 NWRAA Flag Football Rules

Predicting the Draft and Career Success of Tight Ends in the National Football League

2017 PFF RUSHING REPORT

PREDICTING the outcomes of sporting events

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

Guide for Statisticians

The relationship between payroll and performance disparity in major league baseball: an alternative measure. Abstract

CSAL Flag Football Rules 2018 (updated 9/4/18)

Investigation of Winning Factors of Miami Heat in NBA Playoff Season

Jr Skyland Football Conference (JSFC) 2014 SKYLAND Division Football Rules For Varsity & JV Version , June 1

INTRODUCTION Page 2 SETTINGS

Team Records. Single Game Offense

Should bonus points be included in the Six Nations Championship?

The API Score discussed earlier is the variable that was explained. This measures the

Fit to Be Tied: The Incentive Effects of Overtime Rules in Professional Hockey

LIVE BETTING ULTRA RULES

GAME SETUP: There are two teams orange and grey. Each team has 12 offense and 12 defense play cards.

Attack-Tempo and Attack-Type as predictors of attack point made by opposite players in Volleyball

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball

Rules of Play REDZONE! I. DESCRIPTION OF THE GAME: II. EQUIPMENT INCLUDED: III. PLAY OF THE GAME: IV. HOW TO USE THE GAME EQUIPMENT: 1.

CORRELATION BETWEEN THE RESULT EFFICIENCY INDICATORS OF SUCCESS IN TABLE TENNIS

HOSTING DISTRICT/PLAYOFF GAMES:

STYFL Flag Football Rules

Stats 2002: Probabilities for Wins and Losses of Online Gambling

Journal of Quantitative Analysis in Sports Manuscript 1039

GATEWAY FOOTBALL LEAGUE 2011 LOCAL RULES

Pike Youth Football League Game and Operation Rules

HOSTING DISTRICT/PLAYOFF GAMES:

Flag Football Rules. Section I

CAASC Flag Football GUIDELINES & RULES

OK Youth Football League Rules

Department of Economics Working Paper Series

Game Format. High School 7 v 7 Game Format ONE HAND TOUCH (OPTIONAL 7th-8th Grade Division only but will not qualify for the National Tournament.

Does Gun Control Reduce Criminal Violence? An Econometric Evaluation of Canadian Firearm Laws.

Marist Jr. War Eagles. League Rules

Estimating the Probability of Winning an NFL Game Using Random Forests

RULES & REGULATIONS THE BASICS

Rules of Play REDZONE! I. DESCRIPTION OF THE GAME: III. PLAY OF THE GAME: IV. HOW TO USE THE GAME EQUIPMENT: 1. Game Dice:

Analysis of the Impacts of Transit Signal Priority on Bus Bunching and Performance

General: Flag Football Rules *Revised 7/20/16

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

!!!!!!!!!!!!! One Score All or All Score One? Points Scored Inequality and NBA Team Performance Patryk Perkowski Fall 2013 Professor Ryan Edwards!

Prokopios Chatzakis, National and Kapodistrian University of Athens, Faculty of Physical Education and Sport Science 1

2013 Central KY Wilderness Trace Youth Football Conference Rules 1. Members: 2. Board of Directors: 3. Rule Changes:

2017 Norman Optimist 7 v 7 PASSING LEAGUE RULES

Transcription:

Columbus State University CSU epress Faculty Bibliography 2008 The probability of winning a high school football game. Jennifer Brown Follow this and additional works at: http://csuepress.columbusstate.edu/bibliography_faculty Part of the Mathematics Commons Recommended Citation Brown, Jennifer, "The probability of winning a high school football game." (2008). Faculty Bibliography. 935. http://csuepress.columbusstate.edu/bibliography_faculty/935 This Presentation is brought to you for free and open access by CSU epress. It has been accepted for inclusion in Faculty Bibliography by an authorized administrator of CSU epress.

Running Head: THE PROBABILITY OF WINNING The Probability of Winning a High School Football Game Jennifer L. Bell Auburn University This paper, which is part of a series, was intended to demonstrate the application of nonparametric statistics while appealing to the general interest in football. Non-parametric Statistics Workshop EERA 2008 (Hilton Head, SC) J. Bell 1

The Probability of Winning a High School Football Game After a weekend filled with football games, the media presents a wide range of statistics to explain the game s ultimate result, win or lose. During a high school, college, or professional football game, the sports broadcasters will discuss probabilities, frequencies, and overall season statistics regarding offensive or defensive plays (e.g., third down and four, field goal from fifty yards, or goal line stand by the defense). Even though the general game format is the same for high school, college, and professional football, rules and regulations differ depending on the level, conference, and division. Wagner (1985) wanted to explain the margin of victory by the significant statistics according to three multiple regression models and eliminate inferior or insignificant statistics that do not affect the margin of victory or were highly correlated to the other standard statistics. The participants were 90 Top 20 Division 1 schools and 98 professional games, who were randomly selected, from the 1985 season. The author hypothesized that the margin of victory can be explained based on these nine independent variables: (a) difference in the number of first downs; (b) difference in rushing yardage; (c) difference in passing yardage; (d) difference in return yardage; (e) difference in penalty yardage; (f) difference in the number of turnovers; (g) difference in the number of quarterback sacks; (h) difference in the time of possession; and (i) home field designation. For college football games, Wagner (1985) found passing and rushing yards increased the margin of victory. Moreover, rushing yards increased the margin of victory more than passing yards, but turnovers decreased the margin of victory. His results revealed that time of possession, number of first downs, penalty yards, and home field advantage were not significant. Actually,

time of possession had a negative effect on the margin of victory. Comparable to the college results, the results for the professional football games suggested that rushing yards had more of an effect on the margin of victory than passing yards. Two differences existed between the college and professional games: number of first downs and quarterback sacks. These variables were significant for the professional games but not for the college games. According to Wagner, the margin of victory was more easily explained for college games than professional games (R 2 for the college model was 72.35%, and R 2 for the professional model was 56.87%). Similar to the Wagner (1985) study, Willoughby (2002) examined 191 Canadian Football League (CFL) games between 1989 and 1995 using logistic regression. He predicted the outcome of the CFL game by the differences in rushing yardage, passing yardage, interceptions, fumbles recoveries, and quarterback sacks. The results suggested that the better teams in the CFL were more likely to win the game based on rushing yardage, passing yardage, and the number of interceptions, which was parallel to the findings of Wagner. By using the logistic regression model, the average prediction probability of the model was 85%, which exceeded the R 2 values in the Wagner study. Research Question In the Southeast, on any given Friday night during the months of August through November, the local high school s stadium will be filled with football fans of all ages. Often, the high stakes involved with these high school games are overlooked compared to college and professional games. There have been a number of studies (Wagner, 1985; Willoughby, 2002) about predicting the outcome of professional and college football games. Are the same variables used to determine the probability of winning a game with college and professional teams applicable to high school football teams?

Methods Participants The participants for this study were four Division 5-A high school football teams from a southeastern state. They were selected from a total population of 69 high schools. Using the school s enrollment from the previous school year, the state s high school association determined division classification. Division 5-A included the top 20% of schools based on large student populations and membership in their organization. These schools are divided into eight different geographic regions. Participants were selected based on regional designation and availability of game box statistics. A total of 46 individual game box statistics were collected from the 2006 regular and postseason seasons with 31 overall wins (67.4%) and 15 overall losses (32.6%). If 1 of the 4 participants played another team within the same set of selected participants, that game was entered for the winner of the football game and was not entered for the loser of the game. Measures During the football games, a certified teacher from the high school or a member of the booster club stood on the sidelines and recorded the results of each offensive and defensive play. After the game, the team s statistician analyzed the recorded yardage and attempts. The amount of information available for each team varied depending on the team statistician s preferred game box statistics.

Procedures The individual game box statistics were collected by the researcher from the team s webmaster or head coach. The predictor variables includes difference between the team s and the opponent s (a) total passing yards; (b) rushing yards; (c) penalty yards; (d) fumbles; and (e) first downs. Results Predictors of Win/Lose Descriptives. Descriptives for each predictor variable were assessed by the Win/Lose dependent variable. For the entire sample, the difference in total passing yards ranged from -199 to 89 with a mean of -37.83 and a standard deviation of 74.64. The mean score for the difference in total rushing yards was 86.91 with a standard deviation of 134.05. The range of scores was from -171 to 457. The range for the difference in total penalty yards was -54 to 62 with a mean of -0.20 and a standard deviation of 26.40. For the difference in the number of fumbles, the mean was -0.54 with a standard deviation of 1.59, and the values ranged from -6 to 4. The difference in the number of first downs had a range of -12 to 17 and a mean of 2.39 with a standard deviation of 7.25. For the five predictors, an independent sample t test was conducted. Table 1 displays the means and standard deviations for each predictor by Win/Lose in addition to the t test results.

Table 1 Means and Standard Deviations for each Predictor by Win/Lose Win Lose Predictor M SD M SD t rushing yards 126.06 124.88 6.00 118.02 3.11** passing yards -40.26 72.23-32.80 81.80-0.31 penalty yards 0.77 26.66-2.20 26.68 0.36 number of fumbles -0.74 1.61-0.13 1.51-1.23 number of first downs 3.87 6.58-0.67 7.83 2.06* Note. *p <.05. **p <.01. To determine the relationship between the predictor variable, Win/Lose, and the five predicting variables, bivariate correlations were conducted. The difference in total rushing yards and the difference in the number of first downs were significantly and positively correlated (r =.73; p <.001). The negative relationship between the difference in total rushing yards and the difference in total passing yards was significant (r = -.50; p <.001), meaning as the difference increased for rushing yards the difference decreased for passing yards. In addition, the difference in total rushing yards had a significant and negative relationship with the dependent variable, Win/Lose, (r = -.43; p <.01). The intercorrelations for each predictor variable are presented in Table 2.

Table 2 Intercorrelations for Win/Lose and Predictor Variables Predictor Win/Lose passing yards rushing yards penalty yards number of fumbles number of first downs Win/Lose -- rushing yards passing yards penalty yards number of fumbles number of first downs Note. *p <.05. **p <.01. -.43** --.05 -.50** -- -.05.37** -.17 --.18 -.13.19 -.30* -- -.30*.73** -.00.09.05 -- Prediction of Win/Lose. After the initial descriptives and intercorrelations, a logistic regression analysis was conducted using Win/Lose as the dichotomous dependent variable. The Hosmer and Lemeshow Goodness of Fit Test revealed no significant difference between the observed and expected values ( 2 = 2.74; p =.91). The overall predicting percentage for the logistic regression model was 80.4%. The most significant predictors for a win were the difference in total rushing yards, the difference in total passing yards, and the difference in the number of first downs. Table 3 displays the summary of the full regression analysis including the unstandardized coefficients, Odds ratio, and Wald statistics for each predictor.

Table 3 Summary of Logistic Regression Analysis for Variables Predicting Win/Lose (N=46) Predictor B SE B Odds ratio Wald statistic rushing yards -0.04 0.01 0.97 8.42** passing yards penalty yards number of fumbles number of first downs Constant Note. Overall Predicting Percentage 80.4%. *p <.05. **p <.01. -0.03 0.01 0.97 6.33** 0.04 0.02 1.05 3.66 0.74 0.43 2.09 2.87 0.32 0.15 1.37 4.41* 0.27 0.50 1.31 0.29 Discussion Previous studies (Wagner, 1985; Willoughby, 2002) found the number of first downs was not significant for college football games and was a significant predictor for professional football games. The results of this study indicate that the number of first downs was a significant predicting variable for high school football games. In addition, these findings support the previous studies regarding the significant contributions of rushing and passing yardage to winning a football game. Based on the results of this study, the likelihood of winning a high school football game depends upon increased rushing yardage and decreased passing yardage. Thus, these findings suggest that the predictor variables used in determining the probability of winning college and professional games are applicable to high school football games. One significant outlier was found to have a z score of 3.33. In this game, the model predicted a win, but the game was lost. After further inspection of the raw data, the researcher hypothesized that the number of penalties and penalty yards (6 penalties for 50 yards versus 3

penalties for 30 yards) possibly caused the 17 to 20 loss. The participant had 165 total rushing yards and 15 first downs to the opponent s 66 rushing yards and 4 first downs. The literature (Wagner, 1985; Willoughby, 2002) addressed the possible significance of the difference between the teams quarterback sacks, number of interceptions, and time of possession, but those individual statistics were not available for all football teams; therefore, those variables were not included in the analysis. Future research should be considered using quarterback sacks, number of interceptions, and time of possession. Another limitation to this study was the highly correlated predictor variables of total rushing yards and number of first downs. Since the sample included various games involving the same team, future research could randomly select one week of individual game box statistics for all high school football games within the state.

References Wagner, G. O. (1985). College and professional football scores: A multiple regression analysis. American Economist, 31(1), 33-37. Willoughby, K. A. (2002). Winning games in Canadian football: A logistic regression analysis. The College Mathematics Journal, 33(3), 215-220.