Navigate to the golf data folder and make it your working directory. Load the data by typing
|
|
- Brendan Watson
- 6 years ago
- Views:
Transcription
1 Golf Analysis 1.1 Introduction In a round, golfers have a number of choices to make. For a particular shot, is it better to use the longest club available to try to reach the green, or would it be better to use a shorter club that would be more likely to end up in the fairway? On a par five, should the intent from the start be to be on in regulation? Or is it worth the risk to make it on in two? And for a given course, what clubs should be kept in the bag? Here, we re going to take a look at how particular shots affect the score on a hole. We re also going to look at some plotting tools to visualize the data we have. 1.2 Data Navigate to the golf data folder and make it your working directory. Load the data by typing tps1. data = read. csv (" tps1. csv ", header = TRUE ) You should check to see what is contained in the dataset. One way that might make this easier is to type tps1. data [ which ( tps1. data $ Player. Last. Name == " Mickelson "),] This should bring up four lines of output from the data. Observing that the Shot column has entries 1, 2, 3, and 4, we see that each row is a shot on this particular hole. Now, what do some of the other columns mean? The easiest way to find out is to open Shot Detail Field Defs.pdf, which has descriptions for each of the columns. Look through this to get a better sense of what data is encoded in the file. Exercise What are the units of the X, Y, and Z coordinate columns? What do the columns show when the ball is in the hole? Is the location of the tee box given in the coordinates? The data presented here is from the first hole of the Torrey Pines South Course. A map of the course is available in the file tps_scorecard.pdf. Since we have coordinates, let s try to visualize the shot data. plot ( tps1. data $X. Coordinate, tps1. data $Y. Coordinate )
2 2 Figure 1.1 The X and Y coordinates for the shot data. The plot you obtain should look like Figure 1.1. As we can see, this isn t a particularly helpful plot. The problem here is that we need to get rid of the points that are 0. Exercise Use the which function to define a new data frame which does not include any shots for which the X coordinate is 0. Call this tps1.nonzero. Plot the X and Y coordinates of this new data frame. What do you get? Does the concentration of shot locations make sense? Exercise In Exercise 1.2.2, you should have plotted the shots. In the lower left corner, there is a large cluster of points and then two slightly smaller clusters. What is the reason for this? You may want to use the unique function on the column of dates. 1.3 A First Analysis: Distance off the Tee Suppose we re interested in knowing how the features of a drive correspond to a golfer s score on this hole. One place to begin an analysis would be based on drive distance A Simple Linear Regression First, define a data frame called tps1. first.shot which consists of all shots on the first shots. Then, define a vector called distance.yards which is the distance of the shot in
3 3 yards. 1 Now, we can define a linear model by typing golf. model = lm( tps1. first. shot $ Hole. Score ~ distance. yards ) From here, we can get a lot of information about the fit of the model by using the summary function. > summary ( golf. model ) Call : lm( formula = tps1. first. shot $ Hole. Score ~ distance. yards ) Residuals : Min 1Q Median 3Q Max Coefficients : Estimate Std. Error t value Pr ( > t ) ( Intercept ) < 2e -16 *** distance. yards ** --- Signif. codes : 0 *** ** 0.01 * 0.05 "." Residual standard error : on 301 degrees of freedom Multiple R- squared : , Adjusted R- squared : F- statistic : on 1 and 301 DF, p- value : This is quite a bit of information, and we don t have the tools to understand all of it in this course. 2 The basic idea though is that we are modeling the score a golfer receives on the hole as a random variable Y i, and we are assuming here that it follows the formula Y i = β 0 + β 1 X i + ε i (1.1) where X i is the distance of the drive, ε i is a normal random variable with mean 0, and β 0 and β 1 are constants. The point of the regression is to find β 0 and β 1. So in this particular case, we have Y i = X i + ε i. (1.2) As a sanity check, note that the coefficient β 1 is negative. This means that longer drives are correlated with lower scores on the hole. So, suppose we had shots of 250, 275, and 300 yards. What would the expected scores on the hole be? Turning to Equation (1.2), we would get E[Y 1 ] = (250) = We could similarly compute the other values. Alternatively, R has the built-in command predict. To use this function, we define a new data frame with the necessary predictors. In this case, that s just the distance of the shot. This gives 1 It s not strictly necessary to define the distance in yards. But we do it here because tee shots are usually measured in yards for interpretability. 2 Honestly, it would take about a year of an introductory statistics course to cover all of this material.
4 4 > newdata = data. frame ( distance. yards = c (250, 275, 300) ) > predict ( golf. model, newdata ) which agree with our earlier results. Exercise What are the predicted scores for drives of 240, 280, and 310 yards? Diagnostics When we wrote Equation (1.1) earlier, we were making some pretty strong assumptions about the model. Exercise What are some of the assumptions necessary for a linear model? In particular, what can be said about each individual Y i? How about the Y i collectively? Now, we re going to take the step of looking at our fit to the data. In general, looking at the data before fitting a model is bad practice. Why? Well, humans fit all sorts of complicated models to data, and after that has happened, there s no real way to make any statistical guarantees. In any case, we can type the following > plot ( distance. yards, tps1. first. shot $ Hole. Score ) > abline ( golf. model ) This should produce Figure 1.2 Figure 1.2 Hole score as a function of the yardage of the drive.
5 5 This is not a good fit. However, we could sort of imagine this happening. For one, the response variable, which is the score in this case, only takes a couple values. Second, the trend line doesn t seem to fit the data particularly well. If we want to do better, we ll have to throw more refined data at this problem. Exercise Suppose we had data from the entire round. How could we modify the analysis so we would get data that would be closer to normal? Why would it be closer to normal? What assumptions would we have to make that we haven t made for this analysis? 1.4 A Second Look We re going to try once again to fit the score on a hole as a function of the length of the drive, but we want data that will be more normal. Load the new data by typing full. round = read. csv (" TorreyPinesSouth. csv ") Exercise How many unique first names are there among the golfers? How many unique last names are there among the golfers? How many golfers are represented in the data? Exercise Write a function hole. scores which takes a player identification number and returns the vector of scores on par four holes. Write a function drive. distances which takes a player identification number and returns the vector of drive distances in yards on par four holes. You should be able to type > hole. scores (1810) [1] > drive. distances (1810) [1] [9] Exercise Write a function that returns the vector of averages scores on par four holes and the vector of average drive distances on par four holes for all golfers. Call the function round.info. Exercise Define the linear model average.model so that score is a function of drive distance. Plot the data and the fitted line. Does it look more reasonable to fit a linear model now? How well does the line fit the data? There are other refinements one could use for this. In particular, we might be interested in the effect that hitting into the rough or a fairway trap has on the outcome for the hole. You could continue analyzing this further by dividing the data appropriately.
Lab 11: Introduction to Linear Regression
Lab 11: Introduction to Linear Regression Batter up The movie Moneyball focuses on the quest for the secret of success in baseball. It follows a low-budget team, the Oakland Athletics, who believed that
More informationChapter 12 Practice Test
Chapter 12 Practice Test 1. Which of the following is not one of the conditions that must be satisfied in order to perform inference about the slope of a least-squares regression line? (a) For each value
More informationSection I: Multiple Choice Select the best answer for each problem.
Inference for Linear Regression Review Section I: Multiple Choice Select the best answer for each problem. 1. Which of the following is NOT one of the conditions that must be satisfied in order to perform
More informationWhy We Should Use the Bullpen Differently
Why We Should Use the Bullpen Differently A look into how the bullpen can be better used to save runs in Major League Baseball. Andrew Soncrant Statistics 157 Final Report University of California, Berkeley
More informationy ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together
Statistics 111 - Lecture 7 Exploring Data Numerical Summaries for Relationships between Variables Administrative Notes Homework 1 due in recitation: Friday, Feb. 5 Homework 2 now posted on course website:
More informationPredicting the Total Number of Points Scored in NFL Games
Predicting the Total Number of Points Scored in NFL Games Max Flores (mflores7@stanford.edu), Ajay Sohmshetty (ajay14@stanford.edu) CS 229 Fall 2014 1 Introduction Predicting the outcome of National Football
More information100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men?
100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men? The 100 Meter Dash has been an Olympic event since its very establishment in 1896(1928 for women). The reigning 100-meter Olympic champion
More informationPitching Performance and Age
Pitching Performance and Age By: Jaime Craig, Avery Heilbron, Kasey Kirschner, Luke Rector, Will Kunin Introduction April 13, 2016 Many of the oldest players and players with the most longevity of the
More informationLesson 14: Modeling Relationships with a Line
Exploratory Activity: Line of Best Fit Revisited 1. Use the link http://illuminations.nctm.org/activity.aspx?id=4186 to explore how the line of best fit changes depending on your data set. A. Enter any
More informationSample Final Exam MAT 128/SOC 251, Spring 2018
Sample Final Exam MAT 128/SOC 251, Spring 2018 Name: Each question is worth 10 points. You are allowed one 8 1/2 x 11 sheet of paper with hand-written notes on both sides. 1. The CSV file citieshistpop.csv
More informationPitching Performance and Age
Pitching Performance and Age Jaime Craig, Avery Heilbron, Kasey Kirschner, Luke Rector and Will Kunin Introduction April 13, 2016 Many of the oldest and most long- term players of the game are pitchers.
More informationMidterm Exam 1, section 2. Thursday, September hour, 15 minutes
San Francisco State University Michael Bar ECON 312 Fall 2018 Midterm Exam 1, section 2 Thursday, September 27 1 hour, 15 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can
More information100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men?
100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men? The 100 Meter Dash has been an Olympic event since its very establishment in 1896(1928 for women). The reigning 100-meter Olympic champion
More informationAnnouncements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions
Announcements Announcements Lecture 19: Inference for SLR & Statistics 101 Mine Çetinkaya-Rundel April 3, 2012 HW 7 due Thursday. Correlation guessing game - ends on April 12 at noon. Winner will be announced
More informationEconomic Value of Celebrity Endorsements:
Economic Value of Celebrity Endorsements: Tiger Woods Impact on Sales of Nike Golf Balls September 27, 2012 On Line Appendix The Golf Equipments Golf Bags Golf bags are designed to transport the golf clubs
More informationDistancei = BrandAi + 2 BrandBi + 3 BrandCi + i
. Suppose that the United States Golf Associate (USGA) wants to compare the mean distances traveled by four brands of golf balls when struck by a driver. A completely randomized design is employed with
More informationDriv e accu racy. Green s in regul ation
LEARNING ACTIVITIES FOR PART II COMPILED Statistical and Measurement Concepts We are providing a database from selected characteristics of golfers on the PGA Tour. Data are for 3 of the players, based
More informationa) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5
Prof. C. M. Dalton ECN 209A Spring 2015 Practice Problems (After HW1, HW2, before HW3) CORRECTED VERSION Question 1. Draw and describe a relationship with heteroskedastic errors. Support your claim with
More informationCorrelation and regression using the Lahman database for baseball Michael Lopez, Skidmore College
Correlation and regression using the Lahman database for baseball Michael Lopez, Skidmore College Overview The Lahman package is a gold mine for statisticians interested in studying baseball. In today
More informationMinimal influence of wind and tidal height on underwater noise in Haro Strait
Minimal influence of wind and tidal height on underwater noise in Haro Strait Introduction Scott Veirs, Beam Reach Val Veirs, Colorado College December 2, 2007 Assessing the effect of wind and currents
More informationAnnouncements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.
Announcements Announcements UNIT 7: MULTIPLE LINEAR REGRESSION LECTURE 1: INTRODUCTION TO MLR STATISTICS 101 Problem Set 10 Due Wednesday Nicole Dalzell June 15, 2015 Statistics 101 (Nicole Dalzell) U7
More information27Quantify Predictability U10L9. April 13, 2015
1 QUANTIFYING PREDICTABILITY Exercise #1: Make sure that your calculator has its r value on. 2 Exercise #2: In the following exercises four data sets with equal x values are given to illustrate different
More informationLecture 22: Multiple Regression (Ordinary Least Squares -- OLS)
Statistics 22_multiple_regression.pdf Michael Hallstone, Ph.D. hallston@hawaii.edu Lecture 22: Multiple Regression (Ordinary Least Squares -- OLS) Some Common Sense Assumptions for Multiple Regression
More informationAnnouncements. Unit 7: Multiple Linear Regression Lecture 3: Case Study. From last lab. Predicting income
Announcements Announcements Unit 7: Multiple Linear Regression Lecture 3: Case Study Statistics 101 Mine Çetinkaya-Rundel April 18, 2013 OH: Sunday: Virtual OH, 3-4pm - you ll receive an email invitation
More informationComputing the Probability of Scoring a 2 in Disc Golf Revised November 20, 2014 Steve West Disc Golf, LLC
Computing the Probability of Scoring a 2 in Disc Golf Revised November 20, 2014 Steve West Disc Golf, LLC Data: Scores from 2014 Am Worlds and related tournaments. The number of player-rounds contributing
More informationMath SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?
Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages? fts6 Introduction : Basketball is a sport where the players have to be adept
More informationRunning head: DATA ANALYSIS AND INTERPRETATION 1
Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various
More informationBuilding an NFL performance metric
Building an NFL performance metric Seonghyun Paik (spaik1@stanford.edu) December 16, 2016 I. Introduction In current pro sports, many statistical methods are applied to evaluate player s performance and
More informationUnit 4: Inference for numerical variables Lecture 3: ANOVA
Unit 4: Inference for numerical variables Lecture 3: ANOVA Statistics 101 Thomas Leininger June 10, 2013 Announcements Announcements Proposals due tomorrow. Will be returned to you by Wednesday. You MUST
More information1wsSMAM 319 Some Examples of Graphical Display of Data
1wsSMAM 319 Some Examples of Graphical Display of Data 1. Lands End employs numerous persons to take phone orders. Computers on which orders are entered also automatically collect data on phone activity.
More informationCopy of my report. Why am I giving this talk. Overview. State highway network
Road Surface characteristics and traffic accident rates on New Zealand s state highway network Robert Davies Statistics Research Associates http://www.statsresearch.co.nz Copy of my report There is a copy
More informationBoyle s Law: Pressure-Volume. Relationship in Gases
Boyle s Law: Pressure-Volume Relationship in Gases The primary objective of this experiment is to determine the relationship between the pressure and volume of a confined gas. The gas we use will be air,
More informationCS 221 PROJECT FINAL
CS 221 PROJECT FINAL STUART SY AND YUSHI HOMMA 1. INTRODUCTION OF TASK ESPN fantasy baseball is a common pastime for many Americans, which, coincidentally, defines a problem whose solution could potentially
More information1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?
Econ 57 Gary Smith Fall 2011 Final Examination (150 minutes) No calculators allowed. Just set up your answers, for example, P = 49/52. BE SURE TO EXPLAIN YOUR REASONING. If you want extra time, you can
More informationBoyle s Law: Pressure-Volume Relationship in Gases. PRELAB QUESTIONS (Answer on your own notebook paper)
Boyle s Law: Pressure-Volume Relationship in Gases Experiment 18 GRADE LEVEL INDICATORS Construct, interpret and apply physical and conceptual models that represent or explain systems, objects, events
More informationBoyle s Law: Pressure-Volume Relationship in Gases
Boyle s Law: Pressure-Volume Relationship in Gases Computer 6 The primary objective of this experiment is to determine the relationship between the pressure and volume of a confined gas. The gas we use
More informationSession 2: Introduction to Multilevel Modeling Using SPSS
Session 2: Introduction to Multilevel Modeling Using SPSS Exercise 1 Description of Data: exerc1 This is a dataset from Kasia Kordas s research. It is data collected on 457 children clustered in schools.
More informationModel Selection Erwan Le Pennec Fall 2015
Model Selection Erwan Le Pennec Fall 2015 library("dplyr") library("ggplot2") library("ggfortify") library("reshape2") Model Selection We will now use another classical dataset birthwt which corresponds
More informationMapping a course for Pocket Caddy
Contents: 1. Mapping overview 2. Mapping your course o 2.1. Locating the course o 2.2. Mapping the holes o 2.3. Opening the template file and naming the course o 2.4. Mapping the greens o 2.5. Mapping
More informationASTERISK OR EXCLAMATION POINT?: Power Hitting in Major League Baseball from 1950 Through the Steroid Era. Gary Evans Stat 201B Winter, 2010
ASTERISK OR EXCLAMATION POINT?: Power Hitting in Major League Baseball from 1950 Through the Steroid Era by Gary Evans Stat 201B Winter, 2010 Introduction: After a playerʼs strike in 1994 which resulted
More informationEvaluating NBA Shooting Ability using Shot Location
Evaluating NBA Shooting Ability using Shot Location Dennis Lock December 16, 2013 There are many statistics that evaluate the performance of NBA players, including some that attempt to measure a players
More informationISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide
ISDS 4141 Sample Data Mining Work Taylor C. Veillon Tool Used: SAS Enterprise Guide You may have seen the movie, Moneyball, about the Oakland A s baseball team and general manager, Billy Beane, who focused
More informationGrade: 8. Author(s): Hope Phillips
Title: Tying Knots: An Introductory Activity for Writing Equations in Slope-Intercept Form Prior Knowledge Needed: Grade: 8 Author(s): Hope Phillips BIG Idea: Linear Equations how to analyze data from
More informationThe Reliability of Intrinsic Batted Ball Statistics Appendix
The Reliability of ntrinsic Batted Ball Statistics Appendix Glenn Healey, EECS Department University of California, rvine, CA 92617 Given information about batted balls for a set of players, we review
More informationQuantitative Methods for Economics Tutorial 6. Katherine Eyal
Quantitative Methods for Economics Tutorial 6 Katherine Eyal TUTORIAL 6 13 September 2010 ECO3021S Part A: Problems 1. (a) In 1857, the German statistician Ernst Engel formulated his famous law: Households
More informationThe MACC Handicap System
MACC Racing Technical Memo The MACC Handicap System Mike Sayers Overview of the MACC Handicap... 1 Racer Handicap Variability... 2 Racer Handicap Averages... 2 Expected Variations in Handicap... 2 MACC
More information8th Grade. Data.
1 8th Grade Data 2015 11 20 www.njctl.org 2 Table of Contents click on the topic to go to that section Two Variable Data Line of Best Fit Determining the Prediction Equation Two Way Table Glossary Teacher
More informationGuide to Computing Minitab commands used in labs (mtbcode.out)
Guide to Computing Minitab commands used in labs (mtbcode.out) A full listing of Minitab commands can be found by invoking the HELP command while running Minitab. A reference card, with listing of available
More informationAPPENDIX A COMPUTATIONALLY GENERATED RANDOM DIGITS 748 APPENDIX C CHI-SQUARE RIGHT-HAND TAIL PROBABILITIES 754
IV Appendices APPENDIX A COMPUTATIONALLY GENERATED RANDOM DIGITS 748 APPENDIX B RANDOM NUMBER TABLES 750 APPENDIX C CHI-SQUARE RIGHT-HAND TAIL PROBABILITIES 754 APPENDIX D LINEAR INTERPOLATION 755 APPENDIX
More informationNBA TEAM SYNERGY RESEARCH REPORT 1
NBA TEAM SYNERGY RESEARCH REPORT 1 NBA Team Synergy and Style of Play Analysis Karrie Lopshire, Michael Avendano, Amy Lee Wang University of California Los Angeles June 3, 2016 NBA TEAM SYNERGY RESEARCH
More informationIDENTIFYING SUBJECTIVE VALUE IN WOMEN S COLLEGE GOLF RECRUITING REGARDLESS OF SOCIO-ECONOMIC CLASS. Victoria Allred
IDENTIFYING SUBJECTIVE VALUE IN WOMEN S COLLEGE GOLF RECRUITING REGARDLESS OF SOCIO-ECONOMIC CLASS by Victoria Allred A Senior Honors Project Presented to the Honors College East Carolina University In
More informationAnalysis of Variance. Copyright 2014 Pearson Education, Inc.
Analysis of Variance 12-1 Learning Outcomes Outcome 1. Understand the basic logic of analysis of variance. Outcome 2. Perform a hypothesis test for a single-factor design using analysis of variance manually
More informationBoyle s Law. Pressure-Volume Relationship in Gases. Figure 1
Boyle s Law Pressure-Volume Relationship in Gases The primary objective of this experiment is to determine the relationship between the pressure and volume of a confined gas. The gas we use will be air,
More informationWarmupweek. Etiquette. Technical. Rules EAGLES LESSON1.
LESSON1 Warmupweek It may have been some time since you have been golfing. Can you remember the basics? Use this space to write a few notes about Grip.............................................. Posture...........................................
More informationTwo Machine Learning Approaches to Understand the NBA Data
Two Machine Learning Approaches to Understand the NBA Data Panagiotis Lolas December 14, 2017 1 Introduction In this project, I consider applications of machine learning in the analysis of nba data. To
More informationOzobot Bit Classroom Application: Boyle s Law Simulation
OZO AP P EAM TR T S BO RO VE D Ozobot Bit Classroom Application: Boyle s Law Simulation Created by Richard Born Associate Professor Emeritus Northern Illinois University richb@rborn.org Topics Chemistry,
More informationThe 19 th hole - 18 Card Micro Golf game. # Of Players: 1 or 2 players. Game time: min per game if a 2 player game.
The 19 th hole - 18 Card Micro Golf game # Of Players: 1 or 2 players Game time: 30 45 min per game if a 2 player game. Components [4] - TEE BOXES [4] - PUTTING GREENS [6] - FAIRWAY [2] - Player golf ball
More informationHitting The Driver Made Easy
Table of Contents Introduction 3 Equipment 4 The Golf Shaft 4 The Golf Grip 5 The Club Head 6 Equipment Summary 7 The Setup And Swing 9 The Grip 10 Ball Position 12 Alignment 12 Posture 14 The Take-Away
More information1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.
1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data. Green Blue Brown Blue Blue Brown Blue Blue Blue Green Blue Brown Blue Brown Brown Blue
More informationRobust specification testing in regression: the FRESET test and autocorrelated disturbances
Robust specification testing in regression: the FRESET test and autocorrelated disturbances Linda F. DeBenedictis and David E. A. Giles * Policy and Research Division, Ministry of Human Resources, 614
More informationE. Agu, M. Kasperski Ruhr-University Bochum Department of Civil and Environmental Engineering Sciences
EACWE 5 Florence, Italy 19 th 23 rd July 29 Flying Sphere image Museo Ideale L. Da Vinci Chasing gust fronts - wind measurements at the airport Munich, Germany E. Agu, M. Kasperski Ruhr-University Bochum
More informationAlgebra 1 Unit 6 Study Guide
Name: Period: Date: Use this data to answer questions #1. The grades for the last algebra test were: 12, 48, 55, 57, 60, 61, 65, 65, 68, 71, 74, 74, 74, 80, 81, 81, 87, 92, 93 1a. Find the 5 number summary
More informationDarrell Klassen Inner Circle
Darrell Klassen Inner Circle --- SCORING REPORT PART #2 --- BREAKING 80 Just for your information, if you have reached the place in your game where you are shooting consistently in the 80 s, you are probably
More informationLesson 16: More on Modeling Relationships with a Line
Student Outcomes Students use the least squares line to predict values for a given data set. Students use residuals to evaluate the accuracy of predictions based on the least squares line. Lesson Notes
More informationAverages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?
Averages October 19, 2005 Discussion item: When we talk about an average, what exactly do we mean? When are they useful? 1 The Arithmetic Mean When we talk about an average, we can mean different things
More informationA Novel Approach to Predicting the Results of NBA Matches
A Novel Approach to Predicting the Results of NBA Matches Omid Aryan Stanford University aryano@stanford.edu Ali Reza Sharafat Stanford University sharafat@stanford.edu Abstract The current paper presents
More informationCOMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON
COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON Dorit Hammerling 1, Matthew Cefalu 2, Jessi Cisewski 3, Francesca Dominici 2, Giovanni Parmigiani 2,4, Charles Paulson 5, Richard Smith 1,6 1 Statistical
More informationEvaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent
Evaluating The Best Exploring the Relationship between Tom Brady s True and Observed Talent Heather Glenny, Emily Clancy, and Alex Monahan MCS 100: Mathematics of Sports Spring 2016 Tom Brady s recently
More informationPredictors for Winning in Men s Professional Tennis
Predictors for Winning in Men s Professional Tennis Abstract In this project, we use logistic regression, combined with AIC and BIC criteria, to find an optimal model in R for predicting the outcome of
More informationIs Tiger Woods Loss Averse? Persistent Bias in the Face of Experience, Competition, and High Stakes. Devin G. Pope and Maurice E.
Is Tiger Woods Loss Averse? Persistent Bias in the Face of Experience, Competition, and High Stakes Devin G. Pope and Maurice E. Schweitzer Web Appendix Appendix Figure 1 replicates Figure 2 of the paper
More informationMJA Rev 10/17/2011 1:53:00 PM
Problem 8-2 (as stated in RSM Simplified) Leonard Lye, Professor of Engineering and Applied Science at Memorial University of Newfoundland contributed the following case study. It is based on the DOE Golfer,
More informationMIS0855: Data Science In-Class Exercise: Working with Pivot Tables in Tableau
MIS0855: Data Science In-Class Exercise: Working with Pivot Tables in Tableau Objective: Work with dimensional data to navigate a data set Learning Outcomes: Summarize a table of data organized along dimensions
More informationJournal of Human Sport and Exercise E-ISSN: Universidad de Alicante España
Journal of Human Sport and Exercise E-ISSN: 1988-5202 jhse@ua.es Universidad de Alicante España SOÓS, ISTVÁN; FLORES MARTÍNEZ, JOSÉ CARLOS; SZABO, ATTILA Before the Rio Games: A retrospective evaluation
More informationEffect of homegrown players on professional sports teams
Effect of homegrown players on professional sports teams ISYE 2028 Rahul Patel 902949215 Problem Description: Football is commonly referred to as America s favorite pastime. However, for thousands of people
More informationEstimating the Probability of Winning an NFL Game Using Random Forests
Estimating the Probability of Winning an NFL Game Using Random Forests Dale Zimmerman February 17, 2017 2 Brian Burke s NFL win probability metric May be found at www.advancednflstats.com, but the site
More informationEfficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia
Efficiency Wages in Major League Baseball Starting Pitchers 1998-2001 Greg Madonia Statement of Problem Free agency has existed in Major League Baseball (MLB) since 1974. This is a mechanism that allows
More informationHomework Exercises Problem Set 1 (chapter 2)
1 Name: Homework Exercises Problem Set 1 (chapter 2) Exercise 2.5.1 The 10 third-grade students at Lake Wobegone elementary school took a spelling test with 10 words, each worth one point. Given that the
More informationCalculation of Trail Usage from Counter Data
1. Introduction 1 Calculation of Trail Usage from Counter Data 1/17/17 Stephen Martin, Ph.D. Automatic counters are used on trails to measure how many people are using the trail. A fundamental question
More informationReturns to Skill in Professional Golf: A Quantile Regression Approach
International Journal of Sport Finance, 2010, 5, 167-180, 2010 West Virginia University Returns to Skill in Professional Golf: A Quantile Regression Approach Leo H. Kahane 1 1 Providence College Leo H.
More informationNCSS Statistical Software
Chapter 256 Introduction This procedure computes summary statistics and common non-parametric, single-sample runs tests for a series of n numeric, binary, or categorical data values. For numeric data,
More informationOne-factor ANOVA by example
ANOVA One-factor ANOVA by example 2 One-factor ANOVA by visual inspection 3 4 One-factor ANOVA H 0 H 0 : µ 1 = µ 2 = µ 3 = H A : not all means are equal 5 One-factor ANOVA but why not t-tests t-tests?
More informationTHE USGA HANDICAP SYSTEM. Reference Guide
THE USGA HANDICAP SYSTEM Reference Guide THE USGA HANDICAP SYSTEM Reference Guide Copyright 2017 United States Golf Association All Rights Reserved Printed in the United States of America The USGA Handicap
More informationTying Knots. Approximate time: 1-2 days depending on time spent on calculator instructions.
Tying Knots Objective: Students will find a linear model to fit data. Students will compare and interpret different slopes and intercepts in a context. Students will discuss domain and range: as discrete
More informationDATA SCIENCE SUMMER UNI VIENNA
Prerequisites - You have installed Tableau Desktop on your computer. Available here: http://www.tableau.com/academic/students - You have downloaded the data (athlete_events.csv) available here: https://www.kaggle.com/heesoo37/120-years-of-olympic-historyathletes-and-results
More informationBASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG
BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG GOAL OF PROJECT The goal is to predict the winners between college men s basketball teams competing in the 2018 (NCAA) s March
More informationGolfLogix: Golf GPS. User Guide for: BlackBerry Curve. Version 1.0. Software Release , 8330, 8350i, 8800, 8820, 8830, 8900
GolfLogix: Golf GPS User Guide for: BlackBerry Curve 8310, 8330, 8350i, 8800, 8820, 8830, 8900 Version 1.0 Software Release 1.0 1 Table of Contents Topic Page Topic Page Getting Started Statistics Downloading
More informationReal-Time Electricity Pricing
Real-Time Electricity Pricing Xi Chen, Jonathan Hosking and Soumyadip Ghosh IBM Watson Research Center / Northwestern University Yorktown Heights, NY, USA X. Chen, J. Hosking & S. Ghosh (IBM) Real-Time
More informationAccident data analysis using Statistical methods A case study of Indian Highway
Accident data analysis using Statistical methods A case study of Indian Highway Rahul Badgujar 1, Priyam Mishra 2, Mayank Chandra 3, Sayali Sandbhor 4, Humera Khanum 5 1,2,3 Undergraduate scholars, Department
More informationCompetitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement
SPORTSCIENCE sportsci.org Original Research / Performance Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement Carl D Paton, Will G Hopkins Sportscience
More informationIs It Truly a Building Ground? A Returns to Skill and Learning by Doing Study of the PGA Tour and the Web.com Tour
Skidmore College Creative Matter Economics Student Theses and Capstone Projects Economics 2017 Is It Truly a Building Ground? A Returns to Skill and Learning by Doing Study of the PGA Tour and the Web.com
More informationHandicap Differential = (Adjusted Gross Score - USGA Course Rating) x 113 / USGA Slope Rating
FORMULAS Section 10 USGA HANDICAP FORMULA A Handicap Index is the USGA's mark which is used to indicate a measurement of a player's potential scoring ability on a course of standard difficulty. Potential
More informationTracking of Large-Scale Wave Motions
Tracking of Large-Scale Wave Motions Nikki Barbee, Adam Cale, Justin Wittrock Dr. William Gutowski Meteorology 44 Fall 29 This semester we have observed large scale wave patterns in both the Northern and
More informationTaking Your Class for a Walk, Randomly
Taking Your Class for a Walk, Randomly Daniel Kaplan Macalester College Oct. 27, 2009 Overview of the Activity You are going to turn your students into an ensemble of random walkers. They will start at
More informationGolfLogix: Golf GPS. User Guide for: iphone 3G & 3GS. Version 1.0. Software Release 1.0
GolfLogix: Golf GPS User Guide for: iphone 3G & 3GS Version 1.0 Software Release 1.0 1 Table of Contents Topic Page Topic Page Getting Started Statistics Downloading the GolfLogix Application 4 Statistic
More informationLegendre et al Appendices and Supplements, p. 1
Legendre et al. 2010 Appendices and Supplements, p. 1 Appendices and Supplement to: Legendre, P., M. De Cáceres, and D. Borcard. 2010. Community surveys through space and time: testing the space-time interaction
More informationThe Effect of Newspaper Entry and Exit on Electoral Politics Matthew Gentzkow, Jesse M. Shapiro, and Michael Sinkinson Web Appendix
The Effect of Newspaper Entry and Exit on Electoral Politics Matthew Gentzkow, Jesse M. Shapiro, and Michael Sinkinson Web Appendix 1 1 Sources of Voting Data Our primary source for county-level voting
More informationBiostatistics & SAS programming
Biostatistics & SAS programming Kevin Zhang March 6, 2017 ANOVA 1 Two groups only Independent groups T test Comparison One subject belongs to only one groups and observed only once Thus the observations
More informationTitle: 4-Way-Stop Wait-Time Prediction Group members (1): David Held
Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held As part of my research in Sebastian Thrun's autonomous driving team, my goal is to predict the wait-time for a car at a 4-way intersection.
More informationWhen Falling Just Short is a Good Thing: the Effect of Past Performance on Improvement.
Title: When Falling Just Short is a Good Thing: the Effect of Past Performance on Improvement. Mariya Burdina (corresponding author) Associate Professor of Economics University of Central Oklahoma 100
More informationKelsey Schroeder and Roberto Argüello June 3, 2016 MCS 100 Final Project Paper Predicting the Winner of The Masters Abstract This paper presents a
Kelsey Schroeder and Roberto Argüello June 3, 2016 MCS 100 Final Project Paper Predicting the Winner of The Masters Abstract This paper presents a new way of predicting who will finish at the top of the
More information