Driv e accu racy. Green s in regul ation

Similar documents
Stats 2002: Probabilities for Wins and Losses of Online Gambling

Chapter 12 Practice Test

Running head: DATA ANALYSIS AND INTERPRETATION 1

Case Processing Summary. Cases Valid Missing Total N Percent N Percent N Percent % 0 0.0% % % 0 0.0%

Navigate to the golf data folder and make it your working directory. Load the data by typing

Statistical Analysis of PGA Tour Skill Rankings USGA Research and Test Center June 1, 2007

Section I: Multiple Choice Select the best answer for each problem.

Confidence Interval Notes Calculating Confidence Intervals

Lab 11: Introduction to Linear Regression

Sample Final Exam MAT 128/SOC 251, Spring 2018

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Contingent Valuation Methods

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

The probability of winning a high school football game.

Week 7 One-way ANOVA

Lesson 14: Modeling Relationships with a Line

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Kelsey Schroeder and Roberto Argüello June 3, 2016 MCS 100 Final Project Paper Predicting the Winner of The Masters Abstract This paper presents a

PSY201: Chapter 5: The Normal Curve and Standard Scores

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Guide to Computing Minitab commands used in labs (mtbcode.out)

Name May 3, 2007 Math Probability and Statistics

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Boyle s Law. Pressure-Volume Relationship in Gases. Figure 1

Modeling Pedestrian Volumes on College Campuses

MJA Rev 10/17/2011 1:53:00 PM

Distancei = BrandAi + 2 BrandBi + 3 BrandCi + i

Algebra 1 Unit 6 Study Guide

Pitching Performance and Age

IDENTIFYING SUBJECTIVE VALUE IN WOMEN S COLLEGE GOLF RECRUITING REGARDLESS OF SOCIO-ECONOMIC CLASS. Victoria Allred

Legendre et al Appendices and Supplements, p. 1

Building an NFL performance metric

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Example 1: One Way ANOVA in MINITAB

In addition to reading this assignment, also read Appendices A and B.

Psychology - Mr. Callaway/Mundy s Mill HS Unit Research Methods - Statistics

The MACC Handicap System

Name Date Period. E) Lowest score: 67, mean: 104, median: 112, range: 83, IQR: 102, Q1: 46, SD: 17

Stat 139 Homework 3 Solutions, Spring 2015

Practice Test Unit 6B/11A/11B: Probability and Logic

% per year Age (years)

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

Handicap Differential = (Adjusted Gross Score - USGA Course Rating) x 113 / USGA Slope Rating

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

Is Tiger Woods Loss Averse? Persistent Bias in the Face of Experience, Competition, and High Stakes. Devin G. Pope and Maurice E.

Lab Report Outline the Bones of the Story

Boyle s Law: Pressure-Volume Relationship in Gases. PRELAB QUESTIONS (Answer on your own notebook paper)

Boyle s Law: Pressure-Volume Relationship in Gases

NCSS Statistical Software

The pth percentile of a distribution is the value with p percent of the observations less than it.

Select Boxplot -> Multiple Y's (simple) and select all variable names.

Accident data analysis using Statistical methods A case study of Indian Highway

Boyle s Law: Pressure-Volume. Relationship in Gases

Predictors for Winning in Men s Professional Tennis

Chapter 2: Modeling Distributions of Data

Name Student Activity

North Point - Advance Placement Statistics Summer Assignment

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan

Lecture 22: Multiple Regression (Ordinary Least Squares -- OLS)

Journal of Human Sport and Exercise E-ISSN: Universidad de Alicante España

Gizachew Tiruneh, Ph. D., Department of Political Science, University of Central Arkansas, Conway, Arkansas

ASTERISK OR EXCLAMATION POINT?: Power Hitting in Major League Baseball from 1950 Through the Steroid Era. Gary Evans Stat 201B Winter, 2010

STAT 101 Assignment 1

Quantitative Literacy: Thinking Between the Lines

Boyle s Law: Pressure-Volume Relationship in Gases

NBA TEAM SYNERGY RESEARCH REPORT 1

HSIS. Association of Selected Intersection Factors With Red-Light-Running Crashes. State Databases Used SUMMARY REPORT

8th Grade. Data.

Ozobot Bit Classroom Application: Boyle s Law Simulation

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

PGA Tour Scores as a Gaussian Random Variable

APPROACH RUN VELOCITIES OF FEMALE POLE VAULTERS

ANOVA - Implementation.

TRIP GENERATION RATES FOR SOUTH AFRICAN GOLF CLUBS AND ESTATES

One-factor ANOVA by example

IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

A few things to remember about ANOVA

March Madness Basketball Tournament

The Project The project involved developing a simulation model that determines outcome probabilities in professional golf tournaments.

STT 315 Section /19/2014

DISMAS Evaluation: Dr. Elizabeth C. McMullan. Grambling State University

Pitching Performance and Age

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide

Returns to Skill in Professional Golf: A Quantile Regression Approach

That pesky golf game and the dreaded stats class

Diameter in cm. Bubble Number. Bubble Number Diameter in cm

March Madness Basketball Tournament

Correlation and regression using the Lahman database for baseball Michael Lopez, Skidmore College

Universitas Sumatera Utara

Journal of Quantitative Analysis in Sports

The Reliability of Intrinsic Batted Ball Statistics Appendix

ISyE 6414 Regression Analysis

Is lung capacity affected by smoking, sport, height or gender. Table of contents

A decision maker subjectively assigned the following probabilities to the four outcomes of an experiment: P(E 1) =.10, P(E 2) =.

Taking Your Class for a Walk, Randomly

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

Unit4: Inferencefornumericaldata 4. ANOVA. Sta Spring Duke University, Department of Statistical Science

Transcription:

LEARNING ACTIVITIES FOR PART II COMPILED Statistical and Measurement Concepts We are providing a database from selected characteristics of golfers on the PGA Tour. Data are for 3 of the players, based on their end-of-year (2) information. The players are in three groups according to money won: group (players -), group 2 (players 2-3), and group 3 (players 4 to 5) (see table A, or the Excel file named PGA datasheet for the full data set). The variables listed by column in the database are player name, money won ($), world rank, average score, drive distance, drive accuracy, greens hit in regulation (%), putts per round, scrambling, sand saves (%), group membership (, 2, or 3 as listed previously), and whether the golfer is from the United States or another country. Table A Data on PGA Players Ordered by Money Won ( to, 2 to 3, 4 to 5) Name Money won Wo rld ran k Aver age scor e Driv e dista nce Driv e accu racy Green s in regul ation Pu tts pe r ro un d Scram bling Sa nd sa ve s Grou ping by mon ey U.S. vs. interna tional Singh 66 94. 5. 69.5 297. 59.4 5 6.45 29. 47 5.92 45.. 2. Micke lson 5 75. 3. 69. 7 295. 7 55.2 7 65. 2. 74 6.42 62.5.. Garcia 452 24. 2. 69. 2 294. 6 59.3 9 67.6 29. 6 57.59 57. 2. 2. Perry 46637 4. 69. 296. 6.9 67.47 29. 57.57 5.. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

94. 3 7 25. Kim 46562 65.. 69.2 3. 9 5.3 4 65.7 2. 5 59.32 5.3 5.. Villeg as 44226 4. 7. 69.4 9 293. 3 5. 5 64.6 2. 97 53.52 54.6. 2. Harrin gton 4335 5. 4. 69.2 296. 3 59.3 7 6.67 2. 4 6.2 5. 6. 2. Cink 39793. 6. 7. 2 296. 9 55.2 7 66.94 29. 6 55.6 5. 3.. Leona rd 39435 42. 23. 69.7 7 2. 4 67.7 2 66.6 2. 5 6.7 55. 7.. Allenb y 3667. 27. 69.7 3 29. 7 65.6 4 7.4 3. 7 55.26 46.4 9. 2. Els 25372 9. 9. 69.9 296. 56. 6.33 29. 2 56.6 54.3 7 2. 2. Petter sson 2525 3. 62. 7.2 4 26. 59. 7 63.54 2. 59. 53. 3 2. 2. Apple 2446 3. 69.7 29. 5. 6.9 2. 6.24 56.3 2. 2. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

by 3. 3 9 9 55 Strick er 2433 4. 5. 69.9 23. 6 56.2 5 63. 2. 76 6.3 52.3 4 2.. Camp bell 2447 7. 65. 7. 4 29. 9 65.6 6.44 29. 5 54.6 43.4 2.. Weekl ey 2397 5. 4. 7.4 29. 7 64.7 5 67.7 3. 9 57. 5.3 9 2.. Traha n 2343 6. 74. 7.3 3 29. 3 65.3 66.25 29. 52 55.69 42.4 2.. Ames 2257 7. 37. 69.9 9 23. 62.7 2 65.4 2. 99 5.72 5.7 6 2. 2. Duke 223 5.. 7.3 24. 9 62.2 7 64. 2. 79 57.2 5.9 6 2.. Hart 22 7. 54. 7.5 9 275. 5 6. 66. 2. 3 6.2 63.7 2.. Thom pson 693 29. 56. 7. 4 295. 4 66. 5 63.4 29. 75 57.75 4.7 3.. D Johns 79 95. 43. 7.2 2 39. 7 53. 5 63.7 29. 4 5.5 42.4 3.. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

on Perez 756 3. 2. 7. 294. 2 63.2 7 66.9 29. 3 56.2 55. 3.. Mayfa ir 756 3. 92. 7.5 24. 72. 6 6.2 29. 4 6.7 54.6 3.. Clark 722 3. 2. 7.3 2. 64.9 63.6 29. 97 5.32 54.5 5 3. 2. Bryan t 79 53. 5. 7.2 279. 5 73. 7 64. 29. 47 59. 4.3 3.. Pampl ing 729 52. 56. 7. 2. 65.6 5 63.6 29. 3 56. 53.4 6 3. 2. Love 6952 37.. 7.3 3. 3 5.2 2 64.74 29. 5 55.56 5. 3.. Badde ley 6655 7. 47. 7.2 29. 3 59.4 5 62.2 2. 3 5.52 52. 3. 2. Kelly 6524. 2. 7.9 4 277. 66.2 63.36 29. 39 57.76 4.6 3.. Chapter 6 From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Becoming Acquainted With Statistical Concepts We provide a database from selected characteristics of golfers on the PGA Tour. Data are for 3 of the players, based on their end-of-year (2) information. The players are in three groups according to money won: group (players to ), group 2 (players 2 to 3), and group 3 (players 4 to 5). The variables listed by column in the database are player name, money won ($), world rank, average score, driving distance, driving accuracy (%), greens hit in regulation (%), putts per round, scrambling, sand saves (%), group membership by money earned (, 2, or 3 as listed previously), and whether they are from the United States or another country. You ll find the database in Handout, Activity 2. (The database is also available as an Excel file named PGA datasheet. ) The first thing to do with every data set is evaluate the characteristics of the data set. In the following exercise we have used SPSS 22. to evaluate whether the data appear normal for scoring average. In SPSS we do the following: Use the pull-down menu for Analyze. Select the Descriptive Statistics bar. Select the Explore bar. You can now add average score to the Dependent List. Click on Statistics and check Descriptives with a 95% confidence interval and click Continue. Click OK to run the statistics. You can see from table 6. and figure 6. (and from the SPSS results) that the data are reasonably normal in distribution. The mean (7.) and standard deviation (.54) for scoring average are also depicted in table 6.. Note, however, that the standard deviation is very small relative to the mean, because the range of scores is small (69.2 to 7.22). Skewness is slightly positive and kurtosis is slightly negative but are within normal expectations. Table 6. Descriptives Statistic Standard error Scoring Mean 7.27.952 From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

average 95% confidence interval for mean Lower bound 69.2 Upper bound 7.242 5% trimmed mean 7.754 Median 7.6 Variance.29 Standard deviation.53964 Minimum 69.2 Maximum 7.22 Range 2. Interquartile range.66 Skewness.2.427 Kurtosis.297.33 The normality of the data is further confirmed by looking at the histogram (with normal curve overlaid) shown in figure 6.. We got this figure in SPSS the following way: Use the pull-down menu for Analyze. Select the Descriptive Statistics bar. Select the Frequencies bar. Select average score for Variable(s). From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Click the chart option and select Histograms with Show Normal Curve on Histogram checked. Click Continue and then click OK to get the chart. Figure 6. Histogram with overlay of normal curve for scoring averages. We suggest that you do this exercise with the students in your research methods class. We have found it effective to do this as a group if you have the facilities to do so. We reserve our computer lab where each machine has SPSS loaded. We then give each student the entire data set (Handout, Activity 2 or the Excel file named PGA datasheet ) as an SPSS file. This can be done using a thumb drive or, more efficiently, as an e-mail attachment. We bring a laptop with SPSS and the data loaded on it and use an LCD projector and From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

screen to show each step in the process. We do this on the laptop, and students (working alone or they can work in pairs if there are not enough computers) do it on their own machines and print a hard copy for annotation. We point out each of the features so they can annotate on the hard copy. If you don t have the resources to do this in a group, you can go over the process in class using a laptop and LCD projector, and then send each student a PDF of the results to complete the assignment outside of class. Chapter 7 Statistical Issues in Research Planning and Evaluation Review the information from activity 2 in chapter 6. One of the most important statistical skills is to estimate power (chances of rejecting a false null hypothesis) when reading or planning studies. We can use our database on PGA golfers to demonstrate this process. Suppose you would like to do a study on college golf teams in 29 to evaluate if they differ from the top PGA players in saves of par from the sand trap (the variable sand saves in the Excel file named PGA datasheet ). From the data, you could calculate the data on the PGA top players; in 2 they saved par 5.% of the time from the sand trap (SD = 5.%). You would like to know if the 29 college golf players in the Southeastern conference differ significantly from that percentage of par saves from the sand. You know the average percent of sand saves among SEC college golfers in 2 was 4% (SD = 6.%). How many SEC and PGA players from 2 would you need sand save data on to produce a significant difference (p =.5) with a power of.? To do this analysis, do the following: Go to the Power Calculator at www.stat.ubc.ca/~rollin/stats/ssize/n2.html. Bullet Calculate Sample Size (for specified Power). Enter the value of the first mean, PGA players, 5.. Enter the value for the second mean, SEC golfers, 4. Enter sigma, a common standard deviation, by entering the 2 SD for the SEC golfers, 6. (that is the most conservative estimate since it is larger). Select a -Sided Test since you are interested in whether the SEC golfers have lower values than the PGA players. Confirm that alpha is set at.5 and power at.. Click Calculate to get the needed size per group. The program will then calculate that the sample size needed from the 2 PGA and SEC golfers is 32 players for each group. Thus, if the differences between the SEC and PGA golfers in saving par from sand From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

traps in 29 is about as it was in 2, you will need to collect data on about 32 players from each group. It s always safe to be conservative here, as we have been, by using the larger standard deviation, and it might be wise to collect data on about 35 in each group. Chapter Relationships Among Variables First, it might be useful to have students see a graphic display of a high and low relationship between two variables. To do this in SPSS, do the following: Click on the Analyze pull-down menu. Select the Regression bar. Select the Curve Estimation bar. Enter money won as the dependent variable (y axis) and average score as the independent (x axis). Make sure Linear is checked in the model box. Click OK to get the figure. You will now see what we show in figure.. The relationship is a strong one (only the leading money winner, V.J. Singh, is very far off the regression line). Also note that the relationship is negative: More money won is associated with a lower average score. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Figure. High but negative relationship between scoring average and money earned on PGA Tour. To demonstrate a positive relationship, go through the same process, but substitute world rank and average score for the dependent and independent variables. Figure.2 shows this positive relationship. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Figure.2 Strong positive relationship between average score and world rank. Do the following to demonstrate the correlations between each pair of the variables in the data set: Click on the Analyze pull-down menu. Click on the Correlate bar. Click on the Bivariate bar. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Enter the nine continuous variables (all but Grouping by money and U.S. vs. international. ) Click on OK to get the correlation matrix. You will see output that is similar to table., where there is a relationship, r, between each pair of the variables (e.g., money won vs. average score, money won vs. driving distance). There are several that have strong relationships, some positive and some negative:.7 between money won and average score. That is, lower scoring averages are associated with more money that a player has won. This is really a positive relationship, but it looks negative because the more money, the better, while the lower the scoring average, the better..594 between driving distance and drive accuracy. This is a true negative relationship. Longer drives are associated with not hitting as accurately..52 between average score and world rank is another highly positive relationship. There are several moderate relationships:.439 between putts per round and sand saves percentage. That is, fewer putts per round are associated with a higher percentage of sand saves..373 between driving distance and money earned, reflecting that generally hitting longer drives is associated with earning more money..367 between drive accuracy and earning more money. This reflects a positive association between these two variables. Finally, there are some relationships that are low and unimportant:.3 between drive accuracy and scrambling.29 between driving distance and putts per round.33 between money won and % sand saves Table. Correlation Matrix Among all Variables (N = 3) Variable s Mon ey won World rank Avera ge score Drive distan ce Drive accura cy Greens in regulati Putts per round Scrambli ng Sand saves From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

on Money won..77* ** -.7** *.373*.367*.343 -.94.45.33 World rank..52* **.3.36*.7.323.35.47 * Average score..26.37.73.35.297.34 Drive distance. -.594** *.4.29 -.53** -.2 Drive accuracy (%)..294.466 **.3.77 Greens in regulatio n (%)..549* *.2.247 Putts per round..449*.439 * Scrambli ng..569* ** Sand saves (%). ***p <. **p <. *p <.5. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Using Regression in Prediction In chapter we discussed regression, which is the correlation between two variables and the fact that a straight line may be used to fit the data set as in figure.. The data (DV = drive distance, IV = drive accuracy) for this example are shown below in figure.3. You can observe that the relationship is negative. Longer drives are associated with lower accuracy. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Figure.3 Negative relationship between drive distance and drive accuracy. In SPSS we do the following: Click on the Analyze pull-down bar. Click on Regression. Click on Linear. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Enter driving distance as the dependent variable and driving accuracy as the independent variable. That is, we want to develop a regression (with a prediction equation) for estimating driving distance from driving accuracy. Click OK. You will first see a table similar to table.2 that shows the relationship (prediction) between the two variables. Table.2 Model Summary Model r r 2 Adjusted r 2 Standard error of the estimate.594 (a).353.33 6.37 a Predictors: (Constant), drive accuracy. Table.2 shows the calculated r =.594 and r 2 =.353; 35% of the variance in drive distance is associated with drive accuracy. Note the standard error of the estimate is 6.37. Table.3 shows the test of that correlation (i.e., is it significant?). You can see that the correlation is significant as the F (,2) = 5.29, p <.. Table.3 ANOVA(b) Model Sum of squares df Mean square R Significance Regression 624.49 624.49 5.29.(a) Residual 42.25 2 4.5 Total 766.76 29 From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

a Predictors: (Constant), drive accuracy. b Dependent variable: drive distance. Table.4 is the prediction equation: Y = 34..94X Table.4 Coefficients(a) Unstandardized coefficients Standardized coefficients Model B Standard error Beta t Significance (Constant) 34. 4.92796 23.36. Drive accuracy.94.24.594 3.9. a Dependent variable: drive distance. Or you can calculate a predicted score by taking 34. and subtracting the drive accuracy times.94. For example, Rod Pampling s drive accuracy percentage is 65.65. If we multiply 65.65.94, we get 6.7. Subtract 6.7 from 34. and the estimated distance for Rob Pampling is 27.. His real average distance is 2, so you can see our estimated score is very close to his real score. Remember, all we are doing here is estimating the straight line based on the average data for each variable for all the golfers. However, this estimate has to be considered as plus or minus the standard error of the estimate (given previously as 6.37) that we showed you how to calculate in chapter. Thus, we would expect his actual average score to fall between 293.5 and 2.7 and, in fact, it does. Using Multiple Regression Multiple regression is similar to regression except there is more than one predictor variable in the multiple regression that is calculated. In the previous section where we used regression, the predictor (or X) variable was drive accuracy and the criterion variable was drive distance (or Y). In this part we will use several predictors (X variables) money won, average score, drive distance, drive accuracy, greens hit in regulation, scrambling, and sand saves to putts per round (single criterion, or Y variable). In SPSS, do the following: Click on the Analyze pull-down menu. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Click on the Regression bar. Click on the Linear bar. Enter the dependent variable (putts per round) and the independent variables (money won, average score, drive distance, drive accuracy, greens hit in regulation, scrambling, and sand saves). Change Method (below the Independent(s) box) to Forward. Click OK. The forward method means the program selects the best single predictor first, then the second best predictor (after variance accounted for by the first predictor is removed), and so on, until there are no predictors that contribute significantly to explaining variance associated with money earned. If you return to table. and look at the correlation of each variable with putts per round, you can see that greens in regulation is the best single predictor (.549). Thus, that is the variable that will be selected first in multiple regression. The question then is what variable(s) will still significantly predict putts per round when the variance associated with the relationship between greens in regulation and putts per round has been considered? Information summarized from the SPSS output in table.5 shows that, in addition to the contribution made by greens in regulation (3.2% of the variance accounted for in putts per round), average score accounted for an additional 2.4% (which was significant), bringing the total variable accounted for by the two predictors to 5.6%, which was significant, F(2,27) = 3.55, p <.. None of the remaining predictors accounted for any significant portion of the variance after these two (greens in regulation and average score) were used in the equation. Table.5 Contribution to Putts Per Round Model r r 2 r 2 chance Step Greens in regulation.549.32.32 Step 2* Average score.72.56.24 *F(2,27) = 3.55, p <. (test of the total linear combination). If one wanted to use this weighed combination of greens in regulation plus average score to predict putts per round, the regression equations would be as follows: Y = -9.76 +.35 (greens in regulation) +.43 From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

(average score) You can have students run this example (or any other combination of predictors and criterion) by using the steps listed previously. Chapter 9 Differences Among Groups In this section we continue to use the data on professional golfers to provide SPSS examples of calculating differences among groups. Our grouping variable will be the level of performance on the PGA tour: group is top money winners, group 2 is money winners ranked 2 to 3, and group 3 is money winners ranked 4 to 5. The last column in the data set has numbers assigned matching those groupings. Independent t Test This analysis using SPSS compares the means of the top money winners (group ) with the players ranked 2 to 3 (group 2) on average birdies per round. The means and standard deviations for the two groups are as follows: Players ranked to : M = 69.53, SD =.3 Players ranked 2 to 3: M = 7.56, SD =.4 t() = 6.46, p =. To run these same data on SPSS, do the following: Click on the Analyze drop-down menu. Click on the Compare Means bar. Select Independent t test. Enter the Test Variable of average score and the Grouping Variable of grouping by money. Click on Define Groups and enter and 3 to indicate the two groups. Click OK to run the analysis. One-Way (Simple) Analysis of Variance (ANOVA) In this analysis we will compare all three groups: top players (), players ranked 2 to 3 (2), and players ranked 4 to 5 (3) on driving distance. As you can see from the following results, the outcome is quite interesting: From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Players ranked to : M = 294.5, SD = 5.2, confidence intervals = 2.4 to 3.9 Players ranked 2 to 3: M = 27.4, SD = 5., confidence intervals = 275.5 to 296. Players ranked 4 to 5: M = 29., SD =.3, confidence intervals = 277. to 39.7 Despite our intuition, the three groups are not different with an F(2, 27) = 2.3, p =.. This is confirmed by the Bonferroni tests. Interestingly, the second group has the lowest mean, but because of the relatively large standard deviations there are no differences. To run these same data on SPSS, do the following: Click on the Analyze drop-down menu. Click on the Compare Means bar. Select One-Way ANOVA. Enter the Dependent List of driving distance and the Grouping Variable of grouping by money. Click on Posthoc and check Bonferroni and then click Continue. Click OK to run the analysis. We should note that we got the descriptive statistics by using Analyze, Descriptive Statistics, and Explore combination that we reviewed in chapter 6. Chapter Nonparametric Techniques In using nonparametric procedures, SPSS easily permits data transformation to ranks and has a variety of procedures for completing nonparametric statistics. When data are not normal, these statistics would be used. For example, money won has a rather unusual distribution (see figure.). From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Figure. Histogram for money earned per golfer. For money won, skewness (.7) is above expectations for normal data (the kurtosis =.252). Based on this information, and to make the assignment more interesting, we can change the variables, money won and drive accuracy, to ranks using the SPSS by doing the following: Use the Transform pull down menu. Click the Rank Cases bar. Enter the two variables (money won and drive accuracy). From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.

Click OK to transfer the data to ranks. This procedure results in two new columns of data on the SPSS data sheet that are the two variables with data ranked. To do a nonparametric correlation in SPSS, do the following: In the Analyze pull-down menu, click on Correlate. Click on Bivariate. Enter the ranked variables: rank of money won, and rank of drive accuracy. Check the box for Spearman to do a Spearman s rank-order correlation. Click OK to run the analysis. If we return to the correlation matrix presented as table., we can see that the correlation between these two variables using the original data is.367, while the correlation using ranked data is.3. If we square each of those (.367 squared = 3.5% of variance;.3 squared = 66.% of variance), the gain in variance accounted for by changing the non-normal data to ranks was 52.6%, or nearly four times as large. The nonparametric statistic is large and the significant value is much larger, demonstrating the reasons nonparametric statistics are valuable. From J.R. Thomas, J.K. Nelson, and S.J. Silverman. 25, Research methods in physical activity instructor guide, 7th ed.