One-way ANOVA: round, narrow, wide

5/4/2009 9:19:18 AM Retrieving project from file: 'C:\DOCUMENTS AND SETTINGS\BOB S\DESKTOP\RJS\COURSES\MTAB\FIRSTBASE.MPJ' ========================================================================== This is an analysis of data on running times by three methods of rounding first base in baseball. See Hollander and Wolfe, 1973 (or later edition). The methods are round out, narrow angle, and wide angle. Each of 22 different runners performed roundings of first base using each of the three methods, and the running times were recorded. Here is the data as entered into Minitab. The 2 nd column gives the times, with the 1 st column giving the runner for that time and the 3 rd column giving which method was used for that time. Columns 4, 5, and 6 give this same information in unstacked form for each of the 22 runners. runner times subs round narrow wide 1 5.40 round 5.40 5.50 5.55 2 5.85 round 5.85 5.70 5.75 3 5.20 round 5.20 5.60 5.50 4 5.55 round 5.55 5.50 5.40 5 5.90 round 5.90 5.85 5.70 6 5.45 round 5.45 5.55 5.60 7 5.40 round 5.40 5.40 5.35 8 5.45 round 5.45 5.50 5.35 9 5.25 round 5.25 5.15 5.00 10 5.85 round 5.85 5.80 5.70 11 5.25 round 5.25 5.20 5.10 12 5.65 round 5.65 5.55 5.45 13 5.60 round 5.60 5.35 5.45 14 5.05 round 5.05 5.00 4.95 15 5.50 round 5.50 5.50 5.40 16 5.45 round 5.45 5.55 5.50 17 5.55 round 5.55 5.55 5.35 18 5.45 round 5.45 5.50 5.55 19 5.50 round 5.50 5.45 5.25 20 5.65 round 5.65 5.60 5.40 21 5.70 round 5.70 5.65 5.55 22 6.30 round 6.30 6.30 6.25 1 5.50 narrow 2 5.70 narrow 3 5.60 narrow 4 5.50 narrow 5 5.85 narrow 6 5.55 narrow 7 5.40 narrow 8 5.50 narrow 9 5.15 narrow 10 5.80 narrow 11 5.20 narrow 12 5.55 narrow 13 5.35 narrow 14 5.00 narrow 15 5.50 narrow 16 5.55 narrow 17 5.55 narrow 18 5.50 narrow 19 5.45 narrow 20 5.60 narrow 21 5.65 narrow 22 6.30 narrow 1 5.55 wide 2 5.75 wide

3 5.50 wide 4 5.40 wide 5 5.70 wide 6 5.60 wide 7 5.35 wide 8 5.35 wide 9 5.00 wide 10 5.70 wide 11 5.10 wide 12 5.45 wide 13 5.45 wide 14 4.95 wide 15 5.40 wide 16 5.50 wide 17 5.35 wide 18 5.55 wide 19 5.25 wide 20 5.40 wide 21 5.55 wide 22 6.25 wide The null hypothesis is that the average running times for each method are the same. As we discussed in class, this design calls for a two-way ANOVA, thus eliminating the effect of the factor runner. However, for purposes of later comparison, let us see what one-way ANOVA gives when we treat this data as three independent samples of size 22 corresponding to the three methods. That is, ignoring the information that each runner involves has run each of the three methods. MTB > AOVOneway 'round' 'narrow' 'wide'. One-way ANOVA: round, narrow, wide Source DF SS MS F P Factor 2 0.0937 0.0469 0.65 0.525 Error 63 4.5316 0.0719 Total 65 4.6253 S = 0.2682 R-Sq = 2.03% R-Sq(adj) = 0.00% Level N Mean StDev --+---------+---------+---------+------- round 22 5.5432 0.2718 (-------------*-------------) narrow 22 5.5341 0.2598 (--------------*-------------) wide 22 5.4591 0.2728 (-------------*--------------) --+---------+---------+---------+------- 5.360 5.440 5.520 5.600 = 0.2682 We see that the null hypothesis is accepted with p-value of 0.525. The three associated confidence intervals for the true means heavily overlap.

Next we carry out a nonparametric rank-based version of one-way ANOVA. MTB > Kruskal-Wallis 'times' 'subs'. Kruskal-Wallis Test: times versus subs Kruskal-Wallis Test on times subs N Median Ave Rank Z narrow 22 5.525 36.2 0.80 round 22 5.500 35.3 0.54 wide 22 5.450 29.0-1.35 Overall 66 33.5 H = 1.84 DF = 2 P = 0.399 H = 1.85 DF = 2 P = 0.396 (adjusted for ties) Again, the null hypothesis is accepted. Now let us carry out a two-way ANOVA. MTB > Twoway 'times' 'runner' 'subs'; SUBC> Means 'runner' 'subs'. Two-way ANOVA: times versus runner, subs Source DF SS MS F P runner 21 4.21864 0.200887 26.96 0.000 subs 2 0.09371 0.046856 6.29 0.004 Error 42 0.31295 0.007451 Total 65 4.62530 S = 0.08632 R-Sq = 93.23% R-Sq(adj) = 89.53% There are now two null hypotheses: (1) that runners all have the same average running time, and (2) that the three methods all have the same running time. The above analysis shows that both hypotheses are rejected with very low p- values.

Below we see the individual CIs for the average running times for the 22 runners. Note that some pairs of intervals are nonoverlapping. This supports the conclusion that the average running times are not the same for all the runners. Runner 14 is fastest on average, runner 22 is slowest. runner Mean --------+---------+---------+---------+- 1 5.48333 (-*--) 2 5.76667 (-*--) 3 5.43333 (--*-) 4 5.48333 (-*--) 5 5.81667 (-*--) 6 5.53333 (-*--) 7 5.38333 (--*-) 8 5.43333 (--*-) 9 5.13333 (-*--) 10 5.78333 (--*-) 11 5.18333 (--*-) 12 5.55000 (--*-) 13 5.46667 (--*-) 14 5.00000 (--*--) 15 5.46667 (--*-) 16 5.50000 (-*--) 17 5.48333 (-*--) 18 5.50000 (-*--) 19 5.40000 (--*--) 20 5.55000 (--*-) 21 5.63333 (--*-) 22 6.28333 (-*--) --------+---------+---------+---------+- 5.20 5.60 6.00 6.40 Next we look at the individual CIs for the average running times for the three methods. We see that some pairs of intervals are nonoverlapping, indicating that the mean running times are not the same for the three methods. Based on means, the wide angle method outperforms the other two methods, which are very close to each other, and this finding has statistical significance, although it is based on a normality assumption. subs Mean -----+---------+---------+---------+---- narrow 5.53409 (---------*--------) round 5.54318 (--------*--------) wide 5.45909 (---------*--------) -----+---------+---------+---------+---- 5.440 5.480 5.520 5.560

Finally, we perform a nonparametric rank-based version of two-way ANOVA. MTB > Friedman 'times' 'subs' 'runner'. Friedman Test: times versus subs blocked by runner S = 10.64 DF = 2 P = 0.005 S = 11.14 DF = 2 P = 0.004 (adjusted for ties) Sum of subs N Est Median Ranks narrow 22 5.5250 47.0 round 22 5.5667 53.0 wide 22 5.4333 32.0 Grand median = 5.5083 Again, both null hypotheses are rejected. We see again that the wide angle method outperforms the other two methods, this time based on medians, and that this finding has statistical significance, this time without requiring a normality assumption. HAPPY RUNNING!