URL: <http://dx.doi.org/ / >

Similar documents
Journal of Quantitative Analysis in Sports

A Statistical Evaluation of the Decathlon Scoring Systems

DEVELOPING COMBINED EVENT ATHLETES IN THE COLLEGIATE SYSTEM. Nate Davis Assistant Coach Combined Events, PV & HJ University of Wisconsin

IAAF Team Athletics. Championships

ANB Event Specifications OUTDOOR EVENTS

The programme shall comprise the following 20 (twenty) events for Men and 20 (twenty) events for Women:

A Comparative Analysis of Motor Fitness Components among Sprinters, Throwers and Jumpers

A Machine Learning Approach to Predicting Winning Patterns in Track Cycling Omnium

Influence of the size of a nation s population on performances in athletics

Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement

Measuring the Gender Differences in Athletics

save percentages? (Name) (University)

2015 AAU JUNIOR OLYMPIC GAMES NORFOLK STATE UNIVERSITY, NORFOLK, VIRGINIA MULTI-EVENT/TRACK & FIELD MEET SCHEDULE

Open Research Online The Open University s repository of research publications and other research outputs

SCHEDULE OF EVENTS: MULTI EVENTS. Day One Thursday, June 21, 2018

Online Companion to Using Simulation to Help Manage the Pace of Play in Golf

EUROPEAN ATHLETICS TEAM CHAMPIONSHIPS 601. PROMOTION AND RIGHTS

FINAL SCHEDULE as of 07/07/2017

Citation for published version (APA): Canudas Romo, V. (2003). Decomposition Methods in Demography Groningen: s.n.

SAMPLE MAT Proceedings of the 10th International Conference on Stability of Ships

Available online at ScienceDirect. Procedia Engineering 112 (2015 )

Biomechanics of Parkour: The Vertical Wall-Run Technique

COMPARISON STUDY BETWEEN THE EFFICIENY OF THE START TECHNIQUES IN THE ROMANIAN COMPETITIVE SWIMMING

ATHLETICS AUSTRALIA SELECTION POLICY COMMONWEALTH GAMES GLASGOW, SCOTLAND 27 th July to 2 nd AUGUST 2014

EXPLORING MOTIVATION AND TOURIST TYPOLOGY: THE CASE OF KOREAN GOLF TOURISTS TRAVELLING IN THE ASIA PACIFIC. Jae Hak Kim

Factors that affect the motion of a vehicle along a surface

Sprint Hurdles SPRINT HURDLE RACES: A KINEMATIC ANALYSIS

ATHLETICS AUSTRALIA SELECTION POLICY OAA OCEANIA AREA CHAMPIONSHIPS TOWNSVILLE, AUSTRALIA 25 TO 28 JUNE 2019

Georgian University GEOMEDI. Abstract. In this article we perform theoretical analysis of long jumps with the purpose to find

Kinetic Energy Analysis for Soccer Players and Soccer Matches

Track and Field. 1. Age Categories: Men and Women: 55 to 59, 60 to 64, 65 to 69, 70 to 74, 75 to 79, 80 to 84, 85 to 89, 90 to 94, 95 to 99, 100+

ATHLETICS AUSTRALIA SELECTION POLICY OAA OCEANIA AREA CHAMPIONSHIPS TOWNSVILLE, AUSTRALIA 25 TO 28 JUNE 2019

Mathematical model to crouch start in athletics

On the association of inrun velocity and jumping width in ski. jumping

superior in performance in the 100 m dash. If these

TECHNICAL NOTE THROUGH KERBSIDE LANE UTILISATION AT SIGNALISED INTERSECTIONS

ITU PARATRIATHLON Fair Start System

The effect of cognitive intervention on stroke distance in age-group swimmers

2017 IAAF World Championships 4-13 August 2017 London, UK. Selection Policy published February 2017

Evaluating the Influence of R3 Treatments on Fishing License Sales in Pennsylvania

ATHLETICS AUSTRALIA SELECTION POLICY WORLD JUNIOR CHAMPIONSHIPS EUGENE, USA 22 nd to 27 th July 2014

Athletics Australia SELECTION POLICY IAAF WORLD U20 CHAMPIONSHIPS TAMPERE, FINLAND 10 TH to 15 TH July 2018

ScienceDirect. Rebounding strategies in basketball

An Exploratory Study of Psychomotor Abilities among Cricket Players of Different Level of Achievement

APPLICATION OF FILMING AND MOTION ANALYSIS IN MOVEMENT STUDIES. Xie Wei Sports Medicine and Research Center, Singapore Sports Council, Singapore

Track and Field. 1. Age Categories: Men and Women: 55 to 59, 60 to 64, 65 to 69, 70 to 74, 75 to 79, 80 to 84, 85 to 89, 90 to 94, 95 to 99, 100+

This file is part of the following reference:

2017 PAN AM U20 CHAMPIONSHIPS SELECTION CRITERIA

The effect of pack formation at the 2005 world orienteering championships

Track and Field. 55+ BC Games

Biomechanical analysis of the medalists in the 10,000 metres at the 2007 World Championships in Athletics

AUTHOR ABSTRACT. Athlete Development - Reflections on the Pathway from Potential to Performance. by Frank Dick ESSAY

ASA TEAM SELECTION CRITERIA

Spec Towns Invitational April 6-7,2018 Teams: Georgia, Ole Miss, Wisconsin, Vanderbilt, Albany State Location: Spec Towns Track 1 Selig Cir, Athens,

Packet Pick Up Friday, April 6 th from 9am-12pm at the track.

MALTESE RECORDS Regulations for Ratification General Conditions

ABSTRACT AUTHOR. Kinematic Analysis of the Women's 400m Hurdles. by Kenny Guex. he women's 400m hurdles is a relatively

Legendre et al Appendices and Supplements, p. 1

SPORTS PARTICIPATION CHANGES IN LONDON FOLLOWING LONDON 2012 WITH NATIONAL AND INTERNATIONAL COMPARISONS

ATHLETICS AUSTRALIA SELECTION POLICY 28 th FISU WORLD UNIVERSITY GAMES GWANGJU, KOREA 3 rd to 14 th JULY 2015

EAST REGIONAL TRACK AND FIELD GUIDELINES. Revised - January 2010

Using Markov Chains to Analyze a Volleyball Rally

Examples of Carter Corrected DBDB-V Applied to Acoustic Propagation Modeling

2012 AAU JUNIOR OLYMPIC GAMES TURNER STADIUM, HUMBLE HIGH SCHOOL, HUMBLE, TEXAS MULTI-EVENTS/RACEWALK

RULES OF COMPETITION - TRACK AND FIELD

A Case Study of Leadership in Women s Intercollegiate Softball. By: DIANE L. GILL and JEAN L. PERRY

2018 IAAF U20 CHAMPIONSHIPS SELECTION CRITERIA

European Journal of Physics. Related content ADDENDUM. To cite this article: J J Hernández-Gómez et al 2018 Eur. J. Phys.

2017 European Athletics Indoor Championships 3-5 March 2017 Belgrade, SRB. Selection Policy published October 2016

Trouble With The Curve: Improving MLB Pitch Classification

Insurance. The practical application. of credibility models in the rating of Health

Discriminant analysis of game-related statistics between basketball guards, forwards and centres in three professional leagues

Tennis Plots: Game, Set, and Match

Analyses of the Scoring of Writing Essays For the Pennsylvania System of Student Assessment

WHAT CAN WE LEARN FROM COMPETITION ANALYSIS AT THE 1999 PAN PACIFIC SWIMMING CHAMPIONSHIPS?

THE IMPULSE-STEP IN THE JAVELIN THROW

ATHLETICS AUSTRALIA SELECTION POLICY 2017 WORLD PARA ATHLETICS WORLD CHAMPIONSHIPS LONDON, UNITED KINGDOM JULY 2017

discus throw hammer throw high jump hurdle relay race pole vault shot put marathon javelin long jump 100 metre sprint

A COMPARISON OF SELECTED BIOMECHANICAL PARAMETERS OF FRONT ROW SPIKE BETWEEN SHORT SET AND HIGH SET BALL

A Comparative Study of Running Agility, Jumping Ability and Throwing Ability among Cricket Players

Technical Regulations 2018 Youth Olympic Games Athletics Competition Buenos Aires, ARG

Moycarkey Coolcroo A.C.

Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology

Tutorial for the. Total Vertical Uncertainty Analysis Tool in NaviModel3

The Aging Curve(s) Jerry Meyer, Central Maryland YMCA Masters (CMYM)

Application of Bayesian Networks to Shopping Assistance

ATHLETICS AUSTRALIA SELECTION POLICY 16 th IAAF WORLD CHAMPIONSHIPS LONDON, GREAT BRITAIN 4 th to 13 th AUGUST 2017

REPORT. A comparative study of the mechanical and biomechanical behaviour of natural turf and hybrid turf for the practise of sports

The case study was drafted by Rachel Aldred on behalf of the PCT team.

The Scottish Schools' Athletic Association

Selection Criteria Changes April 14, 2016 The following was removed from section 5.2, then revised and added to section 6.1: NOTE: Failure to achieve

TECHNICAL REGULATIONS FOR BALTIC TEAM CHAMPIONSHIPS IN ATHLETICS

BEYOND SPEED IN THE DECATHLON

DP Ice Model Test of Arctic Drillship

Post impact trajectory of vehicles at rural intersections

An examination of try scoring in rugby union: a review of international rugby statistics.

USATF 2008 SUPER CLINIC CRITICAL GENERAL PERFORMANCE DESCRIPTORS FOR THE SHORT HURDLES. Ralph Mann, Ph.D.

arxiv:physics/ v1 [physics.pop-ph] 13 Feb 2001

Journal of Quantitative Analysis in Sports

Transcription:

Citation: Woolf, Anne, Ansley, Les and Bidgood, Penelope (2007) Grouping of decathlon disciplines. Journal of Quantitative Analysis in Sports, 3 (4). ISSN 1559-0410 Published by: Walter de Gruyter URL: http://dx.doi.org/10.2202/1559-0410.1057 <http://dx.doi.org/10.2202/1559-0410.1057> This version was downloaded from Northumbria Research Link: http://nrl.northumbria.ac.uk/504/ Northumbria University has developed Northumbria Research Link (NRL) to enable users to access the University s research output. Copyright and moral rights for items on NRL are retained by the individual author(s) and/or other copyright owners. Single copies of full items can be reproduced, displayed or performed, and given to third parties in any format or medium for personal research or study, educational, or not-for-profit purposes without prior permission or charge, provided the authors, title and full bibliographic details are given, as well as a hyperlink and/or URL to the original metadata page. The content must not be changed in any way. Full items must not be sold commercially in any format or medium without formal permission of the copyright holder. The full policy is available online: http://nrl.northumbria.ac.uk/policies.html This document may differ from the final, published version of the research and has been made available online in accordance with publisher policies. To read and/or cite from the published version of the research, please visit the publisher s website (a subscription may be required.)

Journal of Quantitative Analysis in Sports Volume 3, Issue 4 2007 Article 5 Grouping of Decathlon Disciplines Anne Woolf, Kingston University Les Ansley, Kingston University Penelope Bidgood, Kingston University Recommended Citation: Woolf, Anne; Ansley, Les; and Bidgood, Penelope (2007) "Grouping of Decathlon Disciplines," Journal of Quantitative Analysis in Sports: Vol. 3: Iss. 4, Article 5. DOI: 10.2202/1559-0410.1057 2007 American Statistical Association. All rights reserved.

Grouping of Decathlon Disciplines Anne Woolf, Les Ansley, and Penelope Bidgood Abstract The 10 disciplines in the decathlon can be broadly characterised as running, jumping and throwing. However, these simplistic characteristics may not represent the groupings defined by performances in the decathlon. The identification of groups may reveal a recondite advantage for athletes who excel in particular disciplines. Therefore this study used cluster analysis to determine the groupings inherent within the decathlon disciplines. The data set was derived from the top 173 decathletes between the years 1986 to 2005. Six clustering methods were applied to a Euclidean proximity matrix. The highest number of clusters common to all the methods was accepted as the solution. All six methods produced the same 3-cluster ([100m 400m 110H LJ PV HJ][SP DT JT][1500m]), 4-cluster ([100m 400m 110H LJ PV][SP DT JT][HJ][1500m]) and 5-cluster ([100m 400m 110mH LJ][SP DT JT][PV][HJ][1500m]) solutions. Stability tests confirmed the consistency of all the solutions. The 10 disciplines of the decathlon form into five groupings, which can be adequately explained from a physiological perspective. The clustering suggests that athletes who perform better at the sprint/track disciplines may obtain an advantage in the decathlon. KEYWORDS: athletics, cluster analysis, personal best, classical scaling

Woolf et al.: Grouping of Decathlon Disciplines Introduction The decathlon takes place over two days and consists of a combination of ten track and field disciplines the order of which is invariant: day 1: 100 m race (100m), long jump (LJ), shot put (SP), high jump (HJ) and 400 m race (400m); day 2: 110 m hurdles (110mH), discus (DT), pole vault (PV), javelin (JT) and 1500 m race (1500m). Actual performance results in terms of time in the track disciplines and distance or height in the field disciplines are converted into point scores using scoring equations and tables formulated by the International Amateur Athletics Federation (IAAF). These equations and tables are derived from ranking lists of the best performances of both decathletes and specialised athletes in the ten disciplines that comprise the decathlon. The principle of the scoring system is an attempt to reward performances of equal merit across the different disciplines with equal points. Simplistically the disciplines that make up the decathlon could be considered as providing three broad categories for grouping the disciplines: running (100m, 110mH, 400m and 1500m); jumping (LJ, HJ, PV); and throwing (SP, JT, DT). However, these categories are unlikely to represent the groupings based upon participants performance in the disciplines; and indeed anecdotally decathletes are frequently classified by speed and skill levels as either a sprinter/jumper or a thrower/pole vaulter (Kenny et al, 2005). From the analysis of five sets of World Championship decathlon data (1991 to 1999) Cox and Dunn (2002) commented that the discus throw and the shot-put throw were always grouped together while the running events (with the exception of the 1500 m) were often grouped together. They identified a 3-cluster grouping of the decathlon disciplines. Overall, these groupings were not defined by the three characteristics of running, jumping and throwing although no explanation was proffered to rationalise the groupings and the composition of the clusters were not consistent between sample sub-populations. In a subsequent analysis of decathlon data from the 1999, 2001 and 2003 World Championships and the 2000 and 2004 Olympic Games we found that better consistency of grouping and cluster content was obtained with a 5-cluster solution rather than a 3-cluster solution (unpublished data). The clustering or grouping of the disciplines within the decathlon may have practical implications in terms of the advantages gained by athletes who are relatively better in the disciplines of one cluster than those disciplines contained within another cluster. Since decathlon data can be modelled as multivariate data, cluster analysis can be used to identify and analyse the groupings (or clusters) that may exist among the disciplines. In the analyses by both Cox and Dunn (2001) and ourselves the sample sizes were defined by the number of athletes finishing the decathlon in any year and 1

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 varied between 11 and 27. These sample sizes are small when used in the context of cluster analysis and major championships do not often produce personal best performances. We therefore reasoned that an improved solution might be obtained by using a larger sample based on personal best performances of elite athletes. Accordingly, we hypothesise that the cluster analysis of the top personal best decathlon performances will provide a basis for the examination of groupings among the ten decathlon disciplines. Methods Data set All personal best decathlon results that achieved over 8,000 points in international competition from 1966 to 2005 were included in the original sample set (Matthews, 2005 and Decathlon2000) (permission to use the data was obtained). However, the technical specification of the men s javelin was changed in January 1986 but this change is not reflected in the IAAF scoring tables, which were last updated in 1985. As a consequence points are harder to achieve in the javelin since 1986. For this reason only the personal best results achieved from January 1986 onwards were selected for subsequent analyses. This produced a subset of 173 personal best performances and included the results from 131 meetings. Only seven of these personal best performances were achieved in Olympic competitions and only 14 achieved in World Championship competitions. Cluster analysis Cluster analysis is an exploratory technique that is completely numerical using only the geometric properties of the data. The variables within the cluster analysis were defined by the ten disciplines of the decathlon event. The use of point scores rather than the actual distances, heights or times provided the necessary standardisation of the data in this analysis. Proximity measure In order to carry out a cluster analysis the similarity, or the distance, between every pair of variables must be defined and measured and this information stored in a proximity matrix for the sample. Weighting a variable in cluster analysis would give it greater or lesser importance than other variables when determining discipline proximity. Since the underlying premise of the IAAF scoring system is DOI: 10.2202/1559-0410.1057 2

Woolf et al.: Grouping of Decathlon Disciplines that all disciplines carry equal weight, and the decathletes train to the scoring tables, we have assumed equal weighting for all disciplines in the decathlon. Cox and Dunn (2002) used the Pearson correlation as the basis of the measure of similarity between events and defined their proximity matrix in terms of the correlation coefficients. However, there is an inherent danger in using correlation to establish general behaviour since correlation does not denote causation (Kenny et al. 2005). Also, we have previously used both the Pearson correlation proximity matrix and the squared Euclidean proximity matrix and found that the solutions from the clustering methods based on the squared Euclidean distance measure were more consistent than those based on the Pearson correlation similarity measure (unpublished data). Therefore this study has used only the squared Euclidean measure to define the distance between the variables. Clustering method Everitt et al. (2001, p177) state that it is generally impossible a priori to anticipate what clustering methods are likely to lead to interesting and informative classifications and moreover different clustering methods do not always produce the same results on the same data. Therefore a solution was sought based on a consensus from six of the most widely used agglomerative clustering methods as applied to the Euclidian proximity matrix. The six methods were single linkage; complete linkage; average linkage; centroid linkage; median linkage and Ward s method. The primary difference between the clustering methods is the approach they use to measure the inter-group distance between clusters of variables or between one variable and a cluster of variables. A graphical depiction of the clustering process is illustrated by the example dendrogram in Figure 1. With each of the variables initially represented as a separate cluster on the x-axis, the dendrogram in Figure 1 depicts how the individual variables are joined to form clusters or moved into an existing cluster at the appropriate similarity level, until all the events are amalgamated into a single cluster. At level 1, A and B are joined into a cluster resulting in 5 clusters. At level 2, C is assigned to a second cluster with A and B resulting in 4 clusters. E and F are clustered together at level 3 and D is assigned to the cluster containing A, B and C in level 4, leaving only two clusters. At level 5, the final step, the two remaining clusters are amalgamated into a single cluster. The order of the variables along the x-axis is determined by the clustering process. 3

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 Figure 1 Depiction of the process to produce a 1-cluster dendrogram solution. The cluster analysis presents a hierarchal clustering and it remains for the investigator to decide on the similarity level at which the number of clusters best represent the data. This decision is purely subjective since there is no methodology that can definitively determine the correct number of clusters although an examination of the classical scaling plot may help. Classical scaling is an algebraic reconstruction method for finding a configuration of points from the proximity matrix which is particularly appropriate when the distances are Euclidean. The generation of the squared Euclidean proximity matrix and the subsequent cluster analysis using each of the six methods was carried out using Minitab version 14. Trials runs were made using each of the six methods to produce a single-cluster solution from the data. The single-cluster solutions were examined to determine the consistency among the six clustering methods. Solutions were selected from the levels at which cluster content was common to the majority of the methods. Everitt et al. (2001, p179) caution that where several different levels are feasible, the highest number of clusters should be selected. The cluster content of the selected solutions was examined in the physiological context of decathlon disciplines. DOI: 10.2202/1559-0410.1057 4

Woolf et al.: Grouping of Decathlon Disciplines Cluster analysis was carried out on the decathlon data with the disciplines defining the variables and the point scores as the observations on those variables; the input data was the 173x10 Euclidean proximity matrix of discipline point scores. A classical scaling technique was applied to the data in the Euclidean proximity matrix. Results Consistency Identical single-cluster solutions were produced from average linkage, centroid linkage, median linkage and Ward s method (Figure 2a); the solutions from the complete linkage (Figure 2b) and single linkage (Figure 2c) methods differed slightly. The complete linkage solution differed from the other five in assigning the LJ and the 110mH to a cluster of their own before merging that cluster with the cluster containing the 100m and 400m disciplines. All the clustering methods except the single linkage assigned the ten decathlon events between two main groups. The single linkage method retained the 1500m as a single unassigned event throughout. All six solutions assigned the first three track running disciplines to a different group from the three throwing disciplines and assigned the long jump to the same subgroup as the three running disciplines (Figures 2a, b & c). Figure 2a Representative dendrogram of Average Linkage, Centroid Linkage, Median Linkage and Ward s Method displaying the 3-, 4- and 5-cluster solutions of the decathlon disciplines for the top 173 personal best decathlon scores. 5

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 Figure 2b Dendrogram of Complete Linkage displaying the 3-, 4- and 5-cluster solutions of the decathlon disciplines for the top 173 personal best decathlon scores. Figure 2c Dendrogram of Single Linkage displaying the 3-, 4- and 5-cluster solutions of the decathlon disciplines for the top 173 personal best decathlon scores. DOI: 10.2202/1559-0410.1057 6

Woolf et al.: Grouping of Decathlon Disciplines By seeking a consensus among the six methods and bisecting the dendrograms at these levels it was observed that all six methods produced the same 3-cluster and 4-cluster and 5-cluster solutions: [100m 400m 110mH LJ][SP DT JT][PV][HJ][1500m] 5-cluster solution [100m 400m 110H LJ PV][SP DT JT][HJ][1500m] 4-cluster solution [100m 400m 110H LJ PV HJ][SP DT JT][1500m] 3-cluster solution Stability Stability tests were carried out on all three solutions; it might have been possible to eliminate one or two of the three possible solutions if they had been found to be less stable than the others. The stability of the 3-, 4- and 5-cluster solutions was investigated with a series of stability tests carried out using the average linkage method and Ward s method. The all-time best sample was split into two subsets comprising the first 100 entries and the remaining 65 entries, respectively. A cluster solution was obtained from each subset. Both subsets reproduced the original solution with both methods. Further stability tests were carried out by removing the point score data appropriate to the events in any of one, two or three of the clusters at random from the input data and generating a 4, 3 or 2-cluster solution from the reduced proximity matrix. All these tests produced reduced solutions consistent with the original. In a final test of stability the Euclidean proximity matrix was regenerated with the point score data rearranged as if the PV, JT and 1500m were the first three events of the decathlon. The solutions from the altered data were consistent with the original solutions indicating that the 3-, 4- and 5-cluster solutions were all stable. The solution for the average method from this test is shown in Figure 3. It was observed that the structure of the dendrogram, unlike the content of the clusters, is dependent on the sequence in which the data is stored in the Euclidean proximity matrix (Figure 2a vs. Figure 3). 7

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 Figure 3 Representative dendrogram of Average Linkage and Ward s Method stability tests of the decathlon disciplines for the top 173 personal best decathlon scores where the data was restructured to present the PV, JT and 1500m as the first three disciplines. Classical scaling The classical scaling plot for the decathlon data sample depicts the grouping of the decathlon disciplines into two main clusters on either side of the plot and confirms the possible option of 3, 4 or 5 clusters (Figure 4). DOI: 10.2202/1559-0410.1057 8

Woolf et al.: Grouping of Decathlon Disciplines Figure 4 Classical scaling plot of the decathlon disciplines for the top 173 personal best decathlon scores. Discussion The cluster analysis of the all time best decathlon results presented three solutions that were all equally stable. The fundamental difference between the three solutions was whether the pole vault and the high jump were clustered with other disciplines or left in clusters of their own - as represented by the 5-cluster solution. The 5-cluster solution was chosen as the most plausible based on the principle that since it presents the highest number of clusters common to all the methods it is the solution that is least likely to force disciplines into contrived clusters (Everitt, 2001). The 5-cluster solution derived from the large data set in this study has the same cluster content as a 5-cluster solution displayed in the dendrogram presented by Cox and Dunn (2002) where the combined data from five World Championship competitions was used. Cox and Dunn (2002) used single linkage, complete linkage and Ward s cluster methods in their analysis. Based on the smaller samples from the five individual competitions they found that complete linkage and Ward s method 9

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 gave similar, consistent results. Cox and Dunn (2002) also concluded that [t]he high jump, pole vault, javelin and 1500 metre race were not so consistent with cluster membership (p.181). Based on our previous work with smaller samples we would mostly agree with their conclusion; however, it is notable that the larger sample size from the all time best scores has produced a consistent 5-cluster solution in which the javelin is always assigned to the same cluster as the discus and the shot put. Assuming a case can be made for the existence of groupings among the decathlon disciplines does the 5-cluster solution make physiological sense for the decathlon? Closer examination of the clustering of the disciplines provides some interesting insights. Analysis of the disciplines grouped in the first cluster i.e. [100m, 400m, 110mH, LJ] reveals that success in all four disciplines is dependent on fast running speeds on the flat (Hay 1994). Furthermore, the cross-over between success in sprints and long jump is further borne out in the athletic area where no less than three sprinters have won Olympic medals in both the 100 m and the long jump, namely Marion Jones, Carl Lewis and Jesse Owen. The separation the 1500m from the other three running disciplines in the first cluster supports anecdotal evidence that the 1500m is different in the decathlon. There are a number of possible explanations for this observation. The 1500 m race is the only middle distance event in the decathlon event; all the other events are sprint and power events. Therefore the type of athlete that competes in the decathlon is probably not suited to the rigors of the 1500 m race. Furthermore, although the 1500m was initially included in the decathlon as a test of stamina, it is the final event on the second day by which time the athletes will be tired and the overall positions have usually been decided. Therefore in most instances, the athletes have little motivation to do more than just maintain their position. The three disciplines grouped in the second cluster are the three throwing disciplines (SP, DT and JT). Although the actions of the SP, DT and JT can be described as a thrust, whip and pull, respectively (Cromwell, 1949 p.301), the principles that govern the throwing techniques are the same i.e. speed, angle of release and (in the case of the DT and JT) aerodynamics (Dyson, 1971 p.184). Furthermore the proximal-to-distal sequential order of the kinetic chain is common to the SP, DT and JT (Hay, 1994). That mastery of the fundamentals in one throwing discipline can overlap into mastery of other throwing disciplines is evident in the individual competitive arena where athletes have achieved success in two or more of the throwing events e.g. Hannes Hoply (SP and DT), Ralph Rose (SP, DT and JT), John Godina (SP and DT), Uwe Hohn (DT and JT) and Scott Russell (DT and JT). Notably, the two events whose success is dependent on defying gravity are grouped in separate clusters. The extremely technical nature of the pole vault (Hubbard, 1980) and high jump (Dapena and Chung, 1988) possibly result in the DOI: 10.2202/1559-0410.1057 10

Woolf et al.: Grouping of Decathlon Disciplines scenario where the investment in the extra time in mastering the technique at the expense of time spent on other disciplines does not provide adequate returns in performance (points) improvements. Van Damme et al (2002) described this trade-off in terms of the evolutionary principle of allocation where excellence in one task can only be attained at the expense of average performance in all other tasks (p.755). Therefore the athletes would rather spend their time concentrating on improving performances in their preferred disciplines. Cox and Dunn (2002) posited that the decathlon favours those athletes who do well in the field events. However, the composition of the clusters suggests that athletes who perform better in the sprint/track disciplines may be afforded an advantage as there are four disciplines in which they are likely to do well; whereas only three of the field disciplines are contained in the next largest cluster. This assumption is further strengthened by the observation that the average ranking of the top ten decathletes in the disciplines that comprise the first cluster [100m 400m 110mH LJ] is 11 th, while their average ranking in the disciplines that comprise the second cluster [SP DT JT] is 27 th. This trend continues for the top twenty positions where the average ranking for the [100m 400m 110mH LJ] cluster and [SP DT JT] cluster disciplines is 18 th and 44 th, respectively. This indicates that the athletes who perform better in the disciplines that comprise the first cluster are overrepresented in the top twenty personal best performances compared with those athletes who excel at the disciplines contained in the second cluster (Figure 5). Average ranking for disciplines within clusters 100 75 50 25 0 [100m 400m 110mH LJ] [SP DT JT] 1-10 11-20 21-30 31-40 41-50 51-60 61-70 71-80 81-90 Overall personal best ranking Figure 5 Comparison of the average ranking of disciplines contained within two clusters [100m 400m 110mH LJ] and [SP DT JT] in relation to the overall ranking of decathletes in the top 90. 11

Journal of Quantitative Analysis in Sports, Vol. 3 [2007], Iss. 4, Art. 5 Conclusion Although defining the number of clusters remains subjective the results from this study support a 5-cluster solution. The clusters are not entirely defined by the three overt characteristics of the disciplines, but can be adequately explained from a teleological perspective. Furthermore, it seems that sprint track athletes may have an advantage in the decathlon event since they are more likely to do well in a greater number of events than field athletes. References Cox, T.F. & Dunn, R.T. (2002). An analysis of decathlon data. Journal of the Royal Statistical Society Series D, 51, 179-187. Cromwell, D.B. (1949). Champion Technique in Track and Field (2 nd ed). New York, Whittlesey House. Dapena, J. & Chung, C.S. (1988). Vertical and radial motions of the body during the take-off phase of high jumping. Medicine and Science in Sports and Exercise, 20, 290-302. Decathlon 2000. http// www.decathlon2000.ee/eng/statistics.php?art=834 (Last accessed 24/10/2006) Dyson, G. (1971). The Mechanics of Athletics (5 th ed). London, University of London Press. Everitt, B., Landau, S. & Leese, M. (2001). Cluster Analysis (4 th ed). London, Arnold. Hay, J.G. (1994). The Biomechanics of Sports Techniques (4 th ed). New Jersey, Prentice-Hall. Hubbard, M. (1980). Dynamics of the pole vault. Journal of Biomechanics, 13, 965-976. Kenny, I.C., Sprevak, D., Sharp, C. & Boreham, C. (2005) Journal of Quantitative Analysis in Sports, 1, Article 5. DOI: 10.2202/1559-0410.1057 12

Woolf et al.: Grouping of Decathlon Disciplines Matthews, P. (2005). International Athletics Annual 1995. Cheltenham: SportsBooks. Van Damme, R., Wilson, R. S., Vanhooydonck B. & Aerts, P. (2002). Evolutionary biology: Performance constraints in decathletes. Nature, 415, 755. 13