PREDICTION VERSUS REALITY: THE USE OF MATHEMATICAL MODELS TO PREDICT ELITE PERFORMANCE IN SWIMMING AND ATHLETICS AT THE OLYMPIC GAMES

Similar documents
MATHEMATICAL MODELS TO PREDICT ELITE PERFORMANCE IN SWIMMING AND ATHLETICS AT THE

WHAT CAN WE LEARN FROM COMPETITION ANALYSIS AT THE 1999 PAN PACIFIC SWIMMING CHAMPIONSHIPS?

The Aging Curve(s) Jerry Meyer, Central Maryland YMCA Masters (CMYM)

Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement

Beyond the High-Tech Suits: Predicting 2012 Olympic Swim Performances

Generating Power in the Pool: An Analysis of Strength Conditioning and its Effect on Athlete Performance

Lesson 14: Modeling Relationships with a Line

Bioequivalence: Saving money with generic drugs

APPROACH RUN VELOCITIES OF FEMALE POLE VAULTERS

Atmospheric Rossby Waves in Fall 2011: Analysis of Zonal Wind Speed and 500hPa Heights in the Northern and Southern Hemispheres

A Hare-Lynx Simulation Model

A REVIEW OF AGE ADJUSTMENT FOR MASTERS SWIMMERS

Shedding Light on Motion Episode 4: Graphing Motion

A Nomogram Of Performances In Endurance Running Based On Logarithmic Model Of Péronnet-Thibault

Chapter 12 Practice Test

Modelling World Record Data for the Men s 100-Metre Dash. Can the Data collected Predict the Ultimate in Human Achievement for this Event?

Citation for published version (APA): Canudas Romo, V. (2003). Decomposition Methods in Demography Groningen: s.n.

USING A CALCULATOR TO INVESTIGATE WHETHER A LINEAR, QUADRATIC OR EXPONENTIAL FUNCTION BEST FITS A SET OF BIVARIATE NUMERICAL DATA

OXYGEN POWER By Jack Daniels, Jimmy Gilbert, 1979

A Statistical Evaluation of the Decathlon Scoring Systems

GENETICS OF RACING PERFORMANCE IN THE AMERICAN QUARTER HORSE: II. ADJUSTMENT FACTORS AND CONTEMPORARY GROUPS 1'2

The Development and Prediction of Athletic Performance in Freestyle Swimming

Traffic safety developments in Poland

COMPARISON STUDY BETWEEN THE EFFICIENY OF THE START TECHNIQUES IN THE ROMANIAN COMPETITIVE SWIMMING

save percentages? (Name) (University)

An Update on the Age of National-level American Swimmers

Legendre et al Appendices and Supplements, p. 1

Applying Hooke s Law to Multiple Bungee Cords. Introduction

Calculation of Trail Usage from Counter Data

BODY FORM INFLUENCES ON THE DRAG EXPERIENCED BY JUNIOR SWIMMERS. Australia, Perth, Australia

Journal of Chemical and Pharmaceutical Research, 2014, 6(3): Research Article

Ozobot Bit Classroom Application: Boyle s Law Simulation

100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men?

Teaching Notes. Contextualised task 35 The 100 Metre Race

Is lung capacity affected by smoking, sport, height or gender. Table of contents

Team /08/2016. Report What is the fastest humanly possible time for the Olympic 100m freestyle event? Summary

Running head: DATA ANALYSIS AND INTERPRETATION 1

Section I: Multiple Choice Select the best answer for each problem.

Changes in speed and efficiency in the front crawl swimming technique at 100m track

Jodi M. Cossor and Bruce R. Mason Australian lnstitute of Sport Biomechanics Department, Canberra, Australia

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Review of A Detailed Investigation of Crash Risk Reduction Resulting from Red Light Cameras in Small Urban Areas by M. Burkey and K.

DEVELOPMENT OF A SET OF TRIP GENERATION MODELS FOR TRAVEL DEMAND ESTIMATION IN THE COLOMBO METROPOLITAN REGION

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg

The MACC Handicap System

SPRINTING CHARACTERISTICS OF WOMEN S 100 METER FINALS AT THE IAAF WORLD CHAMPIONSHOPS DAEGU 2011

Sprint Hurdles SPRINT HURDLE RACES: A KINEMATIC ANALYSIS

9A Gas volume and pressure are indirectly proportional.

Lab Report Outline the Bones of the Story

Minimal influence of wind and tidal height on underwater noise in Haro Strait

Journal of Quantitative Analysis in Sports

LOW PRESSURE EFFUSION OF GASES adapted by Luke Hanley and Mike Trenary

Athletics at the Beijing Olympics: how much faster can anyone run?

Georgian University GEOMEDI. Abstract. In this article we perform theoretical analysis of long jumps with the purpose to find

SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION August 2011 Pohnpei, Federated States of Micronesia

100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men?

27Quantify Predictability U10L9. April 13, 2015

Analysis of recent swim performances at the 2013 FINA World Championship: Counsilman Center, Dept. Kinesiology, Indiana University

LOW PRESSURE EFFUSION OF GASES revised by Igor Bolotin 03/05/12

REAL LIFE GRAPHS M.K. HOME TUITION. Mathematics Revision Guides Level: GCSE Higher Tier

AN ASSESSMENT OF NEW JERSEY RESIDENT HUNTER OPINION ON CROSSBOW USE

Science in Sport. 106 How fast is the wind speed? Read. Count/Tachometer Any EASYSENSE. Sensors: Loggers: Logging time: EasyLog

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

Gas volume and pressure are indirectly proportional.

Grade: 8. Author(s): Hope Phillips

Resistance to Fatigue and Success in Competitive Swimming. Steven C. Myers, Los Alamos High School Jake A. Bailey, University of Northern Colorado

DOI /HORIZONS.B P23 UDC : (497.11) PEDESTRIAN CROSSING BEHAVIOUR AT UNSIGNALIZED CROSSINGS 1

March Madness Basketball Tournament

Kinematics, Impulse, and Human Running

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

March Madness Basketball Tournament

The Incremental Evolution of Gaits for Hexapod Robots

On the association of inrun velocity and jumping width in ski. jumping

Pressure of the atmosphere varies with elevation and weather conditions. Barometer- device used to measure atmospheric pressure.

Journal of Human Sport and Exercise E-ISSN: Universidad de Alicante España

Influence of the size of a nation s population on performances in athletics

ABSTRACT AUTHOR. Kinematic Analysis of the Women's 400m Hurdles. by Kenny Guex. he women's 400m hurdles is a relatively

Lecture 22: Multiple Regression (Ordinary Least Squares -- OLS)

Surf Clams: Latitude & Growth

2008 Study Update, Part 3 By Dr. Ed Ashby

Stats 2002: Probabilities for Wins and Losses of Online Gambling

The Physical and Physiological Characteristics of 3x3. Results of Medical Study & Scientific Test

Novel empirical correlations for estimation of bubble point pressure, saturated viscosity and gas solubility of crude oils

FIGURE 1: Lag in Time from Point of Selection Decisions in Nucleus Herds to When Market Pigs are Sold in Commercial Herds

A Machine Learning Approach to Predicting Winning Patterns in Track Cycling Omnium

Best Practices in Mathematics Education STATISTICS MODULES

The Reliability of Intrinsic Batted Ball Statistics Appendix

13.1!"#$#%"&'%()$*+%,+-.$+/*$#

SIMULTANEOUS RECORDINGS OF VELOCITY AND VIDEO DURING SWIMMING

Lesson 16: More on Modeling Relationships with a Line

TRIP GENERATION RATES FOR SOUTH AFRICAN GOLF CLUBS AND ESTATES

Boyle s Law: Pressure-Volume Relationship in Gases. PRELAB QUESTIONS (Answer on your own notebook paper)

USING A PROBABILITY APPROACH TO RANK IN-LINE INSPECTION ANOMALIES FOR EXCAVATION AND FOR SETTING REINSPECTION INTERVALS

2008 Excellence in Mathematics Contest Team Project B. School Name: Group Members:

Algebra I: A Fresh Approach. By Christy Walters

NEURAL AND SENSORY APPLICATIONS OF INTERVENTION IN IMPROVING TECHNIQUE AT HIGH SPEED AND RACE PACE ASCTA CONFERENCE - BROADBEACH MAY 2014

INFLUENCE OF ENVIRONMENTAL PARAMETERS ON FISHERY

Zooplankton Migration Patterns at Scotton Landing: Behavioral Adaptations written by Lauren Zodl, University of Delaware

Lane Bias in Elite-level Swimming Competition. Indiana University 1025 E 7 th St. USA

A Study of Olympic Winning Times

Transcription:

PREDICTION VERSUS REALITY: THE USE OF MATHEMATICAL MODELS TO PREDICT ELITE PERFORMANCE IN SWIMMING AND ATHLETICS AT THE OLYMPIC GAMES Dr Tim Heazlewood ACU National

Dedicated to Gavin Lackey Gavin is doing the hard metres at present.

http://www.olympic.org/uk/games/past/index_uk.asp?o LGT=1&OLGY=2000

http://www.olympic.org/uk/games/past/index_uk.asp?o LGT=1&OLGY=2004

Beijing 2008

Sports Science Predictions The prediction of future athletic performance by humans is a recurring theme during the Olympiad year, as well as forming the basis for some stimulating crystal ball gazing in some of the learned sports science journals and in the mass media.

Sports Science Predictions Mathematics and science are based on the principles of description and more importantly prediction. The ability to make substantive and accurate predictions of future elite level sports performance indicates that such approaches reflect good science.

Sports Science Predictions Often these predictions are purely speculative and are not based upon any substantial evidence, rather they are based on the belief that records are made to be broken and that performances must continue to improve over time.

Sports Science Predictions The accessibility of data in the form of results from Olympic Games, world records and world best performances in a specific year allows the analysis of performances in any number of events. From these analyses, changes in performance over time can be observed and predictions of future performance can be made utilising the process of mathematical extrapolation.

Review of Literature A number of researchers have attempted to predict future performances by deriving and applying a number of mathematical statistical models based on past performances in athletics. Prendergast (1990) applied the average speeds of world record times to determine a mathematical model for world records.

Review of Literature The records or data used in the analysis spanned a 10 year period. Following his analysis, Prendergast (1990) raised the question of whether any further improvements can be expected or if the limits of human performance have been reached.

Review of Literature The sports of athletics (Heazlewood and Lackey, 1996) and swimming (Lackey and Heazlewood, 1998) have been addressed in this manner. The knowledge of future levels of sporting performance has been identified by Banister and Calvert (1980) as beneficial in the areas of talent identification, both long and short term goal setting, and training program development.

Review of Literature In addition, expected levels of future performance are often used in the selection of national representative teams where performance criteria are explicitly stated in terms of times and distances (Athletics Australia, 2004).

Review of Literature Some researchers such as Péronnet and Thibault (1989) postulate that some performances such as human male 100m sprinting is limited to the low 9 seconds. Whereas Seiler (referred to by Hopkins, 2000) envisages no limits on improvements based on data reflecting progression of records over the last 50 years.

Review of Literature According to Seiler improvements per decade have been approximately 1% for sprinting, 1.5% for distance running, 2-3% for jumping, 5% for pole vault, 5% for swimming and 10% for skiing for male athletes, whereas female sprint times may have already peaked. The differences for males and females it is thought to reflect the impact of successful drugs in sport testing on females.

Review of Literature A similar crossover effect, where predicted female performances would exceed male performances, was noted for the 400m and high jump. The crossover effect was based on trends in athletic performances obtained prior to 1996; where in some events female improvements were more rapid than males.

Review of Literature In the sport of swimming (Lackey & Heazlewood, 1998), a similar crossover effect was observed for the 50m freestyle where predicted zero time was the year 2994 for men and 2700 for women. The concept that athletes will complete 100m sprints on land and 50m sprints in water in zero seconds appears unrealistic, however mathematical model based on actual data do derive these interesting predictions.

In swimming the men s 50m freestyle was inverse, 100m freestyle compound, 200m sigmoidal, and the 400m and 1500m freestyle cubic. For the women s freestyle events the 50m was inverse, 100m cubic, 200m sigmoidal, 400m cubic and 800m sigmoidal. Review of Literature The curves that fit the data have also displayed interesting findings as no one curve fits all the data sets. Different events displayed different curves or mathematical functions (Lackey & Heazlewood, 1998) of best fit.

Review of Literature In athletics for the men s events the mathematical functions (Heazlewood and Lackey, 1996) were 100m inverse, 400m sigmoidal, long jump cubic and the high jump displayed four functions (compound, logistic, exponential and growth). In the women s events the mathematical functions were 100m cubic, 400m sigmoidal, long jump inverse and high jump displayed four functions (compound, logistic, exponential and growth).

In their model (Heazlewood and Lackey, 1996), the women s times would be faster than men by 2060 where it was predicted the finalist at the Olympic Games would average 9.58s for men and 9.57 for women respectively. Review of Literature The predictions of Heazlewood and Lackey (1996) paradoxically predicted the men s 100m to improve to zero by year 5038 and the women s 100m to reach zero by year 2429, which indicates a more rapid improvement over time for women sprinters.

Review of Literature A similar crossover effect, where predicted female performances would exceed male performances, was noted for the 400m and high jump. The crossover effect was based on trends in athletic performances obtained prior to 1996; where in some events female improvements were more rapid than males.

Review of Literature In the sport of swimming (Lackey and Heazlewood, 1998), a similar crossover effect was observed for the 50m freestyle where predicted zero time was the year 2994 for men and 2700 for women. The concept that athletes will complete 100m sprints on land and 50m sprints in water in zero seconds appears unrealistic, however mathematical model based on actual data do derive these interesting predictions.

In swimming the men s 50m freestyle was inverse, 100m freestyle compound, 200m sigmoidal, and the 400m and 1500m freestyle cubic. For the women s freestyle events the 50m was inverse, 100m cubic, 200m sigmoidal, 400m cubic and 800m sigmoidal. Review of Literature The curves that fit the data have also displayed interesting findings as no one curve fits all the data sets. Different events displayed different curves or mathematical functions (Lackey and Heazlewood, 1998) of best fit.

Review of Literature In athletics for the men s events the mathematical functions (Heazlewood & Lackey, 1996) were 100m inverse, 400m sigmoidal, long jump cubic and the high jump displayed four functions (compound, logistic, exponential and growth). In the women s events the mathematical functions were 100m cubic, 400m sigmoidal, long jump inverse and high jump displayed four functions (compound, logistic, exponential and growth).

Review of Literature This may indicate that different events are dependent upon different factors that are being trained differently or factors underpinning performance evolving in slightly different ways. This has resulted in different curves or mathematical functions that reflect these improvements in training or phylogenetic changes over time.

Review of Literature However, at some point in time how accurately the predictive models reflect reality can be assessed. Since the models of Heazlewood and Lackey (1996) for athletics and Lackey and Heazlewood (1998) for swimming were derived, the 2000, 2004 and 2008 Olympic Games have occurred.

Review of Literature Hindsight or real data can now enable the assessment of these models over a short timeframe, that is, 10-12 years. Assessing the accuracy of the models predicting performances hundreds or thousands of years into the future will be based on the research interests of future mathematicians, sports scientist and computer scientists.

Research Problem The current research problem is how well the actual times and distances achieved by athletes at the 2000, 2004 and 2008 Olympic Games fit the predicted model for athletics and swimming based on the Heazlewood and Lackey (1996) and Lackey and Heazlewood (1998) prediction equations?

Methods The previous models were based on following model fit criteria used by Heazlewood and Lackey (1996) for athletics and Lackey and Heazlewood (1998) for swimming. The average time and distances for the finalist in each event were utilised to generate the data for the statistical analysis for curve estimation. Potentially both linear and non-linear functions can be derived.

Methods The mean score of the actual performances from the 2000, 2004 and 2008 Olympic Games finalists were compared with the predicted values in the athletic events selected in this study (Wikipedia, 2006 & Beijing 2008 website). Times for the 100m and 400m were in seconds and distances for the long and high jump were in metres.

Methods The results for the finalists in current Olympic freestyle swimming events (50m, 100m, 200m, 400m, 800m women and 1500m men) were collected from internet based results (Wikipedia, 2006 & Beijing 2008 website). Times were recorded to one hundredth of a second which is the recording method used by Federation Internationale de Natation Amateur or FINA (1997).

Methods These times were then converted from a minutes and seconds format to a seconds only format to facilitate calculations when applying the regression methods. The mean of the finalists in each event for each year in the study was then calculated. The mean was used as it is a measure that is representative of all scores in each group (Rothstein, 1986).

Methods A world record holder s performance may be far in advance of that of any other competitor and not be representative of overall performance in an event. For example, the women s 400m freestyle world record as set by Tracey Wickham in 1978 was not bettered until 1988 at the Seoul Olympic Games (Wallechinsky, 1996).

Methods In the swimming pool the factor of wind resistance is not considered significant and as such wind readings are not required for swim records. In athletics, assistive and resistive winds are thought to influence performance in events such as the 100m and long jump and the wind variable can be corrected to assess performance in still air conditions.

Methods The wind correction calculations are not presented in this paper just the times and distances reported for the athletic events, however correcting for the influence of wind may result in slightly different values for the original data.

Statistical Analyses The means were then included as a data set for each event for each Olympic Games for analysis using the Statistical Package for the Social Sciences (SPSS) program version 6.1 (Norušis, 1993 & 1994) to derive a number of possible regression equations. A number of criteria were used to evaluate the goodness of fit of each derived function for each individual event.

General Method of Determining the Appropriate Regression Models Start with an initial estimated value for each variable in the equation. Generate the curve defined by the initial values. Calculate the sum-of-squares (the sum of the squares of the vertical distances of the points from the curve). Adjust the variables to make the curve come closer to the data points. There are several algorithms for adjusting the variables. The most commonly used method was derived by Levenberg and Marquardt (often called simply the Marquardt method).

General Method of Determining the Appropriate Regression Models Adjust the variables again so that the curve comes even closer to the points. Keep adjusting the variables until the adjustments make virtually no difference in the sum-of-squares. Report the best-fit results. The precise values you obtain will depend in part on the initial values chosen in step 1 and the stopping criteria of step 5. This means that repeat analyses of the same data will not always give exactly the same results.

General Method of Determining the Appropriate Regression Models To investigate the hypotheses of model fit and prediction, the eleven regression models were individually applied to each of the athletic and swimming events. The regression equation that produced the best fit for each event, that is, produced the highest coefficient of determination (abbreviated as R 2 ), was then determined from these eleven equations. The specific criteria to select the regression equation of best were the magnitude of R 2, the significance of the analysis of variance alpha or p-value and the residuals.

The Coefficient of Determination The coefficient of determination (R 2 ) is a measure of accuracy of the model used. A coefficient of determination of 1.00 indicates a perfectly fitting model where the predicted values match the actual values for each independent variable (Norušis, 1993 & 1994). Where more than one model was able to be selected due to an equal R 2, the simplest model was used under the principle of parsimony, that is, the avoidance of waste and following the simplest explanatory model.

The Coefficient of Determination Do not over interpret a high R 2. It does not mean that you have chosen the equation that best describes the data. It also does not mean that the fit is unique - other values of the variables may generate a curve that fits just as well.

Residuals The residuals are the difference between the actual value and the predicted value for each case, using the regression equation (Norušis, 1993 & 1994) and the smaller the residual, the better the fit of the model. For each model the residuals were generated by the SPSS program. A large number of positive residuals indicate that the prediction is an over estimation (faster than the actual performance) and a large number of negative residuals indicates an underestimation (slower time than the actual performance).

Level of Significance The level of significance, or p value, is a representation of the relationship between the model and the data. The smaller the p-value, the higher the level of significance and the greater the relationship where a small p-value indicates a small possibility that the closeness of the predicted values to the actual values due to chance is small.

Logical Acceptance Based on Extrapolations The ability of the model to generate extrapolations that appear to be reasonable when compared to previous means was also taken into consideration. When a model generated extrapolations that appear to be inconsistent with the actual results this model was discarded and the model with the next highest coefficient of determination was selected.

Applying the Model of Best Fit After selection of the model to be used, according to the criteria previously stated, the equation of best fit was determined by applying the derived constants and coefficients to the generic formula for that model. Using this equation, a prediction of the mean result for the event at each Olympiad was calculated. At this stage, graphs representing the means of past and future performances for each event in each Olympiad were also generated in addition to predicted means using the appropriate regression equation.

Final Predictions for the Years 2000 and 2004 To predict the level of performance in the year 2000, 2004 and 2008 the data set that provided the greatest accuracy was chosen and the data from 1996 re-included in the data set, where appropriate. A series of regressions were made using the best fitting model and data set for each event. Using the constants and coefficients generated by regression models the future predictions were then calculated.

Final Predictions for the Years 2000 and 2004 It is important to note that in some events the average for a complete field of competitors was not always possible due to disqualifications or injury. In the case of injury, a competitor did not finish the event. This situation only occurred in a few events. At this point in time no attempt was made to reevaluate the 1996 and 1998 models based on inclusion of 2000, 2004 and 2008 data.

Results The data for the predicted values and the actual values are provided in the table for each event. Table 1 indicates the events, mathematical functions, equations and R 2 values derived from the Heazlewood and Lackey (1996) for athletic events of the men s and women s 100m, 400m, long jump and high jump.

Results The trends in the mathematical functions indicate the men s and women s 400m and the high jump show identical trends in changes in performance over time, where the 400m was sigmoidal and the high jump was compound, logistic, exponential and growth.

Results In the majority of events the explained variance or R 2 values were statistically very significant (p<0.01). It is interesting to note the R 2 values were highest for the men s and women s high jump (0.94) and lowest for the men s 100m (0.66) and men s long jump (0.78).

Table 1. Athletic events, type of mathematical function, equations and R 2 values. Event Regression Type Equation * R 2** Men s 100m Inverse Y = b 0 + (b 1 /t) 0.659 Women s 100m Cubic Y = b 0 + b 3 t 3 0.902 Men s 400m S Y = e (bo + b1/t) 0.907 Women s 400m S Y = e (bo + b1/t) 0.843 Men s Long Jump Cubic Y = b 0 + b 3 t 3 0.777 Women s Long Jump Inverse Y = b 0 + (b 1 /t) 0.894 Men s High Jump Compound Logistic Exponential Growth Y = b 0 (b 1 ) t Y = b 0 e b1t Y = e bob1t 0.944 0.944 0.944 0.944 ** All R 2 values in table 1 significant at p<0.05. * Where b 0 = a constant b 1, b 3 = regression coefficients t = year y = mean result for each event Women s High Jump Compound Logistic Exponential Growth Y = b 0 (b 1 ) t Y = b 0 e b1t Y = e bob1t 0.947 0.947 0.947 0.947

Results Table 2 indicates the predicted performances and the actual performances achieved from the 2000, 2004 and 2008 Olympics Games. It can be observed that both 100m times for men exceeded the prediction, whereas the female 100m times were well below the predicted values. For the 400m, long jump and high jump both male and female athletes were below the predicted time and distances.

Results However, the men s actual 400m times and long jump distances did show improvements from 2000 to 2004. In the women s 400m there was a performance decline and in the women s and men s high jump performances remained relatively static from 2000, 2004 and 2008.

Table 2. The predicted and actual performances for men s and women s 100m, 400m, long jump and high jump for 2000, 2004 and 2008 Olympics. Year Predict 100m men 100m women 400m men 400m women Long J men Long J women High J men High J women 2000 10.07s 10.84s 43.84s 48.29s 8.36m 7.27m 2.42m 2.05m 2004 10.03s 10.76s 43.63s 47.78s 8.43m 7.39m 2.45m 2.09m 2008 10.00s 10.68s 43.41s 47.28s 8.49m 7.50m 2.48m 2.12m Year Actual 100m 100m 400ms 400m Long j Long J High J High J 2000 10.05s 11.15s 44.92s 49.92s 8.26m 6.80m 2.32m 1.98m 2004 9.93s 11.04s 44.67s 50.00s 8.33m 6.92m 2.31m 1.98m 2008 9.92s 11.05s 44.87s 50.16s 8.17m 6.81m 2.31m 2.00m

Year Predict - Actual Performance Difference and % Error Across Events 100m men 2000.02s/.019% Trend Actual higher 100m women -.031s/ 2.85% Actual lower 400m men -1.08s/ 2.46% Actual lower 400m women -1.63s/ 3.37% Actual lower Long J men 0.1m/ 1.19% Actual lower Long J women 0.47m/ 6.40% Actual lower High J men 0.1m/ 4.13% Actual lower High J women 0.07m/ 3.41% Actual lower 2004 0.1s/ 0.99% -0.28s/ 2.6% -1.04s/ 2.38% -2.22s/ 4.64% 0.1m/ 1.18% 0.47m/ 6.35% 0.14m/ 5.71% 0.11m/ 5.26% Trend Actual higher Actual lower Actual lower Actual lower Actual lower Actual lower Actual lower Actual lower 2008 0.08s/ 0.8% -0.37s/ 3.46% -1.46s/ 3.36% -2.88s/ 6.09% 0.32m/ 3.77% 0.69m/ 9.20% 0.17m/ 6.85% 0.12m/ 5.66% Trend Actual higher Actual lower Actual lower Actual lower Actual lower Actual lower Actual lower Actual lower

Trends in Percentage Error Athletics 2000-2008 2008 Year Men Women 2000 1.95 4.01 2004 2.56 4.71 2008 3.69 6.10

Results It is interesting to note in athletics, as the years progress the percentage error is increasing. However, the trend is more pronounced with the female athletes.

Figure 1. Men 100m(s) inverse curve fit including 2000 and 2004 data. Where R2 =.991, p<.001.

Figure 2. Women 100m(s) cubic curve fit including 2000 and 2004 data. Where R2 =.899,. p<.001.

Results The mathematical models derived for swimming from pre-1998 data (Lackey and Heazlewood, 1998) for the men s and women s 50m, 100m, 200m, 400m, women s 800m and men s 1500m freestyle events are displayed in table 3. The functions for the 50m (inverse), 200m (sigmoidal) and 400m (cubic) are the same for both men and women, however the functions for the 100m (men compound and women cubic) and the longer distances displayed their own specific function (800m women sigmoidal and men 1500m cubic).

Results The non-significance for the men s and women s 50m freestyle equations and R 2 values is a result of the small degrees of freedom when calculating the level of significance, due to the 50m freestyle only being included in the Olympic swimming program from 1988.

Table 3. Men s and women s freestyle swimming events, type of mathematical function and predictive equations. Gender Event Type Equation R 2 Men s 50m Inverse Y= 44.77+134064.19/year 0.543 ns 100m Compound Y= 42747.22 X.99 year 0.972 sig 200m Sigmoidal Y= e -1.33+11971.98/year 0.862 sig 400m Cubic Y= 1257.26-8.79.year +6.53 x 10-7 x year 3 0.977 sig 1500m Cubic Y=40457.13 27.31.year + 1.88 x 10-6 x year 3 0.963 sig Women s 50m Inverse Y=-71.68 + 193578/year 0.732 ns 100m Cubic Y=7417.11 5.48.year + 4.5 x 10-7 x year 3 0.959 sig 200m Sigmoidal Y= e -1.26 + 8451.02/year 0.812 sig 400m Cubic Y= 1565 1.68 x 10-7 x year 3 0.976 sig 800m Sigmoidal Y= e -1.26 + 14938.8/year 0.692 sig

Results The comparison of the predicted times with the actual times for each event and displayed in table 4, indicates congruence for the men s 50m and women s 50m and 100m. In all other events for both men and women the predicted times are faster than actual times, indicating rate of progress in these events appears to have slowed down based on data up to 1996. This indicates the prediction equations over estimated the rates of improvement.

Table 4. Freestyle actual 1996 means, predicted and actual means for 2000, 2004 and 2008 Olympics. All times in seconds. Event Freestyle Actual 1996 (S) Predicted/ Actual 2000 (S) Predicted/ Actual 2004 (S) Predicted/ Actual 2008 (S) Men s 50m 22.47 22.26/22.19 22.13/22.11 22.00/21.57 100m 49.31 48.17/48.95 47.53/48.80 46.88/47.77 200m 108.40 105.57/107.43 104.32/106.47 103.08/108.81 400m 230.57 219.03/226.21 215.29/225.92 211.68/223.72 1500m 910..29 854.72/901.67 835.76/898.05 817.16/888.61 Women s 50m 25.37 25.11/25.01 24.91/24.96 24.72/24.36 100m 55.37 54.81/54.76 54.53/54.58 54.33/53.80 200m 119..95 117.90/118.89 116.91/118.69 115.93/116.35 400m 250.00 238.98/249.46 233.91/248.05 228.96/244.89 800m 514.89 496.38/507.82 489.03/509.06 481.83/504.42

2000 Ave % Across Events Men=2.49%, Women=1.60% Event Freestyle Difference % Difference Trend Men s 50m 0.07s 0.31 Actual higher 100m -0.78s 1.62 Actual lower 200m -1.86s 1.76 Actual lower 400m -7.18s 3.28 Actual lower 1500m -46.95s 5.49 Actual lower Women s 50m 0.1s 0.39 Actual higher 100m 0.05s 0.09 Actual higher 200m -0.99s 0.84 Actual lower 400m -10.48s 4.38 Actual lower

2004 Ave % Across Events Men=3.44%, Women=2.39% Event Freestyle Difference % Difference Trend Men s 50m 0.02s 0.09 Actual higher 100m -1.27s 2.67 Actual lower 200m -2.15s 2.06 Actual lower 400m -10.63s 4.94 Actual lower 1500m -62.27s 7.45 Actual lower Women s 50m -0.05s 0.20 Actual lower 100m -0.05s 0.09 Actual lower 200m -1.78s 1.52 Actual lower 400m -14.14s 6.04 Actual lower 800m -20.03s 4.09 Actual lower

2008 Ave % Across Events Men=4.76%, Women=2.89% Event Freestyle Difference % Difference Trend Men s 50m 0.43s 1.95 Actual higher 100m -0.89s 1.89 Actual lower 200m -5.73s 5.56 Actual lower 400m -12.04s 5.68 Actual lower 1500m -71.45s 8.74 Actual lower Women s 50m 0.36s 1.46 Actual higher 100m 0.53s 0.97 Actual higher 200m -0.42s 0.36 Actual lower 400m -15.93s 6.96 Actual lower 800m -22.59s 4.69 Actual lower

Results The predictions were closer for the shorter swimming events where men s 50m and women s 50m and 100m, where actual times are almost identical to predicted times. In both men and women, as the swim distances increased, the accuracy of the predictive model decreased, where predicted times were 4.5-7% faster than actual times achieved. For example, the predicted men s 1500m of 489.03s for 2004 was 7% faster than the actual time of 509.06s.

Trends in Percentage Error Swimming 2000-2008 2008 Year Men Women 2000 2.49 1.60 2004 3.44 2.39 2008 4.76 2.89

Results It is interesting to note that as the years progress the average percentage error is increasing. However, the reverse occurred where the greater percentage error was associated with the male swimmers. Somewhat a reverse of the trend observed with male and female athletes.

Discussion The results indicate the ability to predict performances into the near future based on past performances is possible, however both sets of results derived from athletics and swimming indicate that in many events the improvements in performances were overestimated.

Discussion Although improvement in the majority of athletic and swimming events studied did occur in absolute times or distances, the rate of improvement was slower than predicted by the numerous equations generated.

Discussion The reality is, in some events performances have been static such as men s and women s high jump and women s 200m freestyle or actually declined such as women s 400m and women s 800m freestyle. The two events where actual performances exceeded, very slightly the predicted performances, were the men s and women s 100m and men s 50m freestyle.

Discussion Recent emphasis on successful drug testing may have impacted more on women athletes than men, where the slowing down, and in some cases the declines in actual performances were noted for women. Performances are expected to improve over a period of time due to a number of interacting sports scientific, ontogenetic (lifespan) and pharmacological factors, such as:

Discussion The use of more efficient running, jumping and swimming techniques, a biomechanical construct. Improved training programs, which are exercise physiological and functional anatomical construct. Enlarged population of athletes due to increased participation by more nations from which high performance athletes and swimmers are drawn. This will result in an increased sample from the human gene pool, a genetic construct.

Discussion Improved talent identification programs designed and implemented by national sporting organisations and sports institutes that will select and develop tomorrow s high performance athletes. Changes in human physiology, such as the recent ontogenetic trends of increasing height and weight in Australia. The use of performance enhancing drugs legal or illegal, especially androgenic-anabolic steroids and human growth hormone, which have a masculinising effect on women or the use of neutraceuticals (functional foods).

Discussion However, the exact mathematical trends these improvements and interactions take can be plotted mathematically to reveal future trends across athletic and swimming events. In effect, the actual performance is a summary of the complex interactions of all these factors. New research paradigms to decompose the performance factors into their relative contributions to each event as well as how these are changing mathematically over time will challenge future statisticians and mathematicians.

Discussion In some cases the event can be predicted with a high degree of accuracy, such as the sprint swimming events for both men and women, whereas in other events such as 400m women and 800m freestyle are not predicted well as the events are currently displaying performance declines which were not identified by the mathematical models.

Discussion It must be emphasised that all the models across all events indicated consistent improvements over time.

Discussion The primary purposes behind this type of predictive research are that we might understand the realistic limits to human improvement in many sports. To set new and realistic goals that athletes will have to achieve to make representative teams and Olympic finals. To provide a more coherent understanding of what performances of the past suggest about performances of the future.

Discussion To understand if different events represent changes which reflect developments of human biomechanical, exercise physiological, motor learning and sport psychological functions as expressed in sport. As a intellectual exercise to understand more completely the complex trends that underpin human evolution and training adaptation that are expressed in the sports arena.

Conclusion As a heuristic exercise the derivation of mathematical-statistical models that predict changes in human sporting performance both in the near and distant future occupies definitely the minds of mathematicians and statisticians and if we get it right we will have a crystal ball into the future of sport. The problem is it just takes time to find out how good we are at solving such problems.

Appendices and Figures

Correcting for a poorly fitting model utilising nonlinear regression Potential Problem The equation simply does not describe the data The initial values are too far from their correct values The range of X values is too narrow to define the curve completely You have not collected enough data in a critical range of X values Your data are very scattered and do not really define a curve The equation includes more than one component, but your data don't follow a multi-component model Your numbers are too large Your numbers are too large Solution Try a different equation Enter different initial values. If you are using a user-defined equation, check the rules for initial values If possible, collect more data. Otherwise, hold one of the variables to a constant value Collect more data in the important regions Try to collect less scattered data. If you are combining several experiments, normalize the data for each experiment to an internal control Use a simpler equation If your Y values are huge, change the units. If your Y values are tiny, change the units.

11.0 10.8 Time in Seconds 10.6 10.4 10.2 Observed 10.0 1920 1930 1940 1950 1960 1970 1980 1990 2000 Inverse Olympic Year Figure 1. Graph of the means of the Men s s 100m with line of best fit.

12.4 12.2 12.0 Time in Seconds 11.8 11.6 11.4 11.2 11.0 Observed 10.8 1920 1930 1940 1950 1960 1970 1980 1990 2000 Cubic Olympic Year Figure 2. Graph of the means of the Women s s 100m with line of best fit.

49 48 Time in Seconds 47 46 45 Observed 44 1920 1930 1940 1950 1960 1970 1980 1990 2000 S Olympic Year Figure 3. Graph of the means of the Men s s 400m with line of best fit.

60 58 Time in Seconds 56 54 52 50 Observed 48 1920 1930 1940 1950 1960 1970 1980 1990 2000 S Olympic Year Figure 4. Graph of the means of the Women s s 400m with line of best fit.

8.4 8.2 Length in Metres 8.0 7.8 7.6 7.4 7.2 Observed 7.0 1920 1930 1940 1950 1960 1970 1980 1990 2000 Cubic Olympic year Figure 5. Graph of the means of the Men s s Long Jump with line of best fit.

7.5 7.0 Length in Metres 6.5 6.0 5.5 Observed 5.0 1920 1930 1940 1950 1960 1970 1980 1990 2000 Inverse Olympic Year Figure 6. Graph of the means of the Women s s Long Jump with line of best fit.

2.4 2.3 Height in Metres 2.2 2.1 2.0 1.9 Observed Compound Grow th Exponential 1.8 1920 1930 1940 1950 1960 1970 1980 1990 2000 Logistic Olympic Year Figure 7. Graph of the means of the Men s s High Jump with line of best fit.

2.0 1.9 Height in Metres 1.8 1.7 1.6 1.5 Observed Compound Grow th Exponential 1.4 1920 1930 1940 1950 1960 1970 1980 1990 2000 Logistic Olympic Year Figure 8. Graph of the means of the Women s s High Jump with line of best fit.

26 25 24 Time in Seconds 23 22 1880 1900 1920 1940 1960 1980 2000 2020 Observed Inverse Year Figure 9. Men s s 50m Freestyle. R2 = 0.543, p=.473, using all data.

31 30 29 28 Time in Seconds 27 26 25 1880 1900 1920 1940 1960 1980 2000 2020 Observed Inverse Year Figure 10. Women s s 50m Freestyle. R2 = 0.732, p=.347, using all data.

70 60 Time in Seconds 50 40 1880 1900 1920 1940 1960 1980 2000 2020 Observed Compound Year Figure 11. Men s s 100m Freestyle. R2 =0.972, p<.001, using the hand-timing data set.

100 90 80 Time in Seconds 70 60 50 1880 1900 1920 1940 1960 1980 2000 2020 Observed Cubic Year Figure 12. Women s s 100m Freestyle. R2 = 0.959, p<.001, using the hand-timing data set.

150 140 130 Time in Seconds 120 110 100 1880 1900 1920 1940 1960 1980 2000 2020 Observed S Year Figure 13. Men s s 200m Freestyle. R2 = 0.862, p<.001, using all data.

150 140 130 Time in Seconds 120 110 1880 1900 1920 1940 1960 1980 2000 2020 Observed S Year Figure 14. Women s s 200m Freestyle. R2 = 0.812, p<.037, using boycott data set.

380 360 340 320 300 280 Time in Seconds 260 240 220 200 1880 1900 1920 1940 1960 1980 2000 2020 Observed Cubic Year Figure 15. Men s s 400m Freestyle. R2 = 0.977, p<.001, using all data.

500 400 Time in Seconds 300 200 1880 1900 1920 1940 1960 1980 2000 2020 Observed Cubic Year Figure 16. Women s s 400m Freestyle. R2 = 0.968, p<.001, using the steroid free data set.

1500 1400 1300 1200 Time in Seconds 1100 1000 900 800 1880 1900 1920 1940 1960 1980 2000 2020 Observed Cubic Year Figure 17. Men s s 1500m Freestyle. R2 = 0.963, p<.001, using all data.

800 700 600 Time in Seconds 500 400 1880 1900 1920 1940 1960 1980 2000 2020 Observed S Year Figure 18. Women s s 800m Freestyle. R2 = 0.692, p=.04, using the boycott data set.

Figure 19. Men 100m inverse curve fit including 2000 and 2004 data. Where R2 =.991, p<.001.

Figure 20. Women 100m cubic curve fit including 2000 and 2004 data. Where R2 =.899,. p<.001.