Reflecting Against Perception: Data Analysis of IPL Batsman

Similar documents
ANALYSIS AND COMPARATIVE STUDY OF 10 YEARS CRICKET SPORTS DATA OF INDIAN PREMIERE LEAGUE (IPL) USING R PROGRAMMING

Honest Mirror: Quantitative Assessment of Player Performances in an ODI Cricket Match

A new Machine Learning based Deep Performance Index for Ranking IPL T20 Cricketers

Cricket Team Selection and Analysis by Using DEA Algorithm in Python

Data Analytics based Deep Mayo Predictor for IPL-9

How Effective is Change of Pace Bowling in Cricket?

ANALYSIS OF PICTORIAL COVERAGE OF FEMALE ATHLETE IN PRINT MEDIA AS SEX OBJECT

Preview. icc world twenty20 SHOWTIME

Bernoulli Runs Using Book Cricket to Evaluate Cricketers

SOURAV GANGULY. Published A different kettle of fish but India can do well in SA

Application of Queuing Theory to ICC One Day Internationals

arxiv: v1 [stat.ap] 15 Jul 2011

A Fair Target Score Calculation Method for Reduced-Over One day and T20 International Cricket Matches

Ranking Cricket Teams through Runs and Wickets

Sharpe Ratio & Information Ratio By Prof. Simply Simple

Impact of Power Play Overs on the Outcome of Twenty20 Cricket Match

Twitter Analysis of IPL cricket match using GICA method

Dynamic Winner Prediction in Twenty20 Cricket: Based on Relative Team Strengths

Sachin Tendulkar Profile

Longitudinal Linear Mixed Effect Model: An Application in Analyzing Age Effects in Twenty20 Cricket

The selection of a cricket team cannot be fair unless the best available performance

The Indian Premier League: What are the factors that determine player value?

S ANAND CHIEF DATA SCIENTIST

How To Win The Ashes A Statistician s Guide

The DLS methodology: answers to frequently asked questions (updated 29 September 2018)

A Statistical Model for Ideal Team Selection for A National Cricket Squad

Analysis of performance at the 2007 Cricket World Cup

AHP-Neural Network Based Player Price Estimation in IPL

International Journal of Research in Science and Technology Volume 1, Issue 1: October - December, 2014

Tactics for Twenty20 Cricket

Eduheal Foundation 1

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes

Nadiya Afzal 1, Mohd. Sadim 2

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 03, 2016 ISSN (online):

Effect of Depth of Periphery Beams on Behavior of Grid Beams on Grid Floor

Stats 2002: Probabilities for Wins and Losses of Online Gambling

Tournament Statistics

A COMPARATIVE STUDY OF MENTAL TOUGHNESS AND WILL TO WIN AMONG BATSMEN AND BOWLERS IN CRICKET

DOWNLOAD OR READ : WORLD CRICKET RECORDS 2013 CHRIS HAWKES PDF EBOOK EPUB MOBI

Effects of directionality on wind load and response predictions

Modelling the distribution of first innings runs in T20 Cricket

The relationship between payroll and performance disparity in major league baseball: an alternative measure. Abstract

CRICKET ONTOLOGY. Project Description :- Instructor :- Prof: Navjyothi Singh

Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning

Winning Ways. Sport Stats Plus Winning Ways - Page 1 of 8

EFFECT OF SELECTED EXERCISES ON EXPLOSIVE STRENGTH, SPEED, ENDURANCE AND AGILITY OF MEDIUM FAST BOWLERS IN CRICKET

History Of Cricket Players An Article

A study on the effect of limb length and arm strength on the ball release velocity in cricket

Football Rules How To Play World Cup 2011 Games Cricket

Ongoing Match Prediction in T20 International

DATA MINING ON CRICKET DATA SET FOR PREDICTING THE RESULTS. Sushant Murdeshwar

Using Spatio-Temporal Data To Create A Shot Probability Model

Influence of rounding corners on unsteady flow and heat transfer around a square cylinder

Safety Behavior for Cycling : Application Theory of Planned Behavior

History Of Cricket Players An Article

The Cricket on the Hearth Wikipedia The Cricket on the Hearth A Fairy Tale of Home is a novella by Charles Dickens, published by Bradbury and Evans,

A study on Nh-11 to find out the causes and High accident Zone between Jaipur to Sikar

Running head: DATA ANALYSIS AND INTERPRETATION 1

Journal of Emerging Trends in Computing and Information Sciences

Legendre et al Appendices and Supplements, p. 1

An Exploratory Study of Psychomotor Abilities among Cricket Players of Different Level of Achievement

Swinburne Research Bank

Tournament Report. presents. Eventus Corporate Services India Pvt. Ltd. 51/15, Old Rajinder Nagar, New Delhi Business built on Passion!

PGA Tour Scores as a Gaussian Random Variable

An Investigation of Synergy between Batsmen in Opening Partnerships

What the ICC Cricket World Cup Tells Marketers About the Internet Today in Asia and the West

D.O.I: Assistant Prof., University of Guilan, Rasht, Iran

!!!!!!!!!!!!! One Score All or All Score One? Points Scored Inequality and NBA Team Performance Patryk Perkowski Fall 2013 Professor Ryan Edwards!

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Some Issues in the Calculation of Batting Averages: Ranking (and Re-Ranking) the Top 50 Batsmen in Test Cricket,

International Journal of Technical Research and Applications e-issn: , Volume 4, Issue 3 (May-June, 2016), PP.

Analysis of Curling Team Strategy and Tactics using Curling Informatics

Optimal Lineups in Twenty20 Cricket

Novel empirical correlations for estimation of bubble point pressure, saturated viscosity and gas solubility of crude oils

save percentages? (Name) (University)

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player?

No ball ready reckoner Umpire who may call Non-striker s end Square leg Front foot infringement Yes No Back foot infringement Yes No More than

SIMULATION OF THE FLOW FIELD CHARACTERISTICS OF TRANSIENT FLOW

IBPS PO PDF. by: 25 important questions of Puzzles (Part-I) (Logical Reasoning) Contact no. :

Cricketer Kevin Pietersen s popularity rises as interest in England cricket team drops

From the President s Desk 3. Report of the Secretary 4. In Remembrance 6. World Champions - ICC Under-19 Cricket World Cup 9. Farewell...

RELIABILITY ASSESSMENT, STATIC AND DYNAMIC RESPONSE OF TRANSMISSION LINE TOWER: A COMPARATIVE STUDY

. Feb 8, When Sakshi Singh Rawat, an intern at Taj Bengal, was introduced to. Mahendra Singh Dhoni speaks to teammate Sourav Ganguly during.

Additional On-base Worth 3x Additional Slugging?

PEDESTRIAN CRASH MODEL FOR VEHICLE SPEED CALCULATION AT ROAD ACCIDENT

WINNING AND SCORE PREDICTOR (WASP) TOOL

Measuring Customer Discrimination: Evidence from the Professional Cricket League in India

Opleiding Informatica

BABE: THE SULTAN OF PITCHING STATS? by. August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

PRESSURE DISTRIBUTION OF SMALL WIND TURBINE BLADE WITH WINGLETS ON ROTATING CONDITION USING WIND TUNNEL

Predicting Players Performance in the Game of Cricket Using Machine Learning. Niravkumar Pandey

Notational Analysis - Performance Indicators. Mike Hughes, Centre for Performance Analysis, University of Wales Institute Cardiff.

The validity of a rigid body model of a cricket ball-bat impact

Analysis of Gait Characteristics Changes in Normal Walking and Fast Walking Of the Elderly People

COVERAGE LIST 2018 PRE-MATCH BETTING MARKETS LIVE ODDS MARKETS. Premium Cricket. Coverage and Markets

Building an NFL performance metric

Has the NFL s Rooney Rule Efforts Leveled the Field for African American Head Coach Candidates?

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan

EXPERIMENTAL ANALYSIS OF FIN TUBE AND BARE TUBE TYPE TUBE-IN-TUBE HEAT EXCHANGER

DGT Cricket Club Presents

Transcription:

International Journal of Engineering Science Invention ISSN (Online): 2319 6734, ISSN (Print): 2319 6726 Volume 3 Issue 6ǁ June 2014 ǁ PP.07-11 Reflecting Against Perception: Data Analysis of IPL Batsman Amit Kumar 1, Dr. Ritu Sindhu 2, 1,2, Computer Science Engineering, SGTIET, Gurgaon/ Maharishi Dayanand University, Rohtak, India ABSTRACT: This paper analyzes the (Performance of the batsman based upon the run scored different condition during IPL) using data of cricket, particularly IPL. It uses some predictive modeling to identify the trend and it explains different pattern developed by mining of that captured data. These pattern are either deviated or against our perception about them. This paper discuss various analysis created from IPL cricket data from 2008-2014. So in short it converts data into relevant information, which can help better understanding of the game & player s performance as raw data doesn t portrait the complete picture/aspect of any player. KEY WORD: Data analytics, Data mining, sports data analysis, cricket, IPL, predictive modeling. I. INTRODUCTION Cricket is played globally; It is the most popular game in India. It is played in various format like Test series 1,2, One day 3 and twenty- twenty (T-20) 4. In twenty twenty (T-20) is a recently developed format in which each team bat for twenty overs. It can be played in day and night format also. Due to its short duration it is becoming more popular than other format. Indian Premier League (IPL) which was started in year 2008 5, which is hosted (administrated by BCCI -Board of Cricket Control of India ) is much popular than any other domestic tournament. Sportsmen from all over the world participate in this tournament, and viewers from all over the world follow the tournament. Figure-1: indicates that ODI and twenty twenty format is played more than other format. IPL matches which is a approx 60 days tournament, in 5 years around 524 matches were played and number of run scored per match is higher than any other format. Source: www.iplt20.com, http://www.icc-cricket.com, http://www.espncricinfo.com/ This paper analyzes the data of IPL by creating different performance matrices it discuss some experiments that analyze batting performance of a given player. It also discusses the use of impact model matrices to analyze the performance of a batsman. 7 Page

Table I: The IPL so far in numbers: Year Matches Runs Wkts RPO 2014 60 18909 671 8.2 2013 76 22541 909 7.67 2012 75 22,453 857 7.82 2011 73 21,154 813 7.72 2010 60 18,864 720 8.12 2009 57 16,320 697 7.48 2008 58 17,937 689 8.3 Source: http://www.espncricinfo.com/, http://www.iplt20.com, Wkts:wickets, RPO:run rate per over II. CAPTURING THE DATA Secondary data has been taken from the official website of IPL T20 6, ICC 7 and espncricinfo 8. A top twenty batsman data is captured and analyzed. Data has been collected from the year 2008(inception year) till 2014 (recently ended season) for table-3 data used is 2008-2012 taken from www.iplt20.com. MATRIX Unlike other conventional matrices that measure bating performance (such as Total no of runs, batting average, strike rate, etc) Continuous Adjusted Average and batting impact are used. Continuous Adjusted Average Continuous-adjusted average (CAA) Matrix is used to rank the top batsmen of all time in Test cricket. In this paper we have used it in the context of IPLs. As discussed in Borooah and Mangan 9, just looking at the average of a batsman over a period of time might not provide the full picture of how the batsman has performed 10. Averages can be inflated because of a few high scores in non relevant matches or against weaker teams or non critical games. It is important to measure how consistent the batsman was in his performance. To measure Continuous, the Gini coefficient is used (Gini 1912) 11. The Gini coefficient measures the inequality among values of a frequency distribution. The values can range from 0 to1. A low value indicates a more equal distribution (with 0 corresponding to complete equality), and a high value indicates a more unequal distribution (with1 corresponding to complete inequality). In the context of one-day international cricket, the Gini coefficient is computed as Where G is the Gini coefficient, N is the number of complete innings, μ is the average score, and R is the number of runs scored by the batsman in the ith innings. Then the Continuous-adjusted average (CAA) is computed as CAA= µ*(1-g) (2) Where μ is the Continuous-unadjusted average (that is the regular average). Batsman impacting Score (BIS) Batsman impacting Score (BIS) evaluates performance of a player in reference to a match. It takes into account not only how many runs a player has scored but also the pace at which he scored the runs and the conditions under which he scored the runs. This is similar to some of the metrics discussed on the Impact Index website (Impact Index Cricket Pvt. Ltd.) 11. A BI score is assigned to every player who batted in a given match based on the following aspects of his performance: 8 Page

Runs Impact Score (RIS) This metric measures the ratio of the runs scored by the player against the mean runs for all players for the match. The RIS for a player p in match m is computed as Where RISpm represents the runs scored by player p in match m, and N is the total number of players who batted in the match. Strike Rate Impact Score (SRIS) This metric assigns a positive score to the player if his strike rate is above the mean strike rate for the match and a negative score if it is below. If his strike rate equals the mean strike rate of the match, then his SRIS is equal to zero. The SRIS for a player p in match m is computed as Where SRISpm is the strike rate of player p in match m, and is the number of balls faced by player I in match m. Pressure Impact Score (PrIS) The score for this metric is a product of two variables: (a) the situation in which the player comes in to bat and (b) The player s RIS. The first variable is called the Pressure Factor (PF), which depends on the difficulty of the situation when the batsman comes to bat. A situation s difficulty is measured in terms of the number of wickets that have fallen and the mean runs for the match. The PrIS for a player pin match m is computed as Where PFm is the number of wickets that have fallen in the innings when player P comes to bat in match m. is the score of team when player p comes to bat in match m. If player p is an opening batsman, then it is always 0. To account for this all opening batsmen are given a PF of 0.5. In other words, he is expected to score at least half the number of mean runs for the match. Chasing Impact Score (CHIS) This special Score is assigned to a player for staying not out in the second innings of a successful chase. If a player satisfies this criterion then his CHIS is equal to his RIS, otherwise It is equal to 0. Based on the preceding four scores, the overall batting impact score (BI) of a player p in a match is computed as Where BIpm is the highest batting score to match m and is used as a normalization factor. Here it is obvious by table-2 that S K Raina whose is more consistent than Rohit G Sharma but by ranking of Average, he is no.2 but according to CAA it is oblivious that he is no 1. Similarly Utthappa, Marsh is over ranked, while SR Tendulkar, G Gambhir is under ranked due to experimental error in Average, although they have consistently performed during the season. 9 Page

Table -II: The Analysis of Ranking of players who played in the year 2008-14 S. No. Name of player Rank by Avg. Average Continuous adjacent average(caa) CAA Difference in rank. 1 S K Raina 2 51.84 39.98 1 1 2 R G Sharma 1 52.95 38.09 2-1 3 G Gambhir 6 47.00 32.34 3 3 4 C H Gayle 3 51.01 31.98 4-1 5 V Kohli 7 46.30 30.58 5 2 6 P V utthapa 4 47.68 28.54 6-2 7 V Sehwag 8 45.54 27.89 7 1 8 MS DHONI 10 42.10 26.00 8 2 9 JH KALLIS 5 47.45 24.70 9-4 10 SR TENDULKAR* 12 42.08 24.20 10 2 11 S DHAWAN 9 44.02 23.73 11-2 12 RDRAVID* 11 42.44 23.03 12-1 13 Y. PATHAN 18 34.89 22.06 13 5 14 AC GILCHRIST * 17 35.09 21.45 14 3 15 KD KARTHEE 13 41.21 20.68 15-2 16 AB D-VILLIARSA 14 39.92 20.66 16-2 17 SR WATSON 16 35.07 20.33 17-1 18 MARSH 15 38.74 19.57 18-3 19 DA WARNER 20 33.59 19.46 19 3 20 HUSSEY 19 34.08 19.26 20-1 S.No. Name of player* Average Table III: Batting Ranking Matrices of players Rank by Avg. Continuous adjacent average BI Rank by CABI Difference in rank. 1 Chris Gayle 52.28 2 65.54 1 1 2 Virender Sehwag 53.85 1 46.70 2-1 3 Shane Watson 47.00 6 43.23 3 3 4 David Warner 51.65 3 39.42 4-1 5 Shaun Marsh 46.89 7 36.03 5 2 6 Gautam Gambhir 48.67 4 33.85 6-2 7 Shikhar Dhawan 45.02 8 33.30 7 1 8 Kevin Pietersen 43.80 9 31.25 8 1 9 Sachin Tendulkar 47.55 5 31.09 9-4 10 Suresh Raina 42.37 10 29.84 10 0 Fig.2 Graph for Batting Ranking of Players from Table-III 10 Page

Graphical representation of data in any sports 12 From Table 3 it is obvious that Chris Gayle who has been ranked two by average system have performed consistently and he has been a match winner. He has performed in critical matches second surprise is SR Tendulkar who failed to click in crucial matches. Same thing is with Gautam Gambhir. He has performed below par in critical matches from 2010-2012 in IPL matches III. CONCLUSION From the above discussion it can be concluded that there are various factor other than average which contribute to the find out real performance indicator. So while analyzing any data we should use advance matrices to get the real projection. The consistent performance is more important than any other random score. but there are limitation for any predictive and performance model, so all possible factor should be factor in before reaching on any conclusion. REFERENCES [1] Classification of Official Cricket [2] Standard test match playing conditions http://icclive.s3.amazonaws.com/cms/media/about_docs/524ac43be2acastandard%20test%20match.pdf [3] Standard ONE-day International match playing conditions http://icc-live.s3.amazonaws.com/cms/media/about_docs/524ac4ae08b48-04%20standard%20odi_2013_19%2009%2013.pdf [4] Standard TWENTY20 International match playing conditions http://icc-live.s3.amazonaws.com/cms/media/about_docs/524ac4f092e53-05%20standard%20t20_2013_19%2009%2013.pdf [5] History: all about IPL http://sports.in.msn.com/cricket/ipl-5/history/aboutipl.aspx [6] http://www.iplt20.com/ [7] http://www.icc-cricket.com/ [8] http://www.espncricinfo.com/ [9] Borooah, V.K. and Mangan, J., (2012). Measuring Competitive Balance in Sports using Generalized Entropy with an Application to English Premier League Football, Applied Economics, 44, 1093-1102. [10] Analysis of Batting Performance in Cricket using Individual and Moving Range (MR) Control Charts: Muhammad Daniyal1,Tahir Nawaz1, Iqra mubeen 2, Muhammad Aleem, ISSN 1750-9823 (print)international Journal of Sports Science and Engineering, Vol. 06 (2012) No. 04, pp. 195-202. [11] Inequality Analysis, The Gini Index: Lorenzo Giovanni Bellù, Agricultural Policy Support Service, Policy Assistance Division, FAO, Rome, Italy https://www.princeton.edu/~achaney/tmve/wiki100k/docs/gini_coefficient.html [12] Comparison of bowlers, batsmen and all-rounders in cricket using graphical displays, Paul J. van Staden technical report, University of Pretoria, Faculty of natural and agricultural sciences, Department of statistics, ISBN: 978-1-86854-733-3 11 Page