Friday, August 20, 2010

Correlating Type of Bowler and Strokes used by Batsman

Most commonly used strokes by batsman are as follows:


  • Straight Drive/Square Drive
  • Square Cut/late Cut
  • Hook & Pull
  • Sweep
  • Glance
Now one can compare these strokes of the batsman with the type of bowler. The representation will look something as shown below:

## Drives Cuts Leave/Block Sweep Leg Glance Pull & Hook
Fast Bowler
Medium Pace Bowler
Off-spinner
Leg –Spinner
All ypes of Bowler

Thursday, August 5, 2010

Correlating Type of Bowler by scoring areas of a batsman

This is very important part of SWOT analysis of a batsman. Understanding the scoring distribution of batsman. Lets consider a batsman

## Off-side On-Side
Fast Bowler 193 257
Medium Pace Bowler 131 156
Off-spinner 156 243
Leg –Spinner 78 276
All types of Bowlers 558 932

We see that he is predominantly an onside player as shown below:

## Off-side On-Side
Fast Bowler 42.89% 57.11%
Medium Pace Bowler 45.64% 54.36%
Off-spinner 39.10% 60.90%
Leg –Spinner 22.03% 77.97%
All types of Bowlers 37.45% 62.55%

The batsman scores 62% of runs on-side. More we find that spinners are his heavy target on-side. The best strategy is to attack this batsman on the off-side and expose his weakness. From a batsman perspective he needs work on his off-side.

Also we see the distribution by bowlers on onside and offside (Column percentages take here)

## Off-side On-Side
Fast Bowler 34.59% 27.58%
Medium Pace Bowler 23.48% 16.74%
Off-spinner 27.96% 26.07%
Leg –Spinner 13.98% 29.61%

Here it is clear he is comfortable with pacemen on off-side and spinners on leg side. Using this information opposition can plan bowling and fielding attack for the batsman.

One can go even a step-further and try to understand on-side and off-side areas.

Offside can be broken down into

1. Third Man

a. Deep

b. Square

c. Fine

d. Short

2. Mid-off

a. Short

b. Deep

c. Silly –Mid-off

3. Long-Off

a. Straight

b. Wide

4. Cover

a. Short

b. Extra

c. Deep-Extra

d. Deep

5. Point

a. Backward

b. Deep Backward

c. Forward

d. Cover

e. Deep

f. Deep Cover

g. Silly Point

6. Gully

7. Slip

a. First

b. Second

c. Third

d. Leg

e. Fly

Even Onside can be broken into multiple-categories like:

1. Long-on

a. Straight

b. Wide

2. Mid-on

a. Deep

b. Short

3. Mid-wicket

a. Deep

b. Deep Forward

c. Short

4. Square Leg

a. Forward

b. Backward

c. Deep

d. Deep Forward

e. Deep Backward

5. Fine Leg

a. Short

b. Deep

c. Square

d. Backward Short Leg

e. Long

f. Straight

6. Leg Gully

SWOT analysis of Batsman (Strength, Weakness, Opportunities and Threats of Batsman):

It is very important to understand Strength, Weakness, Opportunities and Threats from a batsman perspective. This we can call as SWOT of a batsman. SWOT can help:

1. Understanding batsman capability by scoring regions on the field

2. Understanding batsman strokes on the field

3. Understanding batsman dismissals by category

All these can be done with the nature and type of bowling attack under consideration.

Advantages of SWOT:

One can understand the strength and weakness of batsman in-terms of scoring, strokes and dismissals. This can help

1. Batsman to improve his playing strategy

2. Opposition to fine tune bowling and fielding tactics

Tuesday, August 3, 2010

Dimensions to Evaluate a Batsman

In our previous posts, we defined median, mode and standard deviation of players. In this post, i want to bring to attention multiple dimensions along which a player can be evaluated. Here are they !

i. by Position (Opening, First Down, Second Down, etc)

ii. by Bowler (Fast/Medium/Spin)

iii. by opposition (Aus, WI, NZ, Pak, Ind, SL , Eng, SA etc)

iv. by Location (Home/Abroad)

v. by outcome of matches (Winning/Losing)

vi. by position of the field (on-side/off-side)

vii. by Innings (Batting First/Chase)

viii. by stage of the innings (Power-Play, Middle-overs, Slog)

ix. by Boundaries, Singles, Twos, Threes

x. by current form indicators (Last 5/10 matches, Previous Tournament) - This is very important to assess the "current form". For example mean, median, mode and standard deviation of player in last 5 matches/10 matches/previous tournament.

All these dimensions can be used along with mean, median,mode and standard deviation. For example: Mean, Median,Mode and Standard Deviation of Sachin Tendulkar for Australia, England, WI, Pak,SL,Eng and SA teams will help us understand the performance of Sachin across all these teams. We can compare these numbers with overall mean, median,mode and standard deviation score for sachin. It helps understand against which teams he has performed well and which teams he hasn't matched the average figures.

Sunday, August 1, 2010

COMPARISON OF PLAYERS in-terms of MEAN, MEDIAN, MODE AND STANDARD DEVIATION

Lets us consider previous example and see how we can compare 2 batsmen (Player A and Player B) based on mean, median, mode and standard deviation


Player A

Player B

Mean Score

20.57

20.57

Median Score

20

24

Mode Score

20-24

25-29

Standard Deviation of Player

14.47

5.31

Traditionally cricket statistics will tell Player A and Player B are same because batting average (Mean score) is same. However data shows that Player B has higher Median and Mode score compared to Player A. Also Standard deviation of player A is 3 times as much as Player B, showing that Player B is more consistent.

Using these statistics we can conclude Player B is better in-terms of consistency and performance.

Wednesday, October 8, 2008

New Metrics for Measuring Batsman Characteristics

Currently most of the batsmen in all formats of cricket are evaluated based on batting average (This is called as Mean score of a batsman). In this article I plan define 3 more fundamental aspects to batsman analysis, i.e.

1. Median score of a batsman (Middle -score of batsman arrived by ordered scores of his/her innings)

2. Mode score of a batsman (Most commonly scored runs by Batsman)

3. Standard Deviation of a batsman (Describes Batsman’s variability)

We will in due course see the application of these statistics to evaluate batsmen.

Let’s formally define all these. Let’s consider scores of 2 batsmen A and B as shown below for analysis:

T-20 scores

Player A

Player B

Match 1

0

15

Match 2

11

24

Match 3

20

25

Match 4

35

12

Match 5

24

17

Match 6

45

25

Match 7

9

26

Mean -- "a value that is computed by dividing the sum of a set of terms by the number of terms" (from the Merriam-Webster Dictionary)

Mean of a batsman is nothing but the batting average statistic which is the most commonly used metric to measure batsman characteristic. In the above case, this will be computed as follows:

Batting Average/Mean score of a batsman A is (0+11+20+35+24+45+9)/7 =20.57

Batting Average/Mean score of a batsman B is (15+24+25+12+17+25+26)/7 =20.57

Median -- "a value in an ordered set of values below and above which there is an equal number of values or which is the arithmetic mean of the two middle values if there is no one middle number" (from the Merriam-Webster Dictionary).

Let’s consider the above example, we find that Median of Player A is 20 whereas Median of Player B is 24. The methodology is very simple, I simply arrange the scores in descending order and the middle most value is median.

Player A

Player B

0

12

9

15

11

17

20

24

24

25

35

25

45

26

Sometime we may not exactly have a middle most value. There could be 2 scores which may be considered for median. The best way to address that problem is to take averages of those two scores.

Mode -- "the most frequent value of a set of data" (from the Merriam-Webster Dictionary).
Consider player A and Player B, we can

Let’s consider the mode calculation for Player A and Player B. Instead of calculating mode as per statistical definitions, I have introduced a range based mode concept as shown below:

Mode Categories

Player A

Player B

0--4

1


5--9

1


10--14

1

1

15--19


2

20--24

2

1

25--29


3

30--34



35--39

1


40--44



45--49

1


In the above example we see that the mode score for Player A is 20-24 where as for Player B is 25-29

Standard Deviation -- a measure of the dispersion of a frequency distribution that is the square root of the arithmetic mean of the squares of the deviation of each of the class frequencies from the arithmetic mean of the frequency distribution (from the Merriam-Webster Dictionary).

The standard deviation can be computed using the below formula:

SD= square root of[(sum of X squared -((sum of X)*(sum of X)/N))/N]

Basically this formula uses sum of square difference between score and means (ยต) batsman scores. This actually represents the variation of the batsman. If we substitute the value for Player A and Player B, we get:

Player A – standard deviation is 14.47

Player B – standard deviation is 5.31

What is Cricmetrics and why do we need it ?

I have been an analytics professional for over 5 years now. I have seen how analytics and metrics have redefined the businesses. Thomas Davenport talks about this exciting area in his new bookhttp://www.amazon.com/Competing-Analytics-New-Science-Winning/dp/1422103323

Cricket has been a favorite sport for Indians and many other Asian countries. Of course we have countries like Australia, NZ, England, West Indies, South Africa and many more countries into this sport for quite sometime.

Statistics have been used for quite sometime in cricket. For example, metrics like batting averages, strike rate all describe the ability of batsman or his behaviour in some sense. Now the point is that can we take this analysis to next level to answer questions like?

1. What can increase the probability of a Team winning?

2. What is optimum mix of Team given opposition and pitch conditions?

3. Are there better metrics to measure batsman and bowlers?

Understanding the game and players with better metrics can not only help the teams to win games but also the Leagues like IPL, ICL, etc to evaluate player for the price they pay and measure the RoI (Return on Investment).

In the next 10 years, analytics and statistics will be extensively used in cricket to measure player, team, and league performances. It will help clubs to reduce costs and increase their win rates. Very soon every league will have experienced analytical consultant (who understand statistics with cricket) to make team strategy and effect the outcome of games.

Welcome to the world of cricmetrics. According to me cricmetrics is a science of using mathematical and statistical analysis of cricket records. For example it can help us to understand players, teams and environment like pitches, effect of day Vs night games, Spinners Vs Pacers, what player is suited in what kind of roles, how to bowl to the weakness of a player, etc.

Currently on-field captain has to make many decisions but very often we see small mistakes can make big difference. For instance bowling change depending on opposition batsman & condition of match (pitch, over being played etc) can give captains great insights to take a right decision.

So welcome to this world of cricmetrics where i plan to discuss variety of analytical applications in cricket. I want all viewers to add their thoughts, so that it can refine my ideas in this exciting field cricket!

Keep checking this space for exciting metrics and analytics on cricket :). Feedbacks is what can take this science to new level :)