The research design used in this study is derived from Forrest & Mchale (2007). First, for all 37.220 players the odds are turned into winning probabilities, by using the next formula:
Winning Probability = 1/odds
For example: if Federer has a winning odd of 1,05, the win probability is 1/1.05 = 95%. This is done for all 37.220 players who played a match in ATP and WTA tournaments over the last 5 years.
The next step is to divide all the odds in categories based on their probability. The next 20 categories will be used:
Table 6: Probability categories
1.
|
0-5%
|
11.
|
50-55%
|
2.
|
5-10%
|
12.
|
55-60%
|
3.
|
10-15%
|
13.
|
60-65%
|
4.
|
15-20%
|
14.
|
65-70%
|
5.
|
20-25%
|
15.
|
70-75%
|
6.
|
25-30%
|
16.
|
75-80%
|
7.
|
30-35%
|
17.
|
80-85%
|
8.
|
35-40%
|
18.
|
85-90%
|
9.
|
40-45%
|
19.
|
90-95%
|
10.
|
45-50%
|
20.
|
95-100%
|
In Forrest & McHale (2007) they use probability categories of 10%, which means this study is more specific by using probability categories of 5%.
After the distribution in categories, the mean return per category was calculated. The return will be determined by betting 1 euro per match.
Mean return of category i: ∑ returns
Ni
Statistical tests
First calculate the standard deviation of every category. Then there will be a T-test for every category.
T-statistic = Mean Return
Standard Deviation /
Results
This section will provide the results and findings of the study as described in section 4. More specifically, section 5.1 will presents the results of all ATP and WTA tournaments in the past 5 years (2009-2013). To see if there is a difference in favorite longshot bias between men (ATP) and women (WTA), in section 5.2 the ATP and WTA tournaments are investigated separately. In section 5.3 the trend over the last 5 years is studied. In the last section the results of the influence of surfaces on the favorite longshot bias are presented.
In every section a table with the results is presented which contains the probability, the mean return and the standard deviation for every category. In the last column the result of the t-test for every category is calculated. The t-test is significant when it is below -2 or above 2 which means -2 < t < 2 is insignificant. In the analyzing the results, we look at the categories 1 to 7 for the underdogs and categories 15 to 20 for the favorites. When there are many significant t-values we can conclude that there is a favorite longshot bias.
Results ATP and WTA tournament (2009-2013)
Appendix A shows an overview of all ATP and WTA tournaments in the last 5 years (2009-2013). In total 18.610 matches are played during this tournament. Every player has an odd, so this means that there are 37.220 odds in this study. Below you find the outcomes of all the ATP and WTA tournaments in the last 5 years (2009-2013).
Table 7: Results of all ATP and WTA tournaments in the last 5 years (2009-2013)
All ATP and WTA tournaments 2009-2013
|
Category
|
Proba-bility
|
Number of games
|
Mean return category
|
Standard deviation
|
T-test
|
1
|
1-5%
|
122
|
-0,470
|
3,354
|
-1,548
|
2
|
5-10%
|
1.034
|
-0,483
|
2,502
|
-6,209
|
3
|
10-15%
|
1.274
|
-0,345
|
2,182
|
-5,647
|
4
|
15-20%
|
1.518
|
-0,168
|
2,036
|
-3,218
|
5
|
20-25%
|
1.980
|
-0,163
|
1,736
|
-4,178
|
6
|
25-30%
|
2.083
|
-0,179
|
1,523
|
-5,351
|
7
|
30-35%
|
2.308
|
-0,113
|
1,395
|
-3,883
|
8
|
35-40%
|
2.392
|
-0,070
|
1,274
|
-2,684
|
9
|
40-45%
|
2.443
|
-0,085
|
1,149
|
-3,670
|
10
|
45-50%
|
2.300
|
-0,040
|
1,055
|
-1,838
|
11
|
50-55%
|
1.791
|
-0,064
|
0,954
|
-2,826
|
12
|
55-60%
|
2.205
|
-0,079
|
0,865
|
-4,288
|
13
|
60-65%
|
2.606
|
-0,035
|
0,782
|
-2,273
|
14
|
65-70%
|
2.386
|
-0,056
|
0,710
|
-3,829
|
15
|
70-75%
|
2.324
|
-0,036
|
0,633
|
-2,760
|
16
|
75-80%
|
2.087
|
-0,015
|
0,553
|
-1,255
|
17
|
80-85%
|
2.063
|
-0,021
|
0,480
|
-1,970
|
18
|
85-90%
|
1.482
|
-0,023
|
0,404
|
-2,191
|
19
|
90-95%
|
1.561
|
-0,029
|
0,334
|
-3,481
|
20
|
95-100%
|
1.261
|
-0,013
|
0,203
|
-2,186
|
Total
|
|
37.220
|
-2,487
|
24,124
|
|
ATP and WTA tournaments are played at the highest level of professional tennis. The past five years 18.610 matches were played. Those matches were divided in probability categories. Probability category 1 contains the biggest ‘underdogs’ . The betting market gives those players 1 – 5% chance to win the game. The other way around, category 20 (chance: 95-100%) are the most favorite players. In the fifth column ‘Mean return category’, the mean return when betting 1 euro on all the players in the category is given. In every category this contains a negative value, this means that in every category you lose money.
What stands out in table 1 is the pattern of the mean return. Lower probability categories contains a more negative return, and higher probability categories a lower negative return. For every betted euro in category 1, the bettor receives €0,53 back. Betting one euro in category 20 will one average give €0,987 back. To make it more clear the graph below is added.
Graph 1: Development of the mean return of all ATP and WTA tournaments in the last 5 years (2009-2013)
The mean return line gets more to the 0-point on the X-as while the category becomes higher.
As said before, every category had a negative return, which means that there is no chance to make a profit on betting on the tennis tournaments. Even though, betting on favorite players will generate the least negative return and is the best strategy possible.
Looking at the last column we see a lot of t-values below -2 of above 2 which means a lot of significant results. In category 1 to 7 there are 6 significant values, which is an indication for a favorite longshot bias. However, in the categories 15 to 20 we see also 5 significant values. Based on this you cannot really speak of a bias.
Share with your friends: |