manicmagpie
A slice of fried gold
{This is going to be a long post, hopefully being the stats board there will be some who will find it interesting}
Hey guys, crowd stats has always been my little obsession, but of course with the uneven draw it's always been difficult to directly compare the yearly totals of each club.
During this off season I've been playing around with some methods of equalising the figures to try and remove the discrepancies of the draw. This method is the best I can come up with:
For each club, I've calculated the long term average (2000 onwards) against each opponent on each ground both as the home team and the away team. For each club I then take the average of those averages, splitting the opponents into derby and interstate clubs.
Taking the example of Collingwood as home team at the MCG, they average against Vic clubs:
The average of those averages is 59,715. I feel this represents the expected crowd for a Collingwood home game at the MCG against another Vic club. The same method for Collingwood at home at the MCG against non-Vic clubs gives a figure of 42,222.
Once all these calculations are done for every team on every applicable ground, you can calculate the expected attendance of the entire season for each club.
Using Essendon's fixture from last season as an example: they started with North at Docklands (Away), Port at Docklands (H) and GC at Carrara (A).
Therefore you find North's home crowd average at Docklands for derby games (33,854) + Port's away average at Docklands for interstate games (24,533) + GC's home average at Carrara for interstate games (15,779). This means that if every club had this draw they would average 74,166 across the 3 games. Essendon actually got 97,005 to these games, which is 131% of the average.
Extending this to all 2012 games for all clubs yields the following results:
Problems
The main problem is the small sample size for several matchups. The most obvious difficulty being that there are two new clubs with new grounds that provide no meaningful long term average.
WC and Adelaide's percentages are possibly overstated as they share a ground with just one other team which consistently gets smaller crowds. Therefore all their home games look good by comparison.
There is also no allowance for timeslots. This is because:
a) even I can't be bothered with finding a separate average for each matchup at each ground for each different timeslot;
b) it would dilute the sample sizes to a useless degree; and
c) I think the effect of timeslots on crowd sizes is usually overstated. My next project might be to try and identify the effect, if any that the timeslot has. I have only had a look at Collingwood's figures by timeslot and there is no discernible pattern to them.
Hopefully I've explained the methodology clearly enough. Anyone who had the patience to read through all of this I would welcome your feedback.
Hey guys, crowd stats has always been my little obsession, but of course with the uneven draw it's always been difficult to directly compare the yearly totals of each club.
During this off season I've been playing around with some methods of equalising the figures to try and remove the discrepancies of the draw. This method is the best I can come up with:
For each club, I've calculated the long term average (2000 onwards) against each opponent on each ground both as the home team and the away team. For each club I then take the average of those averages, splitting the opponents into derby and interstate clubs.
Taking the example of Collingwood as home team at the MCG, they average against Vic clubs:
Code:
Carlton 70,250
Essendon 76,662
Geelong 67,646
Hawthorn 60,668
Melbourne 46,622
North Melb 52,625
Richmond 59,642
St Kilda 53,698
W Bulldogs 49,614
The average of those averages is 59,715. I feel this represents the expected crowd for a Collingwood home game at the MCG against another Vic club. The same method for Collingwood at home at the MCG against non-Vic clubs gives a figure of 42,222.
Once all these calculations are done for every team on every applicable ground, you can calculate the expected attendance of the entire season for each club.
Using Essendon's fixture from last season as an example: they started with North at Docklands (Away), Port at Docklands (H) and GC at Carrara (A).
Therefore you find North's home crowd average at Docklands for derby games (33,854) + Port's away average at Docklands for interstate games (24,533) + GC's home average at Carrara for interstate games (15,779). This means that if every club had this draw they would average 74,166 across the 3 games. Essendon actually got 97,005 to these games, which is 131% of the average.
Extending this to all 2012 games for all clubs yields the following results:
Code:
Actual Expected Total %
Collingwood 1,207,743 867,108 139.28%
Essendon 994,993 817,081 121.77%
Hawthorn 891,025 790,256 112.75%
Carlton 965,334 857,690 112.55%
Richmond 871,504 795,072 109.61%
West Coast 754,425 693,161 108.84%
Adelaide 659,884 639,499 103.19%
Geelong 760,616 744,665 102.14%
Sydney 595,825 609,557 97.75%
Fremantle 663,264 685,407 96.77%
Gold Coast 387,289 423,802 91.38%
St Kilda 695,296 763,040 91.12%
GWS 349,870 398,589 87.78%
North Melb 567,363 663,913 85.46%
Bris Lions 514,380 608,020 84.60%
Melbourne 618,341 759,398 81.43%
W Bulldogs 526,959 724,482 72.74%
Port Adel 453,641 637,665 71.14%
Problems
The main problem is the small sample size for several matchups. The most obvious difficulty being that there are two new clubs with new grounds that provide no meaningful long term average.
WC and Adelaide's percentages are possibly overstated as they share a ground with just one other team which consistently gets smaller crowds. Therefore all their home games look good by comparison.
There is also no allowance for timeslots. This is because:
a) even I can't be bothered with finding a separate average for each matchup at each ground for each different timeslot;
b) it would dilute the sample sizes to a useless degree; and
c) I think the effect of timeslots on crowd sizes is usually overstated. My next project might be to try and identify the effect, if any that the timeslot has. I have only had a look at Collingwood's figures by timeslot and there is no discernible pattern to them.
Hopefully I've explained the methodology clearly enough. Anyone who had the patience to read through all of this I would welcome your feedback.