The other day I ran into this analysis made by Google about web authoring statistics.
It is quite interesting to know which tags are frequently used, and what attributes are used the most with those tags, what is the number of css classes used by page, as well as how much have fscked up authoring applications markup has invaded the web :)