Sampling a Billion Web Pages…

2 Comments
25 Jan

… is something you can do if you’re Google. In Dec 2005 (I guess they like to do this kinda stuff during the holidays??), they took a sample of a billion web pages, parsed out all the HTML tags, and then aggregated the results to look at which tags were used the most often. The results makes for a pretty interesting read. I will have to say that I think an angel in heaven loses its wings every time someone does a “Save as HTML…” from MS Office. Looking at the results, there are a lot of flightless angels out there right now :( Also, the <font> tag is the 16th most popular tag out there. Boo.

Posted on Wednesday, Jan 25th 2006 at 9:21 am

2 comments

  1. # joe vasquez Jan 25, 2006

    down with the font tag!

  2. # Christina Jan 27, 2006

    *gasp* the horror.

Leave a comment