Ever wonder which one of the two search giants spiders more of an individual HTML page? According to ResearchBuzz, it's Yahoo.
If you search Yahoo for aardvark apple zither zephyr originurlextension:html...you'll find that filesizes are listed with search results, and the filesizes listed are well over 150K — I see page sizes of over 800K listed here! At least one of the pages listed, at 173K, appears from its cache to be fully indexed..
Searching Google for the same query, according to ResearchBuzz, produces a maximum of 101K from an individual HTML page. ResearchBuzz recommends using Yahoo over Google when "running searches which might tend to focus on large pages."
Interesting - what do people think on this?








1. To date, The googlebot spiders are much more active than the Yahoo Slurp spiders. Check the server logs of most sites and compare the total numbers of googlebots vs. Yahoo Slurp spider visits, typically the googlebots will have been out to crawl pages three to four times as often as Yahoo.
The googlebot spiders do prefer smaller pages as they are easier to compress, cache, and store. However, to suggest that keyword searchers will not find larger pages in Google is a little misleading.
Thousands of large, dynamic, database-driven URLs, can be found at the top of most major search engine results pages, including Google.
Posted at 4:40AM on Dec 19th 2005 by Jack Roberts