Google Maintains Cache of Banned Domain’s Content but Doesn’t Show it During Ban
August 26th, 2006 by metapilot
For some reason, I thought 1300 pages was the max size of that spam site that was at my domain name before me but I see that as of today, the count’s up to over 8000 pages–

and growing every time I go back to look (10 minutes after I wrote the previous sentence, the count is over 11,00 pages–
(Note the new link to Google’s video search)
Now that I think about it though, these different numbers are most likely due to results coming from different data centers.Obviously, Google maintains all of this page information in the index even though the domain’s been banned for some time. It is appearing that when Google turns the domain back on, the results start back up as though the ban hadn’t existed. What I can’t tell, though, is whether Google continued to crawl that previous site while the ban was in place or if it stopped crawling it once the ban kicked in.
In any case, as the those old pages continue to populate the site:domain search in Google I’m seeing that it is showing the cache for these pages and it shows the “View as HTML” link for the couple of dozen PDF files that had been indexed from the site.
Here’s an example of the fine writing style that filled up those (now) 12,000-plus pages:

I’d guess this is from early or very low quality content auto-generation software that was probably built into the spam site creation software used to deploy the site.
- Posted in Google, SEO Case Studies
