Is anyone finding Google Blog Search to be filled with spam? I just read on digg about how Microsoft sent Mozilla a cake for their birthday, so I went to find a picture. Here's what I saw:
http://blogsearch.google.com/blogsearch?hl=en&q=microsoft+cake+mozilla+birthday
All spam, mostly on blogspot. Take a look at this post: http://bound-yfabim.blogspot.com/ |
someone should rebuild Google Blog security system all over again |
So we have a post that has 100 links and only 100 links. How hard is it to see it's spam? |
It's odd because I'm not finding other queries returning such bad results. I wonder how this particular query is triggering something that's causing that. I mean, all 10 frontpage results are useless spam.
But searching "Mozilla Firefox Browser" yields completely clean results: http://blogsearch.google.com/blogsearch?hl=en&q=mozilla+firefox+browser
In fact, removing any of the terms from that first search I linked removes all the spam! Strange. |
The results you now get are not quite as spammy as they were an hour ago – this very forum thread is now the top result! |
True :)
But the rest is still junk. Though I'm more convinced that this is some kind of glitch after trying out different queries. Especially since, as I mentioned above, removing even one term from my original search gets rid of the spam. I found the secret 4 words for massive spam :) |
Actually, I think the problem is that you happened to search on 4 words of which 2 are known to be widely searched for (Microsoft and Mozilla), so are regularly added to spammy pages to hopefully catch some of those searches.
If you add "-free" to your query, since free is another word which appears on a very high proportion of spam pages for just the same reason, then you have a much better result.
http://blogsearch.google.com/blogsearch?hl=en&ie=UTF-8&q=microsoft+birthday+cake+mozilla+-free&btnG=Search+Blogs |
I also find a lot of spam while using Blog Search. It is much more prevalent if you sort by date instead of sort by relevance. |
I absolutely hate the amount of spam (or "syndicated" blogs, as they often are) in Google Blog Search. You should be able to flag any blog as being spam (using a similar approach to Blog*Spot). |