learning, thinking, blogging.
Google hates Doughnuts (and Advanced Search Strings)
Today, while I was writing my Google Search Guide, I was playing with the “..” google operator, and after some search requests I bumped in this 403 page:
.
… but your query looks similar to automated requests from a computer virus or spyware application. To protect our users, we can’t process your request right now.
We’ll restore your access as quickly as possible, so try again soon. In the meantime, if you suspect that your computer or network has been infected, you might want to run a virus checker or spyware remover to make sure that your systems are free of viruses and other spurious software.
We apologize for the inconvenience, and hope we’ll see you again on Google.
To continue searching, please type the characters you see below:
Wooha, maybe I used a really strange search string…
Nope: my search string was “1..10 doughnuts”,
Ok, others have reported similar behaviors, but they were doing a bit more advanced (and suspect) searches .. what the hell is suspect in a doughnut search?? Anyway, after some research I found something more…
It seems there are two levels of filter… one with the captcha (as in the first picture) and one without the captcha, like the following one (search was inurl:.edu “1..10 doughnut OR doughnuts”) 
I’m ok with the captcha: it’s a bit annoying but if you’re human you can work it out, but what about the non-captcha page? A search engine that prevents you to use it’s own search functions? That’s so lame!
It also seems that some search strings can be directly marked as spammy (like inurl:.edu “1..10 doughnut OR doughnuts” ) while others are marked as spammy if they comes with lots of requests from the same IP. (try searching “1..10 doughnuts” a dozen of times in a spell)
I understand google’s need to protect itself from spammers, but what about our need for doughnuts?
| Print article | This entry was posted by francesco mapelli on 2006/12/17 at 3:58 pm, and is filed under Uncategorized. Follow any responses to this post through RSS 2.0. You can leave a response or trackback from your own site. |
- Google Blogsearch Tips (And a Little Wordpress Hack) | www.mapelli.info
- The Matrix Google Has You | www.mapelli.info
- Google Search Guide
- Google Search Tips (Ultimate Google Guide)-NotFailed
- Str8Blog | Str8Blog
- Google Search Tricks « e-tricks and tips . tk
- Microsoft Office Professional Plus 2010
- writing tips
- website
- Windows Recovery Virus
- Watch Movies Shows Online
- Infos about Leasing and Financing for Computers including Laptops, Notebook Computers, Software Leasing, Server Leasing.
- seo chester
- web design in chester
- reverbnation hits
- product photographers
- All the backlinks you need
- Understanding How Your AC and Home Heater Works « Understanding How Your AC and Home Heater Works
- Mac Adobe
about 5 years ago
It looks likeGoogle fixed this problem. Got some nice donut recipes though
about 5 years ago
Mark, I don’t think the problem has been fixed… I still get those lovely 403 after a few “spammy” searches …
about 5 years ago
A question or two.. (I’ve been awake for almost 36 hours and am brain dead, so heres hoping I dont make an ass of myself)
The query: inurl:.edu “1..10 doughnut OR doughnuts”
wouldn’t .. inurl:.edu “1..10 doughnut” .. work the same, since (IIRC) google should auto look for plurals? Or would plurals not be auto-applied here since quotes were used?
How were you submitting the query? From google’s form, by punching in the url, from some other script? (Im wondering if google checks referers here or not)
Your post doesn’t say explicity, so I’m wondering: After a few “spammy” searches, you got the captcha/denied page.. Did you follow through with the captcha? And if so, after a few more “spammys”, got the denied/denied page?
That is.. Is google keeping a session cookie or something similar? I know google uses cookies to store your settings, but have been too lazy to see if they use a session cookie..
Assuming they do use session cookies, and you _didnt_ follow through with the captcha, that could make the google engine a bit more paranoid, correct? (But you’re a good googler and did do the captcha. :D)
And then theres the bit of [quote]try searching “1..10 doughnuts” a dozen of times in a spell[/quote] .. That’d be hammering.
Ya know. Thinking about it, and I mean _really_ thinking about it. There were no auto-plurals, no session cookies, and no hammering. Google just doesnt like doughnuts.
I’m betting the google techs prefer bagels.
about 5 years ago
drzimp : you won the “Longest Comment Ever” contest!! :D
wouldn’t .. inurl:.edu “1..10 doughnut” .. work the same, since (IIRC) google should auto look for plurals? Or would plurals not be auto-applied here since quotes were used?
if you check, the auto-plural does not works here, i think for the quotes ;)
How were you submitting the query? From google’s form, by punching in the url, from some other script? (Im wondering if google checks referers here or not)
mainly with the firefox search box, but also with direct url and with google homepage… I didn’t notice different behaviors depending on the source of the query…
Your post doesn’t say explicity, so I’m wondering: After a few “spammy” searches, you got the captcha/denied page.. Did you follow through with the captcha? And if so, after a few more “spammys”, got the denied/denied page?
Something like that. If you keep on submitting spammy requests after the first denied page you’ll bump in the no-captcha page, but you can also go directly to the no-captcha if your requests are too complex
I think they use something like a cookie, but don’t know exactly… but if you don’t fill the captcha you have to wait a few minutes before you’re allowed to do a soft-spammy query, but there are no problems with basic queries (like “doughnuts”).
I think that google uses a function that depends from the query and the query-rate: you can be blocked by a lots of simple queries (since it’s something near hammering, as you pointed out) or by a few (or even a single one) complex queries.
Maybe I should add a poll… something like “donuts vs bagels”, and put an end to this never-ending war :)
about 4 years ago
This happened to me not long ago. I think they’re trying to completely control the internets.
about 4 years ago
Advanced search strings, okay, but… donuts?
Nooooooooooooo!!!!!
about 4 years ago
It looks likeGoogle fixed this problem.
about 4 years ago
I don’t know what you understand by ‘google fixed this problem’, because it looks to me like it didn’t. I still get those 403 pages, rarely, but I do. Any thoughts on this?
about 2 years ago
I dont think google solved this problem. It’s really getting on my nerves. I cant even view any pictures from my blog or anything from Blogger!
about 2 years ago
I’ve run into this sort of problem several times. Google doesn’t like it when you get too specific in the search queries. Sometimes it is blatantly evident what you are trying to do, and thus, they try to circumvent your request.
about 2 years ago
I hope Google fixes all these problems pretty soon.
about 2 years ago
I’ve to do some research on this. I’ll read it again.
about 1 year ago
со спамом бороться только грыжу нажить!
about 1 year ago
first the google images, now this… google sucks now…
about 6 months ago
That is odd! Though I do get that occasionally but I suspect it’s due to the fact that I use the Google Sharing firefox plugin which makes my searches anonymous.
about 5 months ago
Well, I suppose this is really old news. Either way, it does seem like they’ve fixed the problem, somewhat. Maybe I just didn’t try it enough times..
about 4 months ago
Wow! This is so odd, I would never expect to come across a search ban of any sort from Google; that is what they are all about! Not quite sure what the exact reasoning in this is or why anyone would make virus just to search Google but at the same time maybe I am just technologically unaware. Very neat though; other than the inconvenience, of course.
Adrian
about 4 months ago
I had to learn this out the hard way. Almost destroyed my site!
about 4 months ago
Amazing article on avoiding common errors : )
Please visit this site if you want to improve your air conditioning bill
about 4 months ago
I hate errors! Our company sells truck scales and also airplane scales and we always try to eliminate errors
about 4 months ago
I’ve never had this problem before and I use “…” searches often because it’s more accurate.
More to the point, why would Google care if software is performing searches? Does it skew data? If so, does it matter? I think not.
But, Google is not the Big G for nothing. Whether searching for music, videos, articles, podcasts … the Big G is king.