Google updated their robots.txt file, excluding the (currently not found) URL http://www.google.com/cbk
What could CBK stand for? |
It's related to Maps Street View.
The URLs look like this: http://google.com/cbk?output=xml&ll=39.7570,-104.9874 http://google.com/cbk?output=overlay&zoom=13&x=1308&y=3169 etc. |
That's just Street View's backend. |
BTW, Immersive Media, the company behind most of th Street View images, announced a month ago in their press release:
"Immersive Media Corp. Continues Massive International Expansion with GeoImmersive™ Imagery Capture of Europe's Major Cities
Immersive Media Corp. (TSXV: IMC) (“IMC”), an advanced digital video imaging company, today announced the continued expansion of its GeoImmersive City Data project into Europe’s major cities. TX Immersive Ltd. (“TXi”), an IMC Certified Service Provider based in the UK, will be spearheading the initiative in capturing 360 degree georeferenced spherical video of major metropolitan areas throughout Europe including downtown cores, key points of interest, major intersections and critical infrastructure. The resulting imagery will be available for licensing and provides a complete, natural street level perspective for use in a variety of applications." http://immersivemedia.com/details.php?id=133
I'm pretty sure Google will use every available Immersive imagery... I'd love to see my house there ;] |
Maybe CBK stands for Control Banks K (capital)? |
cbk stands for "check back" |
May be it is a *C*ontact *B*oo*k*... Who knows but my instincts are telling me so. Though I don't trust it that much always... ;) |
Could someone make a robot that only indexed disallowed sites? |
You could make that, but then people would start to get annoyed pretty quickly and ban the bot from their server, because it's against netiquette. It might be more realistic to say, release a search engine that ignores nofollow, because the acceptance of that is not so universal (actually, Ask.com already ignores nofollow...), or one that ignores the "noodp" meta directive and so on... |
James, I guess so. But you'd have to find links to those disallowed pages which you could then spider. |
Cool. Like the reverse google.cn search engine you could have a reverse Google. Search pages people don't want you to see. |