iEntry 10th Anniversary RSS Newsletter Advertising
Visit Twellow.com

Search Bots Behaving Badly

Post to Twitter Post to Facebook

In a classic bit of Internet history (1998/12/30), a newsgroup member explained the GoogleBot to a poster complaining of its rude behavior.

Timeout: Putting bots in a corner for bad behavior...

"It's a legitimate research project. Unfortunately, it's the worst written piece of crapware you'll ever see crawling your website. It ignores robots.txt (well.... it repeatedly retrieves it, but ignores the contents), ignores robots metatags and headers, gets confused by infinite trees and will suck all your excess bandwidth for significant periods if you let it.

"It's broken. Deny it access.

Cheers,
Steve

(tip from SEORoundTable)

Dan Thies, posting at the SEORoundTable, mentioned a forum thread that told of Microsoft's bot acting in a similarly rude manner.

The forum post, by lundens, states, "the main issues is 175,000 hits and 3 Gig of data just to support msnbot in May thus far is adding up to more than I can afford to use."

"If I were getting hammered like that," said Dan, "I wouldn't rely on robots.txt, I'd block every known MSNbot IP address."

The MSNbot apparently got caught in an infinite loop in a dynamic application part of the poster's website.

Have you noticed MSNbot exhibiting adolescent crawler behavior?

Garrett French is the editor of iEntry's eBusiness channel. You can talk to him directly at WebProWorld, the eBusiness Community Forum.

About the author:
Garrett French is the editor of iEntry's eBusiness channel. You can talk to him directly at WebProWorld, the eBusiness Community Forum.

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
14 + 0 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Featured Headline
Fake Chrome OS Screenshots Punk Tech Media
Mystery Blogger Comes Clean
5 comments | 23 hours ago
WebProNews on Facebook
 
Subscribe to WebProNews


Send me relevant info