iEntry 10th Anniversary RSS Newsletter Advertising
Visit Twellow.com
Text: Decrease Font Size Increase Font Size | Print Print Article | Share: Delicious Digg StumbleUpon Post to Twitter Post to Facebook
15 commentsSaturday, April 12, 2008

Google Starts Controversial Form Crawling Program

Small number of sites in initial rollout
Googlebot received an update that permits it to complete certain forms, and learn more about the site hosting them.

Websites place content behind forms for the purpose of collecting information from a visitor requesting access to it. The site publisher might want those details for demographic details to improve marketing campaigns, for example.

Google thinks it can present better results to searchers by having access to the URLs behind forms, improving the site's exposure in the process. The Google Webmaster Central blog promised their crawls will be well-behaved:

Only a small number of particularly useful sites receive this treatment, and our crawl agent, the ever-friendly Googlebot, always adheres to robots.txt, nofollow, and noindex directives. That means that if a search form is forbidden in robots.txt, we won't crawl any of the URLs that a form would generate. Similarly, we only retrieve GET forms and avoid forms that require any kind of user information.

However, concerns have been raised about Google crawling forms not marked as forbidden. Kevin Heisler complained at Search Engine Watch the practice could violate the privacy of corporate data.

Though confident in Google's intentions, Heisler thinks potential backlash from corporate interests could be a problem. "The costs to CEOs, CIOs and CTOs at corporations far outweigh the benefits to consumers," he said.

News Tags: Google, Crawler, Privacy, Forms

before too long gbot will

before too long gbot will break through captchas and start signing up as a user. All in the interest of helping the consumer ;)   

Those dang captchas.  I

Those dang captchas.  I could hardly read half of them 'specially  them warpy funky funhouse mirror ones.  Anyway Google's intentions are good. Google knows where to draw the line and is fair about it.   It's all for and about the benefit for users of the internet.

Publish A Comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
6 + 12 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
SEARCH
Popular WPN Business Resources












Subscribe to WebProNews


Send me relevant info