<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebProNews &#187; Googlebot</title>
	<atom:link href="http://www.webpronews.com/tag/googlebot/feed" rel="self" type="application/rss+xml" />
	<link>http://www.webpronews.com</link>
	<description>Breaking News in Tech, Search, Social, &#38; Business</description>
	<lastBuildDate>Mon, 13 Feb 2012 02:13:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Google Gets Better At Crawling Smartphone Content</title>
		<link>http://www.webpronews.com/googlebot-smartphones-2011-12</link>
		<comments>http://www.webpronews.com/googlebot-smartphones-2011-12#comments</comments>
		<pubDate>Thu, 15 Dec 2011 20:55:08 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Smartphones]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=84623</guid>
		<description><![CDATA[Google announced some improvements for how it indexes smartphone content in mobile search. Googlebot-Mobile will now crawl with a smartphone user-agent in addition to its previous feature phone user-agents. This, the company says, will allow it to increase its coverage &#8230;]]></description>
			<content:encoded><![CDATA[<p>Google announced some improvements for how it indexes smartphone content in mobile search. Googlebot-Mobile will now crawl with a smartphone user-agent in addition to its previous feature phone user-agents. </p>
<p>This, the company says, will allow it to increase its coverage of smartphone content, which means a better search experience for smartphone users. </p>
<p>“The content crawled by smartphone Googlebot-Mobile will be used primarily to improve the user experience on mobile search,” <a href="http://googlewebmastercentral.blogspot.com/2011/12/introducing-smartphone-googlebot-mobile.html">explains</a> software engineer Yoshikiyo Kato. “For example, the new crawler may discover content specifically optimized to be browsed on smartphones as well as smartphone-specific redirects.”</p>
<p>“One new feature we’re also launching that uses these signals is Skip Redirect for Smartphone-Optimized Pages,” adds Kato. “When we discover a URL in our search results that redirects smartphone users to another URL serving smartphone-optimized content, we change the link target shown in the search results to point directly to the final destination URL. This removes the extra latency the redirect introduces leading to a saving of 0.5-1 seconds on average when visiting landing page for such search results.”</p>
<p>Google stresses that webmaster should treat each Googlebot-Mobile request as they would a human user with the same phone user-agent. Google references a blog post from the Webmaster Central blog earlier this year about making sites more mobile friendly. </p>
<p>In that, Webmaster Trends Analyst Pierre Far <a href="http://googlewebmastercentral.blogspot.com/2011/02/making-websites-mobile-friendly.html">wrote</a>, “To decide which content to serve, assess which content your website has that best serves the phone(s) in the User-agent string.”</p>
<p>The new smartphone user-agent strings are as follows: </p>
<p>Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_1 like Mac OS X; en-us) AppleWebKit/532.9 (KHTML, like Gecko) Version/4.0.5 Mobile/8B117 Safari/6531.22.7 (compatible; Googlebot-Mobile/2.1; +http://www.google.com/bot.html)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/googlebot-smartphones-2011-12/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Making The Googlebot Fetch With URL Submission</title>
		<link>http://www.webpronews.com/making-the-googlebot-fetch-with-url-submission-2011-08</link>
		<comments>http://www.webpronews.com/making-the-googlebot-fetch-with-url-submission-2011-08#comments</comments>
		<pubDate>Wed, 03 Aug 2011 18:17:56 +0000</pubDate>
		<dc:creator>Chris Richardson</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Fetch as Googlebot]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Webmaster Tools]]></category>
		<category><![CDATA[Googlebot]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=72337</guid>
		<description><![CDATA[Is there some new content and/or a new site you want Google to notice sooner rather than later? Well, there&#8217;s an official Google utility for that. With the Fetch as Googlebot URL submission tool, site owners can now directly request &#8230;]]></description>
			<content:encoded><![CDATA[<p>Is there some new content and/or a new site you want Google to notice sooner rather than later?  Well, there&#8217;s an official Google utility for that.</p>
<p>With the Fetch as Googlebot URL submission tool, site owners can now directly request that Google send the web-crawling/indexing Googlebot to the URL that was submitted.  While Google&#8217;s index will usually find these new pages/sites&#8211;especially if they are backlinked&#8211;this method speeds up the process.  In fact, according to the <a href="http://googlewebmastercentral.blogspot.com/2011/08/submit-urls-to-google-with-fetch-as.html">Google Webmaster Central Blog</a>, URL that have been submitted with Fetch as Googlebot are crawled within a day.</p>
<p>There are obvious benefits with this technique, especially if Google&#8217;s index has out-of-date pages for your site and you&#8217;d like to see the content updated.  Clearly, the same is true for new site launches as well.  The sooner it&#8217;s in the Google index, the better.  Quality backlinks and good content are still the key to gaining rank in the index, but not knowing your site will be indexed almost as soon as its launched is a boon for site owners and SEOs, alike.</p>
<p>The blog post details the steps in order to submit a URL to Fetch, and it&#8217;s really quite simple.  So much so, in fact, it would be foolish not to take advantage of the option.</p>
<blockquote><p><em><strong>How to submit a URL</strong><br />
First, use Diagnostics > Fetch As Googlebot to fetch the URL you want to submit to Google. If the URL is successfully fetched you’ll see a new “Submit to index” link appear next to the fetched URL</p>
<p>Once you click “Submit to index” you’ll see a dialog box that allows you to choose whether you want to submit only the one URL, or that URL and all its linked pages.</p>
<p>When submitting <strong>individual URLs</strong>, we have a <strong>maximum limit of 50 submissions per week</strong>; when submitting <strong>URLs with all linked pages</strong>, the limit is <strong>10 submissions per month</strong>.</em></p></blockquote>
<p><center><img src="http://images.ientrymail.com/webpronews/pictures/fetch_googlebot.jpg" alt="Fetch as Googlebot" /></center></p>
<p>In other words, you can submit 50 pages a month or 10 sites.  The post goes on to say that is if your wanting to submit content like images and/or video, use their Sitemap. The Fetch as Googlebot is intended for content that appears in the web search results, also known as text.</p>
<p>Another update allows users to submit unverified URLs to the Googlebot as well.  The difference being, with verified submissions, the person submitting most confirm ownership of the site/URL being submitted.  Unverified submissions, obviously, do not require the same proof.  There&#8217;s even a link provided for these kinds of unverified submissions, which takes you to the <a href="https://www.google.com/webmasters/tools/submit-url?pli=1">Crawl URL page</a>.</p>
<p><center><img src="http://images.ientrymail.com/webpronews/pictures/crawl_URL.jpg" alt="Fetch as Googlebot" /></center><br />
If you&#8217;re a committed site owner and you&#8217;re not taking advantage of these capabilities, you are only cheating yourself and your business.</p>
<p>The video that leads this post features Google&#8217;s Matt Cutts discussing how long it took Googlebot to recrawl a page, and it was posted on May, 2010 on the Google Webmaster Help YouTube page. While not specific, the answer for sites that frequently update content was &#8220;a few days,&#8221; but now, with Fetch as Googlebot, if it&#8217;s that important that your new content is indexed on an even more rapid basis, well, there&#8217;s a utility for that.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/making-the-googlebot-fetch-with-url-submission-2011-08/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Webmasters: Googlebot Caught in Spider Trap, Ignoring Robots.txt</title>
		<link>http://www.webpronews.com/googlebot-spider-trap-2011-08</link>
		<comments>http://www.webpronews.com/googlebot-spider-trap-2011-08#comments</comments>
		<pubDate>Mon, 01 Aug 2011 13:48:45 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[Robots.txt]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spider traps]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=72052</guid>
		<description><![CDATA[Sometimes webmasters set up a spider trap or crawler trap to catch spambots or other crawlers that waste their bandwidth. If some webmasters are right, Googlebot (Google&#8217;s crawler) seems to be having some issues here. In the WebmasterWorld forum, member &#8230;]]></description>
			<content:encoded><![CDATA[<p>Sometimes webmasters set up a spider trap or crawler trap to catch spambots or other crawlers that waste their bandwidth. If some webmasters are right, Googlebot (Google&#8217;s crawler) seems to be having some issues here. </p>
<p>In the WebmasterWorld forum, member Starchild started <a href="http://www.webmasterworld.com/google/4346138.htm">a thread</a> by saying, &#8220;I saw today that Googlebot got caught in a spider trap that it shouldn&#8217;t have as that dir is blocked via robots.txt. I know of at least one other person recently who this has also happened to. Why is GB ignoring robots?&#8221;</p>
<p>Another member suggested that Starchild was mistaken, as such claims have been made in the past, only to find that there were other issues at play. </p>
<p>Starchild responded, however, that it had been in place for &#8220;many months&#8221; with no changes. &#8220;Then I got a notification it was blocked (via the spidertrap notifier). Sure enough, it was. Upon double checking, Google webmaster tools reported a 403 forbidden error. IP was google. I whitelisted it, and Google webmaster tools then gave a success.&#8221;</p>
<p>Another ember, nippi, said they also got hit by it 4 months after setting up a spider trap, which was &#8220;working fine&#8221; until now. </p>
<p>&#8220;The link to the spider trap is rel=Nofollowed, the folder is banned in robot.txt. The spider trap works by banning by ip address, not user agent so its not caused by a faker &#8211; and of course robots.txt was setup up correctly and prior, it was in place days before the spider trap was turned on, and it&#8217;s run with no problems for months,&#8221; nippi added. &#8220;My logs show, it was the real google, from a real google ip address that ignored my robots.txt, ignored rel-nofollow and basically killed my site.&#8221;</p>
<p>We&#8217;ve reached out to Google for comment, and if and when we receive a response. </p>
<p>Meanwhile, Barry Schwartz is <a href="http://www.seroundtable.com/google-admits-fault-13787.html">reporting</a> that one site lost 60% of its traffic instantly, due to a bug in Google&#8217;s algorithm. He points to a <a href="http://www.google.com/support/forum/p/Webmasters/thread?tid=11563538bda38f29&#038;hl=en">Google Webmaster Help forum thread</a> where Google&#8217;s Pierre Far said:</p>
<p><em>I reached out to a team internally and they identified an algorithm that is inadvertently negatively impacting your site and causing the traffic drop. They&#8217;re working on a fix which hopefully will be deployed soon.</em></p>
<p>Google&#8217;s Kaspar Szymanski comment on Schwartz&#8217;s post, &#8220;While we can not guarantee crawling, indexing or ranking of sites, I believe this case shows once again that our Google Help Forum is a great communication channel for webmasters.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/googlebot-spider-trap-2011-08/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Tips for Getting Crawled Faster by Google</title>
		<link>http://www.webpronews.com/tips-for-getting-crawled-faster-by-google-2009-08</link>
		<comments>http://www.webpronews.com/tips-for-getting-crawled-faster-by-google-2009-08#comments</comments>
		<pubDate>Fri, 28 Aug 2009 08:00:19 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Crawling]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[presentations]]></category>
		<category><![CDATA[search engine indexing]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=51224</guid>
		<description><![CDATA[<p>Probably the most important step in getting your site found in a search engine is the one in which the search engine crawls it. There are things that can be done and things that can be avoided to make this process as painless as possible for the search engine, which will in turn, make it as painless as possible for the webmaster.<br />
]]></description>
			<content:encoded><![CDATA[<p>Probably the most important step in getting your site found in a search engine is the one in which the search engine crawls it. There are things that can be done and things that can be avoided to make this process as painless as possible for the search engine, which will in turn, make it as painless as possible for the webmaster.</p>
<p>Since Google dominates the search market share by such a large market share, it is always a good idea to listen to what they have to say about such matters. So when they post a presentation with tips on optimizing crawling and indexing, you&#8217;ll probably want to pay attention. </p>
<p>Google has <a href="http://googlewebmastercentral.blogspot.com/2009/08/optimize-your-crawling-indexing.html">done just that</a>, highlighting things to stay away from, and things you can do to enhance your site&#8217;s crawlability. Here is that presentation with specific examples of URLs.</p>
<p><center></p>
<table>
<tbody>
<tr>
<td><iframe height="342" frameborder="0" width="410" src="http://docs.google.com/present/embed?id=dgk2ft62_18cvjx4nk4"></iframe></td>
</tr>
</tbody>
</table>
<p></center></p>
<p>&quot;<a href="http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html">The Internet is a big place</a>; new content is being created all the time,&quot; says Google Webmaster Trends Analyst Susan Moskwa. &quot;Google has a finite number of resources, so when faced with the nearly-infinite quantity of content that&#8217;s available online, Googlebot is only able to find and crawl a percentage of that content. Then, of the content we&#8217;ve crawled, we&#8217;re only able to index a portion.&quot;</p>
<p>&quot;URLs are like the bridges between your website and a search engine&#8217;s crawler: crawlers need to be able to find and cross those bridges (i.e., find and crawl your URLs) in order to get to your site&#8217;s content,&quot; continues Moskwa. &quot;If your URLs are complicated or redundant, crawlers are going to spend time tracing and retracing their steps; if your URLs are organized and lead directly to distinct content, crawlers can spend their time accessing your content rather than crawling through empty pages, or crawling the same content over and over via different URLs.&quot;</p>
<p>If you want to get crawled faster by Google, you should <strong>remove user-specific details from URLs.</strong> Specifics of this can be viewed in the slideshow.&nbsp; Basically, URL parameters that don&#8217;t change the content of the page, should be removed and put into a cookie. This will reduce the number of URLs that point to the same content, and speed up crawling. </p>
<p>Google says <strong>infinite spaces are a waste of time and bandwidth for all</strong>, which is why you should consider taking action when you have calendars that link to infinite numbers of past/future dates with unique URLs, or other paginated data. </p>
<p><strong>Tell Google to ignore pages it can&#8217;t crawl</strong>. This includes things like log-in pages, contact forms, shopping carts, and other pages that require users to perform actions that crawlers can&#8217;t perform themselves. You can do this with the robots.txt file. </p>
<p>Finally, <strong>avoid duplicate content</strong> when possible. Google likes to have one URL for each piece of content. They do recognize that this is not always possible though (because of content management systems and what have you), which is why the <a href="http://videos.webpronews.com/2009/02/13/breaking-news-matt-cutts-explains-canonical-tag-from-google-yahoo-microsoft/">canonical link element</a> exists to let you specify the preferred URL for a particular piece of content.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/tips-for-getting-crawled-faster-by-google-2009-08/feed</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>Love Letters from Googlebot</title>
		<link>http://www.webpronews.com/love-letters-from-googlebot-2008-12</link>
		<comments>http://www.webpronews.com/love-letters-from-googlebot-2008-12#comments</comments>
		<pubDate>Thu, 04 Dec 2008 00:22:00 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[if-modified-since]]></category>
		<category><![CDATA[Redirects]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=47902</guid>
		<description><![CDATA[<p>In a mildly weird fashion, Google <a href="http://googlewebmastercentral.blogspot.com/2008/11/date-with-googlebot-part-ii-http-status.html">answered some questions</a> regarding HTTP status codes and &#34;if-modified-since&#34; from users who were either made up or had their names changed to protect their anonymity. Names like Little Jimmy, Temp O'Rary, Janet Crinklenose, and Frankie O'Fore (my apologies to any Janet Crinklenoses who may be out there, but I think I'm safe in my assumption that the name is fictitious).</p>]]></description>
			<content:encoded><![CDATA[<p>In a mildly weird fashion, Google <a href="http://googlewebmastercentral.blogspot.com/2008/11/date-with-googlebot-part-ii-http-status.html">answered some questions</a> regarding HTTP status codes and &quot;if-modified-since&quot; from users who were either made up or had their names changed to protect their anonymity. Names like Little Jimmy, Temp O&#8217;Rary, Janet Crinklenose, and Frankie O&#8217;Fore (my apologies to any Janet Crinklenoses who may be out there, but I think I&#8217;m safe in my assumption that the name is fictitious).</p>
<p> <center><img title="Janet Crinklenose" alt="Janet Crinklenose" src="http://images.ientrymail.com/webpronews/article_pics/janet-crinklenose.jpg" /></center>
<p>&quot;Sorry, guys &#8211; the fluffy distraction of the dating theme stopped me reading it. I&#8217;ll find another, concise article to read about this,&quot; says <a href="http://googlewebmastercentral.blogspot.com/2008/11/date-with-googlebot-part-ii-http-status.html?showComment=1227799980000#c5910895191344226697">one comment</a> on Google&#8217;s post. I guess this isn&#8217;t the one he&#8217;s looking for since I&#8217;m already in my second paragraph about it. But point taken. <b>Let&#8217;s try to pick through this nonsense. </b></p>
<p> One letter talks about cleaning up a site, deleting some old pages, and whether or not 404 pages are ok. &quot;404s are the standard way of telling me that a page no longer exists,&quot; says&#8230;.umm, the GoogleBot. &quot;I won&#8217;t be upset&mdash;it&#8217;s normal that old pages are pruned from websites, or updated to fresher content. Most websites will show a handful of 404s in the Crawl Diagnostics over at Webmaster Tools. It&#8217;s really not a big deal. As long as you have good site architecture with links to all your indexable content, I&#8217;ll be happy, because it means I can find everything I need.&quot;</p>
<p><center><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_o5Na_9269nA/SS3KqnZGEDI/AAAAAAAAB48/HXXWkN5hha4/s1600-h/googlebot-304.jpg"><img width="400" height="300" border="0" style="cursor: pointer;" src="http://4.bp.blogspot.com/_o5Na_9269nA/SS3KqnZGEDI/AAAAAAAAB48/HXXWkN5hha4/s400/googlebot-304.jpg" alt="" id="BLOGGER_PHOTO_ID_5273093572198273074" /></a></center>
<p>The post then goes on to address similar items like 301 and 302 redirects for pages linked to by ohers that no longer exist or have been moved, and dynamic pages with changing content.&nbsp; The GoogleBot says for example: </p>
<p> <i>&quot;Once you&#8217;re indexed, it&#8217;s the polite way to tell your visitors that your address is still the right one, but that the content can temporarily be found elsewhere. In these situations, a 302 (or the rarer &#8217;307 Temporary Redirect&#8217;) would be better. For example, orkut redirects from http://orkut.com to http://google.com/accounts/login?service=orkut, which isn&#8217;t a page that humans would find particularly useful when searching for Orkut***.</p>
<p> It&#8217;s on a different domain, for starters. So, a 302 has been used to tell me that all the content and linking properties of the URL shouldn&#8217;t be updated to the target &#8211; it&#8217;s just a temporary page.</i></p>
<p> You can find more info in the post like how supporting the &quot;If-Modified-Since&quot; header and returning 304 can save bandwidth, and you may find the answers to redirect-related questions you have. That is if you can stomach the presentation of the information.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/love-letters-from-googlebot-2008-12/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Google Provides Valuable Information On Webserving Techniques</title>
		<link>http://www.webpronews.com/google-provides-valuable-information-on-webserving-techniques-2008-06</link>
		<comments>http://www.webpronews.com/google-provides-valuable-information-on-webserving-techniques-2008-06#comments</comments>
		<pubDate>Tue, 03 Jun 2008 20:49:35 +0000</pubDate>
		<dc:creator>Navneet Kaushal</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[IP Delivery]]></category>
		<category><![CDATA[Webserving]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=45697</guid>
		<description><![CDATA[<p>At the <a onclick="javascript:urchinTracker('/outbound/googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html?ref=http_//www.google.com/reader/view/?hl=en');" href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html" linkindex="20" set="yes"><u>Google Webmaster Central Blog</u></a>, Google has released some valuable information about webserving techniques, especially related to Googlebot.]]></description>
			<content:encoded><![CDATA[<p>At the <a onclick="javascript:urchinTracker('/outbound/googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html?ref=http_//www.google.com/reader/view/?hl=en');" href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html" linkindex="20" set="yes"><u>Google Webmaster Central Blog</u></a>, Google has released some valuable information about webserving techniques, especially related to Googlebot. This post has been written keeping in mind the numerous information requests that Google had received for IP Delivery, Geo Location and Cloaking techniques.</p>
<p> <center><br /> <object width="425" height="355"><param value="http://www.youtube.com/v/XWfqyy7J34s&amp;hl=en" name="movie" /><param value="transparent" name="wmode" /><embed width="425" height="355" wmode="transparent" type="application/x-shockwave-flash" src="http://www.youtube.com/v/XWfqyy7J34s&amp;hl=en"></embed></object> </center>
<ol>
<li><strong>Geolocation:</strong> It is the process of serving targeted or different content to users on the basis of their locations. Webmasters have the tools to determine a user&#8217;s location from preferences stored in their cookies. This information is related to the user&#8217;s login or their IP address. Such as, if your website is about theater, then you can always use geolocation techniques to highlight Broadway for a user in New York.</li>
<li><strong>IP Delivery:</strong> It is the process of serving targeted or different content to users on the basis of their IP address. IP addresses are meant to provide geographic information. IP delivery is quite similar to geolocation, therefore, the techniques are almost the same.</li>
<li><strong>Cloaking:</strong> It is the method (unethical though) of serving different content to users than to Googlebot. However, this step is considered to be unethical and Google Webmaster Guidelines prohibit Webmasters from using it. If the file that Googlebot crawls is different from the file served to the user then a Webmaster is coined as being in a high-risk category. A program such as md5sum or diff can compute a hash to verify that two different files are identical.</li>
<li><strong>First Click Free:</strong> If the Webmasters follow Google <a onclick="javascript:urchinTracker('/outbound/googlenewsblog.blogspot.com/2007/09/first-click-free.html?ref=http_//www.google.com/reader/view/?hl=en');" href="http://googlenewsblog.blogspot.com/2007/09/first-click-free.html" linkindex="21"><u>First Click Free Policy</u></a>. Then they would be able to include their premium or subscription-based content in Google&#8217;s websearch index without violating Google&#8217;s quality guidelines. Webmasters can allow all users who find their page using Google search to see the full text of the document, even if they have not registered or subscribed. The user&#8217;s first click to the content area is free. But, if the user jumps to another section of the website, then a Webmaster can block the user&#8217;s access to the premium or subscribed content with a login or a payment request.</li>
</ol>
<p>There is also a thread at the <a onclick="javascript:urchinTracker('/outbound/groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dec3f461bec0f6ab?ref=http_//www.google.com/reader/view/?hl=en');" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dec3f461bec0f6ab#"><u>Webmaster Help Group</u></a>, that would be quite interesting for all the Webmasters out there.</p>
<p><a href="http://www.pagetrafficblog.com/googles-explains-ip-delievery-geo-location-cloaking/4661/">Comments</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/google-provides-valuable-information-on-webserving-techniques-2008-06/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Googlebot Gets Candid</title>
		<link>http://www.webpronews.com/googlebot-gets-candid-2008-03</link>
		<comments>http://www.webpronews.com/googlebot-gets-candid-2008-03#comments</comments>
		<pubDate>Fri, 07 Mar 2008 20:47:49 +0000</pubDate>
		<dc:creator>Navneet Kaushal</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Google Central Webmaster blog]]></category>
		<category><![CDATA[Googlebot]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=44427</guid>
		<description><![CDATA[<p>Googlebot is like a dream which knows us all , , and soul. Here in this interview, Maile Ohye as the website and Jeremy Lilley as the Googlebot from <a onclick="javascript:urchinTracker('/outbound/googlewebmastercentral.blogspot.com/2008/03/first-date-with-googlebot-headers-and.html?ref=/');" href="http://googlewebmastercentral.blogspot.com/2008/03/first-date-with-googlebot-headers-and.html"><u>Google Central Webmaster blog</u></a> would answer all those questions that you ever had.</p> <p><strong>Question &#38; Answers</strong></p> <p><strong>Googlebot:</strong> ACK</p>]]></description>
			<content:encoded><![CDATA[<p>Googlebot is like a dream which knows us all , , and soul. Here in this interview, Maile Ohye as the website and Jeremy Lilley as the Googlebot from <a onclick="javascript:urchinTracker('/outbound/googlewebmastercentral.blogspot.com/2008/03/first-date-with-googlebot-headers-and.html?ref=/');" href="http://googlewebmastercentral.blogspot.com/2008/03/first-date-with-googlebot-headers-and.html"><u>Google Central Webmaster blog</u></a> would answer all those questions that you ever had.</p>
<p><strong>Question &amp; Answers</strong></p>
<p><strong>Googlebot:</strong> ACK</p>
<p><strong>Website:</strong> Googlebot, you&#8217;re here!</p>
<p><strong>Googlebot:</strong> I am.</p>
<p>GET / HTTP/1.1</p>
<p>Host: example.com</p>
<p>Connection: Keep-alive</p>
<p>Accept: */*</p>
<p>From: googlebot(at)googlebot.com</p>
<p>User-Agent: Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)</p>
<p>Accept-Encoding: gzip,deflate</p>
<p><strong>Website:</strong> Those headers are so flashy! Would you crawl with the same headers if my site were in the U.S., Asia or Europe? Do you ever use different headers?</p>
<p><strong>Googlebot:</strong> My headers are typically consistent world-wide. I&#8217;m trying to see what a page looks like for the default language and settings for the site. Sometimes the User-Agent is different, for instance AdSense fetches use &quot;Mediapartners-Google&quot;:</p>
<p>User-Agent: Mediapartners-Google</p>
<p>Or for image search:</p>
<p>User-Agent: Googlebot-Image/1.0</p>
<p>Wireless fetches often have carrier-specific user agents, whereas Google Reader RSS fetches include extra info such as number of subscribers.</p>
<p>I usually avoid cookies (so no &quot;Cookie:&quot; header) since I don&#8217;t want the content affected too much by session-specific info. And, if a server uses a session id in a dynamic URL rather than a cookie, I can usually figure this out, so that I don&#8217;t end up crawling your same page a million times with a million different session ids.</p>
<p><strong>Website:</strong> I&#8217;m very complex. I have many file types. Your headers say &quot;Accept: */*&quot;. Do you index all URLs or are certain file extensions automatically filtered?</p>
<p><strong>Googlebot:</strong> That depends on what I&#8217;m looking for.</p>
<p>If I&#8217;m indexing for regular web search, and I see links to MP3s and videos, I probably won&#8217;t download those. Similarly, if I see a JPG, I will treat it differently than an HTML or PDF link. For instance, JPG is much less likely to change frequently than HTML, so I will check the JPG for changes less often to save bandwidth. Meanwhile, if I&#8217;m looking for links as Google Scholar, I&#8217;m going to be far more interested in the PDF article than the JPG file. Downloading doodles (like JPGs) and videos of skateboarding dogs is distracting for a scholar&mdash;do you agree?</p>
<p><strong>Website:</strong> Yes, they can be distracting. I&#8217;m in awe of your dedication. I love doodles (JPGs) and find them hard to resist.</p>
<p><strong>Googlebot:</strong> Me, too; I&#8217;m not always so scholarly. When I crawl for image search, I&#8217;m very interested in JPGs. And for news, I&#8217;m mostly looking at HTML and nearby images.</p>
<p>There are also plenty of extensions (exe, dll, zip, dmg&hellip;), that tend to be big and less useful for a search engine.</p>
<p><strong>Website:</strong> If you saw my URL, http://www.example.com/page1.LOL111, would you (whimper whimper) reject it just because it contains an unknown file extension?</p>
<p><strong>Googlebot:</strong> Website, let me give a bit more background. After actually downloading a file, I use the Content-Type header to check whether it really is HTML, an image, text, or something else. If it&#8217;s a special data type like a PDF file, Word document, or Excel spreadsheet, I&#8217;ll make sure it&#8217;s in the valid format and extract the text content. Maybe it has a virus; you never know. If the document or data type is really garbled, there&#8217;s usually not much to do besides discard the content.</p>
<p>So, if I&#8217;m crawling http://www.example.com/page1.LOL111 with an unknown file extension, it&#8217;s likely that I would start to download it. If I can&#8217;t figure out the content type from the header, or it&#8217;s a format that we don&#8217;t index (e.g. mp3), then it&#8217;ll be put aside. Otherwise, we proceed indexing the file.</p>
<p><strong>Website:</strong> My apologies for scrutinizing your style, Googlebot, but I noticed your Accept-Encoding headers say:</p>
<p>Accept-Encoding: gzip,deflate</p>
<p>Can you explain these headers to me?</p>
<p><strong>Googlebot:</strong> Sure. All major search engines and web browsers support gzip compression for content to save bandwidth. Other entries that you might see here include &quot;x-gzip&quot; (the same as &quot;gzip&quot;), &quot;deflate&quot; (which we also support), and &quot;identity&quot; (none).</p>
<p><strong>Website:</strong> Can you talk more about file compression and &quot;Accept-Encoding: gzip,deflate&quot;? Many of my URLs consist of big Flash files and stunning images, not just HTML. Would it help you to crawl faster if I compressed my larger files?</p>
<p><strong>Googlebot:</strong> There&#8217;s not a simple answer to this question. First of all, many file formats, such as swf (Flash), jpg, png, gif, and pdf are already compressed (there are also specialized Flash optimizers).</p>
<p><strong>Website:</strong> Perhaps I&#8217;ve been compressing my Flash files and I didn&#8217;t even know? I&#8217;m obviously very efficient.</p>
<p><strong>Googlebot:</strong> Both Apache and IIS have options to enable gzip and deflate compression, though there&#8217;s a CPU cost involved for the bandwidth saved. Typically, it&#8217;s only enabled for easily compressible text HTML/CSS/PHP content. And it only gets used if the user&#8217;s browser or I (a search engine crawler) allow it. Personally, I prefer &quot;gzip&quot; over &quot;deflate&quot;. Gzip is a slightly more robust encoding &mdash; there is consistently a checksum and a full header, giving me less guess-work than with deflate. Otherwise they&#8217;re very similar compression algorithms.</p>
<p>If you have some spare CPU on your servers, it might be worth experimenting with compression (links: Apache, IIS). But, if you&#8217;re serving dynamic content and your servers are already heavily CPU loaded, you might want to hold off.</p>
<p><strong>Website:</strong> Great information. I&#8217;m really glad you came tonight &mdash; thank goodness my robots.txt allowed it. That file can be like an over-protective parent!</p>
<p><strong>Googlebot:</strong> Ah yes; meeting the parents, the robots.txt. I&#8217;ve met plenty of crazy ones. Some are really just HTML error pages rather than valid robots.txt. Some have infinite redirects all over the place, maybe to totally unrelated sites, while others are just huge and have thousands of different URLs listed individually. Here&#8217;s one unfortunate pattern. The site is normally eager for me to crawl:</p>
<p>User-Agent: *</p>
<p>Allow: /</p>
<p>Then, during a peak time with high user traffic, the site switches the robots.txt to something restrictive:</p>
<p># Can you go away for a while? I&#8217;ll let you back</p>
<p># again in the future. Really, I promise!</p>
<p>User-Agent: *</p>
<p>Disallow: /</p>
<p>The problem with the above robots.txt file-swapping is that once I see the restrictive robots.txt, I may have to start throwing away content I&#8217;ve already crawled in the index. And then I have to recrawl a lot of content once I&#8217;m allowed to hit the site again. At least a 503 response code would&#8217;ve been temporary.</p>
<p>I typically only re-check robots.txt once a day (otherwise on many virtual hosting sites, I&#8217;d be spending a large fraction of my fetches just getting robots.txt, and no date wants to &quot;meet the parents&quot; that often). For webmasters, trying to control crawl rate through robots.txt swapping usually backfires. It&#8217;s better to set the rate to &quot;slower&quot; in Webmaster Tools.</p>
<p><strong>Googlebot:</strong> Website, thanks for all of your questions, you&#8217;ve been wonderful, but I&#8217;m going to have to say &quot;FIN, my love.&quot;</p>
<p><strong>Website:</strong> Oh, Googlebot&hellip; ACK/FIN. <img class="wp-smiley" alt=":)" src="http://www.pagetrafficblog.com/wp-includes/images/smilies/icon_smile.gif" /></p>
<p>Also, for more information on Googlebot read- <a href="http://www.pagetrafficblog.com/is-googlebot-crawling-css-files/1086/"><u>Is GoogleBot Crawling CSS Files</u></a>?</p>
<p><a href="http://www.pagetrafficblog.com/googlebot-gets-candid-now-all-that-you-would-like-know/4330/">Comments</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/googlebot-gets-candid-2008-03/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Some New Tags To Play With</title>
		<link>http://www.webpronews.com/some-new-tags-to-play-with-2007-07</link>
		<comments>http://www.webpronews.com/some-new-tags-to-play-with-2007-07#comments</comments>
		<pubDate>Fri, 13 Jul 2007 19:28:49 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Dan Crow]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[index]]></category>
		<category><![CDATA[Jill Whalen]]></category>
		<category><![CDATA[Tags]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=39134</guid>
		<description><![CDATA[<p>HighRankings.com's Jill Whalen broke some interesting news about tags webmasters can use to control how their content is shown in Google results. The information comes from Dan Crow, director of crawl systems at Google, speaking at the Search Engine Marketing New England event in Rhode Island. <br />
]]></description>
			<content:encoded><![CDATA[<p>HighRankings.com&#8217;s Jill Whalen broke some interesting news about tags webmasters can use to control how their content is shown in Google results. The information comes from Dan Crow, director of crawl systems at Google, speaking at the Search Engine Marketing New England event in Rhode Island. <br />
<span id="more-39134"></span> <br />
Though there was a lot mentioned in <a title="HighRankings.com" href="http://www.highrankings.com/advisor/getting-into-google/">Jill&#8217;s post</a>, including information about PageRank, robots.txt, Webmaster Central tools, Sitemaps, Flash and Ajax, the little-discussed tags already available and a new one soon to be available caught our eye. </p>
<p><strong> Lay off my snippet:</strong> Webmasters can prevent Google from showing certain text in the search results by using a simple &quot;nosnippet&quot; tag. Not that you wouldn&#8217;t want your snippets showing &ndash; just that there may be a reason for it out there somewhere. <br />
<strong><br />
Caching out:</strong> The &quot;noarchive&quot; tag stops Google from showing a cached version of a webpage. <br />
<strong><br />
Limited time only:</strong> Jill seemed pretty excited about a forthcoming tag from Google called &quot;unavailable_after&quot; which sets a self-destruct timer on page information. This is good for coupons, limited time offers, articles intended to be viewed as previews only. </p>
<p>&nbsp;</p></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/some-new-tags-to-play-with-2007-07/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Controlling How Your Site Is Indexed</title>
		<link>http://www.webpronews.com/controlling-how-your-site-is-indexed-2007-02</link>
		<comments>http://www.webpronews.com/controlling-how-your-site-is-indexed-2007-02#comments</comments>
		<pubDate>Mon, 26 Feb 2007 17:00:46 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Delicious]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[Robots.txt]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=35593</guid>
		<description><![CDATA[Over at the Google Blog, Dan Crow is publishing a series of how-to's about controlling what the Googlebot says about your site through the Robots Exclusion Protocol. This is the cheap-and-easy version. <br />
<br />
<strong>Don't Index That Link!</strong><br />
]]></description>
			<content:encoded><![CDATA[<p>Over at the Google Blog, Dan Crow is publishing a series of how-to&#8217;s about controlling what the Googlebot says about your site through the Robots Exclusion Protocol. This is the cheap-and-easy version. </p>
<p><strong>Don&#8217;t Index That Link!</strong></p>
<p>Crow says if you don&#8217;t want the links on specific page indexed, you can use the NOINDEX tag for that page. If, however, doing this requires you to continually add NOINDEX and remove it, say for a continually updated and redirected news page, then you may want to go another way and save yourself some grief. If that news page is found through a gateway page, add a NOFOLLOW tag to the entry page instead so the Googlebot will stay put. </p>
<p>The code should look like this: &lt;META NAME=&quot;ROBOTS&quot; CONTENT=&quot;NOFOLLOW&quot;&gt;</p>
<p><strong>Don&#8217;t Say That! </strong></p>
<p>Sometimes webmasters don&#8217;t want cached versions of webpages showing up in the search results, especially on the chance that the information is dated, or has been updated. Sometimes, webmasters don&#8217;t want the <a href="http://www.webpronews.com/insiderreports/2007/01/18/search-top-is-the-new-top">usually-important</a> &quot;snippet&quot; displayed in the results either. </p>
<p>To keep the Googlebot from creating a cached version on Google&#8217;s servers, Crow says to use the NOARCHIVE tag.</p>
<p>It looks like this: &lt;META NAME=&quot;GOOGLEBOT&quot; CONTENT=&quot;NOARCHIVE&quot;&gt;</p>
<p>For snippets, a NOSNIPPET tag is useful, and kills two birds at once. The NOSNIPPET tag automatically prevents archiving as well. </p>
<p>That code is: &lt;META NAME=&quot;GOOGLEBOT&quot; CONTENT=&quot;NOSNIPPET&quot;&gt;</p>
<p>For the long version, click <a href="http://googleblog.blogspot.com/2007/02/robots-exclusion-protocol.html">here</a>. For the first robots.txt tutorial, click <a href="http://googleblog.blogspot.com/2007/01/controlling-how-search-engines-access.html">here</a>.   </p>
<p>
Add to <a class="printMailTop" false="" return="" window.open="" onclick="window.open('http://del.icio.us/post?v=4&amp;partner=wpn&amp;noui&amp;jump=close&amp;url='+encodeURIComponent(location.href)+'&amp;title='+encodeURIComponent(document.title),'delicious','toolbar=no,width=700,height=400');<br />
return _fcksavedurl=http://del.icio.us/post<br />
onclick=" href="http://del.icio.us/post"><img border="0" src="http://images.ientrymail.com/webpronews/delicious-pic.png" alt="" /> Del.icio.us</a> | <a href="javascript:void<br />
window.open('http://digg.com/submit?phase=2&amp;url='+encodeURIComponent(window.<br />
location.href)+'&amp;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img border="0" src="http://images.ientrymail.com/webpronews/digg-pic.png" alt="" /> Digg</a> | <a href="javascript:location.href='http://reddit.com/submit?url='+encodeURIComponent(location.href)+'&amp;title='+encodeURIComponent(document.title)"><img border="0" src="http://images.ientrymail.com/webpronews/reddit.png" alt="" />Reddit</a> | <a href="javascript:location.href='http://www.furl.net/storeIt.jsp?u='+encodeURIComponent(document.location.href)+'&amp;t='+encodeURIComponent(document.title)+<br />
' '"><img border="0" src="http://images.ientrymail.com/webpronews/furl-pic.png" alt="" /> Furl</a> </p>
<p>Bookmark WebProNews: <a href="http://www.webpronews.com"><img border="0" src="http://images.ientrymail.com/webpronews/wpn-readit.jpg" alt="" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/controlling-how-your-site-is-indexed-2007-02/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Get That Site Back Up Before It&#8217;s Delisted</title>
		<link>http://www.webpronews.com/get-that-site-back-up-before-its-delisted-2007-02</link>
		<comments>http://www.webpronews.com/get-that-site-back-up-before-its-delisted-2007-02#comments</comments>
		<pubDate>Tue, 20 Feb 2007 21:43:27 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[De-listing]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Googlebot]]></category>
		<category><![CDATA[Matt Cutts]]></category>
		<category><![CDATA[Pages]]></category>
		<category><![CDATA[Site Downtime]]></category>
		<category><![CDATA[Vanessa Fox]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=35403</guid>
		<description><![CDATA[If you're site goes down for whatever reason, get it back up quickly or risk being de-listed from Google's index. <br />
<br />
Though it has been thought there may be somewhat of a grace period, it now seems robots only notice what's there when they arrive. <br />
]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;re site goes down for whatever reason, get it back up quickly or risk being de-listed from Google&#8217;s index. </p>
<p>Though it has been thought there may be somewhat of a grace period, it now seems robots only notice what&#8217;s there when they arrive. </p>
<p>SEORoundtable points us to a webmaster struggling with a hosting company&#8217;s reliability raised the question in a Google Groups<a href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/d5a24e4fc5f04261/"> thread</a> about why the webmaster&#8217;s well-ranking site had disappeared from the search results. </p>
<p>Google&#8217;s Vanessa Fox responded verifying that the site had indeed been de-listed: </p>
<blockquote><p><em>If the host is down when Googlebot tries to access your pages, then those pages may disappear from the index until Googlebot can crawl them again. . .</em></p>
<p><em>Googlebot can&#8217;t know if or when a page will return when it gets an error response (whether that&#8217;s a network down, 404, or other error) and since our primary aim is to have quality search results for users, we don&#8217;t want to keep pages that return these types of errors in our index. </em></p>
<p><em>The best way of handling these situations, therefore, is to remove the pages, but continue to try to access them, then return them to the index once we get a valid server response.</em>
</p></blockquote>
<p>
How long it takes before a site to de-listed is unclear, but Fox confirmed that Googlebot would make a few passes before declaring it dead. </p>
<p>This is not a case for a re-inclusion request, however, as once the site is back up and running, it will be crawled again &ndash; though no mention was made as to whether rankings would be restored. </p>
<p>&quot;[I]t does make sense for Google to remove the site, because it may never come back,&quot; said <a href="http://www.seroundtable.com/archives/012437.html">a commentator</a> who really, really wants you to know his name is Vince. </p>
<p>Matt Cutts chimes in and agrees with the Vincemeister: </p>
<blockquote><p><em>VinceVinceVince makes a good point: sometimes temporarily down pages turn into truly-gone-forever pages, so we have to drop those pages at some point.</em></p>
<p><em>But it&#8217;s also true that we go back and revisit those pages pretty often and try to re-crawl them in case the site comes back up. </em>
</p></blockquote>
<p>
Add to <a href="http://del.icio.us/post" onclick="window.open('http://del.icio.us/post?v=4'amp;partner=wpn'amp;noui'amp;jump=close'amp;url='+encodeURIComponent(location.href)+''amp;title='+encodeURIComponent(document.title),'delicious','toolbar=no,width=700,height=400');<br />
return _fcksavedurl=http://del.icio.us/post<br />
onclick=" window.open="" return="" false="" class="printMailTop"><img border="0" alt="" src="http://images.ientrymail.com/webpronews/delicious-pic.png" /> Del.icio.us</a> | <a href="javascript:void<br />
window.open('http://digg.com/submit?phase=2&amp;url='+encodeURIComponent(window.<br />
location.href)+'&amp;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img border="0" alt="" src="http://images.ientrymail.com/webpronews/digg-pic.png" /> Digg</a> | <a href="javascript:location.href='http://reddit.com/submit?url='+encodeURIComponent(location.href)+'&amp;title='+encodeURIComponent(document.title)"><img border="0" alt="" src="http://images.ientrymail.com/webpronews/reddit.png" />Reddit</a> | <a href="javascript:location.href='http://www.furl.net/storeIt.jsp?u='+encodeURIComponent(document.location.href)+'&amp;t='+encodeURIComponent(document.title)+<br />
' '"><img border="0" alt="" src="http://images.ientrymail.com/webpronews/furl-pic.png" /> Furl</a> </p>
<p>
Bookmark WebProNews: <a href="http://www.webpronews.com"><img border="0" alt="" src="http://images.ientrymail.com/webpronews/wpn-readit.jpg" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/get-that-site-back-up-before-its-delisted-2007-02/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching 1/45 queries in 0.024 seconds using memcached
Object Caching 651/764 objects using memcached

Served from: webpronews.com @ 2012-02-12 21:13:47 -->
