<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebProNews &#187; spiders</title>
	<atom:link href="http://www.webpronews.com/tag/spiders/feed" rel="self" type="application/rss+xml" />
	<link>http://www.webpronews.com</link>
	<description>Breaking News in Tech, Search, Social, &#38; Business</description>
	<lastBuildDate>Mon, 13 Feb 2012 04:32:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>The Science of Spider Webs</title>
		<link>http://www.webpronews.com/the-science-of-spider-webs-2012-02</link>
		<comments>http://www.webpronews.com/the-science-of-spider-webs-2012-02#comments</comments>
		<pubDate>Thu, 02 Feb 2012 14:20:21 +0000</pubDate>
		<dc:creator>Billy Thomas</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Arachnophobia]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Silk]]></category>
		<category><![CDATA[spiders]]></category>
		<category><![CDATA[Webs]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=94627</guid>
		<description><![CDATA[Arachnophobia; the fear of spiders. Most people are afraid of these little guys for varying reasons; some are poisonous and some are just creepy looking. However, these 8-legged architects can build some astonishing structures using their silk. Spider silk is &#8230;]]></description>
			<content:encoded><![CDATA[<p>Arachnophobia; the fear of spiders. Most people are afraid of these little guys for varying reasons; some are poisonous and some are just creepy looking. However, these 8-legged architects can build some astonishing structures using their silk. </p>
<p>Spider silk is a versatile material. For example, the single thread the spider uses to drop down, pull it and it stretches, but with no outside forces acting on it, it stiffens to its original state.</p>
<p>How does this work? You may ask, well, <a href="http://news.sciencemag.org/sciencenow/2012/02/how-to-build-a-hardy-web.html?ref=hp">According to one source</a>, during the elastic stage of the web, the proteins in spider silk are pressed into finite folded structures. So when the spider decides to drop, the structures unfurl, when the folds are gone, the proteins reconfigure into the tough structures referred to as beta-sheet nanocrystals.</p>
<p><iframe width="616" height="462" src="http://www.youtube.com/embed/U6IigFWe7pU?fs=1&#038;feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Scientists performed a study to see how molecular properties impact the entire web. So, a group of researchers sought out a a basic web built by your every day casual spider. With the spider still on the web, the team hung tiny metal wires on the web to simulate an insect caught in the web. When they pulled the wires, the individual spokes of the web broke, but the other threads, acting as almost support structures, didn&#8217;t break.</p>
<p>The results of the experiment really show off the unique ecology of these arachnids. Spider silk is one of the most scientifically interesting natural materials. Other creatures that spin silk, such as silk worms, their silk is made to be more stretchy than anything. A spider&#8217;s silk, however, can be elastic and incredibly strong at different times.</p>
<p>Needless to say spider webs are incredible structures and a useful hunting tool for the creatures. The architecture and pure aesthetic beauty of the web is just an added bonus. Spiders may be creepy to you, but they sure are smart arachnids.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/the-science-of-spider-webs-2012-02/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Many Spiders Does Google Have?</title>
		<link>http://www.webpronews.com/how-many-spiders-does-google-have-2010-02</link>
		<comments>http://www.webpronews.com/how-many-spiders-does-google-have-2010-02#comments</comments>
		<pubDate>Wed, 03 Feb 2010 22:13:51 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Matt Cutts]]></category>
		<category><![CDATA[Robots]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spiders]]></category>
		<category><![CDATA[Videos]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=52967</guid>
		<description><![CDATA[<p>Google has posted a short but interesting video to its <a href="http://www.youtube.com/watch?v=usyYXNNBRjc&#38;feature=youtube_gdata">Webmaster Central YouTube channel</a>. A user asked the question, &#34;How many bots/spiders does Google currently have crawling the web?&#34; and Google's Matt Cutts gave his answer. <br />
]]></description>
			<content:encoded><![CDATA[<p>Google has posted a short but interesting video to its <a href="http://www.youtube.com/watch?v=usyYXNNBRjc&amp;feature=youtube_gdata">Webmaster Central YouTube channel</a>. A user asked the question, &quot;How many bots/spiders does Google currently have crawling the web?&quot; and Google&#8217;s Matt Cutts gave his answer. </p>
<p>&quot;It&#8217;s important to realize that it&#8217;s not really actual robots or actual spiders out there&#8230;instead, it&#8217;s banks of machines &#8230;at Google&#8217;s data centers who open up an HTTP connection and request a page and then get it back,&quot; he says. &quot;So any bank of machines (even 50 machines) could easily be requesting a bunch of different content.&quot;</p>
<p><center></p>
<table>
<tbody>
<tr>
<td><object height="340" width="560"><param value="http://www.youtube.com/v/usyYXNNBRjc&amp;hl=en_US&amp;fs=1&amp;" name="movie" /><param value="true" name="allowFullScreen" /><param value="always" name="allowscriptaccess" /><embed height="340" width="560" allowfullscreen="true" allowscriptaccess="always" type="application/x-shockwave-flash" src="http://www.youtube.com/v/usyYXNNBRjc&amp;hl=en_US&amp;fs=1&amp;"></embed></object></td>
</tr>
</tbody>
</table>
<p></center></p>
<p>&quot;We try to refresh a large fraction of the web every few days,&quot; he adds. &quot;So it turns out you really don&#8217;t need a ton of machines. Even a relatively small amount of machines operating in parallel and fetching pages in parallel can really be able to crawl of find new pages on the web in a very quick way.&quot; </p>
<p>Matt says that Google doesn&#8217;t give out the exact number, but that it&#8217;s somewhere between 25 and 1,000. I&#8217;m not sure what you can really do with that information, but it&#8217;s worth hearing a quick rundown of how it works for those who aren&#8217;t real familiar with how Google indexes content.</p>
<p><strong>Related Articles:</strong></p>
<p><span style="font-family: Arial;"><span style="font-size: larger;">&gt; </span></span><a href="../../../../../../topnews/2009/08/20/google-testing-breadcrumb-display-in-serps" style="color: rgb(0, 105, 210); text-decoration: underline;"><span style="font-family: Arial;"><span style="font-size: larger;">Google Rolls Out Breadcrumb Display in SERPs</span></span></a></p>
<p><span style="font-family: Arial;"><span style="font-size: larger;">&gt; </span></span><a href="../../../../../../topnews/2009/12/02/google-makes-it-easier-to-tell-where-results-originate-from" style="color: rgb(0, 105, 210); text-decoration: underline;"><span style="font-family: Arial;"><span style="font-size: larger;">Google Makes it Easier to Tell Where Results Originate From</span></span></a></p>
<p><span style="font-family: Arial;"><span style="font-size: larger;">&gt; </span></span><a href="../../../../../../topnews/2009/09/25/get-more-links-in-your-actual-google-results" style="color: rgb(0, 105, 210); text-decoration: underline;"><span style="font-family: Arial;"><span style="font-size: larger;">Get More Links in Your Actual Google Results</span></span></a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/how-many-spiders-does-google-have-2010-02/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Indexing Sites in 1 Day Again</title>
		<link>http://www.webpronews.com/google-indexing-sites-in-1-day-again-2008-01</link>
		<comments>http://www.webpronews.com/google-indexing-sites-in-1-day-again-2008-01#comments</comments>
		<pubDate>Mon, 07 Jan 2008 20:31:35 +0000</pubDate>
		<dc:creator>Michael Jensen</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[AdWords]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Robots]]></category>
		<category><![CDATA[Robots.txt]]></category>
		<category><![CDATA[spiders]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=43120</guid>
		<description><![CDATA[<p>I created a new site on Friday, and by Saturday exactly 24 hours later it was in Google&#8217;s Index. I posted about this just over a month ago in my post, <a href="http://www.soloseo.com/blog/2007/11/26/7-steps-to-get-your-new-site-indexed-in-24-hours/" title="7 Steps to Get Your New Site Indexed in 24 Hours">7 Steps to Get Your New Site Indexed in 24 Hours</a>. <br /><br />I had a lot of comments about whether or not Adwords was necessary, so I thought I&#8217;d try it again without running Adwords this time. Here&#8217;s how it all played out:</p>]]></description>
			<content:encoded><![CDATA[<p>I created a new site on Friday, and by Saturday exactly 24 hours later it was in Google&rsquo;s Index. I posted about this just over a month ago in my post, <a href="http://www.soloseo.com/blog/2007/11/26/7-steps-to-get-your-new-site-indexed-in-24-hours/" title="7 Steps to Get Your New Site Indexed in 24 Hours">7 Steps to Get Your New Site Indexed in 24 Hours</a>. </p>
<p>I had a lot of comments about whether or not Adwords was necessary, so I thought I&rsquo;d try it again without running Adwords this time. Here&rsquo;s how it all played out:</p>
<p>1) I created 5 pages of content (Home, FAQ, About Us, etc.).</p>
<p>2) I put them in a simple template with site-wide links. I also linked to it from one of my other sites (it&rsquo;s very relevant so it makes sense).</p>
<p>3) I added tagged the site on only 2 social bookmarking sites.</p>
<p>4) Commented in 1 forum, put the URL in one directory (niche specific), and submitted it to Digg.</p>
<p>5) Installed Google Analytics</p>
<p>6) Created a sitemap, <a href="http://www.soloseo.com/blog/2007/04/12/how-to-configure-sitemap-autodiscovery-in-robots-txt/" title=", pinged Google, and put the sitemap in my Robots.txt.">pinged Google, and put the sitemap in my Robots.txt</a>. Logged into Google Webmaster Central and submitted my sitemap there.</p>
<p>When I checked exactly 24 hours later I was sitting in the index and had already begun to get a few visitors from Google.</p>
<p>I had previously done Google adwords out of both necessity (get quick traffic) but also because of the trust factor I believe it gives to Google, and the fact that Google integrates a quality factor into their quality score (so they come to your site and look at it). Obviously this is just one test compared to several others I&rsquo;ve done with Adwords, but it seems its very possible without running some ads.</p>
<p>Anyone else seeing 24 hour indexing for new sites?</p>
<p><a href="http://www.soloseo.com/blog/2008/01/07/24-hour-indexing-new-sites/#respond">Comments</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/google-indexing-sites-in-1-day-again-2008-01/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Meta Commands Can Help You Love Spiders</title>
		<link>http://www.webpronews.com/how-meta-commands-can-help-you-love-spiders-2008-01</link>
		<comments>http://www.webpronews.com/how-meta-commands-can-help-you-love-spiders-2008-01#comments</comments>
		<pubDate>Fri, 04 Jan 2008 19:27:11 +0000</pubDate>
		<dc:creator>Scott Buresh </dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Meta]]></category>
		<category><![CDATA[meta commands]]></category>
		<category><![CDATA[NoFollow]]></category>
		<category><![CDATA[Robots]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[spiders]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=43074</guid>
		<description><![CDATA[<o:p> </o:p><p>Nearly all search engines utilize spiders (which are also known by their original name, robots) to go out and scour the web looking for web pages. These search engine spiders then bring the data back to be indexed by the engine.</p>]]></description>
			<content:encoded><![CDATA[<p><o:p> </o:p>
<p>Nearly all search engines utilize spiders (which are also known by their original name, robots) to go out and scour the web looking for web pages. These search engine spiders then bring the data back to be indexed by the engine.</p>
<p><o:p></o:p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>Since roughly 1996, individual meta commands have existed that can be used on individual web pages to modify how these search engine spiders behave. The most useful of these commands are fairly universal and respected by almost all search engines. What follows is a list of some of the more popular spider commands and instances in which you might want to use them. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;index&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>This meta command is one of the most common ones used &ndash; and it is also the least necessary. It tells search engine spiders to come on in and put the page in their index. However, all search engines do this by default anyway. Basically, if you want to put it in there for fun, be my guest, but this command is not giving you any special treatment. All search engines are going to index your page, unless you specifically tell them otherwise.<o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;follow&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><b><o:p>&nbsp;</o:p></b>The follow command is different from the index command. It basically requests that the search engine spiders follow the links that are on a particular page. Again, however, this piece of code is completely unnecessary because all search engines are going to follow the links on a page, unless otherwise directed. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;noindex&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>The noindex command, the opposite of the index command, tells search engine spiders not to index the content of a page. It&#8217;s important to note however that search engine spiders will still follow the links on a page that uses only this command. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>When not used for legitimate purposes, this tag can be dangerous because it can put you at risk for penalization by most, if not all search engines. This is because you can use a noindex tag to hide pages with multiple links that you don&#8217;t want visitors to see but that you do want all search engines to index. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>There are however some legitimate uses for the noindex command. For example, if you have a dynamic site and you&#8217;ve created static pages to replace some of your dynamic pages, which can make them easier for search engine spiders to access, you could put a noindex tag on the dynamic version. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>As Google mentions in its Webmaster Help Center:<o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><i>&quot;Consider creating static copies of <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34431">dynamic pages</a>. Although the Google index includes dynamic pages, they comprise a small portion of our index. If you suspect that your dynamically generated pages (such as URLs containing question marks) are causing problems for our crawler, you might create static copies of these pages.&quot;<o:p></o:p></i></p>
<p class="MsoNormal"><i><o:p>&nbsp;</o:p></i>In cases like these, it is acceptable to use the &quot;no index&quot; command on the dynamic version of the page, so that your content will not be treated as duplicate. You are not tricking all search engines, you&#8217;re just redirecting them.<o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;nofollow&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>This tag tells search engine spiders that it&#8217;s OK to go ahead and index a page and list it but that they shouldn&#8217;t follow any of the links that are on the page. This can be useful if, for example, you had some partners that requested a link on your site that you felt obligated to give, but you wanted to hold onto as much Page Rank as possible. Now this is of course between you and your own personal god, but you would be able to in effect have a partners page, add the nofollow attribute to the meta tags, and basically not pass on any of your Page Rank to any of the sites to which you are linking. The nofollow command in effect tells all search engines that this is the end of the line. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;noindex,nofollow&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>Obviously, noindex and nofollow are powerful tags &ndash; and in combination, they can make a page and the subsequent pages to which it links invisible to nearly all search engines.&nbsp; This combination command tells search engine spiders, &quot;Do not read this page; do not follow any of the links on this page; do not include this page in your index.&quot; <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>This command has its beneficial uses. For example, it can be placed on pages on a site that have duplicate content for legitimate reasons. A website might have both a page for the United States and a page for England that cover the same product with exactly the same content. However, nearly all search engines would see this as duplicate content and could devalue both pages. So placing this command on one of them means that search engine spiders will walk on by and you won&#8217;t be penalized. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>&lt;meta name=&quot;robots&quot; content=&quot;noarchive&quot;&gt;<o:p></o:p></b></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>Finally, almost all search engines today, including Google and Yahoo, offer a cached version of a site alongside its listing that provides a snapshot of what the page used to look like. The noarchive tag, therefore, is available to be used in circumstances where there is content on your website that is of a timely nature and therefore that you might not necessarily want search engine spiders to cache for people to have access to moving forward. <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p>For example, a business might run a one-time special that has a ridiculously low price to drum up some business while things are slow. The business will want to be able to shut that sale down as soon as sales are back up to a solid level. However, it is conceivable that someone could click on the cached version of the business&#8217;s site, see the old deal that was out there, and insist on getting it for themselves. By using the noarchive tag, you are telling search engine spiders, in effect, &quot;This page is subject to frequent changes, and I don&#8217;t want my visitors to have access to some of this content at a later time.&quot; <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p><b>Conclusion<o:p></o:p></b></p>
<p class="MsoNormal"><b><o:p>&nbsp;</o:p></b>The commands discussed above are just a few of the ones in existence, and new ones are being added frequently. While nearly all search engines support these commands, there are still some that don&#8217;t.&nbsp; The ones in this article, however, are fairly universally understood by search engine spiders, no matter from where they originate. As more universal commands are introduced, I will write about them in future articles.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/how-meta-commands-can-help-you-love-spiders-2008-01/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching 1/23 queries in 0.009 seconds using memcached
Object Caching 365/423 objects using memcached

Served from: webpronews.com @ 2012-02-13 03:02:58 -->
