<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebProNews &#187; Hadoop</title>
	<atom:link href="http://www.webpronews.com/tag/hadoop/feed" rel="self" type="application/rss+xml" />
	<link>http://www.webpronews.com</link>
	<description>Breaking News in Tech, Search, Social, &#38; Business</description>
	<lastBuildDate>Sun, 12 Feb 2012 23:07:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Yahoo Aims to Mainstream Hadoop with New Security and Workflow Offerings</title>
		<link>http://www.webpronews.com/yahoo-aims-to-mainstream-hadoop-with-new-security-and-workflow-features-2010-06</link>
		<comments>http://www.webpronews.com/yahoo-aims-to-mainstream-hadoop-with-new-security-and-workflow-features-2010-06#comments</comments>
		<pubDate>Tue, 29 Jun 2010 14:42:18 +0000</pubDate>
		<dc:creator>Chris Crum</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=54478</guid>
		<description><![CDATA[<p>Yahoo made a significant announcement at its <a href="http://developer.yahoo.com/events/hadoopsummit2010/">Hadoop Summit</a> today. The company says it's made significant enhancements to <a href="http://hadoop.apache.org/">the open source software</a>, accelerating the potential for enterprise-wide adoption by mainstream businesses. <br />
]]></description>
			<content:encoded><![CDATA[<p>Yahoo made a significant announcement at its <a href="http://developer.yahoo.com/events/hadoopsummit2010/">Hadoop Summit</a> today. The company says it&#8217;s made significant enhancements to <a href="http://hadoop.apache.org/">the open source software</a>, accelerating the potential for enterprise-wide adoption by mainstream businesses. </p>
<p>&quot;Hadoop is where science meets big data &ndash; it&#8217;s the technical underpinning that powers our innovative consumer and advertiser products on the world&#8217;s most-advanced digital canvas,&quot; says Blake Irving, Yahoo Executive Vice President and Chief Product Officer. &quot;Yahoo!&rsquo;s cloud and Hadoop make it possible for Yahoo! to rapidly personalize our content and advertising, and deliver highly relevant experiences, while maintaining the trust of our 600 million users.&quot;</p>
<p><img align="right" src="http://images.ientrymail.com/webpronews/article_pics/hadoop.jpg" alt="Apache Hadoop" title="Apache Hadoop" style="margin: 10px;" />Yahoo says Hadoop plays a key role in its home page, Yahoo Search, Yahoo Mail, and other properties.</p>
<p>&quot;Businesses across all sectors are looking for ways to leverage the vast quantities of data they are accumulating, and Apache Hadoop is an efficient solution for processing data at scale,&quot; says Melanie Posey, research director at <a href="http://www.idc.com/">IDC Research</a>. &quot;Now organizations of various sizes can leverage Yahoo!&#8217;s Hadoop investment and deployments to run it on their own systems and build out their own Hadoop deployments without starting from scratch on internal science experiments.&quot;</p>
<p>Specifically, Yahoo announced the beta release of Hadoop with Security and Oozie, the company&#8217;s workflow engine for Hadoop. This means enterprises will benefit from better controls for managing business-sensitive data, according to the company.</p>
<p><font size="2" face="Arial"><span style="font-size: 10pt; font-family: Arial;">The Yahoo Distribution of Hadoop with Security (beta) and Oozie are available <a href="http://developer.yahoo.com/hadoop/">through the <span>Yahoo Developer Network</span></a></span></font>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/yahoo-aims-to-mainstream-hadoop-with-new-security-and-workflow-features-2010-06/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon Introduces Elastic MapReduce</title>
		<link>http://www.webpronews.com/amazon-introduces-elastic-mapreduce-2009-04</link>
		<comments>http://www.webpronews.com/amazon-introduces-elastic-mapreduce-2009-04#comments</comments>
		<pubDate>Thu, 02 Apr 2009 14:49:50 +0000</pubDate>
		<dc:creator>Mike Sachoff</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Hadoop]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=49296</guid>
		<description><![CDATA[<p>Amazon Web Services has introduced the public beta of Amazon Elastic MapReduce, a cloud computing service that allows businesses, researchers, data analysts and developers to process large amounts of data.</p>]]></description>
			<content:encoded><![CDATA[<p>Amazon Web Services has introduced the public beta of Amazon Elastic MapReduce, a cloud computing service that allows businesses, researchers, data analysts and developers to process large amounts of data.</p>
<p><a href="http://aws.amazon.com/" title="Amazon MapReduce">Amazon Elastic MapReduce </a>uses Hadoop, a free Java software framework that runs on the company&#8217;s Elastic Compute Cloud (EC2) and Simple Storage Service (S3). Customers can use MapReduce to perform data-intensive tasks for distributed applications such as web indexing, data mining, log file analysis, financial analysis and bioinformatics research.</p>
<p>Customers will pay only for what they use with no up-front payments or commitments.</p>
<p><center><img border="0" title="Pricing" alt="Pricing" src="http://images.ientrymail.com/webpronews/article_pics/amazon-pricing.gif" style="margin: 4px;" /></center></p>
<p>&quot;Some researchers and developers already run Hadoop on Amazon EC2, and many of them have asked for even simpler tools for large-scale data analysis,&quot; said Adam Selipsky, Vice President of Product Management and Developer Relations for Amazon Web Services.</p>
<p>&quot;Amazon Elastic MapReduce makes crunching in the cloud much easier as it dramatically reduces the time, effort, complexity and cost of performing data-intensive tasks.&quot;</p>
<p>The service automatically launches and configures the number and type of EC2 instances that a customer selects. It then kicks off a Hadoop implementation of the MapReduce programming model, which loads large amounts of user input data from S3 and subdivides it for parallel processing. Users can manage and monitor job flows through web service APIs or via AWS Management Console. <br />
&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/amazon-introduces-elastic-mapreduce-2009-04/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Yahoo Gives Hadoop A Big Hug</title>
		<link>http://www.webpronews.com/yahoo-gives-hadoop-a-big-hug-2008-02</link>
		<comments>http://www.webpronews.com/yahoo-gives-hadoop-a-big-hug-2008-02#comments</comments>
		<pubDate>Wed, 20 Feb 2008 14:40:21 +0000</pubDate>
		<dc:creator>Doug Caverly</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=44172</guid>
		<description><![CDATA[<p>Yahoo's done one impressive thing after another since Microsoft threatened to acquire it, and now, in a step that should make the open source community proud, Yahoo has announced that it's running &#34;the world's largest Hadoop application, a 10,000 core Linux cluster producing data used by the Yahoo! Search Webmap.&#34;</p>]]></description>
			<content:encoded><![CDATA[<p>Yahoo&#8217;s done one impressive thing after another since Microsoft threatened to acquire it, and now, in a step that should make the open source community proud, Yahoo has announced that it&#8217;s running &quot;the world&#8217;s largest Hadoop application, a 10,000 core Linux cluster producing data used by the Yahoo! Search Webmap.&quot;</p>
<p><span id="more-44172"></span>
<p>Hadoop comes from the Apache Software Foundation, and acts as a distributed computing platform.&nbsp; Prior to Yahoo&#8217;s announcement, the software wasn&#8217;t hurting for footing; other users included Google, IBM, and Last.fm.&nbsp; Still, this development should ensure Hadoop is embraced to an even greater degree in the future.</p>
<p><img width="180" height="140" border="0" align="right" src="http://images.ientrymail.com/webpronews/article_pics/yahoo_logo.jpg" title="Yahoo Gives Hadoop A Big Hug" alt="Yahoo Gives Hadoop A Big Hug" /></p>
<p>On the <a href="http://www.ysearchblog.com/archives/000521.html" title="&quot;Hadoop Now at the Heart of Every Yahoo! Search&quot;">Yahoo Search Blog</a>, Sean Suchter points out, &quot;Using open source software is a win-win situation for Yahoo! and the wider community.&nbsp; We achieve cost savings, faster processing, reduced maintenance, and increased scale and the community can benefit from the myriad improvements it took to make Hadoop viable for such a large-scale commercial implementation.&quot;&nbsp; Quite an endorsement (and invitation), eh?</p>
<p>Yahoo and Hadoop are also taking steps to make sure everyone hears the news, with related posts appearing on the <a href="http://developer.yahoo.net/blog/archives/2008/02/hadoop_production_yahoo_search_webmap.html" title="&quot;Hadoop running in production on the Yahoo! Search Webmap&quot;">Yahoo Developer Network</a> and <a href="http://developer.yahoo.com/blogs/hadoop/2008/02/yahoo-worlds-largest-production-hadoop.html" title="&quot;Yahoo! Launches World's Largest Hadoop Production Application&quot;">Hadoop and Distributed Computing</a> blogs.&nbsp; An interview between Jeremy Zawodny and two team members adds to the buzz.</p>
<p>But as <a href="http://jeremy.zawodny.com/blog/archives/009992.html" title="&quot;Yahoo! Search running Apache Hadoop on Large Scale&quot;">Zawodny</a> reminds everyone, there&#8217;s more to this than PR.&nbsp; &quot;It&#8217;s not just an experiment or research project,&quot; he writes.&nbsp; &quot;There&#8217;s real money on the line.&quot;&nbsp; And with their record-breaking implementation, Yahoo and Hadoop seem to be handling it well.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/yahoo-gives-hadoop-a-big-hug-2008-02/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo Adds New Blog To Developer Network</title>
		<link>http://www.webpronews.com/yahoo-adds-new-blog-to-developer-network-2007-11</link>
		<comments>http://www.webpronews.com/yahoo-adds-new-blog-to-developer-network-2007-11#comments</comments>
		<pubDate>Wed, 14 Nov 2007 21:17:26 +0000</pubDate>
		<dc:creator>Mike Sachoff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[developer]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Network]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=41927</guid>
		<description><![CDATA[<p>Jeremy Zawodny of Yahoo announced today that Yahoo will feature a blog on the Yahoo Developer Network focused on Hadoop.</p>
]]></description>
			<content:encoded><![CDATA[<p>Jeremy Zawodny of Yahoo announced today that Yahoo will feature a blog on the Yahoo Developer Network focused on Hadoop.</p>
<p><span id="more-41927"></span></p>
<p><img src="http://images.ientrymail.com/webpronews/article_pics/sm_body/yahoo_blog_developer_network.jpg" align="right" border="0" alt="Jeremy Zawodny" title="Jeremy Zawodny"> He writes, &quot;To make things a bit easier, we decided to start this Hadoop and Distributed Computing <a title="Yahoo" href="http://developer.yahoo.com/blogs/hadoop/">blog</a> on the Yahoo! Developer Network as a place to write about Hadoop and our distributed computing work on a more regular basis. There&#8217;s a lot going on (we already have a backlog of posts!) and we&#8217;re anxious to get the word out.&quot;</p>
<p>Currently featured on the new blog is a video interview Zawodny does with Eric Baldeschwieler about Hadoop and Yahoo.</p>
<p><center><a href="http://aj.600z.com/aj/41545/0/cc?z=1"><img src="http://aj.600z.com/aj/41545/0/vc?z=1&#038;dim=41551" width="336" height="55" border="0"></a></center></p></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/yahoo-adds-new-blog-to-developer-network-2007-11/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo, Carnegie Mellon Switch On Supercomputer</title>
		<link>http://www.webpronews.com/yahoo-carnegie-mellon-switch-on-supercomputer-2007-11</link>
		<comments>http://www.webpronews.com/yahoo-carnegie-mellon-switch-on-supercomputer-2007-11#comments</comments>
		<pubDate>Mon, 12 Nov 2007 16:08:53 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Carnegie Mellon]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[M45]]></category>
		<category><![CDATA[Supercomputer]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=41804</guid>
		<description><![CDATA[<p>The M45 supercomputer provided by Yahoo opened its ports to its partners at Carnegie Mellon University, where the initiative should help boost research that benefits the broader Internet community.</p>
]]></description>
			<content:encoded><![CDATA[<p>The M45 supercomputer provided by Yahoo opened its ports to its partners at Carnegie Mellon University, where the initiative should help boost research that benefits the broader Internet community.</p>
<p><span id="more-41804"></span><br />
<center><img border="0" align="center" src="http://images.ientrymail.com/webpronews/article_pics/sm_body/Yahoo_CarnegieMellon.jpg" alt="Yahoo, Carnegie Mellon Switch On Supercomputer" title="Yahoo, Carnegie Mellon Switch On Supercomputer" /></center></p>
<p>For those of you firing up the old faithful laptop for a morning of surfing, blogging, maybe a little development work, get a load of what some of the lucky geeks at <a href="http://www.cmu.edu">Carnegie Mellon University</a> got to play with this morning:</p>
<p><tt>The M45, Yahoo&rsquo;s supercomputing cluster, has approximately 4,000 processors, three terabytes of memory, 1.5 petabytes of disks, and a peak performance of more than 27 trillion calculations per second (27 teraflops), placing it among the top 50 fastest supercomputers in the world.</tt></p>
<p>Their ranking claim won&#8217;t be confirmed until the next <a href="http://www.top500.org/">Top500 Supercomputer</a> list comes out on Tuesday at this week&#8217;s <a href="http://sc07.supercomputing.org/index.php">SC07</a> conference in Reno, so it will be interesting to see how M45 measures against the best in the world. Yahoo&#8217;s M45 figures should put it in the top 30.</p>
<p>We chatted with Yahoo&#8217;s Ron Brachman, VP for worldwide research operations with the company. He&#8217;s also wearing the hat as head of academic relationships. Jay Kistler, VP for engineering system tools &amp; services, also talked with us ahead of this morning&#8217;s announcement.</p>
<p>Brachman said the M45 supercomputer came about from the opportunity for Yahoo and the university community to advance science and technology on an Internet scale. They have opted to focus on open source, developing solutions for large scale distributed computing.</p>
<p>Yahoo and Carnegie Mellon understand grid computing well. The M45 setup has been geared toward that understanding. It&#8217;s capable of partitioning large data sets thanks to the installation of <a href="http://lucene.apache.org/hadoop/">Hadoop. </a></p>
<p><center><a href="http://aj.600z.com/aj/41547/0/cc?z=1"><img width="336" height="55" border="0" src="http://aj.600z.com/aj/41547/0/vc?z=1&#038;dim=41554" alt="" /></a></center></p>
<p>Hadoop accomplishes this by implementing <a href="http://wiki.apache.org/lucene-hadoop/HadoopMapReduce">MapReduce</a> and <a href="http://research.yahoo.com/node/90">Pig</a> the latter which may be known to those who follow Yahoo&#8217;s research projects closely.</p>
<p>Kistler said they have been working on layering Pig over a Hadoop core. Pig&#8217;s runtime extensions for parallel computing are similar to SQL, but they are procedural rather than declarative.</p>
<p>In the M45 environment, the runtime maps statements down to where MapReduce can divide them into little blocks of work and run them across the supercomputing platform.</p>
<p>We wanted to understand better what the distributed development effort being enabled by M45 might be able to do for this level of supercomputing. Kistler rattled off a couple of achievements he would like to see happen, if developers can pull them off.</p>
<p>One would provide for the improvement of job scheduling across clusters; another the enhancement of monitoring and instrumentation of heterogeneous jobs, where it would be easier to find bottlenecks and faults, and correct them for better performance.</p>
<p>Compelling stuff for the folks who will really get into the tasty innards of supercomputing. The potential gains from the M45 go beyond the items on Kistler&#8217;s wish list.</p>
<p>Carnegie Mellon&#8217;s Randy Bryant, dean of the School of Computer Science, told us in a phone interview about such possibilities. Top of the list: generating statistics for language translation. It&#8217;s a demanding task due to the number of documents needed for mapping words from multiple languages.</p>
<p>Another potential gain would be with digital image editing. Bryant discussed this with the example of getting an ex-brother in law out of photos. Through the use of a massive digital image database, supercomputing could allow the editor to find the content of a photo minus the person to be edited out, and replace that person with the background that would be normally visible.</p>
<p>Semantics and language search support would benefit, and we think Yahoo will be interested in that. Bryant noted such a project would look at distinguishing linguistics, where the system would understand when a speaker means &quot;bare&quot; or &quot;bear&quot; from the context of the rest of a conversation.</p>
<p>Research takes time, but the M45 platform should substantially improve the total time needed for these projects to bear productive results. Some very lucky geek types started researching on this platform today.</p>
<p><small></small></p>
<p><a href="http://twitter.com/dutter/">follow me on Twitter</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/yahoo-carnegie-mellon-switch-on-supercomputer-2007-11/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching 1/25 queries in 0.011 seconds using memcached
Object Caching 382/435 objects using memcached

Served from: webpronews.com @ 2012-02-12 18:08:02 -->
