<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebProNews &#187; UIMA</title>
	<atom:link href="http://www.webpronews.com/tag/uima/feed" rel="self" type="application/rss+xml" />
	<link>http://www.webpronews.com</link>
	<description>Breaking News in Tech, Search, Social, &#38; Business</description>
	<lastBuildDate>Sun, 12 Feb 2012 22:29:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Avatar Seeks Semantic Search</title>
		<link>http://www.webpronews.com/avatar-seeks-semantic-search-2007-07</link>
		<comments>http://www.webpronews.com/avatar-seeks-semantic-search-2007-07#comments</comments>
		<pubDate>Fri, 27 Jul 2007 17:40:26 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Avatar]]></category>
		<category><![CDATA[Email]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Powerset]]></category>
		<category><![CDATA[Search Engine]]></category>
		<category><![CDATA[Semantic]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=39415</guid>
		<description><![CDATA[Researchers at IBM Almaden have been developing a semantic search process that can delve into unstructured text to retrieve structured information.
]]></description>
			<content:encoded><![CDATA[<p>Researchers at IBM Almaden have been developing a semantic search process that can delve into unstructured text to retrieve structured information.<br />
<span id="more-39415"></span></p>
<table width="400" cellspacing="0" cellpadding="2" border="0">
<tr>
<td align="center"><img width="400" height="200" border="0" class="irImage" alt="Avatar Seeks Semantic Search" title="Avatar Seeks Semantic Search" src="http://images.ientrymail.com/webpronews/article_pics/avatar_seeks_semantic_search.jpg" /></td>
</tr>
<tr>
<td align="right" class="caption" style="padding-bottom: 10px; padding-left: 45px; padding-right: 45px;">Avatar Seeks Semantic Search</td>
</tr>
<tr>
<td align="center" class="caption" style="padding-bottom: 0px;"><img width="334" height="21" src="http://images.ientrymail.com/webpronews/salon/complete.gif" alt="" /></td>
</tr>
</table>
<p>While a lot of attention has been heaped upon <a href=http://www.powerset.com/>Powerset</a> and its almost-here natural language search, IBM has been working on a similar technology that may or may not be as close to public debut.</p>
<p>
IBM calls their effort <a href=http://www.almaden.ibm.com/cs/projects/avatar/>Avatar Semantic Search</a>. Right now it doesn&#8217;t even have the nice minimalist home page Powerset has for early peek signups, but since everyone&#8217;s done reading &#8216;Harry Potter and the Deathly Hallows&#8217;, a little text to read is a good thing.</p>
<p>
&#8220;Ongoing research in Avatar is at the cusp of a number of disciplines ranging from search and information retrieval to machine learning, information extraction, and probabilistic databases,&#8221; IBM announced on the project&#8217;s page. </p>
<p>
We&#8217;ve looked at earlier IBM efforts to pull information out of unstructured resources. Their <a href=http://www.webpronews.com/topnews/2005/12/19/ibms-uima-goes-from-search-to-concept>UIMA developments</a> now occupy a place in the freely available IBM Omnifind Yahoo Edition enterprise search product, for example.</p>
<p>
But UIMA is so 2005. While Powerset has drawn upon research performed by the Palo Alto Research Center, aka PARC, IBM reached out to the academic community to complement Avatar&#8217;s internal team.</p>
<p>
They have approached the semantic search issue in three ways. Developing an information extraction system will allow Avatar to plunge into mounds of raw text, and emerge with structured data based on rules-based annotators. </p>
<p>
IBM claimed this extraction system will permit unsophisticated users to build an annotator with Avatar and pull out the desired information from email, web pages, business reports, etc.</p>
<p>
Through semantic search, the researchers think they can interpret queries people make, and model the real intent behind a query. </p>
<p>
The real challenge comes from an effort they refer to as managing uncertainty and probabilistic databases. They&#8217;ve stepped deeply into theory here, well beyond any help <a href=http://en.wikipedia.org/wiki/Infinite_Improbability_Drive>Douglas Adams</a> can provide for me.</p>
<p>
IBM built momentum with UIMA starting well before I&#8217;d interviewed Marc Andrews about it in December 2005. It led to the co-branded, freely available Omnifind product I mentioned earlier, and I have to think Avatar may be on a similar track today.</p>
<p>
<small></small></p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/avatar-seeks-semantic-search-2007-07/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>IBM Sends UIMA To Apache, OASIS</title>
		<link>http://www.webpronews.com/ibm-sends-uima-to-apache-oasis-2006-11</link>
		<comments>http://www.webpronews.com/ibm-sends-uima-to-apache-oasis-2006-11#comments</comments>
		<pubDate>Thu, 16 Nov 2006 18:03:34 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Apache]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Oasis]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=32944</guid>
		<description><![CDATA[Two major open source contributions from IBM for its Unstructured Information Management Architecture (UIMA) have placed it with the Apache Software Foundation as an incubator project, and out to the OASIS specification development group to create a true standard around the UIMA framework.
]]></description>
			<content:encoded><![CDATA[<p>Two major open source contributions from IBM for its Unstructured Information Management Architecture (UIMA) have placed it with the Apache Software Foundation as an incubator project, and out to the OASIS specification development group to create a true standard around the UIMA framework.</p>
<table width="128" border="0" align="right">
<tr>
<td width="122" height="62"><a href="http://www.webproworld.com/viewtopic.php?t=69668"><img src="http://images.ientrymail.com/CommentImage-4.gif" width="130" height="60" border="0"></a></td>
</tr>
</table>
<p>It had been about a year since IBM&#8217;s Marc Andrews, director for strategy &#038; business development for content discovery, <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20051219IBMsUIMAGoesFromSearchToConcept.html class=bluelink>chatted</a> with WebProNews about UIMA. I talked with him while he was enjoying the always-cheerful environs of a busy airport somewhere about the latest news about UIMA.</p>
<p>The technology behind UIMA offers the promise of going beyond the typical search capabilities seen in search engines like Google and Ask today. UIMA understands unstructured information, stored in a multitude of formats, by its concept rather than just as keywords to match.</p>
<p>In January 2006, <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20060123IBMSourceForgesUIMA.html class=bluelink>IBM placed UIMA</a> on SourceForge, giving open source developers access to its framework. </p>
<p>The tech world and beyond have taken notice; two major projects, at the Mayo Clinic and the Memorial Sloan-Kettering Cancer Center, have been implementing UIMA-based solutions to aid in finding and facilitating deeper and more encompassing medical research.</p>
<p>Andrews said IBM has seen adoption of UIMA throughout the academic, government, and business worlds over the past year since we&#8217;ve spoken. </p>
<p>IBM&#8217;s open source efforts continued with their recent steps into that community. Apache has launched a project, <a href=http://incubator.apache.org/uima/ class=bluelink>Apache UIMA</a>, which should drive more community involvement with UIMA.</p>
<p>Ideally, Andrews would like to see what the free/open source software community can do with creating components that take advantage of UIMA to fulfill a given need. &#8220;Value comes from components that others can draw from,&#8221; he said.</p>
<p>The <a href=http://www.oasis-open.org/home/index.php class=bluelink>OASIS</a> efforts at developing a standard for UIMA are at a very early stage. Andrews said the official call for participation will come from the OASIS UIMA Technical Committee, comprised of heavy hitters like Carnegie Mellon University and the Army Information and Intelligence Warfare Directorate.</p>
<p>Those who would like to experiment with UIMA at Apache&#8217;s project site will find IBM has contributed UIMA&#8217;s version 2.0 source code to the incubator. Carnegie Mellon <a href=http://uima.lti.cs.cmu.edu/ class=bluelink>created</a> a UIMA Component Repository, and analytics tools from sources like the UK&#8217;s General Architecture for Text Engineering (<a href=http://gate.ac.uk class=bluelink>GATE</a>) and <a href=http://opennlp.sourceforge.net class=bluelink>OpenNLP</a> may be obtained freely. </p>
<p>&#8212;</p>
<p>Add to <a href="http://del.icio.us/post" onclick="window.open('http://del.icio.us/post?v=4&#038;partner=wpn&#038;noui&#038;jump=close&#038;url='+encodeURIComponent(location.href)+'&#038;title='+encodeURIComponent(document.title),'delicious','toolbar=no,width=700,height=400'); return false;" CLASS="printMailTop"><img src=http://images.ientrymail.com/webpronews/delicious-pic.png border=0> Del.icio.us</a> | <a href="javascript:void window.open('http://digg.com/submit?phase=2&#038;url='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img src=http://images.ientrymail.com/webpronews/digg-pic.png border=0> Digg</a>  | <a href="javascript:location.href='http://reddit.com/submit?url='+encodeURIComponent(location.href)+'&#038;title='+encodeURIComponent(document.title)"><img src=http://images.ientrymail.com/webpronews/reddit.png border=0>Reddit</a> | <a href="javascript:location.href='http://www.furl.net/storeIt.jsp?u='+encodeURIComponent(document.location.href)+'&#038;t='+encodeURIComponent(document.title)+' '"><img src=http://images.ientrymail.com/webpronews/furl-pic.png border=0> Furl</a></p>
<p>Bookmark WebProNews: <a href=http://www.webpronews.com><img src=http://images.ientrymail.com/webpronews/wpn-readit.jpg border=0></a> </p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/ibm-sends-uima-to-apache-oasis-2006-11/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Creative Discovery Comes To Search</title>
		<link>http://www.webpronews.com/creative-discovery-comes-to-search-2006-09</link>
		<comments>http://www.webpronews.com/creative-discovery-comes-to-search-2006-09#comments</comments>
		<pubDate>Tue, 19 Sep 2006 14:22:32 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Concept]]></category>
		<category><![CDATA[Creative]]></category>
		<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Supercomputer]]></category>
		<category><![CDATA[UIMA]]></category>
		<category><![CDATA[Website]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=31563</guid>
		<description><![CDATA[It's probably entirely appropriate that if a supercomputer is going to be set up to strive for creative answers to tough questions, that hardware will consist of Apple Xserve G5s.
]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s probably entirely appropriate that if a supercomputer is going to be set up to strive for creative answers to tough questions, that hardware will consist of Apple Xserve G5s.</p>
<p>At Virginia Tech, the System X supercomputer consists of a <a href=http://www.apple.com/science/profiles/vatech2/ class=bluelink>cluster of 1,100</a> of those Apple Xserve G5 machines. </p>
<p>The PhysOrg <a href=http://www.physorg.com/news77811470.html class=bluelink>website</a> reported on how researchers at the school have developed a method of finding answers by combing through seemingly disparate events.</p>
<p>Those researchers call the search capability they&#8217;ve created, &#8220;Storytelling.&#8221; It is a lot more than keyword search, and sounds more like the conceptual search IBM has been working with its <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20051219IBMsUIMAGoesFromSearchToConcept.html class=bluelink>UIMA project</a>. </p>
<p>The Storytelling process has been described as finding a &#8220;chain of concepts&#8221; between specified start and end points. </p>
<p>A researcher on the project described the process:</p>
<p><i>
<div style=margin-left:10px; margin-right:10px>&#8220;The stories are pieced together by analyzing large volumes of text or other data,&#8221; said Naren Ramakrishnan, associate professor of computer science at Virginia Tech. &#8220;Everyday, there are new research results reported in the literature and there are discoveries waiting to be made by exploring connections.&#8221; </p>
<p>&#8220;Our minds cannot correlate all available datasets efficiently and with any high degree of confidence without the aid of computational biology,&#8221; said Richard Helm, associate professor of biochemistry. &#8220;Attempting to find significant correlations within the ocean of online datasets is daunting. </p>
<p>&#8220;However, there may be experiments that have been published in the literature that look at particular subsets of a biological process. The storytelling algorithm links distant&#8217; objects by finding these closer connections and drawing them together in a storyline.&#8221;</p></div>
<p></i><br />
It&#8217;s pretty amazing technology at work, and it sounds like they can find answers to questions that are a little deeper than my last web search for <a href=http://www.ask.com/web?q=%22midnight+oil%22+lyrics&#038;qsrc=0&#038;o=0&#038;l=dir class=bluelink>Midnight Oil lyrics</a>. </p>
<p>&#8212;</p>
<p>Add to <a href="http://del.icio.us/post" onclick="window.open('http://del.icio.us/post?v=4&#038;partner=wpn&#038;noui&#038;jump=close&#038;url='+encodeURIComponent(location.href)+'&#038;title='+encodeURIComponent(document.title),'delicious','toolbar=no,width=700,height=400'); return false;" CLASS="printMailTop"><img src=http://images.ientrymail.com/webpronews/delicious-pic.png border=0> Del.icio.us</a> | <a href="javascript:void window.open('http://digg.com/submit?phase=2&#038;url='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img src=http://images.ientrymail.com/webpronews/digg-pic.png border=0> Digg</a>  | <a href="javascript:void window.open('http://myweb2.search.yahoo.com/myresults/bookmarklet?t='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;tag=Search,Concept,Supercomputer,Apple','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img src=http://images.ientrymail.com/webpronews/yahoo-pic.png border=0> Yahoo! My Web</a> | <a href="javascript:location.href='http://www.furl.net/storeIt.jsp?u='+encodeURIComponent(document.location.href)+'&#038;t='+encodeURIComponent(document.title)+' '"><img src=http://images.ientrymail.com/webpronews/furl-pic.png border=0> Furl</a></p>
<p>Bookmark WebProNews: <a href=http://www.webpronews.com><img src=http://images.ientrymail.com/webpronews/wpn-readit.jpg border=0></a> </p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/creative-discovery-comes-to-search-2006-09/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM Preps Entry-Level Business Search</title>
		<link>http://www.webpronews.com/ibm-preps-entrylevel-business-search-2006-07</link>
		<comments>http://www.webpronews.com/ibm-preps-entrylevel-business-search-2006-07#comments</comments>
		<pubDate>Fri, 07 Jul 2006 19:30:33 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[business]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=30197</guid>
		<description><![CDATA[While Big Blue already makes high-end enterprise search and content integration software, it found a need to make entry-level versions of those products available in the small to medium business markets.
]]></description>
			<content:encoded><![CDATA[<p>While Big Blue already makes high-end enterprise search and content integration software, it found a need to make entry-level versions of those products available in the small to medium business markets.</p>
<p>Departmental projects and small to mid-sized companies may need products to help them manage information across the business. Not all of them need a sizable install of IBM&#8217;s high-powered solutions to do search or content integration.</p>
<p><a href=http://ibm.com/software/data/discovery/launch.html class=bluelink>IBM</a> wants to embrace that market, a sensible move because while there are a lot of bigger companies in the world, there are plenty of smaller firms that a high-end offering simply does not match.</p>
<p>The IBM WebSphere Information Integrator OmniFind Starter Edition addresses that market for search products. It can analyze and index information across a business from internal portals, databases, and other sources.</p>
<p>Companies needing expanded capabilities in OmniFind Starter can plug-in the Unstructured Information Management Architecture (UIMA) framework. We have <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20051219IBMsUIMAGoesFromSearchToConcept.html class=bluelink>discussed UIMA</a> before, and its ability to sift through concepts and not just keyword search.</p>
<p>Content management systems exist in many forms, beyond those used by online publishers. To address them and have information available beyond their confines, a business can use the WebSphere Information Integrator Content Starter Edition.</p>
<p>IBM said the Information Integrator works with distributed content &#8220;as if it were stored and managed in a single repository.&#8221; Companies that have considered shifting to a service-oriented architecture (SOA) will find the Information Integrator can extend &#8220;the information as a service framework to include unstructured information.&#8221;</p>
<p>Through the use of out-of-the-box connectors, or custom ones created with the toolkit that comes with Information Integrator, users can access multiple data repositories from a single interface built to use the Integrator&#8217;s behaviors. </p>
<p>&#8212;<br />
Tag: </p>
<p>Add to <a href="http://del.icio.us/post" onclick="window.open('http://del.icio.us/post?v=4&#038;noui&#038;jump=close&#038;url='+encodeURIComponent(location.href)+'&#038;title='+encodeURIComponent(document.title), 'delicious','toolbar=no,width=700,height=400'); return false;"><img src=http://images1.ientrymail.com/webpronews/delicious-pic.png border=0> Del.icio.us</a> | <a href="javascript:void window.open('http://digg.com/submit?phase=2&#038;url='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img src=http://images1.ientrymail.com/webpronews/digg-pic.png border=0> Digg</a>  | <a href="javascript:void window.open('http://myweb2.search.yahoo.com/myresults/bookmarklet?t='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;tag=IBM','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)"><img src=http://images1.ientrymail.com/webpronews/yahoo-pic.png border=0> Yahoo! My Web</a> | <a href="javascript:location.href='http://www.furl.net/storeIt.jsp?u='+encodeURIComponent(document.location.href)+'&#038;t='+encodeURIComponent(document.title)+' '"><img src=http://images1.ientrymail.com/webpronews/furl-pic.png border=0> Furl</a></p>
<p>Bookmark WebProNews: <a href=http://www.webpronews.com><img src=http://images.ientrymail.com/webpronews/wpn-readit.jpg border=0></a> </p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/ibm-preps-entrylevel-business-search-2006-07/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft May Use UIMA To Top Google</title>
		<link>http://www.webpronews.com/microsoft-may-use-uima-to-top-google-2006-03</link>
		<comments>http://www.webpronews.com/microsoft-may-use-uima-to-top-google-2006-03#comments</comments>
		<pubDate>Thu, 02 Mar 2006 19:08:43 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Reports]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=27325</guid>
		<description><![CDATA[A Microsoft Europe executive has provided some bulletin board material for Google, in claiming Microsoft will exceed Google in the search market in six months, based on being better able to retrieve specific information rather than just URLs.
]]></description>
			<content:encoded><![CDATA[<p>A Microsoft Europe executive has provided some bulletin board material for Google, in claiming Microsoft will exceed Google in the search market in six months, based on being better able to retrieve specific information rather than just URLs.</p>
<p>Upon seeing <a href=http://business.timesonline.co.uk/article/0,,9075-2064793,00.html class=bluelink>reports</a> like the one in the Times Online UK, we had to wonder if Microsoft president for <acronym title="Europe, Middle East, Africa" class=bluelink>EMEA</acronym> Neil Holloway was discussing a Microsoft integration of <acronym title="Unstructured Information Management Architecture" class=bluelink>UIMA</acronym> into its search capabilities. Holloway told conference attendees in Paris that Microsoft will unveil a new search engine in Britain and the US in six months before unleashing it on the rest of Europe.</p>
<p>And it&#8217;s going to make everyone forget about Google:</p>
<p><i>
<div style=margin-left:10px; margin-right:10px;>&#8220;What we&#8217;re saying is that in six months&#8217; time we&#8217;ll be more relevant in the U.S. market place than Google,&#8221; said Neil Holloway, Microsoft president for Europe, Middle East and Africa.</p>
<p>&#8220;The quality of our search and the relevance of our search from a solution perspective to the consumer will be more relevant,&#8221; he told the Reuters Global Technology, Media and Telecoms Summit.</p>
<p>But being good is not enough to win the hearts and minds of consumers already dedicated to another standard.</p></div>
<p></i><br />
The last point echoes the long-ago battle between Philips and Sony over videotape format standards. Although Sony&#8217;s Betamax won over fans with its superior audio, VHS became the standard through its broader acceptance.</p>
<p>Holloway noted that integration of that search will take place in some of Microsoft&#8217;s popular programs:</p>
<p><i>
<div style=margin-left:10px; margin-right:10px;>Microsoft will put its search engine into its widely used communications tools Windows Messenger and Hotmail.</p>
<p>&#8220;Integrating search into those other applications &#8230; makes it very seamless for people,&#8221; he said. Timing in Europe will be pegged to that in the United States.</p>
<p>&#8220;The UK will probably be at the same time, France maybe three months behind, Germany maybe three months behind. It&#8217;s not two years behind.&#8221;</p>
<p>He said that Microsoft&#8217;s goal &#8212; but not its initial offering &#8212; would go beyond finding URLs and instead focus in on the specific information sought by Internet users.</p>
<p>&#8220;Generally these days what you get back is URLs, and based upon research 50 percent of the time you do a search you don&#8217;t get the URL you&#8217;re looking for,&#8221; he said.</p></div>
<p></i><br />
The information focus sounds very familiar. IBM, <a href=http://www.webpronews.com/insiderreports/searchinsider/wpn-49-20060105IBMNotGoogleScaresMicrosoft.html class=bluelink>the company that scares Microsoft</a> more than Google, released its <acronym title="Unstructured Information Management Architecture" class=bluelink>UIMA</acronym> framework last year and made it freely available. </p>
<p>In an <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20051219IBMsUIMAGoesFromSearchToConcept.html class=bluelink>interview with Marc Andrews</a>, director for strategy &#038; business development for content discovery at IBM, he disclosed that IBM was focusing on concept as the operative function in search. More from that article:</p>
<p><i>
<div style=margin-left:10px; margin-right:10px;>Andrews noted how search solutions don&#8217;t really go beyond web and file servers when it comes to spidering. &#8220;They&#8217;ve really ignored all of the enterprise knowledge that is being managed in their content management environment; that&#8217;s being stored in databases supporting their different applications, and potentially even in mainframes.&#8221; </p>
<p>Content discovery offers a more holistic view of data across the enterprise. &#8220;We&#8217;ve been focusing on enabling organizations to do a lot more than just search,&#8221; Andrews said. &#8220;One of the major limitations organizations have today is they&#8217;re limited to keyword-based search capabilities. That ends up falling short of most organizations&#8217; needs.&#8221;</p></div>
<p></i><br />
If Microsoft does come up with a better search, to achieve the goal Holloway stated, UIMA may be the mechanism they use to get there.</p>
<p>&#8212;<br />
<script language='javascript'> document.write("Email WebProNews <a href='mailto:news@ientry.com?subject="+encodeURIComponent(document.title)+"'>here</a>.")</script></p>
<p>Drag this <a href=http://www.webpronews.com><img src=http://images.ientrymail.com/webpronews/wpn-readit.jpg border=0></a> to your Bookmarks.</p>
<p>Add to <script language='javascript'> document.write("<a href='http://del.icio.us/post?url="+encodeURIComponent(document.location.href)+"&#038;title="+encodeURIComponent(document.title)+"'>Del.icio.us</a>")</script> | <a href="javascript:void window.open('http://digg.com/submit?phase=2&#038;url='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)">DiggThis</a>  | <a href="javascript:void window.open('http://myweb2.search.yahoo.com/myresults/bookmarklet?t='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)">Yahoo! My Web</a></p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/microsoft-may-use-uima-to-top-google-2006-03/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM SourceForges UIMA</title>
		<link>http://www.webpronews.com/ibm-sourceforges-uima-2006-01</link>
		<comments>http://www.webpronews.com/ibm-sourceforges-uima-2006-01#comments</comments>
		<pubDate>Mon, 23 Jan 2006 19:32:14 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Context]]></category>
		<category><![CDATA[Framework]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[SES]]></category>
		<category><![CDATA[UIMA]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Website]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=26177</guid>
		<description><![CDATA[Source code for IBM's Unstructured Information Management Architecture (UIMA) now has a home on SourceForge, as IBM invites developers everywhere to take a shot at the concept of knowledge discovery.
]]></description>
			<content:encoded><![CDATA[<p>Source code for IBM&#8217;s Unstructured Information Management Architecture (UIMA) now has a home on SourceForge, as IBM invites developers everywhere to take a shot at the concept of knowledge discovery.</p>
<p>IBM unveiled its source code release on SourceForge, the biggest open source development site, in a statement today. Developers can find the <a href=http://uima-framework.sourceforge.net/ class=bluelink>UIMA framework</a> at Sourceforge, and additional items like the IBM UIMA SDK, with additional facilities and components, can be downloaded for free from <a href=http://www.alphaworks.ibm.com/tech/uima class=bluelink>IBM&#8217;s website</a>.</p>
<p>The move demonstrated IBM&#8217;s confidence in the open source movement, and in its UIMA technology. In December 2005, IBM&#8217;s director for strategy &#038; business development for content discovery Marc Andrews <a href=http://www.webpronews.com/topnews/topnews/wpn-60-20051219IBMsUIMAGoesFromSearchToConcept.html class=bluelink>discussed UIMA within the context</a> of search, and noted how UIMA is &#8220;information aware&#8221; rather than just data or application aware.</p>
<p>Unstructured data is the main idea to IBM&#8217;s concept approach. Whether information exists in an email, a text file, or a piece of rich media, UIMA&#8217;s framework can enable the construction of applications to retrieve the information from the container. Later in 2006, IBM plans to make the UIMA project a &#8220;full open source community development model.&#8221;</p>
<p>UIMA first started picking up <a href=http://www.webpronews.com/news/ebusinessnews/wpn-45-20050228IBMsApproachToEnterpriseSearch.html class=bluelink>notice in February 2005</a>, and more formally disclosed at <a href=http://www.webpronews.com/insidesearch/insidesearch/wpn-56-20050808IBMOfferingNewSearchConcept.html class=bluelink>SES San Jose 2005 in August</a> of that year. At that time, IBM promised to unveil UIMA on SourceForge by the end of the year.</p>
<p>Several firms like Factiva and Cognos created UIMA compliant solutions, IBM noted in its statement. Ongoing UIMA development in the medical field at places like the Mayo Clinic and Memorial Sloan-Kettering holds promise, too. Mayo wants to extract information from about 20 million clinical notes, while Sloan-Kettering has an even more ambitious plan:</p>
<p><i>
<div style=margin-left:10px; margin-right:10px;>Memorial Sloan-Kettering Cancer Center is working with IBM to develop a Web accessible data warehouse that will conform to HIPAA requirements. This data warehouse will enable clinicians and researchers from Memorial Sloan-Kettering Cancer Center to efficiently use data facilitating research on a new cancer taxonomy. An important aspect of the data warehouse is the inclusion of searchable concepts from Memorial Sloan-Kettering Cancer Center&#8217;s text-based pathology reports. These concepts are automatically extracted by an IBM text analytics solution built on the UIMA framework.</div>
<p></i></p>
<p>&#8212;<br />
<script language='javascript'> document.write("Email WebProNews <a href='mailto:news@ientry.com?subject="+encodeURIComponent(document.title)+"'>here</a>.")</script></p>
<p>Drag this <a href=http://www.webpronews.com><img src=http://images.ientrymail.com/webpronews/wpn-readit.jpg border=0></a> to your Bookmarks.</p>
<p>Add to <script language='javascript'> document.write("<a href='http://del.icio.us/post?url="+encodeURIComponent(document.location.href)+"&#038;title="+encodeURIComponent(document.title)+"'>Del.icio.us</a>")</script> | <a href="javascript:void window.open('http://digg.com/submit?phase=2&#038;url='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)">DiggThis</a> | <a href="javascript:void window.open('http://myweb2.search.yahoo.com/myresults/bookmarklet?t='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)">Yahoo My Web</a></p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/ibm-sourceforges-uima-2006-01/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM&#8217;s UIMA Goes From Search To Concept</title>
		<link>http://www.webpronews.com/ibms-uima-goes-from-search-to-concept-2005-12</link>
		<comments>http://www.webpronews.com/ibms-uima-goes-from-search-to-concept-2005-12#comments</comments>
		<pubDate>Mon, 19 Dec 2005 16:50:02 +0000</pubDate>
		<dc:creator>WebProNews Staff</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Concept]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=25248</guid>
		<description><![CDATA[The reason why people are better at answering questions than search engines is due to people understanding the concept behind a question; while search engines do well on context, IBM sees concepts as the next great advance in search technology.
]]></description>
			<content:encoded><![CDATA[<p>The reason why people are better at answering questions than search engines is due to people understanding the concept behind a question; while search engines do well on context, IBM sees concepts as the next great advance in search technology.</p>
<p>If you remember something about a particular movie, a quote or a scene, but not the movie itself, a search engine may be very helpful in making a connection. But if it&#8217;s something very obscure and a Google or a Yahoo can&#8217;t find it, it isn&#8217;t a real big deal.</p>
<p>What if it&#8217;s your physician trying to find out more about symptoms of an illness you possess? Does finding data related to those symptoms, locating information based on the symptoms, become a big deal for you?</p>
<p>Most likely the answer will be yes. The idea of &#8220;concept&#8221; as the operative function in search has become the place where IBM wants to take enterprise search. For IBM, it&#8217;s stopped being about keywords.</p>
<p>It&#8217;s about discovering the concepts in content.</p>
<p>Marc Andrews, director for strategy &#038; business development for content discovery, spoke to me about how discovery works, and it all starts with <a href=http://www.research.ibm.com/UIMA/ class=bluelink>UIMA</a>, Unstructured Information Management Architecture. </p>
<p>I asked him if UIMA was application aware as well as data aware, and he indicated it went beyond that. &#8220;It&#8217;s information aware, whatever format information comes in, wherever it&#8217;s stored across the enterprise, and it&#8217;s really going beyond the traditional search.&#8221;</p>
<p>Andrews noted how search solutions don&#8217;t really go beyond web and file servers when it comes to spidering. &#8220;They&#8217;ve really ignored all of the enterprise knowledge that is being managed in their content management environment; that&#8217;s being stored in databases supporting their different applications, and potentially even in mainframes.&#8221;</p>
<p>Content discovery offers a more holistic view of data across the enterprise. &#8220;We&#8217;ve been focusing on enabling organizations to do a lot more than just search,&#8221; Andrews said. &#8220;One of the major limitations organizations have today is they&#8217;re limited to keyword-based search capabilities. That ends up falling short of most organizations&#8217; needs.&#8221;</p>
<p>UIMA, which has been in development for about three years, could fulfill those needs. Andrews said development of the architecture began about three years ago. &#8220;It started out as a project in IBM Research because we had over 200 researchers working across 8 labs in six or seven different countries.</p>
<p>&#8220;They were all developing different types of text analytics. We needed to be able to have those different components interoperate with and build upon each other,&#8221; Andrews said. </p>
<p>By developing UIMA, IBM created an open framework that could accept those text analytic components as plug-ins, interpret the meaning of unstructured information, and identify concepts and facts that allow for the search for more than mentions of words.</p>
<p>Now let&#8217;s come back to the scenario where you visit the doctor with a variety of symptoms, a list that has even the most talented of clinicians shaking his or her head. It&#8217;s an appropriate topic since the earliest adopters of UIMA have been organizations like the Mayo Clinic and Sloan-Kettering Cancer Center. </p>
<p>&#8220;An expert knows to go in and search for ten different ways of describing the symptom. Your typical patient or even doctor refers to things in different ways. They want to be able to search for a symptom or search for any drugs that relate to the symptom and find all of that information,&#8221; Andrews said.</p>
<p>&#8220;They&#8217;re doing it for clinical trials research and drug research, to be able to identify concepts and facts. So those organizations are leveraging UIMA today to incorporate these types of analytics.</p>
<p>&#8220;Doctors and patients wanted to find out more about different clinical trials that are going on,&#8221; he continued. &#8220;These clinical trials are being managed by various organizations, information is scattered across NIH (National Institute of Health) and other different databases, and included clinical trials being conducted&#8221; by various pharmaceutical firms.</p>
<p>Medical centers and universities like Stanford, Carnegie-Mellon, and Columbia were running into search-related challenges. So was the organization that gave birth to the Internet, the Defense Advanced Research and Projects Agency (DARPA). </p>
<p>It was DARPA that sponsored the first working group for UIMA. From that working group, the first version of UIMA became available at the beginning of 2005. In August 2005, IBM announced its intent to make UIMA available as open source. Andrews said there have been about 3,000 downloads of the UIMA framework.</p>
<p>There&#8217;s some proof-of-concept work going on at some Fortune 500 companies with UIMA, but Andrews wasn&#8217;t in a position to discuss names yet. More information on that should be coming early in 2006.</p>
<p>Email the author <A HREF="&#109;&#97;&#105;&#108;&#116;&#111;&#58;&#100;&#117;&#116;&#116;&#101;&#114;&#64;&#105;&#101;&#110;&#116;&#114;&#121;&#46;&#99;&#111;&#109;">here</A>.</p>
<p>Add to <script language='javascript'> document.write("<a href='http://del.icio.us/post?url="+encodeURIComponent(document.location.href)+"&#038;title="+encodeURIComponent(document.title)+"'>Del.icio.us</a>")</script> | <a href="javascript:void window.open('http://myweb2.search.yahoo.com/myresults/bookmarklet?t='+encodeURIComponent(document.title)+'&#038;u='+encodeURIComponent(window.location.href)+'&#038;ei=UTF-8','popup','width=520px,height=420px,status=0,location=0,resizable=1,scrollbars=1,left=100,top=50',0)">Yahoo My Web</a></p>
<p><script language=JavaScript src="http://aj.600z.com/aj/1095/0/vj?z=1&#038;dim=1088&#038;pos=15"></script></p>
<p>David Utter is a staff writer for WebProNews covering technology and business. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/ibms-uima-goes-from-search-to-concept-2005-12/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBMs Approach To Enterprise Search</title>
		<link>http://www.webpronews.com/ibms-approach-to-enterprise-search-2005-02</link>
		<comments>http://www.webpronews.com/ibms-approach-to-enterprise-search-2005-02#comments</comments>
		<pubDate>Mon, 28 Feb 2005 19:51:13 +0000</pubDate>
		<dc:creator>Chris Richardson</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[OmniFind]]></category>
		<category><![CDATA[UIMA]]></category>

		<guid isPermaLink="false">http://www.webpronews.com/?p=15332</guid>
		<description><![CDATA[Developing effective search tools for enterprise-level businesses is not the same as developing search for web-based documents.  Websites and other web-related documents contain an inherent structure due to the nature of web links.
]]></description>
			<content:encoded><![CDATA[<p>Developing effective search tools for enterprise-level businesses is not the same as developing search for web-based documents.  Websites and other web-related documents contain an inherent structure due to the nature of web links.</p>
<p>However, because corporate documents do not contain a natural structure, meaning they are more unique and un-related, indexing the large amounts of business documents can be an arduous task.  It&#8217;s this concept that drives Arthur Ciccolo, one of the chief developers of IBM&#8217;s Unstructured Information Management Architecture (UIMA) project.</p>
<p>While enterprise search may be considered a niche topic, many of the developments coming from UIMA, if applied to web-based search, could have incredible ramifications.  However, this is not IBM&#8217;s goal.  During a phone interview, Ciccolo stated in no uncertain terms that IBM&#8217;s goal for their search technology is the enterprise level, not web search.</p>
<p>To define UIMA and its function, <a href="http://www.research.ibm.com/UIMA/index.htm">IBM offers this</a>:</p>
<p><i>Unstructured information represents the largest, most current and fastest growing source of information available to businesses and governments  An Unstructured Information Management (UIM) application may be generally characterized as a software system that analyzes large volumes of unstructured information (text, audio, video, images, etc.) to discover, organize and deliver relevant knowledge to the client or application end-user.</i></p>
<p>In order to accomplish this task, Ciccolo and his team are putting their efforts developing different framework structures to perform text analysis, semantic comprehension, and natural language support.  By doing so, IBM&#8217;s UIMA utilities can better perform the tasks of indexing and comprehending the different types of enterprise business documents.</p>
<p>To understand why enterprise search can be such a complicated excursion, you must first understand the different types of unstructured data that has to be indexed.  With web-based documents, there is a much more narrow focus because document types are more limited (html, pdf, fla, etc.) and they usually contain links, which lends itself to easier indexing.  </p>
<p>These off-page attributes provide the structure and makes web indexing less complicated than the unstructured environment of business documents.  While web documents are more confined, enterprise documents run the gamut from word processing documents to video and sound files, and the off-page attributes that provide structure are absent.</p>
<p>The capabilities of UIMA, some of which are still in the developmental stages, attempt to address these concerns.  For instance, the focal point of UIM is to focus on text analytics and the semantics contained within.  By understanding the contents of the text being indexed, developing natural language search capabilities (&#8220;What is the formula for product X?&#8221;) is an attainable goal.  </p>
<p>However, to understand the difficulties involved in developing a natural language search feature that works, consider this:  the reason Microsoft will be releasing <a href="http://www.webpronews.com/news/ebusinessnews/wpn-45-20040831MicrosoftConfirmsNoDesktopSearchForLonghorn.html">Longhorn without the search feature</a> has to do with the developing a new file structure that supports natural search queries.</p>
<p>With UIMA, once a document is indexed, searching for it should be easier than it would if it was web indexed.  Ciccolo indicated that once an item is indexed, their technology automatically generates editable meta data, which makes discovery much simpler.  By integrating UIM technology with IBM&#8217;s Intranet search utility <a href="http://www-306.ibm.com/common/ssi/fcgi-bin/ssialias?subtype=ca&#038;infotype=an&#038;appname=iSource&#038;supplier=897&#038;letternum=ENUS205-002">WebSphere Information Integrator OmniFind</a>, they are able to provide an exciting era of enterprise-related search.</p>
<p>With regards to the current abilities of UIMA, the potential future developments are quite impressive, and if adopted, could cause huge ripples in the search technology status quo.  While the list of possible developments is fairly long, there are two possible developments that could have long-reaching ramifications.</p>
<p>The first area of interest has to do with video search.  Ciccolo and his team are developing video search methods that could revolutionize the whole concept.  Normally, most video indexing is done by spidering the closed-caption text contained within the film.  However, Ciccolo&#8217;s vision has to do with actually analyzing the picture to extract whatever relevant content is contained within.  This data would then be indexed and have meta data generated (which would be editable using IBM&#8217;s service), making retrieval methods even better.</p>
<p>The other area of interest has to do with providing the ability to perform trans-lingual queries.  What drives this development is the following concept:</p>
<p>- User A enters a query in English language<br />
- UIMA translates query into target language<br />
- UIMA then searches target language documents and,<br />
- Returns search result in whatever language initiated the query.  In this case, English</p>
<p>Arthur indicated that during the testing phases, he discovered search results were actually more relevant after the translation took place, meaning UIMA performs the translation after the query is entered.  If something similar to this was adopted by search as a whole, it could and would alter the entire landscape of possibilities.</p>
<p>Other features and ideas of interest include the ability for administration members to write their own search algorithms, which can be implemented on top of existing framework.  Speaking of admins, Ciccolo made sure to point out UIMA was developed with IT departments in mind.  The technology is purposely made to be easy to install and to tweak.  This makes adapting their search technology to fit your business much easier.</p>
<p>Another area that the team is focusing on is the medical industry.  The ability to catalog and differentiate between the mountains of journals and documents over-running medical institutions is completely welcomed.  Currently, IBM has an agreement with the Mayo Clinic to implement and test UIMA.  If the medical field as a whole would adopt such technology, finding patient information, journals on particular illnesses, and pharmaceutical information would be much easier to accomplish.  This in turn would undoubtedly improve the medical industry&#8217;s ability to treat and care for patients, as well as share information.</p>
<p>To understand the goal of the UIMA project, Ciccolo offers these thoughts, &#8220;IBM&#8217;s goal for UIMA is that it becomes widely accepted as a new class of middleware for analytics and that it enables the next generation of search: semantic search.&#8221;</p>
<p>For much more information about the project and other areas of IBM&#8217;s approach to enterprise search, please visit the following areas:</p>
<p><a href="http://www.research.ibm.com/UIMA/index.htm">UIMA Homepage</a><br />
<a href="http://www.research.ibm.com/journal/sj/433/broder.html">The research proposal</a><br />
<a href="http://www.research.ibm.com/journal/sj/433/brodeaut.html">About the authors</a><br />
<a href="http://www-306.ibm.com/common/ssi/fcgi-bin/ssialias?subtype=ca&#038;infotype=an&#038;appname=iSource&#038;supplier=897&#038;letternum=ENUS205-002">Information about OmniFind</a></p>
<p>While the subject can be tricky to navigate through, I would recommend reading the journals and documentations of Art Ciccolo and his team.</p>
<p>Chris Richardson is a search engine writer and editor for <a href="http://www.WebProNews.com">WebProNews</a>. Visit WebProNews for the <a href="http://www.WebProNews.com">latest search news</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.webpronews.com/ibms-approach-to-enterprise-search-2005-02/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching 1/37 queries in 0.018 seconds using memcached
Object Caching 524/619 objects using memcached

Served from: webpronews.com @ 2012-02-12 17:42:07 -->
