iEntry 10th Anniversary RSS Newsletter Advertising
Join the WebProWorld Forum!
Text: Decrease Font Size Increase Font Size | Print Print Article | Share: Delicious Digg StumbleUpon Post to Twitter Post to Facebook
1 commentMonday, June 22, 2009

Google Makes Progress in Image Recognition

Research Paper Outlines Advancements in Computer Vision

Today at the Computer Vision and Pattern Recognition (CVPR) conference in Miami, Google released a new research paper that looks at building a web-scale landmark-recognition engine. This could lay the groundwork for some interesting advancements in image search.

"To be clear up front, this is a research paper, not a new Google product, but we still think it's cool," says Jay Yagnik, Head of Computer Vision Research.

The goal is to get computers to recognize landmarks (for example, the Eiffel Tower, the Lincoln Memorial, or the example Google shares - the Acropolis). This is no easy task when the engine has to rely on images of the landmarks, which are incredibly varied by angle, lighting, photo quality, etc.

Is that the Acropolis?

Google says it has managed to achieve 80% accuracy on over 50,000 landmarks. Google demonstrated how it did this with the Acropolis. Essentially, they began with an unnamed, untagged picture of it, entered the web address into the recognition engine, and the computer identified it as "Recognized Landmark: Acropolis, Athens, Greece."

To make a long and technical story short, Google generated a list of landmarks based on GPS-tagged photos from Picasa and Panoramio, and online tour guide webpages. Google then found "candidate images" for each landmark using those resources as well as Google Image Search. These images were "pruned using efficient image matching and unsupervised clustering techniques, as Yagnik explains. Then Google developed an indexing system for fast image recognition.

Landmark Recognition

Here's what the clustered recognition model looks like:

Clustering

"Here, we build a world-scale landmark recognition engine, which organizes, models and recognizes the landmarks on the scale of the entire planet Earth," concludes Google's research paper (PDF). "Constructing such an engine is, in essence, a multi-source and multi-modal data mining task."

There isn't really any mention of how such an engine could contribute to the improvement of Google Image Search in the research paper, but given that search, and "organizing the world's information" are what Google is essentially all about, it's not hard to imagine this technology being incorporated into how Google retrieves results for Image Search in the future.

About the author:
Chris Crum has been a part of the WebProNews team and the iEntry Network of B2B Publications since 2003. Twitter: @CCrum237

good

This really is a good advancement from Google in the field of image search. I hope it can lead to much better results for anyone searching images using google image search.

Publish A Comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
4 + 1 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
SEARCH
Popular WPN Business Resources












Subscribe to WebProNews


Send me relevant info