Google Obtains Similarity Engine Patent
This week, Google was awarded a patent for technology designed to address duplicate content issues throughout the index. The patent, originally filed in December of 2001, is entitled “Methods and Apparatus for Estimating Similarity.”
|Does Google’s New Patent Doom Dupe Content?|
Duplicate content continues to be a thorn in the side of search users the world over. I can’t tell you how many times I come across multiple copies of the same article when I’m sifting through Google’s blog search. Scraper sites are as rampant as ever, and the duplicative dilemma when it comes to content only seems to be getting worse.
A recently approved patent, however, brings to light steps that Google is taking to deal with the duplicate content issue. The patent abstract reads as follows: The weight associated with each coordinate in the vector is multiplied by a predetermined hashing vector to generate a product vector, and the product vectors are summed. The similarity engine may then generate a compact representation of the object based on the summed product vector.
The weight associated with each coordinate in the vector is multiplied by a predetermined hashing vector to generate a product vector, and the product vectors are summed. The similarity engine may then generate a compact representation of the object based on the summed product vector.
I’m pretty savvy when it comes to techno-babble. I can explain what a Heisenberg Compensator is meant for when it comes to a Star Trek transporter relay, but all this talk of vectors and objects eludes me.
Luckily, there are people around like Bruce Clay, who can sift through the geek speak and get to the heart of the matter. This is Bruce’s summary of the patented technology’s impact:
Wow, it looks like SEOs had better take good notice of what this patent could mean to the indexing process.
As for me, I will just be happy if the sites that scrape WebProNews articles are unceremoniously booted from the index in similar fashion to the way that LSU dispatched Notre Dame in the Sugar Bowl.