iEntry 10th Anniversary RSS Newsletter Advertising
Visit Twellow.com
Text: Decrease Font Size Increase Font Size | Print Print Article | Share: Delicious Digg StumbleUpon Post to Twitter Post to Facebook
4 commentsThursday, November 13, 2008

PubCon: Getting Rid Of Duplicate Content

Tips from Google

The issue of duplicate content is something that all webmasters and site owners have to take into consideration and the PubCon session "Getting Rid of Duplicate Content Once and For All," addresses that challenge.

(Coverage of PubCon continues at WebProNews Videos.  Stay with WebProNews for continued coverage from the event this week.)

PubCon: Getting Rid Of Duplicate Content

Ben D' Angelo, Software Engineer, Google, spoke about duplicate content issues. There are multiple disjoint situations including multiple URLs pointing to the same page, different countries with the same language, and syndicated content across other sites.

To avoid such issues you should have one URL for one piece of content. The reason for this is users don't like duplicated results, it saves resources by having room to index other content, and it saves resources on your server.

Sources of duplicate content within your sites are multiple URLs pointing to the same page, www. Vs non www., session ids, URL parameters, and printable versions of your pages.

Google handles duplicate content in a number of ways. The general idea is to cluster pages and choose the best representative. Google uses different filters for different types of duplicate content. The goal is to serve on version of the content in the SERPs.

To prevent duplicate content there are a variety of things you can do.  For exact duplicates a 301 redirect is the best option.  For near duplicate content use noindex and robots.txt

For domains by country, different languages are not duplicate content. Use unique content specific to that country.  Use different TLDs and Webmaster tools for geo targeting.

 For URL parameters put data which does not effect the substance of the page in a cookie, not the URL.

When it comes to other sites include the original absolute URL in any syndicated content. Syndicate slightly different content. Manage your expectations if you use syndicated content, you will probably not outrank the original source.

Don't be too concerned about scrapers or proxies, they generally won't impact your rankings. If you are concerned you can file a DMCA or spam report with Google.

If you need other information you can visit Google Webmaster Central or the Google Webmaster Central Blog.
 

About the author:
Mike is a staff writer for WebProNews.

Wordpress not helping

Comment pagination in the latest version of wordpress isn't helping duplicte content - multiple pages with the same post but with just different comments at the bottom.

duplicate content

In regards to the duplicate content part of this blog post, I personally use the http://www.copygator.com website to find and stop duplicate content:

1. it's automated and brings me results instead of me searching for duplicated content. All i had to do was submit my feed and it started monitoring my feed showing me who's republished my articles on the web.

2. i get notified by email so it contacts me when it finds copies of my articles online.

3. i use their image badge feature to alert me directly on my website when my content is being lifted.

4. it's a free service as opposed the "per page" cost of copyscape/copysentry.

Publish A Comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
10 + 1 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
SEARCH
Popular WPN Business Resources












Subscribe to WebProNews


Send me relevant info