tag:blogger.com,1999:blog-31345031852061809262023-11-15T07:33:28.373-08:00raisadadahttp://www.blogger.com/profile/01715062176149108081noreply@blogger.comBlogger1125tag:blogger.com,1999:blog-3134503185206180926.post-28187346232422151432013-08-19T07:41:00.000-07:002013-08-19T07:41:04.074-07:00WordPress: Wave Bye-Bye to Duplicate ContentWordPress: Wave Bye-Bye to Duplicate Content<br />
<br />
One of the many SEO obstacles facing bloggers is duplicate content.
It isn’t just about what you see (although, that is important too), but
about what you don’t see.<br />
Bloggers want to show their content in
more than one place: the homepage, categories, post page, etc. The
problem is, search engines couldn’t give a flying rat-monkey. All they
care about is the fact they are seeing the same content repeated in
multiple places. This will automatically result in either a penalty or
your pages being thrown into the supplemental index.<br />
<br />
<small>Photo by Bob Jagendorf</small><br />
<b>Step 1: Remove Duplication on archives, tags, and homepage</b><br />
To stop this from happening, you have several choices.<br />
<ol>
<li>You
can use the_excerpt() instead of the_content() on your index,
category, and tags pages. (You can customize the_excerpt by writing
within the “Excerpt” box under “Advanced Options” in the post edit
page.)</li>
<li>You can use <!–more–> to place excerpts on your
homepage. Although, this will be very time consuming if you have many
posts.</li>
</ol>
Of course, neither of those options help you if you
are wanting to show the post in more than one place. Instead, a more
likely answer is to use the home page excerpt plugin. This plugin
allows you to show a select amount of posts on your homepage and use
the_excerpt() for the rest.<br />
If you wish to use full posts on
other pages, such as tags or categories, you will have to do something
more drastic: noindex or exclude bots by robots.txt. Of course, I do
not recommend this approach. At least, not for both categories and
tags.<br />
Whichever you choose, this will help you on your way to becoming duplicate content free.<br />
<b>Step 2: Remove duplicate content due to canonicalization</b><br />
The most common duplicate content for bloggers has nothing to do with posts, but the way their server handles page requests.<br />
Can
you access your website at both www and non-www? What about with
www.example.com/index.html and www.example.com/? If so, then you may
already have problems with search engines.<br />
Every time a search
engine sees a copy of a page (even between non-www and www), it is seen
as duplicate content. As well, links pointing to one will not benefit
the other. An example of this is 10 links are pointing to
www.example.com, 5 to example.com. The linkjuice from each do not
benefit the other and only hurt themselves by telling search engines
they are two completely different websites, but with the exact same
content.<br />
<i>So how do we fix this?</i> By telling your server to redirect the URLs using a 301.<br />
<i>To redirect your index pages, place this in your .htaccess file (be sure to change example.com to your own domain):</i><br />
<IfModule mod_rewrite.c><br />
RewriteEngine On<br />
RewriteCond %{THE_REQUEST} ^.*/index.html<br />
RewriteRule ^(.*)index.html$ http://example.com/$1 [R=301,L]<br />
RewriteCond %{THE_REQUEST} ^.*/index.php<br />
RewriteRule ^(.*)index.php$ http://example.com/$1 [R=301,L]<br />
</IfModule><br />
<i>To redirect www to non-www use this:</i><br />
<IfModule mod_rewrite.c><br />
RewriteEngine On<br />
RewriteCond %{HTTP_HOST} ^www\.example\.com$ [NC]<br />
RewriteRule ^(.*)$ http://example.com/$1 [R=301,L]<br />
</IfModule><br />
<i>Or use this to redirect non-www to www:</i><br />
<IfModule mod_rewrite.c><br />
RewriteEngine On<br />
RewriteCond %{HTTP_HOST} ^example\.com$ [NC]<br />
RewriteRule ^(.*)$ http://www.example.com/$1 [R=301,L]<br />
</IfModule><br />
As well, be sure to change example.com to your domain name.<br />
If
you are wondering whether to use a www or non-www, it doesn’t matter.
Personally, I have used both on websites, but tend to favor non-www.adadahttp://www.blogger.com/profile/01715062176149108081noreply@blogger.com0