I came to this session to hear abouts splog and what Google and others were going to do about splog prevention (apparently nothing yet). I was also inerested how feeds affected organic listings as I’ve personally experienced some problems. What I got instead was a bunch of people developing business sites that needed redesign and/or clean up, certainly necessary, but not as interesting to me.
Jon Glick, Become.com
What is duplicate content a problem?
Google, Yahoo, Open Directory Project…
Confusing the Bot: Dynamic URLs
Confusing the Bot: 2 URLs
Don’t confuse the spider – chose one canonical domain and link all internal pages
301 redirects, your hero…
Yahoo! has transparency on whether your site is banned, check it out.
Shari Thurow, Grandtastic Designs
What is duplicate content?
The definition is unclear.
Search Engines do not want duplicate or near-duplicate content in their indices.
Duplicate content filters:
– content properties
– linkage properties
– content evolution
– host name resolution
– shingle comparison
siteexplorer.yahoo.com
example: 3 web pages, 3 unique URLs – robots.txt excludes the duplicate content or meta tag can do the same thing
Duplicate content is often copyright infringement.
copyscape.com