How to gain an audience, avoid other people from usurping your ideas, and avoid duplicate content filters.
Seeding articles, graphs, PDF documents, research, charts, images, videos and quotes to other websites is an excellent way to increase awareness of your website. You will be inserting your brand into a natural source of information and can reach users at a critical point in their decision making process.
When seeding your content the key is to get links back to your site from half a dozen sites. Without this quantity your efforts could be lost.
Doing Content Wrong
Find trending keywords
Get low paid writers to mashup content and rewrite words
Semi automated process – publications still need writers and editors like McDonalds needs cows
You may find articles written by Mr. Al Gorithm
Machines can’t differentiate between “time flies like an arrow” & “fruit flies like a banana” but some people can overlook this
Automated baseball sports writing program exists – developed by Northwestern University. It gets the facts but not meaning of games.
Doing Content Right
Create a great article
Find trusted, high quality, relevant sites to share it with
Ezines
Blogs
Niche websites
Small publications
Industry specific websites
Share with at least 5 sites and have them all link back to your website. If you only share content with 3 sites there is a chance they will outrank you for it.
If you syndicate your content on other sites, Google will always show the version we think is most appropriate for users in each given search, which may or may not be the version you’d prefer. However, it is helpful to ensure that each site on which your content is syndicated includes a link back to your original article. You can also ask those who use your syndicated material to use the noindex meta tag to prevent search engines from indexing their version of the content.
With this strategy you will never have to worry about other people stealing your content again, as you will always rank #1 for your work.
Misleading linkbait title aside, the real title of this video is “Can you benefit from content scraped from your website”. The premise is that if some content is copied automatically from your site, presumably this will include links back to you.
So when you prepare content make sure it includes a lot of intra-site links, and even links using your full URL. So instead of linking to /seo you should link to http://www.netpaths.net/seo to have links picked up on other sites. This is also important for your blog, which may likely be read in a 3rd party feed reader.
Im very impressed that Mr. Cutts took such a bold public stance on content scraping, and here are possible meanings:
Google is tired of people including them in lawsuits over content scraping. Google is effectively saying there are some (possibly negligble) value to content scraping, so please don’t make us defend ourselves while you sue your competitor.
Dulplicate content is such a big problem, Google is waving the white flag and giving up on trying to control blackhat spammers. They are effectively saying: go ahead and scrape, the legitimate sites will always win. Generally this is true, as the authoritative site being copied generally has a higher trust ranking than the copier.
Google is trying to appease newspapers (and the AP specifically) and educate them about the web. Any large news event will have thousands of articles written about it, with the majority of the articles being repurposed content. Google is telling the major newspapers and news bureaus that they should not be afraid of having their good content distributed, as they get links back as a reward for sharing and this will help them build their online authority.
Matt Cutts is the webspam god of Google, and what he says goes. He is the head PR person that interacts with search engine optimization firms In person he is a very kind and fair man, someone you would enjoy sitting down to have a beer with and discuss gadgets, code and algorithms. It is essential to listen to everything he says if you want good search engine rankings, and even more important to read between the lines to understand how the search engine functions.
Straight from Google webmaster central, the search engineers claim that they correctly identify duplicate content and scraped text 99% of the time.
Sven of the Search Quality Team post about the underlying concerns detailing how Google handles scraped content.
Duplicate content within your domain can be controlled & recommends that you require a link back from the syndicated content site. To make sure your site is identified as the original source, ensure a backlink is present.
Cross Domain / Scraped duplicate content – Ensure your site is following all Google guidelines. If scraped content ranks higher than your site, the problem may be a technical issue on your end. check that the content is not blocked by robots.txt, examine your sitemap file & check if the site has any errors or warning flags in Google Webmaster Central.
They post this rather un-assuring paragraph:
When encountering such duplicate content on different sites, we look at various signals to determine which site is the original one, which usually works very well. This also means that you shouldn’t be very concerned about seeing negative effects on your site’s presence on Google if you notice someone scraping your content.
So Google will correctly identify cross domain content problems with a 99% success rate, but the real problem is when the Google algorithm fails.
Graywolf & GWC have more about the search engine’s continues battle with an overflowing array of content.
Recent Comments