<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Merjis Internet Marketing Blog &#187; spamfighting</title>
	<atom:link href="http://blog.merjis.com/category/spamfighting/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.merjis.com</link>
	<description>Effective Internet Marketing Strategy and Tactics Through Test</description>
	<lastBuildDate>Thu, 12 Jan 2012 09:18:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google Hates Undeclared Paid Backlinks</title>
		<link>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/</link>
		<comments>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/#comments</comments>
		<pubDate>Sat, 01 Oct 2011 14:58:46 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=689</guid>
		<description><![CDATA[Tuesday, a client sends me an email asking if they should take advantage of an offer to buy a page on a directory on which they can create links. It&#8217;s accompanied by the email soliciting business, which includes a story about how a couple of businesses benefited, and the prices for pages at different tiers [...]]]></description>
			<content:encoded><![CDATA[<p>Tuesday, a client sends me an email asking if they should take advantage of an offer to buy a page on a directory on which they can create links. It&#8217;s accompanied by the email soliciting business, which includes a story about how a couple of businesses benefited, and the prices for pages at different tiers in the directory. Sounds reasonable, doesn&#8217;t it? Here&#8217;s a screen shot of the results page for the search &#8220;the best links&#8221;, which is the name of the domain that was selling links:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/the-best-links-Google-Search.png" alt="Search for &quot;the best links&quot; on Tuesday" title="the best links - Google Search Tuesday" width="569" height="887" class="aligncenter size-full wp-image-690" /></p>
<p>But what was the number one listing on Tuesday and Wednesday has vanished by Friday:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/the-best-links-Google-Search-2.png" alt="The best links have changed, markedly." title="the best links - Google Search-2" width="566" height="552" class="aligncenter size-full wp-image-691" /></p>
<p>What&#8217;s really remarkable is how thoroughly this site has vanished:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/site_www.thebestlinks.com-Google-Search.png" alt="Site listing for the paid directory service" title="site:www.thebestlinks.com - Google Search" width="551" height="394" class="aligncenter size-full wp-image-692" /></p>
<p>It&#8217;s usual for a site to have the home page as the first page in the sitelinks list. For the home page to vanish&#8230; well, looks like Google hates this site. And I wouldn&#8217;t bet on any rank being passed.</p>
<p>Unsurprisingly, none of the Google penalty checking websites that I could find suggest that there&#8217;s a problem with this site. After all, it has results in the listings. Just very low ranked and none of the top pages (like the home page) show up. Sure sign of a problem of some sort!</p>
<p>So, what happened? Well, the site was probably reported to Google, possibly even before I saw the solicitation. And Google acted, quite quickly. </p>
<p>If you have a Google Webmaster Tools account, you can use <a href="https://www.google.com/webmasters/tools/paidlinks">Google&#8217;s Undeclared Paid Backlinks report form</a> to report sites that are using and offering undeclared paid backlinks. I don&#8217;t know what you do if you aren&#8217;t a registered webmaster &#8211; perhaps find one and ask them?</p>
<p>So, what happened to the companies mentioned in the story? I can&#8217;t find them, now, either. Certainly not on page one of results (I&#8217;ve used the non-personal search on our tool to <a href="http://merjis.com/local_google_search/" title="Google International Search">view Google organic and paid search as if in another country</a>)</p>
<p>Google really isn&#8217;t happy with undeclared paid links. At all.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=689" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Experimental Results: Blog Spammers Do Target Comments</title>
		<link>http://blog.merjis.com/2011/05/22/experimental-results-blog-spammers-target-comments/</link>
		<comments>http://blog.merjis.com/2011/05/22/experimental-results-blog-spammers-target-comments/#comments</comments>
		<pubDate>Sun, 22 May 2011 10:33:46 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[identity]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=451</guid>
		<description><![CDATA[I said that I&#8217;d be testing whether I had substantially reduced blog spamming on this blog&#8217;s articles, by changing the string that announced where to leave a comment or response to an article. The first thing to check is whether this blog, post-Caffeine and post-Panda, is still substantially where it used to be in search [...]]]></description>
			<content:encoded><![CDATA[<p>I said that I&#8217;d be testing whether I had <a href="http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/">substantially reduced blog spamming</a> on this blog&#8217;s articles, by changing the string that announced where to leave a comment or response to an article. The first thing to check is whether this blog, post-Caffeine and post-Panda, is still substantially where it used to be in search rankings. And, yes, we&#8217;re still showing up for the same activity based searches &#8211; questions about &#8220;gclid&#8221;, and niche questions about Google Analytics and AdWords Conversion Tracking, AdWords Geotargeting, etc. So the search engines are still viewing this blog in substantially the same way. </p>
<div id="attachment_584" class="wp-caption aligncenter" style="width: 610px"><a href="http://blog.merjis.com/wp-content/uploads/2011/05/Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress-1.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/05/Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress-1.png" alt="I found your article from Altavista and it is eye-popping. Thank you for sharing such an incredible article." title="Comments ‹ Merjis Internet Marketing Blog — WordPress-1" width="600" height="55" class="size-full wp-image-584" /></a><p class="wp-caption-text">&quot;Flam&quot; - Flattery Spam example, praising the comment policy as if it was an article.</p></div>
<p>Next up, has the nature of spamming changed? I believe that it has. On this blog, and clients&#8217; blogs and generally across the internet, I&#8217;m seeing a lot more &#8220;flam&#8221; &#8211; flattering spam. &#8220;You&#8217;re article really helped me with this difficult topic, thanks, I&#8217;ll be linking to you and subscribing to your feed&#8221; &#8211; stuff like that. Contributes nothing but a feeling of pleasure&#8230; and sometimes even sneaks past Akismet, if the author has managed to lull the suspicions of bloggers. </p>
<p>Of course, the acid test is whether my attempts to deflect spammers to the Comment Policy page have been effective. </p>
<h2>Results</h2>
<p><div id="attachment_577" class="wp-caption alignright" style="width: 147px"><a href="http://blog.merjis.com/wp-content/uploads/2011/05/Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/05/Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.png" alt="Spam list, showing most spam targets the Comment Policy now." title="Comments ‹ Merjis Internet Marketing Blog — WordPress" width="137" height="894" class="size-full wp-image-577" /></a><p class="wp-caption-text">This WordPress blog&#039;s Comment Policy attracts most spam, now.</p></div><br />
They have. About 90% of all spam being left here for the last year, is being left on the honeypot comment page. Usually telling me that I&#8217;ve handled a difficult topic well and that I should &#8220;write more on this topic&#8221;, the comments are dead giveaways that they didn&#8217;t read or understand the Comment Policy page. Non-contributive spam. Yay!</p>
<p>After seeing the trend towards &#8220;flam&#8221;, I&#8217;ve gone back throgh comments here (and on client sites) and made sure that every comment is substantive. That is, it contributes to the material in the posting and would be helpful to other readers. </p>
<p>That in turn, raises the question of what to do about those rare cases when someone says something useful, but then leaves a keyword loaded username. IMO, the best thing to do is to re-write the name as something more sane. After all, the comments here are no-followed, and it is, bluntly, rude to post under a keyword rather than your own name &#8211; not to mention being outside the guidelines for marketing laid down by the Advertising Standard Authority and described in the <a href="http://www.tsoshop.co.uk/bookstore.asp?FO=1160041&#038;ProductID=9780117064102&#038;Action=Book">Code of Advertising Practice</a>.</p>
<h2>Recommendations</h2>
<p>If you run a blog, there are three levels of protection to consider, basically depending on the volume of spam you get &#8211; which in turn is a reflection of your sites&#8217; popularity with search engines (more on that, in another article I have in progress).</p>
<p>We have a general recommendation which is that marketeers have to reduce the barriers to engagement. Philosophically, I&#8217;m opposed to the idea of making it more difficult for a user to add a comment to a blog, to defend from a spammer. It&#8217;s just wrong. The burden should be pushed to the spammer, not the user. So, wherever possible on any form, we avoid CAPTCHA (which is, itself, subject to easy attack from Mechanical Turk style systems &#8211; use automation to find the CAPTCHA and just push the images to paid-by-activity humans who&#8217;ve signed up for the service). </p>
<p>If your blog is not popular (up to tens of visitors per day), then basic tools like Akismet are absolutely fine &#8211; though too infrequently used on small blogs, IMO. You&#8217;ll probably get no more than a one or two &#8220;flams&#8221; per day &#8211; and the rule is &#8220;if the comment could have been applied to any article you&#8217;ve ever written, it is probably spam&#8221;.</p>
<p>If your blog is moderately popular &#8211; hundreds of visitors per day &#8211; then Akismet or other crowd-sourced or Bayesian detectors are probably not quite enough; consider using a honeypot to distract spammers to a page or posting where you know that the overwhelming majority of comments will be spam. This, too, should keep the count of suspect spammy comments that need attention down to a handful per day. </p>
<p>If your blog is seriously popular &#8211; well, you can probably afford to have a few annoyed users who can&#8217;t get through the defences. Very popular blogs might want to consider both a crowd-sourced and/or Bayesian and CAPTCHA solution of some sort. That&#8217;s not as extreme as some US magazines who have <a href="http://blog.merjis.com/2010/02/03/pc-world-online-subscriptions/">ludicrously tough comment defence</a>, added in pursuit of finding candidate new subscribers, rather than spam reduction, I suspect, and an illustration of the maxim about reducing the barriers&#8230; I cant be bothered to sign up, when they make it so difficult. </p>
<p>That &#8220;reducing the barriers&#8221; stuff is really important. Take a look at Bob Cialdini&#8217;s Influence &#8211; an old book, now, but still germane as it deals with fundamentals of human behaviour. Until you&#8217;ve given something of value to someone, they are unlikely to give you much of value. So asking for a full name and address and other contact information, just to drop a comment, is fabulously in excess of what most people will be comfortable to do. In the wake of the Sony hacking, users should also be becoming more aware that leaving details on publicly addressable sites is dangerous. And thats compounded by poor security practices from large organisations who confuse users&#8217; ability to detect safer and less safe places to submit details (see my earlier article on Google&#8217;s absurd inability to help users identify <a href="http://blog.merjis.com/2011/03/11/google-authentication-identity-confusion/">which are trustworthy Google URLs and which URLS can be abused by scammers</a>).</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=451" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2011/05/22/experimental-results-blog-spammers-target-comments/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Sidewiki Made More Obvious &#8211; But Still a Tech Not A Benefit</title>
		<link>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/</link>
		<comments>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/#comments</comments>
		<pubDate>Sun, 05 Sep 2010 15:51:23 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=477</guid>
		<description><![CDATA[The Sidewiki, introduced almost a year ago, looked like an interesting tool and a way to allow users to interact with a site without on-site commentary enabled. Accessed via an unobtrusive button in the Chrome browser or via a Google Toolbar, it was all but completely invisible to most users, and rarely used. This new [...]]]></description>
			<content:encoded><![CDATA[<p>The Sidewiki, introduced almost a year ago, looked like an interesting tool and a way to allow users to interact with a site without on-site commentary enabled. Accessed via an unobtrusive button in the Chrome browser or via a Google Toolbar, it was all but completely invisible to most users, and rarely used. This <a href="http://googlecode.blogspot.com/2010/09/new-sidewiki-sidebar-web-element.html">new announcement</a> makes the annotation tool more useful, by allowing site owners to render the content. </p>
<p>I have my doubts that the Sidewiki sidebar will be rendered on anything other than informational sites with UGC &#8211; which, by and large, will already have a mechanism for users to contribute and so will have no need for a Sidewiki. The fear amongst business site owners will be that users with a grudge will control the conversation. There&#8217;s plenty of evidence that suggests that users with a good experience of a business are much less likely to tell others, than users with a bad experience &#8211; in a ratio of about in in 12 (users with a bad experience are much more likely to tell others). So even if your business manages to annoy only a small fraction of users, they&#8217;ll be a lot more visible than the small fraction who are ecstatic about your business.</p>
<p>Unless the Sidewiki enables a clearly valuable business interaction &#8211; one that doesn&#8217;t leave site owners worrying that Google is controlling the agenda, and one that doesn&#8217;t overemphasise the negative commentary &#8211; then Sidewiki will remain a sideline. I think the idea of UGC paralleling the site content is probably the right sort of thing to be thinking about. I suspect that the current implementation and interaction model aren&#8217;t enough to satisfy business owners to make the Sidewiki an integral part of the site&#8217;s experience. </p>
<p>I&#8217;d fear it being added to, for example, the NHS web site. The amount of quackery and bogus advice that could be offered would make me nervous about the value. </p>
<p>So, for now, Sidewiki continues to be a technology in search of a solution. It appears to come from a philosophy of user interaction that isn&#8217;t matched by the real world, that users will contribute overwhelmingly. That may be true for sites engineered to make that happen (Hub Pages, Wikipedia, etc). But sources of tested information (NHS, NIH) and of commercial information (product details, sales, etc) can suffer in the face of unqualified, biased or defamatory comment. That doesn&#8217;t mean that Sidewiki is always useless, but does mean that the philosophy needs to be exposed as having a benefit to the site owner, many of whom will see nothing but risk when they look at the tool. Another case of Google mistaking technology for product &#8211; as with Wave.</p>
<p>And the blog announcement itself doesn&#8217;t do a lot to encourage confidence. What&#8217;s the first comment? A series of spammy links. How long would it be before some twit starts dropping spammy Sidewiki comments on sites, in order to benefit from their PageRank? Given the rest of the mythology surrounding SEO, it won&#8217;t be long after Sidewiki comments are seen, that there&#8217;s be someone selling Sidewiki spamming as a service, off the back of user ignorance that the Sidewiki domain is not that of the website&#8230;<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/09/Google-Code-Blog_-New-Sidewiki-“Sidebar”-web-element1.png"><img src="http://blog.merjis.com/wp-content/uploads/2010/09/Google-Code-Blog_-New-Sidewiki-“Sidebar”-web-element1.png" alt="Google Code Blog, Now With Blog Spam" title="Google Code Blog_ New Sidewiki “Sidebar” web element" width="561" height="810" class="aligncenter size-full wp-image-480" /></a></p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=477" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Blog Spammers Target Blogs</title>
		<link>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/</link>
		<comments>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 07:04:01 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=343</guid>
		<description><![CDATA[Hardly news, but people who spam blogs, use search to look for them. So is there anything that can be done to reduce the attractiveness of a blog to spam? Have a look at this recent sample of keywords that lead to this blog, taken from Google Analytics. Spot anything? Yup, a large fraction of [...]]]></description>
			<content:encoded><![CDATA[<p>Hardly news, but people who spam blogs, use search to look for them. So is there anything that can be done to reduce the attractiveness of a blog to spam? Have a look at this recent sample of keywords that lead to this blog, taken from Google Analytics. Spot anything?<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/04/Keywords-Google-Analytics.jpg"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Keywords-Google-Analytics.jpg" alt="" title="Keywords - Google Analytics" width="449" height="730" class="alignnone size-full wp-image-342" /></a><br />
Yup, a large fraction of the searchers are looking for the default text that WordPress drops at the bottom of an article, inviting user responses. The keywords are, of course, also skewed by the content of the blog. If I hadn&#8217;t used footwear examples in some early articles, I wouldn&#8217;t have those keywords in the searches. </p>
<p>The searches are also amusing, because this <a href="http://blog.merjis.com/comment-policy/">blog has a comment policy</a>. Searches looking for text indicating that this is a blog that accepts user generated content, will tend to find that policy page. In turn, that means that I get a lot of flattering comments about the article&#8230; posted to the comment policy. Bit of a tell that the commenter isn&#8217;t really engaged. </p>
<p>What&#8217;s also interesting is where people are coming from. Mostly China, India, the US. </p>
<p>So, is there anything that can be done, apart from using Akismet and other despamming tools, to help reduce blogspam? Might be worth changing the text, or possibly replacing the text with a graphic, to indicate where a comment should be left. That&#8217;d reduce the search volume coming to the blog from people intent on spamming. However, that&#8217;s essentially a defence based on security through obscurity &#8211; I&#8217;m going to experiment with changing that text and see what it does to searches leading to this blog. :)</p>
<p>I will, of course, continue to use the comment policy page as a great detector of spam. All those people telling me how great the article is, submitting to the comment policy page, have completely signalled their lack of interest in contributing! Like all these spammy comments, mostly targeting a few choice pages:<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/04/Edit-Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Edit-Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.gif" alt="List of articles that spammers target, showing the popularity of the comment policy" title="Edit Comments ‹ Merjis Internet Marketing Blog — WordPress" width="220" height="664" class="alignnone size-full wp-image-372" /></a><br />
That sample is moderately representative &#8211; about half the spam submitted, is submitted to the comment policy page.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=343" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>SEO: Click Through Rate and Bounce Rate</title>
		<link>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/</link>
		<comments>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/#comments</comments>
		<pubDate>Fri, 26 Mar 2010 21:16:53 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[adwords]]></category>
		<category><![CDATA[click fraud]]></category>
		<category><![CDATA[intent]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[usability]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=340</guid>
		<description><![CDATA[I&#8217;m going to take issue with Rand Fishkin of SEOmoz. I think his most recent White Board Friday video is just plain wrong. Normally, I have a lot of respect for what SEOmoz does, but I think the advice and implications are not just wrong, but dangerously wrong. How Does Google Rank Results I don&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m going to take issue with Rand Fishkin of SEOmoz. I think his most recent <a href="http://www.seomoz.org/blog/whiteboard-friday-influence-of-usage-data#ergabbj-threttuy">White Board Friday</a> video is just plain wrong. Normally, I have a lot of respect for what SEOmoz does, but I think the advice and implications are not just wrong, but dangerously wrong.</p>
<h2>How Does Google Rank Results</h2>
<p>I don&#8217;t know all the details. Rand doesn&#8217;t know all the details. Some guys at Google know a lot of the factors. Matt Cutts, Google&#8217;s head of the search quality team, claims over 200 factors go into ranking. </p>
<p><object width="580" height="360"><param name="movie" value="http://www.youtube.com/v/muSIzHurn4U&#038;hl=en_US&#038;fs=1&#038;rel=0&#038;border=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/muSIzHurn4U&#038;hl=en_US&#038;fs=1&#038;rel=0&#038;border=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="580" height="360"></embed></object></p>
<p>What we do know is that backlinks &#8211; credible links regarded by Google as likely for a search user to visit &#8211; are important. We know that anchor text is important. There&#8217;s some other factors that we know influence Google ranking.</p>
<h2>What *else* do we know?</h2>
<p>We (professional search engine optimisation people) know that on-page content is valuable. For low competition keywords &#8211; keywords where there aren&#8217;t a lot of links and anchor text, and hardly anyone searches &#8211; then page content is enough. Look at the example in the graphic below. There&#8217;s precisely one page on the internet, with that text for something that I can&#8217;t find on Google. When I wrote that, it was true; if you search now, you&#8217;ll find that page. Well, until some spoiler copies it elsewhere&#8230;</p>
<div><a href="http://skitch.com/jezchatfield/n59q8/jeremy-chatfield-google-profile"><img src="http://img.skitch.com/20100326-f3g482f6c6a8ew6js18eey13ry.jpg" alt="Jeremy Chatfield - Google Profile" width=600 /></a></div>
<p>However, try putting the word &#8220;bad credit loan&#8221; on a page on a new web site with some other relevant and unique content, valuable to a user, and see how high you rank for the term. You can wait. And wait. And wait. You&#8217;re not going to show up on the first page of results, just by having a great page alone. It&#8217;s not just the content, it&#8217;s the backlinks that make the difference. </p>
<p>So we now know, as a result of this test, that while Google does pay attention to on-page factors, they also pay attention to backlinks. And in competitive spaces, *effective* <a href="http://www.google.com/support/forum/p/Webmasters/thread?tid=1c377284e24be6db&#038;hl=en" title="Amusing thread about 'Best SEO Company Search Engine Placement'">backlinks count for more than the page content</a>. </p>
<p>The important message to understand from this is that different factors apply under different conditions. Content alone won&#8217;t put you on page one. Backlinks alone won&#8217;t keep you there.</p>
<h2>Click Through Rate and Bounce Rate</h2>
<p>So, at some scale, do CTR (Click Through Rate) and Bounce Rate make any difference? I believe they do, and this blog is a testament to that. Look at this screenshot.</p>
<div><a href="http://skitch.com/jezchatfield/n59t5/content-detail-google-analytics"><img src="http://img.skitch.com/20100326-qf7ibj264gmipd2fxg512jxd1m.jpg" alt="Content Detail: - Google Analytics" width=600 /></a></div>
<p>That&#8217;s a Google Analytics shot of the last 15 months activity for a specific page on the Merjis blog. It&#8217;s all about &#8220;<a href="http://blog.merjis.com/2007/07/16/click-fraud-google-adwords-and-gclid/">gclid</a>&#8221; &#8211; something you&#8217;ll probably care about if you do paid search and look in web server logfiles. </p>
<p>I&#8217;m using this blog as an example, because I&#8217;ve been using it for tests for years &#8211; I know how it works, and it isn&#8217;t confidential client data. I can reveal the usage, because I have my own reasons for running a blog, and few of them directly have anything to with making money.</p>
<p>Most other pages on this site get a profile like this other example:</p>
<div><a href="http://skitch.com/jezchatfield/n5917/content-detail-google-analytics"><img src="http://img.skitch.com/20100326-rs3mrt1rbg886putn3jitpmdnh.jpg" alt="Content Detail: - Google Analytics" width=600 /></a></div>
<p>This is pretty typical for a &#8220;newsy&#8221; blog article. Usage on the day that it is written, and a dribble thereafter. It then usually dries up after a few weeks, because the rank has decayed with time. </p>
<p>So why, with a higher bounce rate, does the older article do better than the newer article in rankings? If Bounce Rate is important, then surely the lower bounce rate in a newer article must mean that Google should drop the older article?</p>
<p>I suspect that Google doesn&#8217;t have a rigid number. They look at how well you do relative to other sites. And especially, they look to see whether search users search again for the same or very similar searches. Read that article on SideWiki, and it&#8217;s lightweight. No real information. No real recommendations. The long lived article on gclid has a much higher bounce rate <i>and longer reading time</i>. It&#8217;s the reading time that&#8217;s the clue. When you&#8217;ve read my article on gclid, you probably don&#8217;t want to read another article about gclid. It&#8217;s reasonably definitive.</p>
<p>Google sustains that old article in search results, despite its&#8217; great age, and despite a high bounce rate, because those users who do read it, value it. It&#8217;s there, because it helps Google to deliver a page of search results that users value more than *without* that article present. </p>
<h2>Uh &#8211; You Didn&#8217;t Mention CTR</h2>
<p>Again, I don&#8217;t think it is actually CTR that Google is looking for. It is user satisfaction. So a high CTR, caused by a misleading piece of copy, won&#8217;t help. You have to deliver what you offer. Again, I don&#8217;t think that Google is measuring conversion, either. But a high CTR message with a high conversion rate, meaning that users are highly satisfied &#8211; that&#8217;s what Google wants you to make. </p>
<p>You won&#8217;t be directly rewarded for high CTR &#8211; but you can measure it (especially if you also run PPC and can get the impression rate). You won&#8217;t be rewarded directly by Google for high conversion rates. But Google does appear to prefer sites that answer the question posed by the search query. And the proxy that can be used by Webmasters, who don&#8217;t have access to Google&#8217;s richer data, is their own performance, as CTR and Conversion Rate. Increase those, and you are more likely to increase position.</p>
<h2>Interaction of Factors</h2>
<p>If you have a good site, with highly relevant content, you tend to get more links. So disentangling backlinks, and the immeasurable relative user satisfaction, is difficult. Pretty much the only way that I know it can be done, is when you have web sites with accidental misbehaviours that create the right conditions for a test. The technical problems that create the conditions are rare &#8211; and recreating them in a real website is likely to decrease the performance. It&#8217;s unlikely that anyone will give you the opportunity to mess up their site, just to prove what works.</p>
<p>However, if you want to go about it&#8230; Here&#8217;s what I think you&#8217;ll need:</p>
<ul>
<li>A visibly horrible page, with a low conversion &#8211; as your starting point</li>
<li>Weak Title and Meta Description as a starting point</li>
<li>A lot of visitors per day &#8211; it takes a long time to demonstrate, otherwise</li>
<li>The ability to make sitewide link changes to the page under test</li>
<li>Good backlinks &#8211; you&#8217;ll want to know that you *could* rank well on page one</li>
</ul>
<p>Change the URL for your horrible page, sitewide. Wait for Google to find it and rank it again. Note the position. Watch the position fall over a period of a week or two (depending on visitor volume). Now improve the page, and switch the URL again and wait for Google to find and rank it. Then watch the rankings change and note which way they go. Now revert the page and switch URLs again, and this time change the Title and Meta Description. Now watch the ranking changes. Now fix up the page again and once more switch the URL and watch. </p>
<p>You should, IME, find that you achieve a higher long term position when you have a better title and description, and a higher converting page with a lower bounce rate. If you can explain why you *shouldn&#8217;t* get a higher position with a site that is better for users, I&#8217;d love to know the reasons. But don&#8217;t make your explanation involve &#8220;gaming&#8221; the system. </p>
<p>And, FWIW, I don&#8217;t believe that the Title and Description are important, as direct factors for SEO. You can rank perfectly well for keyword free pointless titles, and descriptions without keywords that are positively turgid and rambling. However, show the user that you are focused on solving their problem, and your CTR increases; and if you are focused on the user, you&#8217;ll probably have a reasonable landing page, which will engage and convert better. Google&#8217;s not going to reward you for a better snippet, directly, but for a better user experience. Your only measures though, will be what you can observe &#8211; CTR, Bounces, Conversions. If I could tell you to look at the &#8220;re-query rate&#8221;, I&#8217;d tell you to do so &#8211; instead, you&#8217;ll have to use the information you can get.</p>
<h2>Implications For SEO</h2>
<p>If a blog article can decay to little traffic in a few weeks, or sustain rankings for years, on the same blog, with the same blogging software, then the difference must be backlinks? Well, not substantially. Over the years, I&#8217;ve had more backlinks to newsy stories, but still this &#8220;gclid&#8221; article keeps ranking. And all the time, the other lighter weight articles just keep falling out of the listings. </p>
<p>There&#8217;s a few other similar articles on this blog that rank, and stay high for years and years and years. Non-competitive searches, but of long lasting traffic value. And the other sites that I&#8217;m competing with, for attention, are large forums. High weight. Much more frequently updated content. I&#8217;m *deliberately* not trying to place links for articles. Just letting what happens, happen &#8211; so I can understand why it happens. So there&#8217;s no contamination effects here with deliberate link placements. </p>
<p>What are the articles? They all tend to be like that gclid article. Something that is detailed, informative, and means that you can go away and do something. Useful articles, in other words. Harder to write than &#8220;straight news&#8221; articles, as you need unique content, written to address the audience. That&#8217;s part of my reason for writing &#8211; attempting to develop clearer communication.</p>
<p>The clear implication is, I think, that useful content matters. And how do we know it is useful? It&#8217;ll show up in search engine rankings, usablility data and other disturbingly hidden and arcane resource. Google will reward useful content with a better sustained rank &#8211; but won&#8217;t put you on page one just because you have a great article, unless you have some backlinks to create credibility. </p>
<h2>But How?</h2>
<p>Rand makes the point that data about use can be gamed. But so can backlinks. That&#8217;s the major part of undeclared paid backlinks, small world building, and other &#8220;black hat&#8221; techniques. We know that Google sees through most black hat techniques, given time. </p>
<p>We also know, or can find out about, Google&#8217;s interest in invalid impressions and invalid clicks. For example, invalid impressions are generated when search engine ranking tools are run &#8211; they reduce the effective CTR. Invalid clicks are generated when users double click, or are paid to click. Just as with paid search, these two types of invalid activity are measurable by Google. In fact, Google can measure a lot more than a webmaster can see. </p>
<p>We webmasters only get to see bounce rates and conversions. Google gets to look at whether users search again. Much more valuable. If you want to build the worlds&#8217; best search engine, then you want to feature the results that tell you that you&#8217;ve got a winning page &#8211; pages where users don&#8217;t need to search any more. Results that have users positively selecting that site again, when they see it in listings. Webmasters just don&#8217;t have that detail, directly. We just don&#8217;t know if the other guy answers better &#8211; unless we expend effort to learn our customers&#8217; minds and make sure we have the best answer.</p>
<h2>User Experience</h2>
<p><a href="http://www.google.com/corporate/tenthings.html">Google&#8217;s Ten Things</a> lists, first, &#8220;Focus on the user&#8221;. The results from this blog, and from other client activities that I&#8217;m not going to reveal in any detail, are fairly clear. Content that Google can measure as being liked by users, rank better and longer than content that is spammy, tedious and weak. The factors that lead to better rankings will include appropriate Titles and Descriptions and engaging content. It has to be, or rule 1 is broken.</p>
<p>We know that Google has experience of measuring impressions and data to look for invalid data. We know that Google is pretty good at it &#8211; or there&#8217;d be more click fraud problems with AdWords. So, if it can be done, and it is an important indication of quality, why wouldn&#8217;t Google use searchers behaviour to modify results, not just personally, but across the index?</p>
<p>Why can&#8217;t you improve the results when you click on your own listings? Because it is identical behaviour to the banned AdSense practice of clicking on adverts on your own site. Detectable. Invalid. Not counted. And for reasons that I don&#8217;t want to go into, I believe the same will be true of botnets and eLance and Mechanical Turk attacks. There will be a signature associated with them, that doesn&#8217;t match normal user behaviour. The signatures can be spotted and countered, by assigning the activities as invalid &#8211; just as it is in AdWords. Since AdWords continues to run without being infested with click fraud to unusable levels, we have a working system, on a global scale, that shows that user behaviour can be extracted from noisy fraudulent behaviour. </p>
<p>It isn&#8217;t perfect, true, but it separates AdWords from being a system that solely acts to transfer advertising funds to thieves, into a system that, more often than not delivers prospective buyers to an advertiser&#8217;s site. It isn&#8217;t perfect, but it works well enough. <b>AdWords only works because it identifies and categorises user behaviour.</b></p>
<p>User behaviour categorisation works in one system that Google has, worldwide, on a service with measurable economic value. Why wouldn&#8217;t it be usable in organic search results?</p>
<h2>Conclusions</h2>
<p>Failing to identify and understand user interests is an SEO mistake. These are reflected by (but are not completely explained by) CTR and Bounce Rates &#8211; because that&#8217;s about the best that Webmasters can get. Google doesn&#8217;t have to use those &#8211; they have better numbers that are more meaningful to user experience. But saying that &#8220;Google doesn&#8217;t use bounce rates&#8221; is not the same as saying &#8220;Google doesn&#8217;t take account of user behaviour&#8221;.</p>
<p>Unlike Rand, I believe that Google cares very deeply about the user experience, and that Google has very sophisticated technology, probably shared with the Google AdWords guys, to identify unusual search behaviours and exclude them from consideration. </p>
<p>Given enough data, probably gained from multivariate testing on all the different data centres, Google can identify whether users are more, or less, satisfied by different ordering in search results than a pure backlinks-plus-content model would give.</p>
<p>Small scale tests probably won&#8217;t show anything about user interaction &#8211; because the activity doesn&#8217;t have statistical significance or because the signature of strange search activity is too obvious. So, don&#8217;t try faking it &#8211; if you&#8217;ve read this far, you probably aren&#8217;t smart enough to outwit Google&#8217;s teams of click-fraud defence guys. They are really pretty good, as anyone with a rational assessment of AdWords click fraud levels will tell you. Not perfect, but good enough to make the effort of using AdWords worthwhile, rather than primarily a way of siphoning your advertising funds to fraudsters. :)</p>
<p>Why do I say &#8220;if you&#8217;ve read this far&#8221;? Because if you really knew how to hide click streams, you&#8217;d be doing it with AdSense. And you&#8217;d have stopped reading at that point &#8211; because you own the game already. If you can&#8217;t own that game, you can&#8217;t own the game of spoofing user behaviour in organic search &#8211; it is (not identical to, but close enough to) the same game. At the moment I don&#8217;t understand why you&#8217;d bother with SEO behavioural spoofing, if you&#8217;d gamed AdSense, because the revenue is a lot more direct&#8230; Maybe that&#8217;s why Rand hasn&#8217;t spoken with any black hatters that have cracked it? </p>
<p>And if Google can detect unusual impression and click data, then they can fulfil their primary mission, with respect to <b>modifying</b> organic rank based on real user data about preferences and satisfaction. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=340" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Duplicate Content, And Blog Spammers</title>
		<link>http://blog.merjis.com/2010/02/28/duplicate-content-and-blog-spammers/</link>
		<comments>http://blog.merjis.com/2010/02/28/duplicate-content-and-blog-spammers/#comments</comments>
		<pubDate>Sun, 28 Feb 2010 11:39:32 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[bing]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[trust]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=322</guid>
		<description><![CDATA[Around half the people I talk to, about Search Engine Optimisation, are terrified of duplicate content on their own websites. But 80% or more of the spam that I see, is massively duplicated. Why do real site owners, with valuable content but multiple paths to it, fear duplicate penalties from Google, but spammers who endlessly [...]]]></description>
			<content:encoded><![CDATA[<p>Around half the people I talk to, about Search Engine Optimisation, are terrified of duplicate content on their own websites. But 80% or more of the spam that I see, is massively duplicated. Why do real site owners, with valuable content but multiple paths to it, fear duplicate penalties from Google, but spammers who endlessly duplicate useless garbage, can fearlessly sell link building services to terrified site owners, on the basis of their ability to massively distribute spammy duplicate links?</p>
<p>I think the reason is that site owners who &#8220;invest&#8221; in spammy paid link purchasing rarely deeply understand what they are buying. And link spammers don&#8217;t really care whether what they do is effective, so long as there are people prepared to buy, and so long as Google and Bing mistake the links as being valid in the early days. Businesses usually evaluate the impact of an activity fairly early &#8211; so if they are told that the search engine impact will be most visible a few weeks or months after starting, then that&#8217;s when they&#8217;ll measure.  A spammy link buyer will keep buying for years, because the impact is positive at the time of the measurement &#8211; the value declines with time as Google and Bing detect the patterns of spamming. </p>
<p>The very worst link spammers will submit your site to places that are already known to the search engines as places for low quality links, and already offering no value. So the most value that you get from the service, is a list of places that won&#8217;t have any impact&#8230; Useful if you need to go back and clean up the spammy links, later.</p>
<p>Take, for example, this piece of spam, submitted to this blog:</p>
<p><img src="http://img.skitch.com/20100211-n1ft4f9gtgi3jwjjkh8abc8sh6.jpg" alt="Merjis Internet Marketing Blog â€º Edit Comments â€” WordPress" width=600 /></p>
<p>It looks appropriate. It&#8217;s about H1N1 and it has been submitted as a comment to an article about H1N1. But the URL given is for a product, even though the name offered is not a keyword. Is there any way to tell that it is spam? We could search for a key piece of text that seems unlikely to be in other comments. And here&#8217;s the traces that this is a piece of spam:</p>
<p><img src="http://img.skitch.com/20100301-rna9ykxw8ie9qhucyjgtmtk3ed.jpg" alt=""asian countries the Swine Flu did not spread rapidly" - Google Search"/></p>
<p>We can see the same author ID, with the same comment in many blogs &#8211; 314 blogs identified as carrying that precise piece of text, presumably with the same link to the &#8220;Fish Oil FAQ&#8221;. It is definitely spam. </p>
<p>In what way are Google and Bing so stupid that they can&#8217;t detect the same piece of writing in comments, when they can tell that a site has two or three paths that lead to the same product, wrapped in a templated page? It doesn&#8217;t add up that Google and Bing would penalise a site owner for multiple paths to a product that customers buy, but don&#8217;t penalise spammy links. So, do the search engine penalise spam?</p>
<h2> Why are pages containing spam reported in search results, if the content is treated as spam?</h2>
<p>Search engines are looking at the overall quality of the site and its&#8217; pages. Some spammy comments to a blog or a discussion forum won&#8217;t kill the pages&#8217; value. If users are finding the whole page is useful, then the whole page isn&#8217;t deranked &#8211; unless the web spam teams decide that the only reason for the page is to host, or be target of, spammy links. So you can find spammy postings on pages that have weight. A few spammy links on an otherwise useful page, won&#8217;t kill the page. That&#8217;s why we can still find spammy comments &#8211; they are a part of a page that is valuable. </p>
<p>That doesn&#8217;t mean that the reverse is true &#8211; spammy comments can be found, but it doesn&#8217;t mean that the links in the spam carry any weight. If they did carry weight, then we should find at least 314 sites are offering weight to the Fish Oil FAQ. So&#8230; where&#8217;s the site in the listings?</p>
<p>Interestingly, you can&#8217;t find the site named in the spammy posting. Yup. All that spamming and link dropping has had no useful effect at all &#8211; just try the <a href="http://www.google.com/search?as_lq=fishoilfaq.com&#038;btnG=Search">search for links for fishoilfaq.com</a>. <b>Which just goes to show that the technique is pointless &#8211; it doesn&#8217;t work.</b></p>
<div class="thumbnail"><a href="http://skitch.com/jezchatfield/na5pq/link-fishoilfaq.com-google-search"><img src="http://img.skitch.com/20100302-p1tsj6is3kc71umryiqfqrc64u.preview.jpg" alt="link:fishoilfaq.com - Google Search" /></a></div>
<h2>Back To Duplicate Content</h2>
<p><b>Google and Bing are tolerant of genuine duplication within a web site</b>. The large search engines even have a mechanism to help webmasters to signal that they are aware of duplication in their sites, and have a preferred path to that resource &#8211; the canonical link ref. A signal agreed to and used by the major search engines. It&#8217;s been so successful that search engines are now respecting the <a href="http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html">canonical link reference tag across domains</a> (with some limitations).</p>
<p>But, identical postings, across a range of blogs and discussion forums, with keyword laden author names? Somehow that pretty obvious technique is supposed to defeat the search engines with wicked cleverness? It does, for a while. Then the web spam teams notice, zero weight the spam, and decreases their trust in your business. And that&#8217;s why duplicated postings in user generated content don&#8217;t work &#8211; blog spamming is an ultimately sterile exercise. If you&#8217;re going to comment, comment because you are a part of the discussion. Be interesting enough, and people will write about you and what you&#8217;ve written, in their articles &#8211; just as I&#8217;ve written about Danny Sullivan, below. </p>
<p>This decay in the value of blog spam (and other types of undeclared paid links) is why we hear the repeated refrain from businesses that: </p>
<blockquote><p>&#8220;we hired an SEO agency for link building, it made an impact at first, but since we terminated the contract there&#8217;s been no impact on our business&#8221;.
</p></blockquote>
<p>Spammy link building has a positive effect on visitor volumes in the first few months or even years. But after a while, the search engines downgrade it, and decrease trust in the sites that gave you links (because those sites host undeclared paid links) and in your site (as a business that buys undeclared paid links). The activities have less and less impact with increasing time, and it is harder for your business to make headway once the search engines suspect that you are focusing on spamming as a link building strategy. You can even find that an entire chunk of your website is not being given any credibility for inbound links. </p>
<p>Blog spam has an unpleasant impact on the blogs it is dumped on, too. Read <a href="http://daggle.com/link-spammers-killed-wifes-web-site-1446">Danny Sullivan&#8217;s article about the way that blog spam affected a nascent site</a> that may been useful to a specific online segment. The site did have some spam defences in place, but doesn&#8217;t it seem just a tad nightmareish that a site offering some long lasting value is taken out of action through activities that ultimately have no or little value. The economic equation is imbalanced. When that happens, as Danny implies, there is time and space for ethics and morality to play a part. Law? I don&#8217;t hold out a lot of hope for that &#8211; all that a US based law would do is to drive the targets offshore, or to use anonymising proxies, etc. (See our ancient article about <a href="http://blog.merjis.com/2007/09/09/anatomy-of-a-web-spam-attack/">tracking the steps in a spamming effort</a>, apparently by some Ukranians).</p>
<h2>Summary</h2>
<p>Spend less time worrying about duplication on your own site &#8211; use the <a href="http://www.mattcutts.com/blog/canonical-link-tag/">canonical link reference</a> to help yourself and the search engines. Read <a href="http://googlewebmastercentral.blogspot.com/2009/02/specify-your-canonical.html">Google&#8217;s official description about the canonical link reference</a>, and how they have coordinated with Bing and Yahoo to understand the tag.</p>
<p>Spend more time worrying about what your linking strategy is telling Google and Bing &#8211; are you telling the search engines that your business will lie and deceive? Here&#8217;s <a href="http://www.bing.com/community/blogs/webmaster/archive/2010/02/05/eggs-bacon-spam-spam-and-spam-sem-101.aspx">Bings statement about spam and what they do in response to detecting spammy links</a>. You really want those outcomes? You really want to pay people to cause work for other site owners, that has no long term benefit and may have disastrous repercussions on your own site? And when you find that the search engines no longer trust you, then you&#8217;re going to face a higher bill to remove links &#8211; there&#8217;s automated link placement, but the technologies for link removal are largely manual, and hence more expensive. </p>
<p>It&#8217;s pretty easy to establish the practices that lead to long lasting, higher ranking web sites. Start engaging with your prospects and clients, or find another way to engage with an online audience &#8211; at this point in the search engine optimisation game, they don&#8217;t have to be the audience that you sell to!</p>
<p><b><a href="http://www.google.com/buzz/jezchatfield/SCUupYi4jkY/Duplicate-Content-And-Blog-Spammers">Follow this on Buzz</a></b>.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=322" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/28/duplicate-content-and-blog-spammers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search Engine Spam Reduction Through The DMCA</title>
		<link>http://blog.merjis.com/2010/02/17/search-engine-spam-reduction-through-the-dmca/</link>
		<comments>http://blog.merjis.com/2010/02/17/search-engine-spam-reduction-through-the-dmca/#comments</comments>
		<pubDate>Wed, 17 Feb 2010 09:11:00 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=326</guid>
		<description><![CDATA[Google will act to remove web pages that rank highly, that use copyrighted material without permission. One of the more popular tactics for a beginner Search Engine Optimiser, is to copy material that already ranks well. Apart from the ethical dimensions of copying someone elses&#8217; work, there&#8217;s this: The technique of copying already well ranking [...]]]></description>
			<content:encoded><![CDATA[<p>Google will act to remove web pages that rank highly, that use copyrighted material without permission. One of the more popular tactics for a beginner Search Engine Optimiser, is to copy material that already ranks well. Apart from the ethical dimensions of copying someone elses&#8217; work, there&#8217;s this:</p>
<div class="thumbnail"><a href="http://skitch.com/jezchatfield/niu8d/search-engine-optimisation-site-submission-google-search"><img src="http://img.skitch.com/20100216-d2cjuj892thwcn2kx89sp1du62.preview.jpg" alt="search engine optimisation site submission - Google Search" /></a></div>
<p>The technique of copying already well ranking content is partly a misunderstanding about why search engines rank anything. It&#8217;s actually possible to have a page rank highly, that Google hasn&#8217;t yet crawled. Examples are difficult, as they&#8217;d rely on exposing client details, but if you consider the two following outlines, you&#8217;ll understand a little more about why breaching copyright is both dangerous for your business, and a not very effective technique:</p>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Google_bomb">Google bombing</a>, or link bombing, has ranked pages for keywords that aren&#8217;t on the page at all</li>
<li>Uncrawled pages can rank, showing just the anchor text and the link</li>
</ul>
<p>The big thing to walk away from this story is that copying copyrighted material isn&#8217;t a good idea, because it doesn&#8217;t entirely give you what you think you&#8217;re getting, and may get you in trouble. Better to go write some genuine material, than steal it.</p>
<h2>Rewriting Content</h2>
<p>An alternative is to use artificial intelligence to rewrite the copy. I stumbled into a rewritten article from the Merjis blog a few days ago:</p>
<div class="thumbnail"><a href="http://skitch.com/jezchatfield/niujw/discussion-forums-and-customer-service-merjis-internet-marketing-forums"><img src="http://img.skitch.com/20100216-pkiehchjr9x2f6346hfapyucks.preview.jpg" alt="Discussion Forums and Customer Service | Merjis Internet Marketing â€¦ | Forums" /></a></div>
<p>This example is probably using a simple technique of thesaural replacement, rather than any serious linguistic tools. The technique renders the text of the original article &#8211; the title is correctly preserved as &#8220;<a href="http://blog.merjis.com/2010/02/09/discussion-forums-and-customer-service/">Discussion Forums and Customer Service</a>&#8221; &#8211; as high entropy gibberish. These site scrapers went another step, though, and had a straight copy of the article, too.</p>
<p>Very much on the black hat side of SEO, I recommend clients to stay away from this sort of activity, not just because of the legal and search engine optimisation risks, but because most companies that have or should have customers, have more useful ways to interact with their customer and prospect audiences. Finding scraped content on a prospective clients&#8217; site is a huge red flag for the account. </p>
<p>If you have a real brand, it is much, much better to focus on what your customer and your prospect needs to know, and engage them properly, than run the risk of tripping search engine filters, bans or public exposure. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=326" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/17/search-engine-spam-reduction-through-the-dmca/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search Engine Optimisation &#8211; Site Submission Solicitations</title>
		<link>http://blog.merjis.com/2010/02/16/search-engine-optimisation-site-submission-solicitations/</link>
		<comments>http://blog.merjis.com/2010/02/16/search-engine-optimisation-site-submission-solicitations/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 14:42:54 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[trust]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=325</guid>
		<description><![CDATA[The incubator that we work in, has a bunch of small businesses, with varying degrees of proficiency with the web. Every so often, we run a seminar for them on building web presence. One of the questions that is likely to come up at the end, will be about search engine submission. Our neighbouring businesses [...]]]></description>
			<content:encoded><![CDATA[<p>The incubator that we work in, has a bunch of small businesses, with varying degrees of proficiency with the web. Every so often, we run a seminar for them on building web presence. One of the questions that is likely to come up at the end, will be about search engine submission. Our neighbouring businesses get emails with offers like this:</p>
<div class="thumbnail"><a href="http://skitch.com/jezchatfield/ni12i/merjis-ltd-mail-guaranteed-search-engine-inclusion-jeremyc-merjis.com"><img src="http://img.skitch.com/20100216-r5c7ga9r9q3mjhgrc1haw132q4.preview.jpg" alt="Guaranteed Search Engine Inclusion" /></a></div>
<h2>History and the Current Era</h2>
<p>A long, long time ago, I submitted sites to search engines. I&#8217;m talking about 1994-1996. By 1997, the web was becoming decently interlinked. That linking of sites into a formed web, is what allows Google and the modern generation of search engines to work. The old stuff, like AltaVista, HotBot and Lycos, which in their early days relied a lot on submitted sites, has both evolved and disappeared in the face of the modern, link following search engine.</p>
<p>How do the modern search engines decide to rank your site? Strictly, no-one outside the search engines knows all the details. But observation, testing and reading what the search engines themselves claim, shows that search engines like sites with links to them &#8211; not just any links, but links that are formed for a good reason, that a user might feel was a good reason for visiting the link. I&#8217;m not going to discuss that much more in this article, just this whole idea of search engine submission. </p>
<p>The key points are that modern search engines, like Google and Bing, follow the links around the net, and use those links to find and rank sites. If your site is so poorly linked to, that it hasn&#8217;t been found, how high in the ranking will you appear, after submitting your site? The answer is &#8220;on competitive keywords, not very far &#8211; and on uncompetitive keywords, you may do quite well, but are those valuable enough to try to get?&#8221;</p>
<p>By and large, you are better off making sure you&#8217;ve joined relevant trade associations with member directories, joined LinkedIn and mentioned your site and formed relationships with business colleagues, or even joined appropriate discussion forums and linked to relevant and useful comment on your web site &#8211; these are all more valuable uses of your time and money than submitting your site to Google or Bing.</p>
<p>You do have to watch out for who&#8217;s linking to you. Some &#8220;neighbourhoods&#8221; can reduce the trust that Google extends to you. I can&#8217;t tell you what precise directories and link farms might be involved in automated submission, but I&#8217;d tend to be very suspicious about the value of those links. By and large, anything formed from automation, will also tend to have a lot of spammy content &#8211; and you don&#8217;t want that, pointing to your site. Spammy stuff won&#8217;t add link weight and trust. </p>
<p>As a small business looking to promote yourself on the web, be very cautious about following up on the offers you get in your email, and instead of spending time on those, go find someone in a discussion forum with a problem of the type that your business solves, and help them &#8211; especially if they have a problem that you can answer in a longer article, or a YouTube video, on your blog, explaining the options and how to use what you recommend. Takes about the same amount of time, costs about the same or less, and is likely to result in the search engines both visiting, and better ranking, your site than just &#8220;submitting&#8221;. And you may have helped someone who becomes a client, or at least thinks more favourably of you. </p>
<p>There&#8217;s a guy at Google, called &#8220;Matt Cutts&#8221;, who&#8217;s really helpful, especially to learners. Read a nice <a href="http://www.sempdx.org/blog/events/my-conversation-with-matt-cutts-at-smx-advanced/">interview with Matt Cutts, with a side salad on search engine submission</a>. And another, older message about the official looking requests for money in order to <a href="http://www.mattcutts.com/blog/seo-mistakes-read-the-fine-print/">have your site submitted</a>.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=325" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/16/search-engine-optimisation-site-submission-solicitations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Buzz spam &#8211; it&#8217;s started</title>
		<link>http://blog.merjis.com/2010/02/15/buzz-spam-its-started/</link>
		<comments>http://blog.merjis.com/2010/02/15/buzz-spam-its-started/#comments</comments>
		<pubDate>Mon, 15 Feb 2010 13:51:39 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=324</guid>
		<description><![CDATA[Yay! You knew it would happen&#8230;]]></description>
			<content:encoded><![CDATA[<p>Yay! You knew it would happen&#8230;</p>
<div class="thumbnail"><a href="http://skitch.com/jezchatfield/nicch/google-mail-buzz-7-jezchatfield-gmail.com"><img src="http://img.skitch.com/20100215-ed4pgqn3jhtd4e39j1xkha2ab7.preview.jpg" alt="Google Mail - Buzz" /></a></div>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=324" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/15/buzz-spam-its-started/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO: Remember relevance?</title>
		<link>http://blog.merjis.com/2010/02/05/seo-remember-relevance/</link>
		<comments>http://blog.merjis.com/2010/02/05/seo-remember-relevance/#comments</comments>
		<pubDate>Fri, 05 Feb 2010 08:27:53 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=319</guid>
		<description><![CDATA[Inspired by a great piece of spam this morning. This looks like a lengthy and individually crafted posting, offering a summary of what looks like the current situation on H1N1 in the USA. The problems? Well, it is targeting a US dentist, and the spam was contributed to an article about how Google has been [...]]]></description>
			<content:encoded><![CDATA[<p>Inspired by a great piece of spam this morning. This looks like a lengthy and individually crafted posting, offering a summary of what looks like the current situation on H1N1 in the USA. The problems? Well, it is targeting a US dentist, and the spam was contributed to an article about <a href="http://blog.merjis.com/2009/11/18/google-h1n1-swine-flu-tracking-in-the-uk/">how Google has been tracking H1N1</a> and speculating about why the UK stats weren&#8217;t present. It&#8217;s just not relevant, and it&#8217;s been dropped by someone plugging a business &#8211; see <a href="http://googlewebmastercentral.blogspot.com/2009/11/hard-facts-about-comment-spam.html">Google&#8217;s take on comment spam</a>.</p>
<p>Have a look:</p>
<blockquote><p>Dentist Lake Worth<br />
http:// www. dentallakeworth. com | randomizewee@gmail.com | 122.53.68.2</p>
<p>Symptoms of swine flu are similar to most influenza infections: fever (100F or greater), cough, nasal secretions, fatigue, and headache, with fatigue being reported in most infected individuals. Some patients also get nausea, vomiting, and diarrhea. In Mexico, many of the patients are young adults, which made some investigators speculate that a strong immune response may cause some collateral tissue damage. Some patients develop severe respiratory symptoms and need respiratory support (such as a ventilator to breathe for the patient). Patients can get pneumonia (bacterial secondary infection) if the viral infection persists, and some can develop seizures. Death often occurs from secondary bacterial infection of the lungs; appropriate antibiotics need to be used in these patients. The usual mortality (death) rate for typical influenza A is about 0.1%, while the 1918 â€œSpanish fluâ€ epidemic had an estimated mortality rate ranging from 2%-20%. Swine flu in Mexico (as of April 2009) has had about 160 deaths and about 2,500 confirmed cases, which would correspond to a mortality rate of about 6%, but these initial data have been revised and the mortality rate currently in Mexico is estimated to be much lower. By June 2009, the virus had reached 74 different countries on every continent except Antarctica, and by September 2009, the virus had been reported in most countries in the world. Fortunately, the mortality rate as of October 2009 has been low but higher than for the conventional flu (average conventional flu mortality rate is about 36,000 per year; projected novel H1N1 flu mortality rate is 90,000 per year in the U.S. as determined by the presidentâ€™s advisory committee).</p>
<p>From Google h1n1 (Swine Flu) tracking in the UK, 2010/02/05 at 6:49 AM
</p></blockquote>
<p>So, what makes this spam, rather than a valued contribution?</p>
<ul>
<li>It is not from a person, but from a targeted keyword</li>
<li>The supplied URL is appropriate to the keyword</li>
<li>The topic is appropriate to an element of the article, but not the meaning</li>
<li>The author is in the Philippines &#8211; not itself a definite marker, but contributes to the likelihood &#8211; but is wildly unlikely to be the targeted dentist</li>
<li>The submitter has identified the article based on search results, not on the content &#8211; that is, a previous search lead to finding the article, but the article was not read and understood and an appropriate response composed. Instead a well crafted and rare piece of text was composed and submitted &#8211; but was irrelevant to the thread</li>
<li>It has nothing to do with the relatively rare topic of dentistry; barely discussed online. Whereas Swine Flu gets a lot of high interest coverage&#8230; if you were targeting spam, you&#8217;d want high ranking, frequently searched topics.</li>
</ul>
<p>Google&#8217;s search results show the signs of this specific craftwork in many blogs, where the indications of spamminess have been overlooked. Some of the comments left by this spammer are even on-topic and thread-relevant. </p>
<p>You could consider this the dark side of creating frictionless interfaces. OTOH, amongst the more sophisticated SEO spammers, there&#8217;s software that signs up for places like PC World (creates fake email address registrants, completes the forms with fictitious data, submits the forms, reaps the emails and authenticates, and then adds spammy comments). Creating friction to exclude spammers can decrease interaction with real users &#8211; there is a balance of evil to strike; how much near spam will you let through, in order to make sure that you can engage with the right audience?</p>
<p>This is a moderately clever piece of spam &#8211; it evaded Akismet, for example. If I&#8217;d been writing about the US situation, then I might even have let the spam through, as a useful contribution to the thread. </p>
<p>Is it entirely evil, like some of the spam submitted here? No. But this is almost good enough to pass as a valid commentary in a social forum. If you spam, then that&#8217;s the kind of area you need to be in, to make it through to trusted forums. Serious, unique content that contributes to the dialogue. Good effort, but only an &#8216;F&#8217; &#8211; you were busted on relevance!</p>
<p>This article is part of series of short meditations on social marketing and customer interaction, following on from yesterdays &#8220;<a href="http://blog.merjis.com/2010/02/04/google-foot-gun-shoot/">Google. Foot. Gun. Shoot.</a>&#8221; and the previous article about soft paywalls &#8220;<a href="http://blog.merjis.com/2010/02/03/pc-world-online-subscriptions/">PC World &#8211; Online Subscriptions</a>&#8221;</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=319" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/05/seo-remember-relevance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

