<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Merjis Internet Marketing Blog &#187; SEO</title>
	<atom:link href="http://blog.merjis.com/category/seo/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.merjis.com</link>
	<description>Effective Internet Marketing Strategy and Tactics Through Test</description>
	<lastBuildDate>Thu, 12 Jan 2012 09:18:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google Hates Undeclared Paid Backlinks</title>
		<link>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/</link>
		<comments>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/#comments</comments>
		<pubDate>Sat, 01 Oct 2011 14:58:46 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=689</guid>
		<description><![CDATA[Tuesday, a client sends me an email asking if they should take advantage of an offer to buy a page on a directory on which they can create links. It&#8217;s accompanied by the email soliciting business, which includes a story about how a couple of businesses benefited, and the prices for pages at different tiers [...]]]></description>
			<content:encoded><![CDATA[<p>Tuesday, a client sends me an email asking if they should take advantage of an offer to buy a page on a directory on which they can create links. It&#8217;s accompanied by the email soliciting business, which includes a story about how a couple of businesses benefited, and the prices for pages at different tiers in the directory. Sounds reasonable, doesn&#8217;t it? Here&#8217;s a screen shot of the results page for the search &#8220;the best links&#8221;, which is the name of the domain that was selling links:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/the-best-links-Google-Search.png" alt="Search for &quot;the best links&quot; on Tuesday" title="the best links - Google Search Tuesday" width="569" height="887" class="aligncenter size-full wp-image-690" /></p>
<p>But what was the number one listing on Tuesday and Wednesday has vanished by Friday:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/the-best-links-Google-Search-2.png" alt="The best links have changed, markedly." title="the best links - Google Search-2" width="566" height="552" class="aligncenter size-full wp-image-691" /></p>
<p>What&#8217;s really remarkable is how thoroughly this site has vanished:</p>
<p><img src="http://blog.merjis.com/wp-content/uploads/2011/10/site_www.thebestlinks.com-Google-Search.png" alt="Site listing for the paid directory service" title="site:www.thebestlinks.com - Google Search" width="551" height="394" class="aligncenter size-full wp-image-692" /></p>
<p>It&#8217;s usual for a site to have the home page as the first page in the sitelinks list. For the home page to vanish&#8230; well, looks like Google hates this site. And I wouldn&#8217;t bet on any rank being passed.</p>
<p>Unsurprisingly, none of the Google penalty checking websites that I could find suggest that there&#8217;s a problem with this site. After all, it has results in the listings. Just very low ranked and none of the top pages (like the home page) show up. Sure sign of a problem of some sort!</p>
<p>So, what happened? Well, the site was probably reported to Google, possibly even before I saw the solicitation. And Google acted, quite quickly. </p>
<p>If you have a Google Webmaster Tools account, you can use <a href="https://www.google.com/webmasters/tools/paidlinks">Google&#8217;s Undeclared Paid Backlinks report form</a> to report sites that are using and offering undeclared paid backlinks. I don&#8217;t know what you do if you aren&#8217;t a registered webmaster &#8211; perhaps find one and ask them?</p>
<p>So, what happened to the companies mentioned in the story? I can&#8217;t find them, now, either. Certainly not on page one of results (I&#8217;ve used the non-personal search on our tool to <a href="http://merjis.com/local_google_search/" title="Google International Search">view Google organic and paid search as if in another country</a>)</p>
<p>Google really isn&#8217;t happy with undeclared paid links. At all.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=689" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2011/10/01/google-hates-undeclared-backlinks/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Goes Social With +1 for Websites?</title>
		<link>http://blog.merjis.com/2011/06/03/google-social-button/</link>
		<comments>http://blog.merjis.com/2011/06/03/google-social-button/#comments</comments>
		<pubDate>Fri, 03 Jun 2011 09:41:33 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=586</guid>
		<description><![CDATA[Google&#8217;s new &#8220;plus one&#8221; button for web sites is an interesting development and potentially a game changer &#8211; if done well. I&#8217;ve seen a few commentators calling this an attempt by Google to go the Social Media route. And it is, at first blush, out of the Social Media canon. Facebook and Twitter both have [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://googleblog.blogspot.com/2011/06/1-button-for-websites-recommend-content.html">Google&#8217;s new &#8220;plus one&#8221; button for web sites</a> is an interesting development and potentially a game changer &#8211; if done well. I&#8217;ve seen a few commentators calling this an attempt by Google to go the Social Media route. And it is, at first blush, out of the Social Media canon. Facebook and Twitter both have their ways to generate page-specific responses, for example. Google has previously tried user-specific engagement with tools like <a href="http://www.google.com/sidewiki/">SideWiki</a>, the <a href="http://googleblog.blogspot.com/2008/11/searchwiki-make-search-your-own.html">Search Wiki</a> and <a href="http://googleblog.blogspot.com/2010/03/stars-make-search-more-personal.html">Stars</a>. Google uses <a href="http://www.google.com/support/toolbar/">Google Toolbars</a> to understand things about user interaction with actual web pages, as well as what the robots tell it. But that&#8217;s as far as it has got &#8211; the SideWiki and in-search commentary. <em>Until this week</em>.</p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/Official-Google-Blog_-The-+1-button-for-websites_-recommend-content-across-the-web.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/Official-Google-Blog_-The-+1-button-for-websites_-recommend-content-across-the-web.png" alt="" title="The +1 button for websites_ recommend content across the web" width="78" height="50" class="alignleft size-full wp-image-593" /></a></p>
<h2>What is the Plus One Button for websites?</h2>
<p>It&#8217;s a small piece of Javascript to insert on web pages that are candidates for Google Search Results. On each page, you need to refer to the JavaScript file, and then invoke the script.</p>
<blockquote><p>&lt;script type=&#8221;text/javascript&#8221; src=&#8221;http://apis.google.com/js/plusone.js&#8221;&gt;&lt;/script&gt;</p>
<p>&lt;g:plusone&gt;&lt;/g:plusone&gt;
</p></blockquote>
<p>As you can see from the brevity of this code, it has no idea *who* has dropped the link, other than someone with authority to modify the site. This is in strong contrast to Facebook, where the ID is an integral part of adding &#8220;Like&#8221; buttons. The implication is that Google doesn&#8217;t care who drops the code &#8211; they care who clicks on the button it displays, and probably, they care more about which page was clicked. </p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/Screen-shot-2011-06-06-at-07.26.43.png.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/Screen-shot-2011-06-06-at-07.26.43.png.png" alt="" title="Google Search Results showing +1 rather than Star" width="524" height="111" class="aligncenter size-full wp-image-620" /></a></p>
<p>Click on the icon that the code displays, and your ID is added to the list of users that are interested in this page &#8211; and the icon changes from white to blue, to indicate that you have clicked it. If you &#8220;Plus One&#8217;d&#8221; a page, then it gets marked in the search results, as shown above. Note that the tooltip is not changed &#8211; when you +1 a result, the tooltip still shows that you should click to recommend the page, but that will actually remove the recommendation. Probably a transient oversight at Google, rather than intentional. </p>
<p>Here&#8217;s this very article, towards the bottom of Google search results for &#8220;Google plus one for websites&#8221;, before anybody I know had &#8220;plus one&#8217;d&#8221; the page:</p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/google-plus-one-for-websites-Google-Search.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/google-plus-one-for-websites-Google-Search.png" alt="Showing this article in search results, with a &quot;plus one&quot; annotation below the URI" title="google plus one for websites - Google Search" width="559" height="331" class="aligncenter size-full wp-image-607" /></a></p>
<p>You can see that I&#8217;ve +1&#8242;d the page. It shows up under the listing, below the URI, and it knows that I&#8217;ve voted &#8211; but gives no clue how many *others* have clicked. At the point that this was taken, there were a handful of +1s for the article that were at least an hour old. </p>
<p>It appears to take at least fifteen minutes for the &#8220;plus one&#8221; information to be updated in search results. I have to admit that I became bored checking &#8211; so somewhere after 30 minutes and before four hours, the search results had added a friend, who I&#8217;d ask to make a timestamp-noted entry. This is *much* slower than the update to the count of &#8220;plus ones&#8221; shown on the page &#8211; that count changed in seconds. I&#8217;ll guess that the delay before turning up in search results is to add the social graph data of who I know that has clicked, so my search results pages show data about people I know. I suspect that this process takes some time&#8230; so if you&#8217;ve just put up a post and emailed your mates to +1 it, don&#8217;t be surprised if you see on-page activity a long time (internet time) before the Search Results update to reflect that. </p>
<p>Amusingly, while I was checking to see who else that I know had used a plus one on an article about a plus one (interesting behavioural change &#8211; I went looking *much* further into results &#8211; will others look deeper to see if their friends have marked stuff, and will that be a transient behaviour until the SERPs behaviour becomes clearer?)&#8230; I found that the preview had updated to include the latest summary within the article and the clip showing the position. Here, recursively, is this article showing a clip of Google showing a preview of this article showing a clip of Google&#8217;s search results showing this article&#8230;</p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/google-plus-one-for-websites-Google-Search-1.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/google-plus-one-for-websites-Google-Search-1.png" alt="Clip showing preview before updating social information." title="google plus one for websites - Google Search-1" width="417" height="136" class="aligncenter size-full wp-image-614" /></a></p>
<p>The implication is that Google snatches previews at the time of interest and marks them up, as a much faster process than the social data is updated. Now&#8230; Why is the social data apparently slowed so much, relative to the preview? I don&#8217;t know, and I&#8217;ll be thinking about that for a while. </p>
<h2>What Failed Before, and Will The Plus One Also Fail?</h2>
<p>Failures aren&#8217;t the worst thing that can happen. Failures tell you when you aren&#8217;t doing the right thing. So long as you don&#8217;t do something that kills the business, you can learn from failures. A <em>failure to learn</em> from the lessons you should be learning from a failure, now that is a real failing.</p>
<p>What has Google tried before, and why didn&#8217;t they work? And are there lessons from those attempts about what would make Google&#8217;s &#8220;Plus One&#8221; work, or be yet another fail marker?</p>
<h3>Google&#8217;s SideWiki</h3>
<p>Perhaps the most comprehensive attempt to solicit users&#8217; direct feedback on a page or a business, is the Google SideWiki. This is barely known about, even in search engine marketing circles, much less by the wider public. It lets any user add an annotation to a web page. If the web master knows about the SideWiki, they can respond&#8230; But I can&#8217;t recall ever talking to a webmaster who knew about SideWiki before I mentioned it, much less ordinary web users. </p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/Screen-shot-2011-06-06-at-07.30.38.png.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/Screen-shot-2011-06-06-at-07.30.38.png.png" alt="Screenshot showing popup for Google Sidewiki on Chrome Browser" title="Google Chrome showing +1/Bookmark Star and SideWiki" width="600" height="200" class="aligncenter size-full wp-image-619" /></a></p>
<p>The problems for SideWiki are quite large. There&#8217;s no on-page presence (if you have the toolbar or Chrome, there&#8217;s a minor marker in the decoration frame for a web page, but nothing in the area that users focus on). The SideWiki is intended to be present even for sites that don&#8217;t want to participate &#8211; which is why it has such a peripheral vision presence. It&#8217;s a parallel web space so only the search geeks or the deeply aggrieved find their way on to it. Parallel page specific annotations are a rich source of information for Google, with a user indication of liking or not liking the content, and it can provide a parallel stream of keywords and some clues as to the importance of a site or page (e.g. the Apple home page has a lot more SideWiki comments than the Twitter home page &#8211; working out why that is meaningful is left as an exercise for the reader! Try using the SideWiki for this page to discuss this? Your attempt to use it may partially explain why it has been so little used and known.)</p>
<p>Considering <a href="http://www.google.com/about/corporate/company/tenthings.html">Google&#8217;s first rule in their Ten Things</a> (&#8220;Focus on the User and all else will follow&#8221;), I&#8217;d have to call SideWiki a brave, but doomed, effort. Collecting additional data on page value is a great idea. But doing it in a way that users aren&#8217;t aware of, and can&#8217;t possibly comprehend, without a marketing information campaign to support it, and proper tools to engage with it for site owners and regular users &#8211; well, it&#8217;s destined to be roadkill. SideWiki is, I suspect, a victim of Google&#8217;s Engineering Lead business philosophy &#8211; a fallacy supported by the success of their first effort in this space, Google&#8217;s technologically and philosophically profound switch in ranking the results in search engines. </p>
<p>The &#8220;Engineering Lead Marketing Fallacy&#8221; is that if you make a great product, people will find and use it &#8211; but the reality is that if you have something good, you need to educate, inform and persuade users that it is good; this may be a rich vein for another article on another day, though; just think about Sun, until I write that article. I&#8217;m deeply familiar with this fallacy, having been a techie myself, before my Damascene conversion. </p>
<h3>Google Stars</h3>
<p>In Google Search Results, you can &#8220;star&#8221; a listing. This means that it is a page that you like. Starred listings are preferentially popped up to the top of your results pages. Like SideWiki, that is great feedback for Google on pages that work well for users. Pages that are well liked should get lots of stars &#8211; an easy way for Google to automatically find the hot stuff and the useful stuff. They are directly related to Google&#8217;s Bookmarks service &#8211; and social bookmarking is one of the unsung other keys of Social Media (it used to be sung, back when delicious was new).</p>
<p>Why doesn&#8217;t &#8220;Google Stars&#8221; work? When I talk to users they haven&#8217;t noticed the little faint outline of a star, or couldn&#8217;t be bothered to work out what it might mean. This star is an unusual icon, faint, and unexplained. Only the most curious users work out what it is for. And Google does their best to make sure that users *won&#8217;t* engage directly with the Google Star. That may be a surprising statement, but you have to consider the user experience (UX) when considering why the Star doesn&#8217;t and can&#8217;t work very well to deliver user feedback. </p>
<p>What&#8217;s Google&#8217;s proud claim and ambition (number 3 in the ten things)? That they deliver the best page of search results so <em>you spend as little time as possible on Google</em>. This does not encourage users to spend time investigating the details of search results. Any system that Google develops that requires time to parse the page, infer meaning and interact with the results, is counter to that (so far) successful formula of reducing the time spent looking at Google&#8217;s own search results rather than looking at the content that you wanted to find. Because of this emphasis on shifting people off the search results page as quickly as possible, Google Stars will be a minor &#8220;also ran&#8221; in terms of soliciting feedback, but probably slightly more effective than the almost invisible SideWiki.</p>
<p>As a subset of this mechanism, is Google&#8217;s &#8220;spam marking&#8221; annotation. Same problem. I find out that the page is spammy when I&#8217;ve been to it. When I use search (rather than being an SEO and staring at search results for ages to inder why Google does what it does), my attention is not on cleaning up after Google&#8217;s spam cleaning efforts, but on getting to the page that helps me do whatever it is I&#8217;m doing. So the user focus is not, at the time of looking at the page, focused on the question &#8220;is this spammy?&#8221;; anything as onerous as a check box, is too onerous to work with and is inappropriate within the normal context of a user interaction with search. </p>
<p>Effectively using the Star or Spam Marking within Google Search Results means having to return to Google, perhaps rekeying the search, and the user doing so while intending to mark a result as being annoying. Too much effort for most people. You&#8217;d have to *really* hate a web page to take that much effort. Or remember that if you are using Chrome, you can use the faint star at the right hand edge of the input box (URL/search field) to Bookmark/+1/Star the page you are on. </p>
<h3>Google&#8217;s Search Wiki</h3>
<p><a href="http://en.wikipedia.org/wiki/Wiki">Wikis</a> are great, for certain classes of problem at least. The previous version of our company main site was written using a search engine optimised wiki we developed in 1994 and ran until a catastrophic incident in late 2010, but that&#8217;s a story for another time&#8230;. The point was that a Wiki collects lots of stuff, organise it and let users inside and outside the company work on it.</p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/06/SearchWiki-_-Features-Web-Search-Help.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/06/SearchWiki-_-Features-Web-Search-Help.png" alt="" title="SearchWiki _ Features - Web Search Help" width="600" height="127" class="aligncenter size-full wp-image-591" /></a></p>
<p>Google used to use a Wiki in search results.  Google&#8217;s SearchWiki was effectively another parallel web &#8211; another Google system that had potentially one web page for every web page out there. This was before the success of Facebook, when MySpace was still a power online. SearchWiki looks like &#8220;Social&#8221;, smells a bit like &#8220;Social&#8221; &#8211; and it failed. There&#8217;s no sign of the user generated content attached to search listings any more. I think this experiment fell foul of the same problem that afflicts Google Stars &#8211; but even more so. The problem being that users on a search page don&#8217;t want to interact with the search page, they want to go to a useful web page. The Wiki compounded that problem by making the interaction into an essay. Users like simple UI choices &#8211; &#8220;do I like this page&#8221;, &#8220;do I like this company&#8221;, &#8220;do I never want to see this again&#8221;? Only a tiny fraction of users like writing essays. More prefer 140 character limits (c.f. Twitter) and more yet prefer just a &#8220;Like&#8221; button. </p>
<h2>Google Plus One</h2>
<p>There have been a few other attempts by Google to get users to interact about sites and web pages. They&#8217;ve all, to my knowledge, been variations of the above themes &#8211; hidden content in a parallel web or user engagement in search results pages. Note that user engagement does work when well done &#8211; Google&#8217;s own search results are a partially a consequence of user engagement with the content (title, snippet, URL and the actual site all play a part, as does the user interaction with all of those). The failure is not in the effort to engage users, but in the absence of the first rule (&#8220;Focus On The User&#8221;) in the interaction models &#8211; I believe these exercises have been focused on improving search results for all users, at the expense of focus on the individual user as contributor. Clicking a &#8220;Like&#8221; button, or a &#8220;Plus One&#8221; button, is a contribution &#8211; it&#8217;s just non-textual.</p>
<p>Google&#8217;s Plus One Button is different from all these previous efforts though. It&#8217;s on the web site, not the search results. It&#8217;s where users are focused when they are doing something. I think it is the first thing that Google has done, probably since they invented the whole idea of Page Rank, likely to actively engage with user opinion. This is a markedly different activity from any effort so far. But will it work?</p>
<p>So long as the interaction model is kept to a minimum (&#8220;I like this and would want to see it again&#8221; as a checkbox) and on the page that the user is considering, and if it turns up in search results (&#8220;you liked this page when you saw it&#8221;), it&#8217;s reasonably good. </p>
<p>The idea that I&#8217;ll know that people I know have marked a page &#8211; that&#8217;s good, too, and that puts it closer to the Social Media space. Knowing that someone I respect likes a page, helps me to decide that I want to visit that page too. This may have a profound effect on SEO &#8211; and given that user preferences are having increasing importance in ranking, who&#8217;d care to bet that this <em>won&#8217;t</em> also become yet another Google AdWords Extension &#8211; allowing you to advertise more visibly when you have a preferred page in the listings?</p>
<h2>What will stop &#8220;Plus One&#8221; from working? </h2>
<p>Several things can stop &#8220;Plus One&#8221; from working:</p>
<ul>
<li>A good distraction from FaceBook and LinkedIn or GroupOn. Distract people from paying attention to &#8220;Plus One&#8221; when it is new, so that webmasters focus on some other issue to deal with.</li>
<li>Google&#8217;s Engineering Fallacy &#8211; just because this is probably the best interaction model so far devised for improving search engine results, doesn&#8217;t mean that it will win. It needs marketing and communication efforts. It needs Matt Cutts to write about it and encourage webmasters to embed it. It needs the <a href="http://www.google.com/webmasters/+1/button/index.html">Webmaster Tools guys to make space for it</a> and report the results in WMT. It needs to actively let webmasters know when their sites are loved (and relatively unloved &#8211; like Google Analytics Benchmarking data). And most of all it needs to be social &#8211; I *care* when <a href="http://www.seobythesea.com/">Bill Slawski</a>, or <a href="http://www.seroundtable.com/archives/000050.html">RustyBrick (Barry Schwartz)</a>, or <a href="http://www.clinkswebservice.com/">Kim Clinks</a> like a web page. I don&#8217;t care when it&#8217;s some random Joe (unless there&#8217;s several thousand real random Joe&#8217;s, because that has its own message about a page).</li>
<li>It needs to be non-spammy. Google has mechanisms to understand whether real users are interacting; the votes should probably use the same kinds of invalid click detection systems used in AdWords and Search Results. However, reporting to those to webmasters may need the same kind of obfuscation that conceals exactly which clicks were invalid, in AdWords, to continue the security-through-obscurity approach used in click fraud detection mechanisms.</li>
</ul>
<p>At the moment, I don&#8217;t see how how I respond to a friend Plus-one-ing a page. So it is limited in terms of social media. I can like, or not. I can&#8217;t interact. Unlike Facebook, where I can comment on something that someone has found. That may yet prove to be the greatest weakness. Being able to see my friends liking something &#8211; that&#8217;s good. Being unable to respond and correspond, a core weakness.</p>
<p>And the name&#8230; I can easily grasp &#8220;liking&#8221; a page. But &#8220;Plus One-ing&#8221; it? Smells geeky, even nerdy. Hard to say. Hard to write. Hard to put into a grammatically correct page unless treated as an adjectival-noun thingie. </p>
<h2>Is there any sign that Google Gets Marketing Communications?</h2>
<p>A bit. But the most prominent usage of the &#8220;Plus 1&#8243; graphic on the announcement page is just a link to the giant graphic, not a way to plus one the plus one announcement&#8230; That&#8217;s hidden with the other social markers below the article &#8211; and not even first in the list. Placement itself means something. Come on, Google guys &#8211; you rank stuff by importance. If you put &#8220;plus 1&#8243; towards the end of the list of social markers, how important is it? Learn, for goodness&#8217; sake, from your experience.</p>
<p>I&#8217;ll probably update this section as I see signs of life-after-engineering. When I see some type of audience communication rather than simply an announcement.</p>
<h2>Summary</h2>
<p>Good effort, but not enough, yet, to be completely convincing. </p>
<p>Possibly worth getting your braver clients to sign up and see what happens. </p>
<p>More important for Google&#8217;s de-spamming of search results than for social interaction.</p>
<p>May fail, because it is only partially social &#8211; no response mechanism to friends. </p>
<p>Where&#8217;s the social list drawn from? Ah. Now that&#8217;s an interesting question&#8230; for later.</p>
<p>Potentially disruptive &#8211; but not clearly so.</p>
<p>Could be emulated, perhaps better, by Bing using Facebook&#8217;s social graph and their Like button data. How? Well, if Bing did this, then made a &#8220;plus 1&#8243; boost results for a while, &#8220;until the algorithm settles down&#8221;, then SEOs would want to get a Bing button everywhere and people to click on them &#8211; and that would probably help clean up Bing results, increase Bing usage, and is another reminder on web pages of Bing&#8217;s presence in some markets (still just an also-ran, here in the UK). So, disruptive but not exclusive&#8230; and open to a competitor taking it higher, further, faster.</p>
<p>So &#8211; what&#8217;s it like? See this&#8230;</p>
<p><script type="text/javascript" src="http://apis.google.com/js/plusone.js"></script></p>
<p><g:plusone></g:plusone></p>
<p>&#8230; and do, please, click it. :)</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=586" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2011/06/03/google-social-button/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO: Tell Me About Keyword Density Analysis, Again?</title>
		<link>http://blog.merjis.com/2010/09/27/seo-keyword-density-again/</link>
		<comments>http://blog.merjis.com/2010/09/27/seo-keyword-density-again/#comments</comments>
		<pubDate>Mon, 27 Sep 2010 12:35:15 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[keyword density]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=497</guid>
		<description><![CDATA[I&#8217;ve been recruiting some SEOs recently. One of my differentiator questions for the phone interview, has been to ask candidates if they know what &#8220;Keyword Density&#8221; is, and whether it is important to search engines. The most common answer that people give to the question on keyword density, is that it used to be important, [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been recruiting some SEOs recently. One of my differentiator questions for the phone interview, has been to ask candidates if they know what &#8220;Keyword Density&#8221; is, and whether it is important to search engines. </p>
<p>The most common answer that people give to the question on keyword density, is that it used to be important, but that the main value of keyword density analysis nowadays is to avoid keyword stuffing or keyword spamming. Pretty much everyone says that it is important to mention the keyword at least once, preferably with some other related instances, and ideally use semantic markup (or titles, descriptions and headers) to emphasise the keyword.</p>
<p>Not one person has said that you can rank for a keyword without using it on the page &#8211; a so-called <a href="http://en.wikipedia.org/wiki/Google_bomb">&#8220;Google Bombing&#8221; or &#8220;Link Bombing&#8221;</a> exercise. In a link bombing exercise, you use anchor text to target a page that doesn&#8217;t mention the keyword you are focusing on. It was a nice demonstration of the power of backlink anchor text, that you could rank a page against its&#8217; intentions. George Bush was targeted for &#8220;miserable failure&#8221;, for example.</p>
<p>Interestingly, after the attacks on George Bush&#8217;s reputation &#8211; but possibly not directly triggered by that, Google eventually implemented a fix to better synchronise page content and backlink anchor text. This fix tool appears to be something that is periodically run, so unexpected results will creep in from time, for up to a few months at a time. The fix that Google seems to have used, is that the targeted keyword should appear in whole or in part, on the targeted page. So George Bush&#8217;s CV at the Whitehouse appeared again for the search &#8220;miserable failure&#8221;, when a Whitehouse staffer referred to some Senate discussions as a &#8220;failure&#8221;. Close enough for Google to decide there was a match&#8230; until the tool was run again. </p>
<p>And it looks like it is getting time to run the tool again. Unless there&#8217;s something else going on, that is&#8230;. </p>
<ul>
<li>Date: 2007/07/16</li>
<li>Article: Google AdWords, Click Fraud and gclid</li>
</ul>
<p>This article about Google, AdWords, Click Fraud and gclid usually shows up fairly high in searches for &#8220;<a href="http://blog.merjis.com/2007/07/16/click-fraud-google-adwords-and-gclid/">gclid</a>&#8220;. Every so often, I check that search, not just to see whether the article is still ranking, but to see if anyone else has written anything interesting about that tracking parameter recently. What did I see yesterday, when I looked?  I was, FWIW, thinking about call tracking systems &#8211; not just vanity, honestly!</p>
<div id="attachment_496" class="wp-caption aligncenter" style="width: 600px"><a href="http://blog.merjis.com/wp-content/uploads/2010/09/gclid-Google-Search.png"><img src="http://blog.merjis.com/wp-content/uploads/2010/09/gclid-Google-Search.png" alt="search engine results for gclid showing a page about computer memory" title="gclid - Google Search" width="590" height="953" class="size-full wp-image-496" /></a><p class="wp-caption-text">first page results, UK, for gclid - not mentioned in two of these pages at all</p></div>
<p>OK, so how many times is &#8220;gclid&#8221; mentioned on that Crucial Memory page? Never. What about the Tax Credits page? Nope. Not there, either. </p>
<p>So, why were these pages regarded as important for users to know about, when they search for gclid? </p>
<p>Interesting puzzle, isn&#8217;t it? If you can solve it, let me know&#8230; I just might have a job for you!</p>
<p>Oh, and see if you can work out what the connection is between SEO By The Sea&#8217;s recent <a href="http://www.seobythesea.com/?p=4350">blog article about dates in snippets</a>, and this article. It&#8217;s a little test that I&#8217;m doing &#8211; but of what?</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=497" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/09/27/seo-keyword-density-again/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Google Sidewiki Made More Obvious &#8211; But Still a Tech Not A Benefit</title>
		<link>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/</link>
		<comments>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/#comments</comments>
		<pubDate>Sun, 05 Sep 2010 15:51:23 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=477</guid>
		<description><![CDATA[The Sidewiki, introduced almost a year ago, looked like an interesting tool and a way to allow users to interact with a site without on-site commentary enabled. Accessed via an unobtrusive button in the Chrome browser or via a Google Toolbar, it was all but completely invisible to most users, and rarely used. This new [...]]]></description>
			<content:encoded><![CDATA[<p>The Sidewiki, introduced almost a year ago, looked like an interesting tool and a way to allow users to interact with a site without on-site commentary enabled. Accessed via an unobtrusive button in the Chrome browser or via a Google Toolbar, it was all but completely invisible to most users, and rarely used. This <a href="http://googlecode.blogspot.com/2010/09/new-sidewiki-sidebar-web-element.html">new announcement</a> makes the annotation tool more useful, by allowing site owners to render the content. </p>
<p>I have my doubts that the Sidewiki sidebar will be rendered on anything other than informational sites with UGC &#8211; which, by and large, will already have a mechanism for users to contribute and so will have no need for a Sidewiki. The fear amongst business site owners will be that users with a grudge will control the conversation. There&#8217;s plenty of evidence that suggests that users with a good experience of a business are much less likely to tell others, than users with a bad experience &#8211; in a ratio of about in in 12 (users with a bad experience are much more likely to tell others). So even if your business manages to annoy only a small fraction of users, they&#8217;ll be a lot more visible than the small fraction who are ecstatic about your business.</p>
<p>Unless the Sidewiki enables a clearly valuable business interaction &#8211; one that doesn&#8217;t leave site owners worrying that Google is controlling the agenda, and one that doesn&#8217;t overemphasise the negative commentary &#8211; then Sidewiki will remain a sideline. I think the idea of UGC paralleling the site content is probably the right sort of thing to be thinking about. I suspect that the current implementation and interaction model aren&#8217;t enough to satisfy business owners to make the Sidewiki an integral part of the site&#8217;s experience. </p>
<p>I&#8217;d fear it being added to, for example, the NHS web site. The amount of quackery and bogus advice that could be offered would make me nervous about the value. </p>
<p>So, for now, Sidewiki continues to be a technology in search of a solution. It appears to come from a philosophy of user interaction that isn&#8217;t matched by the real world, that users will contribute overwhelmingly. That may be true for sites engineered to make that happen (Hub Pages, Wikipedia, etc). But sources of tested information (NHS, NIH) and of commercial information (product details, sales, etc) can suffer in the face of unqualified, biased or defamatory comment. That doesn&#8217;t mean that Sidewiki is always useless, but does mean that the philosophy needs to be exposed as having a benefit to the site owner, many of whom will see nothing but risk when they look at the tool. Another case of Google mistaking technology for product &#8211; as with Wave.</p>
<p>And the blog announcement itself doesn&#8217;t do a lot to encourage confidence. What&#8217;s the first comment? A series of spammy links. How long would it be before some twit starts dropping spammy Sidewiki comments on sites, in order to benefit from their PageRank? Given the rest of the mythology surrounding SEO, it won&#8217;t be long after Sidewiki comments are seen, that there&#8217;s be someone selling Sidewiki spamming as a service, off the back of user ignorance that the Sidewiki domain is not that of the website&#8230;<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/09/Google-Code-Blog_-New-Sidewiki-“Sidebar”-web-element1.png"><img src="http://blog.merjis.com/wp-content/uploads/2010/09/Google-Code-Blog_-New-Sidewiki-“Sidebar”-web-element1.png" alt="Google Code Blog, Now With Blog Spam" title="Google Code Blog_ New Sidewiki “Sidebar” web element" width="561" height="810" class="aligncenter size-full wp-image-480" /></a></p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=477" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/09/05/google-sidewiki-obvious-tech-benefit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO Ranking Insights From Troubleshooting Web Site Problems</title>
		<link>http://blog.merjis.com/2010/08/11/seo-ranking-insights-from-troubleshooting-web-site-problems/</link>
		<comments>http://blog.merjis.com/2010/08/11/seo-ranking-insights-from-troubleshooting-web-site-problems/#comments</comments>
		<pubDate>Wed, 11 Aug 2010 08:09:19 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=461</guid>
		<description><![CDATA[I look forward to phone calls from companies with an organic ranking problem. Sometimes these companies have a strange set of conditions that would be impossible to ask a client to reproduce, because it would damage traffic to their site. However, fixing existing problems can yield insights into how fast Google does things, and what [...]]]></description>
			<content:encoded><![CDATA[<p>I look forward to phone calls from companies with an organic ranking problem. Sometimes these companies have a strange set of conditions that would be impossible to ask a client to reproduce, because it would damage traffic to their site. However, fixing existing problems can yield insights into how fast Google does things, and what things Google finds important &#8211; which lets you then know how long &#8220;standard&#8221; changes to a clients&#8217; site will take, and improves best practice experience. </p>
<p>The standard problems are somewhat less illuminating &#8211; banned sites form the majority of these, and the problems are usually associated with link spamming. Our experience of banned sites is usually a call from the new buyer of an established domain, who suddenly finds his site has been banned, as the previous owner boosted the site&#8217;s rank prior to sale with mass purchases of undeclared paid backlinks. Dealing with this is usually a matter of removing undeclared backlinks where possible and reworking the site&#8217;s paths to avoid URLs referenced by the spam, while designing and developing white hat link building strategies to avoid future problems. That resolution can be more complicated if the site has a mix of high quality backlinks and spammy backlinks to the same page&#8230;</p>
<p>The major class of problems that delivers serious insights are the things that web server developers do, and that CMS designers do. Microsoft&#8217;s IIS has been an especially rich source of insight. Previous articles on this blog have looked at <a href="http://blog.merjis.com/2008/08/20/seo-iis-case-folding-filenames-spiders-analytics-and-robotstxt/">ranking problems with case folding (case insensitive path names)</a> and with <a href="http://blog.merjis.com/2008/08/18/iis-cookieless-generates-spider-crawling-problems/">IIS&#8217;s cookieless mode user tracking</a>.  In both these cases, resolving the problems correctly showed Google&#8217;s characteristic times to correct the index after crawling. </p>
<p>More recently, we&#8217;ve been getting stuck into some problems with the ways that Content Management Systems handle paths and page contents. Again, much like the IIS technical exercises, these page-template and CMS-specific issues create signatures on the site, that when corrected, tend to be corrected across the entire site at the same time. With a reasonably popular site, it is then possible to predict Googlebot&#8217;s significant re-crawl of content, and the delay between that re-crawl and the increase ranking of existing content. </p>
<p>Of course, re-ranking pages allows another set of tests that one couldn&#8217;t normally do on a user site. Clients are usually focused on getting back into the listings. With suitable corrections, the site can not only get back, but start to rank for keywords that were previously untouched. This exposes a new audience segment to the sites&#8217; content. If the content is not addressed during the re-ranking exercise, then we sometimes get to see what happens when poor page content is exposed to the firehose of a major volume keyword &#8211; allowing the investigation of page content on user behaviour and the subsequent inferred response by Google when the content is judged inadequate by user behaviour.</p>
<p>Finally, one can occasionally find an example of a single off-site link that materially changes how the site ranks. This is usually discovered on small business sites, where some combination of offline connections or a lucky piece of PR manages to snag a high quality backlink to an inner page, with keyword loaded anchor text. The business owner may be puzzled by why one page ranks highly on a specific keyword, and the rest of the site is barely present. Identifying the link, or handful of links, involved and showing the business owner how much effect a high quality backlink can have, is usually an enjoyable experience.</p>
<p>These are valid ways to conduct scientific testing of ranking factors. Although it is much easier, and more predictable and repeatable to conduct tests on low search-volume keywords, Google&#8217;s behaviour is necessarily different on low volume and high volume searches. Much of SEO lore depends on testing on low search volume keywords. This gives undue weight to factors that are insignificant, or of low impact, on higher volume searches. It&#8217;s only technical failures on high volume sites that can yield the discovery of the importance of ranking factors for competitive terms. And since you can&#8217;t, in good conscience, go to the owner of a high ranking site and ask to sacrifice traffic for the sake of an experiment, technical failures on these sites tend to be the best way to collect the data &#8211; albeit sporadically.</p>
<p>There is an entire sub-set of scientific method associated with looking at rare examples of events that illuminate topics that are very difficult to experiment on. For example, almost anything involving a rare species of organism is non-destructive and observational, only. Much of astronomy depends on a rare event being observed, such as the impact of Shoemaker-Levy 9 on Jupiter. Most science classes teach the basic methods of science, which are easiest to demonstrate in class with repeatable experiments. However, this overemphasises the importance of reproducibility &#8211; observation of rare events is still a crucial scientific technique, albeit harder to properly explore in a science lesson in school. This is probably, also, one of the reasons that evolution is regarded with suspicion &#8211; the main investigative technique for evolution, as for astronomy, is observation, not reproducible test; when your understanding of science is driven from being able to reproduce tests, then observational science is downrated in importance and treated with suspicion.</p>
<p>In the SEO world, a reliance on reproducible testing will, I think, tend to overemphasise easily repeated techniques that work for low search volumes, at the expense of optimisation techniques that are more effective for highly competitive and higher search volume keywords (harder to repeatedly test).</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=461" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/08/11/seo-ranking-insights-from-troubleshooting-web-site-problems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO: Close Reading Of Search Results</title>
		<link>http://blog.merjis.com/2010/05/17/seo-close-reading-of-search-results/</link>
		<comments>http://blog.merjis.com/2010/05/17/seo-close-reading-of-search-results/#comments</comments>
		<pubDate>Mon, 17 May 2010 12:12:01 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spiders]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=400</guid>
		<description><![CDATA[Looking closely at Google&#8217;s search results can be informative &#8211; at least, if you take some inductive leaps, and apply knowledge learned in other activities. Take a look at the graphic, showing these early April 2010 search results for Matt Cutts&#8217; web site. Notice that the same articles appear several times, wIth slightly different URLs? [...]]]></description>
			<content:encoded><![CDATA[<p>Looking closely at Google&#8217;s search results can be informative &#8211; at least, if you take some inductive leaps, and apply knowledge learned in other activities. Take a look at the graphic, showing these early April 2010 search results for Matt Cutts&#8217; web site. Notice that the same articles appear several times, wIth slightly different URLs? Now look further down the results. See any more duplications? For example, that mini-review of the iPad. I can&#8217;t see that article listed anywhere else in the search results. </p>
<div id="attachment_401" class="wp-caption alignnone" style="width: 610px"><a href="http://blog.merjis.com/wp-content/uploads/2010/04/site_mattcutts.com-Google-Search.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/site_mattcutts.com-Google-Search.gif" alt="Search results for Matt Cutts site, for the last year, sorted by date not relevance" title="site_mattcutts.com - Google Search" width="600" height="545" class="size-full wp-image-401" /></a><p class="wp-caption-text">Notice that recent articles are sometimes shown twice or more, with different URLs</p></div>
<p>Why are the articles shown twice or more? Because they have different URLs. They are the same page content, but on different paths. Duplicates. Notice that Matt&#8217;s blog is not penalised for these duplicates &#8211; because they aren&#8217;t what Google considers to be duplicate content. They are the same content, reached by different paths, and in this case the different paths are tracking parameters for Google Analytics. It&#8217;d be more than a little annoying if Google handled &#8220;duplicate&#8221; pages, caused by using their own tracking parameters, as if they were different resources!</p>
<p>Why aren&#8217;t other articles on the blog shown twice or more times? What magic makes it that two day old articles get three showings, but articles five days old and older show just once?</p>
<p>Interesting, isn&#8217;t it?</p>
<p>Now, instead of sorting by date order, sort by relevance. Let&#8217;s look for those duplications again:<br />
<div id="attachment_402" class="wp-caption alignnone" style="width: 610px"><a href="http://blog.merjis.com/wp-content/uploads/2010/04/site_mattcutts.com-Google-Search-2.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/site_mattcutts.com-Google-Search-2.gif" alt="Search for the site of matt cutts, look at the last page of results to see the duplicates" title="site_mattcutts.com - Google Search-2" width="600" height="666" class="size-full wp-image-402" /></a><p class="wp-caption-text">The duplicate pages, with different URLs appear lowest for relevance.</p></div></p>
<p>So these duplicate pages of the last few days, appear as the lowest relevance pages in the last year of articles on Matt&#8217;s site. Yet they have the same text as the main article&#8230; or do they? Might there be something magic that the cache shows us?</p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2010/04/Things-to-do-in-Japan-and-Thailand.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Things-to-do-in-Japan-and-Thailand-300x235.gif" alt="" title="Things to do in Japan and Thailand?" width="600" height="470" class="alignnone size-medium wp-image-403" /></a></p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2010/04/Things-to-do-in-Japan-and-Thailand-2.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Things-to-do-in-Japan-and-Thailand-2-300x200.gif" alt="" title="Things to do in Japan and Thailand?-2" width="600" height="400" class="alignnone size-medium wp-image-406" /></a></p>
<p>The cached versions show us that they were taken up to a week previously &#8211; and reveal that these were found from Feedburner. <i>Google is taking different URLs, determined by the tracking tags, and including them in results.</i> The weight for the tagged pages is lower &#8211; Matt&#8217;s blog doesn&#8217;t refer to these URLs, and nor does anything else that Googlebot will crawl. So these additional pages rank lower, despite having the same content &#8211; the difference lies in the backlinks, and perhaps other factors. </p>
<p>But if there are no external backlinks to these articles, what makes them appear in results, at all, in any position? This, I think, is where we drag in another factor that seems to apply, especially to blogs. Trust. If a blog is well trusted by Google, then articles posted will tend to rank highly in search results, in as little as a few minutes for highly relevant searches (relevant, that is, to what Google thinks the article is about, which mostly means the title). Amongst the two hundred-or-so factors that Google is looking at, is whether they trust your results are likely to satisfy searchers. If your articles have enough hits from satisfied (long reading) users, you leap up the rankings within minutes of posting. Less trusted, lower weight postings won&#8217;t appear on page one, if they appear at all. That trust is extended to articles with no backlinks &#8211; these pages appear in results because they&#8217;re coming from a trusted site. </p>
<h3>Why do these duplicate pages disappear, and when, and what does that tell us?</h3>
<p>At some point in the last week, Google has removed the extra results for other page references with additional parameters. If we look again next week, we&#8217;ll see that the extra results for the current recent articles have also disappeared, but if Matt makes more posts, we may see those articles with extra tracking parameters for a brief period. Why do these &#8220;duplicate&#8221; pages disappear?</p>
<p>Well, one reason is that Google detests spam. A bad page of search results would be a page that contained different sites with the same article &#8211; because that wouldn&#8217;t reflect the diversity of opinion and solution on the web. Google would prefer to have ten different answers to the search query, than one answer on ten highly ranked sites. There&#8217;s a conscious effort at Google to compare articles. </p>
<p>How does Google identify that? Well, we can take some guesses by looking at the search results. Notice that bit that says &#8220;cached&#8221;? Taking several shots of a page (cached pages), over time, lets Google see that page content is evolving. User generated content accretes to a page &#8211; and that&#8217;s visible in successive cached snaphots. </p>
<p>Google can therefore see which parts of a page are static and which are likely to be UGC, <i>even without any effort to understand the page structure</i>. Matts&#8217; articles get a lot of commentary, so it&#8217;d be pretty easy for Google to determine that a common core of the page is unchanging &#8211; and identical. How easy? Well, there have been tools to perform that kind of textual analysis with computers since at least the 1970&#8242;s to my certain knowledge &#8211; I used them back then. More recent techniques have used data compression techniques to compare compression rates of samples, using the highly optimised algorithms for good data compression &#8211; and Google has smart people who probably do more complex stuff than that. </p>
<h2>Canonical Link References</h2>
<p>One more datum to collect! Canonical Link References &#8211; does Matt&#8217;s blog use the canonical link ref to make sure that Google knows the best URL for this article? </p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2010/04/Source-of-http___www.mattcutts.com_blog_site-speed_.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Source-of-http___www.mattcutts.com_blog_site-speed_.gif" alt="" title="Source of http___www.mattcutts.com_blog_site-speed_" width="600" height="450" class="alignnone size-full wp-image-409" /></a></p>
<p>The answer is very much affirmative &#8211; Matt so much likes the canonical link reference, it is added twice in the header! Hmm &#8211; some problem with conflicting plugins, perhaps? I don&#8217;t think this duplication of the canonical link reference is intentional. Fortunately they are the same, or we&#8217;d get into some discussions about whether Google believes the first or second canonical link reference! </p>
<p>What *should* the canonical link reference do for Google, when we see the variant forms with tracking tags? It should tell Google that the preferred form is the one without the tracking tags. We should end up with just the preferred form showing in search results. And that&#8217;s what we see &#8211; but the canonical link reference isn&#8217;t the only way that Google looks for probably duplicated data. You can tell, if you look for other blogs that use FeedBurner, but that don&#8217;t use canonical link references. Those blogs still get deduplicated listings &#8211; showing that other components to deduplicate are still working, but still take days to do so. </p>
<h2>What does the transitory presence of these pages tell us? </h2>
<p>Thinking solely about what Matt&#8217;s results are showing us &#8211; not taking any evidence from other experiments and tests into account&#8230;</p>
<p>I think the presence of these pages in the search results says that Google is reading FeedBurner feeds for Matt&#8217;s blog, and getting tagged data, tagged for Google Analytics. I&#8217;m guessing that the tags are not set as ignored in Matt&#8217;s Webmaster Console, so Google sees the tagged pages as different. Because users are not linking to them, and the blog itself doesn&#8217;t refer to them, the usage eventually dies out &#8211; where &#8220;eventually&#8221; means less than a week. And the canonical link reference probably also helps. </p>
<p>Note that the alternate pages *are* present. Not highly ranked, but present. So&#8230; the content is the same, the server is the same, but the links to these pages are different; the untagged page is referenced within the blog (by any category or blog tag and the archive), and after a day or so, there are probably some links to these articles, all probably pointing to the untagged page. So that tells us that backlinks are important &#8211; or there&#8217;d be no reason why these tagged pages shouldn&#8217;t rank as highly &#8211; the user experience is likely to be the same, the content was (at the time of first snapshot/cache) the same. So if backlinks within Matt&#8217;s blog weren&#8217;t important, then the pages should rank with equal weight&#8230; and they clearly don&#8217;t. </p>
<p>This observation also says something important to us &#8211; if we watch our web server log files carefully! You may have read some of my other articles here about web server log file analysis for search engine optimisation &#8211; I think the web server log files tell us important things about how Google perceives our sites. If we see Googlebot requesting a tagged resource, then that tells us something about what resources we have out there, our state of canonicalisation, and Google&#8217;s speed of change. We want to know the speed of change, because when we&#8217;re trying to improve performance, we&#8217;ll want to see when we might expect to see results, at earliest. </p>
<p>We don&#8217;t, unfortunately, have access to Matt&#8217;s web server log files&#8230; but you have access to yours&#8230; What do they tell you, under similar conditions? I may return to this topic, as I think the research is pretty interesting for what it tells us about how Google goes about making decisions. </p>
<h2>Summary</h2>
<p>Google sees the same blog article under different URLs over time. Google collapses the references to a single page URL &#8211; probably using intrinsic information (page content) and supplied information (canonical link references), and backlinks, and the Webmaster Tools mechanism to instruct Google to ignore certain parameters. </p>
<p>URLs for alternate page presentations get into the results for trusted sites, even if these URLs have no weighty backlinks. Not highly ranked, but they do appear. They appear quickly and disappear within a few days. This suggests that whatever it is at Google that evaluates, takes a few days to do so. </p>
<p>I don&#8217;t think this points to content being the key, but to content with backlinks and user-preferred results, as being the most important. And the presence of these duplicates is a hint that a site is trusted. Not a great hint, as if you don&#8217;t use FeedBurner or another similar tagging resource, you won&#8217;t see this effect. </p>
<p>I&#8217;ve not considered the backlinks to Matts blog in detail&#8230; there are a lot and the signal is messy. I&#8217;ve left that analysis out of this article. Besides, discovering the timeline of backlinks is itself a pretty tricky exercise, unless you, as the experimenter, control them; that&#8217;s definitely not the case for Matt&#8217;s blog!</p>
<p>What&#8217;s especially interesting is that an article, without significant backlinks to it yet, can appear high in search results, but that this tends to be for the canonical representation of the page, even though Google is probably learning of the page via FeedBurner/pingomatic notification (IOW, likely to be tagged). So the first mechanism that notifies Google of the URL, is probably *NOT* the canonical form &#8211; yet the canonical form is listed highest, within minutes. That suggests some fast processes at Google for identifying and evaluating a page&#8217;s relevance, and then some slower processes to determine whether the page can be justified for continuing presence. </p>
<p>And that behaviour of high ranking without external justification, in turn, has some implications for the weight that will flow from a blog article&#8230; It is initially likely to be low, and if the article &#8220;sticks&#8221; in the results because it helps users, then it gains some kind of value. Otherwise, the weight of the page will remain low &#8211; though, strictly, understanding that evolution of the weight of the article requires looking at the impact of links from articles in a blog. Another day, perhaps ;)</p>
<p>I hope you found this close look at search results amusing, if not educational. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=400" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/05/17/seo-close-reading-of-search-results/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Non-news: Malformed URLs don&#8217;t pass Anchor Text.</title>
		<link>http://blog.merjis.com/2010/04/09/non-news-malformed-urls-dont-pass-anchor-text/</link>
		<comments>http://blog.merjis.com/2010/04/09/non-news-malformed-urls-dont-pass-anchor-text/#comments</comments>
		<pubDate>Fri, 09 Apr 2010 22:47:08 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[spiders]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=385</guid>
		<description><![CDATA[I&#8217;ve started another burst of postings about web server log file analysis and what it tells search engine optimisers about search engine spiders. Web spider behaviour often lies behind issues that I find on other blogs. For example, Dave Naylor has a couple of recent articles that are interesting. A good one to read is [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve started another burst of postings about web server log file analysis and what it tells search engine optimisers about search engine spiders. Web spider behaviour often lies behind issues that I find on other blogs. For example, Dave Naylor has a couple of recent articles that are interesting. A good one to read is about <a href="http://www.davidnaylor.co.uk/increasing-sales-with-google-analytics-motion-charts.html">using the &#8220;motion charts&#8221; in Google Analytics to find opportunities</a>. But there&#8217;s an odder one about Anchor Text. Some of that article is confirmation of stuff Matt Cutts has written about &#8211; the first link being the one that carries anchor text value, for example, or <a href="http://www.mattcutts.com/blog/pagerank-sculpting/">anchor text and nofollow</a>, or delayed echoes of <a href="http://www.seomoz.org/blog/results-of-google-experimentation-only-the-first-anchor-text-counts">Rand Fishkin&#8217;s recent article on Anchor Text</a>.</p>
<p>Apart from the validation of Matt Cutts statements, there&#8217;s one result that appears blindingly obvious. Malformed URLs don&#8217;t pass anchor text &#8211; and by implication, weight. In the context of the example in the article, adding a space to a URL in the anchor, destroys the value. Googlebot changes spaces (which aren&#8217;t valid characters in a URL) into &#8220;%20&#8243; symbols. In Dave Naylor&#8217;s article, that means that the Googlebot will do a DNS lookup for a domain that doesn&#8217;t and can&#8217;t exist &#8211; spaces are not allowed in domain names. If the URL in the anchor&#8217;s href had been a fully pathed URL, then a space would be added to the end and converted to a &#8220;%20&#8243;. </p>
<p>That full URL, with an appended &#8220;%20&#8243; won&#8217;t be found on the site. It should appear, at some point, in web server log files as a 404 for a Googlebot visit.  404&#8242;s don&#8217;t pass weight. So why the surprise that a malformed URL would fail? </p>
<p>I think the real point, not cleanly spelled out in the article, is that <b><i>web browsers don&#8217;t parse pages the way that search engine spiders parse pages</i></b>. A browser will cope with the embedded space. That ability of a browser to infer the useful thing to do, <a href="http://www.w3.org/Addressing/URL/url-spec.txt">doesn&#8217;t make the space into a valid character in URLs</a> &#8211; not without being escaped, anyway. And the consequence of appending the space, will be that a web spider makes a request for a resource that will usually be 404&#8242;ed, unless the administrator has used <a href="http://httpd.apache.org/docs/1.3/mod/mod_speling.html">Apache</a> <a href="http://httpd.apache.org/docs/2.0/mod/mod_speling.html">mod_speling</a> or an equivalent typo-correction tool (which should yield a 301 redirect to the correct resource).</p>
<p>Attempting to infer the SEO value of browser interpreted behaviour, without understanding Googlebot behaviour, will create confusing and misleading problems. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=385" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/04/09/non-news-malformed-urls-dont-pass-anchor-text/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Googlebot and Search Visitors</title>
		<link>http://blog.merjis.com/2010/04/08/googlebot-and-search-visitors/</link>
		<comments>http://blog.merjis.com/2010/04/08/googlebot-and-search-visitors/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 14:28:25 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spiders]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=362</guid>
		<description><![CDATA[I&#8217;ve been interested in the behaviour of Googlebot, the robot that Google uses to crawl the web, for years. It&#8217;s a topic that seems largely unaddressed by search engine optimisers, yet the behaviour of Googlebot should be extremely important. After all, uncrawled sites tend to have problems with ranking many pages &#8211; the best you [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been interested in the behaviour of Googlebot, the robot that Google uses to crawl the web, for years. It&#8217;s a topic that seems largely unaddressed by search engine optimisers, yet the behaviour of Googlebot should be extremely important. After all, uncrawled sites tend to have problems with ranking many pages &#8211; the best you can get is to have pages ranked that other people are pointing to, which, for most businesses, tends to be just the home page. </p>
<p>I&#8217;ve fairly recently had discussions with a few web site managers who&#8217;d made what appears to me to be the most peculiar decision &#8211; to block Googlebot because of the traffic impact. This resonated with a previous short article that I&#8217;d posted, about a problem identified by a Google staffer who was running his own blog. He&#8217;d seen his <a href="http://blog.merjis.com/2010/02/25/google-hates-me-im-being-penalised-or-not/">blog dropped from search results</a> and was looking for why that might be happening. </p>
<p>There&#8217;s certainly a potential problem &#8211; low bandwidth sites may suffer if Googlebot consumes the available bandwidth. But if you don&#8217;t have Googlebot crawling, then how are you going to appear, anyway? </p>
<p>You could use the Webmaster Tools to request that Google slows the crawl for your site. This should still result in having the crawling and indexing, and minimal damage to the traffic. But just disabling the crawl, by using robots.txt to block all crawling, or to block crawling of large sections of the site that should have user interaction, is probably a mistake.</p>
<p>There is also the legitimate concern that Googlebot&#8217;s visits might be draining server resources at peak traffic periods. That&#8217;s moderately difficult for non-technical site owners to work out. Google Analytics (and the other JavaScript page bug based web analytics packages, such as CoreMetrics, Omniture, Webtrends, etc) measure user visits, not Googlebot and other bot visits. Verifying that Googlebot isn&#8217;t interfering with and slowing down visitors, is pretty much impossible to understand without going to web server log file analysis.</p>
<h2>Web Server Log File Analysis</h2>
<p>I like web server log files. There&#8217;s things I can find out from them, in a few hours, that I simply can&#8217;t find from Google Analytics, CoreMetrics and Omniture. Look at this graph, for example. I&#8217;ve taken web server log files from a UK-targeted business, and extracted Google-inspired visits and Googlebot visits, by hour. </p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2010/04/Googlebot-crawl-rate.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Googlebot-crawl-rate.gif" alt="Graph shows that Googlebot is more active when visitors aren&#039;t present" title="Googlebot crawl rate" width="600" height="506" class="alignnone size-medium wp-image-363" /></a></p>
<p>The graph shows that Googlebot is busiest when users are less present. That is, when Google can see visitors coming to the site, the crawl volume is reduced. </p>
<p>This pattern of making Googlebot most active when the site visits are least active, seems to be the most common pattern that I can see in clients&#8217; web server logfiles. It makes a lot of sense for Google, too: </p>
<ul>
<li>Continuing visits by Googlebot allow them to check that the site is still working (preventing Google from delivering users to a 404&#8242;ed page)</li>
<li>Site performance under load can be monitored (helping Googlebot tune crawling rates, and verifying that users are getting responses from the site, mostly)</li>
</ul>
<h2>Summary</h2>
<p>Googlebot seems to be quite smart about when it visits sites. The more users that are being sent to a site in a given hour, the relatively lower rate that it crawls. So Googlebot should never get in the way of visitors, under normal conditions.</p>
<p>Simply disabling Googlebot looks like a weak way to go.</p>
<p>Following <a href="http://googlewebmastercentral.blogspot.com/2010/03/working-with-multi-regional-websites.html">suggestions from the Google Webmaster Blog</a>, if you have areas of the website that change at different speeds, you might want to validate multiple webmaster consoles for different sections of the site. That would allow setting different crawl rates. I&#8217;ve not tried this, yet&#8230; I don&#8217;t have a client for whom I want to restrict crawling speeds!</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=362" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/04/08/googlebot-and-search-visitors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Blog Spammers Target Blogs</title>
		<link>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/</link>
		<comments>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 07:04:01 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=343</guid>
		<description><![CDATA[Hardly news, but people who spam blogs, use search to look for them. So is there anything that can be done to reduce the attractiveness of a blog to spam? Have a look at this recent sample of keywords that lead to this blog, taken from Google Analytics. Spot anything? Yup, a large fraction of [...]]]></description>
			<content:encoded><![CDATA[<p>Hardly news, but people who spam blogs, use search to look for them. So is there anything that can be done to reduce the attractiveness of a blog to spam? Have a look at this recent sample of keywords that lead to this blog, taken from Google Analytics. Spot anything?<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/04/Keywords-Google-Analytics.jpg"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Keywords-Google-Analytics.jpg" alt="" title="Keywords - Google Analytics" width="449" height="730" class="alignnone size-full wp-image-342" /></a><br />
Yup, a large fraction of the searchers are looking for the default text that WordPress drops at the bottom of an article, inviting user responses. The keywords are, of course, also skewed by the content of the blog. If I hadn&#8217;t used footwear examples in some early articles, I wouldn&#8217;t have those keywords in the searches. </p>
<p>The searches are also amusing, because this <a href="http://blog.merjis.com/comment-policy/">blog has a comment policy</a>. Searches looking for text indicating that this is a blog that accepts user generated content, will tend to find that policy page. In turn, that means that I get a lot of flattering comments about the article&#8230; posted to the comment policy. Bit of a tell that the commenter isn&#8217;t really engaged. </p>
<p>What&#8217;s also interesting is where people are coming from. Mostly China, India, the US. </p>
<p>So, is there anything that can be done, apart from using Akismet and other despamming tools, to help reduce blogspam? Might be worth changing the text, or possibly replacing the text with a graphic, to indicate where a comment should be left. That&#8217;d reduce the search volume coming to the blog from people intent on spamming. However, that&#8217;s essentially a defence based on security through obscurity &#8211; I&#8217;m going to experiment with changing that text and see what it does to searches leading to this blog. :)</p>
<p>I will, of course, continue to use the comment policy page as a great detector of spam. All those people telling me how great the article is, submitting to the comment policy page, have completely signalled their lack of interest in contributing! Like all these spammy comments, mostly targeting a few choice pages:<br />
<a href="http://blog.merjis.com/wp-content/uploads/2010/04/Edit-Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.gif"><img src="http://blog.merjis.com/wp-content/uploads/2010/04/Edit-Comments-‹-Merjis-Internet-Marketing-Blog-—-WordPress.gif" alt="List of articles that spammers target, showing the popularity of the comment policy" title="Edit Comments ‹ Merjis Internet Marketing Blog — WordPress" width="220" height="664" class="alignnone size-full wp-image-372" /></a><br />
That sample is moderately representative &#8211; about half the spam submitted, is submitted to the comment policy page.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=343" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/04/08/blog-spammers-target-blogs/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>SEO: Click Through Rate and Bounce Rate</title>
		<link>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/</link>
		<comments>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/#comments</comments>
		<pubDate>Fri, 26 Mar 2010 21:16:53 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[adwords]]></category>
		<category><![CDATA[click fraud]]></category>
		<category><![CDATA[intent]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[usability]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=340</guid>
		<description><![CDATA[I&#8217;m going to take issue with Rand Fishkin of SEOmoz. I think his most recent White Board Friday video is just plain wrong. Normally, I have a lot of respect for what SEOmoz does, but I think the advice and implications are not just wrong, but dangerously wrong. How Does Google Rank Results I don&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m going to take issue with Rand Fishkin of SEOmoz. I think his most recent <a href="http://www.seomoz.org/blog/whiteboard-friday-influence-of-usage-data#ergabbj-threttuy">White Board Friday</a> video is just plain wrong. Normally, I have a lot of respect for what SEOmoz does, but I think the advice and implications are not just wrong, but dangerously wrong.</p>
<h2>How Does Google Rank Results</h2>
<p>I don&#8217;t know all the details. Rand doesn&#8217;t know all the details. Some guys at Google know a lot of the factors. Matt Cutts, Google&#8217;s head of the search quality team, claims over 200 factors go into ranking. </p>
<p><object width="580" height="360"><param name="movie" value="http://www.youtube.com/v/muSIzHurn4U&#038;hl=en_US&#038;fs=1&#038;rel=0&#038;border=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/muSIzHurn4U&#038;hl=en_US&#038;fs=1&#038;rel=0&#038;border=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="580" height="360"></embed></object></p>
<p>What we do know is that backlinks &#8211; credible links regarded by Google as likely for a search user to visit &#8211; are important. We know that anchor text is important. There&#8217;s some other factors that we know influence Google ranking.</p>
<h2>What *else* do we know?</h2>
<p>We (professional search engine optimisation people) know that on-page content is valuable. For low competition keywords &#8211; keywords where there aren&#8217;t a lot of links and anchor text, and hardly anyone searches &#8211; then page content is enough. Look at the example in the graphic below. There&#8217;s precisely one page on the internet, with that text for something that I can&#8217;t find on Google. When I wrote that, it was true; if you search now, you&#8217;ll find that page. Well, until some spoiler copies it elsewhere&#8230;</p>
<div><a href="http://skitch.com/jezchatfield/n59q8/jeremy-chatfield-google-profile"><img src="http://img.skitch.com/20100326-f3g482f6c6a8ew6js18eey13ry.jpg" alt="Jeremy Chatfield - Google Profile" width=600 /></a></div>
<p>However, try putting the word &#8220;bad credit loan&#8221; on a page on a new web site with some other relevant and unique content, valuable to a user, and see how high you rank for the term. You can wait. And wait. And wait. You&#8217;re not going to show up on the first page of results, just by having a great page alone. It&#8217;s not just the content, it&#8217;s the backlinks that make the difference. </p>
<p>So we now know, as a result of this test, that while Google does pay attention to on-page factors, they also pay attention to backlinks. And in competitive spaces, *effective* <a href="http://www.google.com/support/forum/p/Webmasters/thread?tid=1c377284e24be6db&#038;hl=en" title="Amusing thread about 'Best SEO Company Search Engine Placement'">backlinks count for more than the page content</a>. </p>
<p>The important message to understand from this is that different factors apply under different conditions. Content alone won&#8217;t put you on page one. Backlinks alone won&#8217;t keep you there.</p>
<h2>Click Through Rate and Bounce Rate</h2>
<p>So, at some scale, do CTR (Click Through Rate) and Bounce Rate make any difference? I believe they do, and this blog is a testament to that. Look at this screenshot.</p>
<div><a href="http://skitch.com/jezchatfield/n59t5/content-detail-google-analytics"><img src="http://img.skitch.com/20100326-qf7ibj264gmipd2fxg512jxd1m.jpg" alt="Content Detail: - Google Analytics" width=600 /></a></div>
<p>That&#8217;s a Google Analytics shot of the last 15 months activity for a specific page on the Merjis blog. It&#8217;s all about &#8220;<a href="http://blog.merjis.com/2007/07/16/click-fraud-google-adwords-and-gclid/">gclid</a>&#8221; &#8211; something you&#8217;ll probably care about if you do paid search and look in web server logfiles. </p>
<p>I&#8217;m using this blog as an example, because I&#8217;ve been using it for tests for years &#8211; I know how it works, and it isn&#8217;t confidential client data. I can reveal the usage, because I have my own reasons for running a blog, and few of them directly have anything to with making money.</p>
<p>Most other pages on this site get a profile like this other example:</p>
<div><a href="http://skitch.com/jezchatfield/n5917/content-detail-google-analytics"><img src="http://img.skitch.com/20100326-rs3mrt1rbg886putn3jitpmdnh.jpg" alt="Content Detail: - Google Analytics" width=600 /></a></div>
<p>This is pretty typical for a &#8220;newsy&#8221; blog article. Usage on the day that it is written, and a dribble thereafter. It then usually dries up after a few weeks, because the rank has decayed with time. </p>
<p>So why, with a higher bounce rate, does the older article do better than the newer article in rankings? If Bounce Rate is important, then surely the lower bounce rate in a newer article must mean that Google should drop the older article?</p>
<p>I suspect that Google doesn&#8217;t have a rigid number. They look at how well you do relative to other sites. And especially, they look to see whether search users search again for the same or very similar searches. Read that article on SideWiki, and it&#8217;s lightweight. No real information. No real recommendations. The long lived article on gclid has a much higher bounce rate <i>and longer reading time</i>. It&#8217;s the reading time that&#8217;s the clue. When you&#8217;ve read my article on gclid, you probably don&#8217;t want to read another article about gclid. It&#8217;s reasonably definitive.</p>
<p>Google sustains that old article in search results, despite its&#8217; great age, and despite a high bounce rate, because those users who do read it, value it. It&#8217;s there, because it helps Google to deliver a page of search results that users value more than *without* that article present. </p>
<h2>Uh &#8211; You Didn&#8217;t Mention CTR</h2>
<p>Again, I don&#8217;t think it is actually CTR that Google is looking for. It is user satisfaction. So a high CTR, caused by a misleading piece of copy, won&#8217;t help. You have to deliver what you offer. Again, I don&#8217;t think that Google is measuring conversion, either. But a high CTR message with a high conversion rate, meaning that users are highly satisfied &#8211; that&#8217;s what Google wants you to make. </p>
<p>You won&#8217;t be directly rewarded for high CTR &#8211; but you can measure it (especially if you also run PPC and can get the impression rate). You won&#8217;t be rewarded directly by Google for high conversion rates. But Google does appear to prefer sites that answer the question posed by the search query. And the proxy that can be used by Webmasters, who don&#8217;t have access to Google&#8217;s richer data, is their own performance, as CTR and Conversion Rate. Increase those, and you are more likely to increase position.</p>
<h2>Interaction of Factors</h2>
<p>If you have a good site, with highly relevant content, you tend to get more links. So disentangling backlinks, and the immeasurable relative user satisfaction, is difficult. Pretty much the only way that I know it can be done, is when you have web sites with accidental misbehaviours that create the right conditions for a test. The technical problems that create the conditions are rare &#8211; and recreating them in a real website is likely to decrease the performance. It&#8217;s unlikely that anyone will give you the opportunity to mess up their site, just to prove what works.</p>
<p>However, if you want to go about it&#8230; Here&#8217;s what I think you&#8217;ll need:</p>
<ul>
<li>A visibly horrible page, with a low conversion &#8211; as your starting point</li>
<li>Weak Title and Meta Description as a starting point</li>
<li>A lot of visitors per day &#8211; it takes a long time to demonstrate, otherwise</li>
<li>The ability to make sitewide link changes to the page under test</li>
<li>Good backlinks &#8211; you&#8217;ll want to know that you *could* rank well on page one</li>
</ul>
<p>Change the URL for your horrible page, sitewide. Wait for Google to find it and rank it again. Note the position. Watch the position fall over a period of a week or two (depending on visitor volume). Now improve the page, and switch the URL again and wait for Google to find and rank it. Then watch the rankings change and note which way they go. Now revert the page and switch URLs again, and this time change the Title and Meta Description. Now watch the ranking changes. Now fix up the page again and once more switch the URL and watch. </p>
<p>You should, IME, find that you achieve a higher long term position when you have a better title and description, and a higher converting page with a lower bounce rate. If you can explain why you *shouldn&#8217;t* get a higher position with a site that is better for users, I&#8217;d love to know the reasons. But don&#8217;t make your explanation involve &#8220;gaming&#8221; the system. </p>
<p>And, FWIW, I don&#8217;t believe that the Title and Description are important, as direct factors for SEO. You can rank perfectly well for keyword free pointless titles, and descriptions without keywords that are positively turgid and rambling. However, show the user that you are focused on solving their problem, and your CTR increases; and if you are focused on the user, you&#8217;ll probably have a reasonable landing page, which will engage and convert better. Google&#8217;s not going to reward you for a better snippet, directly, but for a better user experience. Your only measures though, will be what you can observe &#8211; CTR, Bounces, Conversions. If I could tell you to look at the &#8220;re-query rate&#8221;, I&#8217;d tell you to do so &#8211; instead, you&#8217;ll have to use the information you can get.</p>
<h2>Implications For SEO</h2>
<p>If a blog article can decay to little traffic in a few weeks, or sustain rankings for years, on the same blog, with the same blogging software, then the difference must be backlinks? Well, not substantially. Over the years, I&#8217;ve had more backlinks to newsy stories, but still this &#8220;gclid&#8221; article keeps ranking. And all the time, the other lighter weight articles just keep falling out of the listings. </p>
<p>There&#8217;s a few other similar articles on this blog that rank, and stay high for years and years and years. Non-competitive searches, but of long lasting traffic value. And the other sites that I&#8217;m competing with, for attention, are large forums. High weight. Much more frequently updated content. I&#8217;m *deliberately* not trying to place links for articles. Just letting what happens, happen &#8211; so I can understand why it happens. So there&#8217;s no contamination effects here with deliberate link placements. </p>
<p>What are the articles? They all tend to be like that gclid article. Something that is detailed, informative, and means that you can go away and do something. Useful articles, in other words. Harder to write than &#8220;straight news&#8221; articles, as you need unique content, written to address the audience. That&#8217;s part of my reason for writing &#8211; attempting to develop clearer communication.</p>
<p>The clear implication is, I think, that useful content matters. And how do we know it is useful? It&#8217;ll show up in search engine rankings, usablility data and other disturbingly hidden and arcane resource. Google will reward useful content with a better sustained rank &#8211; but won&#8217;t put you on page one just because you have a great article, unless you have some backlinks to create credibility. </p>
<h2>But How?</h2>
<p>Rand makes the point that data about use can be gamed. But so can backlinks. That&#8217;s the major part of undeclared paid backlinks, small world building, and other &#8220;black hat&#8221; techniques. We know that Google sees through most black hat techniques, given time. </p>
<p>We also know, or can find out about, Google&#8217;s interest in invalid impressions and invalid clicks. For example, invalid impressions are generated when search engine ranking tools are run &#8211; they reduce the effective CTR. Invalid clicks are generated when users double click, or are paid to click. Just as with paid search, these two types of invalid activity are measurable by Google. In fact, Google can measure a lot more than a webmaster can see. </p>
<p>We webmasters only get to see bounce rates and conversions. Google gets to look at whether users search again. Much more valuable. If you want to build the worlds&#8217; best search engine, then you want to feature the results that tell you that you&#8217;ve got a winning page &#8211; pages where users don&#8217;t need to search any more. Results that have users positively selecting that site again, when they see it in listings. Webmasters just don&#8217;t have that detail, directly. We just don&#8217;t know if the other guy answers better &#8211; unless we expend effort to learn our customers&#8217; minds and make sure we have the best answer.</p>
<h2>User Experience</h2>
<p><a href="http://www.google.com/corporate/tenthings.html">Google&#8217;s Ten Things</a> lists, first, &#8220;Focus on the user&#8221;. The results from this blog, and from other client activities that I&#8217;m not going to reveal in any detail, are fairly clear. Content that Google can measure as being liked by users, rank better and longer than content that is spammy, tedious and weak. The factors that lead to better rankings will include appropriate Titles and Descriptions and engaging content. It has to be, or rule 1 is broken.</p>
<p>We know that Google has experience of measuring impressions and data to look for invalid data. We know that Google is pretty good at it &#8211; or there&#8217;d be more click fraud problems with AdWords. So, if it can be done, and it is an important indication of quality, why wouldn&#8217;t Google use searchers behaviour to modify results, not just personally, but across the index?</p>
<p>Why can&#8217;t you improve the results when you click on your own listings? Because it is identical behaviour to the banned AdSense practice of clicking on adverts on your own site. Detectable. Invalid. Not counted. And for reasons that I don&#8217;t want to go into, I believe the same will be true of botnets and eLance and Mechanical Turk attacks. There will be a signature associated with them, that doesn&#8217;t match normal user behaviour. The signatures can be spotted and countered, by assigning the activities as invalid &#8211; just as it is in AdWords. Since AdWords continues to run without being infested with click fraud to unusable levels, we have a working system, on a global scale, that shows that user behaviour can be extracted from noisy fraudulent behaviour. </p>
<p>It isn&#8217;t perfect, true, but it separates AdWords from being a system that solely acts to transfer advertising funds to thieves, into a system that, more often than not delivers prospective buyers to an advertiser&#8217;s site. It isn&#8217;t perfect, but it works well enough. <b>AdWords only works because it identifies and categorises user behaviour.</b></p>
<p>User behaviour categorisation works in one system that Google has, worldwide, on a service with measurable economic value. Why wouldn&#8217;t it be usable in organic search results?</p>
<h2>Conclusions</h2>
<p>Failing to identify and understand user interests is an SEO mistake. These are reflected by (but are not completely explained by) CTR and Bounce Rates &#8211; because that&#8217;s about the best that Webmasters can get. Google doesn&#8217;t have to use those &#8211; they have better numbers that are more meaningful to user experience. But saying that &#8220;Google doesn&#8217;t use bounce rates&#8221; is not the same as saying &#8220;Google doesn&#8217;t take account of user behaviour&#8221;.</p>
<p>Unlike Rand, I believe that Google cares very deeply about the user experience, and that Google has very sophisticated technology, probably shared with the Google AdWords guys, to identify unusual search behaviours and exclude them from consideration. </p>
<p>Given enough data, probably gained from multivariate testing on all the different data centres, Google can identify whether users are more, or less, satisfied by different ordering in search results than a pure backlinks-plus-content model would give.</p>
<p>Small scale tests probably won&#8217;t show anything about user interaction &#8211; because the activity doesn&#8217;t have statistical significance or because the signature of strange search activity is too obvious. So, don&#8217;t try faking it &#8211; if you&#8217;ve read this far, you probably aren&#8217;t smart enough to outwit Google&#8217;s teams of click-fraud defence guys. They are really pretty good, as anyone with a rational assessment of AdWords click fraud levels will tell you. Not perfect, but good enough to make the effort of using AdWords worthwhile, rather than primarily a way of siphoning your advertising funds to fraudsters. :)</p>
<p>Why do I say &#8220;if you&#8217;ve read this far&#8221;? Because if you really knew how to hide click streams, you&#8217;d be doing it with AdSense. And you&#8217;d have stopped reading at that point &#8211; because you own the game already. If you can&#8217;t own that game, you can&#8217;t own the game of spoofing user behaviour in organic search &#8211; it is (not identical to, but close enough to) the same game. At the moment I don&#8217;t understand why you&#8217;d bother with SEO behavioural spoofing, if you&#8217;d gamed AdSense, because the revenue is a lot more direct&#8230; Maybe that&#8217;s why Rand hasn&#8217;t spoken with any black hatters that have cracked it? </p>
<p>And if Google can detect unusual impression and click data, then they can fulfil their primary mission, with respect to <b>modifying</b> organic rank based on real user data about preferences and satisfaction. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=340" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/03/26/seo-click-through-rate-and-bounce-rate/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

