<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Merjis Internet Marketing Blog &#187; microsoft</title>
	<atom:link href="http://blog.merjis.com/category/microsoft/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.merjis.com</link>
	<description>Effective Internet Marketing Strategy and Tactics Through Test</description>
	<lastBuildDate>Thu, 12 Jan 2012 09:18:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>MSN adCenter User Community Vexation</title>
		<link>http://blog.merjis.com/2011/07/07/msn-adcenter-user-community-vexation/</link>
		<comments>http://blog.merjis.com/2011/07/07/msn-adcenter-user-community-vexation/#comments</comments>
		<pubDate>Thu, 07 Jul 2011 10:54:14 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[adCenter]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[microsoft]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=652</guid>
		<description><![CDATA[I&#8217;m going, someday, to write about a web site or business that I like&#8230; I promise. In the meantime, here&#8217;s another nutty bit of Microsoft activity. In September, the Bing and Yahoo search network will merge in the UK. While you can now do some activity on those networks and generate some business, because neither [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m going, someday, to write about a web site or business that I like&#8230; I promise. In the meantime, here&#8217;s another nutty bit of Microsoft activity. In September, the Bing and Yahoo search network will merge in the UK. While you can now do some activity on those networks and generate some business, because neither is more than about 1/6th of Google AdWords search traffic, it is hard to justify spending much time to optimise. With smaller auctions the price was generally lower, also meaning it was easy to just set up and run &#8211; no serious optimisation required. </p>
<p>When the networks merge, the combined total should probably be about 20-25% of the AdWords total. Now that&#8217;s of a size to start getting properly interested in optimisation, and one can justify some management time to organising and sorting it out. </p>
<p>Because we&#8217;re anticipating increasing client spend on the enlarged network, we&#8217;re looking at what techniques we can use to optimise. That&#8217;s what we do&#8230; optimise stuff, after digging into it. So I wanted to comment on something that I thought would be useful to do, in the larger network. I went to the Microsoft adCenter user discussion forum to contribute. And I got this, when I tried to submit a comment:</p>
<div id="attachment_654" class="wp-caption alignleft" style="width: 683px"><a href="http://blog.merjis.com/wp-content/uploads/2011/07/Reply-to_-Re_-Is-it-possible-to-make-ads-rotate-evenly-Microsoft-Advertising-Community.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/07/Reply-to_-Re_-Is-it-possible-to-make-ads-rotate-evenly-Microsoft-Advertising-Community.png" alt="screenshot of adCenter forum showing a requirement to complete an invisible CAPTCHA form" title="Reply to_ Re_ Is it possible to make ads rotate evenly? - Microsoft Advertising Community" width="600" height="430" class="size-full wp-image-654" /></a><p class="wp-caption-text">Microsoft adCenter discussion forum requires MSIE, through non-display of CAPTCHA on non-MSIE browsers.</p></div>
<p>Microsoft is making use of their advertising platform pretty difficult. Firstly, I don&#8217;t like this whole &#8220;you must use MSIE to get the experience&#8221;. That&#8217;s so 1997. These days, there are browsers that offer very usable interfaces. Oh look, I&#8217;m using three of them now &#8211; Chrome, FireFox and Safari. It&#8217;s not as if Microsoft even said &#8220;MSIE9 is so far ahead that we&#8217;re making that the standard&#8221; (because that&#8217;d be a pretty stupid thing to say, anyway), because MSIE8 *can* be used &#8211; and that&#8217;s certainly no more advanced than any of the other major browsers.</p>
<p>Nope, Microsoft have made de-facto that to effectively use their web interface for their advertising system, you have to run their operating system and their browser. It&#8217;s so&#8230; retro non-chic. It&#8217;s a marketing approach out of some 1970&#8242;s vendor lockin agenda. </p>
<p>Anyway, the point of frustration was passed today, when I realised that not only do I have to use a Windows system to interact with the advertising platform, I also have to use a Windows system to interact with the community forum. It didn&#8217;t use to be that way. A year or so ago, I could post message. Now, though, I have to complete a CAPTCHA &#8211; which, regrettably, is not visible on Chrome, Safari or Firefox, and has no recommendation as to the plugin I need to adopt. Nope, if I use a non-Windows system with a non-MS browser, they just don&#8217;t care to hear from me. </p>
<p><a href="http://blog.merjis.com/wp-content/uploads/2011/07/Reply-to_-Re_-Is-it-possible-to-make-ads-rotate-evenly-Microsoft-Advertising-Community-1.png"><img src="http://blog.merjis.com/wp-content/uploads/2011/07/Reply-to_-Re_-Is-it-possible-to-make-ads-rotate-evenly-Microsoft-Advertising-Community-1.png" alt="" title="Reply to_ Re_ Is it possible to make ads rotate evenly? - Microsoft Advertising Community-1" width="600" height="558" class="alignleft size-full wp-image-653" /></a></p>
<p>With no CAPTCHA form visible, how do I submit the form with a CAPTCHA? That&#8217;s just a design and test fail.</p>
<p>And there&#8217;s another thing&#8230;</p>
<p>If I&#8217;ve just typed a comment to your blogging platform, couldn&#8217;t you had the courtesy to let me know that I needed to be running MSIE *before* I start writing? I mean you do *know* that your &#8220;community&#8221; now requires MSIE to be used before a comment can be added? In my case, that means usurping a Windows machine used for compatibility testing and adCenter operations, so I can contribute to a discussion. I don&#8217;t feel that generous to Microsoft. I have better things to do with my time than switch machines, in order to post a comment in a discussion thread. </p>
<p>But that&#8217;s not even the serious thing&#8230;</p>
<p>If I&#8217;ve just spent five minutes typing in a reply on the forum, Microsoft knows quite a bit about me. Potential details include stuff like the pacing of characters, the previous activity of the Microsoft authentication service by that ID, the IP address, possibly backtracking the IP address, and details of the packets to indicate the type of operating system I&#8217;m using, all those browser header details, and various other clues that I&#8217;m probably a human. So if you can&#8217;t tell from my logging in and all that typing, that I&#8217;m human &#8211; how the hell are you going to tell from a single click that my advert clicking is or is not fraudulent?</p>
<p>How about it Microsoft? Can you actually detect bad clicks at all, or do you just guess that some percentage are bad? The amount of spend we&#8217;ve done on behalf of clients, and the ROI, has never meant that it is worth checking the adCenter fraud detection system. I think I&#8217;m getting interested now. </p>
<p>And why can&#8217;t Microsoft make its discussion and feedback systems accessible to other browsers, like any other competent technical organisation? What makes Microsoft so technologically incapable that a *discussion* forum requires a technology limitation on the platform used for access? This is just so primitive. I am *so* glad that I decided my organisation would not be using Windows as a standard operating platform. This level of lockin is just tedious obstructivism. At least have the courtesy to tell me when I log in that &#8220;some features of this discussion, such as replying to comments, are not possible unless you use Microsoft Internet Explorer 8 or 9&#8243;? Then I won&#8217;t waste my time trying to engage. </p>
<p>Microsoft has not convinced me of the Windows Advantage. It has convinced me that they are technologically primitive, and that using Windows systems would be a technological backwater. That&#8217;s the complete reverse of what they probably thought they were doing when they made this design decision. The signal I get is &#8220;if you use Microsoft products, you&#8217;re locked in, unable to choose what is best for your business, and we don&#8217;t care at all&#8221;. That&#8217;s not a good message to people who aren&#8217;t using Windows now. It is not going to convince us that using WIndows is a good idea. Forcing users to do something is weaker for branding, than exciting them. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=652" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2011/07/07/msn-adcenter-user-community-vexation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Product Search, Ciao, Foundem and the right to make a buck in search results</title>
		<link>http://blog.merjis.com/2010/02/24/google-product-search-ciao-foundem-and-the-right-to-make-a-buck-in-search-results/</link>
		<comments>http://blog.merjis.com/2010/02/24/google-product-search-ciao-foundem-and-the-right-to-make-a-buck-in-search-results/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 13:53:31 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[bing]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[microeconomics]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[trust]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/?p=328</guid>
		<description><![CDATA[The Telegraph gives some details about the cases being brought against Google in Europe, about product search results. Who&#8217;s complaining, and about what, makes for some interesting speculation. Let&#8217;s have a look at what Google Product Search is, and does, first&#8230; so we&#8217;ve got some context. Google Product Search It&#8217;s free. Anyone can sign up, [...]]]></description>
			<content:encoded><![CDATA[<p>The Telegraph gives some details about the <a href="http://www.telegraph.co.uk/technology/google/7301299/Google-under-investigation-for-alleged-breach-of-EU-competition-rules.html">cases being brought against Google in Europe</a>, about product search results. Who&#8217;s complaining, and about what, makes for some interesting speculation.</p>
<p>Let&#8217;s have a look at what Google Product Search is, and does, first&#8230; so we&#8217;ve got some context.</p>
<h2>Google Product Search</h2>
<p>It&#8217;s free. Anyone can sign up, and feed products. Well, almost anyone. Google has a restriction that there must be one supplier for a given product URL. So you can&#8217;t have lots of people all listing the same DVD on Amazon &#8211; Google doesn&#8217;t like spam, and multiple appearances of the same product from the same vendor at the same price, count as spam.</p>
<p>Within limits, then, Google Product Search is free and a wide range of businesses can use it. You just need to sign up for a <a href="www.google.com/merchants">Google Merchant Center</a> account, and publish a feed. A feed can be as simple as a properly formatted CSV file (exportable from a spreadsheet, once per month) or as complex as a machine-readable and real-time updated XML file synchronised with your product inventory and pricing. </p>
<p>Product Search results &#8211; well, they don&#8217;t appear in every search, and they don&#8217;t alway appear as number one in the search listings. That last item is especially important and tells us, the rest of the world, something about how Google is treating Google Product Search. </p>
<p>Why is it important that Google Product Search is not always number one for product related searches? Anyone involved with search engines knows that number of clicks to a listing is *very strongly* correlated with the position on the page. A listing in position 1 can easily get ten times, or more, as many clicks as exactly the same listing at the bottom of the page. So for Google to list Google Product Search at positions *other than* position 1, tells us that Google is giving up a lot of influence. And remember, Google isn&#8217;t making money (none directly, and only indirectly some revenue) from Product Search listings. </p>
<p>The implication of the varying position of Product Search results is that Google is not giving undue weight to their own product. The reverse implication &#8211; that Google is artificially reducing the ranking of competing product listing services &#8211; is not addressed by that observation. Just seeing that Google doesn&#8217;t inflate it&#8217;s own rank in all cases, is not enough to say that a different service has not been penalised. </p>
<h3>Are there any other vertical search engines in Google results?</h3>
<p>If Foundem has a case, it is that they are being excluded, artificially from results. You don&#8217;t just get listed highly in search results because you exist. Well, not unless the search is pretty rare. If you&#8217;re talking about the highly competitive spaces in which searches relate to things that people might buy, the competition is intense. To show there, you need a lot of &#8220;link love&#8221;. You need a lot of people to link to your site, with the right words, and you need the site to be well designed, and you need to have *credible* links. If you don&#8217;t have all of that, you don&#8217;t stand much chance. </p>
<p>So, let&#8217;s assume that Foundem, being fairly new, and not having made much of a media impact before today, doesn&#8217;t have a lot of link love. Can we support their argument by showing that Google doesn&#8217;t highly rank vertical search engines? </p>
<p>Annoyingly for Foundem, there are vertical search products that show. Businesses like Trovit, with their used car facet-based searches, letting you slip a slider along the price range that interests you, does show up. And they show up for place reviews. And other stuff&#8230; It is arguably a better user experience than Google Product Search for used cars. Trovit have page one appearances for some products. I can see their impact on clients&#8217; sites. </p>
<p><a href="http://googlepolicyeurope.blogspot.com/2010/02/committed-to-competing-fairly.html">Google *does* allow vertical search specialists to appear in search results</a>. That, for me, puts a dent in the argument that Google excludes vertical search specialists. If some do appear, and they collect clicks, in multiple categories of product, then Google is not completely removing them from listings. A dent, but not conclusive evidence. </p>
<h3>Do Ciao and Foundem have a case?</h3>
<p>I don&#8217;t think an appeal based on &#8220;we must be shown because we&#8217;re a competitor&#8221; has any real merit. I&#8217;m not a lawyer &#8211; so that&#8217;s not a commentary on whether the courts would find it to be true. But in terms of search engine results, if you aren&#8217;t sufficiently loved by users, to displace the other ten businesses that are shown, that&#8217;s not, IMO, a reason for a court to decide that you should be shown. Win the users&#8217; love&#8230; and you should be listed. </p>
<p>But, since the algorithms are trade secrets, and the data collected is known only to Google, for an outsider to claim that they have been penalised by Google, is pretty difficult to discern, or prove. </p>
<p>Unless, of course, you have the backing of another search engine that produces broadly similar results, in which your site appears more highly ranked and in which you have no evidence to believe that the results are influenced. In other words, if you were financed by Bing (the only other serious volume search engine left in the game), then you might have strong evidence that you don&#8217;t have spammy links, spammy comments and other factors that would cause de-weighting.</p>
<p>Ciao is owned by Bing. Might there be some evidence? Or is this just Bing (Microsoft) playing games with Google. FUD (Fear, Uncertainty and Doubt) have always been strong components of Microsoft marketing techniques. I certainly don&#8217;t have any confidence that Microsoft is doing this because they love EgalitÃ©, FraternitÃ©, LibertÃ© and the European Way Of Life.</p>
<p>Foundem &#8211; it&#8217;s a small, relatively new player, in a crowded market. I suspect their only problem is the classic one &#8211; not enough users love them to help their site to rank highly. But I haven&#8217;t investigated their site or backlinks. That&#8217;s just a first pass impression with no evidence. I&#8217;ll gleefully admit I could be wrong &#8211; though I&#8217;d be very surprised to be badly wrong.</p>
<p>But the possibility of Google reducing the rank of a disfavoured competitor, or slightly inflating the rank of a preferred information supplier, is something that gets me twitching. There&#8217;s no oversight for Google. We have only their word that they are using what they publicly claim to be fair. If, in the depth of a recession, California based Google were to be weighting US businesses slightly higher in the UK than they should be, it would direct between tens of millions and billions of spend to US companies and away from British companies. Same for France, and Germany. Is Google making sure that the US economy brings in a little more business than it *should* do? I have no idea. I can&#8217;t measure it. I can&#8217;t tell. It does worry me. I worked in the USA for about a decade, and I know how insular US businesses can be. The rest of the world has fewer investors, fewer buyers and works in funny languages and currencies; it is easy to fall into the model of thinking about what works best in the US economy and rolling that out, worldwide. </p>
<p>So there *may* be some basis for European businesses to wonder if Google is deweighting their business. It may be deweighted because it doesn&#8217;t fit a market model that Americans understand. It may be deweighted because an American business is, even unconsciously, working for the end of a recession in their domestic market. It may even be a conscious effort. That really, really worries me &#8211; because I don&#8217;t think even a national government outside the USA, could get the data to investigate whether Google is deliberately, or unintentionally, diverting economically significant activity, artificially, to the USA. </p>
<p>Do Ciao and Foundem have a case? I don&#8217;t know. I&#8217;m looking forward to finding out, if it is released, just what evidence might make them think they do. I hope it&#8217;s a lot more compelling than &#8220;we want our share of the Google billions&#8221;, &#8220;Microsoft is using me as a sock puppet to attack Google&#8221; or &#8220;Google&#8217;s Buzz is creating a weakness in perception we can exploit&#8221;. I fear there may be some interesting biases in Google&#8217;s system that do favour US businesses &#8211; simply because I can&#8217;t see any way to prevent there from being such a bias, or to detect one if it were present. And *that* has implications for national security &#8211; the protection of the realm. Interesting times.</p>
<h2>Updates</h2>
<p>2010-02-27 &#8211; <a href="http://searchengineland.com/admitting-role-in-google-anti-trust-complaints-microsoft-complains-of-google-lock-in-37009">SearchEngineLand article about Microsoft&#8217;s involvement</a>. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=328" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2010/02/24/google-product-search-ciao-foundem-and-the-right-to-make-a-buck-in-search-results/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>SEOMoz article on IIS</title>
		<link>http://blog.merjis.com/2008/09/26/seomoz-article-article-on-iis/</link>
		<comments>http://blog.merjis.com/2008/09/26/seomoz-article-article-on-iis/#comments</comments>
		<pubDate>Thu, 25 Sep 2008 23:03:53 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[microsoft]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2008/09/26/seomoz-article-article-on-iis/</guid>
		<description><![CDATA[I blogged about some of the issues involving IIS and case folding over on SEOmoz. Some interesting suggestions. Best comment so far from &#8220;G-Force&#8221;, to embed some ASP code in each page that checks the file name, and 301&#8242;s to the &#8220;right&#8221; case variant. Heavy work &#8211; individual page edit for each unique page. Unless, [...]]]></description>
			<content:encoded><![CDATA[<p>I blogged about some of the issues involving <a href="http://www.seomoz.org/ugc/iis-case-folding-robots-and-results">IIS and case folding over on SEOmoz</a>. Some interesting suggestions.</p>
<p>Best comment so far from &#8220;G-Force&#8221;, to embed some ASP code in each page that checks the file name, and 301&#8242;s to the &#8220;right&#8221; case variant. Heavy work &#8211; individual page edit for each unique page. Unless, hmm, one uses one of the Unix tool kits. I could see using Perl to auto edit each file in a directory hierarchy.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=224" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2008/09/26/seomoz-article-article-on-iis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO, IIS case folding filenames, Spiders, Analytics, and Robots.Txt</title>
		<link>http://blog.merjis.com/2008/08/20/seo-iis-case-folding-filenames-spiders-analytics-and-robotstxt/</link>
		<comments>http://blog.merjis.com/2008/08/20/seo-iis-case-folding-filenames-spiders-analytics-and-robotstxt/#comments</comments>
		<pubDate>Wed, 20 Aug 2008 15:32:41 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[microsoft]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[spiders]]></category>
		<category><![CDATA[web analytics]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2008/08/20/seo-iis-case-folding-filenames-spiders-analytics-and-robotstxt/</guid>
		<description><![CDATA[AFAICS, the best way to administer IIS for SEO purposes, seems to be to run screaming from the room and hide under a desk until you are allowed to use Apache. So many of the default behaviours create difficulties for users or SEO. Yes, I&#8217;ve been continuing to dig into web analytics and IIS web [...]]]></description>
			<content:encoded><![CDATA[<p>AFAICS, the best way to administer IIS for SEO purposes, seems to be to run screaming from the room and hide under a desk until you are allowed to use Apache. So many of the default behaviours create difficulties for users or SEO. Yes, I&#8217;ve been continuing to dig into web analytics and IIS web server log files for a couple of clients.  I&#8217;ve now seen this problem, again &#8211; first noticed many, many years ago  (1999 or so, I think) for another company and <b>still</b> not solved by default:</p>
<h3>What is the authoritative name of the web page?</h3>
<p>Assume that your web site uses IIS and has a home page available as &#8220;/index.htm&#8221; You can then use the following as silent synonyms of the home page:</p>
<ul>
<li>/index.htm</li>
<li>/Index.htm</li>
<li>/INDEX.HTM</li>
<li>/InDeX.hTm</li>
</ul>
<h3>Why is finding the same file under multiple names a problem?</h3>
<p>Whether you use a web browser, or a search engine bot crawls your web site, each one of those URLs appears to be a different file, albeit with identical content. If you have optimised the content, then each is a candidate to   be the answer for search users. So the spiders do need to track each file name variant, and check them, to see what has changed. </p>
<p>In each case you get an immediate web server response of &#8220;200&#8243; &#8211; file found. Link love can be spent on a wide variety of paths that lead to the same place &#8211; but the spiders aren&#8217;t told that. There is another way to do this, which does not work, out of the box, on Linux and Apache, but is fairly easy to set up.</p>
<h3>Web Brand Is All About Experience</h3>
<p>If the file system does not fold case &#8211; that is, it treats upper and lower case letters as two distinct things &#8211; then a request for a file with mismatched case delivers a 404 &#8211; File Not Found. Now that&#8217;s a bad user interface experience. Brand is all about experience, so why punish your users with a 404 because they can&#8217;t remember what the capitalisation of your ProDuct (sic) is? </p>
<p>You need to find a way to deliver both the web page that the user wanted to get to, and also let the spiders know that there is one authoritative page &#8211; there may be a lot of different links to get there, but just one resource. </p>
<p>On Merjis.com we use a technique that helps the spider understand that we have one page, and that case changes are a link problem, not a server duplication. If you try to reach:</p>
<ul>
<li><a href="http://merjis.com/contact">http://merjis.com/contact</a></li>
<li><a href="http://merjis.com/Contact">http://merjis.com/Contact</a></li>
<li><a href="http://merjis.com/CoNtAcT">http://merjis.com/CoNtAcT</a></li>
<li><a href="http://merjis.com/CONTACT">http://merjis.com/CONTACT</a></li>
</ul>
<p>then you should get to the same page, the Merjis contact information page. In terms of the user experience, this is just the same user experience as on IIS. But we issue a redirect on all the non-standard forms of the page name and IIS doesn&#8217;t. Spiders can see that only one page for &#8220;contact&#8221; exists on the Merjis site, even if it is accessible through many different URLs. This cuts down redundant crawls, focuses link love on a single page, doesn&#8217;t lose references from typographically challenged links, gets users to the page they want whatever the case of the URL they type, and is generally A Good Thing.</p>
<h3>How To See A Redirect</h3>
<p>If you don&#8217;t use a tool like &#8220;wget&#8221; or one of the Firefox HTTP inspection tools, your only real clue to our redirection is that whatever you did type in the URL bar, is replaced by our chosen URL for the resource. Between your input and our response, we added a redirect. Spiders will see the redirection, and only index one page and can pour all the link love on that page.</p>
<p>That&#8217;s completely unlike IIS standard behaviour. The default behaviour is to fold uppercase and lowercase. That means you see the URL that you typed. There&#8217;s no information that the file is a single file known by many names. </p>
<p>Spiders can&#8217;t guess that a single file is a single file &#8211; they only know what they are told. They get told what links exist in sitemaps and by other link references across the web. If a spidered site has references to mixed case versions of names, then the spiders will tromp madly off to each alternative case version of exactly the same file.</p>
<h3>SEO Means Never Saying Sorry To Stupid Spiders</h3>
<p>I&#8217;m of the opinion that helping the spiders to find the right information, helps SEO. Sending spiders to a dozen spelling variations of a path, doesn&#8217;t boost rank, unless the spiders are clever. Even without that, sending spiders to crawl redundant pages, when they could more frequently crawl real content, is a waste of the attention from search engines. If spiders were clever, they wouldn&#8217;t crawl redundant paths to the same content, repeatedly, across the whole server&#8230; So give them a hand to get to the right single file that should be taking all the page rank. </p>
<p>This default case folding behaviour means that IIS again contributes to spamming search engine indexes. Not so bad, except that it causes yet another problem.</p>
<h3>My Web Analytics Don&#8217;t Fold Case</h3>
<p>When you are trying to analyse what is happening to users, just one miskeyed filename can result in the analytics giving you multiple paths to, for example, conversion. Typically the JavaScript page bug reports the filename that was accessed &#8211; using the same capitalisation that the web server delivered. Why? Because on a large fraction of the other web servers, case does matter and &#8220;/index&#8221; is a different file from &#8220;/INDEX&#8221;.</p>
<p>So on an IIS delivered file, the same page can be known by a wide range of names that mean the same thing. But analytics packages don&#8217;t (usually) fold case &#8211; so each reference to a different capitalisation adds another meaningless node to journey analysis. </p>
<p>[ N.B. See Chris's comment below. I should quantify the assertion that Analytics don't fold case - of course, if any of them *do*, that's another problem... ]</p>
<p><ins datetime="2008-08-21T23:50:46+00:00">The following deleted section is a bit rubbish. I failed to properly read and understand the robots.txt spec. I interpreted a line that meant all records in the file, to just mean the User-Agent line. robots.txt allows case folding &#8211; /MEMBERS and /members are identical according to the later spec; the earlier spec only clearly states that the User-Agent field ignores case &#8211; leaving the possibility of ignoring case or respecting it in a Disallow line. </p>
<p>However &#8211; I started this article because I found a clients private area on IIS had been crawled and indexed, despite being listed in robots.txt. I still need to describe that investigation &#8211; but this article is long enough.<br />
</ins></p>
<p><del datetime="2008-08-21T23:49:03+00:00">And that&#8217;s not all. Oh no&#8230;</p>
<h3>Google Spiders Content Disallowed In Robots.txt</h3>
<p>If you have parts of the site that you don&#8217;t want to be in the index, you can use <a href="http://www.robotstxt.org/robotstxt.html">robots.txt</a> to exclude those directories or applications.</p>
<p>Except&#8230; you can&#8217;t. Not sensibly, not with IIS out of the box.</p>
<p>Say that you have a directory called &#8220;/members&#8221; and you want only signed in members to see the content. You exclude the spiders with:</p>
<p><code>Disallow: /members</code></p>
<p>However&#8230; case folding&#8230; This directory is also accessible as &#8220;/MEMBERS&#8221; and that <a href="http://www.webmasterworld.com/google/3357250.htm">isn&#8217;t excluded</a> in this customers&#8217; Robots.txt. So your hidden content is now visible if just one link, somewhere on the internet, or even in your own site, uses a <a href="http://www.webmasterworld.com/forum93/75.htm">different capitalisation</a> from that which has been put in the Robots.txt file.</p>
<p>Is this Google and Yahoo!&#8217;s problem to resolve? IMO, not really. If you choose to use a server that makes the same content available under a range of paths, it is up to you to protect those paths, not for spider developers to guess that you may have shot yourself in the foot.</p>
<p>OTOH, the SEs do themselves no favours for reducing spam in the indexes, and decreasing crawl volumes and bandwidth usage, by failing to recognise that IIS can serve the same page under a multiplicity of case variations. That&#8217;s a different problem &#8211; but solving one would solve the other. If the Server can be detected as using case folding then using a case independent match of Robots.txt paths would be a useful extension. </p>
<p>I can even imagine adding a new directive to Robots.txt to express that pathnames do not respect case.</p>
<p></del></p>
<h3>Other Case Folding Systems</h3>
<p>Well, Apple OS X. It may be my favoured desktop OS, but it has a default FS that is caseless:</p>
<p><code><br />
$ echo boo > goose<br />
$ cat GOOSE<br />
boo<br />
$<br />
</code></p>
<p>I haven&#8217;t tested &#8211; I have no SEO clients with Mac servers &#8211; but I suspect that Mac servers with a default FS run the same problem of a futile and useless Robots.txt protection.</p>
<p>IIRC, OS/2 aka &#8220;Warp&#8221; was used for some years as a web server and it used a case folding FS &#8211; so if you run one of those ancestral systems, watch out.</p>
<h3>Webmasters</h3>
<p>Your defence? Well, make IIS respect case in queries and do a proper redirect to the actual file. </p>
<p><del datetime="2008-08-21T23:54:22+00:00">That way, spiders could properly use Robots.txt and your hidden content wouldn&#8217;t be accessible. Try asking Microsoft about that configuration. Heh. Here&#8217;s <a href="http://support.microsoft.com/kb/217103">Microsoft&#8217;s page about creating a Robots.txt</a> &#8211; note the discussion about case folding? Oh, there *is* no mention of case folding? Hmm. Well, I think that&#8217;s a lesson in its own right. </p>
<p>Failing any rational advice from Microsoft, you could do a combinatorial madness on Robots.txt:</p>
<p><code><br />
Disallow: /members<br />
Disallow: /Members<br />
Disallow: /mEmbers<br />
...<br />
Disallow: /MEmbers<br />
...<br />
Disallow: /MEMBERS<br />
</code><br />
and so on. It&#8217;s easy. Yeah. Right.</p>
<p>Have I mentioned that I think IIS adds problems, rather than removing them?<br />
</del></p>
<p>It is all so much more complex if you have cookieless mode enabled for your ASP applications. Because the path given to the robots doesn&#8217;t match the path that is denied. Deny &#8220;/secret&#8221; and you get a path that starts &#8220;/(&#8221; and goes on to &#8220;))/secret&#8221;. Combine that with case folding and there is no end to index spamming. </p>
<p>And, of course, Robots are the main thing that need to read and respect Robots.txt. </p>
<p>Given a modicum of sense, this whole area could be made a lot simpler for system admins and webmasters. If an IP address and user agent doesn&#8217;t accept cookies, and asks for &#8220;robots.txt&#8221;, it is probably a robot. Stop sending cookieless tracking paths to that IP and UA.</p>
<h3>Sticking In a Reverse Caching Proxy</h3>
<p>If you are a technological sophisticate, then you could insert Apache with a rewriter and mod_speling, to get the benefits of case matching and redirection to the single real file instance. You&#8217;ll possibly see a slight average speed up for users, as unchanged content is delivereddirect from the Apache cache. </p>
<p>How to set up and configure one of these cute web servers is beyond the scope of this article, though. </p>
<h3>Scale Of Problems</h3>
<p>It&#8217;s quite nasty for people using hosted IIS, who have no significant control. I have no doubt that the SEs do duplicate detection. Matt Cutts has written that in-site duplication isn&#8217;t too awful a penalty &#8211; probably because they keep spidering IIS sites. OTOH, places like WebMasterWorld have a fair number of webmaster stories about having two or more case variations of the same page in the search engine indexes, with radically different page rank &#8211; and that depending upon what has been spidered recently, the position of the site will change radically.  </p>
<p>There is a difference between ignoring duplicates and sending full credit for an inbound link to the &#8220;master&#8221; version of the duplicates. Some folk seem to think that there&#8217;s no great loss. I&#8217;m pretty conservative about this &#8211; why risk losing the benefits of inbound links, just because of case folding? </p>
<p>Case folding for private content is a problem. I&#8217;ve seen many complaints about Google revealing information concealed by Robots.txt, and I have strong suspicions that in most of these cases, the complainants were running IIS and had <del datetime="2008-08-21T23:54:22+00:00">an undetectably case-mismatched link reference somewhere on the site, or had</del> enabled Cookieless mode for ASP. </p>
<p>I can&#8217;t find any authoritative discussion about this issue, especially in Microsofts&#8217; Knowledge Base, hence the posting here. Of course it may be that I detest MS so much that I haven&#8217;t spent enough time poking around their resources. That&#8217;d be a quite valid criticism of me and this article. :)</p>
<h3>Other Solutions For Privacy</h3>
<p>Given that Robots.txt is effectively useless faced with both Cookieless mode and case folding, you really, really need to get to grips with your friendly <a href="http://searchengineland.com/070305-204850.php">page-specific metatags &#8220;NOINDEX&#8221;, &#8220;NOFOLLOW&#8221; and NOARCHIVE&#8221;</a>.</p>
<p>Any private content should at least have &#8220;NOINDEX&#8221;.</p>
<p>Arguably, terminal pages (action pages, etc) and private content, get &#8220;NOFOLLOW&#8221; directives.</p>
<h3>Summary</h3>
<p>IIS, by default, apparently opens up web sites to some <a href="http://blog.merjis.com/2007/09/17/google-bowling-and-identity/">Google Bowling</a> activities. You can even bowl yourself out, if you use case variations in your own URLs.</p>
<p>Carefully review what you do have; probably the simplest suggestion is that you stick to all lower case in every link. Watch out for inbound links that use mixed case &#8211; those can scupper your rank.</p>
<p>Use the meta tag for robots to set NOINDEX and possibly NOFOLLOW on your private content on case-folding IIS systems to prevent your data leaking into the search engines.</p>
<p>Best bet &#8211; dump IIS. OK. That&#8217;s a biased view, and a very selective view. But if you want to rank highly in search engines, setting up and running a LAMP (Linux Apache MySQL PHP) or similar system is easy and cheap. While it does have comparable problems (unbranded 404&#8242;s etc) the solutions are also cheap and easy to set up. If I&#8217;m really blunt, I wouldn&#8217;t actually use PHP, either &#8211; the script language design makes it too easy to embed SQL, IME. I&#8217;d steer for Python, Ruby On Rails or OCaml&#8230; probably ;)</p>
<p>It&#8217;s better than finding your private data leaked, or that you&#8217;ve been scuppered by a near unfindable typo.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=211" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2008/08/20/seo-iis-case-folding-filenames-spiders-analytics-and-robotstxt/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>IIS Cookieless Generates Spider Crawling Problems</title>
		<link>http://blog.merjis.com/2008/08/18/iis-cookieless-generates-spider-crawling-problems/</link>
		<comments>http://blog.merjis.com/2008/08/18/iis-cookieless-generates-spider-crawling-problems/#comments</comments>
		<pubDate>Mon, 18 Aug 2008 08:13:16 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spamfighting]]></category>
		<category><![CDATA[spiders]]></category>
		<category><![CDATA[web analytics]]></category>
		<category><![CDATA[yahoo!]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2008/08/18/iis-cookieless-generates-spider-crawling-problems/</guid>
		<description><![CDATA[Another case of Web Server Log File Analysis on IIS being disturbed by bots, having the potential for SEO naughtiness and spamming the search engines. The problem is created by IIS&#8217;s cookieless model. The idea appears to be to present a unique string in the path so you can track sessions without needing a cookie. [...]]]></description>
			<content:encoded><![CDATA[<p>Another case of Web Server Log File Analysis on IIS being disturbed by bots, having the potential for SEO naughtiness and spamming the search engines. The problem is created by IIS&#8217;s cookieless model. The idea appears to be to present a unique string in the path so you can <a href="http://weblogs.asp.net/paulomorgado/archive/2008/08/01/iis-asp-net-cookieless-support-not-working-as-expected.aspx">track sessions without needing a cookie</a>. </p>
<p>Clients that use IIS do seem to suffer from the strangest problems. I kept finding indexed pages, some ranking absurdly highly, with an path infix like (X(kjhgkjfkuyfku)). In other words, if I had a page like &#8220;http://merjis.com/login&#8221;, I&#8217;d find exactly the same content as &#8220;http://merjis.com/(X(stuff))/login, making it look as if I had lots of duplicate pages. Appended parameters as a session ID, I can understand. There&#8217;s even ways to cope with those. </p>
<p>The format of the infix was quite rigid &#8211; 23 characters starting and ending with a parenthesis, and embedding at least one more parenthetical group. I think the regex &#8220;\/\([A-Z]\([A-Za-z0-9]*\)\)\/&#8221; will match every example that I&#8217;ve seen. Oddly, you could navigate to them using a web browser and they worked, but even spidering the site and grepping the resulting mirror failed to show these strange paths in the HTML&#8230; so how did they get there?</p>
<h3>Cookieless Session Tracking</h3>
<p>This looks like an effort by Microsoft to allow tracking of people that don&#8217;t want to be tracked, or who might have, for example, an office based transparent proxy that blocks cookies. AFAICS, when IIS detects that a user doesn&#8217;t permit cookies, it starts sending unique paths. The use of the &#8220;referer_info&#8221; (sic) field allows tracking a single user across the site, looking at where they were and where they&#8217;ve gone to. The positive benefit is that users can be given access to stateful services (like logging in) without needing a cookie. It is, I think, effectively a session cookie, rather than a permanent cookie, as it can&#8217;t recognise you from previous sessions &#8211; your starting point will be the same as any other visitor. </p>
<p>This session tracking seems pretty barking to me. If a user prevents you from serving and saving a session cookie, and you then create a trackable session by using session keys in the URL, you have just violated the privacy that the user requested. If users try to use a use a resource that needs stateful dialogue &#8211; something that remembers whether you were logged in, for example &#8211; then I think that if the user disables cookies it is perfectly reasonable to tell them that this resource won&#8217;t work for any areas requiring an authenticated session, though the public areas may be freely roamed. If they want privacy in the session, then they can&#8217;t have access to anything that requires remembering from page view to page view, who they are. </p>
<h3>Cookies and Search Engine Spiders</h3>
<p>Robots don&#8217;t hold cookies. If faced with an IIS server configured to handle cookieless users, spiders end up being delivered with a false directory structure. Every new visit generates a new directory structure. You get excessive crawling and pages appear in the index as unique, when they are really duplicates caused by server behaviour. </p>
<p>Now, this too seems like a petty piece of madness. Given that some users want to maintain privacy to the extent that they do not even want session cookies, and this number is small on web servers that offer services involving a log in or other identification service, why would you make it more difficult for search engines to spider the site? I believe that you can get more users to a properly indexed site, than the number you&#8217;d lose from failing to handle uncookied users (unless you offer a specific service for the uncookied, of course). </p>
<h3>Robots.txt to the rescue</h3>
<p>Fortunately the workround for this problem seems small. In Robots.txt, add a line:</p>
<p><code>Disallow: *(<br />
</code></p>
<p>For all the spiders that I care about, that seems to prevent crawling the special tracking URL. The consequences of that&#8230; well, I&#8217;m not convinced it is entirely good. But it does stop silly URLs from being indexed after a single crawl.</p>
<p>Technically, this line says &#8220;disallow spidering for a URL starting with anything that has a &#8216;(&#8216; in it somewhere&#8221;. Although this client seems mostly to suffer from the infix immediately after the domain name, reading around the web suggests that the infix could be put at any directory level slash. Otherwise &#8220;Disallow: /(&#8221; would work and avoids the failure possibilities of the wildcard. </p>
<h3>Spidering Improvements?</h3>
<p>I can&#8217;t see why spiders should behave like this.</p>
<p>Having grabbed a server identity from the headers and behaviour, it should be possible to then strip out the session tracking from the path. I can&#8217;t currently think of a reason to *not* do this &#8211; unless you were really trying to sneakily discourage IIS administrators from using this tracking method. </p>
<p>I can think of anti-competitive ways to use the cookieless IIS behaviour. For example, find the user sitemap, point to it and let spiders follow the unique links to every path &#8211; then you can explode the crawl on each site by doubling it. And it accumulates &#8211; because every session appears to be valid for a long time, allowing repeat crawls and generating new unique paths with every reference. </p>
<p>You&#8217;d have thought that this explosion was worth trapping and stopping in the &#8220;fast crawl&#8221; discovery bots, so that the slower inspection spiders wouldn&#8217;t add these redundant pages to the indexes. The presence of these idiot links in link reports, and the multiple crawling of them by various bots, suggests that spiders are still stupid. </p>
<h3>Webmasters?</h3>
<p>Well, if you can&#8217;t avoid running IIS, take a good hard look at your web servers log files. Do you really have a useful volume of search from real users who are cookieless or have you just ramped up bandwidth so that bots can crawl redundant pages without adding any revenue? Is the majority usage of this feature an escalating collection of spiders? Might you get more users if link love wasn&#8217;t being directed to duplicate pages on redundant paths?</p>
<p>Put in the crawl reduction &#8220;Disallow: *(&#8221; and possibly a &#8220;Disallow: /(&#8221; lines to your Robots.txt and verify them with the Search Engine webmaster tools Robots.txt checkers. At least you&#8217;ll be focusing crawl on pages that should be indexed. </p>
<p>Ideally, turn off the cookieless mode. AFAICS, it is a breach of the privacy rights that users were trying to assert. If you offer a service that needs state, then when you detect that you can&#8217;t cookie, offer an apology that parts of the site are unusable. IMO, that&#8217;s perfectly acceptable for both users and designers and won&#8217;t lose any business that you couldn&#8217;t have gained. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=210" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2008/08/18/iis-cookieless-generates-spider-crawling-problems/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Microsoft and Yahoo &#8211; Read Him, Not Me.</title>
		<link>http://blog.merjis.com/2008/02/08/microsoft-and-yahoo-read-him-not-me/</link>
		<comments>http://blog.merjis.com/2008/02/08/microsoft-and-yahoo-read-him-not-me/#comments</comments>
		<pubDate>Fri, 08 Feb 2008 12:57:59 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[microsoft]]></category>
		<category><![CDATA[MSN]]></category>
		<category><![CDATA[yahoo!]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2008/02/08/microsoft-and-yahoo-read-him-not-me/</guid>
		<description><![CDATA[I&#8217;ve just been pointed to a masterful article about the proposed Microsoft acquisition of Yahoo. You know, I thought I&#8217;d written in below, about how a service company (Yahoo) and a product company (MS) would have integration problems &#8211; an issue I&#8217;ve seen before in startups where competent managers in one type of operation failed [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve just been pointed to a <a href="http://www.roughlydrafted.com/2008/02/07/why-does-microsoft-really-want-yahoo/">masterful article about the proposed Microsoft acquisition of Yahoo</a>.</p>
<p>You know, I thought I&#8217;d written in <a href="http://blog.merjis.com/2008/02/01/microsoft-yahoo-opportunity/">below</a>, about how a service company (Yahoo) and a product company (MS) would have integration problems &#8211; an issue I&#8217;ve seen before in startups where competent managers in one type of operation failed to build a company in the other sphere. This article spells that out and supplies other great detail. </p>
<p>I&#8217;ll have to re-read the Roughly Drafted article and see if I can pick some holes or offer some amplifications, but the general thrust is compelling. Right now, I still do see an opportunity for advertisers. But for the companies involved, a mutual cooperation agreement for an approach to advertisers looks healthier than an acquisition. For search agencies, the fewer the number of outlets, the lower the costs of managing them all and the larger the addressed audience&#8230; The cost savings of managing the combined share of market through only one interface, MSN or Yahoo, could improve the attractiveness, especially for smaller accounts in Europe. As the Rough Draft article spells out, that&#8217;s probably not enough to make the business case. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=142" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2008/02/08/microsoft-and-yahoo-read-him-not-me/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft + Yahoo = Opportunity</title>
		<link>http://blog.merjis.com/2008/02/01/microsoft-yahoo-opportunity/</link>
		<comments>http://blog.merjis.com/2008/02/01/microsoft-yahoo-opportunity/#comments</comments>
		<pubDate>Fri, 01 Feb 2008 20:57:45 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[content match]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[MSN]]></category>
		<category><![CDATA[yahoo!]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2008/02/01/microsoft-yahoo-opportunity/</guid>
		<description><![CDATA[Microsoft and Yahoo will doubtless want to capitalise on their current successes. Yahoo&#8217;s increasing Display Ad business is interesting. If they can use stuff like the targeting piloted by Microsoft AdCenter, and improve the control and measurement systems, they can take away Google&#8217;s toys&#8230;. and advertisers. In the UK, Google has over 80% of search [...]]]></description>
			<content:encoded><![CDATA[<p>Microsoft and Yahoo will doubtless want to capitalise on their current successes. Yahoo&#8217;s <a href="http://blog.merjis.com/2008/01/28/google-doing-less-evil/">increasing Display Ad</a> business is interesting. If they can use stuff like the targeting piloted by Microsoft AdCenter, and improve the control and measurement systems, they can take away Google&#8217;s toys&#8230;. and advertisers. </p>
<p>In the UK, Google has over 80% of search share. But, as I&#8217;ve said before, the Content Network is a problem and it is often ignored by larger advertisers. Even worse, for Google, non-textual (display ads) are a small fraction of the Content Network. Google&#8217;s recent advice to advertisers will improve performance, helping increase advertising, but at the expense of higher costs to advertisers through building additional AdGroups with specialist techniques that may be uneconomic if recognised in accounting as a cost of sale &#8211; coupled with a lower click CTR and conversion rate, it&#8217;s usually a lethal combination for a small advertiser.</p>
<p>One of my biggest problems with advertisers and the Content Network, has been proving that the Content Network makes a difference. I have a feeling that Display Advert impressions make a difference to awareness. This may translate later into an increased CTR for paid search, or organic adverts, or affiliates, or even direct access to the site &#8211; but it is an invisible story unrevealed by attributable and measurable lead generation. This is a major problem for Google &#8211; they don&#8217;t like passing on information about users and have shown no indication of any willingness to pass on information about impressions (post impression tracking). Statistical treatments to prove that impression volumes have an effect, also need to take into account which sites are being used, and need large volumes of data to demonstrate statistical correlations. And it all gets much more complex when you consider latency (the time delay between beginning to think about buying something to actually getting around to it &#8211; can be measured in months or even years in parts of the travel industry). </p>
<p>Microsoft and Yahoo could steal advertiser hearts and establish a new standard in internet advertising by offering cookie tracking on impressions for display advertising. If I can show my clients that Display Adverts result in increased awareness, I can sell more advertising. I don&#8217;t think that MS and Yahoo would have the qualms that Google would have about this. As usual, I don&#8217;t care about identifying individuals &#8211; the goal of post impression tracking is to look for statistical correlations between adverts and response and offer ways to tune the delivery to users that are more likely to want to see the advert (win-win for user and advertiser). If I can deliver post impression tracking results and analysis, whew, I&#8217;ll be a much happier bunny. </p>
<p>The increased targeting options for Microsoft also play to segmentation concerns. I have huge doubts that Google&#8217;s content matching alone is enough to provide adequate segmentation. I&#8217;ve seen too much of &#8220;(named software) download&#8221; being matched to &#8220;(ripped music) download&#8221; and &#8220;(ripped video programme) download&#8221;, or other equivalent mismatching of content. Google&#8217;s pretty much unexplained &#8220;search behaviour&#8221; targeting, where they claim to improve performance through optimising based on recent searches&#8230; well, if I don&#8217;t understand it and can&#8217;t control it, I don&#8217;t trust it. I certainly don&#8217;t trust Google to do more than optimise their own revenue. </p>
<p>Microsofts targeting models also leave me a little concerned &#8211; but they are a step on the right path and frankly, without real information from Google about search history and targeting, I&#8217;d rather have the controls that MS offers, than the lack of control offered by Google. I can explain to clients what my errors are &#8211; but if Google is changing the game behind my back, I can&#8217;t justify changing response to clients. So a Microsoft model is much more interesting. </p>
<p>Much as I loathe Microsoft as a company (I&#8217;ve detested using Windows, for years), I do see real opportunities for Microsoft to work with Yahoo to improve targeting and measurement, and to steal advertising share from Google. Whether the world is better served with a Google near-monopoly of search advertising and a Microsoft near-monopoly of display advertising is a different question. But in the short term at least, this may require that Google treats advertisers with a little more respect, and that can&#8217;t be a bad thing from where I sit. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=137" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2008/02/01/microsoft-yahoo-opportunity/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Windows drives me nuts</title>
		<link>http://blog.merjis.com/2007/05/09/windows-drives-me-nuts/</link>
		<comments>http://blog.merjis.com/2007/05/09/windows-drives-me-nuts/#comments</comments>
		<pubDate>Wed, 09 May 2007 14:36:43 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[microsoft]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2007/05/09/windows-drives-me-nuts/</guid>
		<description><![CDATA[I had a high priority task to get done, and took over an hour to log in. I have an active AntiVirus package for my Windows system. It detected nothing recently. Was I a rare victim of a Day Zero attack? Particularly vexatious is that the screen saver keeps running. As you are trying to [...]]]></description>
			<content:encoded><![CDATA[<p>I had a high priority task to get done, and took over an hour to log in. I have an active AntiVirus package for my Windows system. It detected nothing recently. Was I a rare victim of a Day Zero attack? </p>
<p>Particularly vexatious is that the screen saver keeps running. As you are trying to login, you can&#8217;t relax, secure in the knowledge that the next screen you see will be the desktop. You have to keep vigilant, and move the mouse, just a bit, every few minutes. It gets to be about as exciting as watching paint dry. Someone really needs to develop &#8220;Paint TV&#8221; that shows a series of images as it dries&#8230; Anyway, it spoils attention on any other serious projects, while you wait to login and evade the screen saver. </p>
<p>I wasted some time &#8211; which I won&#8217;t count, as it was my stupidity and ignorance &#8211; trying to run the AntiVirus package on all files on the system. After some time, it was estimating completion in between 6 hours and 205 days (or some obscenely large time). I killed the process, eventually, and started investigating the idea of reformatting the disk and re-installing. In doing so, I noticed an article that set me thinking&#8230;</p>
<p>After three hours, the Windows Task Manager finally came up, and showed &#8220;Spoolsv.exe&#8221; consuming 99% of the CPU, doing nothing visibly useful. OK, so this is a 2GHz laptop, 2GB RAM. Should be able to handle the kind of stuff I do (using MSIE7, usually, and some web analytics packages whose vendors are so perverted that the packages only work on MSIE). Four hours before I can even terminate the stupid program that is stopping me from working? Pathetic. </p>
<p>Searching for &#8220;spoolsv.exe&#8221; showed some promising links. The highest ranked result that was obviously about troubleshooting this problem was kinda OK. But the best answer was a lower ranked link. Here&#8217;s what to do if your system goes dog-slow as soon as it boots and you have a process called &#8220;<a href="http://www.computerhope.com/issues/ch000914.htm">spoolsv.exe</a>&#8221; hogging the CPU.  I&#8217;m putting in my link vote for this resource &#8211; easy to follow, and it has worked. </p>
<p>What caused the problem? No idea. Can&#8217;t find anyone describing how something as stupid as this survives in an operating system that is around twenty years old at this point. It&#8217;s seems like bad engineering and inattention to user difficulties. Which is why I loathe Windows. I detest badly designed stuff. I&#8217;m biased that way :) I use a Mac for a desk or laptop, or Linux as a server, when I can. </p>
<p>Having got the system back, I re-ran the AntiVirus and &#8220;Malicious Software Removal Tools&#8221; packages. No problems detected. Except that now the system won&#8217;t properly connect to WiFi. It keeps saying that it isn&#8217;t connected. Right clicking on the WiFi icon in the System Tray (lower right), lets me see that it has detected one wireless network (the right one) and that it is set to &#8220;Automatic Connection&#8221;. Clicking on the network is supposed to join it&#8230; and it ends up flashing a window that says &#8220;Connected!&#8221;. Then the &#8220;Connect&#8221; button changes to &#8220;Disconnect&#8221;, while the network connection status changes to&#8230;. &#8220;Not Connected&#8221;. </p>
<p>The Windows system will no longer connect to the WiFi that it used to use. I haven&#8217;t changed the WiFi settings and and all other equipment (a collection of Apple kit, a mobile phone using Microsoft&#8217;s PocketPC with WiFi, etc) is using the WiFi properly. Something has damaged or changed the machine&#8217;s WiFi, in some unknown way. Can&#8217;t see anything in system logs. More time spent trying to debug. Eventually I give up and run an ethernet cable from a Airport Extreme. More hours wasted trying to find out out why a subsystem that I have not consciously touched, no longer works, and creating a workround. </p>
<p>Windows just drives me nuts.</p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=104" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2007/05/09/windows-drives-me-nuts/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Microsoft AdCenter, CPC and Revenue</title>
		<link>http://blog.merjis.com/2007/04/05/microsoft-adcenter-cpc-and-revenue/</link>
		<comments>http://blog.merjis.com/2007/04/05/microsoft-adcenter-cpc-and-revenue/#comments</comments>
		<pubDate>Thu, 05 Apr 2007 08:38:42 +0000</pubDate>
		<dc:creator>Jeremy Chatfield</dc:creator>
				<category><![CDATA[content match]]></category>
		<category><![CDATA[microsoft]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2007/04/05/microsoft-adcenter-cpc-and-revenue/</guid>
		<description><![CDATA[When you start your own advertising platform, what expectation of revenue should you have? What does this imply for possible strategies for Redmond? Can throwing cash at the problem make it go away? ]]></description>
			<content:encoded><![CDATA[<p>Some time ago John K posted, partially, about <a href="http://gotads.blogspot.com/2007/01/woeful-q4-for-microsoft.html">Microsoft AdCenter Revenues</a>. I think that the search revenues should be lower, after passing up on Overture. I think it is basic economics. What implications might this have for use of contextual advertising and publishers? </p>
<p>Let&#8217;s pretend that you are one of multiple players using an advertising platform. That platform is shared with other paid search outlets and has a significant volume of advertisers. You then create your own advertising platform which starts with no advertisers. What will happen? </p>
<p><span id="more-94"></span></p>
<p>Your cost per click from the few advertisers that you initially gain, will be lower. Even if you offer a great ROI, the number of competitors is not enough to allow the CPC to drift up, fast. So you could serve as many adverts, but you won&#8217;t get the revenue stream, until you&#8217;ve recruited more advertisers to drive up the competition and hence make the prices rise.</p>
<p>What brings in advertisers? A good ROI, for sure, but also users. Usage stats suggest that Live has a small but not negligible proportion of the market. Enough of a portion that advertisers might include them in the Pareto (80/20) rule to include as a target. But that&#8217;s a long way from a mandatory requirement, such as Google SERPs and AdWords. The advertiser volume is low, so ROI is good. User volume is low, meaning that the marginal cost of using the system is relatively high (if it takes the same management time to manage 10% of the business as to manage 60-80% of the business, what should you prioritise?).</p>
<p>That all says &#8220;low revenue&#8221; to me. If the advertising platform were rolled out to appear on more sites, then the user base would increase. Well, looky here &#8211; John&#8217;s already spotted <a href="http://gotads.blogspot.com/2007/03/adcenter-advertisers-watch-out-for.html">the next installment</a> in the thinking! </p>
<p>By offering more impressions, Microsoft can increase the user base. The low cost of adverts makes that attractive for advertisers. But the low revenue potential makes that a no-brainer no-way for the affiliate &#8211; why publish low revenue adverts from Microsoft when the higher competition on Google offers higher priced adverts? </p>
<p>So&#8230; can we expect that those people publishing Microsoft adverts will see a better share of revenue? How much share of revenue does it take to make a Microsoft advert as valuable as a Google advert? And given the difficulties that Google has had in targeting contextual match in 2006, do you expect advertisers to be happy with what Microsofts weaker search technology can do with contextual match? </p>
<p>I&#8217;m expecting that Microsoft will offer a greater share of the advert cost with searchers. That they won&#8217;t use Google-style &#8220;smart pricing&#8221; on contextual match and then that the contextual match will offer a weaker ROI. That&#8217;ll drive up revenues, but eat into profits. Given that Microsoft sees crocodiles anywhere it moves in the swamp, it&#8217;ll probably throw some cash to see if it can drive away competitors. Dunno that anyone has ever proven that throwing money at crocs makes the world safer, but it is a common tactic. </p>
<p>Is contextual match advertising revenue a problem that you can make go away by throwing cash at it? Probably not. The fundamental problems are that keyword search is relatively simple &#8211; users tell you what they want to know, and your problem is to crawl and index the existing pages and apply a bunch of filters to prevent spam sites from getting too high a rank. If you have to infer the conceptual meaning of a web page to deliver suitable adverts, then you are in a different domain of expertise. It could be that somehow Microsoft have jumped Google on this, but it seems&#8230; unlikely. So I&#8217;m expecting that advertisers will complain about the quality of site matching. I can think of an approach to beating Google, or at least matching them, but it relies on some wierd stats that are most popular in the old Soviet bloc and doesn&#8217;t rely on much of a conceptual matching system. But given that content-based publishing system is part of the Search Empire, that is an unlikely path&#8230; </p>
<p>So far, none of our clients have expressed any interest in testing or using Microsofts&#8217; contextual matching. They have enough difficulty with the more mature, and improved over the 2006 performance, Google contextual matching. And they&#8217;ve seen Microsoft Live Search&#8230; Even those that enthuse about the ROI offered on Microsoft AdCenter are still using Google as their default search engine. So we have no evidence that Microsofts&#8217; contextual search is strong or weak. But would you think that it might be strong? Care to gamble your next bonus on that? </p>
<p>This all suggests to me that publishers may be in for a good year, with Microsoft. If you have to burn money to grow, then buying new outlets for adverts has to be good news. It drives up usage, makes advertisers more competitive and that will ultimately drive up the revenue. But that&#8217;s a slow, uphill drive. Especially slow when the engine isn&#8217;t working properly and when you risk increasing advertiser defection through poor matching. Hard times for whoever is currently in the hot seat at Microsoft Live Search. </p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=94" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2007/04/05/microsoft-adcenter-cpc-and-revenue/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Windows, now and then</title>
		<link>http://blog.merjis.com/2006/11/10/windows-now-and-then/</link>
		<comments>http://blog.merjis.com/2006/11/10/windows-now-and-then/#comments</comments>
		<pubDate>Fri, 10 Nov 2006 10:52:38 +0000</pubDate>
		<dc:creator>rich</dc:creator>
				<category><![CDATA[microsoft]]></category>

		<guid isPermaLink="false">http://blog.merjis.com/2006/11/10/windows-now-and-then/</guid>
		<description><![CDATA[Now: 9 Nov 2006 [Jim] Allchin [Microsoft VP] Suggests Vista Won&#8217;t Need Antivirus During a telephone conference with reporters yesterday, outgoing Microsoft co-president Jim Allchin, while touting the new security features of Windows Vista, which was released to manufacturing yesterday, told a reporter that the system&#8217;s new lockdown features are so capable and thorough that [...]]]></description>
			<content:encoded><![CDATA[<h3>Now: 9 Nov 2006</h3>
<p>[Jim] Allchin [Microsoft VP] Suggests Vista Won&#8217;t Need Antivirus</p>
<blockquote><p>
During a telephone conference with reporters yesterday, outgoing Microsoft co-president Jim Allchin, while touting the new security features of Windows Vista, which was released to manufacturing yesterday, told a reporter that the system&#8217;s new lockdown features are so capable and thorough that he was comfortable with his own seven-year-old son using Vista without antivirus software installed.
</p></blockquote>
<p><a href="http://www.betanews.com/article/Allchin_Suggests_Vista_Wont_Need_Antivirus/1163104965">(source)</a></p>
<h3>Then: 6 Sep 2001</h3>
<p>Jim Allchin: &#8220;Microsoft stamps out XP buffer overflows&#8221;</p>
<blockquote><p>
Microsoft has said it has stamped out buffer overflows with the upcoming release of Windows XP. Jim Allchin, vice president, claimed the company has done a complete code review of its operating system and removed all buffers which could overflow.
</p></blockquote>
<p><a href="http://www.vnunet.com/networkitweek/news/2057931/microsoft-stamps-xp-buffer-overflows">(source)</a></p>
 <img src="http://blog.merjis.com/wp-content/plugins/wordpress-feed-statistics/feed-statistics.php?view=1&post_id=58" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://blog.merjis.com/2006/11/10/windows-now-and-then/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

