<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Robert Accettura&#039;s Fun With Wordage &#187; analytics</title>
	<atom:link href="http://robert.accettura.com/blog/tag/analytics/feed/" rel="self" type="application/rss+xml" />
	<link>http://robert.accettura.com</link>
	<description>Robert Accettura&#039;s Personal Blog on Web Development and Tech</description>
	<lastBuildDate>Thu, 09 Feb 2012 01:43:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<atom:link rel='hub' href='http://robert.accettura.com/?pushpress=hub'/>
<cloud domain='robert.accettura.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>2012 Presidential Candidate Websites</title>
		<link>http://robert.accettura.com/blog/2011/09/01/2012-presidential-candidate-websites/</link>
		<comments>http://robert.accettura.com/blog/2011/09/01/2012-presidential-candidate-websites/#comments</comments>
		<pubDate>Fri, 02 Sep 2011 00:07:37 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Around The Web]]></category>
		<category><![CDATA[akamai]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[barack obama]]></category>
		<category><![CDATA[campaign 2012]]></category>
		<category><![CDATA[drupal]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[jquery]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=6044</guid>
		<description><![CDATA[Back in 2008 I did a special segment in my &#8220;Secrets In Websites&#8221; series for the 2008 Presidential Elections. It was quite popular (almost crashed the server). I decided to do it again, but slightly revised for 2012. My observations/conclusions/insights &#8230; <a href="http://robert.accettura.com/blog/2011/09/01/2012-presidential-candidate-websites/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Back in 2008 I did a special segment in my &#8220;Secrets In Websites&#8221; series for the <a href="http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/2/">2008 Presidential Elections</a>.  It was quite popular (almost crashed the server).  I decided to do it again, but slightly revised for 2012.</p>
<p><span id="more-6044"></span>My observations/conclusions/insights (if you can call it that) can be found after the raw data.</p>
<div style="background-color: #FFFFD1; border: 2px dashed #FFF100; padding: 8px 15px;">
<strong>Disclaimer:</strong><em> If you post a comment that&#8217;s beyond the technical scope of this post, it will be deleted</em>.  This isn&#8217;t a politics site, and I don&#8217;t have the patience or time for it.  My blog, my rules.  No exceptions.</p>
<p>This is just a list of data I collected as <a href="#datacollection">described</a> at the bottom of the page and empirical observations.  This site is <strong>not</strong> an endorsement for or against any candidate or party by myself or my employer.
</div>
<style type="text/css"> .dataTable th { text-align: left; } </style>
<h3>Backend</h3>
<table class="dataTable">
<tr class="tableHeader">
<th>Candidate</th>
<th>Server/OS</th>
<th>CMS</th>
<th>Host</th>
<th>CDN</th>
</tr>
<tr>
<th><a href="http://www.barackobama.com/">Barack Obama (D)</a></th>
<td>Apache on Unknown</td>
<td>Unknown</td>
<td><a href="http://www.level3.com/content">Level3 (CDN)</a></td>
<td>Google CDN, Level3 (footprint.net for assets.bostatic.com)</td>
</tr>
<tr>
<th><a href="http://www.michelebachmann.com/">Michele Bachmann (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://www.smartechcorp.net">Smartech</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.hermancain.com/">Herman Cain (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://www.godaddy.com">GoDaddy</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.newt.org/">Newt Gingrich (R)</a></th>
<td>Apache on Unknown</td>
<td>Drupal</td>
<td><a href="http://www.smartechcorp.net">Smartech</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.jon2012.com/">Jon Huntsman, Jr. (R)</a></th>
<td>Apache on Unknown</td>
<td>Drupal</td>
<td><a href="http://www.rackspace.com/cloud/">Rackspace Cloud</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.garyjohnson2012.com/">Gary Johnson (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://mediatemple.net/">Media Temple</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://fredkarger.com/">Fred Karger (R)</a></th>
<td>Apache on Unknown</td>
<td>Drupal</td>
<td><a href="http://www.slicehost.com">Slicehost</a> / <a href="http://www.rackspace.com/cloud">Rackspace Cloud</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.andymartinforpresident.com/">Andy Martin (R}</a></th>
<td>Apache on Unknown</td>
<td>Appears to be static files</td>
<td><a href="http://www.godaddy.com">GoDaddy</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.mccotter2012.com/">Thaddeus McCotter (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://www.rackspace.com">Rackspace</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://jimmymcmillan.org/">Jimmy McMillan (R)</a></th>
<td>Nginx / Varnish</td>
<td>Trellix Site Builder</td>
<td><a href="http://www.ipowerweb.com/">IPOWERWEB</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.roymoore2012.com/">Roy Moore (R)</a></th>
<td>Nginx / Varnish, Apache</td>
<td>Unknown</td>
<td><a href="http://www.ipowerweb.com/">IPOWERWEB</a></td>
<td>Cotendo</td>
</tr>
<tr>
<th><a href="http://www.ronpaul2012.com/">Ron Paul (R)</a></th>
<td>Apache on Unix</td>
<td>WordPress</td>
<td><a href="http://www.racksapce.com">Rackspace</a></td>
<td>Rackspace Cloud Files (Akamai)</td>
</tr>
<tr>
<th><a href="http://www.rickperry.org">Rick Perry (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://www.slicehost.com">Slicehost</a> / <a href="http://www.rackspace.com/cloud">Rackspace Cloud</a></td>
<td>Google CDN</td>
</tr>
<tr>
<th><a href="http://www.buddyroemer.com/">Buddy Roemer (R)</a></th>
<td>Apache on Unknown</td>
<td>WordPress</td>
<td><a href="http://www.smartechcorp.net">Smartech</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.mittromney.com">Mitt Romney (R)</a></th>
<td>Nginx, Varnish on Unknown</td>
<td>Drupal</td>
<td><a href="http://https://aws.amazon.com/ec2/">Amazon Cloud</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://www.ricksantorum.com">Rick Santorum (R)</a></th>
<td>Apache on Ubuntu</td>
<td>Drupal</td>
<td><a href="http://www.slicehost.com">Slicehost</a> / <a href="http://www.rackspace.com/cloud">Rackspace Cloud</a></td>
<td>No</td>
</tr>
<tr>
<th><a href="http://jonathontheimpalerforpresident2008.blogspot.com/">Jonathon Sharkey (R)</a></th>
<td>GSE (Google)</td>
<td>Blogspot</td>
<td>Google</td>
<td>Google Hosted</td>
</tr>
<tr>
<th><a href="http://www.timpawlenty.com/">Tim Pawlenty (R)</a></th>
<td>Varnish/Apache on Red Hat</td>
<td>Drupal</td>
<td><a href="http://www.rackspace.com/">Rackspace</a> via Freedom First PAC</td>
<td>No</td>
</tr>
</table>
<h3>Frontend</h3>
<table class="dataTable">
<tr class="tableHeader">
<th>Candidate</th>
<th>Markup</th>
<th># Validation Errors</th>
<th>Layout</th>
<th>Charset</th>
<th>HTTP Compression</th>
</tr>
<tr>
<th>Barack Obama (D)</th>
<td>HTML5</td>
<td>72 Errors, 4 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~75%)</td>
</tr>
<tr>
<th>Michele Bachmann (R)</th>
<td>HTML5</td>
<td>27 Errors, 5 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~65%)</td>
</tr>
<tr>
<th>Herman Cain (R)</th>
<td>HTML5</td>
<td>24 Errors, 2 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Newt Gingrich (R)</th>
<td>XHTML 1.0</td>
<td>14 Errors, 3 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Jon Huntsman, Jr. (R)</th>
<td>XHTML 1.0</td>
<td>364 Errors</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~75%)</td>
</tr>
<tr>
<th>Gary Johnson (R)</th>
<td>HTML5</td>
<td>15 Errors, 1 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Fred Karger (R)</th>
<td>XHTML 1.0</td>
<td>6 Errors, 6 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~76%)</td>
</tr>
<tr>
<th>Andy Martin (R)</th>
<td>No Doctype</td>
<td>45 Errors, 2 warning(s) as HTML4 Trans</td>
<td>Table</td>
<td>windows-1252</td>
<td>No</td>
</tr>
<tr>
<th>Thaddeus McCotter (R)</th>
<td>HTML5</td>
<td>50 Errors, 2 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Jimmy McMillan (R)</th>
<td>No Doctype</td>
<td>27 Errors, 10 warning(s) </td>
<td>Table</td>
<td>iso-8859-1</td>
<td>Yes (~88%)</td>
</tr>
<tr>
<th>Roy Moore (R)*</th>
<td>XHTML 1.0</td>
<td>3 Errors, 6 warning(s) </td>
<td>Flash, Tables</td>
<td>UTF-8</td>
<td>Yes (~65%)</td>
</tr>
<tr>
<th>Ron Paul (R)</th>
<td>XHTML 1.0 Strict</td>
<td>34 Errors, 19 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Rick Perry (R)</th>
<td>HTML5</td>
<td>6 Errors, 3 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~73%)</td>
</tr>
<tr>
<th>Buddy Roemer (R)</th>
<td>XHTML 1.0</td>
<td>23 Errors, 7 warning(s)</td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Mitt Romney (R)</th>
<td>XHTML 1.0 Strict</td>
<td>5 Errors, 4 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~77%)</td>
</tr>
<tr>
<th>Rick Santorum (R)</th>
<td>XHTML 1.0 Strict</td>
<td>22 Errors</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~74%)</td>
</tr>
<tr>
<th>Jonathon Sharkey (R)</th>
<td>XHTML 1.0 Strict</td>
<td>158 Errors, 199 warning(s) </td>
<td>CSS</td>
<td>UTF-8</td>
<td>No</td>
</tr>
<tr>
<th>Tim Pawlenty ®</th>
<td>XHTML 1.0 Strict</td>
<td>42 Errors</td>
<td>CSS</td>
<td>UTF-8</td>
<td>Yes (~83%)</td>
</tr>
</table>
<h3>Frontend (cont)</h3>
<table class="dataTable">
<tr class="tableHeader">
<th>Candidate</th>
<th>apple-touch-icon</th>
<th>Syndication Format</th>
<th>Framework/Libraries</th>
<th>Social Networks</th>
<th>Analytics</th>
<th>Misc.</th>
</tr>
<tr>
<th>Barack Obama (D)</th>
<td>Yes</td>
<td>RSS 2.0</td>
<td>jQuery</td>
<td>Facebook, Twitter, YouTube</td>
<td>Google Analytics</td>
<td>Chrome IE Frame, Viewport Meta tags</td>
</tr>
<tr>
<th>Michele Bachmann (R)</th>
<td>No</td>
<td>Feedburner/RSS2</td>
<td>jQuery,Yahoo Base CSS, SWFObject</td>
<td>Facebook, Twitter, YouTube, Flickr</td>
<td>Google Analytics</td>
<td>FB OpenGraph, lots of WordPress Plugins</td>
</tr>
<tr>
<th>Herman Cain (R)</th>
<td>No</td>
<td>No</td>
<td>jQuery</td>
<td>Facebook, TWitter</td>
<td>Google Analytics</td>
<td>ShareThis, Some WordPress Plugins</td>
</tr>
<tr>
<th>Newt Gingrich (R)</th>
<td>No</td>
<td>RSS</td>
<td>jQuery</td>
<td>Facebook, Twitter, YouTube</td>
<td>Google Analytics, Omniture</td>
<td>Has separate mobile site. Short domain.</td>
</tr>
<tr>
<th>Jon Huntsman, Jr. (R)</th>
<td>No</td>
<td>RSS 2.0</td>
<td>jQuery</td>
<td>Facebook, Twitter, YouTube, Vimeo</td>
<td>Google Analytics</td>
<td>ShareThis</td>
</tr>
<tr>
<th>Gary Johnson (R)</th>
<td>No</td>
<td>RSS 2.0</td>
<td>jQuery</td>
<td>Facebook, Twitter, Google+, Flickr, LinkedIn, YouTube</td>
<td>Google Analytics, KISSmetrics</td>
<td>Web Fonts</td>
</tr>
<tr>
<th>Fred Karger (R)</th>
<td>No</td>
<td>RSS</td>
<td>jQuery</td>
<td>Facebook, Twitter, Flickr, YouTube</td>
<td>Google Analytics</td>
<td>NetBoots Powered</td>
</tr>
<tr>
<th>Andy Martin (R)</th>
<td>No</td>
<td>Atom/RSS via Blogspot blog</td>
<td>No</td>
<td>Facebook, Twitter</td>
<td>-</td>
<td>Font tag</td>
</tr>
<tr>
<th>Thaddeus McCotter (R)</th>
<td>No</td>
<td>RSS 2.0</td>
<td>jQuery, jQuery UI</td>
<td>Facebook, Twitter, YouTube</td>
<td>Google Analytics, Chartbeat</td>
<td>Multilingual (xili-language powered)</td>
</tr>
<tr>
<th>Jimmy McMillan (R)</th>
<td>No</td>
<td>No</td>
<td>-</td>
<td>-</td>
<td>Hit Counter by Digits</td>
<td>The rent is too damn high</td>
</tr>
<tr>
<th>Roy Moore (R)</th>
<td>No</td>
<td>No</td>
<td>SwfObject</td>
<td>Facebook, YouTube</td>
<td>Google Analytics</td>
<td>iframed flash site. Likely to prevent spidering / caching content.</td>
</tr>
<tr>
<th>Ron Paul (R)</th>
<td>No</td>
<td>RSS 2.0</td>
<td>jQuery</td>
<td>Twitter, Facebook, YouTube</td>
<td>Google Analytics, Chartbeat</td>
<td>W3 Total Cache</td>
</tr>
<tr>
<th>Rick Perry (R)</th>
<td>Yes</td>
<td>RSS  2.0</td>
<td>html5 boilerplate/modernizr, jQuery</td>
<td>Facebook, Twitter</td>
<td>ChartBeat, Google Analytics</td>
<td><code>@media print</code></td>
</tr>
<tr>
<th>Buddy Roemer (R)</th>
<td>No</td>
<td>RSS (via FeedBurner)</td>
<td>jQuery</td>
<td>Facebook, Twitter, YouTube</td>
<td>Google Analytics</td>
<td>All in One SEO Pack</td>
</tr>
<tr>
<th>Mitt Romney (R)</th>
<td>Yes</td>
<td>RSS 2.0</td>
<td>jQuery, Typekit, Gigya</td>
<td>Facebook, Twitter, YouTube, Flickr</td>
<td>Google Analytics, Omniture, Lotame, NewRelic, Compete, Clickable</td>
<td>Analytics!</td>
</tr>
<tr>
<th>Rick Santorum (R)</th>
<td>No</td>
<td>RSS</td>
<td>jQuery, TypeKit</td>
<td>Facebook YouTube, Twitter, Flickr</td>
<td>Google Analytics</td>
<td>That Google bomb is still working wonders on his name</td>
</tr>
<tr>
<th>Jonathon Sharkey (R)</th>
<td>No</td>
<td>Atom/RSS 2.0</td>
<td>-</td>
<td>-</td>
<td>Google</td>
<td>Doesn&#8217;t own a .com?</td>
</tr>
<tr>
<th>Tim Pawlenty (R)</th>
<td>No</td>
<td>No</td>
<td>jQuery</td>
<td>Twitter, YouTube, Flickr, Facebook</td>
<td>GoSquared, ChartBeat, DoubleClick Floodlight, Google Analytics</td>
<td>Ended campaign</td>
</tr>
</table>
<h3>Observations</h3>
<p>I retooled this for 2012 based on how web development and the internet has changed as well as the data that&#8217;s available.  The most noteworthy thing is that EVERY campaign uses open source.  Perhaps it&#8217;s saving money in this economy?  Windows Server isn&#8217;t free after all.  Most use it extensively.  Regardless who wins, that candidate would be very hypocritical to support the (unlikely regardless) &#8220;<a href="http://www.slate.com/id/2130798/">open source is communism</a>&#8221; mantra.  If this isn&#8217;t proof that open source is as mainstream as ever in America, I don&#8217;t know what is.  Apache is a huge winner.  So is jQuery, WordPress, Drupal, even Nginx and Varnish made a showing (they weren&#8217;t even on the radar in 2008).</p>
<p>Lots of websites are using the HTML5 doctype now.  That doesn&#8217;t mean they are using HTML5, but many are moving in that direction.  Web Fonts were spotted.  Tables were very rare.</p>
<p>Shockingly, CDN usage and HTTP compression were pretty rare.  Given Google will host popular javascript frameworks (jQuery for example), if you can&#8217;t afford the CPU to gzip data you could let Google host it for free.  Lots of cloud hosting though.</p>
<p>Between popular CMS&#8217;s, and popular plugins/modules for those CMS&#8217;s, there&#8217;s little diversity in these sites this time around.  It was obvious last time, it&#8217;s much more obvious this time.  Mono-culture has set in regarding the technicals of these sites.</p>
<p>One thing that really stood out is the amount of analytics on each site.  This election is really a data war.  Knowing as much as possible about voters and the candidates base.  Clearly this is an escalation from 2008.  There is also a proliferation of real-time analytics usage this time around.  Data is everything in the business world, it&#8217;s becoming that way in campaign websites as well.</p>
<h3>Misc Notes</h3>
<p><strong id="datacollection">Data collection method:</strong> The data for this blog post was mostly done on the evening of August 30, 2011 and August 31, 2011 by myself. The character set was however Firefox 6.0 interprets the page. HTML validation was checked by submitting to the W3C validator. All other analysis was done by eye and using tools like cURL. Some things were a little bit of a judgment call, such as CSS layout. I didn’t generally penalize if a table was used, it depends how it was used, and the context. I viewed source on all of them, and spent some time looking around while collecting data. I didn’t view every page on every site, since that would drive me insane. The data is based on the homepage of the site however I did make a brief attempt to hunt for feeds since some only include a link on a &#8220;news&#8221; page. If I couldn&#8217;t find it quick enough, it doesn&#8217;t exist.</p>
<p><strong>Secrets In Websites III?:</strong> Yes there will likely be a third installment. I don&#8217;t know when, I don&#8217;t know what will be included.  I do have some ideas and notes. It takes time to put these together, and I&#8217;m not exactly drowning in free time these days.</p>
<p><small>* Roy Moore&#8217;s website is Flash in an iframe.  For purposes of this analysis I&#8217;m using the page containing the flash object.</small>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=6044#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2011/09/6b39183.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2011/09/01/2012-presidential-candidate-websites/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Who Indexes Tweets</title>
		<link>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/</link>
		<comments>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/#comments</comments>
		<pubDate>Sun, 06 Dec 2009 20:49:41 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[baidu]]></category>
		<category><![CDATA[bing]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=3160</guid>
		<description><![CDATA[I was curious who is indexing the links that people tweet on Twitter. It&#8217;s obvious someone does since links get &#8216;clicks&#8217; almost immediately after submission. To do this presumably they are tapping into the xmpp firehose. Lets take a look: &#8230; <a href="http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I was curious who is indexing the links that people tweet on Twitter.  It&#8217;s obvious someone does since links get &#8216;clicks&#8217; almost immediately after submission.  To do this presumably they are tapping into the <a href="http://blog.twitter.com/2008/07/twitter-and-xmpp-drinking-from-fire.html">xmpp firehose</a>.</p>
<p>Lets take a look:</p>
<pre>66.xxx.xxx.xxx - - [06/Dec/2009:20:17:43 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
</pre>
<p>I guess Google has a deal with Twitter.  Googlebot indexed just a few seconds after it was sent.  As far as I know nothing is actually announced.  This is the first evidence I know of a potential deal of some sort.  I&#8217;d be shocked if Google is scraping the site this quickly.</p>
<p><strong>Edit:</strong> Stephen Duncan pointed out in the comments that this was <a href="http://googleblog.blogspot.com/2009/10/rt-google-tweets-and-updates-and-search.html">announced in October</a>.  Totally forgot about that.</p>
<pre>208.xxx.xxx.xxx - - [06/Dec/2009:20:17:47 +0000] "GET /test HTTP/1.0" 301 - "-" "Mozilla/5.0 (compatible; Butterfly/1.0; +http://labs.topsy.com/butterfly.html) Gecko/2009032608 Firefox/3.0.8"
</pre>
<p>This is <a href="http://topsy.com/">Topsy</a>, a twitter search engine. Never saw this site before. Few tests and I actually kind of like the output.</p>
<pre>89.xxx.xxx.xxx - - [06/Dec/2009:20:17:58 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/5.0 (compatible; MSIE 6.0b; Windows NT 5.0) Gecko/2009011913 Firefox/3.0.6 TweetmemeBot"
</pre>
<p><a href="http://tweetmeme.com/">Tweetmeme</a> mines Twitter links and attempts to build a Digg-like index based on retweets rather than Diggs.</p>
<pre>75.xxx.xxx.xxx - - [06/Dec/2009:20:18:05 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
72.xxx.xxx.xxx - - [06/Dec/2009:20:20:25 +0000] "GET /test HTTP/1.1" 301 - "-" "Python-urllib/2.5"
</pre>
<p>Can&#8217;t identify these AWS hosted services.</p>
<pre>70.xxx.xxx.xxx - - [06/Dec/2009:20:20:53 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
70.xxx.xxx.xxx - - [06/Dec/2009:20:24:23 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
</pre>
<p>This is actually Microsoft.  Microsoft&#8217;s <a href="http://www.bing.com/Twitter">Bing</a> search engine indexes Twitter.  I&#8217;m not sure why they indexed twice in such close intervals that seems odd for this day and age.</p>
<p>Mining logs a little deeper it looks like when tweets meet certain criteria (such as retweeted) there are other bots that spider them.  It also looks like other search engines may be indexing at a slower rate (<a href="http://www.baidu.jp/">Baidu</a> for example).</p>
<p>There are several others from AWS and a few other dedicated providers.  These servers are obviously trying to keep a low profile, they don&#8217;t even have reverse DNS.</p>
<p>So there you go.  Just a matter of seconds after a link hits Twitter this all happens.</p>
<p>Here&#8217;s a few more from another Tweet that weren&#8217;t in the first set:</p>
<p><strong>Edit: More!:</strong></p>
<pre>75.xxx.xxx.xxx - - [06/Dec/2009:20:49:42 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/5.0 (compatible; Feedtrace-bot/0.2; bot@feedtrace.com)"
</pre>
<p><a href="http://www.feedtrace.com/">Feedtrace</a> is some sort of twitter mining service currently in beta.</p>
<pre>67.xxx.xxx.xxx - - [06/Dec/2009:20:49:45 +0000] "GET /test HTTP/1.0" 301 - "-" "Mozilla/5.0 (compatible; mxbot/1.0; +http://www.chainn.com/mxbot.html)"
</pre>
<p><a href="http://www.chainn.com/">Chainn</a> is a social data mining service with a few apps that make use of the data it collects.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=3160#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/12/9808ae3.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Yahoo! Web Analytics</title>
		<link>http://robert.accettura.com/blog/2008/10/08/yahoo-web-analytics/</link>
		<comments>http://robert.accettura.com/blog/2008/10/08/yahoo-web-analytics/#comments</comments>
		<pubDate>Thu, 09 Oct 2008 01:29:23 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[google analytics]]></category>
		<category><![CDATA[yahoo]]></category>
		<category><![CDATA[yahoo web analytics]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2061</guid>
		<description><![CDATA[It went somewhat unnoticed, but Yahoo! today announced it&#8217;s Yahoo! Web Analytics package which is intended to compete with the wildly popular Google Analytics. I&#8217;ve spent quite a few hours in analytics packages over the years ranging from very amateurish &#8230; <a href="http://robert.accettura.com/blog/2008/10/08/yahoo-web-analytics/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It went somewhat unnoticed, but Yahoo! today announced it&#8217;s <a href="http://web.analytics.yahoo.com/">Yahoo! Web Analytics</a> package which is intended to compete with the wildly popular Google Analytics.  I&#8217;ve spent quite a few hours in analytics packages over the years ranging from very amateurish to enterprise grade.  Google Analytics is a very good product but it does have limitations.  The biggest limitation is the lack of real-time reporting.  Google Analytics takes a few hours, making it for most people next-day service.  This isn&#8217;t a big deal for some, but if your in an environment where you need feedback on your content ASAP (a must for media sites), this is a huge deal.  Yahoo is promising to deliver &#8220;within minutes&#8221;:</p>
<blockquote cite="http://web.analytics.yahoo.com/"><p>
Get detailed reporting <strong>within minutes</strong> after an action occurs on your website. Quickly identify dips in key site metrics or monitor the performance of new content. Seeing the impact of website and marketing changes immediately makes it much easier to optimize them. Yahoo! Web Analytics also maintains historical data so you can go back at any time to review old data for new insight, or compare the present to the past without any changes to your page tags.
</p></blockquote>
<p>Interesting.  I wonder if this will light a fire under Google&#8217;s butt to deliver real-time analytics as well.  Urchin wasn&#8217;t really designed for real-time data.  Google&#8217;s obviously done a lot of work with it to build Google Analytics.  I wonder if that&#8217;s the next step for them.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2061#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/10/52dbb06.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/10/08/yahoo-web-analytics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Secrets In Websites II</title>
		<link>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/</link>
		<comments>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/#comments</comments>
		<pubDate>Sat, 12 Jan 2008 02:25:49 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Around The Web]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[akamai]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[barack obama]]></category>
		<category><![CDATA[bill richardson]]></category>
		<category><![CDATA[campaign 2008]]></category>
		<category><![CDATA[chase]]></category>
		<category><![CDATA[chris dodd]]></category>
		<category><![CDATA[css]]></category>
		<category><![CDATA[democrat]]></category>
		<category><![CDATA[dennis kucinich]]></category>
		<category><![CDATA[duncan hunter]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[fred thompson]]></category>
		<category><![CDATA[global crossing]]></category>
		<category><![CDATA[hillary clinton]]></category>
		<category><![CDATA[html]]></category>
		<category><![CDATA[joe biden]]></category>
		<category><![CDATA[john edwards]]></category>
		<category><![CDATA[john mccain]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[mike gravel]]></category>
		<category><![CDATA[mike huckabee]]></category>
		<category><![CDATA[mitt romney]]></category>
		<category><![CDATA[mtv]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[popurls]]></category>
		<category><![CDATA[redhat]]></category>
		<category><![CDATA[republican]]></category>
		<category><![CDATA[ron paul]]></category>
		<category><![CDATA[rudy guliani]]></category>
		<category><![CDATA[secrets in websites]]></category>
		<category><![CDATA[stephen colbert]]></category>
		<category><![CDATA[tom tancredo]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2008/01/11/secrets-in-websites-ii/</guid>
		<description><![CDATA[This post is a follow up to the first Secrets In Websites. For those who don&#8217;t remember the first time, I point out odd, interesting, funny things in other websites&#8217; code. Yes it takes some time to put a post &#8230; <a href="http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This post is a follow up to the first <a href="http://robert.accettura.com/blog/2007/01/20/secrets-in-websites/">Secrets In Websites</a>.  For those who don&#8217;t remember the first time, I point out odd, interesting, funny things in other websites&#8217; code.  Yes it takes some time to put a post like this together, that&#8217;s why it&#8217;s just about a year since the last time.  Enough with the intro, read on for the code.</p>
<p><span id="more-1270"></span></p>
<h3>The Code</h3>
<h4>WordPress.com</h4>
<p>You&#8217;ll notice a comment sure to make any web developer laugh on WordPress.com&#8217;s <a href="http://wordpress.com/wp-login.php">login page</a></p>
<pre>

	&lt;link rel=&quot;stylesheet&quot; href=&quot;http://wordpress.com/wp-admin/wp-admin.css?version=MU&quot; type=&quot;text/css&quot; /&gt;
	&lt;!--[if IE]&gt;
		&lt;style type=&quot;text/css&quot;&gt;#login h1 a { margin-top: 35px; } #login #login_error { margin-bottom: 10px; }&lt;/style&gt;&lt; ![endif]--&gt;
	&lt;!-- Curse you, IE! --&gt;
</pre>
<p>The guys behind <a href="http://www.wordpress.com">WordPress</a> a <a href="http://www.webstandards.org/press/releases/20050601/">while back</a> took the site <a href="http://browsehappy.com/">BrowseHappy</a> under its wing.  WordPress has always been a strong believer in web standards, so this isn&#8217;t surprising (though still amusing).  Did you also know that the guys behind it (<a href="http://automattic.com/about/">Automattic</a>) don&#8217;t have job titles?  Unless you consider &#8220;Chief BBQ Taste Tester&#8221; to be a real job title.  <a href="http://www.photomatt.net">Matt</a>, I hope your job doesn&#8217;t kill you with a heart attack.</p>
<h4>Facebook</h4>
<p>The geniuses over at <a href="http://www.facebook.com">Facebook</a> feel the same and put the following on the top of their IE conditionally included <a href="http://static.ak.facebook.com/css/ie6.css?48:76473">stylesheets</a>:</p>
<pre>

/*  ------------------------------------------------------------------------
                    Facebook | IE/PC Hacks | getfirefox.com
    ------------------------------------------------------------------------  */
</pre>
<h4>popurls</h4>
<p>The ever so popular <a href="http://www.popurls.com">popurls</a> has the following comment in the header of the page.</p>
<pre>

&lt;!--

  __   __
 (  \,/  )
  \_ | _/  IN THE FUTURE EVERY URL WILL BE POPULAR FOR 1.5 SECONDS
  (_/ \_)                  - thomas and the wise popurls butterfly

--&gt;
</pre>
<h4>RedHat</h4>
<p><a href="http://www.redhat.com">RedHat</a> was one of the earlier corporate sites to redo itself into a standards based design.  They have great respect for those who came before them.  In their <a href="http://www.redhat.com/s/master.css">master css</a> file they have the following tribute as well as a little remark about Netscape 4.x:</p>
<pre>

/* 	redhat.com MASTER style sheet

	a tip of the red hat to Zeldman, Bowman, Meyer, Shea, Cederholm, Newhouse, Holzschlag,
	and many, many other css and web standards pioneers who have inspired us. 

	the css, layout and validation status of redhat.com is a work-in-progress. numerous
	web-building worker bees are working furiously to correct the bugs, minimize the hacks
	and validate the code. stay tuned. 

-------------------------------------------------------------------- created June 2004 */
@import url(&quot;global.css&quot;);
...
@import url(&quot;dig.css&quot;);

/* ---------------------------------------------------------------- ns4 styles - bah! */

table {
	border: 1px;
	}
...
</pre>
<h4>Panic Software</h4>
<p><a href="http://www.panic.com/">Panic Software</a> has a cool little piece of code for those who browse the product page for <a href="http://www.panic.com/coda/">Coda</a> (awesome product btw) with IE and don&#8217;t have at least version 6.0:</p>
<pre>

		&lt;!--[if lte IE 6]&gt;&lt;p id=&quot;iewarning&quot;&gt;&lt;img src=&quot;/extras/ripoff/images/ie-warning.gif&quot; alt=&quot;IE Warning&quot; title=&quot;We hear Firefox is nice!&quot; /&gt;&lt;/p&gt;&lt; ![endif]--&gt;
</pre>
<p>I hear it&#8217;s pretty nice too.</p>
<p>Panic also has a comment in the head of their homepage that reads:</p>
<pre>

&lt;!-- This homepage design is not long for this world. Enjoy it while you can! <img src='http://robert.accettura.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  --&gt;
</pre>
<h4>Twitter</h4>
<p><a href="http://twitter.com/">Twitter</a> (who redirects to drop the &#8216;www&#8217; btw) is a very popular service these days.  In their html they mark which server served up the data.  You&#8217;ll see it in the form:</p>
<pre>

  &lt;!-- served to you through a copper wire by bennu.twitter.com at 24 Nov 19:08 in 11 ms (d 0 / r 8). thank you, come again. --&gt;
</pre>
<p>Copper eh?  No fiber in your data center?  I won&#8217;t judge, as long as your bandwidth is plentiful.</p>
<h4>WordPress.com</h4>
<p>Here&#8217;s a bonus from <a href="http://wordpress.com/">WordPress</a>.  While many analytics programs use a 1px transparent &#8220;tracker gif&#8221; to manage statistics, WordPress did something a little different.  At the very bottom on the left hand side, you can see the face of WordPress analytics in all it&#8217;s tiny glory.</p>
<h4>Mozilla</h4>
<p>This technically applies to more than just Firefox.  You&#8217;d be surprised to see how many times <code>kungFuDeathGrip</code> is in the <a href="http://lxr.mozilla.org/mozilla/search?string=kungFuDeathGrip">code base</a>.</p>
<h4>Many Sites using Google Products/Services</h4>
<p>Many people have noticed strange Google tags on sites such as:</p>
<pre>

code
&lt;!--googleoff: index--&gt;
all
&lt;!--googleon: index--&gt;
over
</pre>
<p>This isn&#8217;t a &#8220;SEO&#8221; practice, despite some misconception on the web.  This is used by the Google Search Appliance, a product made by Google which many websites use to power their own search engines to tell the engine what to read and what to ignore.  It wouldn&#8217;t be practical for Google to use these &#8220;in the wild&#8221;.  The reason is that spammers could effectively hide an alternate website within those comments.  Google&#8217;s business is based largely on accurate search results.  Spammers have already tried to abuse the css property <code>display: none;</code>.  This would be even better.  You can find code like this on <a href="http://www.apple.com">Apple.com</a> among many other sites.</p>
<p>Webmasters <em>can</em> however optimize their side for AdSense using a technique <a href="http://google.com/support/adsense/bin/answer.py?answer=23168">recommended by Google</a>:</p>
<pre>

&lt;!-- google_ad_section_start --&gt;

&lt;!-- google_ad_section_end --&gt;
</pre>
<p>This tells Google to give weight to a certain part of your page when deciding what ad to display on the page.  This is good for cases where you feel other material on your page is influencing the ads and resulting in off-topic ads.</p>
<h3>Infrastructure/Platform</h3>
<h4>Microsoft</h4>
<p><a href="http://www.microsoft.com">Microsoft</a>&#8216;s offering against Linux and Apache is IIS on Windows.  Which one would expect they themselves use.  What they don&#8217;t tell you is that they also have used <a href="http://www.akamai.com/">Akamai</a> (with over <a href="http://www.akamai.com/html/about/facts_figures.html">25,000</a> servers), which uses <a href="http://news.zdnet.co.uk/software/0,1000000121,39115920,00.htm">Linux</a>.  They have used Akamai for many things like DNS, and caching files.  Rather than &#8220;Powered By Windows Server&#8221; maybe they should append &#8220;hiding behind Linux&#8221;.</p>
<h4>Myspace.com</h4>
<p><a href="http://www.myspace.com">Myspace.com</a> was previously Adobe/Macromedia&#8217;s model customer because it was written in <a href="http://www.adobe.com/products/coldfusion/">ColdFusion</a>, and said to be the biggest ColdFusion site on the net (and one of the biggest sites on the net).  Many think it still is, but it&#8217;s not.  While many url&#8217;s suggest it might be because they end in <code>.cfm</code> it&#8217;s actually running ASP.net and has been <a href="http://weblogs.asp.net/scottgu/archive/2006/03/25/Handling-1.5-Billion-Page-Views-Per-Day-Using-ASP.NET-2.0.aspx">since aprox, 2006</a>.  You can confirm this by viewing the headers on some of their pages.  You&#8217;ll see:</p>
<pre>
X-AspNet-Version: 2.0.50727
</pre>
<h4>MTV.com</h4>
<p><a href="http://www.mtv.com">MTV.com</a>&#8216;s site has search powered by a Google Search Appliance.  MTV is also owned by <a href="http://www.techcrunch.com/2007/03/13/5217/">Viacom who sued Google</a>, the parent company of YouTube.  The folks at MTV awesomely <a href="http://labsblog.mtv.com/category/site-relaunch/">admitted the irony during relaunch on their blog</a>.</p>
<h4>Global Crossing</h4>
<p>Tier 1 networking provider <a href="http://www.globalcrossing.com/">Global Crossing</a> really wants you to know how fast they are.  Doing a trace could turn up something like this:</p>
<pre>
  7    15 ms    13 ms    14 ms  COMCAST-IP-SERVICES-LLC.tengigabitethernet1-4.ar5.NYC1.gblx.net [64.208.222.58]
  8    14 ms    13 ms    13 ms  tengigabitethernet1-4.ar5.NYC1.gblx.net [64.208.222.57]
</pre>
<p>Yes that&#8217;s right, they use 10 GigE!  Just FYI.</p>
<h3>Goofy</h3>
<h4>Firefox 2.0</h4>
<p>In Firefox 2.0, go to &#8220;About Firefox&#8221; (under the help menu for Windows, under the Firefox menu for Mac), and click on credits.  You&#8217;ll notice <a href="http://www.comedycentral.com/shows/the_colbert_report/index.jhtml">Stephen Colbert</a>.  He wrote it single handedly, but added some other names because he&#8217;s a nice guy.  Bonus: I&#8217;m on the list too.  Above him because I&#8217;m <a href="http://robert.accettura.com/blog/2007/11/17/who-dropped-the-soap/">better</a> than him.  That&#8217;s right, I said it.</p>
<h3>Handy</h3>
<h4>Chase</h4>
<p>Chase for some reason puts it&#8217;s login form in plain text.  The submit url is https, but it doesn&#8217;t feel right.  They do have a SSL enabled login page, but for some reason they hide it.  Here it is for those interested:</p>
<p><a href="https://chaseonline.chase.com/online/home/sso_co_home.jsp">https://chaseonline.chase.com/online/home/sso_co_home.jsp</a></p>
<h4>Google</h4>
<p>For some reason, most of Google&#8217;s services are insecure by default.  By simply going to https, you can use SSL for added security.<br />
Gmail: <a href="https://mail.google.com">https://mail.google.com</a><br />
Google Calendar: <a href="https://www.google.com/calendar">https://www.google.com/calendar</a><br />
Google Reader: <a href="https://www.google.com/reader">https://www.google.com/reader</a></p>
<p>On the <a href="2/">next page</a> is the 2008 US Presidential Candidate Campaign sites&#8230;</p>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2008/01/11/secrets-in-websites-ii/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/01/c850371.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/feed/</wfw:commentRss>
		<slash:comments>69</slash:comments>
		</item>
		<item>
		<title>Tab Impact On Total Time Spent</title>
		<link>http://robert.accettura.com/blog/2007/12/26/tab-impact-on-total-time-spent/</link>
		<comments>http://robert.accettura.com/blog/2007/12/26/tab-impact-on-total-time-spent/#comments</comments>
		<pubDate>Thu, 27 Dec 2007 01:44:24 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[ajax]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[browser]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[ie7]]></category>
		<category><![CDATA[nielsen netratings]]></category>
		<category><![CDATA[opera]]></category>
		<category><![CDATA[page views]]></category>
		<category><![CDATA[safari]]></category>
		<category><![CDATA[Tabs]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2007/12/26/tab-impact-on-total-time-spent/</guid>
		<description><![CDATA[As everyone in the industry knows, Nielsen/NetRatings no longer relies on page views instead preferring total time spent. This makes sense since ajax applications can have 1 page view, but keep a user for an hour. Not to mention other &#8230; <a href="http://robert.accettura.com/blog/2007/12/26/tab-impact-on-total-time-spent/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>As everyone in the industry knows, Nielsen/NetRatings <a href="http://www.forbes.com/media/2007/07/10/internet-advertising-nielsen-biz-media-cx_lh_0710pageviews.html">no longer relies on page views</a> instead preferring total time spent.  This makes sense since ajax applications can have 1 page view, but keep a user for an hour.  Not to mention other things like video or Flash.  The use of time spent is likely much more accurate.  In my mind &#8220;time spent&#8221; is time actually spent on the site (I&#8217;m a literal guy).</p>
<p>This of course raises an interesting question.  How do tabs influence this metric?  Take the following situation as an example.  A user visits a home page, and opens a link in a new tab.  Then finds another link and opens it in a new [background] tab.  That&#8217;s 3 tabs in 1 visit (assume visit to be 30 minutes).</p>
<p>Before tabs, most browser sessions would look like this:<br />
<img src="http://robert.accettura.com/wp-content/uploads/2007/12/20071226_linear_pathing.png" alt="Linear Pathing" class="centered" /></p>
<p>There&#8217;s now an increasing number that will look like this (gray is a tab not in view):<br />
<img src="http://robert.accettura.com/wp-content/uploads/2007/12/20071226_tabbed_pathing.png" alt="Tabbed Pathing" class="centered" /></p>
<p>If we assume total time on the site is time between the first and last page, we potentially undercount the total time on sites that list information (for example Digg).  The total time to make those clicks could be &lt; 10 seconds, but the time spent reading those two page alone might be > 10 minutes.  Many tab power-users from what I&#8217;ve read around the web over the years essentially use them as a way to bookmark their &#8220;to read&#8221; list (including myself).  It also undercounts sites like Gmail which are ajax based (1 page) but can be used for several minutes.</p>
<p>If we use javascript to &#8220;ping&#8221; (call back by placing a tracker gif) the analytics service every x seconds to see if the page is still open, we potentially double count since a user can&#8217;t be in 3 tabs at once.  The clock would be counting 3 seconds for every 1 second the user is actually looking at the page.</p>
<p>This raises the question: are sites that are heavily used by Firefox, Safari, Opera and IE7 site underestimated or overestimated because of the way users browse the site?  How do you accurately tell how long a view is when a user can have multiple tabs?</p>
<p>Another example is someone who keeps their webmail open in a tab all afternoon for easy access.  They may only check it 1x measuring no more than 1 minute in actual attention.  But it&#8217;s open for 5 hrs.  What is the real time on the page?  You can measure my interaction (opening/closing mail).  But what if I&#8217;m reading an email for an hour (it&#8217;s a really complicated one)?  How does that compare to just leaving it open in the background?</p>
<p>This is really no different than using new windows, the difference being that most people seem to have found windows to be annoyance, while tabs are a &#8220;feature&#8221;.  The increase in usage and popularity in a time where visit length matters raises an interesting question.  How do you measure it?</p>
<p>One assumption is that it&#8217;s just a small percentage of the population, which is likely true.  The problem with this assumption is that it&#8217;s one subject to change as the browserscape matures and users learn about new features.  Another assumption is to just account for all time a page is open, even if it&#8217;s not visible.  The downside I see here is that it&#8217;s pretty inaccurate.  As a content producer I&#8217;d like to know if my content is used, or just loaded on a users computer.  If I were an advertiser I&#8217;d care even more.</p>
<p>I&#8217;m not sure how analytics firms approach this.  In a sense it&#8217;s similar to the &#8220;<a href="http://en.wikipedia.org/wiki/Web_analytics#The_hotel_problem">hotel problem</a>&#8220;.  Perhaps just something you need to decide upon and live with.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2007/12/26/tab-impact-on-total-time-spent/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2007/12/83e8ef5.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2007/12/26/tab-impact-on-total-time-spent/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

