<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Robert Accettura&#039;s Fun With Wordage &#187; twitter</title>
	<atom:link href="http://robert.accettura.com/blog/tag/twitter/feed/" rel="self" type="application/rss+xml" />
	<link>http://robert.accettura.com</link>
	<description>Robert Accettura&#039;s Personal Blog on Web Development and Tech</description>
	<lastBuildDate>Thu, 09 Feb 2012 01:43:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<atom:link rel='hub' href='http://robert.accettura.com/?pushpress=hub'/>
<cloud domain='robert.accettura.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>Web App Stores Via Twittter/Facebook</title>
		<link>http://robert.accettura.com/blog/2012/01/25/web-app-stores-via-twittterfacebook/</link>
		<comments>http://robert.accettura.com/blog/2012/01/25/web-app-stores-via-twittterfacebook/#comments</comments>
		<pubDate>Wed, 25 Jan 2012 20:59:48 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Web Development]]></category>
		<category><![CDATA[app store]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=7293</guid>
		<description><![CDATA[It seems likely to me that Facebook and Twitter will eventually be competing with Apple in terms of App stores. Facebook sort of already is with their extensive apps platform, however that&#8217;s just competing for developer attention. Twitter doesn&#8217;t really &#8230; <a href="http://robert.accettura.com/blog/2012/01/25/web-app-stores-via-twittterfacebook/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It seems likely to me that Facebook and Twitter will eventually be competing with Apple in terms of App stores.  Facebook sort of already is with their extensive apps platform, however that&#8217;s just competing for developer attention.  Twitter doesn&#8217;t really have an equivalent today (developers mainly build clients and interact with data), but don&#8217;t underestimate their clout.</p>
<p>The reason I say this is that Facebook and Twitter have become identity gatekeepers on the net.  Already you can login to many sites via accounts with one of the two sites.  Creating the API&#8217;s to handle purchase/subscriptions and transparently handling the billing to effectively turning a HTML5 site into an &#8220;app&#8221; is the next logical step.  They could undercut Apple and still walk away with a handsome profit for not doing terribly much more than leveraging their size and reach.  These apps would work on any device with a web browser.  Desktop or mobile.</p>
<p>Given both sites need to diversify revenue streams (something Google never figured out), it seems only logical to make this step.  $0.99 for <a href="http://chrome.angrybirds.com/">Angry Birds</a> seems more than plausible.</p>
<p>And yes, there are offline abilities in a browser.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=7293#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2012/01/7d91786.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2012/01/25/web-app-stores-via-twittterfacebook/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Who Indexes Tweets</title>
		<link>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/</link>
		<comments>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/#comments</comments>
		<pubDate>Sun, 06 Dec 2009 20:49:41 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[baidu]]></category>
		<category><![CDATA[bing]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=3160</guid>
		<description><![CDATA[I was curious who is indexing the links that people tweet on Twitter. It&#8217;s obvious someone does since links get &#8216;clicks&#8217; almost immediately after submission. To do this presumably they are tapping into the xmpp firehose. Lets take a look: &#8230; <a href="http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I was curious who is indexing the links that people tweet on Twitter.  It&#8217;s obvious someone does since links get &#8216;clicks&#8217; almost immediately after submission.  To do this presumably they are tapping into the <a href="http://blog.twitter.com/2008/07/twitter-and-xmpp-drinking-from-fire.html">xmpp firehose</a>.</p>
<p>Lets take a look:</p>
<pre>66.xxx.xxx.xxx - - [06/Dec/2009:20:17:43 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
</pre>
<p>I guess Google has a deal with Twitter.  Googlebot indexed just a few seconds after it was sent.  As far as I know nothing is actually announced.  This is the first evidence I know of a potential deal of some sort.  I&#8217;d be shocked if Google is scraping the site this quickly.</p>
<p><strong>Edit:</strong> Stephen Duncan pointed out in the comments that this was <a href="http://googleblog.blogspot.com/2009/10/rt-google-tweets-and-updates-and-search.html">announced in October</a>.  Totally forgot about that.</p>
<pre>208.xxx.xxx.xxx - - [06/Dec/2009:20:17:47 +0000] "GET /test HTTP/1.0" 301 - "-" "Mozilla/5.0 (compatible; Butterfly/1.0; +http://labs.topsy.com/butterfly.html) Gecko/2009032608 Firefox/3.0.8"
</pre>
<p>This is <a href="http://topsy.com/">Topsy</a>, a twitter search engine. Never saw this site before. Few tests and I actually kind of like the output.</p>
<pre>89.xxx.xxx.xxx - - [06/Dec/2009:20:17:58 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/5.0 (compatible; MSIE 6.0b; Windows NT 5.0) Gecko/2009011913 Firefox/3.0.6 TweetmemeBot"
</pre>
<p><a href="http://tweetmeme.com/">Tweetmeme</a> mines Twitter links and attempts to build a Digg-like index based on retweets rather than Diggs.</p>
<pre>75.xxx.xxx.xxx - - [06/Dec/2009:20:18:05 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
72.xxx.xxx.xxx - - [06/Dec/2009:20:20:25 +0000] "GET /test HTTP/1.1" 301 - "-" "Python-urllib/2.5"
</pre>
<p>Can&#8217;t identify these AWS hosted services.</p>
<pre>70.xxx.xxx.xxx - - [06/Dec/2009:20:20:53 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
70.xxx.xxx.xxx - - [06/Dec/2009:20:24:23 +0000] "GET /test HTTP/1.1" 301 20 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0)"
</pre>
<p>This is actually Microsoft.  Microsoft&#8217;s <a href="http://www.bing.com/Twitter">Bing</a> search engine indexes Twitter.  I&#8217;m not sure why they indexed twice in such close intervals that seems odd for this day and age.</p>
<p>Mining logs a little deeper it looks like when tweets meet certain criteria (such as retweeted) there are other bots that spider them.  It also looks like other search engines may be indexing at a slower rate (<a href="http://www.baidu.jp/">Baidu</a> for example).</p>
<p>There are several others from AWS and a few other dedicated providers.  These servers are obviously trying to keep a low profile, they don&#8217;t even have reverse DNS.</p>
<p>So there you go.  Just a matter of seconds after a link hits Twitter this all happens.</p>
<p>Here&#8217;s a few more from another Tweet that weren&#8217;t in the first set:</p>
<p><strong>Edit: More!:</strong></p>
<pre>75.xxx.xxx.xxx - - [06/Dec/2009:20:49:42 +0000] "GET /test HTTP/1.1" 301 - "-" "Mozilla/5.0 (compatible; Feedtrace-bot/0.2; bot@feedtrace.com)"
</pre>
<p><a href="http://www.feedtrace.com/">Feedtrace</a> is some sort of twitter mining service currently in beta.</p>
<pre>67.xxx.xxx.xxx - - [06/Dec/2009:20:49:45 +0000] "GET /test HTTP/1.0" 301 - "-" "Mozilla/5.0 (compatible; mxbot/1.0; +http://www.chainn.com/mxbot.html)"
</pre>
<p><a href="http://www.chainn.com/">Chainn</a> is a social data mining service with a few apps that make use of the data it collects.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=3160#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/12/9808ae3.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/12/06/who-indexes-tweets/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Capturing User Innovation</title>
		<link>http://robert.accettura.com/blog/2009/05/30/capturing-user-innovation/</link>
		<comments>http://robert.accettura.com/blog/2009/05/30/capturing-user-innovation/#comments</comments>
		<pubDate>Sun, 31 May 2009 02:01:22 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[software-development]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[user innovation]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2714</guid>
		<description><![CDATA[Building a new product is always fun. You draft ideas, generate wireframes, mockups, prototypes, you build your app, you tweak it, you release it. In the case of software and web applications you also get to update it and make &#8230; <a href="http://robert.accettura.com/blog/2009/05/30/capturing-user-innovation/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Building a new product is always fun.  You draft ideas, generate wireframes, mockups, prototypes, you build your app, you tweak it, you release it.  In the case of software and web applications you also get to update it and make it better.  If it&#8217;s hardware, you work on a 2nd revision to be sold a year later to people who didn&#8217;t adopt early (jab at early adopters).  </p>
<p>One of the most interesting things is how users actually use the product you make, if they use it at all.  Do they use it a little or a lot?  Do they use it as intended?  Do they find things missing?  To robust for their taste?  Or do they just find uses and modifications that all the engineers involved never in a million years would have contemplated?</p>
<p><span id="more-2714"></span></p>
<h3>Historical User Innovation</h3>
<p>User innovation isn&#8217;t a new thing.  Our brains were designed to tinker and reinvent.  The <a href="http://en.wikipedia.org/wiki/Circle">wheel</a> is nothing more than a plane where all points are an equal distance from a given point (centre).  <a href="http://en.wikipedia.org/wiki/Fire">Fire</a> is the oxidation of a combustible material releasing heat, light, and various reaction products.  Both are natural logical things that mankind has come up with many clever &#8220;hacks&#8221; (todays term) for.  We&#8217;ve even combined this seemingly unrelated geometric wonder and potentially dangerous reaction to come up with the combustion engine powered motor coach (car).</p>
<p>The <a href="http://en.wikipedia.org/wiki/Wright_Flyer">Flier I</a>, the first plane the Wright Brothers famously flew in 1903 used a chain drive to power the propellers from the single centrally mounted motor.  The chain drive is bicycle technology the wright brothers knew well from bike repair and sale business.</p>
<p><a href="http://science.ksc.nasa.gov/history/apollo/apollo-13/apollo-13.html">Apollo 13</a> is the story of finding unintended uses for technology and to bring 3 astronauts home safely.  First they used the Lunar Module (LEM) as a makeshift &#8220;lifeboat&#8221;.  If that wasn&#8217;t clever enough they utilized plastic bags, cardboard, tape, and a hose from a space suit to rig some air filters from the command module to fit the LEM (you can find an amazing picture of that <a href="http://grin.hq.nasa.gov/ABSTRACTS/GPN-2002-000056.html">here</a>).</p>
<p>These are just a few more notable examples but you could find millions more, from the old &#8220;soldiers tricks&#8221; you&#8217;ve heard time and time again like soldiers in the past using condoms to keep water and debris out of their gun among <a href="http://www.scribd.com/doc/7441205/Best-of-the-Ranger-Digest-Soldiers-Tips-and-Tricks">others</a>.</p>
<h3>Historic Computing User Innovation</h3>
<p>Computing has long been about user innovation.  The most obvious and the least recognized is computer games.  Long before computers were ever intended to be used for recreational purposes <a href="http://en.wikipedia.org/wiki/First_video_game">people were figuring out how to do it</a>.  While it seems silly it eventually turned into a <a href="http://en.wikipedia.org/wiki/Video_game_industry#Game_industry_value_chain">multi-billion dollar industry</a>.  Not bad for a nerdy trick on a cathode ray tube.</p>
<p>Then you had the geniuses at the <a href="http://www.bambi.net/bob/homebrew.html">Homebrew Computer Club</a> and and the <a href="http://www.virtualaltair.com/virtualaltair.com/vac_altair_8800_mainframe.asp">Altair 8800</a> which helped usher in a new era of computers that people could manipulate to do whatever they wanted.  They were hackers, and they remembered that when they starting building computers for consumers.  Their machines were customizable with software that could be developed easily (at least relatively speaking).  Leading to the modern era of user innovation&#8230;</p>
<h3>Modern Computing User Innovation</h3>
<p>There are many examples of user innovation, but I&#8217;ll give just a few to make a point:</p>
<p>Some systems facilitate user innovation for example through extensions and plugins.  Just take a look at Firefox and it&#8217;s robust <a href="https://addons.mozilla.org/">addons collection</a>.  On occasion features have even become bundled into the browser, for example <a href="https://bugzilla.mozilla.org/show_bug.cgi?id=179656#c47">tab reordering</a>.  Other features first get prototyped as extensions (the reporter tool being a perfect example).  Users are invited to develop their own features and innovations and share them with others.  Clever, but somewhat technical.  Removing the technical barrier is being done as part of <a href="https://labs.mozilla.com/2009/05/introducing-jetpack-call-for-participation/">Jetpack</a>. </p>
<p>Another more subtle example is how people search often then they tend to browse.  As many people with a domain name know, a noteworthy portion of traffic will come from people who just Google your domain name rather than enter it in the URL bar.  It seems in at least some minds search and navigation have merged much more than in others minds.  I&#8217;ll touch more on this in an upcoming blog post.  For yet another example, leaving tabs open with stuff you want to read later.  I&#8217;ve even seen (and done myself) the practice of blank tabs as separators between different things I&#8217;m working on in 1 window.  There are extensions out there to help colorize to help organize as well.</p>
<p>One of my favorite stories is the Twitter <a href="http://twitter.pbworks.com/Hashtags">hash tag</a>.  Hash tags aren&#8217;t something that the Twitter developers created or even intended for.  They are something users just decided to do, and standardize on.  It&#8217;s democracy through chaos at it&#8217;s best.  Despite Twitter&#8217;s user generated data stream being <a href="http://blog.twitter.com/2008/07/twitter-and-xmpp-drinking-from-fire.html">compared to a fire hose</a> somehow people caught on and agreed on this standard.  Even cooler is how hash tags can spontaneously be created with seemingly little or no coordination.  Users solved one of Twitters largest problems (organizing data) completely on their own without any engineering effort.</p>
<p>Even hardware is becoming more open to being modified and manipulated by users.  Take for example the <a href="http://arduino.cc/">Arduino</a> (something I totally need to get and play with).</p>
<p>A standard Web 2.0 feature has been to provide an API and tons of feeds to allow developers to mashup and find new uses for your service.  Some have become insanely popular and helped websites, and others have fizzled at best.</p>
<p>There are many more examples of users innovating, but I will stop here.  Sites like <a href="http://hackaday.com/">Hack a Day</a> and <a href="http://lifehacker.com/">Lifehacker</a> do a good job documenting some of these.</p>
<h3>How To Capture User Innovation</h3>
<p>The big question is how do you capture this innovation and bake it into the product?  Some lucky products like Twitter seem to take a life of their own and organically evolve.  Others, notably non-web services like Firefox seem to take a bit more nurturing to find the diamonds, shine them up and bake them into the next version.  Others still can&#8217;t even manage that.</p>
<p>There seems to be two basic paths that an organization can take:</p>
<p><strong>Build the platform and let users take control</strong> &#8211; This is the Twitter method.  Works best for situations where technical knowledge required to shape the product is minimal.<br />
<strong>Build the platform and make it accessible for the technically inclined and willing</strong> &#8211; This is the API and feed route.  Let developers build cool things, and hope they will improve your offering in return.</p>
<p>Are there others?  </p>
<p>I think open source projects generally have an advantage do to their inherent open nature.  They fully intend for users to have control.  Those focused on content tend to have an advantage over those that focus on service.</p>
<p>It&#8217;s an interesting topic.  How can you take this organic user innovation and utilize it to either fix the underlying flaws to make a better product or bundle the innovation to enhance the experience for all users?  Just ask?  Usability studies?  More API&#8217;s?  Bribery?</p>
<p>User Innovation may be the best there is.  No engineer knows exactly how their product will be used and all the real life test cases.  Users however are the test case.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2714#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/05/b628386.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/05/30/capturing-user-innovation/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Loren Brichter On Tweetie</title>
		<link>http://robert.accettura.com/blog/2009/04/25/loren-brichter-on-tweetie/</link>
		<comments>http://robert.accettura.com/blog/2009/04/25/loren-brichter-on-tweetie/#comments</comments>
		<pubDate>Sat, 25 Apr 2009 19:34:36 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[adobe air]]></category>
		<category><![CDATA[flash]]></category>
		<category><![CDATA[html5]]></category>
		<category><![CDATA[java]]></category>
		<category><![CDATA[loren brichter]]></category>
		<category><![CDATA[tweetie]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[UI]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2655</guid>
		<description><![CDATA[Loren Brichter is the author of the popular Twitter application Tweetie, an iPhone only application until the Mac version was released on Monday. MacWorld has a great little interview with Loren. One thing I really admire is that Loren really &#8230; <a href="http://robert.accettura.com/blog/2009/04/25/loren-brichter-on-tweetie/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Loren Brichter is the author of the popular Twitter application <a href="http://www.atebits.com/">Tweetie</a>, an iPhone only application until the Mac version was released on Monday.  MacWorld has a great little <a href="http://www.macworld.com/article/140209/2009/04/tweetie_brichter_interview.html">interview</a> with Loren.  One thing I really admire is that Loren really understands how to build a good application.  Performance, ease of use, simplicity are all taken into account.  Not just features and toys.</p>
<p>I thought this particular nugget was the highlight though:</p>
<blockquote cite="http://www.macworld.com/article/140209/2009/04/tweetie_brichter_interview.html"><p>
..AIR apps are like modern day Java applets&#8230; sure, they run on every platform. But they also suck on every platform.
</p></blockquote>
<p>I&#8217;ve yet to find an Adobe AIR application I like even though several have great ideas behind them.  Even on Windows, where I presume AIR has the biggest market share they all look strange, the UI is garbage and the performance is abysmal.  On the Mac it gets even worse.  Creating a Mac theme won&#8217;t help as my expectations for a Mac UI are different than they are on Windows or Linux.  Java apps have the same issues.</p>
<p>I think this is why more and more &#8220;applications&#8221; are becoming web based.  If your going to feel awkward and unnatural to the user anyway, why even bother with the installation barrier?  Why not just be web based so you don&#8217;t have to download and install.  As awkward as they may be, those that add Adobe Flash tend to make the problem worse by adding more strange feeling UI to their application.  Adobe Flash does do good video, it&#8217;s a big reason YouTube became popular,  but it&#8217;s really no replacement for user interface.  Hopefully in 2017 when HTML5 is wrapping up we&#8217;ll have this problem solved.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2655#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/04/e0688d1.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/04/25/loren-brichter-on-tweetie/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>User Generated Content Ownership</title>
		<link>http://robert.accettura.com/blog/2009/03/30/user-generated-content-ownership/</link>
		<comments>http://robert.accettura.com/blog/2009/03/30/user-generated-content-ownership/#comments</comments>
		<pubDate>Tue, 31 Mar 2009 02:08:13 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[flickr]]></category>
		<category><![CDATA[google docs]]></category>
		<category><![CDATA[intellectual property]]></category>
		<category><![CDATA[richard stallman]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[user-generated-content]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2599</guid>
		<description><![CDATA[Since the creation of the &#60;form/&#62; elements people have been wondering about the ownership and copyright of content created online. From email and message boards in Web 1.0 to blogs and Twitter in Web 2.0 the same fundamental questions remain. &#8230; <a href="http://robert.accettura.com/blog/2009/03/30/user-generated-content-ownership/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Since the creation of the <code>&lt;form/&gt;</code> elements <a href="http://techdirt.com/articles/20090329/2229284297.shtml">people have been wondering</a> about the ownership and copyright of content created online.  From email and message boards in Web 1.0 to blogs and Twitter in Web 2.0 the same fundamental questions remain. </p>
<p>Lately, Twitter has been the focus.  Twitter is actually pretty clear about it&#8217;s claims to user generated content:</p>
<ol>
<li>We claim no intellectual property rights over the material you provide to the Twitter service. Your profile and materials uploaded remain yours. You can remove your profile at any time by deleting your account. This will also remove any text and images you have stored in the system.</li>
<li>We encourage users to contribute their creations to the public domain or consider progressive licensing terms.</li>
</ol>
<p>It&#8217;s pretty clear that Twitter is taking a hands off approach, but it doesn&#8217;t let users decide what they want.  I&#8217;m personally a fan of <a href="http://creativecommons.org/">Creative Commons</a> so my suggestion would be to let decide in their account settings how they wish to license and choose between CC licenses.  That of course makes retweeting complicated to put it nicely (it&#8217;s more like a minefield).  That&#8217;s likely the reason they avoid the licensing issue.  Sure you can put some sort of an icon next to the tweet to indicate the licensing, but what if someone retweets it?  Or modifies it ever so slightly?  Is it a new tweet?  How many characters must change for it to be a new one?  This is where it gets murky.</p>
<p>Yahoo owned <a href="http://www.flickr.com">Flickr</a> choose to solve this problem by letting users choose what copyright they want to impose, and include a Creative Commons option.  A very graceful solution though admittedly their situation is much simpler than Twitter&#8217;s since they don&#8217;t have to deal with complexities like retweeting which would make things very complicated.</p>
<p><a href="http://en.wordpress.com/tos/">WordPress.com</a> isn&#8217;t as clear in regards to it&#8217;s claims (or lack of) to copyright.  Though they are far from locking people in considering you can delete stuff at any time and download your entire blog and move it elsewhere.  <a href="http://www.ma.tt">Matt</a>&#8216;s been pretty open about giving users choice including the ability to leave WordPress.com.  There is of course room for improvement to clarify their stance on copyright ownership.  </p>
<p>Even Google has been criticized for copyright concerns regarding services like <a href="http://www.google.com/search?hl=en&#038;q=google+docs%2C+copyright">Google Docs</a>.</p>
<p>They could adopt the <a href="http://www.stallman.org/">Richard Stallman</a> stance to &#8220;intellectual property&#8221; (his airquotes), though that would alienate at least as many as it attracts.</p>
<p>While Twitter might be the hot topic today it&#8217;s hardly a problem exclusive to Twitter.  It&#8217;s an issue for virtually any site out there that accepts third party content.  It gets more complicated when content can be remixed and redistributed.</p>
<p>The reality is people should know what rights they are giving up by putting content on these or any other services, but people rarely do.  Perhaps a great Creative Commons project would be to create the same simplified icon/license system but for websites that allow users to submit content.  The licenses would indicate what the impacts of the Terms of Service jargon are in plain English.  It&#8217;s essentially the inverse of what they do now.  Label the service as well as the content.</p>
<p>So what&#8217;s the best solution?
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2599#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/03/50abc3e.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/03/30/user-generated-content-ownership/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon S3 Outage</title>
		<link>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/</link>
		<comments>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/#comments</comments>
		<pubDate>Mon, 21 Jul 2008 02:13:55 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon-s3]]></category>
		<category><![CDATA[cdn]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[mirror image]]></category>
		<category><![CDATA[reliability]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[uptime]]></category>
		<category><![CDATA[varnish]]></category>
		<category><![CDATA[wordpress.com]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1845</guid>
		<description><![CDATA[The buzz around the web today was the outage of Amazon&#8217;s S3. It shows what websites are &#8220;doing it right&#8221;, and who fails. This is a great follow up to my &#8220;Reliability On The Grid&#8221; post the other day. Amazon &#8230; <a href="http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The buzz around the web today was the <a href="http://www.readwriteweb.com/archives/more_amazon_s3_downtime.php">outage of Amazon&#8217;s S3</a>.  It shows what websites are &#8220;doing it right&#8221;, and who fails.  This is a great follow up to my &#8220;<a href="http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/">Reliability On The Grid</a>&#8221; post the other day.</p>
<p>Amazon S3 is cloud based computing.  Essentially when you send them a file using their REST or SOAP interface Amazon stores it on multiple nodes in their infrastructure.  This provides redundancy and security (in case a data center catches fire for example).  Because of this design it&#8217;s often though that cloud based computing is invincible to problems.  This is hardly the fact.  Just like any large system, it&#8217;s complicated and full of hazards.  It takes only a small software glitch, or an unaccounted for issue to cause the entire thing to grind to a halt.  More complexity = more things that can fail.</p>
<p>Amazon S3 is popular because it&#8217;s cheap and easy to scale.  It&#8217;s pay-per-use based on bandwidth, disk storage, and requests.  Because that allows for websites to grow without having to make a large infrastructure investment, it&#8217;s popular for &#8220;Web 2.0&#8243; companies trying to keep their budgets tight.  Notably sites like Twitter, WordPress.com, SmugMug and Amazon.com themselves all use Amazon S3 to host things like images.</p>
<p>Many sites, notably Twitter, and SmugMug didn&#8217;t have a good day today.  WordPress.com and Amazon.com operated like normal.  The obvious reason for this is WordPress.com and Amazon.com are much better in terms of infrastructure and design.</p>
<p>WordPress.com uses S3, but proxies that with <a href="http://varnish.projects.linpro.no/">Varnish</a>.  There&#8217;s a <a href="http://ma.tt/2007/10/s3-news/">brief description here</a>, and a <a href="http://blog.apokalyptik.com/2007/10/10/so-you-wanna-see-an-image/">more detailed breakdown here</a>.  According to <a href="http://barry.wordpress.com/2008/02/15/amazon-aws-outage/">Barry Abrahamson</a>, WordPress.com does 1500 image requests per second across and 80-100 are served through S3.  They have (slower) back up&#8217;s in house for when S3 is down and can failover if S3 has a problem.  This means they can leverage S3 to their advantage, but aren&#8217;t down because of S3.  Using Varnish allows them to keep the S3 bill down by using their own bandwidth (likely cheaper since they are a large site and can get better rates on bandwidth).  This  also and lets them have this have a good level of redundancy.  Awesome job.</p>
<p>Amazon.com uses S3 themselves.  If you look at images on the site, they are actually served from <code>g-ecx.images-amazon.com</code>.  Which is actually:</p>
<pre>
g-ecx.images-amazon.com. 38     IN      CNAME   ant.mii.instacontent.net.
</pre>
<p><code>instacontent.net</code> is actually part of <a href="http://www.mirror-image.com">Mirror Image</a>, a CDN.  This is essentially outsourcing what WordPress.com is doing in terms of caching.  It&#8217;s similar to Akamai&#8217;s services.  A CDN&#8217;s biggest advantage is lowering latency by using servers closer to the customer, which are generally going to feel faster.  The other benefit is that they cache content for when the origin is having problems.  Because Amazon has a layer on top of S3, they have an added level of protection and remained up and images loaded.</p>
<p>Twitter serves most images such as avatars right off of S3.  This means when S3 went down, there were thousands of dead images on their pages.  No caching, not even a <code>CNAME</code> in place.  Image hosting is the least of their concerns.  Keeping the service up and running is their #1 concern right now.  The service was still usable, just ugly.  Many users take advantage of third party clients anyway.</p>
<p>Using a CDN or having the infrastructure in house is obviously more expensive (it makes S3 more of a luxury than a cost savings measure), but it means your not depending on one third party for your uptime.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1845#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/07/59bcda7.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Reliability On The Grid</title>
		<link>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/</link>
		<comments>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/#comments</comments>
		<pubDate>Mon, 14 Jul 2008 00:45:46 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon-s3]]></category>
		<category><![CDATA[data portability]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[reliability]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[uptime]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1815</guid>
		<description><![CDATA[There’s been a lot of discussion lately (in particular NYTimes, Data Center Knowledge) regarding both reliability of web applications which users are becoming more and more reliant on, as well as the security of such applications. It’s a pretty interesting &#8230; <a href="http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>There’s been a lot of discussion lately (in particular <a href="http://www.nytimes.com/2008/07/06/technology/06outage.html?_r=3&#038;partner=rss&#038;oref=slogin&#038;oref=slogin">NYTimes</a>, <a href="http://www.datacenterknowledge.com/archives/2008/Jul/06/is_downtime_more_frequent_or_more_visible.html">Data Center Knowledge</a>) regarding both reliability of web applications which users are becoming more and more reliant on, as well as the security of such applications.  It’s a pretty interesting topic considering there are so many things that ultimately have an impact on these two metrics.  I call them metrics since that’s what they really are.</p>
<p><span id="more-1815"></span></p>
<h4>Defining uptime, security, and privacy</h4>
<p>For the intents of the discussion at hand, &#8220;uptime&#8221; is defined as the application being accessible and functional to the user.  Note putting a &#8220;fail whale&#8221;? image up so that the page loads doesn’t not qualify as functional.  For all intents and purposes the service is down.  One should also note that traffic goes through different routes to get to different users, hence a site can be down for one person, but up for millions of others.  The vast majority (95%+) should be able to use the service for it to really be considered &#8220;up&#8221;.</p>
<p>&#8220;Security&#8221; defined as the assurance that privacy, data integrity, and account access are restricted in accordance with typical site functionality and users understanding.  &#8220;Privacy&#8221; is defined as not allowing any unauthorized person or entity to manipulate, view, copy, handle, destroy, or know about the existence of data without explicit approval from the user.   </p>
<h4>Why applications fail</h4>
<p>Applications can fail for many reasons, but most can be lumped into a handful of categories.  At the highest level you have in-house and upstream reasons.  In-house can be defined as something you can control, for example software or servers you control, while upstream is typically a vendor or partner, for example ISP, colo facility, etc, which there is less control over (other than submitting a ticket).  Generally startups have more upstream services and bring more things in-house as time goes on.  For example, Facebook relies on colo facilities for their servers.  They now plan to build their own (more control, and hopefully will ensure lower costs as well).</p>
<p>On a lower level you break things down to hardware and software.  Hardware failures are inevitable.  Computers suck in a 24&#215;7 environment.  We deal with that since they are better than people, who still insist on sleep (lazy bastards).  Hard drives fry, motherboards fail, fans die resulting in &#8220;thermal events&#8221;?.  Generally it’s pretty easy to deal with this.  You can use RAID so 1 hard drive isn’t critical after all, moving parts are the most prone to failure.  You can also have more than one server powering a successful application.  If one dies, the load goes to other boxes running on the grid.  You can put them in different data centers so if there was a problem at one, your still up and running.  This obviously comes at a cost.  Services like Google App Engine, Amazon’s S3 and Amazon’s EC2 help lower the cost, but also result in hardware being handled by an upstream provider.  Amazon and Google are very redundant, but they too can and have failed.  </p>
<p>Software generally fails because it either wasn’t designed to scale, or it was hatefully put together to meet a deadline.  Startups are infamous for this as the business guys just want things done quick and cheap and don’t care about reliability until it’s too late (they will also deny this until the end of time).  All major software platforms can scale when done correctly.  Many people say Perl can’t scale, but it has for a decade, look at IMDB, Amazon and Slashdot among the many.  Even more claim PHP can’t scale, but Facebook and Yahoo seem to run fine.  Python (YouTube), Ruby-On-Rails (YellowPages.com, Hulu, 43things)  ASP.NET (MySpace and Microsoft) all seem workable in high traffic situations. It’s not what you use, but how you use it.  These run on Apache, IIS, Oracle, MySQL, among others.  The platform is rarely (if ever) the problem.  The implementation almost always is.</p>
<p>There’s also the possibility that everything is fine and dandy, but somewhere along the internet from the servers to some of your users there’s a problem.  ISP’s encounter tons of problems with people snagging their fiber and tearing a line, to DoS attacks and viruses reeking havoc.  When this happens close to the user, no sites are accessible, when this happens further away several sites may be inaccessible or slow.  Users often wrongly attribute this to a site or application being slow or down when that’s hardly the case.  Using a data center with good connectivity reduces these cases.  Having data centers distributed around the globe is even better, but often not economical.  The best a business can do is submit a ticket and wait.  If it’s frequent enough they can move somewhere else.</p>
<h4>Why security fails</h4>
<p>Security failure is almost too complex of a topic to discuss without holding a complete college course.  The most obvious answer is that someone is cleverer than the person in charge of security, and outwitted or outsmarted them.  It could be in the physical form (stealing a server or hard drive with data), or in the electronic form (Phishing, XSS, DoS).  It could be a &#8220;hacker&#8221;, or it could be an application failure that results in a security glitch.</p>
<p>Many websites take several measures to protect your privacy.  They require &#8220;strong&#8221; passwords, maybe even require you to change them.  For things like banking you may have &#8220;security questions&#8221; to answer.  Perhaps even a key fob to provide two factor authentication.  </p>
<p>Most security failure can be traced to stupidity.  For example using &#8220;password&#8221; for your password, or replying to an email asking for your password.  A poorly configured server can also be a vulnerability.  Then all you need is someone who wants to exploit that.  If the data is of any value, that person exists.  </p>
<h4>When businesses fail</h4>
<p>Hackers want your data, business want to keep it secure, but don’t want to spend too much time/effort on it since the formula is <code>time = money</code>.  There’s really not much more to explain here.  </p>
<p>Sometimes it’s not even the business you know your dealing with.  You may be working with company X, but they may use company Y, Z, A1, A2, A3, and A4 to actually provide their services.  Your data may be accessible by any or all of them.  </p>
<p>Then there’s the possibility of a business going out of business.  They may give you a chance to download your data and move it elsewhere or they may even do it for you.  They may also just shut down abruptly and disappear of the face of the earth.  Goodbye data.</p>
<h4>Take control of your data</h4>
<p>I may sound cynical for effectively saying applications fail, many people could potentially see your data, and there’s nothing you can do about it.  I&#8217;m not, I am a realist, and I know what goes on behind the scenes.  There actually is something you can do about it:  <em>Take control of your data.  Keep control of your data</em>.</p>
<ol>
<li>Know who has your data, what they might do to it, who they might share it with, and what they will do to protect it.  Companies (at least reputable ones) post privacy policies for a reason.  Check them out or Google for some info on that company.  The results may surprise you.  For example if you delete something from Google Docs it may take 3 weeks for it to actually be deleted on their servers.  This isn’t uncommon, but many people assume once you delete it, the company deletes it.  That’s not the case.</li>
<li>Think about accessibility.  What happens if that application has an outage?  What happens if your ISP has a problem?  Or your cable line got cut?  Using an online office suite is a great way to keep documents accessible from home or work, but not so great when you can’t access them.  Storing them on a USB drive may prove useful, at least as a backup.  If you&#8217;ve got a business, this is especially true.  You may also want to consider a 2nd way to get online should your ISP have problems (giving a wireless card and a laptop to certain employees may also have the perk of  allowing employees to be more mobile).</li>
<li>Decide the fate of your data.  I personally prefer to keep a copy of everything so if a company goes under, I still have my data.  I host my own blog, and my own photos.  I keep backups of all that too, in multiple locations.  I know I’ll be around as long as I care about keeping that data online.  I’m not going out of business.  If I am, I don’t care about that data anymore <img src='http://robert.accettura.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  .  I always have my data.  You should too.</li>
</ol>
<h4>Keep control of your data</h4>
<p>Just because you’ve figured out how to protect your data, doesn’t mean you’re done.  You need to reevaluate yourself every time you start using something else, or change your usage patterns.  You don’t have to keep your data offline, just understand what putting it online really means.  Offline backups aren&#8217;t a bad idea.  Having backups on another service is also an option, but may be even more complicated.</p>
<p>This is somewhat more complicated in the case of things like social networks, but things like <a href="http://www.dataportability.org/">Data Portability</a> are slowly becoming a reality.</p>
<p>Google in general has been pretty good with leaving the options to take your data back.  Gmail lets you use IMAP to download all your mail, Google Reader lets you export an OPML feed, Google Docs lets you save all your docs to your computer.  It&#8217;s important to know what the services you rely on let you do with your data.  Don&#8217;t just assume you can easily get it out.  </p>
<h4>You’re responsible for your fate</h4>
<p>It’s easy to blame Google, Microsoft, Yahoo, or Twitter for your problems, but that’s really a poor excuse.  You’re responsible for the choices you make, and what you rely on.  If what you’re relying on isn’t giving you what you need, you need to find something else, or reevaluate if your putting your priorities in the right place.</p>
<p>I now present to you&#8230;</p>
<h4>Accettura&#8217;s Law Of Business Computing</h4>
<pre>
where
people = prone to frequent failures
technology = expensive, complex, frequent failure

business computing = people + technology = complex frequent failures that are costly in nature.
</pre>
<p>You can see how this works right?  Best way to avoid that cost?  Make sure your technology is redundant, and your people&#8217;s interaction is controlled to prevent failure from leaking into the technology.</p>
<p>This should be in Wikipedia and every Business and CompSci textbook.  That way everything that a student touches or thinks about in this industry is done with this in mind.  Build with the knowledge in mind that the <a href="http://www.flickr.com/photos/scriptingnews/2537265280/">fail whale</a> will just make you a relic before you even hit your prime.</p>
<p>That said, <em>get over Twitter being down and stop complaining.</em>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1815#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/07/f0bbac6.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Only Hours Left</title>
		<link>http://robert.accettura.com/blog/2008/06/16/only-hours-left/</link>
		<comments>http://robert.accettura.com/blog/2008/06/16/only-hours-left/#comments</comments>
		<pubDate>Tue, 17 Jun 2008 02:08:56 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[firefox 3]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1794</guid>
		<description><![CDATA[Something tells me that at about 10:00 PST / 1:00 EST something big will be happening. I sure hope someone is interested .]]></description>
			<content:encoded><![CDATA[<p><a href="http://twitter.com/mozillafirefox/statuses/836404961">Something</a> tells me that at about 10:00 PST / 1:00 EST something big will be happening.</p>
<p>I sure hope someone is interested <img src='http://robert.accettura.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  .
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1794#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/06/59e0b26.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/06/16/only-hours-left/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Secrets In Websites II</title>
		<link>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/</link>
		<comments>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/#comments</comments>
		<pubDate>Sat, 12 Jan 2008 02:25:49 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Around The Web]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[akamai]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[barack obama]]></category>
		<category><![CDATA[bill richardson]]></category>
		<category><![CDATA[campaign 2008]]></category>
		<category><![CDATA[chase]]></category>
		<category><![CDATA[chris dodd]]></category>
		<category><![CDATA[css]]></category>
		<category><![CDATA[democrat]]></category>
		<category><![CDATA[dennis kucinich]]></category>
		<category><![CDATA[duncan hunter]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[fred thompson]]></category>
		<category><![CDATA[global crossing]]></category>
		<category><![CDATA[hillary clinton]]></category>
		<category><![CDATA[html]]></category>
		<category><![CDATA[joe biden]]></category>
		<category><![CDATA[john edwards]]></category>
		<category><![CDATA[john mccain]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[mike gravel]]></category>
		<category><![CDATA[mike huckabee]]></category>
		<category><![CDATA[mitt romney]]></category>
		<category><![CDATA[mtv]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[popurls]]></category>
		<category><![CDATA[redhat]]></category>
		<category><![CDATA[republican]]></category>
		<category><![CDATA[ron paul]]></category>
		<category><![CDATA[rudy guliani]]></category>
		<category><![CDATA[secrets in websites]]></category>
		<category><![CDATA[stephen colbert]]></category>
		<category><![CDATA[tom tancredo]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2008/01/11/secrets-in-websites-ii/</guid>
		<description><![CDATA[This post is a follow up to the first Secrets In Websites. For those who don&#8217;t remember the first time, I point out odd, interesting, funny things in other websites&#8217; code. Yes it takes some time to put a post &#8230; <a href="http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This post is a follow up to the first <a href="http://robert.accettura.com/blog/2007/01/20/secrets-in-websites/">Secrets In Websites</a>.  For those who don&#8217;t remember the first time, I point out odd, interesting, funny things in other websites&#8217; code.  Yes it takes some time to put a post like this together, that&#8217;s why it&#8217;s just about a year since the last time.  Enough with the intro, read on for the code.</p>
<p><span id="more-1270"></span></p>
<h3>The Code</h3>
<h4>WordPress.com</h4>
<p>You&#8217;ll notice a comment sure to make any web developer laugh on WordPress.com&#8217;s <a href="http://wordpress.com/wp-login.php">login page</a></p>
<pre>

	&lt;link rel=&quot;stylesheet&quot; href=&quot;http://wordpress.com/wp-admin/wp-admin.css?version=MU&quot; type=&quot;text/css&quot; /&gt;
	&lt;!--[if IE]&gt;
		&lt;style type=&quot;text/css&quot;&gt;#login h1 a { margin-top: 35px; } #login #login_error { margin-bottom: 10px; }&lt;/style&gt;&lt; ![endif]--&gt;
	&lt;!-- Curse you, IE! --&gt;
</pre>
<p>The guys behind <a href="http://www.wordpress.com">WordPress</a> a <a href="http://www.webstandards.org/press/releases/20050601/">while back</a> took the site <a href="http://browsehappy.com/">BrowseHappy</a> under its wing.  WordPress has always been a strong believer in web standards, so this isn&#8217;t surprising (though still amusing).  Did you also know that the guys behind it (<a href="http://automattic.com/about/">Automattic</a>) don&#8217;t have job titles?  Unless you consider &#8220;Chief BBQ Taste Tester&#8221; to be a real job title.  <a href="http://www.photomatt.net">Matt</a>, I hope your job doesn&#8217;t kill you with a heart attack.</p>
<h4>Facebook</h4>
<p>The geniuses over at <a href="http://www.facebook.com">Facebook</a> feel the same and put the following on the top of their IE conditionally included <a href="http://static.ak.facebook.com/css/ie6.css?48:76473">stylesheets</a>:</p>
<pre>

/*  ------------------------------------------------------------------------
                    Facebook | IE/PC Hacks | getfirefox.com
    ------------------------------------------------------------------------  */
</pre>
<h4>popurls</h4>
<p>The ever so popular <a href="http://www.popurls.com">popurls</a> has the following comment in the header of the page.</p>
<pre>

&lt;!--

  __   __
 (  \,/  )
  \_ | _/  IN THE FUTURE EVERY URL WILL BE POPULAR FOR 1.5 SECONDS
  (_/ \_)                  - thomas and the wise popurls butterfly

--&gt;
</pre>
<h4>RedHat</h4>
<p><a href="http://www.redhat.com">RedHat</a> was one of the earlier corporate sites to redo itself into a standards based design.  They have great respect for those who came before them.  In their <a href="http://www.redhat.com/s/master.css">master css</a> file they have the following tribute as well as a little remark about Netscape 4.x:</p>
<pre>

/* 	redhat.com MASTER style sheet

	a tip of the red hat to Zeldman, Bowman, Meyer, Shea, Cederholm, Newhouse, Holzschlag,
	and many, many other css and web standards pioneers who have inspired us. 

	the css, layout and validation status of redhat.com is a work-in-progress. numerous
	web-building worker bees are working furiously to correct the bugs, minimize the hacks
	and validate the code. stay tuned. 

-------------------------------------------------------------------- created June 2004 */
@import url(&quot;global.css&quot;);
...
@import url(&quot;dig.css&quot;);

/* ---------------------------------------------------------------- ns4 styles - bah! */

table {
	border: 1px;
	}
...
</pre>
<h4>Panic Software</h4>
<p><a href="http://www.panic.com/">Panic Software</a> has a cool little piece of code for those who browse the product page for <a href="http://www.panic.com/coda/">Coda</a> (awesome product btw) with IE and don&#8217;t have at least version 6.0:</p>
<pre>

		&lt;!--[if lte IE 6]&gt;&lt;p id=&quot;iewarning&quot;&gt;&lt;img src=&quot;/extras/ripoff/images/ie-warning.gif&quot; alt=&quot;IE Warning&quot; title=&quot;We hear Firefox is nice!&quot; /&gt;&lt;/p&gt;&lt; ![endif]--&gt;
</pre>
<p>I hear it&#8217;s pretty nice too.</p>
<p>Panic also has a comment in the head of their homepage that reads:</p>
<pre>

&lt;!-- This homepage design is not long for this world. Enjoy it while you can! <img src='http://robert.accettura.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  --&gt;
</pre>
<h4>Twitter</h4>
<p><a href="http://twitter.com/">Twitter</a> (who redirects to drop the &#8216;www&#8217; btw) is a very popular service these days.  In their html they mark which server served up the data.  You&#8217;ll see it in the form:</p>
<pre>

  &lt;!-- served to you through a copper wire by bennu.twitter.com at 24 Nov 19:08 in 11 ms (d 0 / r 8). thank you, come again. --&gt;
</pre>
<p>Copper eh?  No fiber in your data center?  I won&#8217;t judge, as long as your bandwidth is plentiful.</p>
<h4>WordPress.com</h4>
<p>Here&#8217;s a bonus from <a href="http://wordpress.com/">WordPress</a>.  While many analytics programs use a 1px transparent &#8220;tracker gif&#8221; to manage statistics, WordPress did something a little different.  At the very bottom on the left hand side, you can see the face of WordPress analytics in all it&#8217;s tiny glory.</p>
<h4>Mozilla</h4>
<p>This technically applies to more than just Firefox.  You&#8217;d be surprised to see how many times <code>kungFuDeathGrip</code> is in the <a href="http://lxr.mozilla.org/mozilla/search?string=kungFuDeathGrip">code base</a>.</p>
<h4>Many Sites using Google Products/Services</h4>
<p>Many people have noticed strange Google tags on sites such as:</p>
<pre>

code
&lt;!--googleoff: index--&gt;
all
&lt;!--googleon: index--&gt;
over
</pre>
<p>This isn&#8217;t a &#8220;SEO&#8221; practice, despite some misconception on the web.  This is used by the Google Search Appliance, a product made by Google which many websites use to power their own search engines to tell the engine what to read and what to ignore.  It wouldn&#8217;t be practical for Google to use these &#8220;in the wild&#8221;.  The reason is that spammers could effectively hide an alternate website within those comments.  Google&#8217;s business is based largely on accurate search results.  Spammers have already tried to abuse the css property <code>display: none;</code>.  This would be even better.  You can find code like this on <a href="http://www.apple.com">Apple.com</a> among many other sites.</p>
<p>Webmasters <em>can</em> however optimize their side for AdSense using a technique <a href="http://google.com/support/adsense/bin/answer.py?answer=23168">recommended by Google</a>:</p>
<pre>

&lt;!-- google_ad_section_start --&gt;

&lt;!-- google_ad_section_end --&gt;
</pre>
<p>This tells Google to give weight to a certain part of your page when deciding what ad to display on the page.  This is good for cases where you feel other material on your page is influencing the ads and resulting in off-topic ads.</p>
<h3>Infrastructure/Platform</h3>
<h4>Microsoft</h4>
<p><a href="http://www.microsoft.com">Microsoft</a>&#8216;s offering against Linux and Apache is IIS on Windows.  Which one would expect they themselves use.  What they don&#8217;t tell you is that they also have used <a href="http://www.akamai.com/">Akamai</a> (with over <a href="http://www.akamai.com/html/about/facts_figures.html">25,000</a> servers), which uses <a href="http://news.zdnet.co.uk/software/0,1000000121,39115920,00.htm">Linux</a>.  They have used Akamai for many things like DNS, and caching files.  Rather than &#8220;Powered By Windows Server&#8221; maybe they should append &#8220;hiding behind Linux&#8221;.</p>
<h4>Myspace.com</h4>
<p><a href="http://www.myspace.com">Myspace.com</a> was previously Adobe/Macromedia&#8217;s model customer because it was written in <a href="http://www.adobe.com/products/coldfusion/">ColdFusion</a>, and said to be the biggest ColdFusion site on the net (and one of the biggest sites on the net).  Many think it still is, but it&#8217;s not.  While many url&#8217;s suggest it might be because they end in <code>.cfm</code> it&#8217;s actually running ASP.net and has been <a href="http://weblogs.asp.net/scottgu/archive/2006/03/25/Handling-1.5-Billion-Page-Views-Per-Day-Using-ASP.NET-2.0.aspx">since aprox, 2006</a>.  You can confirm this by viewing the headers on some of their pages.  You&#8217;ll see:</p>
<pre>
X-AspNet-Version: 2.0.50727
</pre>
<h4>MTV.com</h4>
<p><a href="http://www.mtv.com">MTV.com</a>&#8216;s site has search powered by a Google Search Appliance.  MTV is also owned by <a href="http://www.techcrunch.com/2007/03/13/5217/">Viacom who sued Google</a>, the parent company of YouTube.  The folks at MTV awesomely <a href="http://labsblog.mtv.com/category/site-relaunch/">admitted the irony during relaunch on their blog</a>.</p>
<h4>Global Crossing</h4>
<p>Tier 1 networking provider <a href="http://www.globalcrossing.com/">Global Crossing</a> really wants you to know how fast they are.  Doing a trace could turn up something like this:</p>
<pre>
  7    15 ms    13 ms    14 ms  COMCAST-IP-SERVICES-LLC.tengigabitethernet1-4.ar5.NYC1.gblx.net [64.208.222.58]
  8    14 ms    13 ms    13 ms  tengigabitethernet1-4.ar5.NYC1.gblx.net [64.208.222.57]
</pre>
<p>Yes that&#8217;s right, they use 10 GigE!  Just FYI.</p>
<h3>Goofy</h3>
<h4>Firefox 2.0</h4>
<p>In Firefox 2.0, go to &#8220;About Firefox&#8221; (under the help menu for Windows, under the Firefox menu for Mac), and click on credits.  You&#8217;ll notice <a href="http://www.comedycentral.com/shows/the_colbert_report/index.jhtml">Stephen Colbert</a>.  He wrote it single handedly, but added some other names because he&#8217;s a nice guy.  Bonus: I&#8217;m on the list too.  Above him because I&#8217;m <a href="http://robert.accettura.com/blog/2007/11/17/who-dropped-the-soap/">better</a> than him.  That&#8217;s right, I said it.</p>
<h3>Handy</h3>
<h4>Chase</h4>
<p>Chase for some reason puts it&#8217;s login form in plain text.  The submit url is https, but it doesn&#8217;t feel right.  They do have a SSL enabled login page, but for some reason they hide it.  Here it is for those interested:</p>
<p><a href="https://chaseonline.chase.com/online/home/sso_co_home.jsp">https://chaseonline.chase.com/online/home/sso_co_home.jsp</a></p>
<h4>Google</h4>
<p>For some reason, most of Google&#8217;s services are insecure by default.  By simply going to https, you can use SSL for added security.<br />
Gmail: <a href="https://mail.google.com">https://mail.google.com</a><br />
Google Calendar: <a href="https://www.google.com/calendar">https://www.google.com/calendar</a><br />
Google Reader: <a href="https://www.google.com/reader">https://www.google.com/reader</a></p>
<p>On the <a href="2/">next page</a> is the 2008 US Presidential Candidate Campaign sites&#8230;</p>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2008/01/11/secrets-in-websites-ii/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/01/c850371.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/01/11/secrets-in-websites-ii/feed/</wfw:commentRss>
		<slash:comments>69</slash:comments>
		</item>
	</channel>
</rss>

