<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Robert Accettura&#039;s Fun With Wordage &#187; datacenter</title>
	<atom:link href="http://robert.accettura.com/blog/tag/datacenter/feed/" rel="self" type="application/rss+xml" />
	<link>http://robert.accettura.com</link>
	<description>Robert Accettura&#039;s Personal Blog on Web Development and Tech</description>
	<lastBuildDate>Thu, 09 Feb 2012 01:43:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<atom:link rel='hub' href='http://robert.accettura.com/?pushpress=hub'/>
<cloud domain='robert.accettura.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>Data Center Power Consumption</title>
		<link>http://robert.accettura.com/blog/2009/06/13/data-center-power-consumption/</link>
		<comments>http://robert.accettura.com/blog/2009/06/13/data-center-power-consumption/#comments</comments>
		<pubDate>Sun, 14 Jun 2009 03:25:59 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[energy consumption]]></category>
		<category><![CDATA[green computing]]></category>
		<category><![CDATA[server]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2757</guid>
		<description><![CDATA[It&#8217;s hardly a secret that there is a serious demand for saving power in data centers. In a recent Times Magazine article: Data centers worldwide now consume more energy annually than Sweden. And the amount of energy required is growing, &#8230; <a href="http://robert.accettura.com/blog/2009/06/13/data-center-power-consumption/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s hardly a secret that there is a serious demand for saving power in data centers.  In a recent <a href="http://www.nytimes.com/2009/06/14/magazine/14search-t.html?_r=2&#038;ref=magazine&#038;pagewanted=all">Times Magazine article</a>:</p>
<blockquote cite="http://www.nytimes.com/2009/06/14/magazine/14search-t.html?_r=2&#038;ref=magazine&#038;pagewanted=all"><p>
Data centers worldwide now consume more energy annually than Sweden. And the amount of energy required is growing, says Jonathan Koomey, a scientist at Lawrence Berkeley National Laboratory. From 2000 to 2005, the aggregate electricity use by data centers doubled. The cloud, he calculates, consumes 1 to 2 percent of the world’s electricity.
</p></blockquote>
<p>To put that in a little more perspective, the 2009 census for Sweden puts the population at 9,263,872.  Sweden&#8217;s population is just slightly higher than New York City (8,274,527 in 2007) or the state of New Jersey (8,682,661 estimate in 2008).  Granted Sweden&#8217;s population density is 20.6/km<sup>2</sup> compared to New York City&#8217;s 10,482/km<sup>2</sup> or New Jersey&#8217;s 438/km<sup>2</sup>.  Population density is important since that says a lot about energy consumption.  Dense populations require less energy thanks to communal resources.  I still suspect the average Swede uses less electricity than the average American anyway.  All these numbers were pulled from Wikipedia. </p>
<p>The US Department of Energy does have data on <a href="http://www.eia.doe.gov/cneaf/electricity/esr/table5.html">power consumption</a> and <a href="http://www.eia.doe.gov/cneaf/electricity/epa/epat2p2.html">capacity</a> as well as <a href="http://www.eia.doe.gov/oiaf/aeo/index.html">forecasts on consumption and production</a>.  The obvious downside in the data is the reliance on coal, oil and gas which have environmental impacts as well as political impacts and costs (we know about the instabilities of the oil market).  This is why companies with lots of servers like Google are looking very carefully at power generation alternatives such as hydroelectric and solar.</p>
<p>We all benefit from data center efficiency.  Lower cost computing is a big advantage to startups and encourages more innovation by removing price barriers.  It&#8217;s also an advantage to the general public since the technology and tricks learned eventually trickle down to consumers.  We already are seeing more efficient power supplies, some even beating the original 80 PLUS certification.</p>
<p>Perhaps if we started tracking &#8220;performance per watt&#8221; in addition to &#8220;watts per square foot&#8221; we&#8217;d be looking at things from a more sustainable perspective.</p>
<p>Data center capacity and consumption is pretty interesting when you look at all the variables involved.  Growth, power costs, facility size, technology available, even foreign politics play a role in what it costs to operate.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2757#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/06/d58e2f0.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/06/13/data-center-power-consumption/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Engineering Efficiency</title>
		<link>http://robert.accettura.com/blog/2009/04/11/engineering-efficiency/</link>
		<comments>http://robert.accettura.com/blog/2009/04/11/engineering-efficiency/#comments</comments>
		<pubDate>Sat, 11 Apr 2009 21:23:35 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[ec2]]></category>
		<category><![CDATA[ecommerce]]></category>
		<category><![CDATA[green]]></category>
		<category><![CDATA[packaging]]></category>
		<category><![CDATA[s3]]></category>
		<category><![CDATA[shipping]]></category>
		<category><![CDATA[ups]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2623</guid>
		<description><![CDATA[Internet companies have the unique ability to scale quicker than any other industry on earth. Never before has a company been able to position itself from being nothing more than an idea to being in the living rooms of millions &#8230; <a href="http://robert.accettura.com/blog/2009/04/11/engineering-efficiency/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Internet companies have the unique ability to scale quicker than any other industry on earth.  Never before has a company been able to position itself from being nothing more than an idea to being in the living rooms of millions around the globe in a matter of hours.  While this introduces seemingly unlimited opportunities to grow it also allows for exponential waste if a company isn&#8217;t careful.  It&#8217;s interesting to see how they scale.  Scaling businesses in many ways isn&#8217;t very different than scaling servers and software stacks.</p>
<h3>The Classic Example: UPS</h3>
<p>Started in 1907 and adopting the name United Parcel Service in 1919 UPS has no real &#8220;high tech&#8221; background unless you include the Ford Model T.  That doesn&#8217;t mean it couldn&#8217;t become more efficient.  UPS has made a science of the delivery business.  For example it&#8217;s famous for it&#8217;s <a href="http://www.nytimes.com/2007/12/09/magazine/09left-handturn.html">&#8220;no left&#8221; policy</a>.  Simply put they found that avoiding left turns means less time waiting at lights which means less fuel is wasted.  The more efficient routing formerly done by humans now computerized saves them 3 million gallons of fuel in <a href="http://www.boston.com/news/local/articles/2008/07/10/right_turns_make_the_most_out_of_gas/">2007 alone</a>.  Lets do the math:</p>
<p>Assuming they run 100% diesel at an average cost of $2.87/gallon in 2007 [<a href="http://tonto.eia.doe.gov/dnav/pet/hist/mg_tt_usA.htm">doe</a>] multiplied by 3 million that&#8217;s $8.61 million dollars by trying to avoid left turns.</p>
<p>Not bad for a souped up mapping application.</p>
<p>By having their drivers do things like turning of the ignition while unbuckling their seat belt at the same time, and scanning for the doorbell while walking towards the door (it&#8217;s easier to see from a distance than up close) they can shave time off of their routes.</p>
<p>Then of course there&#8217;s package tracking.  While customers might like to know in what city their weight loss taps are sitting tracking systems help reduce loss and monitor package routing for optimal efficiency.</p>
<h3>Cutting Utility Bills: Google</h3>
<p>Being the largest search engine, a large ad network, email provider, analytics firm, mapping service, video site, and whatever else they do means Google needs a ton of servers.  Cramming servers into data centers and keeping them cool to prevent hardware failures is a complicated and expensive task.  Keeping the whole thing powered is also really expensive.  <a href="http://robert.accettura.com/blog/2009/04/01/googles-data-center-secrets/">Google has scrutinized server designs</a> to eliminate all waste possible.  This has resulted in Google having more horsepower at a lower cost than their competitors.  Having more capacity at a lower cost means Google can do more at a lower cost than their competitors.  I won&#8217;t discuss Google in too much detail since they did a great job themselves recently and I mentioned it the other day in another blog post: <a href="http://robert.accettura.com/blog/2009/04/01/googles-data-center-secrets/">Google&#8217;s Data Center Secrets</a>.</p>
<h3>Shipping Efficiency: Amazon</h3>
<p>Amazon&#8217;s long been improving efficiency by using data collection and analysis to encouraging their customers to spend more.  Their algorithms to recommend related products you might be interested in is one of the best out there.  Their ordering system is streamlined to prevent customers from bailing before completion.  Their products are SEO&#8217;d to appear on the top of Google searches.  That doesn&#8217;t mean Amazon can&#8217;t improve other parts of their business.</p>
<p>Amazon several months ago started a Frustration-Free Packaging program.  Here&#8217;s how they <a href="http://www.amazon.com/gp/help/customer/display.html?nodeId=200285450">describe it</a>:</p>
<blockquote cite="http://www.amazon.com/gp/help/customer/display.html?nodeId=200285450"><p>
The Frustration-Free Package (on the left) is recyclable and comes without excess packaging materials such as hard plastic clamshell casings, plastic bindings, and wire ties. It&#8217;s designed to be opened without the use of a box cutter or knife and will protect your product just as well as traditional packaging (on the right). Products with Frustration-Free Packaging can frequently be shipped in their own boxes, without an additional shipping box.
</p></blockquote>
<p>The key here is <q>&#8220;can frequently be shipped in their own boxes&#8221;</q>.  By shipping a box alone rather than packaging they can skip a step in their warehouses (and the packaging materials that go with packaging something for delivery).  This also lowers the weight as those extra boxes don&#8217;t weigh 0 oz.  The frustration free packaging is also the perfect shape for efficiently filling trucks and strong enough to not crush easily thus lowering returns due to damage.</p>
<p>Amazon now even has a <a href="http://www.amazon.com/packaging-feedback">feedback form</a> [login required] for users to share what they think of their package.  This has the added bonus of helping further reduce the <a href="http://consumerist.com/tag/stupid-shipping-gang/">inefficient shipping practices</a> so common right now.</p>
<p>Amazon&#8217;s also done a significant amount of work on their infrastructure to make their servers scale well using tech such as <a href="http://aws.amazon.com/ec2/">EC2</a> and <a href="http://aws.amazon.com/s3/">S3</a>.  By selling capacity to other companies they able to take advantage of economy of scale as well as diversify their business beyond just retail.  Of course they are <a href="http://www.datacenterknowledge.com/archives/2008/11/07/amazon-building-large-data-center-in-oregon/">planning their data centers</a> to have access to cheap power.</p>
<p>These aren&#8217;t haphazard attempts at increasing efficiency, these are well calculated engineered approaches to removing even the smallest inefficiencies with the knowledge of how they compound as operations scale.  Aren&#8217;t they clever?
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2623#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2009/04/8e68c3c.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2009/04/11/engineering-efficiency/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rackspace Acquisitions</title>
		<link>http://robert.accettura.com/blog/2008/10/22/rackspace-acquisitions/</link>
		<comments>http://robert.accettura.com/blog/2008/10/22/rackspace-acquisitions/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 01:46:37 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[hosting]]></category>
		<category><![CDATA[jungledisk]]></category>
		<category><![CDATA[rackspace]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[slicehost]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=2072</guid>
		<description><![CDATA[Despite the bad economy, Rackspace is acquiring startup JungleDisk and SliceHost. This is a very interesting step on their behalf. Buying JungleDisk makes sense since Rackspace wants to get into the cloud storage business. JungleDisk is one of the bigger &#8230; <a href="http://robert.accettura.com/blog/2008/10/22/rackspace-acquisitions/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Despite the bad economy, Rackspace is <a href="http://ir.rackspace.com/phoenix.zhtml?c=221673&#038;p=irol-newsArticle&#038;ID=1215812&#038;highlight=">acquiring startup JungleDisk and SliceHost</a>.  This is a very interesting step on their behalf.</p>
<p>Buying JungleDisk makes sense since Rackspace wants to get into the cloud storage business.  JungleDisk is one of the bigger Amazon S3 products out there.  By adding Rackspace support to the product they can quickly attempt to get into that market.  If they will succeed depends on their offering&#8217;s cost.  Their press release suggests $0.15/GB, but that doesn&#8217;t say if they will bill based on requests and bandwidth (which is where Amazon S3 gets expensive).  Also interesting is this little nugget:</p>
<blockquote cite="http://ir.rackspace.com/phoenix.zhtml?c=221673&#038;p=irol-newsArticle&#038;ID=1215812&#038;highlight="><p>
Also later this year, Limelight Networks will team with Rackspace to allow developers to easily distribute content to millions of end users around the world and bring scalable content delivery and application acceleration services to the masses.
</p></blockquote>
<p>This is competing with Amazon&#8217;s attempts at starting a CDN later this year.  It&#8217;s worth noting that these are both pretty primitive CDN&#8217;s since they require you to register objects before the CDN hosts them.  Modern CDN&#8217;s like Limelight and Akamai allow you setup a <code>CNAME</code> so that their CDN essentially acts as a middle layer between your origin servers and your users.  This requires no preregistering since the CDN can just check the origin for any asset requested.  Caching is configured via configuration files and via standard http headers.  I&#8217;m not sure how useful these CDN&#8217;s will be to most.  Registering objects and uploading to another platform is a giant pain as opposed to just setting up a transparent <code>CNAME</code>.  The difference is one requires development time, the other doesn&#8217;t.</p>
<p>Acquiring Slicehost makes sense since they apparently have technology that will be useful to Rackspace.  They are making a bet that startups in need of hosting on virtual machines (which is much more complicated to manage than typical shared hosting) will produce a decent market in the future.  With the economic downturn, at least in the short term this may not look like the most useful purchase.  In the long run this may pay off handsomely.  They have decent competition in that space and it&#8217;s quickly growing.  Rackspace&#8217;s size may help it weather a downturn better than others though.</p>
<p>They closed 5.18 +0.22 (4.44%) today, despite the DOW being -514.45, so I guess I&#8217;m not alone in my assessment.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=2072#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/10/07cb5f8.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/10/22/rackspace-acquisitions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon S3 Outage</title>
		<link>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/</link>
		<comments>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/#comments</comments>
		<pubDate>Mon, 21 Jul 2008 02:13:55 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon-s3]]></category>
		<category><![CDATA[cdn]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[mirror image]]></category>
		<category><![CDATA[reliability]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[uptime]]></category>
		<category><![CDATA[varnish]]></category>
		<category><![CDATA[wordpress.com]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1845</guid>
		<description><![CDATA[The buzz around the web today was the outage of Amazon&#8217;s S3. It shows what websites are &#8220;doing it right&#8221;, and who fails. This is a great follow up to my &#8220;Reliability On The Grid&#8221; post the other day. Amazon &#8230; <a href="http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The buzz around the web today was the <a href="http://www.readwriteweb.com/archives/more_amazon_s3_downtime.php">outage of Amazon&#8217;s S3</a>.  It shows what websites are &#8220;doing it right&#8221;, and who fails.  This is a great follow up to my &#8220;<a href="http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/">Reliability On The Grid</a>&#8221; post the other day.</p>
<p>Amazon S3 is cloud based computing.  Essentially when you send them a file using their REST or SOAP interface Amazon stores it on multiple nodes in their infrastructure.  This provides redundancy and security (in case a data center catches fire for example).  Because of this design it&#8217;s often though that cloud based computing is invincible to problems.  This is hardly the fact.  Just like any large system, it&#8217;s complicated and full of hazards.  It takes only a small software glitch, or an unaccounted for issue to cause the entire thing to grind to a halt.  More complexity = more things that can fail.</p>
<p>Amazon S3 is popular because it&#8217;s cheap and easy to scale.  It&#8217;s pay-per-use based on bandwidth, disk storage, and requests.  Because that allows for websites to grow without having to make a large infrastructure investment, it&#8217;s popular for &#8220;Web 2.0&#8243; companies trying to keep their budgets tight.  Notably sites like Twitter, WordPress.com, SmugMug and Amazon.com themselves all use Amazon S3 to host things like images.</p>
<p>Many sites, notably Twitter, and SmugMug didn&#8217;t have a good day today.  WordPress.com and Amazon.com operated like normal.  The obvious reason for this is WordPress.com and Amazon.com are much better in terms of infrastructure and design.</p>
<p>WordPress.com uses S3, but proxies that with <a href="http://varnish.projects.linpro.no/">Varnish</a>.  There&#8217;s a <a href="http://ma.tt/2007/10/s3-news/">brief description here</a>, and a <a href="http://blog.apokalyptik.com/2007/10/10/so-you-wanna-see-an-image/">more detailed breakdown here</a>.  According to <a href="http://barry.wordpress.com/2008/02/15/amazon-aws-outage/">Barry Abrahamson</a>, WordPress.com does 1500 image requests per second across and 80-100 are served through S3.  They have (slower) back up&#8217;s in house for when S3 is down and can failover if S3 has a problem.  This means they can leverage S3 to their advantage, but aren&#8217;t down because of S3.  Using Varnish allows them to keep the S3 bill down by using their own bandwidth (likely cheaper since they are a large site and can get better rates on bandwidth).  This  also and lets them have this have a good level of redundancy.  Awesome job.</p>
<p>Amazon.com uses S3 themselves.  If you look at images on the site, they are actually served from <code>g-ecx.images-amazon.com</code>.  Which is actually:</p>
<pre>
g-ecx.images-amazon.com. 38     IN      CNAME   ant.mii.instacontent.net.
</pre>
<p><code>instacontent.net</code> is actually part of <a href="http://www.mirror-image.com">Mirror Image</a>, a CDN.  This is essentially outsourcing what WordPress.com is doing in terms of caching.  It&#8217;s similar to Akamai&#8217;s services.  A CDN&#8217;s biggest advantage is lowering latency by using servers closer to the customer, which are generally going to feel faster.  The other benefit is that they cache content for when the origin is having problems.  Because Amazon has a layer on top of S3, they have an added level of protection and remained up and images loaded.</p>
<p>Twitter serves most images such as avatars right off of S3.  This means when S3 went down, there were thousands of dead images on their pages.  No caching, not even a <code>CNAME</code> in place.  Image hosting is the least of their concerns.  Keeping the service up and running is their #1 concern right now.  The service was still usable, just ugly.  Many users take advantage of third party clients anyway.</p>
<p>Using a CDN or having the infrastructure in house is obviously more expensive (it makes S3 more of a luxury than a cost savings measure), but it means your not depending on one third party for your uptime.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1845#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/07/59bcda7.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/07/20/amazon-s3-outage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Reliability On The Grid</title>
		<link>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/</link>
		<comments>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/#comments</comments>
		<pubDate>Mon, 14 Jul 2008 00:45:46 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon-s3]]></category>
		<category><![CDATA[data portability]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[reliability]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[uptime]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1815</guid>
		<description><![CDATA[There’s been a lot of discussion lately (in particular NYTimes, Data Center Knowledge) regarding both reliability of web applications which users are becoming more and more reliant on, as well as the security of such applications. It’s a pretty interesting &#8230; <a href="http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>There’s been a lot of discussion lately (in particular <a href="http://www.nytimes.com/2008/07/06/technology/06outage.html?_r=3&#038;partner=rss&#038;oref=slogin&#038;oref=slogin">NYTimes</a>, <a href="http://www.datacenterknowledge.com/archives/2008/Jul/06/is_downtime_more_frequent_or_more_visible.html">Data Center Knowledge</a>) regarding both reliability of web applications which users are becoming more and more reliant on, as well as the security of such applications.  It’s a pretty interesting topic considering there are so many things that ultimately have an impact on these two metrics.  I call them metrics since that’s what they really are.</p>
<p><span id="more-1815"></span></p>
<h4>Defining uptime, security, and privacy</h4>
<p>For the intents of the discussion at hand, &#8220;uptime&#8221; is defined as the application being accessible and functional to the user.  Note putting a &#8220;fail whale&#8221;? image up so that the page loads doesn’t not qualify as functional.  For all intents and purposes the service is down.  One should also note that traffic goes through different routes to get to different users, hence a site can be down for one person, but up for millions of others.  The vast majority (95%+) should be able to use the service for it to really be considered &#8220;up&#8221;.</p>
<p>&#8220;Security&#8221; defined as the assurance that privacy, data integrity, and account access are restricted in accordance with typical site functionality and users understanding.  &#8220;Privacy&#8221; is defined as not allowing any unauthorized person or entity to manipulate, view, copy, handle, destroy, or know about the existence of data without explicit approval from the user.   </p>
<h4>Why applications fail</h4>
<p>Applications can fail for many reasons, but most can be lumped into a handful of categories.  At the highest level you have in-house and upstream reasons.  In-house can be defined as something you can control, for example software or servers you control, while upstream is typically a vendor or partner, for example ISP, colo facility, etc, which there is less control over (other than submitting a ticket).  Generally startups have more upstream services and bring more things in-house as time goes on.  For example, Facebook relies on colo facilities for their servers.  They now plan to build their own (more control, and hopefully will ensure lower costs as well).</p>
<p>On a lower level you break things down to hardware and software.  Hardware failures are inevitable.  Computers suck in a 24&#215;7 environment.  We deal with that since they are better than people, who still insist on sleep (lazy bastards).  Hard drives fry, motherboards fail, fans die resulting in &#8220;thermal events&#8221;?.  Generally it’s pretty easy to deal with this.  You can use RAID so 1 hard drive isn’t critical after all, moving parts are the most prone to failure.  You can also have more than one server powering a successful application.  If one dies, the load goes to other boxes running on the grid.  You can put them in different data centers so if there was a problem at one, your still up and running.  This obviously comes at a cost.  Services like Google App Engine, Amazon’s S3 and Amazon’s EC2 help lower the cost, but also result in hardware being handled by an upstream provider.  Amazon and Google are very redundant, but they too can and have failed.  </p>
<p>Software generally fails because it either wasn’t designed to scale, or it was hatefully put together to meet a deadline.  Startups are infamous for this as the business guys just want things done quick and cheap and don’t care about reliability until it’s too late (they will also deny this until the end of time).  All major software platforms can scale when done correctly.  Many people say Perl can’t scale, but it has for a decade, look at IMDB, Amazon and Slashdot among the many.  Even more claim PHP can’t scale, but Facebook and Yahoo seem to run fine.  Python (YouTube), Ruby-On-Rails (YellowPages.com, Hulu, 43things)  ASP.NET (MySpace and Microsoft) all seem workable in high traffic situations. It’s not what you use, but how you use it.  These run on Apache, IIS, Oracle, MySQL, among others.  The platform is rarely (if ever) the problem.  The implementation almost always is.</p>
<p>There’s also the possibility that everything is fine and dandy, but somewhere along the internet from the servers to some of your users there’s a problem.  ISP’s encounter tons of problems with people snagging their fiber and tearing a line, to DoS attacks and viruses reeking havoc.  When this happens close to the user, no sites are accessible, when this happens further away several sites may be inaccessible or slow.  Users often wrongly attribute this to a site or application being slow or down when that’s hardly the case.  Using a data center with good connectivity reduces these cases.  Having data centers distributed around the globe is even better, but often not economical.  The best a business can do is submit a ticket and wait.  If it’s frequent enough they can move somewhere else.</p>
<h4>Why security fails</h4>
<p>Security failure is almost too complex of a topic to discuss without holding a complete college course.  The most obvious answer is that someone is cleverer than the person in charge of security, and outwitted or outsmarted them.  It could be in the physical form (stealing a server or hard drive with data), or in the electronic form (Phishing, XSS, DoS).  It could be a &#8220;hacker&#8221;, or it could be an application failure that results in a security glitch.</p>
<p>Many websites take several measures to protect your privacy.  They require &#8220;strong&#8221; passwords, maybe even require you to change them.  For things like banking you may have &#8220;security questions&#8221; to answer.  Perhaps even a key fob to provide two factor authentication.  </p>
<p>Most security failure can be traced to stupidity.  For example using &#8220;password&#8221; for your password, or replying to an email asking for your password.  A poorly configured server can also be a vulnerability.  Then all you need is someone who wants to exploit that.  If the data is of any value, that person exists.  </p>
<h4>When businesses fail</h4>
<p>Hackers want your data, business want to keep it secure, but don’t want to spend too much time/effort on it since the formula is <code>time = money</code>.  There’s really not much more to explain here.  </p>
<p>Sometimes it’s not even the business you know your dealing with.  You may be working with company X, but they may use company Y, Z, A1, A2, A3, and A4 to actually provide their services.  Your data may be accessible by any or all of them.  </p>
<p>Then there’s the possibility of a business going out of business.  They may give you a chance to download your data and move it elsewhere or they may even do it for you.  They may also just shut down abruptly and disappear of the face of the earth.  Goodbye data.</p>
<h4>Take control of your data</h4>
<p>I may sound cynical for effectively saying applications fail, many people could potentially see your data, and there’s nothing you can do about it.  I&#8217;m not, I am a realist, and I know what goes on behind the scenes.  There actually is something you can do about it:  <em>Take control of your data.  Keep control of your data</em>.</p>
<ol>
<li>Know who has your data, what they might do to it, who they might share it with, and what they will do to protect it.  Companies (at least reputable ones) post privacy policies for a reason.  Check them out or Google for some info on that company.  The results may surprise you.  For example if you delete something from Google Docs it may take 3 weeks for it to actually be deleted on their servers.  This isn’t uncommon, but many people assume once you delete it, the company deletes it.  That’s not the case.</li>
<li>Think about accessibility.  What happens if that application has an outage?  What happens if your ISP has a problem?  Or your cable line got cut?  Using an online office suite is a great way to keep documents accessible from home or work, but not so great when you can’t access them.  Storing them on a USB drive may prove useful, at least as a backup.  If you&#8217;ve got a business, this is especially true.  You may also want to consider a 2nd way to get online should your ISP have problems (giving a wireless card and a laptop to certain employees may also have the perk of  allowing employees to be more mobile).</li>
<li>Decide the fate of your data.  I personally prefer to keep a copy of everything so if a company goes under, I still have my data.  I host my own blog, and my own photos.  I keep backups of all that too, in multiple locations.  I know I’ll be around as long as I care about keeping that data online.  I’m not going out of business.  If I am, I don’t care about that data anymore <img src='http://robert.accettura.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  .  I always have my data.  You should too.</li>
</ol>
<h4>Keep control of your data</h4>
<p>Just because you’ve figured out how to protect your data, doesn’t mean you’re done.  You need to reevaluate yourself every time you start using something else, or change your usage patterns.  You don’t have to keep your data offline, just understand what putting it online really means.  Offline backups aren&#8217;t a bad idea.  Having backups on another service is also an option, but may be even more complicated.</p>
<p>This is somewhat more complicated in the case of things like social networks, but things like <a href="http://www.dataportability.org/">Data Portability</a> are slowly becoming a reality.</p>
<p>Google in general has been pretty good with leaving the options to take your data back.  Gmail lets you use IMAP to download all your mail, Google Reader lets you export an OPML feed, Google Docs lets you save all your docs to your computer.  It&#8217;s important to know what the services you rely on let you do with your data.  Don&#8217;t just assume you can easily get it out.  </p>
<h4>You’re responsible for your fate</h4>
<p>It’s easy to blame Google, Microsoft, Yahoo, or Twitter for your problems, but that’s really a poor excuse.  You’re responsible for the choices you make, and what you rely on.  If what you’re relying on isn’t giving you what you need, you need to find something else, or reevaluate if your putting your priorities in the right place.</p>
<p>I now present to you&#8230;</p>
<h4>Accettura&#8217;s Law Of Business Computing</h4>
<pre>
where
people = prone to frequent failures
technology = expensive, complex, frequent failure

business computing = people + technology = complex frequent failures that are costly in nature.
</pre>
<p>You can see how this works right?  Best way to avoid that cost?  Make sure your technology is redundant, and your people&#8217;s interaction is controlled to prevent failure from leaking into the technology.</p>
<p>This should be in Wikipedia and every Business and CompSci textbook.  That way everything that a student touches or thinks about in this industry is done with this in mind.  Build with the knowledge in mind that the <a href="http://www.flickr.com/photos/scriptingnews/2537265280/">fail whale</a> will just make you a relic before you even hit your prime.</p>
<p>That said, <em>get over Twitter being down and stop complaining.</em>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1815#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/07/f0bbac6.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/07/13/reliability-on-the-grid/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Site Outage</title>
		<link>http://robert.accettura.com/blog/2008/06/10/site-outage/</link>
		<comments>http://robert.accettura.com/blog/2008/06/10/site-outage/#comments</comments>
		<pubDate>Tue, 10 Jun 2008 13:52:44 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[server]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/?p=1779</guid>
		<description><![CDATA[This server will be moving to a new data center tonight (Tuesday sometime between 1 &#8211; 7 AM EST). If your feed reader reports that I&#8217;m down&#8230; that&#8217;s why. Edit: All done. Successfully moved to it&#8217;s new home.]]></description>
			<content:encoded><![CDATA[<p>This server will be moving to a new data center tonight (Tuesday sometime between 1 &#8211; 7 AM EST).  If your feed reader reports that I&#8217;m down&#8230; that&#8217;s why.</p>
<p><strong>Edit:</strong> All done.  Successfully moved to it&#8217;s new home.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/?p=1779#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2008/06/8c01a75.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2008/06/10/site-outage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Site Outages</title>
		<link>http://robert.accettura.com/blog/2007/07/28/site-outages/</link>
		<comments>http://robert.accettura.com/blog/2007/07/28/site-outages/#comments</comments>
		<pubDate>Sun, 29 Jul 2007 01:21:22 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[hosting]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[server]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2007/07/28/site-outages/</guid>
		<description><![CDATA[This and other sites of mine will experience a few outages this weekend as the servers move to a new temporary data center. Update: Server is snug in it&#8217;s new home. Will be moving again to a permanent new data &#8230; <a href="http://robert.accettura.com/blog/2007/07/28/site-outages/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>This and other sites of mine will experience a few outages this weekend as the servers move to a new temporary data center.</p>
<p><strong>Update:</strong> Server is snug in it&#8217;s new home.  Will be moving again to a permanent new data center.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2007/07/28/site-outages/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2007/07/fe70c36.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2007/07/28/site-outages/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wikipedia Infrastructure</title>
		<link>http://robert.accettura.com/blog/2007/04/29/wikipedia-infrastructure/</link>
		<comments>http://robert.accettura.com/blog/2007/04/29/wikipedia-infrastructure/#comments</comments>
		<pubDate>Mon, 30 Apr 2007 02:32:41 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[lamp]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2007/04/29/wikipedia-infrastructure/</guid>
		<description><![CDATA[Here&#8217;s a great read on Wikipedia&#8217;s Infrastructure. Two excellent sets of slides. A lot can be done with a LAMP stack. The common theme: caching and careful optimization. There are some really impressive stats in there.]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a great read on <a href="http://dammit.lt/2007/04/29/wikipedia-site-internals-etc-the-workbook/">Wikipedia&#8217;s Infrastructure</a>.  Two excellent sets of slides.  A lot can be done with a LAMP stack.  The common theme: <em>caching</em> and careful optimization.  There are some really impressive stats in there.
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2007/04/29/wikipedia-infrastructure/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2007/04/9701a1c.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2007/04/29/wikipedia-infrastructure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>FoxTorrent</title>
		<link>http://robert.accettura.com/blog/2007/04/27/foxtorrent/</link>
		<comments>http://robert.accettura.com/blog/2007/04/27/foxtorrent/#comments</comments>
		<pubDate>Sat, 28 Apr 2007 01:08:10 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[akamai]]></category>
		<category><![CDATA[bittorrent]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[firefox]]></category>
		<category><![CDATA[foxtorrent]]></category>
		<category><![CDATA[server]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2007/04/27/foxtorrent/</guid>
		<description><![CDATA[I said back in 2004 that Firefox needs built in support for BitTorrent. My idea was it would be integrated into the download manager so that it was &#8220;just another protocol&#8221; and would be transparent to a typical user. I &#8230; <a href="http://robert.accettura.com/blog/2007/04/27/foxtorrent/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I said back in 2004 that <a href="http://robert.accettura.com/blog/2004/03/03/mozilla-needs-bittorrent/">Firefox needs built in support for BitTorrent</a>.  My idea was it would be integrated into the download manager so that it was &#8220;just another protocol&#8221; and would be transparent to a typical user.  I still stand by that.</p>
<p>Fast forward to 2007: <a href="http://www.foxtorrent.com/">FoxTorrent</a> is by <a href="http://www.akamai.com/redswoosh">RedSwoosh</a> (now owned by <a href="http://www.akamai.com/">Akamai</a>).  </p>
<p>I&#8217;d personally love to see something like this ship built in.  It&#8217;s a great feature.  BitTorrent is a great protocol for distributing large downloads without having to buy expensive infrastructure.  Akamai&#8217;s interest is proof of that.</p>
<p>FoxTorrent has a <a href="http://foxtorrent.wordpress.com/">blog</a> if you want to keep an eye on it.  FoxTorrent is MIT licensed as well.  It seems like a very interesting product.  I&#8217;ll have to dig into this and look at it a bit closer.</p>
<p><small>[Hat tip: <a href="http://www.techcrunch.com/2007/04/27/akamai-releases-foxtorrent-10-firefox-bittorrent-add-on/">TechCrunch</a>]</small>
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2007/04/27/foxtorrent/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2007/04/602d130.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2007/04/27/foxtorrent/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Yahoo Goes Green</title>
		<link>http://robert.accettura.com/blog/2007/04/17/yahoo-goes-green/</link>
		<comments>http://robert.accettura.com/blog/2007/04/17/yahoo-goes-green/#comments</comments>
		<pubDate>Wed, 18 Apr 2007 02:10:44 +0000</pubDate>
		<dc:creator>Robert</dc:creator>
				<category><![CDATA[In The News]]></category>
		<category><![CDATA[Tech (General)]]></category>
		<category><![CDATA[datacenter]]></category>
		<category><![CDATA[electricity]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://robert.accettura.com/archives/2007/04/17/yahoo-goes-green/</guid>
		<description><![CDATA[Yahoo is going carbon neutral. I&#8217;m curious how much is offset, and how much is reduction. Yahoo has a fairly large infrastructure. I wonder if they are using alternative power sources, or if they are going to plant a million &#8230; <a href="http://robert.accettura.com/blog/2007/04/17/yahoo-goes-green/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://yodel.yahoo.com/2007/04/17/dont-even-leave-a-footprint/">Yahoo is going carbon neutral</a>.  I&#8217;m curious how much is offset, and how much is reduction.  Yahoo has a fairly large infrastructure.  I wonder if they are using alternative power sources, or if they are going to plant a million trees.  They do <a href="http://brand.yahoo.com/forgood/environment/carbon_neutral.html#how">mention</a>:</p>
<blockquote cite="http://brand.yahoo.com/forgood/environment/carbon_neutral.html#how"><p>
These projects could include a wind farm in India or a small-scale run of the river hydroelectric project in Brazil. We’re also looking to invest in emerging clean technologies.
</p></blockquote>
<p>Interesting.  I wonder if we will see things like carbon neutral VoIP, carbon neutral bandwidth, carbon neutral data centers / colocation / hosting?
<div id="rja_commentCountImage"><a href="http://robert.accettura.com/archives/2007/04/17/yahoo-goes-green/#comments"><img src="http://robert.accettura.com/wp-content/commentCount/2007/04/9683cc5.gif" alt="Comment Count" style="border:0;" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://robert.accettura.com/blog/2007/04/17/yahoo-goes-green/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

