Categories
Google Mozilla

Google Buys On2

Google today announced they are buying On2 Technologies. This is one of their more significant purchases despite the relatively low price tag of $106.5 million since it’s video technology and Google is the largest video source on the web right now.

On2 is really an unknown to most people but their product has an amazing reach thanks to Adobe Flash. VP6 notably was included in Flash 8 and really brought about the age of Flash video (think YouTube). On2 also has VP7 which is considered a H.264 competitor. VP3 was released as open source and lives on as OGG Theora.

Of course by buying On2 Google will not need to pay any licensing for it’s VP7 technology, they can then bundle it into Chrome, Android and Google Chrome OS (finally giving Linux decent web video support). They could also open source it similar to these platforms in hopes that it will gain ubiquity.

This does however leave me wondering if this pending On2 deal had any bearing on the decision to leave HTML 5 <video/> codec ambiguous. It’s noteworthy since Google is very involved in the HTML 5 efforts. As I mentioned last month licensing is really key. If VP7 were open sourced and it’s licensing were compatible to meet Apple and Mozilla’s needs (which could lead to inclusion in Safari and Firefox respectively), OGG Theora is potentially dead overnight. Given Google’s strategy so far of making technology open source in efforts to encourage adoption, I wouldn’t rule this out, though it would likely take a while to evaluate everything and make sure they legally have that option. Timeline could also come into play here. The web isn’t necessarily going to wait for Google. These reviews can potentially take a long time. No guarantee others will incorporate it either, though it’s a pretty good deal should licensing work.

It’s also interesting that now Microsoft has Windows Media Player, Apple has QuickTime, and Google has On2’s codec bundle. It’s not exactly a “player”, but considering it’s usage it’s just as important.

It’s going to be very interesting to see how this plays out. One thing that seems relatively certain is that Google just made web video more interesting.

Categories
Google

Google Chrome OS

The big news over the past 24 hours is the announcement of Google Chrome OS. Effectively Google Chrome OS is a stripped down Linux Kernel with just enough to boot Chrome/WebKit as it’s main UI. The exact UI paradigm hasn’t been reveled as of yet. Google claims:

Speed, simplicity and security are the key aspects of Google Chrome OS. We’re designing the OS to be fast and lightweight, to start-up and get you onto the web in a few seconds. The user interface is minimal to stay out of your way, and most of the user experience takes place on the web. And as we did for the Google Chrome browser, we are going back to the basics and completely redesigning the underlying security architecture of the OS so that users don’t have to deal with viruses, malware and security updates. It should just work.

It’s an interesting and somewhat bold statement.

Categories
Google Internet

Google Wave

Google Wave is a pretty impressive demo, and the fact that they are open sourcing most of it, documenting the protocol and enabling federation is a major win, but I’m hesitant to think it will replace email anytime soon, if ever.

John Gruber has a very interesting observation:

Communication systems that succeed are usually conceptually simple: the telegraph, the telephone, fax, email, IM, Twitter. So color me skeptical regarding Wave’s prospects.

A very valid point. Popular technical communication systems solve one communications problem. Attempts to solve more than one so far have failed. A good example is the video phone we were supposed to have in every home 20 years ago. Even today with cheap web cams, video and telephony is rare to combine and is seen as somewhat of a novelty.

Wave also has other limitations such as people who use Wave interacting with people who don’t. Most of the “wow” in Wave requires interacting with other Wave users. Pretty cool if everyone you communicate with is using Wave, but no so much if many/most of your contacts aren’t using Wave. How many people only communicate with others using the same mail provider? Google users never email Yahoo and Microsoft users?

Will Wave be adopted? For one thing it will change the business model of many email providers. Wave will be significantly more resource intensive than basic webmail and POP3 access (or IMAP for the rare few). One could argue spam has made Email somewhat resource intensive, but Email has more slack regarding expected latency since it’s not “real time”. Email is often given away with internet access, web hosting, or just as a freebie because providers know that email keeps users coming back and it’s extremely low cost to provide. It also retains users. For example lots of non-AOL users keep their AOL account just for the email.

Then there’s the issue of ownership. Group editing a wave sure sounds like fun, as it’s so wiki-like. All that collaboration is also a real boost for productivity, but it does have it’s downsides. Who owns that data? Obviously companies are going to be a bit concerned about this aspect. Email has the benefit of being rather concrete. Send, receive. Those are the only two functions supported. Replies are merely a copy of a previously received email with an appended response. Ad-hoc collaboration seems to create a new twist. The courts have also seen their share of email. Wave means new precedents and interpretation in the law. How many companies you think want to test that pool?

One thing the Google team said virtually nothing about was security.

Email was never designed to be secure. SMTP servers initially had no authentication anyone could send using any SMTP server. Auth was bolted on later on, and is still problematic (receive before send anyone?). Presumably since Wave is built on top of XMPP SSL will be we the encryption mechanism. But that’s only on the transport level between federated servers. What about end to end? Is an S/MIME like method supported? SSL to the user is a secure transport layer but doesn’t protect from interception by either server. Since it’s text you could use PGP and send a message, though you loose a lot of functionality and grace.

SPF is a hack for email origin verification. It works OK where it’s supported, but not everyone supports it from a provider or user perspective making it a pretty poor solution. Will Wave be utilizing EV-SSL? How about supporting verification from the actual user? S/MIME signature? Verifying identity is critical to being a successor to email. Both verifying the organization, and the user at the organization.

Lastly spam. How does Wave attempt to mitigate the spam problem? Sounds like one of the possibilities is a whitelist which doesn’t work in email, and is unlikely to work in a Wave. Unsolicited emails are good in many non-spammy situations. For example a friend emailing from a new address or another business discussing a partnership. Sure you can prompt each time to add to whitelist, but then the process itself becomes spam. Do you wish to add “buy-viagra-at-thebiggestviagrastoreintheworld.com@yahoo.com” to your whitelist? You get the picture. I’m sure there will be traditional filtering as well, but that still doesn’t solve the problem.

I think Wave has a chance, but it’s not a very high chance of success. There are a lot of barriers. Email is still the ultimate API for it’s ease of use and implementation. Email didn’t survive for so long because there was nobody willing to build something better. Email survived because it became the standard and worked in virtually all situations. It was simple enough for users and implementers alike.

I think it’s much more likely that concepts from Wave will end up elsewhere, rather than Wave replace email. Because of that, I’d call it a disruptive innovation.

Categories
Google Security Web Development

Phorm’s UserAgent

There’s a fair amount of controversy regarding Phorm a company who plans to target advertising by harvesting information via deep packet inspection. They are already in talks with several ISP’s. I’ll leave the debate over Phorm from a user perspective for someplace else.

They claim to offer ways to let websites opt out of their tracking but it’s a true double-edged sword as they don’t play nice with a standard robots.txt file. Take a look at what they are doing here:

The Webwise system observes the rules that a website sets for the Googlebot, Slurp (Yahoo! agent) and “*” (any robot) user agents. Where a website’s robots.txt file disallows any of these user agents, Webwise will not profile the relevant URL. As an example, the following robots.txt text will prevent profiling of all pages on a site:

Rather than use a unique user agent they are copying that of Google and Yahoo. The only way to block them via a robots.txt file is to tell one of the two largest search engines in the western world not to index your site. This seems fundamentally wrong.

There is an email address where you can provide a list of domains to exclude, but that requires intervention and updating a list of domains when you create a site. This obviously doesn’t scale.

Now I’m curious. Is piggybacking off of another companies user agent considered a trademark violation? From what I understand they aren’t broadcasting it, just honoring it. If I were Google or Yahoo I’d be pretty annoyed. Particularly Yahoo since there are websites who will just block Slurm given Google’s dominance in search. Yes there are many user-agent spoofing products out there (including wget and curl), but nobody to my knowledge is crawling web pages for a commercial purpose hiding behind another company name.

robots.txt is a somewhat flawed system as not all user agents even obey it (sadly) though it’s one of the only defenses without actual blocks that exist.

Categories
Google Internet

Engineering Efficiency

Internet companies have the unique ability to scale quicker than any other industry on earth. Never before has a company been able to position itself from being nothing more than an idea to being in the living rooms of millions around the globe in a matter of hours. While this introduces seemingly unlimited opportunities to grow it also allows for exponential waste if a company isn’t careful. It’s interesting to see how they scale. Scaling businesses in many ways isn’t very different than scaling servers and software stacks.

The Classic Example: UPS

Started in 1907 and adopting the name United Parcel Service in 1919 UPS has no real “high tech” background unless you include the Ford Model T. That doesn’t mean it couldn’t become more efficient. UPS has made a science of the delivery business. For example it’s famous for it’s “no left” policy. Simply put they found that avoiding left turns means less time waiting at lights which means less fuel is wasted. The more efficient routing formerly done by humans now computerized saves them 3 million gallons of fuel in 2007 alone. Lets do the math:

Assuming they run 100% diesel at an average cost of $2.87/gallon in 2007 [doe] multiplied by 3 million that’s $8.61 million dollars by trying to avoid left turns.

Not bad for a souped up mapping application.

By having their drivers do things like turning of the ignition while unbuckling their seat belt at the same time, and scanning for the doorbell while walking towards the door (it’s easier to see from a distance than up close) they can shave time off of their routes.

Then of course there’s package tracking. While customers might like to know in what city their weight loss taps are sitting tracking systems help reduce loss and monitor package routing for optimal efficiency.

Cutting Utility Bills: Google

Being the largest search engine, a large ad network, email provider, analytics firm, mapping service, video site, and whatever else they do means Google needs a ton of servers. Cramming servers into data centers and keeping them cool to prevent hardware failures is a complicated and expensive task. Keeping the whole thing powered is also really expensive. Google has scrutinized server designs to eliminate all waste possible. This has resulted in Google having more horsepower at a lower cost than their competitors. Having more capacity at a lower cost means Google can do more at a lower cost than their competitors. I won’t discuss Google in too much detail since they did a great job themselves recently and I mentioned it the other day in another blog post: Google’s Data Center Secrets.

Shipping Efficiency: Amazon

Amazon’s long been improving efficiency by using data collection and analysis to encouraging their customers to spend more. Their algorithms to recommend related products you might be interested in is one of the best out there. Their ordering system is streamlined to prevent customers from bailing before completion. Their products are SEO’d to appear on the top of Google searches. That doesn’t mean Amazon can’t improve other parts of their business.

Amazon several months ago started a Frustration-Free Packaging program. Here’s how they describe it:

The Frustration-Free Package (on the left) is recyclable and comes without excess packaging materials such as hard plastic clamshell casings, plastic bindings, and wire ties. It’s designed to be opened without the use of a box cutter or knife and will protect your product just as well as traditional packaging (on the right). Products with Frustration-Free Packaging can frequently be shipped in their own boxes, without an additional shipping box.

The key here is “can frequently be shipped in their own boxes”. By shipping a box alone rather than packaging they can skip a step in their warehouses (and the packaging materials that go with packaging something for delivery). This also lowers the weight as those extra boxes don’t weigh 0 oz. The frustration free packaging is also the perfect shape for efficiently filling trucks and strong enough to not crush easily thus lowering returns due to damage.

Amazon now even has a feedback form [login required] for users to share what they think of their package. This has the added bonus of helping further reduce the inefficient shipping practices so common right now.

Amazon’s also done a significant amount of work on their infrastructure to make their servers scale well using tech such as EC2 and S3. By selling capacity to other companies they able to take advantage of economy of scale as well as diversify their business beyond just retail. Of course they are planning their data centers to have access to cheap power.

These aren’t haphazard attempts at increasing efficiency, these are well calculated engineered approaches to removing even the smallest inefficiencies with the knowledge of how they compound as operations scale. Aren’t they clever?

Categories
Google Hardware

Google’s Data Center Secrets

Big news today is that Google “unveiled” (more like confirmed) some data center secrets:

Custom Servers

It has been known for years that Google has been building it’s own servers rather than buy from a vendor. They have defended this as their servers are more efficient and customized for their needs than they could ever buy. They cut out things like a video card which do nothing but add a point of failure and waste power. They put a battery on the server itself rather than have a UPS for the rack they found it to be more cheaper and more efficient. They also hang the power supply away from the rest of the system itself, presumably for cooling. This actually isn’t shocking since it’s been leaked several times before, though this is the first time that I’m aware of Google speaking publicly about their design in this much detail.

Container Data Centers

Apparently since 2005 Google has been using shipping containers as data centers. It’s been known for a long time Google was interested in the idea (as were other companies) but a first that they have actually been using them for a while. 1,160 servers per container utilizing 250 kilowatts of power = 780 watts per square foot. Very impressive.

I guess it’s only a matter of time before we see commercial servers, and perhaps even some desktops with power supplies that have their own batteries.

Update [4/11/2009 @ 5:00 PM EST]: Google has a blog post up including video of the summit.

Categories
Audio/Video Google

YouTube Recommends

YouTube Recommends

Thanks YouTube for that recommendation of assorted animals mating. I also enjoyed the horse sperm harvesting video.

Categories
Google Internet Networking

Measurement Lab

Google today unwrapped Measurement Lab (M-Lab) which can measure connection speed, run diagnostics and check if your ISP is blocking or throttling things via it’s blog.

In general it’s a good idea, but it’s nothing new. Sites like dslreports.com and SpeedTest.net have been measuring speed and providing diagnostics for years. The BitTorrent test however isn’t replicated by many yet.

One thing that isn’t addressed is how they will detect if an ISP is adjusting their routing to handle M-Lab servers specially. What stops an ISP from not throttling from one of Google’s “36 servers in 12 locations in the U.S. and Europe” but throttling all other data? Perhaps Vint Cerf and friends have a plan in mind but it seems to me this could be a cat and mouse game.

Categories
Google Internet

Technology Growing In Public Consciousness

Google Zeitgeist 2008 is out. As always it’s a fun to read because it’s a recap of 2008. It’s also gives some pretty good insight into 2009, in particular for tech since people tend to turn to Google to explain technology for them. I noticed some very interesting things:

What is…

  1. what is love
  2. what is life
  3. what is java
  4. what is sap
  5. what is RSS
  6. what is scientology
  7. what is autism
  8. what is lupus
  9. what is 3g
  10. what is art

What Is RSS

Number 5 “what is RSS” is really what’s interesting. There’s long been speculation if RSS will ever move beyond a more technical audience. It powers many things on the internet, and is possibly the most popular use of XML (I’ve got no data on that), but it’s never had tremendous adoption by mainstream users who still Google for websites and read news off the homepages.

Steve Rubel thinks it’s peaked at 11% citing Forrester research back in October. Personally I think it’s still got a way to go. It will grow, but slowly. The following reasons are why I think this will turn out to be true:

  • RSS and “feed” are terms that are just now entering mainstream public consciousness. People tend to see things a while before they care about them when it comes to technology. They prefer not to waste time on fads. What we see above is evidence of a transitional step in this process.
  • Modern browsers finally offer a smoother process to help users take advantage of RSS. For most of it’s life browsers simply showed raw XML on the screen, this was unusable for 99.5% of the population. Showing XML looked foreign and overwhelming. The newer interfaces, while they can still be improved upon, are much easier for users.
  • RSS readers are at their infancy. I’m an advocate of Google Reader since very early on. That said, I think there’s a lot that can be improved upon. I think they can become a bit more usable for mainstream users, in particular when it comes to bucketing into folders and sifting/sorting. It’s management of data rather than just displaying that needs work.
  • Greater need to manage time. Lack of time and information overload have long been growing problems for people. RSS actually helps here (unless your Steve Rubel and addicted) by reducing the amount of time you need to access and digest information. I keep tabs on a few hundred sites with minimal effort throughout the day, it’s essentially a constantly evolving newspaper for me. People need to reduce the time they spend monitoring things they care about. RSS is the leading candidate to help them in this task.

Because of the need, and the fact that RSS is a pretty good solution, despite the lack of good interfaces to intproduce it to users, I suspect there will still be growth as people overcome the barriers and take advantage of it. The only way that won’t happen is if there’s a more disruptive technology. Even if that doesn’t pan out, RSS will be with us for many years due to it’s pervasive use across the net.

What is 3G

I presume this is highly related to the iPhone 3G release. The term “iPhone” appears in the Zeitgeist for several countries but not the US. I suspect that’s because it’s new to those countries. What’s new to the US is the iPhone 3G, and that’s what people wanted more information on.

What is Java, What is SAP

Enterprise IT departments love the Google. Enough said.

Other thoughts

I was a little disappointed to not see “who is rick astley“. As well as “i can has cheezburger?”