Archive for the ‘Network Architecture’ Category

Participating in MPLS 2009

Saturday, October 24th, 2009

I’ve meant to post something about this before now, but I suppose it’s better late than never…

I’m traveling this weekend to Washington, DC in order to attend the MPLS 2009 conference. Specifically I will be participating in a panel hosted by Monique Morrow entitled Emerging Technologies and Business Architectural Impact. The topic of the panel is described as “Cloud Computing, P2P applications, Social Networking and Infrastructure Required to Scale” and also includes a handful of panelists from other interesting companies, both vendors and service providers.

The program for the technical sessions can be found at http://www.isocore.com/mpls2009/program/technical_sessions.htm.

If you’re in the DC area and want to meet up, please let me know. If you’re going to be attending MPLS 2009 then definitely stop by and say hello after the panel.

Donn Lee on Need for 100G and 1T Ethernet

Thursday, October 1st, 2009

Facebook friend Donn Lee presented on the need for 100Gbps and 1Tbps Ethernet at a recent Ethernet Alliance meeting. (found via Data Center Knowledge) You can watch the video here:

http://link.brightcove.com/services/player/bcpid1701276884?bclid=1622640422&bctid=40363249001

In addition to Donn’s presentation there are very good presentation videos from Google and Brocade that you can view at the above link. Bikash Koley from Google gave a great talk, and hints at some of the deep architectural problems posed by cloud services during his post-talk Q&A, so be sure to watch until the end. While watching Donn’s talk I was pleased to see that he can present real details as a Facebook employee, as opposed to censored presentations I’ve seen him give when he worked for Google. In particular it’s nice to see numbers on each axis of his graphs. ;)

In any case, his discussion centers around the port bandwidth needed for non-blocking Ethernet fabrics at high bisection bandwidths. His argument is that the bandwidth needs of Internet-scale properties like Facebook are outpacing the progress of the vendor and standards community. I especially liked one of his example network topologies, which illustrates how crazy things can get when you need lots of bandwidth throughout a datacenter:
fabric-chaos

This is a topic that I’ve been struggling with over the past couple years, as I’ve worried about building a non-blocking datacenter fabric that can support arbitrary workload distribution. And after all this time worrying, I’m not convinced this is the right strategy. Please don’t misunderstand; I’m certainly not convinced that it’s a wrong strategy. But, considering the technical and economic challenges, this approach seems pretty weak on its own. My opinion (until I’m convinced otherwise) is that more bandwidth and clever fabric topologies must be supplemented with some form of workload distribution management.

In enterprise or single-tenant environments this should be much easier than in a service provider cloud, because the relationship between each node can be determined and/or planned. But when anybody can throw any sort of workload at the infrastructure, creating opaque relationship topologies, what’s a network architect to do? That’s a real question–please send me pointers if you’re aware of existing work. Otherwise, I guess I should start wishing for the appearance of 1Tbps Ethernet in my future.

Cloud: Private vs Public, Internal vs External, Oh My!

Tuesday, September 15th, 2009

James Urquhart (Cisco marketer and author of The Wisdom of Clouds) has posted a video to the Cisco Data Center Networks blog entitled Clarifying Internal Cloud versus Private Cloud. In the video James stands by an unreadable whiteboard (new markers, please, James! ;) ) and discusses the difference between public vs. private clouds and internal vs. external clouds, briefly summarized as:

While internal and external clouds are based on the ownership of where the computing resources reside, two other cloud types – public and private – have more do to with the control point of the cloud applications and resources.

Sam Charrington (Appistry marketer and regular CloudCamp organizer) commented on James’ definition in a post on his blog:

While I agree that Private is not always equal to Internal, James’ [re]definition of increasingly accepted terminology just serves to muddy the waters by introducing the existence of unified control systems as a defining characteristic.

I have much respect for James and Sam as cloud thought-leaders, but I’m afraid I have to disagree with them both.

I’ve been hearing confused voices discuss the public/private cloud concepts for a couple years now, but I don’t think the industry has yet settled on a definition that captures the dimensions of a cloud environment completely. Oh, I agree that the conversation has become more subdued recently. But I suspect this is due to boredom with the topic rather than consensus.

But boredom be damned, James expands/clarifies the definition to cover a new dimension. His division of the cloud based on two attributes, ownership and point of control, leaves aside other dimensions that are critical to the cloud’s definition. I don’t have any disagreement with these attributes being central to a cloud instance. But I would add a few critical attributes to the list of dimensions that must be captured by the internal/external private/public definition.

To illustrate the point: My own boss*, Bryan Doerr (Savvis CTO), was recently summarized describing the “Private” aspect of a cloud instance similarly to how I suspect Sam would define it but with the on/off-site aspect that James is trying to accommodate in his internal/external definition:

In general, the phrase, “private cloud” has meant cloud-like resources inside the enterprise. Savvis is proposing that the “private cloud” be located off premises but include greater measures of security and quality of service than temporary workloads sent to the shared facilities, like Amazon’s EC2.

Bryan’s talk, which led to the summary quoted above, hints at the fundamental problem: that a cloud service has to accommodate all of the complexities of a traditional IT environment. There is no magic pixie-dust that makes an infrastructure cloud capable of side-stepping longstanding IT challenges and best-practices while scaling existing applications as-is in an efficient hardware environment.

Rather, cloud service providers design and package different IT elements into a solution, addressing the aforementioned challenges on the user’s behalf. It’s IT outsourcing taken one step closer to the inevitable conclusion of any technology life-cycle in a free-market economy: commoditization.

But we’re not there yet. We can say that this packaged solution is a “cloud” solution because of the high-level behaviors that it supports: on-demand provisioning, usage-based costing/billing, consolidated controls, etc. These behaviors are typically enabled by a collection of multi-tenant hardware and software platforms. But the specific platform details such as capabilities and features, implementation choices, etc, all lead to small differences in the solution.

These differences, in addition to being the gap between commoditization and our current position in the technology life-cycle, are what lead us to need terminology like internal/external and private/public. Maybe one day IT infrastructure will converge around a common set of attributes with limited and known values. (Not that I would predict such a thing anytime soon.) In the meantime we’re stuck with a terminology that is more complicated than cloud marketers and customers would like.

I’d propose that this terminology includes at a minimum the following attributes:

  • Platform Ownership – customer, provider, other
  • Financial Liability – customer, provider
  • Physical Location – customer premises, provider premises, other location
  • Point of Control – customer, provider, other
  • Point of Management – customer, provider, other
  • Network Connectivity – private, public, both
  • Security Policy Management – customer-driven, provider-driven
  • QoS Policy Management – customer-driven, provider-driven

Our job as cloud service providers is to package these attributes in the best way for our customer-base and to make transitions between different attribute states as seamless as possible. As an architect working on one such cloud platform, however, I’d sure kill for some of that aforementioned pixie-dust…

* – Note Well: This post is not endorsed or sponsored by my employer, Savvis, Inc, and has not been written, edited, reviewed, or approved by anybody except me (Benson Schliesser).

Internet Growth During the Recession

Tuesday, September 15th, 2009

Popular wisdom has it that during a recession people will cut back on all sorts of expenses, but they will still go out to the movies. The story goes that, supposedly, people need to relax and that movies provide a great temporary escape from reality. As I’ve always heard it, movie theater revenues actually go up during a recession.

Now, I don’t know for certain if the movie theater story is true. But current trends would seem to suggest that it is, according to articles like this one. Movie-going aside, I’ve seen multiple sources that would suggest a similar role for recessions in influencing Internet traffic. For instance, this article discusses a recent Telegeography study showing significant increases in international Internet traffic. The study focuses on just a subset of the infrastructure, so it is by no means definitive. But other sources (including private data) back up this view of Internet growth.

The question remaining, in my mind, is whether the increased growth is due to changes in the underlying network connectivity, business consolidation and/or outsourcing, consumer use (i.e. the “Recession Movie-going effect”), or something else entirely. If anybody has data to suggest one conclusion over another, please post a comment or send me a note with a pointer. I’d love to understand this better.

Cyan Optics and Economic Reality

Tuesday, September 15th, 2009

Om Malik has a post on GigaOm entitled The Return of the Optical Startup — Cyan Optics Launches discussing a new start-up in the optical space, Cyan Optics. As I read his post I had a thought, which I was happy to see he captured near the end of the article:

Many of these products have been talked about for years. In an ideal world, Cisco would have developed some of these technologies as next-generation Cerent products. When I talked to Hatfield last week, he pointed out that the carriers want to deliver more bandwidth and yet keep the costs of delivering that bandwidth low, otherwise the network economics breakdown. Cyan was started to address this quandary, he said.

I haven’t met with Cyan yet, though I may have exchanged emails with them sometime over the past year… My memory is so overloaded with cloud networking these days that I can’t recall. ;)

In any case, I’m glad to hear about their strategy for an optical multi-service platform. This is something that I’ve hoped for and expected for some time. I’m surprised that Cisco hasn’t built this already, given their integration of optical capabilities in router platforms like the CRS. With the right optical multi-service platform, technologies like GMPLS and GFP could enable more dynamic networking in the transport layer for both access and backbone. This is a vision that Cisco seems to have in their mind, and yet to me they seem blind to the need for multi-service L1 and L2 network elements.

Cyan’s management platform also looks impressive. This isn’t an area in which Cisco has been impressive, either. But take a look at Cyan’s multi-layer visualization from their website:

Cyan Multi-Layer Visualization

Maybe Cyan can help wake Cisco up to the reality of building physical networks. Perhaps they will be an interesting acquisition target for Cisco, if not a competitor. I’ve thought the same thing about previous start-up platforms (like the impressive Hammerhead Systems L2.5 MSE platform) and I turned out to be wrong then. But perhaps the recovering economy will cast a new light on the situation. After all, we’re seeing that there is demand for bandwidth even in a broad economic down-turn. But there is money to be made only if platform vendors enable service providers to build economical networks, recognizing that profit margins are already razor-thin for most.