Technology, IT Performance

Scale-out datacenter architectures

Blog-post by,
HP Blogger
,

"If you were plowing a field, which would you rather use: Two strong oxen or 1024 chickens?". I'm the first to admit is what one of the more bizarre conversations with developer and long-time friend Chris Cogdon. If memory serves it was 1989 and he was paraphrasing Seymour Cray and the tradeoffs inherit in scale-up vs scale-out. Scale-up (aka the oxen) won that round but in 2012 I believe their chickens might come home to roost (pun very much intended) with real-world planning implications for IT leaders in terms of improved price/performance and energy efficiency but also application development and team skills. First, let me shed some light of Chris’ analogy in order to set some context for the next couple of posts that can all trace their roots back to this idea. 

Back when I was a lad (please, no abacus jokes), the doomsday crowd were predicting that we’d very soon be bumping up against the upper limits of physics and progress observed by Moore’s “law” would soon grind the a halt (sounds . While we now know that didn’t happen, at the time computer  scientists and vendors were scurrying around trying to determine how they harness micro-processor s could create mainframe style “grunt” by scaling workloads across multiple CPUs. 

As a developer, Chris’ point was that each architecture represented a choice, a choice bounded by the type of work you wanted to accomplish (for those inclined to get their geek on, see this Wiki article). That simple choice then drove everything else - operating system, programming languages and storage architectures. 

The two competing camps – Seymour's “two strong oxen” model championed by the mainframe/midrange system types (powerful RISC and CISC CPUs well suited to a handful of big, batch tasks) versus the “1024 chickens” camp (thousands of lower power CPUs doing large number of simple tasks repeatedly). Enterprises largely chose the former and the academic/science community chose the latter, the most famous being Danny Hillis’ now defunct Thinking Machines 65,536 processor Connection Machine beast beloved of biochemists and nuclear weapons researchers everywhere. Virtually no-one except their creators could program the 64K CPU beasts and they all but died out - almost... 

The relevance to today's CIO is that engineers at “.com” companies such as Facebook, Google and their progenitors recognized that lots of users, performing the same basic task (like search or updating friends on what they just had for breakfast) was architecturally similar to the use-case of a physicist simulating thousands of chemical reactions and as a result borrowed liberally from those architectural and programming language toolkits. Toolkits which are now migrating into the enterprise.

Programming scaled-out architectures no longer requires a PhD. The great news is the arrival of  modern development languages and architectures, in particular Erlang  along with Ruby (including Ruby on Rails), Struts, PHP and Python. Having been developed or derived from the days of massively parallel systems, these languages behind the rapid growth of scale-out applications capable of tapping the power of the 1024 chickens without requiring an army of resident computer scientists. Like COBOL and Fortran before them, this doesn’t spell the end of Visual BASIC, C and Java, but it does mean that CIOs should be assessing their future orientation readiness with HR and their leadership team and prepare plans to hire or develop skills in for both developers and, operations staff. 

What’s true of software is also true of hardware. With Web 2.0 applications and cloud architectures now designed to exploiting distributed, scale-out arrays of cheaper compute, network and storage I believe CIOs should be using early 2012 to re-evaluate their datacenter plans and standards to ensure that they’re building on a modernized, converged architecture that’s capable of accommodating both scale up and scale out models without requiring a rip and replace between the two (not one to plug product, I’ll break with policy here and recommend you look at HP’s 3PAR storage as a great proof-point  of what “2.0” hardware architectures can bring to the table).

Granted, most of these skills are in highest demand by “.com” startups everywhere, is anyone in the enterprise having trouble ramping their skills fast enough or is everyone getting by with the IT equivalent of Latin? (ie: dead, but stable languages)

P.S. both of these topics naturally lead us to the big-data and ARM vs Intel discussions, each of which I’ll cover in separate posts - stay tuned

(2) (2)

Discussion
Would you like to comment on this content? Log in or Register.
pearl
Pearl Zhu 90 Points | Tue, 01/17/2012 - 17:40

Hi, Paul, interesting blog to exploit the computing trend: the convergence of hardware and software, the monolithic architecture and infrastructure may need be modernized and modulized with mosaic style cloud/mobile/social digital technology, from EA perspective, we may need quote Einstein again: "If I have one hour to do a work, I may need spend 50 minutes to think through the problem, and 10 minute to solve it". thanks. 

PaulM
Paul Muller 119 Points | Tue, 01/17/2012 - 03:24
As an aside, Fast Company just posted a great article about coding for non-coders and the unnecessarily opaque nature of programming languages http://www.fastcodesign.com/1665735/why-arent-computer-programming-langu...
jdodge
John Dodge 1439 Points | Tue, 01/17/2012 - 14:59

"That's not Greek. It's Klingon." LOL

Frankly, I find a story that uses the word "opaque" about difficult programming language interfaces to be guilty of the same sin as the programming languages themselves.   

PaulM
Paul Muller 119 Points | Tue, 01/17/2012 - 17:38

Fair point!

jdodge
John Dodge 1439 Points | Tue, 01/17/2012 - 18:06

BTW, I checked on Youtube and could not find any videos of 1,024 chickens pulling a plow.

PaulM
Paul Muller 119 Points | Tue, 01/17/2012 - 23:18

You just *know* that now you've said it, one will be made! Here's hoping at least.

Shall I call PETA or shall you?

Out of curiosity, am I the only one who remembers Seymour's amazing (but doomed) machines?

jdodge
John Dodge 1439 Points | Wed, 01/18/2012 - 00:15

Heck no. The revered Cray supercomputers used by large oil companies, the U.S. National Security Agency and other enterprises needing gobs of number crunching were made just outside in the city my wife grew up in - Eau Claire, Wis. In fact, she interviewed him in Chippewa, Wis., early in her 16 years as an editor at Computerworld. It was engineer Steve Chen who Cray mentored and who also set up a supercomputer shop in Eau Claire.

Here's one for you: what company did Cray leave to set up Cray Supercomputer? And what unique and pioneering exec led that company? (w/o looking it up!)

 

PaulM
Paul Muller 119 Points | Thu, 01/19/2012 - 00:46

Control Data for the first answer (I believe) and the second, wow, not sure - I would have guessed it was Cray himself to led it, but now I'm not sure...

I remember going to visit my first Cray (X-MP) computer. At the time was being sold as a time share (proto-cloud?) mainly processing seismic data for gas/mining.

The legend of Cray is astounding. Completely mad attention to detail, sadly the the chickens won....