For Server Makers, Itanium cpu’s still matter
March 2010
It’s been a long time coming, but earlier this week, Intel finally launched “Tukwila,” the latest iteration in its Itanium family of high-end microprocessors.
Coming on the same day as IBM introduced both its Power7 chip (see below) and the first of an associated line of servers, Tukwila didn’t garner as much attention as it might have otherwise. It’s also true that today’s Itanium is something of a specialty product. But that doesn’t make it irrelevant.
Tukwila will be the first Itanium to incorporate Intel’s serial processor communications link (QuickPath Interconnect, or QPI) and integrated memory controllers. These features boost performance considerably and are standard fare for the current generation of server microprocessors. They also mean that the Itanium 9300, as Tukwila is officially known, and the Intel’s upcoming Nehalem-EX Xeon (x86) processor can, in principle, be supported by the same system design.
In practice, this convergence was a more interesting selling point in the days when Intel envisioned a broader market for Itanium processors. Nonetheless, it will still let Intel and its manufacturing partners take advantage of Xeon design work and dollars for Itanium. The specifics of the chip aside, though, it’s not unreasonable to ask whether any of this matters. Given that both AMD and Intel’s high-end x86 processors get more capable by the year, why does anyone need Itanium?
Certainly, Itanium’s market position today is not the one envisioned by Intel and Hewlett-Packard, when they first started designing the processor in the mid-1990s. They had conceived of it as a 64-bit processor family running Windows and (perhaps) a united Unix that would emerge as the de facto standard, when the time came to move beyond the increasingly restrictive memory limits imposed by 32-bit processors.
The reasons why this didn’t happen are numerous, and it would take an extended discussion to give them their due. However, some of the big ones include an overly ambitious concept; delays coupled with bad timing; a focus on instruction-level parallelism, when the world would soon move to more of an emphasis on threads; and AMD’s introduction of 64-bit extensions for x86.
Today, by contrast, just one company, HP, accounts for about 85 percent of the market for Itanium processors, with the balance mostly going to several large Japanese computer system vendors. HP uses Itanium in its Integrity line, for which it mostly runs HP-UX (HP’s Unix) and NonStop (the descendent of Tandem’s fault-tolerant operating system) applications.
One company may not sound like much of a market but, in fact, vendor-specific processors were long the norm in the computer industry and only went by the wayside when x86 matured. And even today, IBM continues to aggressively roll out new Power processors, and Oracle says it plans to continue developing Sparc chips. Each of these cases is a bit different, but the basic point is that it’s not outlandish to imagine that a major vendor’s product line could support a unique microprocessor.
But why would HP want to, given that this is a company that also has a major x86 product line? In a word: software.
It is likely, perhaps even certain, that if HP could wave a magic wand, and have HP-UX and all its myriad applications run on Xeon tomorrow, it would do so. However, there is no such magic wand.
The closest to such a wand, dynamic binary translation (DBT), works in some limited contexts. IBM uses it for certain Linux applications on Power chips, and Apple used DBT to aid migrating from PowerPC to Intel chips. But, for the most part, IT shops won’t or can’t use it for the sort of critical applications that run on HP-UX today. Indeed, HP developed its own DBT technology, when it was initially moving applications to Itanium in the first place, under the name “Aries.” Few used it.
It took many painful years–the better part of the last decade–for HP and its software partners to re-establish HP-UX’s software catalog on Itanium when it migrated off PA-RISC. To start this process anew for Xeon is simply unthinkable.
And even if the features of Xeon have largely achieved parity with Itanium, the same isn’t generally true of the platforms as a whole. HP-UX is a mature commercial Unix operating system in the mold of AIX and Solaris. Linux and Windows gain in capability and robustness with each passing year, but they’re not yet at the same point. The contrast with NonStop is even more striking. This is, after all, a line of systems that powers about 75 of the 100 largest fund transfer networks around the world.
In short, Integrity brings a lot of money into HP, and it provides customers with capabilities that they can’t necessarily get on Xeon-based platforms. And, in any case, HP-UX customers can’t necessarily just pick up and move. Migrations take effort and money, and they have a degree of risk, even if the end state is ultimately a more desirable place to be.
In addition to introducing Tukwila, Intel provided additional detail about its successor, “Poulson.” Scheduled for about two years hence, it will skip a process generation and launch using 32-nanometer technology. This should bring it more in line with the then-contemporary Xeon processors than the 65nm Tukwila is, relative to today’s 45nm Nehalem. (The process generation is significant because it’s closely related to the amount of real estate on the chip and therefore to features such as the number of cores and the amount of cache.)
Without providing much in the way of details, Intel also indicated that Poulson will have other architectural enhancements that go beyond simply being a process shrink. “Kittson” will be the generation after that.
Plans can change, of course, and processors can slip. However, barring seismic changes, Intel sketched out a road map for something like a decade’s worth of Itanium processors. I don’t really expect these Itaniums to set a lot of performance records, but there’s no reason to think that they won’t be “in the ballpark.” It’s worth remembering that Sun sold lots of Sparc systems long after they had a “hot” microprocessor. The inertia in applications, skills, and general risk aversion in high-end servers is enormous.
Itanium doesn’t matter, when it comes to volume computing. It fought that battle and lost. But it remains an integral component in a major product line at a major systems vendor. And it remains a component that, in a world without magic wands, can’t be easily replaced.
Intel meets its match in IBM
February 2010
There are few, if any, microprocessor manufacturers equal to Intel. IBM, however, is a very large exception.
By the time Intel had introduced its latest processor for servers, the Itanium 9300, on Monday, IBM had already stolen Intel’s thunder with its new Power7 chip technology, announced earlier in the morning.
And rightfully so: the Power7 is impressive. It has eight cores, while Intel’s Itanium 9300 (PDF) has four. And each of the Power7’s cores is capable of four threads, or tasks, compared to Itanium’s two per core.
Although both companies are touting dozens of other features–for example, better thread performance and improved scaling of workloads–IBM is taking a lead in marquee features for the lucrative high-end server market.
“While Intel is talking about a 2x [two-times] performance boost per chip, IBM is talking about almost an 8x [eight-times],” said Nathan Brookwood, principal analyst at Insight64. “IBM has gone from two cores to eight cores per (chip). And each of the cores is roughly twice as fast as the [prior-generation] Power6,” according to Brookwood, adding that IBM was already ahead of Intel to begin with.
“Buyers who are sitting on the fence and have an application that could go either way (Itanium or Power7), may find that the Power7 offers a more attractive platform,” Brookwood said, acknowledging that Itanium or Intel’s upcoming eight-core Nehalem-EX processor would be a good choice for those seeking to use popular applications such as SQL Server that run on Windows and are not supported on the Power7.
IBM Blue Waters supercomputer
Power7 has another leg up on Itanium: it is already being used to construct what may be the fastest supercomputer in the world at the renowned National Center for Supercomputing Applications at the University of Illinois, a credible claim, considering the center’s history–and IBM’s, whose chips in the past have powered the world’s fastest supercomputers.
The Blue Waters project supercomputer, housed in a special building on the Urbana-Champaign campus, will theoretically be capable of achieving up to 10 petaflops, about 10 times as fast as the fastest IBM supercomputers today. (A petaflop is 1 quadrillion floating-point operations per second, a key indicator of supercomputer performance.)
One of the novel ways the University of Illinois is able to cram together the thousands of Power7 chips used in the Blue Waters supercomputer is by tapping a novel memory technology called eDRAM, or embedded dynamic access memory, for the level-3 cache memory.
Widely-used “static” RAM memory, or SRAM, is used as the on-chip memory in almost all processors today and can add as much as a billion transistors to high-end processors. IBM wanted to avoid these ballooning–and costly–chip counts, electing to use eDRAM, keeping the total number of transistors to 1.2 billion per processor. (Power7 uses SRAM in the smaller-capacity 32KB level-1 and 256KB level-2 cache; eDRAM is used in the large-capacity 32MB level-3 cache.)
The equivalent number of transistors using traditional SRAM would be well in excess of 2 billion, Bradley McCredie, an IBM fellow in the Systems and Technology Group, said in a CNET interview last year. (Two billion, in fact, is the chip count Intel is citing for the Itanium 9300.)
IBM has stated that the chip’s speed–between 3.5GHz and 4GHz–actually has a lower rating than the previous Power6 chip, which ran at 5GHz. “We have gotten performance from other spots, such as the dense eDRAM. We had to back off from the gigahertz in order to get eight of these cores onto the chip and not have it melt,” according to McCredie.
Intel, of course, is not standing still. The next-generation Itanium will be made on a cutting-edge 32-nanometer production process, and its upcoming Nehalem-EX server chip may go a long way toward leveling the playing field. But IBM has shown that it has the technological and manufacturing wherewithal to keep Intel on its toes, if not racing to keep up.