Petascale storage may trickle down to you

Research institute aims to improve storage in high-performance computer clusters

Disk dilemmas

Storage systems have the unfortunate quality of not scaling well. Here are some of the problems that PDSI researchers will try to solve:

-- Disk access times have not kept pace with disk capacity. In 1990, a computer could read an entire hard drive in under a minute. Now it takes three hours or so to read the largest disks. "It's only going to get worse, and it will take longer and longer to recover from a disk failure," Miller says.

-- As the number of disks in a system increases, so does the probability that one will fail in any period of time. Right now, big systems at the national laboratories fail once or twice a day, but with multi­petabyte systems, that rate could increase to a failure every few minutes.

-- When a disk does fail, the ones that must restore the affected data to another disk have to work even harder, increasing the chances that one of them will fail too.

Applications at the national labs -- for example, simulations of the aging of nuclear weapons -- can run for months. They generate huge amounts of data, in part because they periodically copy the contents of memory to disk as "checkpoints" in case a disk or processor fails. Researchers will look for faster checkpoint/restarting methods, better fault-tolerance technologies and more efficient file systems.

-- One promising approach that's now coming into use at the national labs is a technology called object storage, by which clients can access storage devices directly without going through a central file server. Object storage devices have processors attached to them so that lower-level functions, such as space management, can be handled by the devices themselves. And because data objects contain both data and metadata, it's possible to apply fine-grained, highly intelligent controls for security and other purposes. What's more, object-based storage systems tend to be much more scalable than traditional ones.

-- Researchers will also work on protocols and APIs, especially those related to Linux. They will help develop extensions to Posix, the portable operating system interface for Unix, to enable more effective use of file systems in highly parallel computer clusters. Researchers will also work with The Open Group and the Internet Engineering Task Force to make the Network File System protocols for file access more capable in highly parallel systems.

-- The PDSI will explore a number of emerging technologies, such as phase-change RAM, Miller says. PRAM, recently announced by Samsung Electronics, offers the speed of dynamic RAM with the nonvolatility of flash memory. Miller says it's the perfect place to put metadata because it can be accessed much more quickly than if it were on disk, thereby making object storage systems much faster.

-- Miller says PRAM might also be used to store indexes used by search engines, greatly accelerating them as well. That increased speed may prove to be of interest to businesses such as oil companies that have huge stores of private data but lack the enormous resources of a company like Google.

-- Few corporations will ever have systems the size of those at the national labs, with tens of thousands of disks, says Miller. But even desktop systems, which will have more and more disk drives over time, will experience some of the challenges the PDSI will address.

"I can't tell you yet which ones they will be," he says. "But problems at the high end have a nasty habit of trickling down to the low end."

Join the newsletter!


Sign up to gain exclusive access to email subscriptions, event invitations, competitions, giveaways, and much more.

Membership is free, and your security and privacy remain protected. View our privacy policy before signing up.

Error: Please check your email address.

More about Carnegie Mellon University AustraliaGoogleInternet Engineering Task ForceMellonOpen GroupSamsungSamsung Electronics AustraliaSpeed

Show Comments