Facebook rolls out storage system to handle photo stores

Homegrown system, Facebook Haystack, built to handle multiplying files of user photos

Needing to better deal with 50 billion files worth of photos, engineers at Facebook are installing a new photo storage system they say is 50% faster than traditional systems.

The storage system, dubbed Haystack, has been under development in-house for the past couple of years and Facebook has been rolling it out in limited test versions to parts of the network for the past few months. The company expects to use Haystack to store all Facebook photos by next week, according to Bobby Johnson, director of engineering at Facebook.

And Jonathan Heiliger, vice president of technical operations, told Computerworld today that based on their tests, Haystack is more than 50% faster than traditional photo storage systems.

"In terms of cost, if it's twice as efficient, we can have 50% less hardware," said Johnson. "With 50 billion files on disk, the cost adds up. It's essentially give us some [financial] headroom."

Johnson and Heiliger said they began building the new storage system to better handle the growing number of photos Facebook has to store. Many of their 175 million and 200 million users share photos of everything from their pets to vacations, weddings to days at the beach. That means that users are posting and calling up their own photos, as well as their friends' and family members' photos, as well. Keeping those trains running efficiently and on time was a growing challenge.

Johnson noted that Facebook deals with 15 billion photos - not including all of the replications. Their user data grows by 500GB a day. And they have 50 million requests per second to their backend servers.

A spokesman for Facebook said more specifics about the system will be released in a few weeks.

Johnson, though, said the new system is so much faster than the previous one because of changes made to its setup. Haystack is tailored for small files that don't change very often, instead of for a small number of large files that are changing all the time. Traditional file directories also need file names and a lot of resource cost goes to just finding the files. With the new system, however, he said they don't have to deal with directory structures or file names. They use ID numbers instead of names. That mapping is very small.

Johnson said that so far the rollout of the new system has gone very smoothly.

Five-year-old Facebook's user base passed one-time leader MySpace last year, according to a recent report.

Facebook, once thought of as the up-and-coming social network, had almost 222 million unique visitors last month, while MySpace came in at 125 million, according to online researcher comScore Inc. That's a dramatic change since the Facebook-MySpace race for unique visitors was a near dead heat in April 2008.

The company is closing in on a big milestone -- 200 million users, executives said today.

More about: Billion, Facebook, Leader, Leader Computers, Milestone
References show all

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
Users posting comments agree to the Computerworld comments policy.
Login or register to link comments to your user profile, or you may also post a comment without being logged in.
Related Coverage
Related Whitepapers
Latest Stories
Community Comments
Tags: Facebook, online storage, storage
Whitepapers
All whitepapers
Sign up now to get free exclusive access to reports, research and invitation only events.
Featured Download
/downloads/product/58/seamonkey/

Seamonkey

Seamonkey includes an Internet browser, email and newsgroup client with an included web feed reader, HTML editor, IRC chat and web development tools. SeaMonkey will ...

Computerworld newsletter

Join the most dedicated community for IT managers, leaders and professionals in Australia