How did Facebook manipulate the Hive storage format to enable it to deal with a data warehouse that stores some 300 petabytes and takes in about 600 terabytes per day? RCFile (record-columnar file format) wasn’t enough, so enter ORCFile.
How is Facebook able to quickly process the sort of queries about users and their friends generated by features such as Graph Search, despite the fact that the relevant data may be stored on several different servers? Software Engineers Alessandro Presta and Alon Shalita offered an example of how the social network uses graph-processing system Apache Giraph to handle those tasks in a post on its engineering blog.
The newly announced second building at Facebook’s data center in Luleå, Sweden, will be the first of the social network’s data centers to be built using its new rapid deployment data center concept, which leans on modular and lean construction principles, much like those demonstrated by Swedish furniture giant Ikea.
With OCP Summit V taking place at the San Jose Convention Center in San Jose, Calif., Tuesday and Wednesday, Open Compute Project Chairman and President Frank Frankovsky, vice president of hardware design and supply chain operations at Facebook, discussed the group’s achievements over the past year in a post on the Open Compute Project blog.
Every operations staffer at one of Facebook’s data centers can manage at least 20,000 servers, with some handling as many as 26,000, Facebook Director of Data Center Operations Delfina Eberly said during her keynote speech Tuesday at the 7×24 Exchange 2013 Fall Conference, as reported by Data Center Knowledge.
Internet.org — the global partnership formed by Facebook, Ericsson, MediaTek, Nokia, Opera, Qualcomm, and Samsung, with the goal of connecting the two-thirds of the world’s population currently without Internet access — released a white paper Monday on the important role efficiency must play in achieving that goal.
Facebook uses so many resources just to save the 240 billion-plus photos that are on the social network. Now the company is utilizing cold storage at its Prineville, Ore., data center to make sure older photos can be as easily accessed as the ones users uploaded five minutes ago.
How does Facebook manage its ever-growing plethora of servers, configurations, administrative access policies, and the other tasks that go along with its volumes of data? Part of that answer now lies in Private Chef from cloud infrastructure automation provider Opscode.