Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How does Facebook store it's log data?

I recently started using Scribe, Facebooks solution for transferring and collecting log data from many different servers.

What I could not find is how Facebook stores the huge amounts of log data it gets (according to a presentation it was 25TB per day in 2009).

Has Facebook released any information on how they do it? Hadoop HDFS? Cassandra?

like image 730
theduke Avatar asked Sep 16 '25 18:09

theduke


1 Answers

They use Hive on top of Hadoop. Cassandra is used for their email/messaging, not logging. Some links:

https://developers.facebook.com/opensource/

http://highscalability.com/blog/2008/11/24/product-scribe-facebooks-scalable-logging-system.html

http://wiki.apache.org/hadoop/Hive

like image 135
Brent Baisley Avatar answered Sep 18 '25 06:09

Brent Baisley