Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What databases do the World Wide Web's biggest sites run on? [closed]

People also ask

What databases do big companies use?

1. The Oracle. Oracle is the most widely used commercial relational database management system, built-in assembly languages such as C, C++, and Java.

What database do most websites use?

Common Database TypesMySQL – MySQL is probably the most common database on the web. It's free to use and open source, so a lot of projects have been built around it.

What database does Stackoverflow use?

Technology. Stack Overflow is written in C# using the ASP.NET MVC (Model–View–Controller) framework, and Microsoft SQL Server for the database and the Dapper object-relational mapper used for data access.


Facebook.com

  • MySQL with MyRocks. Used to store user info and social activities such as likes, comments, and shares.
  • Hive (Data warehouse for Hadoop, supports tables and a variant of SQL called hiveQL). Used for "simple summarization jobs, business intelligence and machine learning and many other applications"
  • Cassandra (Multi-dimensional, distributed key-value store). Currently used for Facebook's private messaging.

Currently running 610 (soon to be 1000) Hadoop nodes in a single cluster with Hive datastore. Both Hive and Cassandra have been open-sourced by Facebook.

Facebook stats:

  • More than 200 million active users
  • More than 100 million users log on to Facebook at least once each day
  • More than 30 million users update their statuses at least once each day
  • Average user has 120 friends on the site

Sources:

  • http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/
  • http://www.facebook.com/note.php?note_id=89508453919
  • http://www.facebook.com/press/info.php?statistics
  • http://hadoop.apache.org/hive/
  • http://wiki.apache.org/hadoop/Hive/Design
  • http://www.facebook.com/note.php?note_id=24413138919
  • https://code.facebook.com/posts/190251048047090/myrocks-a-space-and-write-optimized-mysql-database

Stack Overflow - SQL Server.

Jeff Atwood wrote a nice blog post on this

https://blog.stackoverflow.com/2008/09/what-was-stack-overflow-built-with/


LinkedIn.com

  • Oracle (Relational Database)
  • MySQL (Relational Database)

Databases replicated on multiple servers for high availability. Each specific Service uses its own domain-specific DB.

LinkedIn stats:

  • 22 million members
  • 4+ million unique visitors/month
  • 40 million page views/day
  • 2 million searches/day

Sources:

  • http://hurvitz.org/blog/2008/06/linkedin-architecture/

Flickr uses MySQL.

YouTube uses MySQL but they are moving to Google's BigTable.

Myspace uses SQL Server.

Wikipedia uses MySQL.


Microsoft.com

  • SQL Server (no surprise there)

Microsoft.com stats:

  • 250 million unique visits/month.
  • 70 million page views/day.
  • 15,000 connections/second.
  • Maintains an average of 35,000 concurrent connections to a total of 80 Web servers.

Sources:

  • http://technet.microsoft.com/en-us/mscomops/default.aspx

Yahoo.com

  • PostgreSQL (modified) - A client can connect to any of the nodes in the cluster (or a policy restricted subset). A query flows from the client to the server it chose to connect with. The SQL compiler on that node compiles and optimizes the query on that single node (no parallelism).

Yahoo.com stats:

  • 24 billion events a day
  • 2-petabyte, claims largest database (Mar 2008)

Source:

  • http://perspectives.mvdirona.com/2008/05/23/PetascaleSQLDBAtYahoo.aspx
  • http://www.computerworld.com/s/article/9087918/Size_matters_Yahoo_claims_2_petabyte_database_is_world_s_biggest_busiest