Many tables or rows, which one is more efficient in SQL?

Tags:

I'm building a program that stores news headlines for companies and its timestamp from various sources.

Let's say the number of company is 1000. It goes like Apple, Google, Microsoft.. etc.

So I can think about two options.

One table with numerous rows (above code is just an example).

CREATE TABLE news
(
    news_id INT NOT NULL AUTO_INCREMENT PRIMARY KEY,
    company VARCHAR(10) NOT NULL,
    timestamp TIMESTAMP NOT NULL,
    source TEXT NOT NULL,
    content TEXT NOT NULL,
    ...
)

// I also can make company and timestamp as primary keys,
   and news_id will be unique key.*

1000 Tables

CREATE TABLE news_apple // and news_google, news_microsoft, news_...(x 1000)
(
    news_id INT NOT NULL AUTO_INCREMENT PRIMARY KEY,
    timestamp TIMESTAMP NOT NULL,
    source TEXT NOT NULL,
    content TEXT NOT NULL,
    ...
)

Most of the time, I will find the news for the certain company. Let's say there are more than 10000 news for each company. I wonder that if I use a 'WHERE' clause in the first option, it would be slower than the second option.

Which one is more efficient in terms of performance and why?

333

asked Jan 22 '14 03:01

KimchiMan

1 Answers

Relational databases are designed to store many rows per table. There are a whole bunch of mechanisms to facilitate large tables, such as:

Indexes on any combination of fields to speed searches
Page caching so commonly used pages remain in memory
Vertical partitioning (columnar databases) to further speed requests
Advanced algorithms such as hash joins and group bys (at least in databases other than MySQL)
Use of multiple processors and disks to process queries

There is one thing that is more difficult when putting data in a single table, and that is security. And, in fact, in some circumstances this is a primary concern and basically requires that the data go in separate table. Those applications are rare and far between.

To give an example of how bad storing data in multiple tables could be, imagine that in your system you have one record per company and you store it in a table. This record stores information about the company -- something like name, address, whatever. Call is 100 bytes of information.

In your schema there is a separate table for each "company", so that is one row per table. That record will reside on one data page. A data page could be 16 kbytes, so you are wasting about 15.9 kbytes to store this data. Storing 1000 such records occupies 16 Mbytes instead of about 7 pages worth (112 Kbytes). That can be a significant performance hit.

In addition, with multiple tables you are not taking into account the challenges of maintaining all the tables and ensuring the correctness of data in the different tables. Maintenance updates need to be applied to thousands of tables, instead of a handful.

120

answered Sep 21 '22 13:09

Gordon Linoff

Related questions
                            
                                Check table exists
                            
                                Why SUPER privileges are disabled when binary logging option is on?
                            
                                System.TypeLoadException: Method 'Create' in type 'MySql.Data.EntityFrameworkCore.Query.Internal.MySQLSqlTranslatingExpressionVisitorFactory'
                            
                                Migrating database changes from development to live
                            
                                MySQL - how to SHOW PROCESSLIST only with current user's processes?
                            
                                Displaying rows with count 0 with mysql group by
                            
                                How best to compare to 0 in PHP?
                            
                                How to use MySQL index columns?
                            
                                mysql NULL value in where in CLAUSE
                            
                                How does PDO know last inserted id in MySQL?
                            
                                MySQL query to show difference between development and production schema
                            
                                On using ini_set('max_execution_time', 0);
                            
                                MysqlDataTruncation: Data truncation: Out of range value for column 'column' at row 1
                            
                                How to delete mysql row after time passes?
                            
                                Big Database backup best practice
                            
                                SQL Injection attack - What does this do?
                            
                                DELETE using LEFT JOIN with LIMIT in MySQL
                            
                                How can VBA connect to MySQL database in Excel?
                            
                                unknown database in jdbc
                            
                                Groovy - class not found

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Many tables or rows, which one is more efficient in SQL?

Tags:

sql

database

mysql

schema

relation

KimchiMan

People also ask

1 Answers

Gordon Linoff

Recent Activity

Donate For Us