What is the reason not to use select *?

Tags:

sql

People also ask

Why we use SELECT * from in SQL?

An asterisk (" * ") can be used to specify that the query should return all columns of the queried tables. SELECT is the most complex statement in SQL, with optional keywords and clauses that include: The FROM clause, which indicates the table(s) to retrieve data from.

What SELECT * means?

SELECT == It orders the computer to include or select each content from the database name(table ) . (*) == means all {till here code means include all from the database.} FROM == It refers from where we have to select the data.

What is the use of SELECT * from table?

The SELECT statement is used to select or retrieve the data from one or more tables. You can use this statement to retrieve all the rows from a table in one go, as well as to retrieve only those rows that satisfy a certain condition or a combination of conditions.

Why using SELECT * in an SQL query is a bad practice?

Using SELECT * when you only need a couple of columns means a lot more data transferred than you need. This adds processing on the database, and increase latency on getting the data to the client.

The essence of the quote of not prematurely optimizing is to go for simple and straightforward code and then use a profiler to point out the hot spots, which you can then optimize to be efficient.

When you use select * you're make it impossible to profile, therefore you're not writing clear & straightforward code and you are going against the spirit of the quote. select * is an anti-pattern.

So selecting columns is not a premature optimization. A few things off the top of my head ....

If you specify columns in a SQL statement, the SQL execution engine will error if that column is removed from the table and the query is executed.
You can more easily scan code where that column is being used.
You should always write queries to bring back the least amount of information.
As others mention if you use ordinal column access you should never use select *
If your SQL statement joins tables, select * gives you all columns from all tables in the join

The corollary is that using select * ...

The columns used by the application is opaque
DBA's and their query profilers are unable to help your application's poor performance
The code is more brittle when changes occur
Your database and network are suffering because they are bringing back too much data (I/O)
Database engine optimizations are minimal as you're bringing back all data regardless (logical).

Writing correct SQL is just as easy as writing Select *. So the real lazy person writes proper SQL because they don't want to revisit the code and try to remember what they were doing when they did it. They don't want to explain to the DBA's about every bit of code. They don't want to explain to their clients why the application runs like a dog.

If your code depends on the columns being in a specific order, your code will break when there are changes to the table. Also, you may be fetching too much from the table when you select *, especially if there is a binary field in the table.

Just because you are using all the columns now, it doesn't mean someone else isn't going to add an extra column to the table.

It also adds overhead to the plan execution caching since it has to fetch the meta data about the table to know what columns are in *.

One major reason is that if you ever add/remove columns from your table, any query/procedure that is making a SELECT * call will now be getting more or less columns of data than expected.

In a roundabout way you are breaking the modularity rule about using strict typing wherever possible. Explicit is almost universally better.
Even if you now need every column in the table, more could be added later which will be pulled down every time you run the query and could hurt performance. It hurts performance because
- You are pulling more data over the wire; and
- Because you might defeat the optimizer's ability to pull the data right out of the index (for queries on columns that are all part of an index.) rather than doing a lookup in the table itself

When TO use select *

When you explicitly NEED every column in the table, as opposed to needing every column in the table THAT EXISTED AT THE TIME YOU WROTE THE QUERY. For example, if were writing an DB management app that needed to display the entire contents of the table (whatever they happened to be) you might use that approach.

There are a few reasons:

If the number of columns in a database changes and your application expects there to be a certain number...
If the order of columns in a database changes and your application expects them to be in a certain order...
Memory overhead. 8 unnecessary INTEGER columns would add 32 bytes of wasted memory. That doesn't sound like a lot, but this is for each query and INTEGER is one of the small column types... the extra columns are more likely to be VARCHAR or TEXT columns, which adds up quicker.
Network overhead. Related to memory overhead: if I issue 30,000 queries and have 8 unnecessary INTEGER columns, I've wasted 960kB of bandwidth. VARCHAR and TEXT columns are likely to be considerably larger.

Note: I chose INTEGER in the above example because they have a fixed size of 4 bytes.

Related questions
                            
                                Ruby on Rails: getting the max value from a DB column
                            
                                Identity increment is jumping in SQL Server database
                            
                                Are soft deletes a good idea? [duplicate]
                            
                                Real life example, when to use OUTER / CROSS APPLY in SQL
                            
                                Row Offset in SQL Server
                            
                                MySQL Insert query doesn't work with WHERE clause
                            
                                Join between tables in two different databases?
                            
                                Split function equivalent in T-SQL?
                            
                                Physical vs. logical (hard vs. soft) delete of database record?
                            
                                SQL : BETWEEN vs <= and >=
                            
                                Unioning two tables with different number of columns
                            
                                Error Dropping Database (Can't rmdir '.test\', errno: 17)
                            
                                Boolean vs tinyint(1) for boolean values in MySQL
                            
                                SQL statement to select all rows from previous day
                            
                                SQLite Reset Primary Key Field
                            
                                How can I schedule a job to run a SQL query daily?
                            
                                How to extract year and month from date in PostgreSQL without using to_char() function?
                            
                                Is there any difference between "!=" and "<>" in Oracle Sql?
                            
                                Naming convention for unique constraint
                            
                                What's best SQL datatype for storing JSON string?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the reason not to use select *?

Tags:

sql

People also ask

When TO use select *

Related questions

Recent Activity

Donate For Us