I understand how integer and floating point data types are stored, and I am guessing that the variable length of decimal data types means it is stored more like a string. Does that imply a performance overhead when using a decimal data type and searching against them?

Pavel has it quite right, I'd just like to explain a little. Presuming that you mean a performance impact as compared to floating point, or fixed-point-offset integer (i.e. storing thousandsths of a cent as an integer): Yes, there is very much a performance impact. PostgreSQL, and by the sounds of things MySQL, store <code>DECIMAL</code> / <code>NUMERIC</code> in binary-coded decimal. This format is more compact than storing the digits as text, but it's still not very efficient to work with. If you're not doing many calculations in the database, the impact is limited to the greater storage space requried for BCD as compared to integer or floating point, and thus the wider rows and slower scans, bigger indexes, etc. Comparision operations in b-tree index searches are also slower, but not enough to matter unless you're already CPU-bound for some other reason. If you're doing lots of calculations with the <code>DECIMAL</code> / <code>NUMERIC</code> values in the database, then performance can really suffer. This is particularly noticeable, at least in PostgreSQL, because Pg can't use more than one CPU for any given query. If you're doing a huge bunch of division & multiplication, more complex maths, aggregation, etc on numerics you can start to find yourself CPU-bound in situations where you would never be when using a float or integer data type. This is particularly noticeable in OLAP-like (analytics) workloads, and in reporting or data transformation during loading or extraction (ETL). Despite the fact that there is a performance impact (which varies based on workload from negligible to quite big) you should generally use <code>numeric</code> / <code>decimal</code> when it is the most appropriate type for your task - i.e. when very high range values must be stored and/or rounding error isn't acceptable. Occasionally it's worth the hassle of using a bigint and fixed-point offset, but that is clumsy and inflexible. Using floating point instead is very rarely the right answer due to all the challenges of working reliably with floating point values for things like currency. (BTW, I'm quite excited that some new Intel CPUs, and IBM's Power 7 range of CPUs, include hardware support for IEEE 754 decimal floating point. If this ever becomes available in lower end CPUs it'll be a huge win for databases.)

Is there a performance hit using decimal data types (MySQL / Postgres)

2 Answers

Pavel has it quite right, I'd just like to explain a little.

Presuming that you mean a performance impact as compared to floating point, or fixed-point-offset integer (i.e. storing thousandsths of a cent as an integer): Yes, there is very much a performance impact. PostgreSQL, and by the sounds of things MySQL, store DECIMAL / NUMERIC in binary-coded decimal. This format is more compact than storing the digits as text, but it's still not very efficient to work with.

If you're not doing many calculations in the database, the impact is limited to the greater storage space requried for BCD as compared to integer or floating point, and thus the wider rows and slower scans, bigger indexes, etc. Comparision operations in b-tree index searches are also slower, but not enough to matter unless you're already CPU-bound for some other reason.

If you're doing lots of calculations with the DECIMAL / NUMERIC values in the database, then performance can really suffer. This is particularly noticeable, at least in PostgreSQL, because Pg can't use more than one CPU for any given query. If you're doing a huge bunch of division & multiplication, more complex maths, aggregation, etc on numerics you can start to find yourself CPU-bound in situations where you would never be when using a float or integer data type. This is particularly noticeable in OLAP-like (analytics) workloads, and in reporting or data transformation during loading or extraction (ETL).

Despite the fact that there is a performance impact (which varies based on workload from negligible to quite big) you should generally use numeric / decimal when it is the most appropriate type for your task - i.e. when very high range values must be stored and/or rounding error isn't acceptable.

Occasionally it's worth the hassle of using a bigint and fixed-point offset, but that is clumsy and inflexible. Using floating point instead is very rarely the right answer due to all the challenges of working reliably with floating point values for things like currency.

(BTW, I'm quite excited that some new Intel CPUs, and IBM's Power 7 range of CPUs, include hardware support for IEEE 754 decimal floating point. If this ever becomes available in lower end CPUs it'll be a huge win for databases.)

119

answered Oct 05 '22 19:10

Craig Ringer

A impact of decimal type (Numeric type in Postgres) depends on usage. For typical OLTP this impact could not be significant - for OLAP can be relative high. In our application a aggregation on large columns with numeric is more times slower than for type double precision.

Although a current CPU are strong, still is rule - you should to use a Numeric only when you need exact numbers or very high numbers. Elsewhere use float or double precision type.

answered Oct 05 '22 19:10

Pavel Stehule

Related questions
                            
                                Default value for a text column
                            
                                How to do a fast but innacurate InnoDB row count?
                            
                                MySQL error - #1062 - Duplicate entry ' ' for key 2
                            
                                MySQL WHERE <multiple-column> IN <subquery>
                            
                                Executing multiple SQL queries in one statement with PHP
                            
                                Define a variable within select and use it within the same select
                            
                                what does unauthenticated user mean in MYSQL?
                            
                                Using Git to track mysql schema - some questions
                            
                                MySQL - How to throw exception in stored procedure?
                            
                                What value could I insert into a bit type column?
                            
                                How do you output MySQL query results in csv format (to the screen, not to a file)?
                            
                                MySQL trigger On Insert/Update events
                            
                                Read slave, read-write master setup
                            
                                Equivalent of MySQL ON DUPLICATE KEY UPDATE in Sql Server
                            
                                Storing Social Security Numbers
                            
                                How can I set the max number of MySQL processes or threads?
                            
                                Performance of RegEx vs LIKE in MySql queries
                            
                                UTF8 Encoding problem - With good examples
                            
                                Call a stored procedure for each row returned by a query in MySQL
                            
                                MySQL - SELECT AS in WHERE

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a performance hit using decimal data types (MySQL / Postgres)

Tags:

types

database

type-conversion

mysql

postgresql

wobbily_col

People also ask

2 Answers

Craig Ringer

Pavel Stehule

Recent Activity

Donate For Us