Can Multiple Indexes Work Together?

Tags:

Suppose I have a database table with two fields, "foo" and "bar". Neither of them are unique, but each of them are indexed. However, rather than being indexed together, they each have a separate index.

Now suppose I perform a query such as SELECT * FROM sometable WHERE foo='hello' AND bar='world'; My table a huge number of rows for which foo is 'hello' and a small number of rows for which bar is 'world'.

So the most efficient thing for the database server to do under the hood is use the bar index to find all fields where bar is 'world', then return only those rows for which foo is 'hello'. This is O(n) where n is the number of rows where bar is 'world'.

However, I imagine it's possible that the process would happen in reverse, where the fo index was used and the results searched. This would be O(m) where m is the number of rows where foo is 'hello'.

So is Oracle smart enough to search efficiently here? What about other databases? Or is there some way I can tell it in my query to search in the proper order? Perhaps by putting bar='world' first in the WHERE clause?

535

asked Sep 29 '08 15:09

Eli Courtwright

1 Answers

Oracle will almost certainly use the most selective index to drive the query, and you can check that with the explain plan.

Furthermore, Oracle can combine the use of both indexes in a couple of ways -- it can convert btree indexes to bitmaps and perform a bitmap ANd operation on them, or it can perform a hash join on the rowid's returned by the two indexes.

One important consideration here might be any correlation between the values being queried. If foo='hello' accounts for 80% of values in the table and bar='world' accounts for 10%, then Oracle is going to estimate that the query will return 0.8*0.1= 8% of the table rows. However this may not be correct - the query may actually return 10% of the rwos or even 0% of the rows depending on how correlated the values are. Now, depending on the distribution of those rows throughout the table it may not be efficient to use an index to find them. You may still need to access (say) 70% or the table blocks to retrieve the required rows (google for "clustering factor"), in which case Oracle is going to perform a ful table scan if it gets the estimation correct.

In 11g you can collect multicolumn statistics to help with this situation I believe. In 9i and 10g you can use dynamic sampling to get a very good estimation of the number of rows to be retrieved.

To get the execution plan do this:

explain plan for
SELECT *
FROM   sometable
WHERE  foo='hello' AND bar='world'
/
select * from table(dbms_xplan.display)
/

Contrast that with:

explain plan for
SELECT /*+ dynamic_sampling(4) */
       *
FROM   sometable
WHERE  foo='hello' AND bar='world'
/
select * from table(dbms_xplan.display)
/

137

answered Oct 12 '22 19:10

David Aldridge

Related questions
                            
                                Cannot persist computed column - not deterministic
                            
                                Are PostgreSQL temporary tables already unlogged?
                            
                                SQL Select * from multiple tables
                            
                                Is it possible to perform joins across different databases using LINQ?
                            
                                How get the sum for every distinct value in another column?
                            
                                Optimal RAID setup for SQL server
                            
                                How do I update the members of a MySQL SET Type?
                            
                                Generic extension method : Type argument cannot be inferred from the usage
                            
                                Grouping tables within a MySQL database
                            
                                SQLite - Is it possible to insert a BLOB via insert statement?
                            
                                Lua script for Redis which sums the values of keys
                            
                                Pandas HDF5 as a Database
                            
                                Two nodes MongoDB replica set without arbiter
                            
                                Warning in ./libraries/plugin_interface.lib.php#551 count(): Parameter must be an array or an object that implements Countable
                            
                                How do I find out if an oracle database is set to autocommit?
                            
                                How to store TimeZoneInfo objects in a database?
                            
                                Python Redis connection should be closed on every request? (flask)
                            
                                Pros and cons of using MD5 Hash as the primary key vs. use a int identity as the primary key in SQL Server
                            
                                Database integration tests in Visual Studio Online
                            
                                Cannot initialize flask initdb (Flask Tutorial Step4)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can Multiple Indexes Work Together?

Tags:

database

optimization

indexing

oracle

Eli Courtwright

People also ask

1 Answers

David Aldridge

Recent Activity

Donate For Us