I have a table structure similar the following: <pre class="prettyprint"><code>create table MAIL ( ID int, FROM varchar, SENT_DATE date ); create table MAIL_TO ( ID int, MAIL_ID int, NAME varchar ); </code></pre> and I need to run the following query: <pre class="prettyprint"><code>select m.ID from MAIL m inner join MAIL_TO t on t.MAIL_ID = m.ID where m.SENT_DATE between '07/01/2010' and '07/30/2010' and t.NAME = 'someone@example.com' </code></pre> Is there any way to design indexes such that both of the conditions can use an index? If I put an index on MAIL.SENT_DATE and an index on MAIL_TO.NAME, the database will choose to use either one of the indexes or the other, not both. After filtering by the first condition the database always has to do a full scan of the results for the second condition.

Oracle can use both indices. You just don't have the right two indices. Consider: if the query plan uses your index on <code>mail.sent_date</code> first, what does it get from <code>mail</code>? It gets all the <code>mail.id</code>s where <code>mail.sent_date</code> is within the range you gave in your <code>where</code> clause, yes? So it goes to <code>mail_to</code> with a list of <code>mail.id</code>s and the <code>mail.name</code> you gave in your <code>where</code> clause. At this point, Oracle decides that it's better to scan the table for matching <code>mail_to.mail_id</code>s rather than use the index on <code>mail_to.name</code>. Indices on varchars are always problematic, and Oracle really prefers full table scans. But if we give Oracle an index containing the columns it really wants to use, and depending on total table rows and statistics, we can get it to use it. This is the index: <pre class="prettyprint"><code> create index mail_to_pid_name on mail_to( mail_id, name ) ; </code></pre> This works where an index just on <code>name</code> doesn't, because Oracle's not looking just for a name, but for a <code>mail_id</code> and a <code>name</code>. Conversely, if the cost-based analyzer determines it's cheaper to go to table <code>mail_to</code> first, and uses your index on <code>mail_to.name</code>, what doe sit get? A bunch of <code>mail_to_.mail_id</code>s to look up in <code>mail</code>. It needs to find rows with those ids and certain sent_dates, so: <pre class="prettyprint"><code> create index mail_id_sentdate on mail( sent_date, id ) ; </code></pre> Note that in this case I've put <code>sent_date</code> first in the index, and <code>id</code> second. (This is more an intuitive thing.) Again, the take home point is this: in creating indices, you have to consider not just the columns in your <code>where</code> clause, but also the columns in your join conditions. <hr> Update jthg: yes, it always depends on how the data is distributed. And on how many rows are in the table: if very many, Oracle will do a table scan and hash join, if very few it will do a table scan. You might reverse the order of either of the two indices. By putting sent_date first in the second index, we eliminate most needs for an index solely on <code>sent_date</code>.

Equivalent of a composite index across multiple tables?

Tags:

sql

oracle

I have a table structure similar the following:

create table MAIL (
  ID        int,
  FROM      varchar,
  SENT_DATE date
);

create table MAIL_TO (
  ID      int,
  MAIL_ID int,
  NAME      varchar
);

and I need to run the following query:

select m.ID 
from MAIL m 
  inner join MAIL_TO t on t.MAIL_ID = m.ID
where m.SENT_DATE between '07/01/2010' and '07/30/2010'
  and t.NAME = '[email protected]'

Is there any way to design indexes such that both of the conditions can use an index? If I put an index on MAIL.SENT_DATE and an index on MAIL_TO.NAME, the database will choose to use either one of the indexes or the other, not both. After filtering by the first condition the database always has to do a full scan of the results for the second condition.

724

asked Jul 30 '10 17:07

jthg

2 Answers

Oracle can use both indices. You just don't have the right two indices.

Consider: if the query plan uses your index on mail.sent_date first, what does it get from mail? It gets all the mail.ids where mail.sent_date is within the range you gave in your where clause, yes?

So it goes to mail_to with a list of mail.ids and the mail.name you gave in your where clause. At this point, Oracle decides that it's better to scan the table for matching mail_to.mail_ids rather than use the index on mail_to.name.

Indices on varchars are always problematic, and Oracle really prefers full table scans. But if we give Oracle an index containing the columns it really wants to use, and depending on total table rows and statistics, we can get it to use it. This is the index:

 create index mail_to_pid_name on mail_to( mail_id, name ) ;

This works where an index just on name doesn't, because Oracle's not looking just for a name, but for a mail_id and a name.

Conversely, if the cost-based analyzer determines it's cheaper to go to table mail_to first, and uses your index on mail_to.name, what doe sit get? A bunch of mail_to_.mail_ids to look up in mail. It needs to find rows with those ids and certain sent_dates, so:

 create index mail_id_sentdate on mail( sent_date, id ) ;

Note that in this case I've put sent_date first in the index, and id second. (This is more an intuitive thing.)

Again, the take home point is this: in creating indices, you have to consider not just the columns in your where clause, but also the columns in your join conditions.

Update

jthg: yes, it always depends on how the data is distributed. And on how many rows are in the table: if very many, Oracle will do a table scan and hash join, if very few it will do a table scan. You might reverse the order of either of the two indices. By putting sent_date first in the second index, we eliminate most needs for an index solely on sent_date.

answered Sep 19 '22 18:09

tpdi

A materialized view would allow you to index the values, assuming the stringent materialized view criteria is met.

answered Sep 22 '22 18:09

OMG Ponies

Related questions
                            
                                SQLAlchemy: Inserting the results of a query into another table
                            
                                Best practice for a "comment" table in a relational database [closed]
                            
                                Simulating CTE recursion in C#
                            
                                Data normalization and writing queries
                            
                                Simple "SELECT" with variable but without "INTO"
                            
                                Slow query ordering by a column in a joined table
                            
                                Rails: SQLite3::CantOpenException: unable to open database file
                            
                                Why does SELECT 'a'='b'='c' return 1 in MYSQL?
                            
                                DB agnostic SQL for CURRENT_TIMESTAMP
                            
                                SQLAlchemy: update from_select
                            
                                The Network Adapter could not establish the connection in SQL developer
                            
                                How to join three tables in Rails?
                            
                                Derived type in PostgreSQL
                            
                                How to implement Like-condition in SparkSQL?
                            
                                Use Option (Recompile) in an Inline Table Valued Function
                            
                                syntax error when using row_number in sqlite3
                            
                                Using Columns in a RegExp in MySQL
                            
                                Suspended status in SQL Activity Monitor
                            
                                Can someone give me some basic XSS and sql injection scripts? (not what it seems)
                            
                                How can I fix this "SQL Statement ignored" error?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With