Is an index not similar to a dictionary? If you have the key, you can immediately access it? Apparently indexes are sometimes stored as B-Trees... why is that?

Dictionaries are not implicitly sorted, <code>B-Tree</code>s are. A <code>B-Tree</code> index can be used for ranged access, like this: <pre class="prettyprint"><code>WHERE col1 BETWEEN value1 AND value2 </code></pre> or ordering, like this: <pre class="prettyprint"><code>ORDER BY col1 </code></pre> You cannot immediately access a page in a <code>B-Tree</code> index: you need to traverse the child pages whose number increases logarithmically. Some databases support dictionary-type indexes as well, namely, <code>HASH</code> indexes, in which case the search time is constant. But such indexes cannot be used for ranged access or ordering.

Why does searching an index have logarithmic complexity?

3 Answers

Dictionaries are not implicitly sorted, B-Trees are.

A B-Tree index can be used for ranged access, like this:

WHERE col1 BETWEEN value1 AND value2

or ordering, like this:

ORDER BY col1

You cannot immediately access a page in a B-Tree index: you need to traverse the child pages whose number increases logarithmically.

Some databases support dictionary-type indexes as well, namely, HASH indexes, in which case the search time is constant. But such indexes cannot be used for ranged access or ordering.

174

answered Sep 28 '22 09:09

Quassnoi

Database Indices are usually (almost always) stored as B-Trees. And all balanced tree structures have O(log n) complexity for searching.

'Dictionary' is an 'Abstract Data Type' (ADT), ie it is a functional description that does not designate an implementation. Some dictionaries could use a Hashtable for O(1) lookup, others could use a tree and achieve O(log n).

The main reason a DB uses B-trees (over any other kind of tree) is that B-trees are self-balancing and are very 'shallow' (requiring little disk I/O)

answered Sep 28 '22 08:09

Henk Holterman

One of the only data structures you can access immediately with a key is a vector, which for a massive amount of data, becomes inefficient when you start inserting and removing elements. It also needs contiguous memory allocation.

A hash can be efficient but needs more space and will end up with collisions.

A B tree has a good balance between search performance and space.

answered Sep 28 '22 07:09

Andres

Related questions
                            
                                ROW_NUMBER() OVER () with order by in H2
                            
                                Python: Real-Time Streaming Data [closed]
                            
                                How does SQL join work?
                            
                                The difference between a 'view' and 'base' relation
                            
                                How do you read floating numbers (REAL) from a sqlite database on the iphone?
                            
                                Zend Framework: Populating DB data to a Zend Form dropdown element
                            
                                When/why should I start using a database?
                            
                                Saving Password with Md5
                            
                                core data database is empty test
                            
                                MySQL select records 1 hour ago or fresher on datetime column
                            
                                PHP Should I store image paths in a database?
                            
                                SQL Query to Select the 'Next' record (similar to First or Top N)
                            
                                Select subset of rows using Row_Number()
                            
                                How to best store a timestamp or date in a database?
                            
                                Good practice to open/close connections in an asp.net application?
                            
                                Is it okay to use NSUserDefaults instead of Database?
                            
                                Determine which user deleted a SQL Server database?
                            
                                To INSERT INTO a database uniquely by PostgreSQL
                            
                                Exclude Statement in SQL
                            
                                Is there a better place to store large amounts of unused data than a the database?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does searching an index have logarithmic complexity?

Tags:

database

Lieven Cardoen

People also ask

3 Answers

Quassnoi

Henk Holterman

Andres

Recent Activity

Donate For Us