search for before and after values in a long sorted list

Tags:

python

What would be the fastest way to search for a number (eg. 12.31) in long sorted list and get the values one before and after my "search" value when the exact value isn't found (eg. 11.12 and 12.03 in the list below)?
Many thanks in advance.

long_list = [10.11, 11.12, 13.03, 14.2 .. 12345.67]

976

asked Jul 08 '11 18:07

DGT

2 Answers

The fastest is probably to use built-in support in python. Here I'm thinking about the bisect module. Below I'm using a dictionary to quickly check in O(1) if a value is in the list; if not, bisect is used to find values smaller than and larger than the sought value.

#!/usr/bin/env python

import bisect

def find_lt(a, x):
    'Find rightmost value less than x'
    i = bisect.bisect_left(a, x)
    if i:
        return a[i-1]
    raise ValueError

def find_gt(a, x):
    'Find leftmost value greater than x'
    i = bisect.bisect_right(a, x)
    if i != len(a):
        return a[i]
    raise ValueError

# First create a test-list (49996 items)
i=1.0
R=[1.0]
D={}
while i < 10000:
    i+=0.2
    i=round(i,2)
    D[i]=True
    R.append(i)

# Locate a value, in this case 100.3 which is not in the list
x=100.3
if D.has_key(x):
    print "found", x
else:
    print find_lt(R, x)
    print find_gt(R, x)

Output for x=100.3:

100.2
100.4

answered Nov 25 '22 09:11

Fredrik Pihl

Exponential search (AKA galloping search) would perform better than plain binary search if the list is very long. The idea is to scan forward from position 0 on increasing steps until the answer is passed at this point a binary search can be performed to the range formed by the last two steps. If the element is not found then the last attempt will point to the closest elements.

Have a look at Basic Techniques for information retrieval. The pseudo-code algorithm is provided and they discuss its complexity against binary search.

answered Nov 25 '22 10:11

Manuel Salvadores

Related questions
                            
                                Using Windows API or WMI to determine if a process is displayed in the taskbar
                            
                                Why does my python egg not work? - No distributions at all found for
                            
                                Python - Suppressing creation of __dict__ class variable in a subclass
                            
                                How to keep attribute outside __dict__?
                            
                                python: Why does SQLObject fail in conn.autocommit(1)?
                            
                                Python Regex to match a string as a pattern and return number
                            
                                Pythonic way to extract values from this text file
                            
                                multiprocess module with paramiko
                            
                                Django/python and Apache Solr: pysolr or solrpy?
                            
                                How do I update virtualenv on Ubuntu?
                            
                                python "re" package, strange phenomenon with "raw" string
                            
                                Detecting when a celery task and all subtasks have completed
                            
                                Python Multi-Processing Question?
                            
                                Is it possible to include a library like lxml without installing it?
                            
                                Optparse callback not consuming argument
                            
                                Python PIL: How to FIll a Image with a copyright logo like this?
                            
                                Kafka offset management: enable.auto.commit vs enable.auto.offset.store
                            
                                How to use groupby transform across multiple columns
                            
                                How to set/get Pandas dataframes into Redis using pyarrow
                            
                                Resize column width to fit into the QTableWidget pyqt

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With