How to search a list of tuples in Python

Tags:

So I have a list of tuples such as this:

[(1,"juca"),(22,"james"),(53,"xuxa"),(44,"delicia")]

I want this list for a tuple whose number value is equal to something.

So that if I do search(53) it will return the index value of 2

Is there an easy way to do this?

322

asked May 26 '10 22:05

2 Answers

[i for i, v in enumerate(L) if v[0] == 53]

answered Oct 25 '22 15:10

tl;dr

A generator expression is probably the most performant and simple solution to your problem:

l = [(1,"juca"),(22,"james"),(53,"xuxa"),(44,"delicia")]  result = next((i for i, v in enumerate(l) if v[0] == 53), None) # 2

Explanation

There are several answers that provide a simple solution to this question with list comprehensions. While these answers are perfectly correct, they are not optimal. Depending on your use case, there may be significant benefits to making a few simple modifications.

The main problem I see with using a list comprehension for this use case is that the entire list will be processed, although you only want to find 1 element.

Python provides a simple construct which is ideal here. It is called the generator expression. Here is an example:

# Our input list, same as before l = [(1,"juca"),(22,"james"),(53,"xuxa"),(44,"delicia")]  # Call next on our generator expression. next((i for i, v in enumerate(l) if v[0] == 53), None)

We can expect this method to perform basically the same as list comprehensions in our trivial example, but what if we're working with a larger data set? That's where the advantage of using the generator method comes into play. Rather than constructing a new list, we'll use your existing list as our iterable, and use next() to get the first item from our generator.

Lets look at how these methods perform differently on some larger data sets. These are large lists, made of 10000000 + 1 elements, with our target at the beginning (best) or end (worst). We can verify that both of these lists will perform equally using the following list comprehension:

List comprehensions

"Worst case"

worst_case = ([(False, 'F')] * 10000000) + [(True, 'T')] print [i for i, v in enumerate(worst_case) if v[0] is True]  # [10000000] #          2 function calls in 3.885 seconds # #    Ordered by: standard name # #    ncalls  tottime  percall  cumtime  percall filename:lineno(function) #         1    3.885    3.885    3.885    3.885 so_lc.py:1(<module>) #         1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

"Best case"

best_case = [(True, 'T')] + ([(False, 'F')] * 10000000) print [i for i, v in enumerate(best_case) if v[0] is True]  # [0] #          2 function calls in 3.864 seconds # #    Ordered by: standard name # #    ncalls  tottime  percall  cumtime  percall filename:lineno(function) #         1    3.864    3.864    3.864    3.864 so_lc.py:1(<module>) #         1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

Generator expressions

Here's my hypothesis for generators: we'll see that generators will significantly perform better in the best case, but similarly in the worst case. This performance gain is mostly due to the fact that the generator is evaluated lazily, meaning it will only compute what is required to yield a value.

Worst case

# 10000000 #          5 function calls in 1.733 seconds # #    Ordered by: standard name # #    ncalls  tottime  percall  cumtime  percall filename:lineno(function) #         2    1.455    0.727    1.455    0.727 so_lc.py:10(<genexpr>) #         1    0.278    0.278    1.733    1.733 so_lc.py:9(<module>) #         1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects} #         1    0.000    0.000    1.455    1.455 {next}

Best case

best_case  = [(True, 'T')] + ([(False, 'F')] * 10000000) print next((i for i, v in enumerate(best_case) if v[0] == True), None)  # 0 #          5 function calls in 0.316 seconds # #    Ordered by: standard name # #    ncalls  tottime  percall  cumtime  percall filename:lineno(function) #         1    0.316    0.316    0.316    0.316 so_lc.py:6(<module>) #         2    0.000    0.000    0.000    0.000 so_lc.py:7(<genexpr>) #         1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects} #         1    0.000    0.000    0.000    0.000 {next}

WHAT?! The best case blows away the list comprehensions, but I wasn't expecting the our worst case to outperform the list comprehensions to such an extent. How is that? Frankly, I could only speculate without further research.

Take all of this with a grain of salt, I have not run any robust profiling here, just some very basic testing. This should be sufficient to appreciate that a generator expression is more performant for this type of list searching.

Note that this is all basic, built-in python. We don't need to import anything or use any libraries.

I first saw this technique for searching in the Udacity cs212 course with Peter Norvig.

answered Oct 25 '22 15:10

Jon Surrell

Related questions
                            
                                H14 error in heroku - "no web processes running"
                            
                                Handle JSON Decode Error when nothing returned
                            
                                How to check type of files without extensions? [duplicate]
                            
                                Test if all elements of a python list are False
                            
                                `ipython` tab autocomplete does not work on imported module
                            
                                save a pandas.Series histogram plot to file
                            
                                Shift elements in a numpy array
                            
                                Make requests using Python over Tor
                            
                                Double precision floating values in Python?
                            
                                Best way to do enum in Sqlalchemy?
                            
                                How to print out a dictionary nicely in Python?
                            
                                python-pandas and databases like mysql
                            
                                UserWarning: Could not import the lzma module. Your installed Python is incomplete
                            
                                Setting Different Bar color in matplotlib Python [duplicate]
                            
                                pip throws TypeError: parse() got an unexpected keyword argument 'transport_encoding' when trying to install new packages
                            
                                Removing python module installed in develop mode
                            
                                Execute Python script via crontab
                            
                                Is it possible to run python SimpleHTTPServer on localhost only?
                            
                                Python in-memory zip library
                            
                                Interprocess communication in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to search a list of tuples in Python

Tags:

python

list

search

tuples

hdx

People also ask