I have a list of vectors (in Python) that I want to normalize, while at the same time removing the vectors that originally had small norms. The input list is, e.g. <pre class="prettyprint"><code>a = [(1,1),(1,2),(2,2),(3,4)] </code></pre> And I need the output to be <code>(x*n, y*n)</code> with <code>n = (x**2+y**2)**-0.5</code> If I just needed the norms, for example, that would be easy with a list comprehension: <pre class="prettyprint"><code>an = [ (x**2+y**2)**0.5 for x,y in a ] </code></pre> It would be also easy to store just a normalized x, too, for example, but what I want is to have this temporary variable "n", to use in two calculations, and then throw it away. I can't just use a lambda function too because I also need the n to filter the list. So what is the best way? Right now I am using this nested list comprehension here (with an expression in the inner list): <pre class="prettyprint"><code>a = [(1,1),(1,2),(2,2),(3,4)] [(x*n,y*n) for (n,x,y) in (( (x**2.+y**2.)**-0.5 ,x,y) for x,y in a) if n < 0.4] # Out[14]: # [(0.70710678118654757, 0.70710678118654757), # (0.60000000000000009, 0.80000000000000004)] </code></pre> The inner list generates tuples with an extra value (n), and then I use these values for the calculations and filtering. Is this really the best way? Are there any terrible inefficiencies I should be aware of?

<blockquote> <pre class="prettyprint"><code>Is this really the best way? </code></pre> </blockquote> Well, it does work efficiently and if you really, really want to write oneliners then it's the best you can do. On the other hand, a simple 4 line function would do the same much clearer: <pre class="prettyprint"><code>def normfilter(vecs, min_norm): for x,y in vecs: n = (x**2.+y**2.)**-0.5 if min_norm < n: yield (x*n,y*n) normalized = list(normfilter(vectors, 0.4)) </code></pre> Btw, there is a bug in your code or description - you say you filter out short vectors but your code does the opposite :p

Starting <code>Python 3.8</code>, and the introduction of assignment expressions (PEP 572) (<code>:=</code> operator), it's possible to use a local variable within a list comprehension in order to avoid calling multiple times the same expression: In our case, we can name the evaluation of <code>(x**2.+y**2.)**-.5</code> as a variable <code>n</code> while using the result of the expression to filter the list if <code>n</code> is inferior than <code>0.4</code>; and thus re-use <code>n</code> to produce the mapped value: <pre class="prettyprint"><code># vectors = [(1, 1), (1, 2), (2, 2), (3, 4)] [(x*n, y*n) for x, y in vectors if (n := (x**2.+y**2.)**-.5) < .4] # [(0.7071067811865476, 0.7071067811865476), (0.6000000000000001, 0.8)] </code></pre>

Intermediate variable in a list comprehension for simultaneous filtering and transformation

Tags:

python

list

list-comprehension

mapping

filtering

I have a list of vectors (in Python) that I want to normalize, while at the same time removing the vectors that originally had small norms.

The input list is, e.g.

Click to copy

a = [(1,1),(1,2),(2,2),(3,4)]

And I need the output to be (x*n, y*n) with n = (x**2+y**2)**-0.5

If I just needed the norms, for example, that would be easy with a list comprehension:

Click to copy

an = [ (x**2+y**2)**0.5 for x,y in a ]

It would be also easy to store just a normalized x, too, for example, but what I want is to have this temporary variable "n", to use in two calculations, and then throw it away.

I can't just use a lambda function too because I also need the n to filter the list. So what is the best way?

Right now I am using this nested list comprehension here (with an expression in the inner list):

Click to copy

a = [(1,1),(1,2),(2,2),(3,4)]

[(x*n,y*n) for (n,x,y) in (( (x**2.+y**2.)**-0.5 ,x,y) for x,y in a) if n < 0.4]

# Out[14]: 
# [(0.70710678118654757, 0.70710678118654757),
#  (0.60000000000000009, 0.80000000000000004)]

The inner list generates tuples with an extra value (n), and then I use these values for the calculations and filtering. Is this really the best way? Are there any terrible inefficiencies I should be aware of?

408

asked Nov 04 '10 14:11

dividebyzero

2 Answers

Click to copy

Is this really the best way?

Well, it does work efficiently and if you really, really want to write oneliners then it's the best you can do.

On the other hand, a simple 4 line function would do the same much clearer:

Click to copy

def normfilter(vecs, min_norm):
    for x,y in vecs:
        n = (x**2.+y**2.)**-0.5
        if min_norm < n:
            yield (x*n,y*n)

normalized = list(normfilter(vectors, 0.4))

Btw, there is a bug in your code or description - you say you filter out short vectors but your code does the opposite :p

198

answered Sep 29 '22 11:09

Jochen Ritzel

Starting Python 3.8, and the introduction of assignment expressions (PEP 572) (:= operator), it's possible to use a local variable within a list comprehension in order to avoid calling multiple times the same expression:

In our case, we can name the evaluation of (x**2.+y**2.)**-.5 as a variable n while using the result of the expression to filter the list if n is inferior than 0.4; and thus re-use n to produce the mapped value:

Click to copy

# vectors = [(1, 1), (1, 2), (2, 2), (3, 4)]
[(x*n, y*n) for x, y in vectors if (n := (x**2.+y**2.)**-.5) < .4]
# [(0.7071067811865476, 0.7071067811865476), (0.6000000000000001, 0.8)]

answered Sep 29 '22 09:09

Xavier Guihot

Related questions
                            
                                Extracting raw data from a PowerPivot model using Python
                            
                                Input to LSTM network tensorflow
                            
                                how can I avoid storing a command in ipython history?
                            
                                Can I run multiple threads in a single heroku (python) dyno?
                            
                                How can I make a python script change itself?
                            
                                Python Selenium Webdriver `Failed to start browser: Permission Denied`
                            
                                Portable Python com server using pywin32
                            
                                Overload all arithmetic operators in Python
                            
                                Is there a way to make flake8 check for type hints in the source
                            
                                When should I subclass EnumMeta instead of Enum?
                            
                                Difference between TestCase and TransactionTestCase classes in django test
                            
                                How to ensure that Spyder runs within a conda environment?
                            
                                Why can't I swap two items in a list in one line?
                            
                                Hierarchical data: efficiently build a list of every descendant for each node
                            
                                How to fix AttributeError: module 'numpy' has no attribute 'square' [closed]
                            
                                Custom validators in WTForms using Flask
                            
                                Loading vs linking in Cython modules
                            
                                How does the following expression work in python? [duplicate]
                            
                                Alternative ways to browse the python api [closed]
                            
                                Image classification in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Intermediate variable in a list comprehension for simultaneous filtering and transformation

Tags:

python

list

list-comprehension

mapping

filtering

dividebyzero

People also ask

2 Answers

Jochen Ritzel

Xavier Guihot

Recent Activity

Donate For Us