I've been looking for a way to efficiently check for duplicates in a numpy array and stumbled upon a question that contained an answer using this code. What does this line mean in numpy? <pre class="prettyprint"><code>s[s[1:] == s[:-1]] </code></pre> Would like to understand the code before applying it. Looked in the Numpy doc but had trouble finding this information.

The slices <code>[1:]</code> and <code>[:-1]</code> mean all but the first and all but the last elements of the array: <pre class="prettyprint"><code>>>> import numpy as np >>> s = np.array((1, 2, 2, 3)) # four element array >>> s[1:] array([2, 2, 3]) # last three elements >>> s[:-1] array([1, 2, 2]) # first three elements </code></pre> therefore the comparison generates an array of boolean comparisons between each element <code>s[x]</code> and its "neighbour" <code>s[x+1]</code>, which will be one shorter than the original array (as the last element has no neighbour): <pre class="prettyprint"><code>>>> s[1:] == s[:-1] array([False, True, False], dtype=bool) </code></pre> and using that array to index the original array gets you the elements where the comparison is <code>True</code>, i.e. the elements that are the same as their neighbour: <pre class="prettyprint"><code>>>> s[s[1:] == s[:-1]] array([2]) </code></pre> Note that this only identifies adjacent duplicate values.

What does this: s[s[1:] == s[:-1]] do in numpy?

Tags:

python

numpy

I've been looking for a way to efficiently check for duplicates in a numpy array and stumbled upon a question that contained an answer using this code.

What does this line mean in numpy?

s[s[1:] == s[:-1]]

Would like to understand the code before applying it. Looked in the Numpy doc but had trouble finding this information.

436

asked Jun 14 '15 15:06

AturSams

1 Answers

The slices [1:] and [:-1] mean all but the first and all but the last elements of the array:

>>> import numpy as np
>>> s = np.array((1, 2, 2, 3))  # four element array
>>> s[1:]
array([2, 2, 3])  # last three elements
>>> s[:-1]
array([1, 2, 2])  # first three elements

therefore the comparison generates an array of boolean comparisons between each element s[x] and its "neighbour" s[x+1], which will be one shorter than the original array (as the last element has no neighbour):

>>> s[1:] == s[:-1]
array([False,  True, False], dtype=bool)

and using that array to index the original array gets you the elements where the comparison is True, i.e. the elements that are the same as their neighbour:

>>> s[s[1:] == s[:-1]]
array([2])

Note that this only identifies adjacent duplicate values.

answered Oct 02 '22 00:10

jonrsharpe

Related questions
                            
                                Why the need to commit explicitly when doing an UPDATE?
                            
                                Using an index to get an item
                            
                                Django model inheritance and type check
                            
                                Which is generally faster, a yield or an append?
                            
                                How to import multiple locations to PYTHONPATH (bash)
                            
                                Get the version of Django for application
                            
                                Create an instance, I already have the type
                            
                                Trying to catch integrity error with SQLAlchemy
                            
                                Python's StringIO doesn't do well with `with` statements
                            
                                How to parse positional arguments with leading minus sign (negative numbers) using argparse
                            
                                TypeError: object of type 'Cursor' has no len()
                            
                                Python Pandas, write DataFrame to fixed-width file (to_fwf?)
                            
                                Importing large tab-delimited .txt file into Python
                            
                                Redis: Return all values stored in a database
                            
                                Numpy build fails with cannot import multiarray
                            
                                How do I remove Label text in Django generated form?
                            
                                How to signal slots in a GUI from a different process?
                            
                                ploting filled polygons in python
                            
                                User ID to Username tweepy
                            
                                How can i get all models in django 1.8

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With