Looking for a way to reliably identify if a numpy object is a view. Related questions have come up many times before (here, here, here), and people have offered some solutions, but all seem to have problems: <ul> <li>The test used in <code>pandas</code> now is to call something a view if <code>my_array.base is not None</code>. This seems to always catch views, but also offers lots of false positives (situations where it reports something is a view even if it isn't). </li> <li> <code>numpy.may_share_memory()</code> will check for two specific arrays, but won't answer generically <ul> <li>(@RobertKurn says was best tool as of 2012 -- any changes?)</li> </ul> </li> <li> <code>flags['OWNDATA'])</code> is reported (third comment first answer) to fail in some cases. </li> </ul> (The reason for my interest is that I'm working on implementing copy-on-write for pandas, and a conservative indicator is leading to over-copying.)

Depending on your usages, <code>flags['OWNDATA']</code> would do the job. In fact, there's no problem with your link. It does not fail in some cases. It will always do what it's supposed to do. According to http://docs.scipy.org/doc/numpy-1.10.0/reference/generated/numpy.require.html: the flag "ensure an array that owns its own data". In your "counterexample", they use the code: <pre class="prettyprint"><code>print (b.flags['OWNDATA']) #False -- apparently this is a view e = np.ravel(b[:, 2]) print (e.flags['OWNDATA']) #True -- Apparently this is a new numpy object. </code></pre> But, it's the normal behaviour to be True in the second case. It comes from the definition of <code>ravel</code> (from http://docs.scipy.org/doc/numpy-1.10.1/reference/generated/numpy.ravel.html). <blockquote> Return a contiguous flattened array. A 1-D array, containing the elements of the input, is returned. A copy is made only if needed. </blockquote> Here, a copy is needed, so a copy is made. So, the variable e really owns its own data. It's not a "view of b", "a reference to b", "an alias to a part of b". It's a real new array that contains a copy of some elements of b. So, I think that it's impossible without tracking the entire origin of the data to detect that kind of behaviour. I believe you should be able to build your program with that flag.

numpy: Reliable (non-conservative) indicator if numpy array is view

1 Answers

Depending on your usages, flags['OWNDATA'] would do the job. In fact, there's no problem with your link. It does not fail in some cases. It will always do what it's supposed to do.

According to http://docs.scipy.org/doc/numpy-1.10.0/reference/generated/numpy.require.html: the flag "ensure an array that owns its own data".

In your "counterexample", they use the code:

print (b.flags['OWNDATA'])  #False -- apparently this is a view e = np.ravel(b[:, 2]) print (e.flags['OWNDATA'])  #True -- Apparently this is a new numpy object.

But, it's the normal behaviour to be True in the second case.

It comes from the definition of ravel (from http://docs.scipy.org/doc/numpy-1.10.1/reference/generated/numpy.ravel.html).

Return a contiguous flattened array. A 1-D array, containing the elements of the input, is returned. A copy is made only if needed.

Here, a copy is needed, so a copy is made. So, the variable e really owns its own data. It's not a "view of b", "a reference to b", "an alias to a part of b". It's a real new array that contains a copy of some elements of b.

So, I think that it's impossible without tracking the entire origin of the data to detect that kind of behaviour. I believe you should be able to build your program with that flag.

171

answered Oct 03 '22 07:10

Alexis Clarembeau

Related questions
                            
                                Loading all images using imread from a given folder
                            
                                Parallel processing from a command queue on Linux (bash, python, ruby... whatever)
                            
                                Adding borders to an image using python
                            
                                Is it possible to add a string as a legend item in matplotlib
                            
                                Python - make a POST request using Python 3 urllib
                            
                                How to preview a part of a large pandas DataFrame, in iPython notebook?
                            
                                How to emulate sum() using a list comprehension?
                            
                                Query datetime by today's date in Django
                            
                                'module' object has no attribute 'DataFrame' [closed]
                            
                                Elegant Python code for Integer Partitioning [closed]
                            
                                Speeding Up Python
                            
                                How to find most common elements of a list? [duplicate]
                            
                                Fast Haversine Approximation (Python/Pandas)
                            
                                How to return more than one value from a function in Python? [duplicate]
                            
                                Convert list of ASCII codes to string (byte array) in Python
                            
                                Python function to convert seconds into minutes, hours, and days
                            
                                How to make Django slugify work properly with Unicode strings?
                            
                                Python: Generate random number between x and y which is a multiple of 5 [duplicate]
                            
                                Compact way to assign values by slicing list in Python
                            
                                Suppress output in Python calls to executables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

numpy: Reliable (non-conservative) indicator if numpy array is view

Tags:

python

arrays

pandas

numpy

nick_eu

People also ask

1 Answers

Alexis Clarembeau

Recent Activity

Donate For Us