Numpy empty list type inference

Tags:

numpy

Why is the empty list [] being inferred as float type when using np.append?

np.append([1,2,3], [0])
# output: array([1, 2, 3, 0]), dtype = np.int64

np.append([1,2,3], [])
# output: array([1., 2., 3.]), dtype = np.float64

This is persistent even when using a np.array([1,2,3], dtype=np.int32) as arr.

It's not possible to specify a dtype for append, so I am just curious on why this happens. Numpy's concatenate does the same thing, but when I try to specify the dtype I get an error:

Click to copy

np.concatenate([[1,2,3], []], dtype=np.int64)

Error:

Click to copy

TypeError: Cannot cast array data from dtype('float64') to dtype('int64') according to the rule 'same_kind'

But finally if I set the unsafe casting rule it works:

Click to copy

np.concatenate([[1,2,3], []], dtype=np.int64, casting='unsafe')

Why is [] considered a float?

869

asked May 28 '21 22:05

1 Answers

np.append is subject to well-defined semantic rules like any Numpy binary operation. As a result, it first converts the input operands to Numpy arrays if this is not the case (typically with np.array) and then apply the semantic rules to find the type of the resulting array and check it is a valid operation before applying the actual operation (here the concatenation). The array type returned by np.array is "determined as the minimum type required to hold the objects in the sequence" regarding to the documentation. When the list is empty, like in your case, the default type is numpy.float64 as stated in the documentation of np.empty. This arbitrary choice was made long ago and has not been changed since in order not to break old codes. Please note that It seems not all Numpy developers agree with the current choice and so this is a matter of debate. For more information, you can read this opened issue.

The rule of thumb is to use either existing Numpy arrays or to perform an explicit conversion to a Numpy array using np.array with a fixed dtype parameter (as described in the above comments).

answered Nov 18 '22 13:11

Jérôme Richard

Related questions
                            
                                Why does float.__repr__ return a different representation compared to the equivalent formatting option?
                            
                                PySpark; DecimalType multiplication precision loss
                            
                                Speeding up pandas profiling analysis using check_correlation?
                            
                                Where and in what context did Guido van Rossum say "If you want your code to run faster, you should probably just use PyPy."? [closed]
                            
                                Remove non straight lines from text image
                            
                                How to get current learning rate of SGD optimizer in TensorFlow 2.0 when I use tf.keras.optimizers.schedules.ExponentialDecay?
                            
                                Python OpenCV line detection to detect `X` symbol in image
                            
                                Why does reading an image from OpenCV python samples giving error where as it does not give error in c++?
                            
                                Warning: failed to read path from javaldx
                            
                                Quartiles line properties in seaborn violinplot
                            
                                List of dicts to multilevel dict based on depth info
                            
                                cv2.approxPolyDP() , cv2.arcLength() How these works
                            
                                How to find out `DataFrame.to_numpy` did not create a copy
                            
                                Python/OpenCV — Centroid Determination in Bacterial Clusters
                            
                                What is the difference between aiosqlite and SQLite in multi-threaded mode?
                            
                                Pipfile Hash Creation
                            
                                How to open .ipynb file in Spyder?
                            
                                Does it make sense to multi-thread within multiprocessing?
                            
                                How does parent of custom exception class get the arguments if I don't call super().__init__()?
                            
                                Create a symmetric matrix that counts the relational records

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Numpy empty list type inference

Tags:

python

numpy

Kevin

People also ask

1 Answers

Jérôme Richard

Recent Activity

Donate For Us