What is the difference between applying <code>list()</code> on a <code>numpy</code> array vs. calling <code>tolist()</code>? I was checking the types of both outputs and they both show that what I'm getting as a result is a <code>list</code>, however, the outputs don't look exactly the same. Is it because that <code>list()</code> is not a <code>numpy</code>-specific method (i.e. could be applied on any sequence) and <code>tolist()</code> is <code>numpy</code>-specific, and just in this case they are returning the same thing? Input: <pre class="prettyprint"><code>points = numpy.random.random((5,2)) print "Points type: " + str(type(points)) </code></pre> Output: <pre class="prettyprint"><code>Points type: <type 'numpy.ndarray'> </code></pre> Input: <pre class="prettyprint"><code>points_list = list(points) print points_list print "Points_list type: " + str(type(points_list)) </code></pre> Output: <pre class="prettyprint"><code>[array([ 0.15920058, 0.60861985]), array([ 0.77414769, 0.15181626]), array([ 0.99826806, 0.96183059]), array([ 0.61830768, 0.20023207]), array([ 0.28422605, 0.94669097])] Points_list type: 'type 'list'' </code></pre> Input: <pre class="prettyprint"><code>points_list_alt = points.tolist() print points_list_alt print "Points_list_alt type: " + str(type(points_list_alt)) </code></pre> Output: <pre class="prettyprint"><code>[[0.15920057939342847, 0.6086198537462152], [0.7741476852713319, 0.15181626186774055], [0.9982680580550761, 0.9618305944859845], [0.6183076760274226, 0.20023206937408744], [0.28422604852159594, 0.9466909685812506]] Points_list_alt type: 'type 'list'' </code></pre>

Your example already shows the difference; consider the following 2D array: <pre class="prettyprint"><code>>>> import numpy as np >>> a = np.arange(4).reshape(2, 2) >>> a array([[0, 1], [2, 3]]) >>> a.tolist() [[0, 1], [2, 3]] # nested vanilla lists >>> list(a) [array([0, 1]), array([2, 3])] # list of arrays </code></pre> <code>tolist</code> handles the full conversion to nested vanilla lists (i.e. <code>list</code> of <code>list</code> of <code>int</code>), whereas <code>list</code> just iterates over the first dimension of the array, creating a list of arrays (<code>list</code> of <code>np.array</code> of <code>np.int64</code>). Although both are lists: <pre class="prettyprint"><code>>>> type(list(a)) <type 'list'> >>> type(a.tolist()) <type 'list'> </code></pre> the elements of each list have a different type: <pre class="prettyprint"><code>>>> type(list(a)[0]) <type 'numpy.ndarray'> >>> type(a.tolist()[0]) <type 'list'> </code></pre> The other difference, as you note, is that <code>list</code> will work on any iterable, whereas <code>tolist</code> can only be called on objects that specifically implement that method.

<code>.tolist()</code> appears to convert all of the values recursively to python primitives (<code>list</code>), whereas <code>list</code> creates a python list from an iterable. Since the numpy array is an array of <code>arrays</code>, <code>list(...)</code> creates a <code>list</code> of <code>array</code>s You can think of <code>list</code> as a function that looks like this: <pre class="prettyprint"><code># Not the actually implementation, just for demo purposes def list(iterable): newlist = [] for obj in iter(iterable): newlist.append(obj) return newlist </code></pre>

Difference between list(numpy_array) and numpy_array.tolist()

Tags:

python

arrays

list

numpy

What is the difference between applying list() on a numpy array vs. calling tolist()?

I was checking the types of both outputs and they both show that what I'm getting as a result is a list, however, the outputs don't look exactly the same. Is it because that list() is not a numpy-specific method (i.e. could be applied on any sequence) and tolist() is numpy-specific, and just in this case they are returning the same thing?

Input:

points = numpy.random.random((5,2))
print "Points type: " + str(type(points))

Output:

Points type: <type 'numpy.ndarray'>

Input:

points_list = list(points)
print points_list
print "Points_list type: " + str(type(points_list))

Output:

[array([ 0.15920058,  0.60861985]), array([ 0.77414769,  0.15181626]), array([ 0.99826806,  0.96183059]), array([ 0.61830768,  0.20023207]), array([ 0.28422605,  0.94669097])]
Points_list type: 'type 'list''

Input:

points_list_alt = points.tolist()
print points_list_alt
print "Points_list_alt type: " + str(type(points_list_alt))

Output:

[[0.15920057939342847, 0.6086198537462152], [0.7741476852713319, 0.15181626186774055], [0.9982680580550761, 0.9618305944859845], [0.6183076760274226, 0.20023206937408744], [0.28422604852159594, 0.9466909685812506]]

Points_list_alt type: 'type 'list''

812

asked Jan 11 '15 18:01

atoregozh

2 Answers

Your example already shows the difference; consider the following 2D array:

>>> import numpy as np
>>> a = np.arange(4).reshape(2, 2)
>>> a
array([[0, 1],
       [2, 3]])
>>> a.tolist()
[[0, 1], [2, 3]] # nested vanilla lists
>>> list(a)
[array([0, 1]), array([2, 3])] # list of arrays

tolist handles the full conversion to nested vanilla lists (i.e. list of list of int), whereas list just iterates over the first dimension of the array, creating a list of arrays (list of np.array of np.int64). Although both are lists:

>>> type(list(a))
<type 'list'>
>>> type(a.tolist())
<type 'list'>

the elements of each list have a different type:

>>> type(list(a)[0])
<type 'numpy.ndarray'>
>>> type(a.tolist()[0])
<type 'list'>

The other difference, as you note, is that list will work on any iterable, whereas tolist can only be called on objects that specifically implement that method.

173

answered Oct 12 '22 20:10

jonrsharpe

.tolist() appears to convert all of the values recursively to python primitives (list), whereas list creates a python list from an iterable. Since the numpy array is an array of arrays, list(...) creates a list of arrays

You can think of list as a function that looks like this:

# Not the actually implementation, just for demo purposes
def  list(iterable):
    newlist = []
    for obj in iter(iterable):
        newlist.append(obj)
    return newlist

answered Oct 12 '22 20:10

Anthony Sottile

Related questions
                            
                                Can I insert matplotlib graphs into Excel programmatically?
                            
                                Python extension debugging
                            
                                How to ConfigParse a file keeping multiple values for identical keys?
                            
                                Creating one Django Form to save two models
                            
                                Allow ALL method types in flask route
                            
                                "Too much contention" when creating new entity in dataStore
                            
                                Python Slice Notation with Comma/List
                            
                                Sphinx doesn't find Python packages when using autodoc
                            
                                Efficient iteration over slice in Python
                            
                                Methods like ord and chr from Python
                            
                                Which is a better practice - global import or local import
                            
                                Django - Function inside a model. How to call it from a view?
                            
                                Is there a way to config Python's JSON library to ignore fields that have null values when calling json.loads()?
                            
                                A simple Hello World setuptools package and installing it with pip
                            
                                How to display an image using kivy
                            
                                SettingWithCopyWarning, even when using loc (?) [duplicate]
                            
                                Working with unicode keys in a python dictionary
                            
                                from django.db import models, migrations ImportError: cannot import name migrations
                            
                                Is it a bug to omit an Accept */* header in an HTTP/1.0 Request for a REST API
                            
                                How do I get the index of a specific percentile in numpy / scipy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between list(numpy_array) and numpy_array.tolist()

Tags:

python

arrays

list

numpy

atoregozh

People also ask

2 Answers

jonrsharpe

Anthony Sottile

Recent Activity

Donate For Us