Set Pandas column values to an array

Tags:

I have the following problem: I have a dataframe like this one:

   col1   col2   col3
0   2       5      4
1   4       3      5
2   6       2      7

Now I have an array for example a = [5,5,5] and i want to insert this array in col3 but only in specific rows (let's say 0 and 2) and obtain something like that:

   col1   col2   col3
0   2       5    [5,5,5]
1   4       3      5
2   6       2    [5,5,5]

The problem is that when I try to do:

 zip_df.at[[0,2],'col3'] = a

I receive the following error ValueError: Must have equal len keys and value when setting with an ndarray. How can I solve this problem?

212

asked Dec 01 '18 23:12

Marco Miglionico

1 Answers

What you're attempting is not recommended.¹ Pandas is not designed to hold list in series. Having said this, you can define a series explicitly and assign via update or loc. Note at is used to get or set a single value only, not multiple values as in your case.

a = [5, 5, 5]
indices = [0, 2]

df['col3'].update(pd.Series([a]*len(indices), index=indices))

# alternative:
# df.loc[indices, 'col3'] = pd.Series([a]*len(indices), index=indices)

print(df)

   col1  col2       col3
0     2     5  [5, 5, 5]
1     4     3          5
2     6     2  [5, 5, 5]

¹ For more information (source):

Don't do this. Pandas was never designed to hold lists in series / columns. You can concoct expensive workarounds, but these are not recommended.

The main reason holding lists in series is not recommended is you lose the vectorised functionality which goes with using NumPy arrays held in contiguous memory blocks. Your series will be of object dtype, which represents a sequence of pointers, much like list. You will lose benefits in terms of memory and performance, as well as access to optimized Pandas methods.

See also What are the advantages of NumPy over regular Python lists? The arguments in favour of Pandas are the same as for NumPy.

137

answered Sep 16 '22 22:09

jpp

Related questions
                            
                                How do i click an element using selenium from a long drop down list?
                            
                                Why does math.isclose() fail to detect minor differences between very large values?
                            
                                Pass command line arguments to test modules
                            
                                pip failling to install for Python 3.7 on MacOs
                            
                                deploying the Tensorflow model in Python
                            
                                Pandas: Group by bi-monthly date field
                            
                                Pandas - Replace other columns in row with 0 if a specific column has a value of 1
                            
                                django.core.exceptions.SuspiciousFileOperation: The joined path is located outside of the base path component
                            
                                My for loop isn't removing items in my array based on condition? Python [duplicate]
                            
                                Python Marshmallow: Dict validation Error
                            
                                PyTorch gradient differs from manually calculated gradient
                            
                                Why cannot python PIL show two images in one program
                            
                                Why do I receive an AttributeError even though import, spelling and file location is correct?
                            
                                Scrapy - Use feed exporter for a particular spider (and not others) in a project
                            
                                Python redirect (with delay)
                            
                                Is it possible to split the training DataLoader (and dataset) into training and validation datasets?
                            
                                how to update scan Cython code in Theano?
                            
                                ML Engine Runtime version and Python version not supported
                            
                                Django - Admin - on form change
                            
                                Python: How to create and use a custom logger in python use logging module?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Set Pandas column values to an array

Tags:

python

arrays

pandas

dataframe

series

Marco Miglionico

People also ask

1 Answers

jpp

Recent Activity

Donate For Us