Apologize if this has been asked before, somehow I am not able to find the answer to this. Let's say I have two lists of values: <pre class="prettyprint"><code>rows = [0,1,2] cols = [0,2,3] </code></pre> that represents indexes of rows and columns respectively. The two lists combined signified sort of coordinates in the matrix, i.e (0,0), (1,2), (2,3). I would like to use those coordinates to change specific cells of the <code>dataframe</code> without using a loop. In numpy, this is trivial: <pre class="prettyprint"><code>data = np.ones((4,4)) data[rows, cols] = np.nan array([[nan, 1., 1., 1.], [ 1., 1., nan, 1.], [ 1., 1., 1., nan], [ 1., 1., 1., 1.]]) </code></pre> But in pandas, it seems I am stuck with a loop: <pre class="prettyprint"><code>df = pd.DataFrame(np.ones((4,4))) for _r, _c in zip(rows, cols): df.iat[_r, _c] = np.nan </code></pre> Is there a way to use to vectors that lists coordinate-like index to directly modify cells in pandas? <hr> Please note that the answer is not to use iloc instead, this selects the intersection of entire rows and columns.

Very simple! Exploit the fact that pandas is built on top of <code>numpy</code> and use <code>DataFrame.values</code> <pre class="prettyprint"><code>df.values[rows, cols] = np.nan </code></pre> Output: <pre class="prettyprint"><code> 0 1 2 3 0 NaN 1.0 1.0 1.0 1 1.0 1.0 NaN 1.0 2 1.0 1.0 1.0 NaN 3 1.0 1.0 1.0 1.0 </code></pre>

How to change dataframe cells values with "coordinate-like" indexes stored in two lists/vectors/series?

Tags:

python

pandas

Apologize if this has been asked before, somehow I am not able to find the answer to this.

Let's say I have two lists of values:

rows = [0,1,2]
cols = [0,2,3]

that represents indexes of rows and columns respectively. The two lists combined signified sort of coordinates in the matrix, i.e (0,0), (1,2), (2,3).

I would like to use those coordinates to change specific cells of the dataframe without using a loop.

In numpy, this is trivial:

data = np.ones((4,4))
data[rows, cols] = np.nan

array([[nan,  1.,  1.,  1.],
      [ 1.,  1., nan,  1.],
      [ 1.,  1.,  1., nan],
      [ 1.,  1.,  1.,  1.]])

But in pandas, it seems I am stuck with a loop:

df = pd.DataFrame(np.ones((4,4)))
for _r, _c in zip(rows, cols): 
    df.iat[_r, _c] = np.nan

Is there a way to use to vectors that lists coordinate-like index to directly modify cells in pandas?

Please note that the answer is not to use iloc instead, this selects the intersection of entire rows and columns.

932

asked Aug 21 '18 13:08

toto_tico

1 Answers

Very simple! Exploit the fact that pandas is built on top of numpy and use DataFrame.values

df.values[rows, cols] = np.nan

Output:

     0    1    2    3
0  NaN  1.0  1.0  1.0
1  1.0  1.0  NaN  1.0
2  1.0  1.0  1.0  NaN
3  1.0  1.0  1.0  1.0

196

answered Oct 24 '22 18:10

Yuca

Related questions
                            
                                Keras Word2Vec implementation
                            
                                Pandas: How to add column to multiindexed dataframe?
                            
                                Faster alternative to iterrows
                            
                                Sum matrix elements group by indices in Python
                            
                                Python mypy unable to infer type from union return types
                            
                                Is there a way to take screenshot of a window in pyqt5 or qt5?
                            
                                How to correctly use mask_zero=True for Keras Embedding with pre-trained weights?
                            
                                Python sklearn's labelencoder with categorical bins
                            
                                TensorFlow - Stop training when losses reach a defined value
                            
                                Python ctypes to return an array of function pointers
                            
                                How to use Plotly/Dash (Python) completely offline?
                            
                                How to create string with line breaks in Python? [duplicate]
                            
                                Line plot with multiple lines pandas
                            
                                Keras dot/Dot layer behavior on 3D tensors
                            
                                Using Gaussian Mixture for 1D array in python sklearn
                            
                                win32gui MoveWindow() not aligned with left edge of screen
                            
                                Can I use JetBrains MPS in a web application?
                            
                                Elasticsearch query date range does not work
                            
                                Time Series prediction with multiple features in the input data
                            
                                Gunicorn worker doesn't deflate memory after request

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With