The problem I have is that adding a row to DataFrame changes dtype of columns: <pre class="prettyprint"><code>>>> from pandas import DataFrame >>> df = DataFrame({'a' : range(10)}, dtype='i4') >>> df a 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 [10 rows x 1 columns] </code></pre> I specifically specified dtype to be int32 (i.e., 'i4'), as can be seen: <pre class="prettyprint"><code>>>> df.dtypes a int32 dtype: object </code></pre> However, adding a row changes dtype to float64: <pre class="prettyprint"><code>>>> df.loc[10] = 99 >>> df a 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 10 99 [11 rows x 1 columns] >>> df.dtypes a float64 dtype: object </code></pre> I've tried specifying the dtype of the value that I add: <pre class="prettyprint"><code>>>> import numpy as np >>> df = DataFrame({'a' : np.arange(10, dtype=np.int32)}) >>> df.dtypes a int32 dtype: object >>> df.loc[10] = np.int32(0) >>> df.dtypes a float64 dtype: object </code></pre> But that does not work either. Is there any solution, without using functions that return new objects?

Enlargment is done in 2 stages, and a <code>nan</code> is placed in that column first, then its assigned, so that is why it is coerced. I'll put it on the bug/enhancement list. Its a bit non-trivial. Here's a workaround, by using append. <pre class="prettyprint"><code>In [14]: df.append(Series(99,[10],dtype='i4').to_frame('a')) Out[14]: a 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 10 99 [11 rows x 1 columns] In [15]: df.append(Series(99,[10],dtype='i4').to_frame('a')).dtypes Out[15]: a int32 dtype: object </code></pre> An issue for the bug/enhancement to do this automagically: https://github.com/pydata/pandas/issues/6485

Adding row to pandas DataFrame changes dtype

Tags:

python

pandas

The problem I have is that adding a row to DataFrame changes dtype of columns:

Click to copy

>>> from pandas import DataFrame
>>> df = DataFrame({'a' : range(10)}, dtype='i4')
>>> df
   a
0  0
1  1
2  2
3  3
4  4
5  5
6  6
7  7
8  8
9  9

[10 rows x 1 columns]

I specifically specified dtype to be int32 (i.e., 'i4'), as can be seen:

Click to copy

>>> df.dtypes
a    int32
dtype: object

However, adding a row changes dtype to float64:

Click to copy

>>> df.loc[10] = 99

>>> df
     a
0    0
1    1
2    2
3    3
4    4
5    5
6    6
7    7
8    8
9    9
10  99

[11 rows x 1 columns]

>>> df.dtypes
a    float64
dtype: object

I've tried specifying the dtype of the value that I add:

Click to copy

>>> import numpy as np
>>> df = DataFrame({'a' : np.arange(10, dtype=np.int32)})

>>> df.dtypes
a    int32
dtype: object

>>> df.loc[10] = np.int32(0)

>>> df.dtypes
a    float64
dtype: object

But that does not work either. Is there any solution, without using functions that return new objects?

963

asked Feb 26 '14 14:02

Ben

1 Answers

Enlargment is done in 2 stages, and a nan is placed in that column first, then its assigned, so that is why it is coerced. I'll put it on the bug/enhancement list. Its a bit non-trivial.

Here's a workaround, by using append.

Click to copy

In [14]: df.append(Series(99,[10],dtype='i4').to_frame('a'))
Out[14]: 
     a
0    0
1    1
2    2
3    3
4    4
5    5
6    6
7    7
8    8
9    9
10  99

[11 rows x 1 columns]

In [15]: df.append(Series(99,[10],dtype='i4').to_frame('a')).dtypes
Out[15]: 
a    int32
dtype: object

An issue for the bug/enhancement to do this automagically: https://github.com/pydata/pandas/issues/6485

114

answered Sep 16 '22 12:09

Jeff

Related questions
                            
                                Python 'Connection reset by peer'
                            
                                need to understand the flow of __init__, __new__ and __call__
                            
                                Python object containing an array of objects being weird [duplicate]
                            
                                How to get current URL in python web page?
                            
                                Global variables and coding style recommendations
                            
                                Matplotlib: Aligning y-ticks to the left
                            
                                Numpy running at half the speed of MATLAB
                            
                                What are the benefits of using the Eventlet module in python over the threading module? [closed]
                            
                                Passing a numpy pointer (dtype=np.bool) to C++
                            
                                python prime factorization performance
                            
                                Tips on processing a lot of images in python
                            
                                Plotting with matplotlib in threads
                            
                                Determine whether super().__new__ will be object.__new__ in Python 3?
                            
                                How to call a Python function from Lua?
                            
                                Django haystack EdgeNgramField given different results than elasticsearch
                            
                                GeoDjango LayerMapping & Foreign Key
                            
                                Scikit-learn χ² (chi-squared) statistic and corresponding contingency table
                            
                                Python: How to check if path is a subpath [duplicate]
                            
                                += with multiple variables in python [closed]
                            
                                Scapy error: no module names pcapy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Adding row to pandas DataFrame changes dtype

Tags:

python

pandas

Ben

People also ask

1 Answers

Jeff

Recent Activity

Donate For Us