Stop Pandas from converting int to float

Tags:

I have a DataFrame with two columns: a column of int and a column of str.

I understand that if I insert NaN into the int column, Pandas will convert all the int into float because there is no NaN value for an int.
However, when I insert None into the str column, Pandas converts all my int to float as well. This doesn't make sense to me - why does the value I put in column 2 affect column 1?

Here's a simple working example):

import pandas as pd
df = pd.DataFrame()
df["int"] = pd.Series([], dtype=int)
df["str"] = pd.Series([], dtype=str)

df.loc[0] = [0, "zero"]
print(df)
print()

df.loc[1] = [1, None]
print(df)

The output is:

   int   str
0    0  zero

   int   str
0  0.0  zero
1  1.0   NaN

Is there any way to make the output the following:

   int   str
0    0  zero

   int   str
0    0  zero
1    1   NaN

without recasting the first column to int.

I prefer using int instead of float because the actual data in that column are integers. If there's not workaround, I'll just use float though.
I prefer not having to recast because in my actual code, I don't
store the actual dtype.
I also need the data inserted row-by-row.

439

asked Oct 26 '16 00:10

user2570465

2 Answers

If you set dtype=object, your series will be able to contain arbitrary data types:

df["int"] = pd.Series([], dtype=object)
df["str"] = pd.Series([], dtype=str)
df.loc[0] = [0, "zero"]
print(df)
print()
df.loc[1] = [1, None]
print(df)

   int   str
0    0  zero
1  NaN   NaN

  int   str
0   0  zero
1   1  None

101

answered Oct 12 '22 06:10

maxymoo

As of pandas 1.0.0 I believe you have another option, which is to first use convert_dtypes. This converts the dataframe columns to dtypes that support pd.NA, avoiding the issues with NaN/None.

...

df = df.convert_dtypes()
df.loc[1] = [1, None]
print(df)

#   int   str
# 0   0  zero
# 1   1  NaN

answered Oct 12 '22 07:10

totalhack

Related questions
                            
                                Is there any nosql flat file database just as sqlite? [closed]
                            
                                Do overridden methods inherit decorators in python?
                            
                                Genetic Algorithms and multi-objectives optimization on PYTHON : libraries/tools to use? [closed]
                            
                                Django Selective Dumpdata
                            
                                Please explain "Task was destroyed but it is pending!"
                            
                                How can I find the full path to a font from its display name on a Mac?
                            
                                A comparison between fastparquet and pyarrow?
                            
                                Best way to encode tuples with json
                            
                                Matplotlib savefig with a legend outside the plot
                            
                                Celery Worker Database Connection Pooling
                            
                                If x is list, why does x += "ha" work, while x = x + "ha" throws an exception?
                            
                                What does "Symbol not found / Expected in: flat namespace" actually mean?
                            
                                Python 3 dictionary with known keys typing
                            
                                Cross-platform desktop notifier in Python
                            
                                Convert JSON to SQLite in Python - How to map json keys to database columns properly?
                            
                                How to export figures to files from IPython Notebook
                            
                                Visual Studio Code: run Python file with arguments
                            
                                Python: Semantic similarity score for Strings [duplicate]
                            
                                Pyodbc - "Data source name not found, and no default driver specified"
                            
                                Why would running scheduled tasks with Celery be preferable over crontab?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stop Pandas from converting int to float

Tags:

python

type-conversion

pandas

type-inference

user2570465

People also ask

2 Answers

maxymoo

totalhack

Recent Activity

Donate For Us