Convert string with NaNs to int in pandas

Tags:

I have a pandas dataframe, all the values are strings. Some are 'None's, and the rest are integers but in string format, such as '123456'. How can I convert all 'None's to np.nan, and others to integers, like, 123456.

df = {'col1': ['1', 'None'], 'col2': ['None', '123']}

Convert df to:

df = {'col1': [1, NaN], 'col2': [NaN, 123]}

999

asked Apr 09 '19 02:04

Ting Wang

2 Answers

Use the below code:

print(df.replace('None', np.nan).astype(float))

Output:

   col1   col2
0   1.0    NaN
1   NaN  123.0

You have to use replace.

P.S. if df is a dictionary, convert it first:

df = pd.DataFrame(df)

165

answered Sep 29 '22 01:09

U12-Forward

You can convert your columns to Nullable Integer type (new in 0.24+):

d = {'col1': ['1', 'None'], 'col2': ['None', '123']}
res = pd.DataFrame({
    k: pd.to_numeric(v, errors='coerce') for k, v in d.items()}, dtype='Int32')
res

   col1  col2
0     1   NaN
1   NaN   123

With this solution, numeric data is converted to integers (but missing data remains as NaN):

res.to_dict()
# {'col1': [1, nan], 'col2': [nan, 123]}

On older versions, convert to object when initialising the DataFrame:

res = pd.DataFrame({
    k: pd.to_numeric(v, errors='coerce') for k, v in d.items()}, dtype=object)
res

  col1 col2
0    1  NaN
1  NaN  123

It is different from the nullable types solution above—only the representation changes, not the actual data.

res.to_dict()
#  {'col1': [1.0, nan], 'col2': [nan, 123.0]}

answered Sep 28 '22 23:09

cs95

Related questions
                            
                                Append list to pandas DataFrame as new row with index
                            
                                How to convert a python script in a local conda env into systemd service in Linux?
                            
                                Why am I receive AlreadyExistsError?
                            
                                LabelEncoder that keeps missing values as 'NaN'
                            
                                How to generate both server and client certificates under root CA
                            
                                Where can I find numpy.where() source code? [duplicate]
                            
                                Python type-hint friendly type that constrains possible values
                            
                                Why is `json.dump()` not ending the line with `\n`?
                            
                                Python: logging comments printed to console before other outputs
                            
                                Wrong current working directory when running python code and jupyter extension in vscode
                            
                                Find elements in a list of which all elements in another list are factors, using a list comprehension
                            
                                Homebrew pyenv install error dyld: Library not loaded: /usr/local/opt/readline/lib/libreadline.7.dylib
                            
                                Python pytest does not show assertion differences
                            
                                /usr/lib/x86_64-linux-gnu/libstdc++.so.6: version `GLIBCXX_3.4.21' not found required by TensorFlow
                            
                                How to run flask_migrate in Docker
                            
                                Pytest - testing parser Error : Unrecognised arguments
                            
                                Pandas groupby give any non nan values
                            
                                How to train a neural network model with bert embeddings instead of static embeddings like glove/fasttext?
                            
                                how to avoid using _siftup or _siftdown in heapq
                            
                                redis installation using conda not working ModuleNotFoundError No module named 'redis'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert string with NaNs to int in pandas

Tags:

python

pandas

Ting Wang

People also ask

2 Answers

U12-Forward

cs95

Recent Activity

Donate For Us