<code>shift</code> converts my column from integer to float. It turns out that <code>np.nan</code> is float only. Is there any ways to keep shifted column as integer? <pre class="prettyprint"><code>df = pd.DataFrame({"a":range(5)}) df['b'] = df['a'].shift(1) df['a'] # 0 0 # 1 1 # 2 2 # 3 3 # 4 4 # Name: a, dtype: int64 df['b'] # 0 NaN # 1 0 # 2 1 # 3 2 # 4 3 # Name: b, dtype: float64 </code></pre>

Solution for pandas under 0.24: Problem is you get <code>NaN</code> value what is <code>float</code>, so <code>int</code> is converted to <code>float</code> - see na type promotions. One possible solution is convert <code>NaN</code> values to some value like <code>0</code> and then is possible convert to <code>int</code>: <pre class="prettyprint"><code>df = pd.DataFrame({"a":range(5)}) df['b'] = df['a'].shift(1).fillna(0).astype(int) print (df) a b 0 0 0 1 1 0 2 2 1 3 3 2 4 4 3 </code></pre> Solution for pandas 0.24+ - check <code>Series.shift</code>: <blockquote> fill_value object, optional The scalar value to use for newly introduced missing values. the default depends on the dtype of self. For numeric data, np.nan is used. For datetime, timedelta, or period data, etc. NaT is used. For extension dtypes, self.dtype.na_value is used. Changed in version 0.24.0. </blockquote> <pre class="prettyprint"><code>df['b'] = df['a'].shift(fill_value=0) </code></pre>

pandas shift converts my column from integer to float.

Tags:

python

pandas

numpy

shift converts my column from integer to float. It turns out that np.nan is float only. Is there any ways to keep shifted column as integer?

df = pd.DataFrame({"a":range(5)})
df['b'] = df['a'].shift(1)

df['a']
# 0    0
# 1    1
# 2    2
# 3    3
# 4    4
# Name: a, dtype: int64

df['b']

# 0   NaN
# 1     0
# 2     1
# 3     2
# 4     3
# Name: b, dtype: float64

796

asked Jan 26 '17 09:01

user3226167

1 Answers

Solution for pandas under 0.24:

Problem is you get NaN value what is float, so int is converted to float - see na type promotions.

One possible solution is convert NaN values to some value like 0 and then is possible convert to int:

df = pd.DataFrame({"a":range(5)})
df['b'] = df['a'].shift(1).fillna(0).astype(int)
print (df)
   a  b
0  0  0
1  1  0
2  2  1
3  3  2
4  4  3

Solution for pandas 0.24+ - check Series.shift:

fill_value object, optional
The scalar value to use for newly introduced missing values. the default depends on the dtype of self. For numeric data, np.nan is used. For datetime, timedelta, or period data, etc. NaT is used. For extension dtypes, self.dtype.na_value is used.

Changed in version 0.24.0.

df['b'] = df['a'].shift(fill_value=0)

146

answered Oct 17 '22 02:10

jezrael

Related questions
                            
                                python how to trim trailing spaces in csv DictReader keys
                            
                                add leading zeros to a list of numbers in Python
                            
                                How can I add nothing to the list in list comprehension?
                            
                                Python requests base64 image
                            
                                Export BigQuery Data to CSV without using Google Cloud Storage
                            
                                "Can only join an iterable" python error
                            
                                xlsxwriter and LibreOffice not showing formula's result
                            
                                Python @patch not working
                            
                                How to do "(df1 & not df2)" dataframe merge in pandas?
                            
                                How to horizontally center a widget using grid()?
                            
                                How to share object from fixture to all tests using pytest?
                            
                                call a setter from __init__ in Python
                            
                                Kurtosis on groupby of pandas dataframe doesn't work
                            
                                Spurious newlines added in Django management commands
                            
                                Why is flask's jsonify method slow?
                            
                                Most efficient way to search in list of dicts
                            
                                Pyplot errorbar keeps connecting my points with lines?
                            
                                How to operate logic operation of all columns of a 2D numpy array
                            
                                Why can you loop through an implicit tuple in a for loop, but not a comprehension in Python?
                            
                                Divide Dataframe by a series sharing index

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With