perform operation opposite to pandas ffill

Tags:

Let's say I have the following DataFrame:

df = pd.DataFrame({'player': ['LBJ', 'LBJ', 'LBJ', 'Kyrie', 'Kyrie', 'LBJ', 'LBJ'],
                   'points': [25, 32, 26, 21, 29, 21, 35]})

How can I perform the operation opposite of ffill so I can get the following DataFrame:

df = pd.DataFrame({'player': ['LBJ', np.nan, np.nan, 'Kyrie', np.nan, 'LBJ', np.nan],
                   'points': [25, 32, 26, 21, 29, 21, 35]})

That is, I want to fill directly repeated values with NaN.

Here's what I have so far but I'm hoping there's a built-in pandas method or a better approach:

for i, (index, row) in enumerate(df.iterrows()):
    if i == 0:
        continue
    go_back = 1
    while True:
        past_player = df.ix[i-go_back, 'player']
        if pd.isnull(past_player):
            go_back += 1
            continue
        if row['player'] == past_player:
            df.set_value(index, 'player', value=np.nan)
        break

478

asked Sep 28 '17 22:09

Johnny Metz

2 Answers

ffinv = lambda s: s.mask(s == s.shift())
df.assign(player=ffinv(df.player))

  player  points
0    LBJ      25
1    NaN      32
2    NaN      26
3  Kyrie      21
4    NaN      29
5    LBJ      21
6    NaN      35

176

answered Oct 19 '22 12:10

piRSquared

Probably not the most efficient solution but working would be to use itertools.groupby and itertools.chain:

>>> df['player'] = list(itertools.chain.from_iterable([key] + [float('nan')]*(len(list(val))-1) 
                        for key, val in itertools.groupby(df['player'].tolist())))
>>> df
  player  points
0    LBJ      25
1    NaN      32
2    NaN      26
3  Kyrie      21
4    NaN      29
5    LBJ      21
6    NaN      35

More specifically this illustrates how it works:

for key, val in itertools.groupby(df['player']):
    print([key] + [float('nan')]*(len(list(val))-1))

giving:

['LBJ', nan, nan]
['Kyrie', nan]
['LBJ', nan]

which is then "chained" together.

answered Oct 19 '22 12:10

MSeifert

Related questions
                            
                                Django case insensitive "distinct" query
                            
                                Why does locale.getpreferredencoding() return 'ANSI_X3.4-1968' instead of 'UTF-8'?
                            
                                Zorder specification in matplotlib patch collections?
                            
                                PyCharm: How to document :rtype: for function that returns generator
                            
                                How to create minor ticks for polar plot matplotlib
                            
                                Descending order using heapq
                            
                                Encode CSV file for Sendgrid's Email API
                            
                                AttributeError:'Tensor' object has no attribute '_keras_history'
                            
                                Python error cannot do a non empty take from an empty axes
                            
                                What exactly does the Pandas random_state do?
                            
                                3darray training/testing TensorFlow RNN LSTM
                            
                                In PostgreSQL, where does plpython(3)u output from `print` go?
                            
                                Dask: nunique method on Dataframe groupBy
                            
                                valid UUID is not a valid UUID
                            
                                Dispatching keypresses to embedded Pygame
                            
                                Web Scraping with Python in combination with asyncio
                            
                                list in Python3.6 [duplicate]
                            
                                Automatically find optimal image threshold value from density of histogram plot
                            
                                __add__ two class objects
                            
                                Selenium python web driver doesnt close

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

perform operation opposite to pandas ffill

Tags:

python

pandas

dataframe

Johnny Metz

People also ask

2 Answers

piRSquared

MSeifert

Recent Activity

Donate For Us