repeating the rows of a data frame

Tags:

I'm trying repeat the rows of a dataframe. Here's my original data:

pd.DataFrame([
        {'col1': 1, 'col2': 11, 'col3': [1, 2] },
        {'col1': 2, 'col2': 22, 'col3': [1, 2, 3] },
        {'col1': 3, 'col2': 33, 'col3': [1] },
        {'col1': 4, 'col2': 44, 'col3': [1, 2, 3, 4] },
    ])

which gives me

   col1  col2          col3
0     1    11        [1, 2]
1     2    22     [1, 2, 3]
2     3    33           [1]
3     4    44  [1, 2, 3, 4]

I'd like to repeat the rows depending on the length of the array in col3 i.e. I'd like to get a dataframe like this one.

   col1  col2
0     1    11
1     1    11
2     2    22
3     2    22
4     2    22
5     3    33
6     4    44
7     4    44
8     4    44
9     4    44

What's a good way accomplishing this?

930

asked Sep 16 '18 08:09

zinyosrim

1 Answers

You can also use reindex and index.repeat

df = df.reindex(df.index.repeat(df.col3.apply(len)))

df = df.reset_index(drop=True).drop("col3", axis=1)
# To reset index and drop col3 

# Output:

   col1  col2
0   1     11
1   1     11
2   2     22
3   2     22
4   2     22
5   3     33
6   4     44
7   4     44
8   4     44
9   4     44

108

answered Oct 06 '22 00:10

Abhi

Related questions
                            
                                Python: Create structured numpy structured array from two columns in a DataFrame
                            
                                command 'cc' failed with exit status 1 on OSX High Sierra
                            
                                Can I pip install python3.6?
                            
                                Django - ManyRelatedManager object is not iterable when returning Object
                            
                                Resampling a signal with scipy.signal.resample
                            
                                How to make Django sessionId cookie as secure
                            
                                What is the Python equivalent of CPP reinterpret_cast
                            
                                ImportError: cannot import name 'get_default_renderer'
                            
                                Django Rest Framework: HTTP 401 Unauthorized error
                            
                                PonyORM - multiple model files
                            
                                Python 2 Max Function
                            
                                How can I limit regression output between 0 to 1 in keras
                            
                                pyenv-virtualenv: `3.6.4' is not installed in pyenv
                            
                                Performance comparison Static Typing Python 3.6+ vs Cython
                            
                                Message "Exception ignored" when dealing pandas.datetime type
                            
                                How to use He initialization in TensorFlow
                            
                                AWS Rekognition detect label Invalid image encoding error
                            
                                Django: filter queryset by multiple ID
                            
                                Python: Pyppeteer clicking on pop up window
                            
                                Merging multiple bands together through gdal...correctly

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

repeating the rows of a data frame

Tags:

python

pandas

python-3.6

zinyosrim

People also ask

1 Answers

Abhi

Recent Activity

Donate For Us