How can I replicate rows in Pandas?

Tags:

My pandas dataframe looks like this:

   Person  ID   ZipCode   Gender
0  12345   882  38182     Female
1  32917   271  88172     Male
2  18273   552  90291     Female

I want to replicate every row 3 times like:

   Person  ID   ZipCode   Gender
0  12345   882  38182     Female
0  12345   882  38182     Female
0  12345   882  38182     Female
1  32917   271  88172     Male
1  32917   271  88172     Male
1  32917   271  88172     Male
2  18273   552  90291     Female
2  18273   552  90291     Female
2  18273   552  90291     Female

And of course, reset the index so it is:

0
1
2
...

I tried solutions such as:

pd.concat([df[:5]]*3, ignore_index=True)

And:

df.reindex(np.repeat(df.index.values, df['ID']), method='ffill')

But none of them worked.

864

asked Jun 10 '18 22:06

5 Answers

Use `np.repeat`:

Version 1:

Try using np.repeat:

newdf = pd.DataFrame(np.repeat(df.values, 3, axis=0)) newdf.columns = df.columns print(newdf)

The above code will output:

  Person   ID ZipCode  Gender 0  12345  882   38182  Female 1  12345  882   38182  Female 2  12345  882   38182  Female 3  32917  271   88172    Male 4  32917  271   88172    Male 5  32917  271   88172    Male 6  18273  552   90291  Female 7  18273  552   90291  Female 8  18273  552   90291  Female

np.repeat repeats the values of df, 3 times.

Then we add the columns with assigning new_df.columns = df.columns.

Version 2:

You could also assign the column names in the first line, like below:

newdf = pd.DataFrame(np.repeat(df.values, 3, axis=0), columns=df.columns) print(newdf)

The above code will also output:

  Person   ID ZipCode  Gender 0  12345  882   38182  Female 1  12345  882   38182  Female 2  12345  882   38182  Female 3  32917  271   88172    Male 4  32917  271   88172    Male 5  32917  271   88172    Male 6  18273  552   90291  Female 7  18273  552   90291  Female 8  18273  552   90291  Female

136

answered Sep 22 '22 23:09

U12-Forward

These will repeat the indices and preserve the columns as op demonstrated

`iloc` version 1

df.iloc[np.arange(len(df)).repeat(3)]

`iloc` version 2

df.iloc[np.arange(len(df) * 3) // 3]

answered Sep 24 '22 23:09

IMCoins

You can try the following code:

df = df.iloc[df.index.repeat(3),:].reset_index()

df.index.repeat(3) will create a list where each index value will be repeated 3 times and df.iloc[df.index.repeat(3),:] will help generate a dataframe with the rows as exactly returned by this list.

answered Sep 25 '22 23:09

mahesha sahoo

Related questions
                            
                                HSV to RGB Color Conversion
                            
                                Python lightweight database wrapper for SQLite
                            
                                How to add an image in Tkinter?
                            
                                How to write Pandas dataframe to sqlite with Index
                            
                                How can I check if a Pandas dataframe's index is sorted
                            
                                Python parse CSV ignoring comma with double-quotes
                            
                                How to find first non-zero value in every column of a numpy array?
                            
                                Concise way to getattr() and use it if not None in Python
                            
                                Download and decompress gzipped file in memory?
                            
                                bbox_to_anchor and loc in matplotlib
                            
                                What is the order of evaluation in python when using pop(), list[-1] and +=?
                            
                                How to convert a boto3 Dynamo DB item to a regular dictionary in Python?
                            
                                running code if try statements were successful in python
                            
                                assign operator to variable in python?
                            
                                changing the values of the diagonal of a matrix in numpy
                            
                                how to post multiple value with same key in python requests?
                            
                                How do I reverse a part (slice) of a list in Python?
                            
                                Flask-WTF - validate_on_submit() is never executed
                            
                                How to keep multiple independent celery queues?
                            
                                Un-persisting all dataframes in (py)spark

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I replicate rows in Pandas?

Tags:

python

pandas

dataframe

repeat

DasVisual

People also ask

5 Answers

Use `np.repeat`:

Version 1:

Version 2:

U12-Forward

`iloc` version 1

`iloc` version 2

piRSquared

BENY

IMCoins

mahesha sahoo

Recent Activity

Donate For Us

How can I replicate rows in Pandas?

Tags:

python

pandas

dataframe

repeat

DasVisual

People also ask

5 Answers

Use np.repeat:

Version 1:

Version 2:

U12-Forward

iloc version 1

iloc version 2

piRSquared

BENY

IMCoins

mahesha sahoo

Related questions

Recent Activity

Donate For Us

Use `np.repeat`:

`iloc` version 1

`iloc` version 2