Is there an efficient way to merge two sorted dataframes in pandas, maintaing sortedness?

Tags:

If I have two dataframes (or series) that are already sorted on compatible keys, I'd like to be able to cheaply merge them together and maintain sortedness. I can't see a way to do that other than via concat() and explicit sort()

a = pd.DataFrame([0,1,2,3], index=[1,2,3,5], columns=['x'])
b = pd.DataFrame([4,5,6,7], index=[0,1,4,6], columns=['x'])
print pd.concat([a,b])
print pd.concat([a,b]).sort()

   x
1  0
2  1
3  2
5  3
0  4
1  5
4  6
6  7

   x
0  4
1  0
1  5
2  1
3  2
4  6
5  3
6  7

It looks like there has been a bit of related discussion with numpy arrays, suggesting an 'interleave' method, but I haven't found a good answer.

603

asked May 01 '13 12:05

patricksurry

1 Answers

If we limit the problem to a and b having only one column, then I would go through this path:

s = a.merge(b, how='outer', left_index=True, right_index=True)
s.stack().reset_index(level=1, drop=True)

175

answered Nov 12 '22 01:11

Zeugma

Related questions
                            
                                How do I get django celery to write to the test database for my functional tests?
                            
                                NLTK: Document Classification with numeric score instead of labels
                            
                                How to use the python icalendar.cal.Timezone class to create an icalendar VTIMEZONE
                            
                                Running command with Paramiko exec_command causes process to sleep before finishing
                            
                                In practice, how eventual is the "eventual consistency" in HRD?
                            
                                Blender 2.6 JSON exporter, texture wrong only one side of cube
                            
                                GAE development server keep full text search indexes after restart?
                            
                                scipy.interpolate.griddata equivalent in CUDA
                            
                                Django - User Billing Platforms / Middleware, i.e., Tracking Expenses and Charges
                            
                                Animation with contours matplotlib
                            
                                Django many to many relationship with built in "User" model
                            
                                Controlling Python 3.3 stdio line termination on windows 7
                            
                                Separating similar object in an image - opencv python
                            
                                PyAudio ErrNo Input Overflowed -9981
                            
                                Meaning of absolute/relative paths in python stack trace
                            
                                Code example for Sentiment Analysis for Asian languages - Python NLTK
                            
                                Ensuring safe JSON, XML and YAML loading in Django project
                            
                                virtualenv: command not found after installed with Pip on Mac [duplicate]
                            
                                Python readline module prints escape character during import
                            
                                How to use WTForms' TableWidget?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there an efficient way to merge two sorted dataframes in pandas, maintaing sortedness?

Tags:

python

pandas

numpy

patricksurry

People also ask

1 Answers

Zeugma

Recent Activity

Donate For Us