Fastest way to merge pandas dataframe on ranges

Tags:

I have a dataframe A

    ip_address
0   13
1   5
2   20
3   11
.. ........

and another dataframe B

    lowerbound_ip_address   upperbound_ip_address           country
0    0                       10                             Australia
1    11                      20                             China

based on this I need to add a column in A such that

ip_address  country
13          China
5           Australia

I have an idea that I should write define a function and then call map on each row of A. But how would I search through each row of B for this. Is there a better way to do this.

271

asked Sep 12 '17 14:09

John Constantine

Video Answer

1 Answers

Use pd.IntervalIndex

In [2503]: s = pd.IntervalIndex.from_arrays(dfb.lowerbound_ip_address,
                                            dfb.upperbound_ip_address, 'both')

In [2504]: dfa.assign(country=dfb.set_index(s).loc[dfa.ip_address].country.values)
Out[2504]:
   ip_address    country
0          13      China
1           5  Australia
2          20      China
3          11      China

Details

In [2505]: s
Out[2505]:
IntervalIndex([[0, 10], [11, 20]]
              closed='both',
              dtype='interval[int64]')

In [2507]: dfb.set_index(s)
Out[2507]:
          lowerbound_ip_address  upperbound_ip_address    country
[0, 10]                       0                     10  Australia
[11, 20]                     11                     20      China

In [2506]: dfb.set_index(s).loc[dfa.ip_address]
Out[2506]:
          lowerbound_ip_address  upperbound_ip_address    country
[11, 20]                     11                     20      China
[0, 10]                       0                     10  Australia
[11, 20]                     11                     20      China
[11, 20]                     11                     20      China

Setup

In [2508]: dfa
Out[2508]:
   ip_address
0          13
1           5
2          20
3          11

In [2509]: dfb
Out[2509]:
   lowerbound_ip_address  upperbound_ip_address    country
0                      0                     10  Australia
1                     11                     20      China

176

answered Oct 27 '22 10:10

Zero

Related questions
                            
                                Count the number of unique characters in a string Python using only for loops and ifel operations
                            
                                Python in vs ==. Which to Use in this case?
                            
                                How can I make a Post Request on Python with urllib3?
                            
                                How to fully disassemble Python source
                            
                                adding custom permission through django-admin, while server is running
                            
                                Replace keys in a dictionary
                            
                                Python conditional one or the other but not both
                            
                                Getting "django.core.exceptions.ImproperlyConfigured: GEOS is required and has not been detected." although GEOS is installed
                            
                                gyp ERR! stack Error: `C:\Program Files (x86)\MSBuild\12.0\bin\msbuild.exe` failed with exit code: 1
                            
                                Understanding python's lstrip method on strings [duplicate]
                            
                                tkinter Treeview widget inserting data
                            
                                Scrapy getting href out of div
                            
                                Can I dump blank instead of null in yaml/pyyaml?
                            
                                How to web scrape followers from Instagram web browser?
                            
                                How do i declare more than one extra-index-url in pip.conf
                            
                                how does 2d kernel density estimation in python (sklearn) work?
                            
                                pandas to_json returns a string not a json object
                            
                                PyQt - Connect QAction to function
                            
                                Check if single element is contained in Numpy Array
                            
                                TypeError: expected string or bytes-like object – with Python/NLTK word_tokenize

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fastest way to merge pandas dataframe on ranges

Tags:

python

pandas

dataframe

numpy

John Constantine

People also ask

Video Answer

1 Answers

Zero

Recent Activity

Donate For Us