How can I parse a host:port pair in Python

Tags:

python

Suppose I have a string of the of the format host:port, where :port is optional. How can I reliably extract the two components?

The host can be any of:

A hostname (localhost, www.google.com)
An IPv4 literal (1.2.3.4)
An IPv6 literal ([aaaa:bbbb::cccc]).

In other words, this is the standard format used across the internet (such as in URIs: complete grammar at https://www.rfc-editor.org/rfc/rfc3986#section-3.2, excluding the "User Information" component).

So, some possible inputs, and desired outputs:

'localhost' -> ('localhost', None)
'my-example.com:1234' -> ('my-example.com', 1234)
'1.2.3.4' -> ('1.2.3.4', None)
'[0abc:1def::1234]' -> ('[0abc:1def::1234]', None)

321

asked Oct 22 '17 16:10

richvdh

2 Answers

Well, this is Python, with batteries included. You have mention that the format is the standard one used in URIs, so how about urllib.parse?

import urllib.parse

def parse_hostport(hp):
    # urlparse() and urlsplit() insists on absolute URLs starting with "//"
    result = urllib.parse.urlsplit('//' + hp)
    return result.hostname, result.port

This should handle any valid host:port you can throw at it.

178

answered Oct 30 '22 09:10

twisteroid ambassador

Came up with a dead simple regexp that seems to work in most cases:

def get_host_pair(value):
    return re.search(r'^(.*?)(?::(\d+))?$', value).groups()

get_host_pair('localhost')
get_host_pair('localhost:80')
get_host_pair('[::1]')
get_host_pair('[::1]:8080')

It probably doesn't work when the base input is invalid however

answered Oct 30 '22 10:10

Romuald Brunet

Related questions
                            
                                Django F expression on datetime objects
                            
                                Keras: real amount of GPU memory used
                            
                                Python3 ImportError: No module named '_tkinter' [duplicate]
                            
                                "Unable to get Filesystem for path" error when training neural network on google cloud
                            
                                Scrapy on a schedule
                            
                                Python: requests can't login to a website
                            
                                Ignoring NaN in a dataframe
                            
                                Creating a dictionary for each word in a file and counting the frequency of words that follow it
                            
                                How to wait for object to change state
                            
                                loading an image from cifar-10 dataset
                            
                                Input shape and Conv1d in Keras
                            
                                Why does computational time decrease when removing unnecessary items from a list in Python
                            
                                Google cloud vision not accepting base64 encoded images python
                            
                                How to get length of query result SqlAlchemy
                            
                                How to keep column MultiIndex values when merging pandas DataFrames
                            
                                Use os.listdir to show directories only [duplicate]
                            
                                Matplotlib reads jpg into int8 and png into normalized float
                            
                                Using a colormap for matplotlib line plots
                            
                                PYQT - nesting widgets and layouts in multiple levels
                            
                                How to remove the multiindex from GroupBy.apply()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With