Python - Split url into its components

Tags:

I have a huge list of urls that are all like this:

http://www.example.com/site/section1/VAR1/VAR2

Where VAR1 and VAR2 are the dynamic elements of the url. What I want to do is to extract from this url string only the VAR1. I've tried to use urlparse but the output look like this:

ParseResult(scheme='http', netloc='www.example.com', path='/site/section1/VAR1/VAR2', params='', query='', fragment='')

521

asked Jul 01 '15 19:07

Hyperion

1 Answers

You can remember this in general. Different sections of the url can be obtained using urlparse. Here you can obtain the path by urlparse(url).path and then obtain the desired variable by split() function

>>> from urlparse import urlparse
>>> url = 'http://www.example.com/site/section1/VAR1/VAR2' 
>>> urlparse(url)
ParseResult(scheme='http', netloc='www.example.com', path='/site/section1/VAR1/VAR2', params='', query='', fragment='')
>>> urlparse(url).path
'/site/section1/VAR1/VAR2'
>>> urlparse(url).path.split('/')[-2]
'VAR1'

175

answered Sep 19 '22 17:09

Naman Sogani

Related questions
                            
                                Sending a password over SSH or SCP with subprocess.Popen
                            
                                Generate correlated data in Python (3.3)
                            
                                How to join all the lines together in a text file in python?
                            
                                Python installation in Mac OS X virtual environment that includes a framework that I can include into Xcode?
                            
                                how to use a Python function with keyword "self" in arguments
                            
                                Installing win32gui python module [duplicate]
                            
                                Is a countvectorizer the same as tfidfvectorizer with use_idf=false?
                            
                                Embedding Python3 in Qt 5
                            
                                Calculating cumulative minimum with numpy arrays
                            
                                How to properly escape strings when manually building SQL queries in SQLAlchemy?
                            
                                Determine what project id my App Engine code is running on
                            
                                how to set autocommit = 1 in a sqlalchemy.engine.Connection
                            
                                'str' does not support the buffer interface Python3 from Python2
                            
                                Error: 'utf8' codec can't decode byte 0x80 in position 0: invalid start byte
                            
                                Calculate correlation between all columns of a DataFrame and all columns of another DataFrame?
                            
                                python scikit error - no module named sklearn
                            
                                Pandas Efficient VWAP Calculation
                            
                                urllib.request module fails to install in my system
                            
                                how to redirect to a external 404 page python flask
                            
                                Sort a list to form the largest possible number

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python - Split url into its components

Tags:

python

regex

urlparse

Hyperion

People also ask

1 Answers

Naman Sogani

Recent Activity

Donate For Us