Removing HTTP and WWW from URL python

Tags:

python

url

url1='www.google.com'
url2='http://www.google.com'
url3='http://google.com'
url4='www.google'
url5='http://www.google.com/images'
url6='https://www.youtube.com/watch?v=6RB89BOxaYY

How to strip http(s) and www from url in Python?

202

asked Nov 17 '16 08:11

guri

2 Answers

A more elegant solution would be using urlparse:

from urllib.parse import urlparse

def get_hostname(url, uri_type='both'):
    """Get the host name from the url"""
    parsed_uri = urlparse(url)
    if uri_type == 'both':
        return '{uri.scheme}://{uri.netloc}/'.format(uri=parsed_uri)
    elif uri_type == 'netloc_only':
        return '{uri.netloc}'.format(uri=parsed_uri)

The first option includes https or http, depending on the link, and the second part netloc includes what you were looking for.

answered Sep 20 '22 14:09

JohnAndrews

You can use the string method replace:

url = 'http://www.google.com/images'
url = url.replace("http://www.","")

or you can use regular expressions:

import re

url = re.compile(r"https?://(www\.)?")
url = url.sub('', 'http://www.google.com/images').strip().strip('/')

answered Sep 18 '22 14:09

Januka samaranyake

Related questions
                            
                                Sqlalchemy, raw query and parameters
                            
                                Deploying Django to AWS - WSGIPath refers to a file that does not exist
                            
                                Django Rest Framework nested serializer not showing related data
                            
                                WeasyPrint page size wrong. (8.27in x 11.69 in)
                            
                                Unrecognized commands in bash are captured by the python interpreter [closed]
                            
                                How do you add a simple counter column that increases by one in each row to a Pandas DataFrame?
                            
                                Too many if statements
                            
                                How to label y-axis when using a secondary y-axis?
                            
                                Confusing behaviour of Pandas crosstab() function with dataframe containing NaN values
                            
                                Do all callables have __name__?
                            
                                When would you use reduce() instead of sum()?
                            
                                Django 1.9 Installation SyntaxError: invalid syntax [duplicate]
                            
                                Insert list of lists into single column of pandas df
                            
                                How to generate a word frequency histogram, where bars are ordered according to their height
                            
                                How to "render" HTML with PyQt5's QWebEngineView
                            
                                Throttling in Bokeh application
                            
                                Simplest example for streaming audio with Alexa
                            
                                Have Pandas column containing lists, how to pivot unique list elements to columns?
                            
                                Add one more StructField to schema
                            
                                MultiIndex Slicing requires the index to be fully lexsorted

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With