How to extract meta description from urls using python?

Tags:

I want to extract the title and description from the following website:

view-source:http://www.virginaustralia.com/au/en/bookings/flights/make-a-booking/

with the following snippet of source code:

<title>Book a Virgin Australia Flight | Virgin Australia
</title>
    <meta name="keywords" content="" />
        <meta name="description" content="Search for and book Virgin Australia and partner flights to Australian and international destinations." />

I want the title and meta content.

I used goose but it does not do a good job extracting. Here is my code:

website_title = [g.extract(url).title for url in clean_url_data]

and

website_meta_description=[g.extract(urlw).meta_description for urlw in clean_url_data]

The result is empty

467

asked Jun 24 '16 09:06

Technologic27

1 Answers

Please check BeautifulSoup as solution.

For question above, you may use the following code to extract "description" info:

import requests
from bs4 import BeautifulSoup

url = 'http://www.virginaustralia.com/au/en/bookings/flights/make-a-booking/'
response = requests.get(url)
soup = BeautifulSoup(response.text)

metas = soup.find_all('meta')

print [ meta.attrs['content'] for meta in metas if 'name' in meta.attrs and meta.attrs['name'] == 'description' ]

output:

['Search for and book Virgin Australia and partner flights to Australian and international destinations.']

177

answered Oct 17 '22 05:10

linpingta

Related questions
                            
                                How can I check if one two-dimensional NumPy array contains a specific pattern of values inside it?
                            
                                How to serialize custom user model in DRF
                            
                                Get cell color from .xlsx
                            
                                Dot product with dictionaries
                            
                                Addition of list and NumPy number
                            
                                How to use nltk regex pattern to extract a specific phrase chunk?
                            
                                Python Pandas - Read csv file containing multiple tables
                            
                                How to check if all elements in a tuple or list are in another?
                            
                                Convert an Array, converted to a String, back to an Array
                            
                                Merge lists in Python by placing every nth item from one list and others from another?
                            
                                Django backup strategy with dumpdata and migrations
                            
                                How to sort an array of integers faster than quicksort?
                            
                                Python Attribute Error: type object has no attribute
                            
                                Pandas: Idxmax, best n results
                            
                                Making cells independent of each other in a Jupyter notebook
                            
                                How to control order of result from iterator in python
                            
                                PyYAML automatically converting certain keys to boolean values
                            
                                Importing user defined modules in python from a directory
                            
                                Create a GeoDataFrame from a GeoJSON object
                            
                                How can I run a numpy function percentile() on a masked array?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to extract meta description from urls using python?

Tags:

python

url

meta-tags

extract

goose

Technologic27

People also ask

1 Answers

linpingta

Recent Activity

Donate For Us