Python 3 - Getting some strings from a HTTPrequest response

Question

I'm having a hard time extracting data from a httprequest response.

Can somebody help me? Here's a part of my code:

import requests

r = requests.get('https://www.example.com', verify=True)
keyword = r.text.find('loginfield')
print (keyword)

>>> 42136

42136 value basically means that string 'loginfield' exists on the response.text. But how do I extract specific strings from it?

Like for example I want to extract these exact strings:

<title>Some title here</title>

or this one:

<div id='bla...' #continues extracting of strings until it stops where I want it to stop extracting.

Anybody got an idea on how should I approach this problem?

TerryA · Accepted Answer

You can use BeautifulSoup to parse HTML and get tags. Here's an example piece of code:

import requests
from bs4 import BeautifulSoup as BS
r = requests.get('https://www.example.com', verify=True)
soup = BS(r.text)
print(soup.find('title').text)

Should print:

Some title here

But depends on if it's the first title or not

Python 3 - Getting some strings from a HTTPrequest response

Tags:

python

string

python-3.x

text-parsing

httprequest

Arthur Codova

1 Answers

TerryA

Recent Activity

Donate For Us

Python 3 - Getting some strings from a HTTPrequest response

Tags:

python

string

python-3.x

text-parsing

httprequest

Arthur Codova

1 Answers

TerryA

Related questions

Recent Activity

Donate For Us