Python 3: using requests does not get the full content of a web page

Tags:

I am testing using the requests module to get the content of a webpage. But when I look at the content I see that it does not get the full content of the page.

Here is my code:

import requests
from bs4 import BeautifulSoup

url = "https://shop.nordstrom.com/c/womens-dresses-shop?origin=topnav&cm_sp=Top%20Navigation-_-Women-_-Dresses&offset=11&page=3&top=72"
page = requests.get(url)

soup = BeautifulSoup(page.content, 'html.parser')
print(soup.prettify())

Also on the chrome web-browser if I look at the page source I do not see the full content.

Is there a way to get the full content of the example page that I have provided?

899

asked Dec 09 '17 16:12

TJ1

1 Answers

The page is rendered with JavaScript making more requests to fetch additional data. You can fetch the complete page with selenium.

from bs4 import BeautifulSoup
from selenium import webdriver
driver = webdriver.Chrome()
url = "https://shop.nordstrom.com/c/womens-dresses-shop?origin=topnav&cm_sp=Top%20Navigation-_-Women-_-Dresses&offset=11&page=3&top=72"
driver.get(url)
soup = BeautifulSoup(driver.page_source, 'html.parser')
driver.quit()
print(soup.prettify())

For other solutions see my answer to Scraping Google Finance (BeautifulSoup)

172

answered Sep 23 '22 13:09

Dan-Dev

Related questions
                            
                                sklearn classifier get ValueError: bad input shape
                            
                                Set size of matplotlib figure with 3d subplots
                            
                                Why do people default owner parameter to None in __get__?
                            
                                Pandas DataFrame - Combining one column's values with same index into list
                            
                                Saving a cross-validation trained model in Scikit
                            
                                python requests upload large file with additional data
                            
                                Jupyter notebook does not print logs to the output cell
                            
                                How int() object uses "==" operator without __eq__() method in python2?
                            
                                What is the default variable initializer in Tensorflow?
                            
                                Cannot convert string to float in pandas (ValueError)
                            
                                How to document multiple return values using reStructuredText in Python 2?
                            
                                How am I supposed to register a package to PyPI?
                            
                                value error in python statsmodels.tsa.seasonal
                            
                                create a new dataframe from selecting specific rows from existing dataframe python
                            
                                Why Python hasn't true constants? Is it not dangerous?
                            
                                How to share in memory resources between Flask methods when deploying with Gunicorn
                            
                                get_document_topics and get_term_topics in gensim
                            
                                Key <variable_name> not found in checkpoint Tensorflow
                            
                                find duplicate rows in a pandas dataframe
                            
                                seasonal decompose in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python 3: using requests does not get the full content of a web page

Tags:

python

python-requests

web-scraping

TJ1

People also ask

1 Answers

Dan-Dev

Recent Activity

Donate For Us