scrape websites with infinite scrolling

1 Answers

You can use selenium to scrap the infinite scrolling website like twitter or facebook.

Step 1 : Install Selenium using pip

pip install selenium

Step 2 : use the code below to automate infinite scroll and extract the source code

from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.common.keys import Keys from selenium.webdriver.support.ui import Select from selenium.webdriver.support.ui import WebDriverWait from selenium.common.exceptions import TimeoutException from selenium.webdriver.support import expected_conditions as EC from selenium.common.exceptions import NoSuchElementException from selenium.common.exceptions import NoAlertPresentException import sys  import unittest, time, re  class Sel(unittest.TestCase):     def setUp(self):         self.driver = webdriver.Firefox()         self.driver.implicitly_wait(30)         self.base_url = "https://twitter.com"         self.verificationErrors = []         self.accept_next_alert = True     def test_sel(self):         driver = self.driver         delay = 3         driver.get(self.base_url + "/search?q=stckoverflow&src=typd")         driver.find_element_by_link_text("All").click()         for i in range(1,100):             self.driver.execute_script("window.scrollTo(0, document.body.scrollHeight);")             time.sleep(4)         html_source = driver.page_source         data = html_source.encode('utf-8')   if __name__ == "__main__":     unittest.main()

Step 3 : Print the data if required.

120

answered Oct 05 '22 17:10

Pawan Kumar

Related questions
                            
                                Django - get HTML output into a variable
                            
                                Does PyGame do 3d?
                            
                                link several Popen commands with pipes
                            
                                cProfile for Python does not recognize Function name
                            
                                How to insert blank line using reStructuredText / Sphinx [duplicate]
                            
                                Update method in Python dictionary
                            
                                numpy, how do I find total rows in a 2D array and total column in a 1D array
                            
                                What's the correct way to set up Django translation?
                            
                                Django rest framework override page_size in ViewSet
                            
                                Purpose of return self python
                            
                                How can I extract the nth row of a pandas data frame as a pandas data frame?
                            
                                Pandas groupby multiple fields then diff
                            
                                Convert ipynb notebook to HTML in Google Colab
                            
                                Use variable in Pandas query
                            
                                Why does keras model predict slower after compile?
                            
                                Python : Assert that variable is instance method?
                            
                                Regular expressions in SQLalchemy queries?
                            
                                Removing Trailing Zeros in Python [duplicate]
                            
                                python built-in function to do matrix reduction
                            
                                Python 2.7 Combine abc.abstractmethod and classmethod

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

scrape websites with infinite scrolling

Tags:

python

screen-scraping

scraper

add-semi-colons

People also ask

1 Answers

Pawan Kumar

Recent Activity

Donate For Us