Firefox with Selenium (Headless)

Install Firefox, xvfb, selenium

echo "deb http://packages.linuxmint.com debian import" >> /etc/apt/sources.list && apt-get update
apt-get install firefox xvfb python-dev python-pip
pip install pyvirtualdisplay selenium

selenium_scrape.py

from pyvirtualdisplay import Display
import time
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.common.exceptions import TimeoutException

display = Display(visible=0, size=(800, 600))
display.start()

def init_driver():
    driver = webdriver.Firefox()
    driver.wait = WebDriverWait(driver, 5)
    return driver

def lookup(driver, query):
    driver.get("http://www.google.com")
    try:
    box = driver.wait.until(EC.presence_of_element_located(
        (By.NAME, "q")))
    button = driver.wait.until(EC.element_to_be_clickable(
        (By.NAME, "btnK")))
    box.send_keys(query)
button.click()
    except TimeoutException:
        print("Box or Button not found in google.com")

if __name__ == "__main__":
    driver = init_driver()
    lookup(driver, "Selenium")
    time.sleep(5)
    driver.quit()

display.stop()

Error

  File "selenium_scrape.py", line 20
    box = driver.wait.until(EC.presence_of_element_located(
      ^
IndentationError: expected an indented block

335

asked Feb 13 '16 16:02

clarkk

1 Answers

The difference is that you cannot use a packaged Chrome browser; you need a special driver... chromedriver.

Get the current latest version here: Chromedriver

Now you have 2 options, either to move the downloaded chromedriver so it is always accessible (option 1), or to define in your script how to access it.

Option 1: move it into path

Then move it so it is accessible when you use webdriver.Chrome():

sudo mv /path/to/download/chromedriver /usr/bin

Also set it to be allowed to be executed:

chmod a+x /usr/binchromedriver

Option 2: do not move it into path

Or you can define a path

import os
chr = "/Users/you/Downloads/chromedriver"
os.environ["webdriver.chrome.driver"] = chr
driver = webdriver.Chrome(chromedriver)

194

answered Oct 01 '22 02:10

PascalVKooten

Related questions
                            
                                Neural network generating incorrect results that are around the average of outputs
                            
                                How to reshape a vector to TensorFlow's filters?
                            
                                get subsection of df based on multiple conditions
                            
                                SQLAlchemy events are not working
                            
                                subclass str, and make new method with same effect as +=
                            
                                Python inheritance old style type in a new style class
                            
                                Python string.format() : formatting nans as 'some text'?
                            
                                Is there a complete list of key event names used by turtle-graphics?
                            
                                Having trouble removing headers when using pd.read_csv
                            
                                Instantiating object automatically adds to SQLAlchemy Session. Why?
                            
                                Numpy 3D array transposed when indexed in single step vs two steps
                            
                                How to use full-text search in sqlite3 database in django?
                            
                                How do I add buttons that are dynamically created in pure python to a kivy layout that is Written in Kivy Language?
                            
                                Python Beautifulsoup Find_all except
                            
                                JSON object must be str, not 'bytes'
                            
                                Can I cause lazy evaluation of an expression?
                            
                                Edit database outside Django ORM
                            
                                What significance test is used for spearmanr in SciPy?
                            
                                PyQt5: Gtk-CRITICAL **: IA__gtk_widget_style_get: assertion 'GTK_IS_WIDGET (widget)' failed
                            
                                Django count grouping by year/month without extra

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Firefox with Selenium (Headless)

Tags:

python

linux

selenium