Parse BeautifulSoup element into Selenium

Tags:

I want to get the source code of a website using selenium; find a particular element using BeautifulSoup; and then parse it back into selenium as a selenium.webdriver.remote.webelement object. Like so:

driver.get("www.google.com")
soup = BeautifulSoup(driver.source)
element = soup.find(title="Search")

element = Selenium.webelement(element)
element.click()

How can I achieve this?

697

asked Jun 22 '16 23:06

Darth Ludius

2 Answers

A general solution that worked for me is to compute the xpath of the bs4 element, then use that to find the element in selenium,

xpath = xpath_soup(soup_element)
selenium_element = driver.find_element_by_xpath(xpath)

...

import itertools

def xpath_soup(element):
    """
    Generate xpath of soup element
    :param element: bs4 text or node
    :return: xpath as string
    """
    components = []
    child = element if element.name else element.parent
    for parent in child.parents:
        """
        @type parent: bs4.element.Tag
        """
        previous = itertools.islice(parent.children, 0, parent.contents.index(child))
        xpath_tag = child.name
        xpath_index = sum(1 for i in previous if i.name == xpath_tag) + 1
        components.append(xpath_tag if xpath_index == 1 else '%s[%d]' % (xpath_tag, xpath_index))
        child = parent
    components.reverse()
    return '/%s' % '/'.join(components)

answered Oct 19 '22 01:10

Rob Hawkins

from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from bs4 import BeautifulSoup

driver = webdriver.Chrome()
driver.get("http://www.google.com")
soup = BeautifulSoup(driver.page_source, 'html.parser')
search_soup_element = soup.find(title="Search")
input_element = soup.select('input.gsfi.lst-d-f')[0]

search_box = driver.find_element(by='name', value=input_element.attrs['name'])
search_box.send_keys('Hello World!')
search_box.send_keys(Keys.RETURN)

This pretty much works. I can see reason for working with both webdriver and BeautifulSoup but not necessarily for this example.

answered Oct 19 '22 01:10

Brian A

Related questions
                            
                                Boring Factorials in python
                            
                                What is the most CPU efficient way to resize big images in Python
                            
                                Can I force python array elements to have a specific size?
                            
                                Does uWSGI start all processes at boot time?
                            
                                Different Sigmoid Equations and its implementation
                            
                                How to Convert XLSX to Sheets in Google Drive API v3
                            
                                Pyautogui TypeError: 'NoneType' object is not iterable
                            
                                How to get the N maximum values per row in a numpy ndarray?
                            
                                Understanding the output of Doc2Vec from Gensim package
                            
                                Cannot connect to neo4j database on Docker container
                            
                                How to convert a sha256 object to integer and pack it to bytearray in python?
                            
                                Python CMA-ES Algorithm to solve user-defined function and constraints
                            
                                What's distutils' equivalent of setuptools' `find_packages`? (python)
                            
                                How to unittest Python Lock is acquired with 'with' statement?
                            
                                value based thread lock
                            
                                What's the most efficient way to select a non-rectangular ROI of an Image in OpenCV?
                            
                                Unsupported TIFF Compression
                            
                                Is it actually possible to pass data (callback) from mpld3 to ipython?
                            
                                How to compute optical flow using tvl1 opencv function
                            
                                How to use monkeypatch in a "setup" method for unit tests using pytest?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Parse BeautifulSoup element into Selenium

Tags:

python

html

beautifulsoup

selenium

Darth Ludius

People also ask

2 Answers

Rob Hawkins

Brian A

Recent Activity

Donate For Us