Finding html element with class using lxml

Tags:

I've searched everywhere and what I most found was doc.xpath('//element[@class="classname"]'), but this does not work no matter what I try.

code I'm using

import lxml.html

def check():
    data = urlopen('url').read();
    return str(data);

doc = lxml.html.document_fromstring(check())
el = doc.xpath("//div[@class='test']")
print(el)

It simply prints an empty list.

Edit: How odd. I used google as a test page and it works fine there, but it doesn't work on the page I was using (youtube)

Here's the exact code I'm using.

import lxml.html
from urllib.request import urlopen
import sys

def check():
    data = urlopen('http://www.youtube.com/user/TopGear').read(); #TopGear as a test
    return data.decode('utf-8', 'ignore');


doc = lxml.html.document_fromstring(check())
el = doc.xpath("//div[@class='channel']")
print(el)

523

asked Nov 22 '11 12:11

Vexx

2 Answers

The TopGear page that you use for testing doesn't have any <div class="channel"> elements. But this works (for example):

el = doc.xpath("//div[@class='channel-title-container']")

Or this:

el = doc.xpath("//div[@class='a yb xr']")

To find <div> elements with a class attribute that contains the string channel, you could use

el = doc.xpath("//div[contains(@class, 'channel')]")

121

answered Sep 22 '22 16:09

mzjn

You can use lxml.cssselect to simplify class and id request: http://lxml.de/dev/cssselect.html

answered Sep 23 '22 16:09

dmzkrsk

Related questions
                            
                                Method to tell if ancestor is a Class or Module in Ruby?
                            
                                How do I extend the Django "login" form?
                            
                                Can anybody give a good example of what to use generic classes for?
                            
                                Why Does Binding to a Struct Not Work?
                            
                                How to initialize an array in C++ objects
                            
                                Type Hinting for objects of type that's being defined [duplicate]
                            
                                Is it a syntax error in C++ to end a function inside a class definition with a };?
                            
                                When structures are better than classes? [duplicate]
                            
                                Change python mro at runtime
                            
                                Win32 C++ Create a Window and Procedure Within a Class
                            
                                Typed Array should be recycled after use with #recycle()
                            
                                Is it possible to dynamically inherit from a class that is only known at runtime in python?
                            
                                php static property
                            
                                How to list all Variables of a class in swift
                            
                                How to find CPU-intensive class in Java?
                            
                                How to get type of class without initiating object?
                            
                                PHP - catchall method in a class
                            
                                How does the c++ sizeof operator calculate size?
                            
                                line 60, in make_tuple return tuple(l) TypeError: iter() returned non-iterator of type 'Vector'
                            
                                C++ difference between virtual = 0; and empty function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Finding html element with class using lxml

Tags:

python-3.x

class

lxml

Vexx

People also ask

2 Answers

mzjn

dmzkrsk

Recent Activity

Donate For Us