Filtering out content with style display:none in an XPath expression

Tags:

xpath

I'm trying to parse with lxml in python and this is my output

<td>
    <span style="display:inline">text1</span>
    <span style="display:none">text2</span>
    <span>text3</span>
    text4
</td>

Thought I was smart enough to use the following

tree = tr.xpath("//*[contains(@style,'inline')]/text()")

But then I thought I would only see text1. What I want is to see text3 and text4 too so that the output will be

['text1', 'text3', 'text4']

Can anyone send me to the right direction of doing it?

209

asked Jun 05 '12 15:06

1 Answers

Explicitly exclude anything with display:none:

tree = tr.xpath("//*[not(contains(@style,'display:none'))]/text()")

That said -- this is only a distant approximation of what a browser would actually do; you'd want to be driving an actual browser (as with Selenium, embedding APIs, or the like) if you required strictly accurate results.

190

answered Sep 20 '22 15:09

Charles Duffy

Related questions
                            
                                How to get the ROOT node name from SQL Server
                            
                                Default XML namespace, JDOM, and XPath
                            
                                How to use for each group in XSL
                            
                                Xpath - How to get all the attribute names and values of an element
                            
                                JAXB XJC - XPath evaluation results in empty target node?
                            
                                XSLT: How to convert XML Node to String
                            
                                selenium.common.exceptions.ElementClickInterceptedException: Message: element click intercepted: Element is not clickable with Selenium and Python
                            
                                How can I quickly check if a xpath is valid in IE?
                            
                                How do you run an xPath query in IE11?
                            
                                Scraping Youtube comments in R
                            
                                Find a JSON property name that starts with something using JSON Path
                            
                                XPath/XSLT nested predicates: how to get the context of outer predicate?
                            
                                Nested for-each loops, accessing outer element with variable from the inner loop
                            
                                xsltproc doesn't recognize XSLT 2.0
                            
                                XML element has namespace, my XPATH does not work
                            
                                Correct XPath query to fetch div inner text
                            
                                Recommended way to locate parent element in Protractor
                            
                                Sending XPath a variable from Java
                            
                                How to select nodes that has X as descendant using xpath
                            
                                How can I find a certain element that comes right after another element with Capybara?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Filtering out content with style display:none in an XPath expression

Tags:

xpath

Clubmate

People also ask

1 Answers

Charles Duffy

Recent Activity

Donate For Us