I'm searching in a HTML document using XPath from lxml in python. How can I get the path to a certain element? Here's the example from ruby nokogiri: <pre class="prettyprint"><code>page.xpath('//text()').each do |textnode| path = textnode.path puts path end </code></pre> print for example '/html/body/div/div[1]/div[1]/p/text()[1]' and this is the string I want to get in python.

Use <code>getpath</code> from ElementTree objects. <pre class="prettyprint lang-py prettyprint-override"><code>from lxml import etree root = etree.fromstring(''' <foo><bar>Data</bar><bar><baz>data</baz> <baz>data</baz></bar></foo> ''') tree = etree.ElementTree(root) for e in root.iter(): print(tree.getpath(e)) </code></pre> Prints <pre class="prettyprint"><code>/foo /foo/bar[1] /foo/bar[2] /foo/bar[2]/baz[1] /foo/bar[2]/baz[2] </code></pre>

How to get path of an element in lxml?

Tags:

python

xpath

lxml

I'm searching in a HTML document using XPath from lxml in python. How can I get the path to a certain element? Here's the example from ruby nokogiri:

page.xpath('//text()').each do |textnode|     path = textnode.path     puts path end

print for example '/html/body/div/div[1]/div[1]/p/text()[1]' and this is the string I want to get in python.

842

asked Oct 16 '09 10:10

Fluffy

1 Answers

Use getpath from ElementTree objects.

from lxml import etree      root = etree.fromstring('''     <foo><bar>Data</bar><bar><baz>data</baz>     <baz>data</baz></bar></foo>     ''')      tree = etree.ElementTree(root) for e in root.iter():     print(tree.getpath(e))

Prints

/foo /foo/bar[1] /foo/bar[2] /foo/bar[2]/baz[1] /foo/bar[2]/baz[2]

117

answered Sep 22 '22 01:09

nosklo

Related questions
                            
                                How to pad a numeric string with zeros to the right in Python?
                            
                                Converting xml to dictionary using ElementTree
                            
                                How do I group this list of dicts by the same month?
                            
                                removing time from date&time variable in pandas?
                            
                                Python Argparse: Issue with optional arguments which are negative numbers
                            
                                Conda: Creating a virtual environment
                            
                                Python gzip: is there a way to decompress from a string?
                            
                                Numpy and line intersections
                            
                                Get browser version using selenium webdriver
                            
                                Apply StandardScaler to parts of a data set
                            
                                Determine start and end time of current day (UTC -> EST -> UTC) ; Python
                            
                                ImportError: Failed to import any qt binding, Python - Tensorflow
                            
                                Python: get list indexes using regular expression?
                            
                                Pandas df.iterrows() parallelization
                            
                                Parsing date/time string with timezone abbreviated name in Python?
                            
                                How to escape single quotes in Python on a server to be used in JavaScript on a client
                            
                                Difference between list, sequence and slice in Python?
                            
                                related_name argument not working as expected in Django model?
                            
                                Converting a datetime column to a string column
                            
                                aiohttp: set maximum number of requests per second

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With