How to retrieve all child nodes in a single query using lxml & XPATH

Tags:

This is my xml data

<location>
   <city>
      <name> New York</name>
      <type>non-capital</type>
   </city>

   <city>
        <name> London</name>
        <type>capital</type>
   </city>
</location>

using lxml & python

from lxml import etree as ET

parser = ET.XMLParser(recover=True)

tree = ET.fromstring(xml_data,parser)
print(tree.xpath('//city//name/text() | //city//type/text()'))

The above code works but i'd like an nested-array description as [['New York','non-capital'],['London','capital']]

What would be the accurate xpath query/combination of queries/loops to get the above?

617

asked Mar 27 '15 05:03

wolfgang

2 Answers

This is one possible way :

.......
result = []
for city in tree.xpath('//city'):
    result.append([city.find('name').text, city.find('type').text])

print(result)
# output :
#[[' New York', 'non-capital'], [' London', 'capital']]

110

answered Oct 03 '22 00:10

har07

List comprehension solution:

xml_data='''<location>
   <city>
      <name> New York</name>
      <type>non-capital</type>
   </city>
   <city>
        <name> London</name>
        <type>capital</type>
   </city>
</location>'''

from lxml import etree as ET

parser = ET.XMLParser(recover=True)

tree = ET.fromstring(xml_data,parser)
print(tree.xpath('//city'))


cities = [[c.text for c in n if c.tail] for n in tree.xpath('//city')]

Results in:

[[' New York', 'non-capital'], [' London', 'capital']]

answered Oct 03 '22 00:10

Marcin

Related questions
                            
                                Determining a homogeneous affine transformation matrix from six points in 3D using Python
                            
                                Python Tornado render static directory
                            
                                Django nested Transaction.atomic
                            
                                PySpark distinct().count() on a csv file
                            
                                numpy einsum to get axes permutation
                            
                                Removing completely isolated cells from Python array?
                            
                                Revert Ubuntu 14.04 to default python after uninstalling anaconda
                            
                                Correct way to manage redis connections in django
                            
                                RethinkDB losing data after restarting server
                            
                                Python struct.Struct.size returning unexpected value
                            
                                Fancy indexing with assignment for numpy array
                            
                                How to store environment variables in a Python Flask app?
                            
                                How to check if there is a missing argument
                            
                                ascii codec cant decode byte 0xe9
                            
                                Calculate moments (mean, variance) of distribution in python
                            
                                How to install multiple ipython 3.0 kernels (python 2.7, python 3.4, etc...) with anaconda under linux?
                            
                                numpy/scipy build adjacency matrix from weighted edgelist
                            
                                Adding edge weights and scaling drawn edge lengths in graph_tool
                            
                                With Flask or Quart NameError: global name 'g' is not defined
                            
                                Python/tox Install a dependency as editable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to retrieve all child nodes in a single query using lxml & XPATH

Tags:

python

xml

xpath

lxml

wolfgang

People also ask

2 Answers

har07

Marcin

Recent Activity

Donate For Us