Reading PASCAL VOC annotations in python

Tags:

I have annotations in xml files such as this one, which follows the PASCAL VOC convention:

<annotation>
<folder>training</folder>
<filename>chanel1.jpg</filename>
<source>
<database>synthetic initialization</database>
<annotation>PASCAL VOC2007</annotation>
<image>synthetic</image>
<flickrid>none</flickrid>
</source>
<owner>
<flickrid>none</flickrid>
<name>none</name>
</owner>
<size>
<width>640</width>
<height>427</height>
<depth>3</depth>
</size>
<segmented>0</segmented>
<object>
<name>chanel</name>
<pose>Unspecified</pose>
<truncated>0</truncated>
<difficult>0</difficult>
<bndbox>
<xmin>344</xmin>
<ymin>10</ymin>
<xmax>422</xmax>
<ymax>83</ymax>
</bndbox>
</object>
<object>
<name>chanel</name>
<pose>Unspecified</pose>
<truncated>0</truncated>
<difficult>0</difficult>
<bndbox>
<xmin>355</xmin>
<ymin>165</ymin>
<xmax>443</xmax>
<ymax>206</ymax>
</bndbox>
</object>
</annotation>

What is the cleanest way of retrieving for example the fields filename and bndbox in Python?

I was trying to ElementTree, which seems to be the official Python solution, but I can't make it work.

My code so far:

from xml.etree import ElementTree as ET
tree = ET.parse("data/all/annotations/" + file)
fn = tree.find('filename').text
boxes = tree.findall('bndbox')

this produces

fn == 'chanel1.jpg'
boxes == []

So it succesfully extracts the filename field, but not the bndbox'es.

243

asked Nov 15 '18 10:11

Jsevillamol

1 Answers

That's a quite easy solution for your problem:

This will return your box coordinates in a nested list [xmin, ymin, xmax, ymax] and the filename Once I struggled with bndbox tags which where mixed up (ymin, xmin,...) or any other strange combinations, so this code read the tags not only the position.

Finally I updated the code. Thanks to craq and Pritesh Gohil, you were absolutely right.

Hope it helps...

import xml.etree.ElementTree as ET


def read_content(xml_file: str):

    tree = ET.parse(xml_file)
    root = tree.getroot()

    list_with_all_boxes = []

    for boxes in root.iter('object'):

        filename = root.find('filename').text

        ymin, xmin, ymax, xmax = None, None, None, None

        ymin = int(boxes.find("bndbox/ymin").text)
        xmin = int(boxes.find("bndbox/xmin").text)
        ymax = int(boxes.find("bndbox/ymax").text)
        xmax = int(boxes.find("bndbox/xmax").text)

        list_with_single_boxes = [xmin, ymin, xmax, ymax]
        list_with_all_boxes.append(list_with_single_boxes)

    return filename, list_with_all_boxes

name, boxes = read_content("file.xml")

answered Sep 19 '22 12:09

pix_1

Related questions
                            
                                Pythonic way of write if open is successful
                            
                                Tensorflow: Word2vec CBOW model
                            
                                Sqlalchemy in_ subquery
                            
                                How to calculate the midpoint of several geolocations in python
                            
                                Sorting the list of dictionaries in descending order of a particular key [duplicate]
                            
                                Change the height of a Seaborn heatmap colorbar
                            
                                Is Python uuid.uuid4 strong enough for password reset links?
                            
                                Inorder Binary Tree Traversal (using Python)
                            
                                Assign values to different index positions in Numpy array
                            
                                Failed to install wsgiref on Python 3
                            
                                Merging Overlapping Intervals
                            
                                using mpatches.Patch for a custom legend
                            
                                Kivy error, [CRITICAL] [Text ] unable to find any valuable text provider (python 3.6.1) (windows 10)
                            
                                How to compute precision,recall and f1 score of an imbalanced dataset for K fold cross validation?
                            
                                StopIteration: generator_output = next(output_generator)
                            
                                gcloud ml-engine local predict RuntimeError: Bad magic number in .pyc file
                            
                                Check for string in "response.content" raising "TypeError: a bytes-like object is required, not 'str'"
                            
                                NotADirectoryError: [WinError 267] The directory name is invalid error while invoking Firefox through Selenium Python
                            
                                Change color of specific ticks at plot with matplotlib
                            
                                MySQLClient instal error: "raise Exception("Wrong MySQL configuration: maybe https://bugs.mysql.com/bug.php?id"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reading PASCAL VOC annotations in python

Tags:

python

python-3.x

xml

Jsevillamol

People also ask

1 Answers

pix_1

Recent Activity

Donate For Us