saving an 'lxml.etree._ElementTree' object

Tags:

I've spent the last couple of days getting to grips with the basics of lxml; in particular using lxml.html to parse websites and create an ElementTree of the content. Ideally, I want to save the returned ElementTree so that I can load it up and experiment with it, without having to parse the website every time I modify my script. I assumed that pickling would be the way to go, however I'm now beginning to wonder. Although I am able to retrieve an ElementTree object after pickling...

type(myObject)

returns

<class 'lxml.etree._ElementTree'>

the object itself appears to be 'empty', since none of the subsequent method/attribute calls I make on it yield any output.

My guess is that pickling isn't appropriate here, but can anyone suggest an alternative?

(In case it matters, the above is happening in: python3.2, lxml 2.3.2, snow-leopard))

264

asked Nov 25 '11 21:11

Paul Patterson

1 Answers

lxml is a C library - libxml to be precise - and the object probably don't support python pickling or any other kind of serialization - except serializing them to XML.

So you'll either have to keep them in memory, or re-parse the XML fragments you need, I assume.

165

answered Oct 02 '22 20:10

Has QUIT--Anony-Mousse

Related questions
                            
                                Setuptools platform specific dependencies
                            
                                Why does list.append() return None? [duplicate]
                            
                                Is possible to mapping view with class using mapper in SqlAlchemy?
                            
                                Python Change Master/Application Volume
                            
                                How to get the numbers of data rows from sqlite table in python
                            
                                how to retrieve the selected row of a QTableView?
                            
                                How to format cell with datetime object of the form 'yyyy-mm-dd hh:mm:ss' in Excel using openpyxl
                            
                                python reading in multi-column tsv file with row numbers
                            
                                Pandas 'DataFrame' object has no attribute 'unique'
                            
                                Does python os.fork uses the same python interpreter?
                            
                                Python context manager that measures time
                            
                                How to Hash Django user password in Django Rest Framework?
                            
                                TypeError: concat() got multiple values for argument 'axis'
                            
                                mysql data base connection inside Sam local
                            
                                How to iterate over two dataloaders simultaneously using pytorch?
                            
                                case_when function from R to Python
                            
                                Understanding and evaluating template matching methods
                            
                                Python: Why can't I iterate over a list? Is my exception class borked?
                            
                                Ordered Sets Python 2.7
                            
                                Compress whitespaces in string [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

saving an 'lxml.etree._ElementTree' object

Tags:

python

pickle

lxml

Paul Patterson

People also ask

1 Answers

Has QUIT--Anony-Mousse

Recent Activity

Donate For Us