Recommended way to generate XHTML documents with lxml

`lxml.builder.E`

Example from http://lxml.de/tutorial.html#the-e-factory:

>>> from lxml.builder import E

>>> def CLASS(*args): # class is a reserved word in Python
...     return {"class":' '.join(args)}

>>> html = page = (
...   E.html(       # create an Element called "html"
...     E.head(
...       E.title("This is a sample document")
...     ),
...     E.body(
...       E.h1("Hello!", CLASS("title")),
...       E.p("This is a paragraph with ", E.b("bold"), " text in it!"),
...       E.p("This is another paragraph, with a", "\n      ",
...         E.a("link", href="http://www.python.org"), "."),
...       E.p("Here are some reserved characters: <spam&egg>."),
...       etree.XML("<p>And finally an embedded XHTML fragment.</p>"),
...     )
...   )
... )

`lxml.html.builder`

Example from http://lxml.de/lxmlhtml.html#creating-html-with-the-e-factory:

>>> from lxml.html import builder as E
>>> from lxml.html import usedoctest
>>> html = E.HTML(
...   E.HEAD(
...     E.LINK(rel="stylesheet", href="great.css", type="text/css"),
...     E.TITLE("Best Page Ever")
...   ),
...   E.BODY(
...     E.H1(E.CLASS("heading"), "Top News"),
...     E.P("World News only on this page", style="font-size: 200%"),
...     "Ah, and here's some more text, by the way.",
...     lxml.html.fromstring("<p>... and this is a parsed fragment ...</p>")
...   )
... )

316

asked Jun 03 '12 23:06

Mechanical snail

1 Answers

Mixing ElementMaker and E from lxml.builder does the trick for me:

from lxml import etree
from lxml.builder import ElementMaker,E

M=ElementMaker(namespace=None,
               nsmap={None: "http://www.w3.org/1999/xhtml"})
html = M.html(E.head(E.title("Test page")),
              E.body(E.p("Hello world")))
result = etree.tostring(html,
                        xml_declaration=True,
                        doctype='<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">',
                        encoding='utf-8',
                        standalone=False,
                        with_tail=False,
                        method='xml',
                        pretty_print=True)
print result

The result is

<?xml version='1.0' encoding='utf-8' standalone='no'?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <title>Test page</title>
  </head>
  <body>
    <p>Hello world</p>
  </body>
</html>

answered Oct 26 '22 14:10

Pedro A. Aranda

Related questions
                            
                                Python: add a parent class to a class after initial evaluation
                            
                                Simple RTMP Python client
                            
                                How to label axes in Mayavi using LaTeX math symbols?
                            
                                Converting timezone-aware date string to UTC and back in Python
                            
                                How can I fix Vim's line breaking behavior for long lines in Python?
                            
                                python GUI compared to Swing?
                            
                                How to catch error 1062 "duplicate entry" independent from used database/engine?
                            
                                Browserless access to LinkedIn with Python
                            
                                Python's urllib2.urlopen() hanging with local connection to a Java Restlet server
                            
                                Preventing imports in Python
                            
                                Avoiding multiple references to the same object in Django ORM
                            
                                Utility for releasing packages to PyPi?
                            
                                Matplotlib, legend with multiple different markers with one label
                            
                                Foolproof cross-platform process kill daemon
                            
                                Role-based authorization mechanism for a GAE app
                            
                                Django unique=True except for blank values
                            
                                Automatic XSD validation
                            
                                python argparse to handle arbitrary numeric options (like HEAD(1))
                            
                                wxPython 2.9 on Mac Os X
                            
                                How to use Qxt libraries on PyQt?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Recommended way to generate XHTML documents with lxml

Tags:

python

xml

xhtml

lxml