Convert tabbed text to html unordered list?

Tags:

2 Answers

Try this (works on your test case):

import itertools
def listify(filepath):
    depth = 0
    print "<ul>"*(depth+1)
    for line in open(filepath):
        line = line.rstrip()
        newDepth = sum(1 for i in itertools.takewhile(lambda c: c=='\t', line))
        if newDepth > depth:
            print "<ul>"*(newDepth-depth)
        elif depth > newDepth:
            print "</ul>"*(depth-newDepth)
        print "<li>%s</li>" %(line.strip())
        depth = newDepth
    print "</ul>"*(depth+1)

Hope this helps

148

answered Sep 28 '22 00:09

tokenize module understands your input format: lines contain a valid Python identifiers, the indentation level of the statements is significant. ElementTree module allows you to manipulate tree structures in memory so it might be more flexable to separate a tree creation from a rendering it as html:

from tokenize import NAME, INDENT, DEDENT, ENDMARKER, NEWLINE, generate_tokens
from xml.etree import ElementTree as etree

def parse(file, TreeBuilder=etree.TreeBuilder):
    tb = TreeBuilder()
    tb.start('ul', {})
    for type_, text, start, end, line in generate_tokens(file.readline):
        if type_ == NAME: # convert name to <li> item
            tb.start('li', {})
            tb.data(text)
            tb.end('li')
        elif type_ == NEWLINE:
            continue
        elif type_ == INDENT: # start <ul>
            tb.start('ul', {})
        elif type_ == DEDENT: # end </ul>
            tb.end('ul')
        elif type_ == ENDMARKER: # done
            tb.end('ul') # end parent list
            break
        else: # unexpected token
            assert 0, (type_, text, start, end, line)
    return tb.close() # return root element

Any class that provides .start(), .end(), .data(), .close() methods can be used as a TreeBuilder e.g., you could just write html on the fly instead of building a tree.

To parse stdin and write html to stdout you could use ElementTree.write():

import sys

etree.ElementTree(parse(sys.stdin)).write(sys.stdout, method='html')

Output:

<ul><li>A</li><ul><li>B</li><li>C</li><ul><li>D</li><li>E</li></ul></ul></ul>

You can use any file, not just sys.stdin/sys.stdout.

Note: To write to stdout on Python 3 use sys.stdout.buffer or encoding="unicode" due to bytes/Unicode distinction.

answered Sep 28 '22 00:09

jfs

Related questions
                            
                                Dynamic list that automatically expands
                            
                                How to get hours difference from UTC to given timezone?
                            
                                How to access ODB files in Python 2.7
                            
                                interprocess C# python real time
                            
                                How to get a build a form with repeated elements well
                            
                                Python logging extremely slow on Linux server... but fast on Linux development VM?
                            
                                How can I fill a matplotlib grid?
                            
                                How does Pyramid's add_static_view work?
                            
                                Flask-WTForms can't find WTForms in my project directory
                            
                                Installing Python binary modules to a custom location in Windows
                            
                                Reference class variable in a comprehension of another class variable
                            
                                Inserting pyodbc.Binary data (BLOB) into SQL Server image column
                            
                                How to run Python script with one icon click?
                            
                                Simple remote process monitoring with Python
                            
                                Ignore exceptions thrown and caught inside a library
                            
                                Adding a badge to an icon in Python on Windows/OSX/Linux
                            
                                Display details of importer
                            
                                Why does my ttk.Treeview click handler return the wrong item on tree.focus()?
                            
                                Conditional pip install requirements on Heroku for Django app
                            
                                Mako escaping issue within Pyramid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert tabbed text to html unordered list?

Tags:

python

html

Elip

People also ask

2 Answers

inspectorG4dget

jfs

Recent Activity

Donate For Us