I have a lot of RST (reStructuredText) files converted to HTML using docutils. Using a small developed JS app (a basic iframe with HTML5 contenteditable attribute), non-tech people can easily edit the generated HTML page. The goal would be apply the modification done in HTML back into the original RST file.
I'm wondering if there is a way to train an algorithm with the ground-truth of original_RST -> outputed_HTML conversion so we can later reverse outputed_HTML+minor modification back to RST ? What kind of algorithm will it be ? Which way should I start digging ?
To simplify the problem, instead of RST, I can also use the internal XML AST representation of docutils as document input (RST to XML-AST: http://docutils.sourceforge.net/tools/quicktest.py) and later, fix the xml2rst+xslt project (http://www.merten-home.de/FreeSoftware/xml2rst/)
Any suggestion is welcome
From HTML to XML free allows loading the Website URL which has tables converting to XML. Click on the URL button, Enter URL and Submit. Parsing HTML into XML supports loading the HTML File to transform to XML. Click on the Upload button and select File.
Try Pandoc
If you need to convert files from one markup format into another, pandoc is your swiss-army knife.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With