I want to convert the following <code>*.md</code> converted into proper LaTeX <code>*.tex</code>. <pre class="prettyprint"><code>Lorem *ipsum* something. Does anyone know lorem by heart? That would *sad* because there's always Google. </code></pre> Expected Behavior / Resulting LaTeX from Pandoc <pre class="prettyprint"><code>Lorem \emph{ipsum} something. Does anyone know lorem by heart? That would \emph{sad} because there's always Google. </code></pre> Observed Behavior / Resulting LaTeX from Pandoc <pre class="prettyprint"><code>Lorem \emph{ipsum} something. Does anyone know lorem by heart? That would \emph{sad} because there's always Google. </code></pre> Why do I care? 1. I'm transitioning a bigger git repo from markdown to LaTeX, and I want a clean diff and history. 2. I actually like my LaTeX with one sentence-per-line even though it does not matter for the typesetting. How can I get Pandoc to do this? Ps.: I am aware of the option <code>hard_line_breaks</code>, but that only adds <code>\\</code> between the two first lines, and does not actually preserve my line breaks.

<h3>Update</h3> Since pandoc 1.16, this is possible: <pre class="prettyprint"><code>pandoc --wrap=preserve </code></pre> <h3>Old answer</h3> Since Pandoc converts the Markdown to an AST-like internal representation, your non-semantic linebreaks are lost. So what you're looking for is not possible without some custom scripting (like using <code>--no-wrap</code> and then processing the output by inserting a line-break wherever there is a dot followed by a space). However, you can use the <code>--columns NUMBER</code> options to specify the number of characters on each line. So you won't have a sentence per line, but NUMBER of characters per line.

I figured out another way to address this problem – which is to not change the original <code>*.md</code>s (under version control), but to simply read them in and to have them "pandoced" when building the PDF. Here's how: Some <code>markdown.md</code> in project root: <pre class="prettyprint"><code>Happy one-sentence-per-line **markdown** stuff. And another line – makes for clear git diffs! </code></pre> And some <code>latexify.tex</code> in project root: <pre class="prettyprint lang-latex prettyprint-override"><code>\documentclass{article} \begin{document} \immediate\write18{pandoc markdown.md -t latex -o tmp.tex} \input{tmp.tex} \end{document} </code></pre> Works just dandy if you have some markdown components in a latex project, e.g. github READMEs or sth. Requires no special package, but compilation with <code>shell-escape</code> enabled.

Preserve Line Breaks in Pandoc Markdown -> LaTeX Conversion

Tags:

pandoc

I want to convert the following *.md converted into proper LaTeX *.tex.

Lorem *ipsum* something.
Does anyone know lorem by heart?

That would *sad* because there's always Google.

Expected Behavior / Resulting LaTeX from Pandoc

Lorem \emph{ipsum} something.
Does anyone know lorem by heart?

That would \emph{sad} because there's always Google.

Observed Behavior / Resulting LaTeX from Pandoc

Lorem \emph{ipsum} something. Does anyone know lorem by heart?

That would \emph{sad} because there's always Google.

Why do I care? 1. I'm transitioning a bigger git repo from markdown to LaTeX, and I want a clean diff and history. 2. I actually like my LaTeX with one sentence-per-line even though it does not matter for the typesetting.

How can I get Pandoc to do this?

Ps.: I am aware of the option hard_line_breaks, but that only adds \\ between the two first lines, and does not actually preserve my line breaks.

220

asked Sep 26 '14 19:09

maxheld

3 Answers

Update

Since pandoc 1.16, this is possible:

pandoc --wrap=preserve

Old answer

Since Pandoc converts the Markdown to an AST-like internal representation, your non-semantic linebreaks are lost. So what you're looking for is not possible without some custom scripting (like using --no-wrap and then processing the output by inserting a line-break wherever there is a dot followed by a space).

However, you can use the --columns NUMBER options to specify the number of characters on each line. So you won't have a sentence per line, but NUMBER of characters per line.

123

answered Oct 14 '22 18:10

mb21

A much simpler solution would be to add two spaces after "...something.". This will add a manual line break (the method is mentioned in the Pandoc Manual).

answered Oct 14 '22 18:10

René

I figured out another way to address this problem – which is to not change the original *.mds (under version control), but to simply read them in and to have them "pandoced" when building the PDF.

Here's how:

Some markdown.md in project root:

Happy one-sentence-per-line **markdown** stuff.
And another line – makes for clear git diffs!

And some latexify.tex in project root:

\documentclass{article}
\begin{document}

\immediate\write18{pandoc markdown.md -t latex -o tmp.tex}
\input{tmp.tex}

\end{document}

Works just dandy if you have some markdown components in a latex project, e.g. github READMEs or sth.

Requires no special package, but compilation with shell-escape enabled.

answered Oct 14 '22 19:10

maxheld

Related questions
                            
                                How to convert reStructuredText files with images to markdown?
                            
                                Why Pandoc does not retrieve the image file?
                            
                                How to make a figure caption in Rmarkdown?
                            
                                Not able to use titlesec with markdown and pandoc?
                            
                                How can I change PDF output font from within the YAML header when using Pandoc on Markdown?
                            
                                pandoc-citeproc error 83 with Rmarkdown file
                            
                                NOTE or WARNING from package check when README.md includes images
                            
                                Images pushed off slides in beamer output from R markdown
                            
                                Posiible to use pandoc with HTML containing base64 inline images?
                            
                                Whole site compilation of markdown/pandoc? [closed]
                            
                                How to move the bibliography in markdown/pandoc
                            
                                RMarkdown: How to change headline in table of contents in R Markdown?
                            
                                Vim syntax and Latex math inside markdown
                            
                                How can I modifiy the positions of the text and logo on RMarkdown title slide
                            
                                Specifying papersize for md to pdf conversion
                            
                                How to use Pandoc image alignment to align two images in the same row?
                            
                                *Some* figure captions from RMarkdown not showing
                            
                                How to cite multiple papers in RMarkdown
                            
                                Preserve line breaks in title using pandoc
                            
                                How to convert HTML to Markdown while retaining non-markdown HTML tags?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Preserve Line Breaks in Pandoc Markdown -> LaTeX Conversion

Tags:

pandoc

maxheld

People also ask

3 Answers

Update

Old answer

mb21

René

maxheld

Recent Activity

Donate For Us