Is it possible to write documentation in R using non-ASCII characters (such as å, ä, ö) using roxygen2? I'm asking because I am writing an package with internal functions in Swedish. I have use the following code using roxygen to write documentation: <pre class="prettyprint"><code>#' @param data data frame där variablen finns #' @param x variabeln, måste vara en av typen character </code></pre> This results in the non-ASCII characters being distorted. I can change the .Rd files manually but I'd rather not.

I solved this problem by putting <pre class="prettyprint"><code>##' @encoding UTF-8 </code></pre> in the roxygen2 documentation comment and then typing <pre class="prettyprint"><code>options(encoding = "UTF-8") </code></pre> in the R console before roxygenizing. For future sessions, it is helpful to add the line <pre class="prettyprint"><code>options(encoding = "UTF-8") </code></pre> in the <code>R/etc/Rprofile.site</code> file.

On Windows, encoding sucks in R, and is very complicated - and those developing packages don't always consider it as a real issue (see roxygen or devtools). What worked for me: <ul> <li> if you have data in your package with non-ASCII labels, e.g. a colorvector c(rød = "#C30000", blå = "#00A9E0"), you have to escape the names/values in code: <pre class="prettyprint"><code>c(r\u00f8d = "#C30000", bl\u00e5 = "#00A9E0") </code></pre> </li> <li>in the documentation (if you use roxygenize or devtools::document()) you have to place @encoding UTF-8 before EVERY function description but then use regular keyboard. </li> </ul> If you have two functions in the same file (e.g. "palette" and "saturation" in a design package for your organisation), you have to place the tag in every description block, not just once. Example: <pre class="prettyprint"><code> #' @encoding UTF-8 #' datastruktur for å definere firmapalett med æøå dummypalett <- structure(.Data = c("#c30000", "#00A9E0"), names = c("r\u00f8d", "bl\u00e5")) #' @encoding UTF-8 #' neste funksjon som er beskrevet med æøåäö </code></pre> For good measure, I placed Language: nob in the DESCRIPTION file and changed the encoding tag in Rprofile to "UTF-8".

Is it possible to write package documentation using non-ASCII characters with roxygen2?

Tags:

r

roxygen2

r-package

Is it possible to write documentation in R using non-ASCII characters (such as å, ä, ö) using roxygen2? I'm asking because I am writing an package with internal functions in Swedish.

I have use the following code using roxygen to write documentation:

#' @param data data frame där variablen finns
#' @param x variabeln, måste vara en av typen character

This results in the non-ASCII characters being distorted. I can change the .Rd files manually but I'd rather not.

664

asked May 08 '17 14:05

FilipW

3 Answers

I solved this problem by putting

##' @encoding UTF-8

in the roxygen2 documentation comment and then typing

options(encoding = "UTF-8")

in the R console before roxygenizing. For future sessions, it is helpful to add the line

options(encoding = "UTF-8")

in the R/etc/Rprofile.site file.

answered Oct 22 '22 11:10

César Asensio

On Windows, encoding sucks in R, and is very complicated - and those developing packages don't always consider it as a real issue (see roxygen or devtools). What worked for me:

if you have data in your package with non-ASCII labels, e.g. a colorvector c(rød = "#C30000", blå = "#00A9E0"), you have to escape the names/values in code:
```
c(r\u00f8d = "#C30000", bl\u00e5 = "#00A9E0")
```
in the documentation (if you use roxygenize or devtools::document()) you have to place @encoding UTF-8 before EVERY function description but then use regular keyboard.

If you have two functions in the same file (e.g. "palette" and "saturation" in a design package for your organisation), you have to place the tag in every description block, not just once.

Example:

    #' @encoding UTF-8
    #' datastruktur for å definere firmapalett med æøå
    dummypalett <- structure(.Data = c("#c30000", "#00A9E0"),
                   names = c("r\u00f8d", "bl\u00e5"))

    #' @encoding UTF-8
    #' neste funksjon som er beskrevet med æøåäö

For good measure, I placed Language: nob in the DESCRIPTION file and changed the encoding tag in Rprofile to "UTF-8".

answered Oct 22 '22 11:10

Espen Rosenquist

Non-ASCII characters are tricky to use with R (https://cran.r-project.org/doc/manuals/r-release/R-exts.html#Package-subdirectories).

Only ASCII characters (and the control characters tab, formfeed, LF and CR) should be used in code files. Other characters are accepted in comments13, but then the comments may not be readable in e.g. a UTF-8 locale. Non-ASCII characters in object names will normally14 fail when the package is installed. Any byte will be allowed in a quoted character string but \uxxxx escapes should be used for non-ASCII characters. However, non-ASCII character strings may not be usable in some locales and may display incorrectly in others.

For documentation you have to add the tag @encoding UTF-8 to your roxygen2 code.

You can check whether \uxxxx escapes have been successfully employed by the tag using the following.

path <- "path to Rd file"
tools::checkRd(path)

answered Oct 22 '22 12:10

Crops

Related questions
                            
                                add x=y line to scatterplot
                            
                                install.packages R on Ubuntu 12.04 downloads but does not install packages
                            
                                How is J() function implemented in data.table?
                            
                                Make list of vectors by joining pair-corresponding elements of 2 vectors efficiently in R
                            
                                embedFonts complains about “Unknown device: pswrite”
                            
                                Split keep repeated delimiter
                            
                                Assigning values in a sequence depending on previous row value in R
                            
                                Embed Youtube Video in R Markdown
                            
                                How to suppress output in RStudio?
                            
                                ggmap route finding - doesn't stay on roads
                            
                                How do I build a reactive dataframe in R / Shiny?
                            
                                ggplot: gradient scale to diverge on specific break
                            
                                checkboxGroupInput - set minimum and maximum number of selections - ticks
                            
                                How to detect sentence boundaries with OpenNLP and stringi?
                            
                                How to read tab separated file into data.table using fread?
                            
                                R: draw a line between two points in ggplot
                            
                                Multiple inheritance for R6 classes
                            
                                Tidy data.frame with repeated column names
                            
                                Source code from Rmd file within another Rmd
                            
                                Get name of dataframe passed through pipe in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With