I have run into an issue where even when I disable exponential notation, <code>fwrite</code> prints the number in exponential notation. An example: <pre class="prettyprint"><code>library(data.table) options(scipen = 999) testint = c(500000) </code></pre> Before I print, <code>r</code> behaves and does not print in exponential notation: <pre class="prettyprint"><code>print(testint) [1] 500000 print(list(testint) [[1]] [1] 500000 </code></pre> But when I do: <pre class="prettyprint"><code>fwrite(list(testint), "output") </code></pre> The content of the file is 5e+05. I suspect this issue may specifically be with <code>fwrite</code>, as when I do: <pre class="prettyprint"><code>write(testint, "output1") </code></pre> The content of the output file is 500000. Is there any way to prevent <code>fwrite</code> from doing this? I could switch to using <code>write</code>, but there is a massive speed difference between them and I am writing a lot of data, so there would be a significant performance impact that I would like to avoid if possible. Thanks! Edit: if anyone is interested, there is an existing open github issue here that I found after I asked the question!

Would this be an acceptable workaround? (It would end up truncating to whatever decimal level of precision is set by the digit after the period.) <pre class="prettyprint"><code>fwrite(list(sprintf("%9.2f", testint))) 500000.00 </code></pre> The response to the issue yage you cited had a suggestion to use <code>bit64::as.integer64</code> from a package, but ordinary <code>as.integer</code> seems to work here: <pre class="prettyprint"><code>fwrite(list(as.integer(testint))) 500000 </code></pre>

Disable Exponential Notation when printing with fwrite r

Tags:

r

data.table

fwrite

I have run into an issue where even when I disable exponential notation, fwrite prints the number in exponential notation. An example:

library(data.table)
options(scipen = 999)
testint = c(500000)

Before I print, r behaves and does not print in exponential notation:

print(testint)
[1] 500000
print(list(testint)
[[1]]
[1] 500000

But when I do:

fwrite(list(testint), "output")

The content of the file is 5e+05. I suspect this issue may specifically be with fwrite, as when I do:

write(testint, "output1")

The content of the output file is 500000.

Is there any way to prevent fwrite from doing this? I could switch to using write, but there is a massive speed difference between them and I am writing a lot of data, so there would be a significant performance impact that I would like to avoid if possible. Thanks!

Edit: if anyone is interested, there is an existing open github issue here that I found after I asked the question!

685

asked May 02 '18 20:05

Walker in the City

2 Answers

If you look at the source code of fwrite() function it passes the values your values straight to internal C function:

> fwrite
function (x, file = "", append = FALSE, quote = "auto", sep = ",",
    sep2 = c("", "|", ""), eol = if (.Platform$OS.type == "windows") "\r\n" else "\n",
    na = "", dec = ".", row.names = FALSE, col.names = TRUE,
    qmethod = c("double", "escape"), logicalAsInt = FALSE, dateTimeAs = c("ISO",
        "squash", "epoch", "write.csv"), buffMB = 8, nThread = getDTthreads(),
    showProgress = getOption("datatable.showProgress"), verbose = getOption("datatable.verbose"))
{
...
    .Call(Cwritefile, x, file, sep, sep2, eol, na, dec, quote,
        qmethod == "escape", append, row.names, col.names, logicalAsInt,
        dateTimeAs, buffMB, nThread, showProgress, verbose)
    invisible()
}

If you look at the source code of the function that is called: https://github.com/Rdatatable/data.table/blob/master/src/fwrite.c you will notice that they do not check for any environment set in R and use significant notation for large enough values. One can change this source the way you like, build own dynamic library and call it from R. The other option would be to use some standard R writing functions (though I suspect you like the performance of data.table package functions).

answered Oct 16 '22 06:10

Katia

Would this be an acceptable workaround? (It would end up truncating to whatever decimal level of precision is set by the digit after the period.)

fwrite(list(sprintf("%9.2f", testint)))
500000.00

The response to the issue yage you cited had a suggestion to use bit64::as.integer64 from a package, but ordinary as.integer seems to work here:

fwrite(list(as.integer(testint)))
500000

answered Oct 16 '22 06:10

IRTFM

Related questions
                            
                                How to extract substring using regex into multiple column using data.table
                            
                                Skip NA in data.table by
                            
                                Autocorrelation in Generalized Additive Models (GAM)
                            
                                how to move x labels to be over facet labels in ggplot in R
                            
                                travis error: "package ‘devtools’ was installed by an R version with different internals; it needs to be reinstalled for use with this R version"
                            
                                ggplot2 removes zero when using scale_x_sqrt
                            
                                Select statement error : unused argument [duplicate]
                            
                                Reduce/remove plot margins in Shiny apps including wordclouds
                            
                                SelectInput from named list in shiny with single element vectors
                            
                                Purrr::map_df() drops NULL rows
                            
                                SparkR DataFrame partitioning issue
                            
                                Rscript file path with space
                            
                                Is it possible to create .eps files with ggsave using the Cairo graphics device?
                            
                                how to map cities points to US map with shifted coordinates to allow for space between regions?
                            
                                Get coordinates from a drawing object from an R leaflet map
                            
                                separate data in 2 groups with elements of each pair in separate groups
                            
                                R nested tibble map2 comparisons
                            
                                Combine sub lists of different lists into a list of dataframes
                            
                                Plot knitting error : "unable to start png() device"
                            
                                Purrr filter the nested data based on unnested variable containing character vectors

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With