Truncating to Word type

Tags:

haskell

The following code truncates a number of type Double to one in the type Word16 (although I suspect any other word type behaves similarly, I had to choose one for the example).

truncate1 :: Double -> Word16
truncate1 = fromIntegral . (truncate :: Double -> Int)

As you can read, I first truncate it to Int and only then I cast it to Word16. I benchmarked this function agains a direct truncation:

truncate2 :: Double -> Word16
truncate2 = truncate

Surprisingly to me, the first version (going thru the Int type first) performed much better. Or the second much worse. According to the criterion output:

benchmarking truncate/truncate1
mean: 25.42399 ns, lb -47.40484 ps, ub 67.87578 ns, ci 0.950
std dev: 145.5661 ns, lb 84.90195 ns, ub 244.2057 ns, ci 0.950
found 197 outliers among 100 samples (197.0%)
  97 (97.0%) low severe
  100 (100.0%) high severe
variance introduced by outliers: 99.000%
variance is severely inflated by outliers

benchmarking truncate/truncate2
mean: 781.0604 ns, lb 509.3264 ns, ub 1.086767 us, ci 0.950
std dev: 1.436660 us, lb 1.218997 us, ub 1.592479 us, ci 0.950
found 177 outliers among 100 samples (177.0%)
  77 (77.0%) low severe
  100 (100.0%) high severe
variance introduced by outliers: 98.995%
variance is severely inflated by outliers

To be honest, I just started using Criterion, so I'm not an expert using it, but I understand that 25.42399 ns is a shorter execution time than 781.0604 ns. I suspect that some specialization is playing a role here. Is it truncate2 too slow? Being the case, can truncate be improved? Furthermore, anybody knows an even faster way? I feel like doing something wrong casting to a type I don't really use.

Thanks in advance.

I am compiling with GHC-7.4.2, optimizations enabled (-O2).

369

asked Feb 11 '13 20:02

Daniel Díaz

1 Answers

First, note that the module GHC.Word includes the following RULE pragma:

"truncate/Double->Word16"
    forall x. truncate (x :: Double) = (fromIntegral :: Int -> Word16) (truncate x)

This is a simple rewrite rule to perform precisely the optimization your truncate1 provides. So we have a few questions to consider:

Why is this an optimization at all?

Because the default implementation of truncate is generic, to support any Integral instance. The speed difference you see is the cost of that generality; in the specific case of truncating one primitive type to another, there are much faster methods available.

So it seems that truncate1 is benefiting from a specialized form, while truncate2 is not.

Why is `truncate1` faster?

In GHC.Float, where the RealFrac instance for Double is defined, we have the following RULE pragma:

"truncate/Double->Int"              truncate = double2Int

Where double2Int is the optimized form we want. Compare this to the RULE mentioned earlier--apparently there's no similar primitive operation specifically for converting Double to Word16.

Why doesn't `truncate2` get rewritten as well?

Quoth the GHC User's Guide:

GHC currently uses a very simple, syntactic, matching algorithm for matching a rule LHS with an expression. It seeks a substitution which makes the LHS and expression syntactically equal modulo alpha conversion. The pattern (rule), but not the expression, is eta-expanded if necessary.

Expressions being matched are not eta-expanded, which is to say that a rule matching on forall x. foo x will match in bar y = foo y but not in bar = foo.

Since your definitions are all written point-free, the RULE for Double -> Int matches, but the RULE for Double -> Word16 does not.

178

answered Sep 19 '22 15:09

C. A. McCann

Related questions
                            
                                Making a grid-like data type in haskell
                            
                                Write a parallel array Haskell expression once, run on CPUs & GPUs with repa and accelerate
                            
                                Can exceptions sneak between restore and mask?
                            
                                Is there a fast algorithm to determine the godel number of a term of a context free language?
                            
                                Append for type-level numbered lists with TypeLits
                            
                                Is it possible to get the Kind of a Type Constructor in Haskell?
                            
                                difference between munch and many (satisfy p)?
                            
                                Is it possible to write fmap for this data type involving a type family?
                            
                                How do you write rewrite rules for typeclass methods?
                            
                                What are the state-of-art methods for solving functional equations?
                            
                                Why is full-laziness a default optimization?
                            
                                When is it useful to use "strict wildcard" in Haskell, and what does it do?
                            
                                In GHCi, what is the difference between ':module' and 'import'? [duplicate]
                            
                                How can I use parMap with a monadic function?
                            
                                Which haskell array implementation to use? AKA what are the pros and cons of each
                            
                                Write a Maximum Monoid using Maybe in Haskell
                            
                                Is GHC's auto-derived `Eq` instance really *O(N)*?
                            
                                Trying to implement Data.Either
                            
                                Sleeping until the start of the next minute
                            
                                Declaring instances of parameterized type synonyms

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Truncating to Word type

Tags:

haskell

Daniel Díaz

People also ask

1 Answers

Why is this an optimization at all?

Why is `truncate1` faster?

Why doesn't `truncate2` get rewritten as well?

C. A. McCann

Recent Activity

Donate For Us

Truncating to Word type

Tags:

haskell

Daniel Díaz

People also ask

1 Answers

Why is this an optimization at all?

Why is truncate1 faster?

Why doesn't truncate2 get rewritten as well?

C. A. McCann

Related questions

Recent Activity

Donate For Us

Why is `truncate1` faster?

Why doesn't `truncate2` get rewritten as well?