Efficiency of equality in Haskell

Tags:

haskell

I've got a function that takes data and either returns the same data or a slightly modified version.

I want to have my program do one thing if it changed or another thing if it did not change.

Previously I was returning a pair (Bool,Object) and using fst to check if it changed. Lately it occurred to me that I could simplify the code by just returning the object and checking equality using ==.

But then I realized that Haskell doesn't differentiate between deep equality checking and "object identity" (i.e., pointer equality). So how can I know whether using == is going to be efficient or not? Should I avoid it for efficiency reasons, or are there cases where I can depend on the compiler figuring out that it doesn't need to do a deep equality check?

Normally I wouldn't be too worried about efficiency while writing an initial program, but this affects the interface to my module so I want to get it right before writing too much code, and it doesn't seem worth it to make the program much less efficient just to simply a small piece of code. Moreover, I'd like to get a better idea of what kind of optimizations I can depend on GHC to help me with.

530

asked Dec 29 '09 18:12

Steve

1 Answers

It's always a bad idea to rely on uncertain compiler optimizations to provide such an important performance guarantee as constant-time equality vs linear-time deep equality. You're much better off with a new type that encapsulates a value plus information about whether the value is new. Depending on your application this can be either

data Changed a = Changed a | Unchanged a

data Changed a = Changed a | Unchanged

We actually use a similar type inside the Glasgow Haskell Compiler so we can keep running the optimizer until the code stops changing. We also run iterative dataflow analysis until the results stop changing.

We found it useful to make this type a monad so that we can write some simple higher-order functions using do notation, but it's not necessary—just a convenience.

Summary: If you want constant-time checking, code it yourself—don't rely on a possible compiler optimization which might not be there—or which might change in the next release.

165

answered Nov 10 '22 11:11

Norman Ramsey

Related questions
                            
                                Shader position vec4 or vec3
                            
                                Efficient Out-Of-Core Sorting
                            
                                PHP behavior of include/require inside conditional
                            
                                preserveDrawingBuffer false - is it worth the effort?
                            
                                What are the advantages of Blocking Queue in Java?
                            
                                jQuery fn.extend ({bla: function(){}} vs. jQuery.fn.bla
                            
                                Get speed of a onTouch ACTION_MOVE event in Android
                            
                                Why is HashMap faster than HashSet?
                            
                                Why does backtracking make an algorithm non-deterministic?
                            
                                Best way to find differences between two large arrays in PHP
                            
                                Prevent model hydration on Eloquent queries
                            
                                Why is local variable access faster than class member access in Python?
                            
                                DNS prefetching of subdomains
                            
                                Hibernate Performance Best Practice?
                            
                                Is atomic decrementing more expensive than incrementing?
                            
                                ORM solutions (JPA; Hibernate) vs. JDBC
                            
                                Symfony, Doctrine and "Proxy Classes are always regenerating"
                            
                                android - GC_FOR_ALLOC freed 6346K, 7% free , paused 143ms, total 143ms
                            
                                Effects of branch prediction on performance?
                            
                                Which is more efficient, PHP string functions or regex in PHP?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With