Given an example data type with record syntax: <pre class="prettyprint"><code>data VmInfo = VmInfo {infoVid :: String ,infoIndex :: Int ,infoPid :: Int ,infoExe :: String } deriving (Show) </code></pre> and (vmInfo :: String -> VmInfo) function that generates and returns the above data structure given vm name as string. I can see two methods to extract the individual parts of the VmInfo data type. <pre class="prettyprint"><code>(VmInfo vid _ _ _) <- vmInfo vm </code></pre> Which is just a pattern match. And ... <pre class="prettyprint"><code>vid <- infoVid <$> vmInfo vm </code></pre> using record syntax compiler generated functions. The question is simple: which is a preferred method? Amount-of-typing wise they are the same so I am looking for speed and correctness/best practice. I assume the pattern matching would be faster but then what is the point of record syntax?

These aren't semantically equivalent. Let's look at the first example: <pre class="prettyprint"><code>(VmInfo vid _ _ _) <- vmInfo vm </code></pre> This performs a pattern match in the binding operation. There are two results of this. The first is that the constructor of the result of the <code>vmInfo vm</code> action is evaluated. This means that if <code>vmInfo</code> ended with a line like <code>return undefined</code>, the exception thrown by evaluating <code>undefined</code> would happen at this pattern match, not a later use of <code>vid</code>. The second is that if the pattern match is refuted (the pattern match does not match the value), the monad's <code>fail</code> instance will be called with the pattern match error text. That's not possible in this case, but it is generally possible when pattern matching a constructor in a bind. Now, on to the next example: <pre class="prettyprint"><code>vid <- infoVid <$> vmInfo vm </code></pre> By the definition of <code><$></code>, this will be entirely lazy in the value returned by the action (not the effects). If <code>vmInfo</code> ended with <code>return undefined</code>, you wouldn't get the exception from evaluating <code>undefined</code> until you did something that used the value of <code>vid</code>. Additionally, if <code>infoVoid</code> had the ability to throw any exceptions, they wouldn't end up happening until the use of <code>vid</code>, best case. Interestingly enough, these differences are only present in the scope of a monadic bind. If <code>vmInfo</code> was pure and you were binding the name <code>vid</code> inside a <code>let</code> or <code>where</code> expression, they would generate identical code. In that case, which one you would want to use is entirely up to you. Both are idiomatic Haskell. People generally pick whichever looks better in the context they're working in. The main reasons people use accessor functions is brevity when the record has so many fields a pattern match is huge, and because they are actual functions - they can be passed to any higher-order function their type fits into. You can't pass around pattern matches as a distinct construct.

Pattern matching vs record syntax function for data type field extraction

Tags:

haskell

Given an example data type with record syntax:

data VmInfo = VmInfo {infoVid   :: String
                     ,infoIndex :: Int
                     ,infoPid   :: Int
                     ,infoExe   :: String
                     } deriving (Show)

and (vmInfo :: String -> VmInfo) function that generates and returns the above data structure given vm name as string.

I can see two methods to extract the individual parts of the VmInfo data type.

(VmInfo vid _ _ _) <- vmInfo vm

Which is just a pattern match. And ...

vid <- infoVid <$> vmInfo vm

using record syntax compiler generated functions.

The question is simple: which is a preferred method?

Amount-of-typing wise they are the same so I am looking for speed and correctness/best practice.

I assume the pattern matching would be faster but then what is the point of record syntax?

249

asked Apr 24 '12 03:04

r.sendecky

1 Answers

These aren't semantically equivalent.

Let's look at the first example:

(VmInfo vid _ _ _) <- vmInfo vm

This performs a pattern match in the binding operation. There are two results of this. The first is that the constructor of the result of the vmInfo vm action is evaluated. This means that if vmInfo ended with a line like return undefined, the exception thrown by evaluating undefined would happen at this pattern match, not a later use of vid. The second is that if the pattern match is refuted (the pattern match does not match the value), the monad's fail instance will be called with the pattern match error text. That's not possible in this case, but it is generally possible when pattern matching a constructor in a bind.

Now, on to the next example:

vid <- infoVid <$> vmInfo vm

By the definition of <$>, this will be entirely lazy in the value returned by the action (not the effects). If vmInfo ended with return undefined, you wouldn't get the exception from evaluating undefined until you did something that used the value of vid. Additionally, if infoVoid had the ability to throw any exceptions, they wouldn't end up happening until the use of vid, best case.

Interestingly enough, these differences are only present in the scope of a monadic bind. If vmInfo was pure and you were binding the name vid inside a let or where expression, they would generate identical code.

In that case, which one you would want to use is entirely up to you. Both are idiomatic Haskell. People generally pick whichever looks better in the context they're working in.

The main reasons people use accessor functions is brevity when the record has so many fields a pattern match is huge, and because they are actual functions - they can be passed to any higher-order function their type fits into. You can't pass around pattern matches as a distinct construct.

146

answered Nov 15 '22 08:11

Carl

Related questions
                            
                                Choosing data from a list of tuples in Haskell
                            
                                Polymorphic class-constrained instances
                            
                                Getting list of object names in module with template haskell?
                            
                                cabal install yesod fails?
                            
                                Is there a sensible way to unzip the state monad?
                            
                                Is it possible to create a collection api like Scala 2.8's in Haskell?
                            
                                Dealing with Writable Memory in Haskell - Implementation of Infocom's Z-Machine VM
                            
                                Lift instance for a function?
                            
                                Speed up calculation of partitions in Haskell
                            
                                Model-driven Software Development vs. Haskell
                            
                                Reading sequence of ints from a binary file
                            
                                UI input with reactive-banana-wx
                            
                                Error loading function from file in GHCi
                            
                                Lisp-like configuration using code in Haskell
                            
                                Value constraints
                            
                                Does tryhaskell.org support definitions?
                            
                                can't find Parsec modules in GHCi
                            
                                How do I create a thread pool?
                            
                                Get module contents
                            
                                'unification' in list comprehensions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With