I just uncovered this confusion and would like a confirmation that it is what it is. Unless, of course, I am just missing something. Say, I have these data declarations: <pre class="prettyprint"><code>data VmInfo = VmInfo {name, index, id :: String} deriving (Show) data HostInfo = HostInfo {name, index, id :: String} deriving (Show) vm = VmInfo "vm1" "01" "74653" host = HostInfo "host1" "02" "98732" </code></pre> What I always thought and what seems to be so natural and logical is this: <pre class="prettyprint"><code>vmName = vm.name hostName = host.name </code></pre> But this, obviously, does not work. I got this. <hr> <h3>Questions</h3> So my questions are. <ul> <li>When I create a data type with record syntax, do I have to make sure that all the fields have unique names? If yes - why? </li> <li>Is there a clean way or something similar to a "scope resolution operator", like <code>::</code> or <code>.</code>, etc., so that Haskell distinguishes which data type the <code>name</code> (or any other none unique fields) belongs to and returns the correct result? </li> <li>What is the correct way to deal with this if I have several declarations with the same field names?</li> </ul> <hr> <h3>As a side note.</h3> In general, I need to return data types similar to the above example. First I returned them as tuples (seemed to me the correct way at the time). But tuples are hard to work with as it is impossible to extract individual parts of a complex type as easy as with the lists using "!!". So next thing I thought of the dictionaries/hashes. When I tried using dictionaries I thought what is the point of having own data types then? Playing/learning data types I encountered the fact that led me to the above question. So it looks like it is easier for me to use dictionaries instead of own data types as I can use the same fields for different objects. <hr> Can you please elaborate on this and tell me how it is done in real world?

Haskell record syntax is a bit of a hack, but the record name emerges as a function, and that function has to have a unique type. So you can share record-field names among constructors of a single datatype but not among distinct datatypes. <blockquote> What is the correct way to deal with this if I have several declarations with the same field names? </blockquote> You can't. You have to use distinct field names. If you want an overloaded name to select from a record, you can try using a type class. But basically, field names in Haskell don't work the way they do in say, C or Pascal. Calling it "record syntax" might have been a mistake. <blockquote> But tuples are hard to work with as it is impossible to extract individual parts of a complex type </blockquote> Actually, this can be quite easy using pattern matching. Example <pre class="prettyprint"><code>smallId :: VmInfo -> Bool smallId (VmInfo { vmId = n }) = n < 10 </code></pre> As to how this is done in the "real world", Haskell programmers tend to rely heavily on knowing what type each field is at compile time. If you want the type of a field to vary, a Haskell programmer introduces a type parameter to carry varying information. Example <pre class="prettyprint"><code>data VmInfo a = VmInfo { vmId :: Int, vmName :: String, vmInfo :: a } </code></pre> Now you can have <code>VmInfo String</code>, <code>VmInfo Dictionary</code>, <code>VmInfo Node</code>, or whatever you want. Summary: each field name must belong to a unique type, and experienced Haskell programmers work with the static type system instead of trying to work around it. And you definitely want to learn about pattern matching.

There are more reasons why this doesn't work: lowercase typenames and data constructors, OO-language-style member access with <code>.</code>. In Haskell, those member access functions actually are free functions, i.e. <code>vmName = name vm</code> rather than <code>vmName = vm.name</code>, that's why they can't have same names in different data types. If you really want functions that can operate on both <code>VmInfo</code> and <code>HostInfo</code> objects, you need a type class, such as <pre class="prettyprint"><code>class MachineInfo m where name :: m -> String index :: m -> String -- why String anyway? Shouldn't this be an Int? id :: m -> String </code></pre> and make instances <pre class="prettyprint"><code>instance MachineInfo VmInfo where name (VmInfo vmName _ _) = vmName index (VmInfo _ vmIndex _) = vmIndex ... instance MachineInfo HostInfo where ... </code></pre> Then <code>name machine</code> will work if <code>machine</code> is a <code>VmInfo</code> as well as if it's a <code>HostInfo</code>.

Haskell: Confusion with own data types. Record syntax and unique fields

Tags:

haskell

I just uncovered this confusion and would like a confirmation that it is what it is. Unless, of course, I am just missing something.

Say, I have these data declarations:

data VmInfo = VmInfo {name, index, id :: String} deriving (Show)
data HostInfo = HostInfo {name, index, id :: String} deriving (Show)

vm = VmInfo "vm1" "01" "74653"
host = HostInfo "host1" "02" "98732"

What I always thought and what seems to be so natural and logical is this:

vmName = vm.name
hostName = host.name

But this, obviously, does not work. I got this.

Questions

So my questions are.

When I create a data type with record syntax, do I have to make sure that all the fields have unique names? If yes - why?
Is there a clean way or something similar to a "scope resolution operator", like :: or ., etc., so that Haskell distinguishes which data type the name (or any other none unique fields) belongs to and returns the correct result?
What is the correct way to deal with this if I have several declarations with the same field names?

As a side note.

In general, I need to return data types similar to the above example. First I returned them as tuples (seemed to me the correct way at the time). But tuples are hard to work with as it is impossible to extract individual parts of a complex type as easy as with the lists using "!!". So next thing I thought of the dictionaries/hashes. When I tried using dictionaries I thought what is the point of having own data types then? Playing/learning data types I encountered the fact that led me to the above question. So it looks like it is easier for me to use dictionaries instead of own data types as I can use the same fields for different objects.

Can you please elaborate on this and tell me how it is done in real world?

505

asked Feb 20 '12 00:02

r.sendecky

2 Answers

Haskell record syntax is a bit of a hack, but the record name emerges as a function, and that function has to have a unique type. So you can share record-field names among constructors of a single datatype but not among distinct datatypes.

What is the correct way to deal with this if I have several declarations with the same field names?

You can't. You have to use distinct field names. If you want an overloaded name to select from a record, you can try using a type class. But basically, field names in Haskell don't work the way they do in say, C or Pascal. Calling it "record syntax" might have been a mistake.

But tuples are hard to work with as it is impossible to extract individual parts of a complex type

Actually, this can be quite easy using pattern matching. Example

smallId :: VmInfo -> Bool
smallId (VmInfo { vmId = n }) = n < 10

As to how this is done in the "real world", Haskell programmers tend to rely heavily on knowing what type each field is at compile time. If you want the type of a field to vary, a Haskell programmer introduces a type parameter to carry varying information. Example

data VmInfo a = VmInfo { vmId :: Int, vmName :: String, vmInfo :: a }

Now you can have VmInfo String, VmInfo Dictionary, VmInfo Node, or whatever you want.

Summary: each field name must belong to a unique type, and experienced Haskell programmers work with the static type system instead of trying to work around it. And you definitely want to learn about pattern matching.

147

answered Oct 26 '22 19:10

Norman Ramsey

There are more reasons why this doesn't work: lowercase typenames and data constructors, OO-language-style member access with .. In Haskell, those member access functions actually are free functions, i.e. vmName = name vm rather than vmName = vm.name, that's why they can't have same names in different data types.

If you really want functions that can operate on both VmInfo and HostInfo objects, you need a type class, such as

class MachineInfo m where
  name :: m -> String
  index :: m -> String    -- why String anyway? Shouldn't this be an Int?
  id :: m -> String

and make instances

instance MachineInfo VmInfo where
  name (VmInfo vmName _ _) = vmName
  index (VmInfo _ vmIndex _) = vmIndex
  ...
instance MachineInfo HostInfo where
  ...

Then name machine will work if machine is a VmInfo as well as if it's a HostInfo.

answered Oct 26 '22 18:10

leftaroundabout

Related questions
                            
                                Conditional QuickCheck properties
                            
                                Has the Control.Monad.State API changed recently?
                            
                                What's the benefit of conduit's leftovers?
                            
                                List of IO Strings
                            
                                Is Haskell designed to encourage Hungarian Notation?
                            
                                Generate a random value from a user-defined data type in Haskell
                            
                                Programmatic type annotations in Haskell
                            
                                Can this function be written in point-free style? If not, why?
                            
                                Applicative functors other than monads and ZipList?
                            
                                Generate function of given arity in Haskell using type numbers
                            
                                GHC anything results in "ld: library not found for -lgmp"
                            
                                Non type-variable argument in the constraint error on Haskell map function
                            
                                Can Haskell optimize function calls the same way Clang / GCC does?
                            
                                Haskell code littered with TVar operations and functions taking many arguments: code smell?
                            
                                Is `data PoE a = Empty | Pair a a` a monad?
                            
                                Haskell - alternating elements from two lists
                            
                                An example of using Data.Map in Haskell
                            
                                How does Haskell deal with documentation?
                            
                                What is the difference between forM and forM_ in haskell?
                            
                                How to install Haskell cabal tool for Haskell 7.6.1 on Mac OSX?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With