I'm reading Learn You a Haskell and I'm wondering why so many things are acting like a list, and nothing in the Prelude is using the native facility of type classes to set this up: <blockquote> "The bytestring version of : is called cons It takes a byte and a bytestring and puts the byte at the beginning. It's lazy though, so it will make a new chunk even if the first chunk in the bytestring isn't full. That's why it's better to use the strict version of cons, cons' if you're going to be inserting a lot of bytes at the beginning of a bytestring." </blockquote> Why isn't there a TypeClass listable or something that offers the <code>:</code> function to unify <code>Data.ByteString</code>, <code>Data.List</code>, <code>Data.ByteString.Lazy</code>, etc? Is there a reason for this, or is this just an element of legacy Haskell? Using <code>:</code> as an example is kind of an understatement, also from LYAH: <blockquote> Otherwise, the bytestring modules have a load of functions that are analogous to those in Data.List, including, but not limited to, head, tail, init, null, length, map, reverse, foldl, foldr, concat, takeWhile, filter, etc. </blockquote>

The main problem with such a class is that even if it existed it would only offer a superficial similarity. The asymptotics of the same algorithm built using different structures would vary tremendously. In the case of strict bytestrings building them up with cons is terrible, because you wind up copying the entire string every time you add another Char. This O(1) operation on a list turns it into an O(n) operation on a Bytestring. This leads to O(n^2) behavior when you implement the first algorithm that might come to mind, map, whereas building up a list or Data.Sequence.Seq with cons is linear time and it can be implemented in O(n) for bytestrings or vectors as well with a little bit of thought. It turns out the utility of such a class in light of this is more superficial than actual. I'm not saying that a good design can't be found, but such a design would be difficult to use and to optimize for and likely a usable version of the design would not wind up being Haskell 98. I've eked out portions of this design space in my keys package, which provides a lot of functions for indexing into containers, etc, but I've deliberately avoided providing a list-like API a.) because it has been done before to little success and b.) because of the asymptotic concerns above. tl;dr You typically want to implement algorithms very differently when the asymptotics of the underlying operations change.

In Haskell, why isn't there a TypeClass for things that can act like lists?

Tags:

list

haskell

typeclass

I'm reading Learn You a Haskell and I'm wondering why so many things are acting like a list, and nothing in the Prelude is using the native facility of type classes to set this up:

"The bytestring version of : is called cons It takes a byte and a bytestring and puts the byte at the beginning. It's lazy though, so it will make a new chunk even if the first chunk in the bytestring isn't full. That's why it's better to use the strict version of cons, cons' if you're going to be inserting a lot of bytes at the beginning of a bytestring."

Why isn't there a TypeClass listable or something that offers the : function to unify Data.ByteString, Data.List, Data.ByteString.Lazy, etc? Is there a reason for this, or is this just an element of legacy Haskell? Using : as an example is kind of an understatement, also from LYAH:

Otherwise, the bytestring modules have a load of functions that are analogous to those in Data.List, including, but not limited to, head, tail, init, null, length, map, reverse, foldl, foldr, concat, takeWhile, filter, etc.

580

asked Sep 02 '10 03:09

NO WAR WITH RUSSIA

2 Answers

The ListLike package seems to provide what you're looking for. I've never understood why it isn't more popular.

ListLike aside, one reason this isn't implemented in the Prelude is because it's not possible to do so well without invoking some language extensions (multi-param type classes and fundeps or associated types). There are three sorts of containers to consider:

Containers that don't care about their elements at all (e.g. [])
Containers which are only implemented for specific elements (e.g. bytestrings)
Containers which are polymorphic over elements but require a context (e.g. Data.Vector.Storable, which will hold any type with a storable instance).

Here's a very basic ListLike-style class without using any extensions:

class Listable container where   head :: container a -> a  instance Listable [] where   head (x:xs) = x  instance Listable ByteString where --compiler error, wrong kind  instance Listable SV.Vector where   head v = SV.head    --compiler error, can't deduce context (Storable a)

Here container has kind *->*. This won't work for bytestrings because they don't allow an arbitrary type; they have kind *. It also won't work for a Data.Vector.Storable vector, because the class doesn't include the context (the Storable constraint).

You can fix this problem by either changing your class definition to

class ListableMPTC container elem | container -> elem where

class ListableAT container where   type Elem container :: *

Now container has kind *; it's a fully-applied type constructor. That is, your instances look like

instance ListableMPTC [a] a where

but you're no longer Haskell98.

That's why even a simple Listable-type interface is non-trivial; it gets a bit harder when you have different collection semantics to account for (e.g. queues). The other really big challenge is mutable-vs.-immutable data. So far every attempt I've seen (except one) punts on that issue by creating a mutable interface and an immutable one. The one interface I know which did unify the two was mind-bending, invoked a bunch of extensions, and had quite poor performance.

Addendum: bytestrings

Totally conjecture on my part, but I think we're stuck with bytestrings as a product of evolution. That is, they were the first solution to low performance I/O operations, and it made sense to use Ptr Word8s for interfacing with IO system calls. Operations on pointers require Storable, and most likely the necessary extensions (as described above) to make polymorphism work weren't available then. Now it's difficult to overcome their momentum. A similar container with polymorphism is certainly possible, the storablevector package implements this, but it's not anywhere near as popular.

Could bytestrings be polymorphic without any restrictions on the elements? I think the closest Haskell has to this is the Array type. This isn't nearly as good as a bytestring for low-level IO because data needs to be unpacked from the pointer into the array's internal format. Also the data is boxed, which adds significant space overhead. If you want unboxed storage (less space) and efficient interfacing with C, pointers are the way to go. Once you have a Ptr, you need Storable, and then you need to include the element type in the type class, so then you're left with requiring extensions.

That being said, I think that with the appropriate extensions available this is essentially a solved problem for any single container implementation (modulo mutable/immutable APIs). The harder part now is coming up with a sensible set of classes that are usable for many different types of structures (lists, arrays, queues, etc.) and is flexible enough to be useful. I personally would expect this to be relatively straightforward, but I could be wrong.

answered Oct 08 '22 18:10

John L

The main problem with such a class is that even if it existed it would only offer a superficial similarity.

The asymptotics of the same algorithm built using different structures would vary tremendously.

In the case of strict bytestrings building them up with cons is terrible, because you wind up copying the entire string every time you add another Char. This O(1) operation on a list turns it into an O(n) operation on a Bytestring.

This leads to O(n^2) behavior when you implement the first algorithm that might come to mind, map, whereas building up a list or Data.Sequence.Seq with cons is linear time and it can be implemented in O(n) for bytestrings or vectors as well with a little bit of thought.

It turns out the utility of such a class in light of this is more superficial than actual.

I'm not saying that a good design can't be found, but such a design would be difficult to use and to optimize for and likely a usable version of the design would not wind up being Haskell 98.

I've eked out portions of this design space in my keys package, which provides a lot of functions for indexing into containers, etc, but I've deliberately avoided providing a list-like API a.) because it has been done before to little success and b.) because of the asymptotic concerns above.

tl;dr You typically want to implement algorithms very differently when the asymptotics of the underlying operations change.

answered Oct 08 '22 19:10

Edward Kmett

Related questions
                            
                                Scala - convert List of Lists into a single List: List[List[A]] to List[A]
                            
                                Remove all empty elements from string array
                            
                                What is the order of evaluation in python when using pop(), list[-1] and +=?
                            
                                Scala, repeat a finite list infinitely
                            
                                User interface for reordering a list items
                            
                                populate a dictionary using linq
                            
                                Converting a list of tuples into a dict
                            
                                Unpack list to variables
                            
                                Retrieve the two highest item from a list containing 100,000 integers
                            
                                lisp filter out results from list not matching predicate
                            
                                Python find first instance of non zero number in list
                            
                                Creating a list of dictionaries results in a list of copies of the same dictionary
                            
                                Process a list with a loop, taking 100 elements each time and automatically less than 100 at the end of the list
                            
                                How to insert the contents of one list into another
                            
                                Using return vs. not using return in the list monad
                            
                                How do I sort a zipped list in Python?
                            
                                Getting a list of all possible data type values in Haskell
                            
                                Avoid Adding duplicate elements to a List C#
                            
                                How to get list elements by index in elixir
                            
                                How to return a subset of a list that matches a condition [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With