Workaround for Scala RDD not being covariant

Tags:

I'm trying to write a function to operate on RDD[Seq[String]] objects, e.g.:

def foo(rdd: RDD[Seq[String]]) = { println("hi") }

This function cannot be called on objects of type RDD[Array[String]]:

val testRdd : RDD[Array[String]] = sc.textFile("somefile").map(_.split("\\|", -1))
foo(testRdd)

->
error: type mismatch;
found   : org.apache.spark.rdd.RDD[Array[String]]
required: org.apache.spark.rdd.RDD[Seq[String]]

I guess that's because RDD isn't covariant.

I've tried a bunch of definitions of foo to get around this. Only one of them has compiled:

def foo2[T[String] <: Seq[String]](rdd: RDD[T[String]]) = { println("hi") }

But it's still broken:

foo2(testRdd)


->
<console>:101: error: inferred type arguments [Array] do not conform to method foo2's type
parameter bounds [T[String] <: Seq[String]]
          foo2(testRdd)
          ^
<console>:101: error: type mismatch;
found   : org.apache.spark.rdd.RDD[Array[String]]
required: org.apache.spark.rdd.RDD[T[String]]

Any idea how I can work around this? This is all taking place in the Spark shell.

891

asked May 22 '14 16:05

user3666020

1 Answers

For this you can use a view bound.

Array is not a Seq, but it can be viewed as a Seq.

def foo[T <% Seq[String]](rdd: RDD[T]) = ???

The <% says that T can be viewed as a Seq[String] so that whenever you use a Seq[String] method on T then T will be converted to Seq[String].

For Array[A] to be viewed as Seq[A] there needs to be an implicit function in scope that can convert Arrays to Seqs. As Ionuț G. Stan said, it exists in scala.Predef.

105

answered Oct 08 '22 03:10

ggovan

Related questions
                            
                                Basic Play framework routing and web sockets example
                            
                                Slick where/filter/withFilter
                            
                                scala implicit causes StackOverflowError
                            
                                Scalaz: how to compose a map lens with a value lens?
                            
                                Getting the error "play-iteratees_2.10 not found" after adding the reactivemongo
                            
                                What does the '<-' do in scala?
                            
                                Scala & Play: route regex without identifier
                            
                                How to assert that mocked method is never called using ScalaTest and ScalaMock?
                            
                                Composing `Future` result in Play Framework with Scala
                            
                                Project compiles fine in IntelliJ, Tomcat says java.lang.NoClassDefFoundError: my/package/name/blah
                            
                                how to get value from counter Column in cassandra with multiple row keys?
                            
                                How do you replace actorFor?
                            
                                Get TypeTag[A] from Class[A]
                            
                                Lazy val to implement lazy lists in Scala
                            
                                how many threads do scala's parallel collections use by default?
                            
                                Get process id of Scala.sys.process.Process
                            
                                When running "play" java.lang.NoSuchMethodError occurs
                            
                                Get ClassTag from reflected Java Class instance
                            
                                SBT cannot find snapshots in an Artifactory maven repository
                            
                                How do I inherit Scaladoc from Scala's standard library?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Workaround for Scala RDD not being covariant

Tags:

types

scala

apache-spark

covariance

user3666020

People also ask

1 Answers

ggovan

Recent Activity

Donate For Us