Am new to spark. I have two RDD's and want to generate resulted RDD on them as below. <pre class="prettyprint"><code>val rdd1 = Array(1, 2) val rdd2 = Array(a, b, c) val resultRDD = [(1,a), (1,b), (1,c), (2,a), (2,b), (2,c)] </code></pre> Can anyone help me on what transformations or actions I need to use to generate resultRDD like above. FYI, I am writing in scala. EDIT Thanks. spark cartesian works for me as below. <pre class="prettyprint"><code> val data = Array('a', 'b') val rdd1 = sc.parallelize(data) val data2 = Array(1, 2, 3) val rdd2 = sc.parallelize(data2) rdd1.cartesian(rdd2).foreach(println) </code></pre>

<pre class="prettyprint"><code>def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)] </code></pre> <blockquote> Return the Cartesian product of this RDD and another one, that is, the RDD of all pairs of elements (a, b) where a is in this and b is in other. </blockquote> Doc here

How to get the product of two RDDs?

Tags:

scala

apache-spark

Am new to spark. I have two RDD's and want to generate resulted RDD on them as below.

val rdd1 =  Array(1, 2)
val rdd2 =  Array(a, b, c)

val resultRDD = [(1,a), (1,b), (1,c), (2,a), (2,b), (2,c)]

Can anyone help me on what transformations or actions I need to use to generate resultRDD like above. FYI, I am writing in scala.

EDIT

Thanks. spark cartesian works for me as below.

    val data = Array('a', 'b')
    val rdd1 = sc.parallelize(data)

    val data2 = Array(1, 2, 3)
    val rdd2 = sc.parallelize(data2)

    rdd1.cartesian(rdd2).foreach(println)

250

asked Nov 13 '14 06:11

Pand005

1 Answers

def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)]

Return the Cartesian product of this RDD and another one, that is, the RDD of all pairs of elements (a, b) where a is in this and b is in other.

Doc here

110

answered Sep 19 '22 00:09

The Archetypal Paul

Related questions
                            
                                foreach and Enumeration
                            
                                Why don't Scala case class fields reflect as public?
                            
                                Is there a Java version of Clojure's or Scala's persistent immutable vector?
                            
                                garbage collect objects after lazy values have been calculated
                            
                                How to list all fields with a custom annotation using Scala's reflection at runtime?
                            
                                Play framework WS set cookie
                            
                                "Dead letters encountered" as soon as actors are placed into router
                            
                                Read JSON Tree structure in Scala Play Framework
                            
                                Create a generic Json serialization function
                            
                                How to return None in Scala
                            
                                Catching unhandled errors in Scala futures
                            
                                What are the benefits of Reader monad?
                            
                                sbt not working on amazon ec2 micro instance
                            
                                What does the _ parameter signify in this context?
                            
                                Make method actually inline
                            
                                Failing maven-build when Gatling-test has too high fail-percentage
                            
                                Difference between isInstance and isInstanceOf
                            
                                How to define a list of functions of the same arity in Scala?
                            
                                abandon calling `get` on Option and generate compile error
                            
                                Can't use a negative number in named parameters in Scala

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With