I have an <code>org.apache.spark.rdd.RDD[(String, (Double, Double), Iterable[(String, Double)])]</code> but it seems working with the <code>Iterable</code> is hard. Is there any way I can change it to an <code>Array[(String, Double)]</code>?

You can simply use <code>Iterable.toArray</code> <pre class="prettyprint"><code>rdd.map{case (x, y, iter) => (x, y, iter.toArray)} </code></pre> or <code>Iterable.toList</code> <pre class="prettyprint"><code>rdd.map{case (x, y, iter) => (x, y, iter.toList)} </code></pre>

Change Iterable[(String, Double)] of an RDD to Array or List

Tags:

scala

apache-spark

I have an org.apache.spark.rdd.RDD[(String, (Double, Double), Iterable[(String, Double)])] but it seems working with the Iterable is hard. Is there any way I can change it to an Array[(String, Double)]?

655

asked Aug 10 '15 17:08

Kevin Zakka

1 Answers

You can simply use Iterable.toArray

Click to copy

rdd.map{case (x, y, iter) => (x, y, iter.toArray)}

or Iterable.toList

Click to copy

rdd.map{case (x, y, iter) => (x, y, iter.toList)}

answered Oct 31 '22 18:10

zero323

Related questions
                            
                                How to speed up Scala IDE?
                            
                                Idiomatic alternative to `if (x) Some(y) else None`
                            
                                how to know from Option[Map[String,Seq[String]]] contains key or not?
                            
                                Splitting a Comma-Separated String in Scala: Missing Trailing Empty Strings?
                            
                                Meaning of 2nd parameter in StringOps.split(String, Int)
                            
                                Multiple type parameters on a scala method
                            
                                Do you need to install Scala separately if you use sbt?
                            
                                Implicit ordering of case classes scala
                            
                                How is val in scala different from const in java?
                            
                                How to do File creation and manipulation in functional style?
                            
                                What does HList#foldLeft() return?
                            
                                Calling Java API from Scala with null argument
                            
                                In Scala, why `_` can't be used in groupBy here?
                            
                                Chunked Response from an Iterator with Play Framework in Scala
                            
                                Unexpected Result when Overriding 'val'
                            
                                Jackson mapper with generic class in scala
                            
                                Intellij does not recognize Scala List operator
                            
                                Shuffled vs non-shuffled coalesce in Apache Spark
                            
                                Scala factory method with generics
                            
                                What to do when hitting the queue size of slick?