How can I unpersist RDD that were generated in an MLlib model for which I don't have a reference? I know in pyspark you could unpersist all dataframes with <code>sqlContext.clearCache()</code>, is there something similar but for RDDs in the scala API? Furthermore, is there a way I could unpersist only some RDDs without having to unpersist all?

You can call <pre class="prettyprint"><code>val rdds = sparkContext.getPersistentRDDs(); // result is Map[Int, RDD] </code></pre> and then filter values to get this value that you want (1) : <pre class="prettyprint"><code>rdds.filter (x => filterLogic(x._2)).foreach (x => x._2.unpersist()) </code></pre> (1) - written by hand, without compiler - sorry if there's some error, but there shouldn't be ;)

Spark: unpersist RDDs for which I have lost the reference

Tags:

scala

apache-spark

How can I unpersist RDD that were generated in an MLlib model for which I don't have a reference?

I know in pyspark you could unpersist all dataframes with sqlContext.clearCache(), is there something similar but for RDDs in the scala API? Furthermore, is there a way I could unpersist only some RDDs without having to unpersist all?

648

asked Feb 06 '17 16:02

germanium

1 Answers

You can call

val rdds = sparkContext.getPersistentRDDs(); // result is Map[Int, RDD]

and then filter values to get this value that you want (1) :

rdds.filter (x => filterLogic(x._2)).foreach (x => x._2.unpersist())

(1) - written by hand, without compiler - sorry if there's some error, but there shouldn't be ;)

195

answered Oct 10 '22 23:10

T. Gawęda

Related questions
                            
                                SBT: Exclude resource subdirectory
                            
                                On Spark's RDD's take and takeOrdered methods
                            
                                Operate on neighbor elements in RDD in Spark
                            
                                Kryo serializer causing exception on underlying Scala class WrappedArray
                            
                                Add a compile time only sub-project dependency in sbt
                            
                                scala.js — getting complex objects from JavaScript
                            
                                reduce() vs. fold() in Apache Spark
                            
                                How to convert column to vector type?
                            
                                Scala-Spark Dynamically call groupby and agg with parameter values
                            
                                Spark random forest binary classifier metrics
                            
                                Local assignment affects type?
                            
                                How to put a variable into z ZeppelinContext in javascript in Zeppelin?
                            
                                Spark History Server on S3A FileSystem: ClassNotFoundException
                            
                                Can non-persistent data structures be used in a purely functional way?
                            
                                Generic Numeric division
                            
                                Chain functions in different way
                            
                                value read is not a member of org.apache.spark.SparkContext
                            
                                scala.MatchError: [Ljava.lang.String; (of class [Ljava.lang.String;)
                            
                                Inserting Data Into Cassandra table Using Spark DataFrame
                            
                                Dropping columns by data type in Scala Spark

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With