Is there a more elegant way of filtering based on values in a Set of String? <pre class="prettyprint"><code>def myFilter(actions: Set[String], myDF: DataFrame): DataFrame = { val containsAction = udf((action: String) => { actions.contains(action) }) myDF.filter(containsAction('action)) } </code></pre> In SQL you can do <pre class="prettyprint"><code>select * from myTable where action in ('action1', 'action2', 'action3') </code></pre>

How about this: <pre class="prettyprint"><code>myDF.filter("action in (1,2)") </code></pre> OR <pre class="prettyprint"><code>import org.apache.spark.sql.functions.lit myDF.where($"action".in(Seq(1,2).map(lit(_)):_*)) </code></pre> OR <pre class="prettyprint"><code>import org.apache.spark.sql.functions.lit myDF.where($"action".in(Seq(lit(1),lit(2)):_*)) </code></pre> Additional support will be added to make this cleaner in 1.5

How do I filter rows based on whether a column value is in a Set of Strings in a Spark DataFrame

Tags:

scala

apache-spark

apache-spark-sql

Is there a more elegant way of filtering based on values in a Set of String?

def myFilter(actions: Set[String], myDF: DataFrame): DataFrame = {
  val containsAction = udf((action: String) => {
    actions.contains(action)
  })

  myDF.filter(containsAction('action))
}

In SQL you can do

select * from myTable where action in ('action1', 'action2', 'action3')

522

asked Jul 14 '15 01:07

zzztimbo

1 Answers

How about this:

myDF.filter("action in (1,2)")

import org.apache.spark.sql.functions.lit       
myDF.where($"action".in(Seq(1,2).map(lit(_)):_*))

import org.apache.spark.sql.functions.lit       
myDF.where($"action".in(Seq(lit(1),lit(2)):_*))

Additional support will be added to make this cleaner in 1.5

102

answered Sep 20 '22 19:09

Justin Pihony

Related questions
                            
                                Scala get file path of file in resources folder
                            
                                scala vs java for Spark? [closed]
                            
                                Spark jobs finishes but application takes time to close
                            
                                When should I use Option.empty[A] and when should I use None in Scala?
                            
                                IntelliJ: “Output exceeds cutoff limit” in scala worksheet
                            
                                Intellij: SBT-based Scala project does not build with Java 9
                            
                                Mapping Spark DataSet row values into new hash column
                            
                                Scaladoc [use case]
                            
                                Scala Array[String] to Java Collection[String]
                            
                                How to compare Scala function values for equality
                            
                                why use def and val in Scala or vice versa
                            
                                How can I change version of Scala that is used by Play, SBT and its plugins?
                            
                                Akka Pattern - Actor tree, reply to original source
                            
                                Is cons a method or a class?
                            
                                module not found: com.eed3si9n#sbt-assembly;0.14.3
                            
                                Any differences between asInstanceOf[X] and toX for value types?
                            
                                Better alternative to Strategy pattern in Scala?
                            
                                Scala regex replace with anonymous function
                            
                                Code generation with Scala
                            
                                Mockito for Objects in Scala

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With