I'm trying to filter a Spark DataFrame using a list in Java. <pre class="prettyprint"><code>java.util.List<Long> selected = ....; DataFrame result = df.filter(df.col("something").isin(????)); </code></pre> The problem is that <code>isin(...)</code> method accepts Scala <code>Seq</code> or varargs. Passing in <code>JavaConversions.asScalaBuffer(selected)</code> doesn't work either. Any ideas?

Use <code>stream</code> method as follows: <pre class="prettyprint"><code>df.filter(col("something").isin(selected.stream().toArray(String[]::new)))) </code></pre>

How to use Column.isin in Java?

Tags:

java

apache-spark

apache-spark-sql

I'm trying to filter a Spark DataFrame using a list in Java.

java.util.List<Long> selected = ....;
DataFrame result = df.filter(df.col("something").isin(????));

The problem is that isin(...) method accepts Scala Seq or varargs.

Passing in JavaConversions.asScalaBuffer(selected) doesn't work either.

Any ideas?

915

asked Nov 07 '16 15:11

Boris

1 Answers

Use stream method as follows:

df.filter(col("something").isin(selected.stream().toArray(String[]::new))))

193

answered Sep 23 '22 16:09

Shankar

Related questions
                            
                                Eclipse- How to remove jars which are added "from class path" of referenced library
                            
                                How can JVMTI agent set a JVM flag on startup?
                            
                                How to Show/Hide Action Bar item programmatically via Click Event
                            
                                How to mock AWS API using Mockito in java
                            
                                scheduled alarm to repeat every minute of the clock android
                            
                                Cannot add element to ObservableList (UnsupportedOperationException) in JavaFX [duplicate]
                            
                                Spark SQL fails because "Constant pool has grown past JVM limit of 0xFFFF"
                            
                                FCM push notification not working when app is closed Android
                            
                                Java method overloading and varargs
                            
                                How to fix the error "Subject class type invalid."
                            
                                Custom validation logic for parameter in REST endpoint Spring Boot
                            
                                java.lang.IllegalArgumentException: Undefined filter parameter [p1]
                            
                                Reference method with array constructor [duplicate]
                            
                                How to enable maven profile when built version is not -SNAPSHOT?
                            
                                is java AtomicReference thread safe when used within parallelStream?
                            
                                Selenium Close File Picker Dialog
                            
                                ant: order of execution of "depends" target?
                            
                                PDFBox Error When Converting to BufferedImage: NoClassDefFoundError: org/apache/fontbox/FontBoxFont
                            
                                Java, Spark and Cassandra java.lang.ClassCastException: com.datastax.driver.core.DefaultResultSetFuture cannot be cast to shade
                            
                                Caused by: java.lang.ClassNotFoundException: com.fasterxml.jackson.databind.JavaType not found

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With