filter only not empty arrays dataframe spark [duplicate]

Tags:

How can i filter only not empty arrays

import  org.apache.spark.sql.types.ArrayType

  val arrayFields = secondDF.schema.filter(st => st.dataType.isInstanceOf[ArrayType])
  val names = arrayFields.map(_.name)

Or is this code

val DF1=DF.select(col("key"),explode(col("objectiveAttachment")).as("collection")).select(col("collection.*"),col("key"))

|-- objectiveAttachment: array (nullable = true) 
 | |-- element: string (containsNull = true)

I get this error

 org.apache.spark.sql.AnalysisException: Can only star expand struct data types. Attribute: ArrayBuffer(collection);

Any help is appreciated.

247

asked Apr 01 '19 18:04

A kram

2 Answers

Use the function size

import org.apache.spark.sql.functions._

secondDF.filter(size($"objectiveAttachment") > 0)

answered Oct 03 '22 04:10

Henrique Florencio

Try with size() function from org.apache.spark.sql.functions._

    import org.apache.spark.sql.functions._
    val df1=df.select(col("key"),explode(col("objectiveAttachment")).as("collection")).select(col("collection.*"),col("ins"))
.filter(size($"objectiveAttachment")>0)

answered Oct 03 '22 04:10

deo

Related questions
                            
                                Cannot serialize LocalDate in Mongodb
                            
                                generate a tree in scala
                            
                                Akka FSM Goto within future
                            
                                Adding code examples in scaladoc
                            
                                Why does Scala have a case object?
                            
                                How to iterate records spark scala?
                            
                                Correct way to work with two instances of Option together
                            
                                How to find unique elements from list of tuples based on some elements using scala?
                            
                                What are the advantages of having a trait behind an object?
                            
                                Why does using the '~' operator in scala give me a negative value
                            
                                Functions vs function pointers
                            
                                Cleanup after sending a file in Play Framework
                            
                                json4s Serialize and deserialize generic type
                            
                                UDF to extract only the file name from path in Spark SQL
                            
                                Converting dataframe columns into list of tuples
                            
                                Converting from Array[String] to Seq[String] in Scala
                            
                                Unable to run Scala application in IntelliJ
                            
                                How to extract number from string column?
                            
                                Pondering name of pattern seen in Elm and if other similar cases
                            
                                Curly braces in Scala method call [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

filter only not empty arrays dataframe spark [duplicate]

Tags:

scala

apache-spark

apache-spark-sql

A kram

People also ask

2 Answers

Henrique Florencio

deo

Recent Activity

Donate For Us