I have a dataframe with the following structure: <pre class="prettyprint"><code> |-- data: struct (nullable = true) | |-- id: long (nullable = true) | |-- keyNote: struct (nullable = true) | | |-- key: string (nullable = true) | | |-- note: string (nullable = true) | |-- details: map (nullable = true) | | |-- key: string | | |-- value: string (valueContainsNull = true) </code></pre> How it is possible to flatten the structure and create a new dataframe: <pre class="prettyprint"><code> |-- id: long (nullable = true) |-- keyNote: struct (nullable = true) | |-- key: string (nullable = true) | |-- note: string (nullable = true) |-- details: map (nullable = true) | |-- key: string | |-- value: string (valueContainsNull = true) </code></pre> Is there something like explode, but for structs?

This should work in Spark 1.6 or later: <pre class="prettyprint"><code>df.select(df.col("data.*")) </code></pre> or <pre class="prettyprint"><code>df.select(df.col("data.id"), df.col("data.keyNote"), df.col("data.details")) </code></pre>

How to flatten a struct in a Spark dataframe?

Tags:

java

apache-spark

apache-spark-sql

pyspark

I have a dataframe with the following structure:

 |-- data: struct (nullable = true)
 |    |-- id: long (nullable = true)
 |    |-- keyNote: struct (nullable = true)
 |    |    |-- key: string (nullable = true)
 |    |    |-- note: string (nullable = true)
 |    |-- details: map (nullable = true)
 |    |    |-- key: string
 |    |    |-- value: string (valueContainsNull = true)

How it is possible to flatten the structure and create a new dataframe:

     |-- id: long (nullable = true)
     |-- keyNote: struct (nullable = true)
     |    |-- key: string (nullable = true)
     |    |-- note: string (nullable = true)
     |-- details: map (nullable = true)
     |    |-- key: string
     |    |-- value: string (valueContainsNull = true)

Is there something like explode, but for structs?

860

asked Sep 30 '22 14:09

djWann

1 Answers

This should work in Spark 1.6 or later:

df.select(df.col("data.*"))

df.select(df.col("data.id"), df.col("data.keyNote"), df.col("data.details"))

119

answered Oct 19 '22 01:10

2 revsuser6022341

Related questions
                            
                                ExecutorService vs ThreadPoolExecutor using LinkedBlockingQueue
                            
                                ResourcesCompat.getDrawable() vs AppCompatResources.getDrawable()
                            
                                java.net.SocketTimeoutException: timeout
                            
                                What is the difference between != and =! in Java? [duplicate]
                            
                                Subclipse and JavaHL installation headache
                            
                                How to change Runnable to lambda expression in Java with IntelliJ shortcut
                            
                                Whole text file to a String in Java
                            
                                java.util.Date - Deleting three months from a date?
                            
                                Stream of boolean values, is any true?
                            
                                How to determine if a number is positive or negative?
                            
                                Hibernate: Create Mysql InnoDB tables instead of MyISAM
                            
                                Java Enums and Switch Statements - the default case?
                            
                                Calculating the angle between the line defined by two points
                            
                                Which String method: "contains" or "indexOf > -1"?
                            
                                Intellij generate javadoc for methods and classes
                            
                                Running java with JAVA_OPTS env variable has no effect
                            
                                java.lang.RuntimeException: Uncompilable source code - what can cause this?
                            
                                Create new package in IntelliJ
                            
                                Concat VS Merge operator
                            
                                Eclipse javadoc background color is black

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With