Transpose DataFrame Without Aggregation in Spark with scala

Tags:

I looked number different solutions online, but count not find what I am trying to achine. Please help me on this.

I am using Apache Spark 2.1.0 with Scala. Below is my dataframe:


+-----------+-------+
|COLUMN_NAME| VALUE |
+-----------+-------+
|col1       | val1  |
|col2       | val2  |
|col3       | val3  |
|col4       | val4  |
|col5       | val5  |
+-----------+-------+

I want this to be transpose to, as below:


+-----+-------+-----+------+-----+
|col1 | col2  |col3 | col4 |col5 |
+-----+-------+-----+------+-----+
|val1 | val2  |val3 | val4 |val5 |
+-----+-------+-----+------+-----+

347

asked Mar 20 '18 19:03

Maruti K

1 Answers

If your dataframe is small enough as in the question, then you can collect COLUMN_NAME to form schema and collect VALUE to form the rows and then create a new dataframe as

import org.apache.spark.sql.functions._
import org.apache.spark.sql.Row
//creating schema from existing dataframe
val schema = StructType(df.select(collect_list("COLUMN_NAME")).first().getAs[Seq[String]](0).map(x => StructField(x, StringType)))
//creating RDD[Row] 
val values = sc.parallelize(Seq(Row.fromSeq(df.select(collect_list("VALUE")).first().getAs[Seq[String]](0))))
//new dataframe creation
sqlContext.createDataFrame(values, schema).show(false)

which should give you

+----+----+----+----+----+
|col1|col2|col3|col4|col5|
+----+----+----+----+----+
|val1|val2|val3|val4|val5|
+----+----+----+----+----+

155

answered Oct 08 '22 05:10

Ramesh Maharjan

Related questions
                            
                                Test if implicit conversion is available
                            
                                java.nio.charset.MalformedInputException when reading a stream
                            
                                Reducing a large stream without stack overflowing
                            
                                "Could not find main method from given launch configuration" when using Java+Scala+Slick2D
                            
                                Match a tuple of unknown size in scala
                            
                                ClassTag based pattern matching fails for primitives
                            
                                Standalone deployment of Scalatra servlet
                            
                                How to implement the `List` monad transformer in Scala?
                            
                                How do i specify spray Content-Type response header?
                            
                                ScalaFx: Event Handler with First Class Function
                            
                                flatMap on a map gives error: wrong number of parameters; expected = 1
                            
                                How to Check whether input variable is Int in Scala?
                            
                                How to find and modify field in nested case classes?
                            
                                How does map() on 'zipped' Lists work?
                            
                                Understanding Scala code: (-_._2)
                            
                                Difference between transparent remoting and location transparency
                            
                                Scala Dataframe null check for columns
                            
                                akka http compile error
                            
                                Spark, Scala - column type determine
                            
                                AWS Lambda - How to get the topic name of data coming from AWS IOT

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Transpose DataFrame Without Aggregation in Spark with scala

Tags:

dataframe

scala

transpose

apache-spark

Maruti K

People also ask

1 Answers

Ramesh Maharjan

Recent Activity

Donate For Us