I was wondering if it is possible to change the position of a column in a dataframe, actually to change the schema? Precisely if I have got a dataframe like <code>[field1, field2, field3]</code>, and I would like to get <code>[field1, field3, field2]</code>. I can't put any piece of code. Let us imagine we're working with a dataframe with one hundred columns, after some joins and transformations, some of these columns are misplaced regarding the schema of the destination table. How to move one or several columns, i.e: how to change the schema?

You can get the column names, reorder them however you want, and then use <code>select</code> on the original DataFrame to get a new one with this new order: <pre class="prettyprint"><code>val columns: Array[String] = dataFrame.columns val reorderedColumnNames: Array[String] = ??? // do the reordering you want val result: DataFrame = dataFrame.select(reorderedColumnNames.head, reorderedColumnNames.tail: _*) </code></pre>

How to change a column position in a spark dataframe?

Tags:

dataframe

scala

apache-spark

apache-spark-sql

I was wondering if it is possible to change the position of a column in a dataframe, actually to change the schema?

Precisely if I have got a dataframe like [field1, field2, field3], and I would like to get [field1, field3, field2].

I can't put any piece of code. Let us imagine we're working with a dataframe with one hundred columns, after some joins and transformations, some of these columns are misplaced regarding the schema of the destination table.

How to move one or several columns, i.e: how to change the schema?

968

asked Jun 29 '16 15:06

obiwan kenobi

1 Answers

You can get the column names, reorder them however you want, and then use select on the original DataFrame to get a new one with this new order:

val columns: Array[String] = dataFrame.columns val reorderedColumnNames: Array[String] = ??? // do the reordering you want val result: DataFrame = dataFrame.select(reorderedColumnNames.head, reorderedColumnNames.tail: _*)

answered Sep 28 '22 11:09

Tzach Zohar

Related questions
                            
                                In Scala, what is the difference between Any and Object?
                            
                                How to restrict actor messages to specific types?
                            
                                Ruby vs Scala - pros and contras of each one [closed]
                            
                                Is there any analog for Scala 'zip' function in Groovy?
                            
                                How to define maven test-jar dependency in sbt
                            
                                How are message-passing concurrent languages better than shared-memory concurrent languages in practice
                            
                                scala median implementation
                            
                                What's the easiest way to use reify (get an AST of) an expression in Scala?
                            
                                How to get a list with the Typesafe config library
                            
                                What exactly is Dotty?
                            
                                Error: scala: No 'scala-library*.jar' in Scala compiler library
                            
                                Why do case class companion objects extend FunctionN?
                            
                                Slick 3.0 Insert and then get Auto Increment Value
                            
                                How do I match multiple arguments?
                            
                                What is the difference between `##` and `hashCode`?
                            
                                Scala Spark DataFrame : dataFrame.select multiple columns given a Sequence of column names
                            
                                Main method in Scala
                            
                                How to create DataFrame from Scala's List of Iterables?
                            
                                Filter spark DataFrame on string contains
                            
                                Convert Java List to Scala Seq

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With