How to get all the column names in a spark dataframe into a Seq variable . Input Data & Schema <pre class="prettyprint"><code>val dataset1 = Seq(("66", "a", "4"), ("67", "a", "0"), ("70", "b", "4"), ("71", "d", "4")).toDF("KEY1", "KEY2", "ID") dataset1.printSchema() root |-- KEY1: string (nullable = true) |-- KEY2: string (nullable = true) |-- ID: string (nullable = true) </code></pre> I need to store all the column names in variable using scala programming . I have tried as below , but its not working. <pre class="prettyprint"><code>val selectColumns = dataset1.schema.fields.toSeq selectColumns: Seq[org.apache.spark.sql.types.StructField] = WrappedArray(StructField(KEY1,StringType,true),StructField(KEY2,StringType,true),StructField(ID,StringType,true)) </code></pre> Expected output: <pre class="prettyprint"><code>val selectColumns = Seq( col("KEY1"), col("KEY2"), col("ID") ) selectColumns: Seq[org.apache.spark.sql.Column] = List(KEY1, KEY2, ID) </code></pre>

<pre class="prettyprint"><code>val selectColumns = dataset1.columns.toList.map(col(_)) </code></pre>

Fetch Spark dataframe column list

Tags:

How to get all the column names in a spark dataframe into a Seq variable .

Input Data & Schema

val dataset1 = Seq(("66", "a", "4"), ("67", "a", "0"), ("70", "b", "4"), ("71", "d", "4")).toDF("KEY1", "KEY2", "ID")

dataset1.printSchema()
root
|-- KEY1: string (nullable = true)
|-- KEY2: string (nullable = true)
|-- ID: string (nullable = true)

I need to store all the column names in variable using scala programming . I have tried as below , but its not working.

val selectColumns = dataset1.schema.fields.toSeq

selectColumns: Seq[org.apache.spark.sql.types.StructField] = WrappedArray(StructField(KEY1,StringType,true),StructField(KEY2,StringType,true),StructField(ID,StringType,true))

Expected output:

val selectColumns = Seq(
  col("KEY1"),
  col("KEY2"),
  col("ID")
)

selectColumns: Seq[org.apache.spark.sql.Column] = List(KEY1, KEY2, ID)

963

asked Oct 15 '17 06:10

RaAm

Video Answer

2 Answers

You can use the following command:

val selectColumns = dataset1.columns.toSeq

scala> val dataset1 = Seq(("66", "a", "4"), ("67", "a", "0"), ("70", "b", "4"), ("71", "d", "4")).toDF("KEY1", "KEY2", "ID")
dataset1: org.apache.spark.sql.DataFrame = [KEY1: string, KEY2: string ... 1 more field]

scala> val selectColumns = dataset1.columns.toSeq
selectColumns: Seq[String] = WrappedArray(KEY1, KEY2, ID)

187

answered Sep 20 '22 12:09

Yaron

val selectColumns = dataset1.columns.toList.map(col(_))

answered Sep 21 '22 12:09

RaAm

Related questions
                            
                                Using --onefile with a .spec in PyInstaller
                            
                                Multiple modules and routing in angular 5
                            
                                how to control access for pods/exec only in kubernetes rbac without pods create binded?
                            
                                Effective way to append strings separated with comma [Kotlin]
                            
                                Kotlin convert json array to model list using GSON
                            
                                py.test patch on fixture
                            
                                Django 2 - How to register a user using email confirmation and CBVs?
                            
                                NextJS cannot find a valid build in the '.next' directory
                            
                                Two Buttons with equal width in Constraint Layout
                            
                                Is SingleLiveEvent actually part of the Android Architecture Components Library?
                            
                                How to make a function composer
                            
                                Using Offline Persistence in Firestore in a Flutter App

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fetch Spark dataframe column list

Tags:

RaAm

People also ask

Video Answer

2 Answers

Yaron

RaAm

Recent Activity

Donate For Us