I want to add a new map type column to a dataframe, like this: <pre class="prettyprint"><code>|-- cMap: map (nullable = true) | |-- key: string | |-- value: string (valueContainsNull = true) </code></pre> I tried the code: <pre class="prettyprint"><code>df.withColumn("cMap", lit(null).cast(MapType)).printSchema </code></pre> The error is : <pre class="prettyprint"><code><console>:132: error: overloaded method value cast with alternatives: (to: String)org.apache.spark.sql.Column <and> (to: org.apache.spark.sql.types.DataType)org.apache.spark.sql.Column cannot be applied to (org.apache.spark.sql.types.MapType.type) </code></pre> Is there other way to cast the new column to Map or MapType? Thanks

I had the same problem, finally I found solution: <pre class="prettyprint"><code>df.withColumn("cMap", typedLit(Map.empty[String, String])) </code></pre> From ScalaDocs for <code>typedLit</code>: <blockquote> The difference between this function and [[lit]] is that this function can handle parameterized scala types e.g.: List, Seq and Map. </blockquote>

How to add empty map type column to DataFrame?

Tags:

scala

apache-spark

apache-spark-sql

I want to add a new map type column to a dataframe, like this:

|-- cMap: map (nullable = true)
|    |-- key: string
|    |-- value: string (valueContainsNull = true)

I tried the code:

df.withColumn("cMap", lit(null).cast(MapType)).printSchema

The error is :

<console>:132: error: overloaded method value cast with alternatives:
(to: String)org.apache.spark.sql.Column <and>
(to: org.apache.spark.sql.types.DataType)org.apache.spark.sql.Column
cannot be applied to (org.apache.spark.sql.types.MapType.type)

Is there other way to cast the new column to Map or MapType? Thanks

698

asked May 28 '17 04:05

Pingjiang Li

1 Answers

I had the same problem, finally I found solution:

df.withColumn("cMap", typedLit(Map.empty[String, String]))

From ScalaDocs for typedLit:

The difference between this function and [[lit]] is that this function can handle parameterized scala types e.g.: List, Seq and Map.

129

answered Sep 29 '22 14:09

bartholomaios

Related questions
                            
                                Java interoperability woes with Scala generics and boxing
                            
                                How to integrate npm/gulp/bower building process into sbt?
                            
                                Splitting a scalaz-stream process into two child streams
                            
                                Spark Streaming: StreamingContext doesn't read data files
                            
                                Composing Free monads in Scala
                            
                                How to avoid stack overflow when using scalaz's free monad?
                            
                                Scala reflection to instantiate scala.slick.lifted.TableQuery
                            
                                Why scala's collections are not 'views' by default?
                            
                                Spark - Multiple filters on RDD in one pass
                            
                                When is it appropriate to use a TrieMap?
                            
                                Play 2.4 disable certain filters set based on request path or method
                            
                                ExecutionContext to use with mapAsync in Akka-Streams
                            
                                How to assemble an Akka Streams sink from multiple file writes?
                            
                                Too many elements for Tuple: 27, allowed: 22
                            
                                Spark Source code: How to understand withScope method
                            
                                Async before and after for creating and dropping scala slick tables in scalatest
                            
                                Sequences in Spark dataframe
                            
                                How to use UUID in a VARCHAR column with Slick?
                            
                                Scala - add type definition to declaration - keyboard shortcut
                            
                                How to clean up substreams in continuous Akka streams

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With