I have the following data and would like to get the result with a text prefix: Input dataframe: <pre class="prettyprint"><code>sk id 2306220722 117738 </code></pre> Current code: <pre class="prettyprint"><code>df.withColumn("Remarks", concat_ws("MCA", col("ID"))) </code></pre> Expected output: <pre class="prettyprint"><code>sk id Remarks 2306220722 117738 MCA 117738 </code></pre> I would like to prefix the <code>id</code> column with "MCA" and add the resulting string to the <code>Remarks</code> column.

Simply use the <code>concat</code> command in combination with <code>lit</code>. <code>lit</code> will take a value and produce a column with only this value, it can be a string, double, etc. <pre class="prettyprint"><code>val df2 = df.withColumn("Remarks", concat(lit("MCA "), col("id"))) </code></pre> Using the example dataframe in the question and running <code>df2.show()</code> gives <pre class="prettyprint"><code>+----------+------+----------+ | sk| id| Remarks| +----------+------+----------+ |2306220722|117738|MCA 117738| +----------+------+----------+ </code></pre>

How to concatenate a string to a column in Spark?

Tags:

concatenation

scala

apache-spark

apache-spark-sql

I have the following data and would like to get the result with a text prefix:

Input dataframe:

sk            id       
2306220722    117738

Current code:

df.withColumn("Remarks", concat_ws("MCA", col("ID")))

Expected output:

sk           id      Remarks  
2306220722   117738  MCA 117738

I would like to prefix the id column with "MCA" and add the resulting string to the Remarks column.

270

asked Feb 07 '18 02:02

Rjj

1 Answers

Simply use the concat command in combination with lit. lit will take a value and produce a column with only this value, it can be a string, double, etc.

val df2 = df.withColumn("Remarks", concat(lit("MCA "), col("id")))

Using the example dataframe in the question and running df2.show() gives

+----------+------+----------+
|        sk|    id|   Remarks|
+----------+------+----------+
|2306220722|117738|MCA 117738|
+----------+------+----------+

114

answered Oct 05 '22 15:10

Shaido

Related questions
                            
                                Passing around path dependent type fails to retain dependent value
                            
                                Protobuf objects as Keys in Maps
                            
                                How to use Scala DataFrameReader option method
                            
                                how to find the implicit function or variables in scala
                            
                                how to bind request body in Finch
                            
                                In scala, how can I find the size of an array element
                            
                                Static resource reload with akka-http
                            
                                Why does sbt download a different Scala version than the one in build.sbt?
                            
                                Spark - How many Executors and Cores are allocated to my spark job
                            
                                Sum a list of options in Scala
                            
                                Extract Longs from ByteBuffer (Java/Scala)
                            
                                perform join on multiple DataFrame in spark
                            
                                how to get the sub project path in sbt multi project build
                            
                                Is it possible to execute a command on all workers within Apache Spark?
                            
                                How to use cats and State Monad
                            
                                flatten Vs flatMap with def method and val function
                            
                                Creating/accessing dataframe inside the transformation of another dataframe
                            
                                How can I count the average from Spark RDD?
                            
                                How to pattern match on Row with null values?
                            
                                Error while importing SBT project

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With