how to change column value in spark sql

Tags:

In Sql, I can easily update some column value using UPDATE, for example: I have a table (student) like:

student_id, grade, new_student_id
123             B      234
555             A      null

UPDATE Student
SET student_id = new_student_id
WHERE new_student_id isNotNull

How can I do it in Spark using SparkSql(PySpark)?

557

asked Mar 12 '17 04:03

rainyballball

1 Answers

You can use withColumn to overwrite the existing new_student_id column with the original new_student_id value if is not null, or otherwise the value from the student_id column is used:

from pyspark.sql.functions import col,when

#Create sample data
students = sc.parallelize([(123,'B',234),(555,'A',None)]).toDF(['student_id','grade','new_student_id'])

#Use withColumn to use student_id when new_student_id is not populated
cleaned = students.withColumn("new_student_id", 
          when(col("new_student_id").isNull(), col("student_id")).
          otherwise(col("new_student_id")))
cleaned.show()

Using your sample data as input:

+----------+-----+--------------+
|student_id|grade|new_student_id|
+----------+-----+--------------+
|       123|    B|           234|
|       555|    A|          null|
+----------+-----+--------------+

the output data looks as follows:

+----------+-----+--------------+
|student_id|grade|new_student_id|
+----------+-----+--------------+
|       123|    B|           234|
|       555|    A|           555|
+----------+-----+--------------+

191

answered Sep 17 '22 02:09

Alex

Related questions
                            
                                SQL Server Nvarchar and Java prepared statement
                            
                                Optimizing a row exclusion query
                            
                                Hive- issue with Create Table with column have space
                            
                                Creating Surrogate Key table with other unique constraints
                            
                                postgres query with IN is very slow
                            
                                what's the difference between is not null and <>' '
                            
                                Get substring between second and fourth slash
                            
                                HQL left join with condition
                            
                                Oracle REGEXP_LIKE doesn't work as expected
                            
                                Strange translation of jOOQ query for array contains function
                            
                                SQL select items that make datetime range between flag toggle
                            
                                MySQL Performance - string vs integer
                            
                                SQL: How to change a row order position
                            
                                How to add xmlns in the root in XML in SQL Server 2014
                            
                                Oracle SQL - SELECT query locks index & blocks DML sessions
                            
                                What is the difference between NOT condition and NOT() in Oracle and MS SQL Server
                            
                                Can't commit changes to table with Datagrip
                            
                                Making a CHECK CONSTRAINT with OR in postgres SQL
                            
                                How to store datetime with millisecond precision in SQL database
                            
                                PostgreSQL - Start A Transaction block IN Function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to change column value in spark sql

Tags:

sql

apache-spark

apache-spark-sql

pyspark

rainyballball

People also ask

1 Answers

Alex

Recent Activity

Donate For Us