How to rename a column in Databricks

Tags:

databricks

delta-lake

How do you rename a column in Databricks?

The following does not work:

ALTER TABLE mySchema.myTable change COLUMN old_name new_name int

It returns the error:

ALTER TABLE CHANGE COLUMN is not supported for changing column 'old_name' with type 'IntegerType >(nullable = true)' to 'new_name' with type 'IntegerType (nullable = true)';

If it makes a difference, this table is using Delta Lake, and it is NOT partitioned or z-ordered by this "old_name" column.

322

asked Dec 26 '19 17:12

David Maddox

1 Answers

You can't rename or change a column datatype in Databricks, only add new columns, reorder them or add column comments. To do this you must rewrite the table using the overwriteSchema option.

Take this example below from this documentation:

spark.read.table(...)
  .withColumnRenamed("date", "date_created")
  .write
  .mode("overwrite")
  .option("overwriteSchema", "true")
  .table(...)

answered Oct 19 '22 02:10

LeandroHumb

Related questions
                            
                                Running into 'java.lang.OutOfMemoryError: Java heap space' when using toPandas() and databricks connect
                            
                                Simplest method for text lemmatization in Scala and Spark
                            
                                How to set environment variable in databricks?
                            
                                AttributeError: 'DataFrame' object has no attribute '_data'
                            
                                Trouble when writing the data to Delta Lake in Azure databricks (Incompatible format detected)
                            
                                How to loop through Azure Datalake Store files in Azure Databricks
                            
                                Difference in usecases for AWS Sagemaker vs Databricks?
                            
                                databricks: check if the mountpoint already mounted
                            
                                Unsupported literal type class scala.runtime.BoxedUnit
                            
                                In Databricks, check whether a path exist or not
                            
                                Triggering Databricks job from Airflow without starting new cluster
                            
                                High Concurrency Clusters in Databricks
                            
                                Connection from Spark to snowflake
                            
                                Create External table in Azure databricks
                            
                                How to check Databricks cluster for Log4J vulnerability?
                            
                                What is the correct way to install the delta module in python?
                            
                                How to plot correlation heatmap when using pyspark+databricks
                            
                                partitionBy & overwrite strategy in an Azure DataLake using PySpark in Databricks

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With