How to append to a csv file using df.write.csv in pyspark?

Tags:

I'm trying to append data to my csv file using df.write.csv. This is what I did after following spark document http://spark.apache.org/docs/2.0.1/api/python/pyspark.sql.html#pyspark.sql.DataFrameWriter:

from pyspark.sql import DataFrameWriter
.....
df1 = sqlContext.createDataFrame(query1)
df1.write.csv("/opt/Output/sqlcsvA.csv", append) #also tried 'mode=append'

Executing the above code gives me error:

NameError: name 'append' not defined

Without append, error:

The path already exists.

483

asked Dec 19 '16 07:12

kavya

1 Answers

df.write.save(path='csv', format='csv', mode='append', sep='\t')

answered Sep 25 '22 22:09

Zhang Tong

Related questions
                            
                                Advantage of setting name to RDD
                            
                                Copy schema from one dataframe to another dataframe
                            
                                In Apache Spark 2.0.0, is it possible to fetch a query from an external database (rather than grab the whole table)?
                            
                                check if a row value is null in spark dataframe
                            
                                Replace all ":" with "_" in Spark dataframe [duplicate]
                            
                                Querying json object in dataframe using Pyspark
                            
                                Scala & Spark: Cast multiple columns at once
                            
                                How to parse CSV file with UTF-8 encoding?
                            
                                Spark on YARN + Secured hbase
                            
                                How to use --num-executors option with spark-submit?
                            
                                How to Generate Parquet File Using Pure Java (Including Date & Decimal Types) And Upload to S3 [Windows] (No HDFS)
                            
                                Pyspark 'NoneType' object has no attribute '_jvm' error
                            
                                DataFrame object has no attribute 'col'
                            
                                Pandas scalar UDF failing, IllegalArgumentException
                            
                                Storing a Graph in Spark Graphx with HDFS
                            
                                Apache Spark Exception in thread "main" java.lang.NoClassDefFoundError: scala/collection/GenTraversableOnce$class
                            
                                How can I change spark ui port?
                            
                                Spark ALS predictAll returns empty
                            
                                withColumn not allowing me to use max() function to generate a new column
                            
                                how to join two DataFrame and replace one column conditionally in spark

How to append to a csv file using df.write.csv in pyspark?

Tags:

apache-spark

pyspark

kavya

People also ask

1 Answers

Zhang Tong

Recent Activity

Donate For Us