Saving dataframe records in a tab delimited file

Question

How can I save records of a DataFrame into a tab delimited output file? The DataFame looks like below:

>>> csvDf.show(2,False)

1. |1  |Eldon Base for stackable storage shelf, platinum  |Muhammed
MacIntyre|3  |-213.25|38.94 |35   |Nunavut|Storage & Organization   
|0.8 | 
2. |2  |1.7 Cubic Foot Compact "Cube" Office Refrigerators|Barry
French      |293|457.81 |208.16|68.02|Nunavut|Appliances            
|0.58|

Alper t. Turker · Accepted Answer

Just pass delimiter option to the writer:

csvDf.write.option("delimiter", "	").csv(output_path)

In Spark 1.6 use spark-csv package (check README for detailed instructions) with the same option:

csvDf.write.option("delimiter", "	").format("com.databricks.spark.csv").save(output_path)

dripp · Answer

In Spark 2.4.3 it is:

csvDf
.write
.option("sep", "	")
.option("encoding", "UTF-8")
.csv(targetFilePath)

Suraj · Answer

this worked for me ...

csvDf.rdd.map(lambda x: ' '.join(x)).coalesce(1).saveAsTextFile('/output/csv/6.csv')

Saving dataframe records in a tab delimited file

Tags:

apache-spark

pyspark

Suraj

3 Answers

Alper t. Turker

dripp

Suraj

Recent Activity

Donate For Us

Saving dataframe records in a tab delimited file

Tags:

apache-spark

pyspark

Suraj

3 Answers

Alper t. Turker

dripp

Suraj

Related questions

Recent Activity

Donate For Us