How to add a unique id column to a DataFrame, Apache Spark, Scala

Question

I have a DataFrame, that i want to join with another Dataframe, and then group by original rows, but the original rows do not have a unique id. How can i add a unique id or otherwise accomplish that goal.

Tawkir · Accepted Answer

You can use monotonically_increasing_id

import org.apache.spark.sql.functions._
val unique_df = original_df.withColumn("UniqueID", monotonically_increasing_id)

How to add a unique id column to a DataFrame, Apache Spark, Scala

Tags:

scala

apache-spark

apache-spark-sql

qonf

1 Answers

Tawkir

Recent Activity

Donate For Us

How to add a unique id column to a DataFrame, Apache Spark, Scala

Tags:

scala

apache-spark

apache-spark-sql

qonf

1 Answers

Tawkir

Related questions

Recent Activity

Donate For Us