How do I convert column of unix epoch to Date in Apache spark DataFrame using Java?

Tags:

I have a json data file which contain one property [creationDate] which is unix epoc in "long" number type. The Apache Spark DataFrame schema look like below:

root 
 |-- creationDate: long (nullable = true) 
 |-- id: long (nullable = true) 
 |-- postTypeId: long (nullable = true)
 |-- tags: array (nullable = true)
 |    |-- element: string (containsNull = true)
 |-- title: string (nullable = true)
 |-- viewCount: long (nullable = true)

I would like to do some groupBy "creationData_Year" which need to get from "creationDate".

What's the easiest way to do this kind of convert in DataFrame using Java?

996

asked Jan 06 '16 05:01

ErhWen Kuo

2 Answers

After checking spark dataframe api and sql function, I come out below snippet:

DateFrame df = sqlContext.read().json("MY_JSON_DATA_FILE");

DataFrame df_DateConverted = df.withColumn("creationDt", from_unixtime(stackoverflow_Tags.col("creationDate").divide(1000)));

The reason why "creationDate" column is divided by "1000" is cause the TimeUnit is different. The orgin "creationDate" is unix epoch in "milli-second", however spark sql "from_unixtime" is designed to handle unix epoch in "second".

102

answered Oct 18 '22 09:10

ErhWen Kuo

pyspark converts from Unix epoch milliseconds to dataframe timestamp

df.select(from_unixtime((df.my_date_column.cast('bigint')/1000)).cast('timestamp').alias('my_date_column'))

answered Oct 18 '22 09:10

Ray Metz

Related questions
                            
                                Image Orientation - Android
                            
                                Check Map key/values with jsonPath
                            
                                Error parsing wsdl - The sytem cannot find the path specified
                            
                                log4j2 Unable to register shutdown hook because JVM is shutting down
                            
                                How to convert array to collection in Java?
                            
                                Why should I use Runnable instead of Thread? [duplicate]
                            
                                Process list of 'N' items with multiple threads
                            
                                Correct way to change Collapseable Toolbar title
                            
                                android error: Could not read input channel file descriptors from parcel
                            
                                Throwing exception from lambda [duplicate]
                            
                                Updating existing Excel file in Java Apache POI
                            
                                AWS DynamoDB trigger using Lambda in JAVA
                            
                                Get screen coordinates of a node in javaFX 8
                            
                                sum of digits till the sum is one-digit number
                            
                                How do I cure a call to the AWS Java SDK DynamoDB resulting in an ExpiredTokenException?
                            
                                Spring Boot with Session/Redis Serialization Error with Bad Active Directory Ldap Credentials
                            
                                How to make a Bouncy Castle ECPublicKey
                            
                                How to get stream over lines of text at certain URL in Java 8 way?
                            
                                Selenium Webdriver enter multiline text in form without submitting it
                            
                                Is the python "elif" compiled differently from else: if?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I convert column of unix epoch to Date in Apache spark DataFrame using Java?

Tags:

java

apache-spark

spark-dataframe

ErhWen Kuo

People also ask

2 Answers

ErhWen Kuo

Ray Metz

Recent Activity

Donate For Us