How to decompress the hadoop reduce output file end with snappy?

Question

Our hadoop cluster using snappy as default codec. Hadoop job reduce output file name is like part-r-00000.snappy. JSnappy fails to decompress the file bcz JSnappy requires the file start with SNZ. The reduce output file start with some bytes 0 somehow.

How could I decompress the file?

arviarya · Accepted Answer

Use "Hadoop fs -text" to read this file and pipe it to txt file. ex:

hadoop fs -text part-r-00001.snappy > /tmp/mydatafile.txt

How to decompress the hadoop reduce output file end with snappy?

Tags:

hadoop

snappy

DeepNightTwo

1 Answers

arviarya

Recent Activity

Donate For Us

How to decompress the hadoop reduce output file end with snappy?

Tags:

hadoop

snappy

DeepNightTwo

1 Answers

arviarya

Related questions

Recent Activity

Donate For Us