Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

GlusterFS or Ceph as backend for Hadoop

hadoop ceph glusterfs

Spark + Scala transformations, immutability & memory consumption overheads

scala hadoop apache-spark

Difference between 'distcp' and 'distcp -update'?

hadoop mapreduce hdfs

Filter a string on the basis of a word

hadoop apache-pig

How can I concatenate two files in hadoop into one using Hadoop FS shell?

shell hadoop concatenation

What does CPU Time for a Hadoop Job signify?

hadoop timing benchmarking

How to pull data from Mainframe to Hadoop

hadoop mainframe

Failed to set permissions of path: \tmp

hadoop

Apache hive MSCK REPAIR TABLE new partition not added

How to save Spark RDD to local filesystem

Will Spark SQL completely replace Apache Impala or Apache Hive?

CASE statements in Hive

sql hadoop hive case hiveql

how to handle millions of smaller s3 files with apache spark

Where is Sort used in MapReduce phase and why?

hadoop mapreduce

hadoop's datanode is not starting

hadoop installation

How to convert a Date String from UTC to Specific TimeZone in HIVE?

Hive 1.1.0 Alter table partition type from int to string

hadoop hive partitioning ddl

Cannot connect to hive using beeline, user root cannot impersonate anonymous

hadoop hive beeline

Efficient and scalable storage for JSON data with NoSQL databases

Hadoop dfs replicate

hadoop hdfs