Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop-partitioning

How to add an hard disk to hadoop

Sqoop import : composite primary key and textual primary key

Specify minimum number of generated files from Hive insert

How does the HDFS Client knows the block size while writing?

Spark Partitionby doesn't scale as expected

Windowing function in Hive

Hadoop - Produce multiple values for a single key

Hive Partition recovery

How to check specific partition data from Spark partitions in Pyspark

HDINSIGHT hive, MSCK REPAIR TABLE table_name throwing error

hive hadoop-partitioning

Hadoop fs -du-h sorting by size for M, G, T, P, E, Z, Y

Can I cluster by/bucket a table created via "CREATE TABLE AS SELECT....." in Hive?

Spark: can you include partition columns in output files?

How the data is split in Hadoop

In Apache Spark, why does RDD.union not preserve the partitioner?