Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Executing Sqoops using Oozie

hadoop sqoop oozie

scala filename too long

scala hadoop scalding

HBase regions automatic splitting using hbase.hregion.max.filesize

hadoop split hbase region

Passing parameter to sqoop job

hadoop hive sqoop

Find out actual disk usage in HDFS

hadoop hdfs bigdata diskspace

get size of parquet file in HDFS for repartition with Spark in Scala

Presto and hive partition discovery

hadoop amazon-s3 hive presto

How Hadoop -getmerge works?

Reading Avro file gives AvroTypeException: missing required field error (even though the new field is declared null in schema)

java hadoop avro

Relationship between Hive and Hadoop MapReduce?

hadoop hive mapreduce hdfs

Spark: grouping rows in array by key

scala hadoop apache-spark

Authentication for Spark standalone cluster

Unable to run yarn during hadoop installation

hadoop hdfs hadoop-yarn

How do I fix "File could only be replicated to 0 nodes instead of minReplication (=1)."?

Does throwing an exception in an EvalFunc pig UDF skip just that line, or stop completely?

hadoop apache-pig

ERROR: org.apache.hadoop.hbase.MasterNotRunningException: null+hbase+hadoop

hadoop hbase

Ubuntu cluster management

Why do we need to set the output key/value class explicitly in the Hadoop program?

class input hadoop

Hadoop MapReduce intermediate output

logging hadoop mapreduce

Can Hadoop distribute tasks and code base?

hadoop distributed hdfs