Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Optimal block size in HDFS - Can large block sizes hurt

hadoop hdfs

Running Hadoop Job Remotely

hadoop

Delta/Incremental Load in Hive

hadoop hive sqoop hiveql

What does AM in "AM Container" mean in Hadoop

hadoop mapreduce hbase

Hadoop: Unable to load native-hadoop library for your platform

java hadoop

Configured the HA Cluster with Hive-2.0.1(Derby Support) shows redundant database names?

hadoop hive derby

Amazon EMR: running Custom Jar with input and output from S3

How a Spark executor runs multiple tasks?

Connecting to Hive using python's Jaydebeapi

python hadoop hive

Hive query too slow and failed

hadoop hive hadoop-yarn tez

JSON object spans multiple lines, How to split input in Hadoop

java json hadoop elephantbird

In Hadoop Map-Reduce, does any class see the whole list of keys after sorting and before partitioning?

Chaining Multi-Reducers in a Hadoop MapReduce job

java hadoop mapreduce

R+Hadoop: How to read CSV file from HDFS and execute mapreduce?

r hadoop rhadoop

Processing images using hadoop

hadoop/yarn and task parallelization on non-hdfs filesystems

hadoop hdfs hpc hadoop2

Error on running multiple Workflow in OOZIE-4.1.0

JAVA_HOME error with upgrade to Spark 1.3.0

java scala hadoop apache-spark

How wordCount mapReduce jobs, run on hadoop yarn cluster with apache tez?

Is it possible to read and write Parquet using Java without a dependency on Hadoop and HDFS?