Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

spark: SAXParseException while writing to parquet on s3

How does back pressure property work in Spark Streaming?

YARN: Containers and JVM

Spark Shell with Yarn - Error: Yarn application has already ended! It might have been killed or unable to launch application master

Spring-Batch for a massive nightly / hourly Hive / MySQL data processing

Problem starting tasktracker in hadoop under windows

java windows hadoop mapreduce

Running Hadoop MapReduce, is it possible to call external executables outside of HDFS

hadoop mapreduce hdfs

How to pull data in the Map/Reduce functions?

hadoop mapreduce pull

Installing PIG on single node

hadoop apache-pig

What is mean by implementing a advanced job control framework to help chain multiple Map-Reduce jobs?

hadoop mapreduce oozie

Distributing Data Nodes Across Multiple Data Centers

Missing Hive Execution Jar: /usr/local/hadoop/hive/lib/hive-exec-*.jar

pig to hadoop issue: Server IPC version 7 cannot communicate with client version 4

hadoop apache-pig

Impala cannot find com.mysql.jdbc.Driver

hadoop hive cloudera impala

MapReduce job in headless environment fails N times due to AM Container exception from container-launch

java macos hadoop headless

JVM crashes with no frame specified, only "timer expired, abort"

How to insert data into Parquet table in Hive

hadoop hive parquet

hdfs log file is too huge

hadoop hdfs

Cannot validate serde : org.openx.data.jsonserde.jsonserde

java json hadoop hive

Resources/Documentation on how does the failover process work for the Spark Driver (and its YARN Container) in yarn-cluster mode