Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

modify hive query to force more than 1 reducer

sql hadoop hive bigdata

How to save numpy array from PySpark worker to HDFS or shared file system?

Get list of files from hdfs (hadoop) directory using python script

YARN REST API - Spark job submission

[Simba][ImpalaJDBCDriver](500051) ERROR processing query/statement

hadoop jdbc cloudera impala

hadoop user file permissions

Writing file to HDFS using Java

java hadoop apache-spark

Monitor a cluster of nodes

Running EMR example, getting 301 Error

how to create a symlink on a hdfs cluster?

hadoop hdfs symlink

Hive, Beeline: Peer indicated failure: GSS initiate failed

hadoop hive

org.datanucleus.exceptions.NucleusUserException: Error : Could not find API definition for name "JDO"

Physical memory usage keeps increasing for Spark application on YARN

Spark-submit how to set the user.name

hadoop apache-spark hadoop2

Running tensorflow with file on HDFS (cannot find libhdfs.so)

python hadoop tensorflow

Hadoop streaming job using Mxnet failing in AWS Emr

Hive: Unable to insert data in table with 100 or more partition columns Error: in column "PART_NAME" that has maximum length of 767

hadoop hive cloudera

java.io.InvalidClassException: org.apache.spark.internal.io.HadoopMapReduceCommitProtocol; local class incompatible

Ingest log files from edge nodes to Hadoop

Java Read Parquet File to JSON Output