Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop

Hadoop Namenode Metadata - fsimage and edit logs

memory hadoop metadata

What is an efficient way of running a logistic regression for large data sets (200 million by 2 variables)?

python r matlab hadoop stata

Map Reduce Slot Definition

read json key-values with hive/sql and spark

Export HDFS file with custom delimiter into Mysql via Sqoop

mysql hadoop hdfs sqoop

How can I use Oozie workflow configuration property in the workflow itself?

hadoop hive oozie

Remove directory level when transferring from HDFS to S3 using S3DistCp

Why HDFS not preferred with applications that require low latency?

hadoop apache-spark hdfs hawq

java.lang.VerifyError with Hadoop

java hadoop

Hadoop on Windows. YARN fails to start with java.lang.UnsatisfiedLinkError

hadoop hadoop-yarn

Hdfs file timestamp

datetime hadoop hdfs

How to append ORC file

java hadoop hive orc

YARN shell command to get number of containers and vcores used by running applications

hadoop hadoop-yarn

Unable to connect to HIVE2 via JAVA

java hadoop jdbc hive hiveql

Accessing hadoop from remote machine

java hadoop

Using Spark for sequential row-by-row processing without map and reduce

hadoop apache-spark pyspark

How to use Mahout in a Windows environment?

windows cygwin hadoop mahout

Java or Python distributed compute job (on a student budget)?

java python nlp hadoop nltk

Can I get invidually sorted Mapper outputs from Hadoop when using zero Reducers?

hadoop mapreduce

Viewing the number of blocks for a file in hadoop

hadoop hdfs