Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in hadoop-streaming

Processing images using hadoop

Pass directories not files to hadoop-streaming?

hadoop hadoop-streaming

What is the difference between Rack-local map tasks and Data-local map tasks?

Python hadoop streaming : Setting a job name

How to get the name of input file in MRjob

How to use a file in a hadoop streaming job using python?

How to set the precise max number of concurrently running tasks per node in Hadoop 2.4.0 on Elastic MapReduce

How to read hadoop sequential file?

Using python efficiently to calculate hamming distances [closed]

Hadoop: job runs okay on smaller set of data but fails with large dataset

Amazon MapReduce best practices for logs analysis

Hadoop is not showing my job in the job tracker even though it is running

Hadoop streaming - remove trailing tab from reducer output

hadoop hadoop-streaming

Are there any distributed machine learning libraries for using Python with Hadoop? [closed]

Hadoop Java Error : Exception in thread "main" java.lang.NoClassDefFoundError: WordCount (wrong name: org/myorg/WordCount)

How do I pass a parameter to a python Hadoop streaming job?

stateful and stateless streaming processing

Hadoop streaming with C# and Mono : IdentityMapper being used incorrectly

c# mono hadoop-streaming

How to import a custom module in a MapReduce job?