hadoop-streaming tutorials

How to get s3distcp to merge with newlines

Nov 02, 2022

How to do Mapper testing using MRUnit Test?

Sep 25, 2022

java eclipse hadoop hadoop-streaming mrunit

os.environ['mapreduce_map_input_file'] doesn't work

Mar 26, 2022

mapreduce hadoop-streaming

Python Hadoop streaming on windows, Script not a valid Win32 application

Apr 15, 2021

python windows hadoop mapreduce hadoop-streaming

Load snappy-compressed files into Elastic MapReduce

Mar 19, 2022

hadoop amazon-web-services compression hadoop-streaming emr

Exception while connecting to mongodb in spark

Apr 23, 2022

mongodb exception hadoop apache-spark hadoop-streaming

Pivot table with Apache Pig

Jun 06, 2018

apache-pig hadoop-streaming

Sorting by value in Hadoop from a file

Nov 08, 2022

java hadoop hadoop-streaming

How to resolve java.lang.RuntimeException: PipeMapRed.waitOutputThreads(): subprocess failed with code 2?

Jun 15, 2022

hadoop nltk hadoop-streaming

EMR How to join files into one?

Apr 03, 2022

amazon-s3 amazon-web-services hadoop-streaming amazon-emr emr

How to decide when to use a Map-Side Join or Reduce-Side while writing an MR code in java?

Jun 19, 2022

hadoop mapreduce hadoop-streaming

Hadoop Configuration Error

Oct 01, 2022

java hadoop hadoop-streaming

Hadoop Throws ClassCastException for the keytype of java.nio.ByteBuffer

Feb 26, 2015

hadoop mapreduce bytebuffer hadoop-streaming

Running the Python Code on Hadoop Failed

Aug 26, 2022

python hadoop-streaming

Can I force my reducers (copy phase) to start only when all mappers are completed

Apr 10, 2019

configuration hadoop mapreduce hadoop-streaming

Amazon Elastic MapReduce - SIGTERM

Nov 15, 2022

python hadoop-streaming elastic-map-reduce amazon-emr

Python MapReduce Hadoop Streaming Job that requires multiple input files?

May 17, 2022

python hadoop mapreduce hadoop-streaming

Hive FAILED: ParseException line 2:0 cannot recognize input near ''macaddress'' 'CHAR' '(' in column specification

Sep 09, 2019

hadoop hive hadoop-streaming

hadoop, python, subprocess failed with code 127

Jun 15, 2022

python hadoop mapreduce cloudera hadoop-streaming

POC for Hadoop in real time scenario

Jan 29, 2020

hadoop real-time bigdata hadoop-streaming

New posts in hadoop-streaming