Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in elastic-map-reduce

How to set the precise max number of concurrently running tasks per node in Hadoop 2.4.0 on Elastic MapReduce

Python client support for running Hive on top of Amazon EMR

AWS EMR and Spark 1.0.0

Setting hadoop parameters with boto?

Amazon Elastic Map Reduce - Creating a job flow

parallel generation of random forests using scikit-learn

Elastic Map Reduce: difference between CANCEL_AND_WAIT and CONTINUE?

Broken Pipe Error causes streaming Elastic MapReduce job on AWS to fail

Configuring external data source for Elastic MapReduce

Are there any distributed machine learning libraries for using Python with Hadoop? [closed]

Loading data with Hive, S3, EMR, and Recover Partitions

Re-use Amazon Elastic MapReduce instance

How can I wait for completion of an Elastic MapReduce job flow in a Java application?

Get a yarn configuration from commandline

Drop all partitions from a hive table?

hive elastic-map-reduce

Scheduling A Job on AWS EC2

Slow Performance with Apache Spark Gradient Boosted Tree training runs

Deleting file/folder from Hadoop

Backup AWS Dynamodb to S3