Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in amazon-emr

AWS EMR perform "bootstrap" script on all the already running machines in cluster

Amazon Elastic MapReduce - mass insert from S3 to DynamoDB is incredibly slow

Saving dataframe to local file system results in empty results

apache-spark amazon-emr

Spark on Amazon EMR: "Timeout waiting for connection from pool"

apache-spark amazon-emr

AWS Glue pricing against AWS EMR

How to handle fields enclosed within quotes(CSV) in importing data from S3 into DynamoDB using EMR/Hive

Amazon Emr - What is the need of Task nodes when we have Core nodes?

hadoop hadoop2 amazon-emr

S3 SlowDown error in Spark on EMR

How to tune spark job on EMR to write huge data quickly on S3

Amazon EC2 On-Demand Workers for Short Tasks

Spark 2.0 deprecates 'DirectParquetOutputCommitter', how to live without it?

Any Scala SDK or interface for AWS?

Can we consider AWS Glue as a replacement for EMR?

Does Hive have something equivalent to DUAL?

hadoop hive amazon-emr

'Operation timed out' error on trying to ssh in to the Amazon EMR Spark Cluster

apache-spark ssh amazon-emr

How to configure high performance BLAS/LAPACK for Breeze on Amazon EMR, EC2

Extremely slow S3 write times from EMR/ Spark

AWS VPC identify private and public subnet

How do you make a HIVE table out of JSON data?

json hadoop hive amazon-emr emr

Application report for application_ (state: ACCEPTED) never ends for Spark Submit (with Spark 1.2.0 on YARN)