Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in amazon-emr

AWS Glue pricing against AWS EMR

How to handle fields enclosed within quotes(CSV) in importing data from S3 into DynamoDB using EMR/Hive

Amazon Emr - What is the need of Task nodes when we have Core nodes?

hadoop hadoop2 amazon-emr

S3 SlowDown error in Spark on EMR

How to tune spark job on EMR to write huge data quickly on S3

Amazon EC2 On-Demand Workers for Short Tasks

Spark 2.0 deprecates 'DirectParquetOutputCommitter', how to live without it?

Any Scala SDK or interface for AWS?

Can we consider AWS Glue as a replacement for EMR?

Does Hive have something equivalent to DUAL?

hadoop hive amazon-emr

'Operation timed out' error on trying to ssh in to the Amazon EMR Spark Cluster

apache-spark ssh amazon-emr

How to configure high performance BLAS/LAPACK for Breeze on Amazon EMR, EC2

Extremely slow S3 write times from EMR/ Spark

AWS VPC identify private and public subnet

How do you make a HIVE table out of JSON data?

json hadoop hive amazon-emr emr

Application report for application_ (state: ACCEPTED) never ends for Spark Submit (with Spark 1.2.0 on YARN)

"Container killed by YARN for exceeding memory limits. 10.4 GB of 10.4 GB physical memory used" on an EMR cluster with 75GB of memory