Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How do I submit a Spark jar to a EMR cluster?

I have used the online AWS console to launch my cluster along with Apache Spark. I have a fat jar based on my Spark app and I have uploaded it to a S3 Bucket. When I try to send it as a Step with a Custom Jar, the process fails. Any pointers would be greatly appreciated.

like image 358
dataker Avatar asked Mar 04 '15 16:03

dataker


1 Answers

Use EMR bootstrap to install Spark, and submit the job as described in the documentation: https://github.com/awslabs/emr-bootstrap-actions/blob/master/spark/examples/spark-submit-via-step.md

like image 158
Dan Osipov Avatar answered Sep 28 '22 08:09

Dan Osipov