Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to set spark.driver.memory for Spark/Zeppelin on EMR

When using EMR (with Spark, Zeppelin), changing spark.driver.memory in Zeppelin Spark interpreter settings won't work.

I wonder what is the best and quickest way to set Spark driver memory when using EMR web interface (not aws CLI) to create clusters?

Is Bootstrap action could be a solution? If yes, can you please provide an example of how the bootstrap action file should look like?

like image 240
Rami Avatar asked Nov 28 '17 12:11

Rami


1 Answers

You can always try to add the following configuration on job flow/cluster creation :

[
    {
        "Classification": "spark-defaults",
        "Properties": {
            "spark.driver.memory": "12G"
        }
    }
]

You can do this most of the configurations whether for spark-default, hadoop core-site, etc.

I hope this helps !

like image 157
eliasah Avatar answered Oct 20 '22 19:10

eliasah