Make YARN clean up appcache before retry

Question

The situation is the following:

A YARN application is started. It gets scheduled.
It writes a lot to its appcache directory.
The application fails.
YARN restarts it. It goes pending, because there is not enough disk space anywhere to schedule it. The disks are filled up by the appcache from the failed run.

If I manually intervene and kill the application, the disk space is cleaned up. Now I can manually restart the application and it's fine.

I wish I could tell the automated retry to clean up the disk. Alternatively I suppose it could count that used disk as part of the new allocation, since it belongs to the application anyway.

I'll happily take any solution you can offer. I don't know much about YARN. It's an Apache Spark application started with spark-submit in yarn-client mode. The files that fill up the disk are the shuffle spill files.

prudenko · Accepted Answer

So here's what happens:

When you submit yarn application it creates a private local resource folder (appcache directory).
Inside this directory spark block manager creates directory for storing block data. As mentioned:

local directories and won't be deleted on JVM exit when using the external shuffle service.

This directory can be cleaned via:
- Shutdown hook. This what's happen when you kill the application.
- Yarn DeletionService. It should be done automatically on application finish. Make sure yarn.nodemanager.delete.debug-delay-sec=0. Otherwise there is some unresolved yarn bug

Make YARN clean up appcache before retry

Tags:

apache-spark

hadoop-yarn

Daniel Darabos

1 Answers

prudenko

Recent Activity

Donate For Us

Make YARN clean up appcache before retry

Tags:

apache-spark

hadoop-yarn

Daniel Darabos

1 Answers

prudenko

Related questions

Recent Activity

Donate For Us