Why does Spark job fail with "too many open files"?

Question

I get "too many open files" during the shuffle phase of my Spark job. Why is my job opening so many files? What steps can I take to try to make my job succeed.

samthebest · Accepted Answer

This has been answered on the spark user list:

The best way is definitely just to increase the ulimit if possible, this is sort of an assumption we make in Spark that clusters will be able to move it around.

You might be able to hack around this by decreasing the number of reducers [or cores used by each node] but this could have some performance implications for your job.

In general if a node in your cluster has C assigned cores and you run a job with X reducers then Spark will open C*X files in parallel and start writing. Shuffle consolidation will help decrease the total number of files created but the number of file handles open at any time doesn't change so it won't help the ulimit problem.

-Patrick Wendell

Why does Spark job fail with "too many open files"?

Tags:

apache-spark

samthebest

1 Answers

samthebest

Recent Activity

Donate For Us

Why does Spark job fail with "too many open files"?

Tags:

apache-spark

samthebest

1 Answers

samthebest

Related questions

Recent Activity

Donate For Us