Difference failed tasks vs killed tasks

Question

From Jobtracker web UI, I see this column called "Failed/Killed Task Attempts".

I would like to know the distinction between them. I guess "Failed ones" mean tasks that really failed eventually after some retries (so no recovery was done at all?) while "Killed ones" mean tasks which are killed (due to timeout and so on) but they might be retried?

David Gruzman · Accepted Answer

There are a few reasons Hadoop can kill tasks by his own decisions:
a) Task does not report progress during timeout (default is 10 minutes)
b) FairScheduler or CapacityScheduler needs the slot for some other pool (FairScheduler) or queue (CapacityScheduler).
c) Speculative execution causes results of task not to be needed since it has completed on other place.

Difference failed tasks vs killed tasks

Tags:

hadoop

mapreduce

kee

1 Answers

David Gruzman

Recent Activity

Donate For Us

Difference failed tasks vs killed tasks

Tags:

hadoop

mapreduce

kee

1 Answers

David Gruzman

Related questions

Recent Activity

Donate For Us