Bank conflict in parallel reduction using interleaved addressing method

Question

I was reading the presentation on Optimizing Parallel Reduction in CUDA by Mark Harris. Here is a slide I have problem in:

enter image description here

It says there is bank conflict problem in this method. But why? All threads are accessing two consecutive memory cell which are in different banks. Neither of them accesses a specific memory cell concurrently.

2 revs · Accepted Answer

This presentation dates from the very early days of CUDA, and applies to first generation hardware.

That hardware had shared memory arranged in 8 32 bit banks. Because every eighth entry in the shared array resides in the same bank, there are bank conflicts at a number of levels of that reduction tree.

This problem was addressed in newer hardware, where the number of banks was expanded to 32, meaning that this sort of bank conflict cannot occur.

Bank conflict in parallel reduction using interleaved addressing method

Tags:

parallel-processing

cuda

gpu

reduction

Majid Azimi

1 Answers

2 revs

Recent Activity

Donate For Us

Bank conflict in parallel reduction using interleaved addressing method

Tags:

parallel-processing

cuda

gpu

reduction

Majid Azimi

1 Answers

2 revs

Related questions

Recent Activity

Donate For Us