I know that devices before the Fermi architecture had 8 SPs in a single multiprocessor. Is the count same in Fermi architecture?
One CUDA Core is very similar to a CPU Core. Generally, CUDA Cores are not as developed, though they are implemented in much greater numbers, with your standard gaming CPU coming with up to 16 cores, while CUDA Cores can easily get into the hundreds.
CUDA Cores are parallel processors, just like your CPU might be a dual- or quad-core device, nVidia GPUs host several hundred or thousand cores. The cores are responsible for processing all the data that is fed into and out of the GPU, performing game graphics calculations that are resolved visually to the end-user.
A CPU consists of four to eight CPU cores, while the GPU consists of hundreds of smaller cores. Together, they operate to crunch through the data in the application. This massively parallel architecture is what gives the GPU its high compute performance.
The answer depends on the Compute Capability property of the CUDA device. The numbers are:
See appendix G of the CUDA C Programming Guide.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With