How does CUDA handle multiple updates to memory address?
I have written a CUDA kernel in which each thread makes an update to a
particular memory address (with int size). Some threads might want to
update this address simultaneously.
How does CUDA handle this? Does the operation become atomic? Does this
increase the latency of my application in any way? If so, how?
Thank you very much!
No comments:
Post a Comment