mirror of
https://github.com/davisking/dlib.git
synced 2024-11-01 10:14:53 +08:00
Fixed spelling error in comment
This commit is contained in:
parent
2a0b461926
commit
776678a37c
@ -261,7 +261,7 @@ namespace dlib
|
||||
CUDA's y thread index (e.g. threadIdx.y) instead of the x index.
|
||||
Therefore, if you launch a cuda kernel with a statement like:
|
||||
dim3 blocks(10,1);
|
||||
dim3 threads(32,32); // You need to have x any not equal to 1 to get parallelism over both loops.
|
||||
dim3 threads(32,32); // You need to have x and y not equal to 1 to get parallelism over both loops.
|
||||
add_arrays<<<blocks,threads>>>(a,b,out,nr,nc);
|
||||
You can perform a nested 2D parallel for loop rather than doing just a
|
||||
1D for loop.
|
||||
|
Loading…
Reference in New Issue
Block a user