mirror of https://github.com/davisking/dlib.git
Problem: CUDA error having `code: 716, reason: misaligned address` (#2796)
Why does this happen: `float` data should be aligned to 4 bytes [1], but this is not guaranteed when the truth data is `uint16_t` and is in the buffer before the weights data Solution: re-arrange the buffer so that the weights data (type `float`) comes before the truth data (type `uint16_t`), essentially making it so that the `float` data is correctly aligned to 4 bytes [1] https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#device-memory-accessespull/2798/head
parent
decdef12f5
commit
b6418e349e
Loading…
Reference in new issue