We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
This line assumes that there are at least 256 thread per thread block: if (tid < 256) H[tid] = 0; https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/cuda/math/topk_impl.cuh#L275
This may be true today, but may not be in the future.
Something like this would be more future proof: for (int x_i = tid; x_i < 256; x_i += blockDim.x) { H[x_i] = 0; }
code inspection
Not urgent. In fact, this bug may never surface; depends on nvidia hw architecture changes going fwd.
Linux
Ubuntu 22.04
Built from Source
15cb2f5
C++
X64
CUDA
No response
The text was updated successfully, but these errors were encountered:
Contributions are welcome
Sorry, something went wrong.
No branches or pull requests
Describe the issue
This line assumes that there are at least 256 thread per thread block:
if (tid < 256) H[tid] = 0;
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/cuda/math/topk_impl.cuh#L275
This may be true today, but may not be in the future.
Something like this would be more future proof:
for (int x_i = tid; x_i < 256; x_i += blockDim.x) {
H[x_i] = 0;
}
To reproduce
code inspection
Urgency
Not urgent. In fact, this bug may never surface; depends on nvidia hw architecture changes going fwd.
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
15cb2f5
ONNX Runtime API
C++
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: