topk assumes GridDim::maxThreadsPerBlock >= 256 #22079

kurquhar · 2024-09-12T17:36:26Z

Describe the issue

This line assumes that there are at least 256 thread per thread block:
if (tid < 256) H[tid] = 0;
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/cuda/math/topk_impl.cuh#L275

This may be true today, but may not be in the future.

Something like this would be more future proof:
for (int x_i = tid; x_i < 256; x_i += blockDim.x) {
H[x_i] = 0;
}

To reproduce

code inspection

Urgency

Not urgent. In fact, this bug may never surface; depends on nvidia hw architecture changes going fwd.

Platform

Linux

OS Version

Ubuntu 22.04

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

15cb2f5

ONNX Runtime API

C++

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

yuslepukhin · 2024-09-13T00:54:28Z

Contributions are welcome

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

topk assumes GridDim::maxThreadsPerBlock >= 256 #22079

topk assumes GridDim::maxThreadsPerBlock >= 256 #22079

kurquhar commented Sep 12, 2024

yuslepukhin commented Sep 13, 2024

topk assumes GridDim::maxThreadsPerBlock >= 256 #22079

topk assumes GridDim::maxThreadsPerBlock >= 256 #22079

Comments

kurquhar commented Sep 12, 2024

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

yuslepukhin commented Sep 13, 2024