-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update CUDA versions for CI #6539
base: master
Are you sure you want to change the base?
Conversation
@shiyu1994 Hi! May I kindly ask you to update NVIDIA drivers at the host machine where CUDA CI jobs are executed? It will allow us to run tests against the most recent CUDA version
Refer to #6520 for the context of this PR. Some related external links: |
Based on https://docs.nvidia.com/datacenter/tesla/drivers/index.html#cuda-drivers, I think we want R535 (the latest long-term support release). |
Agree. * Based on my personal experience, R530 driver doesn't support CUDA 12.5. |
Gently ping @shiyu1994 for fresh NVIDIA driver installation. |
Can confirm that R535 is enough to run containers with CUDA 12.5.
|
I'll try to contact @shiyu1994 in the maintainer Slack. |
@jameslamb Did you succeed? 👼 |
No, I haven't been able to reach @shiyu1994 in the last 2 months. @shiyu1994 since I do see you're active here (#6623), could you please help us with this? I sent another message in the maintainer private chat as well on a separate topic. |
Just learned that CUDA Forward Compatibility feature is available only for server cards (e.g. Tesla A100) and not for domestic ones (e.g. RTX 4090).
For example, on domestic card RTX 4090 with R535 driver you'll get |
Sorry I cannot login to my slack account, since it is registered with a @qq.com email. I will update the CUDA version of the CI agent. |
Thank you!! |
Fixed #6520.