Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Currently cuda version only supports training on a single GPU #6045

Open
HenrySun98 opened this issue Aug 17, 2023 · 6 comments
Open

Currently cuda version only supports training on a single GPU #6045

HenrySun98 opened this issue Aug 17, 2023 · 6 comments
Labels

Comments

@HenrySun98
Copy link

HenrySun98 commented Aug 17, 2023

Description

I build LightGBM CUDA implementation, and set GPU parameter "device" : 'cuda' and "num_gpu" : 4
But I got a Fatal Error: Currently cuda version only supports training on a single GPU

In the official doc, it says multiple GPUs is supported in CUDA version.

My LightGBM version is : 4.0.0.99

Tasks

No tasks being tracked yet.
@HenrySun98
Copy link
Author

Also, GPU parameter gpu_id accepts int value, how can I determine which GPUs used in the task?
Suppose I have 8 GPUs in the server.

@jameslamb
Copy link
Collaborator

In the official doc, it says multiple GPUs is supported in CUDA version

Please share a link to what you're referring to with this statement.


@shiyu1994 could you answer the other parts of this report?

@HenrySun98
Copy link
Author

@jameslamb

https://lightgbm.readthedocs.io/en/latest/Parameters.html#num_gpu

I found this link for num_gpu parameter.

@HenrySun98
Copy link
Author

@jameslamb @shiyu1994 expect your replies, many thx

@bstockton
Copy link

@jameslamb Is there any update on this? This seems like a very direct conflict with the documentation and implemented code that is causing a lot of confusion.

@eadeoye-Arcadia
Copy link

Same please address this potential conflict in documentation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants