Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

由于Windows环境下不支持nccl,所以无法使用多卡训练。 #3794

Open
1 task done
nothingws opened this issue Sep 3, 2024 · 3 comments
Open
1 task done
Assignees
Labels
question Further information is requested

Comments

@nothingws
Copy link

问题确认 Search before asking

  • 我已经搜索过问题,但是没有找到解答。I have searched the question and found no related answer.

请提出你的问题 Please ask your question

现在paddlese还是不支持在win10系统下,进行多gpu训练吗?测试的时候报错如下、
Traceback (most recent call last):
File "tools\train.py", line 201, in
main(args)
File "tools\train.py", line 176, in main
train(
File "d:\pycharmprojects\paddleseg-release-2.8\paddleseg\core\train.py", line 140, in train
paddle.distributed.fleet.init(is_collective=True)
File "C:\ProgramData\Anaconda3\lib\site-packages\paddle\distributed\fleet\fleet.py", line 287, in init
paddle.distributed.init_parallel_env()
File "C:\ProgramData\Anaconda3\lib\site-packages\paddle\distributed\parallel.py", line 1101, in init_parallel_env
pg = _new_process_group_impl(
File "C:\ProgramData\Anaconda3\lib\site-packages\paddle\distributed\collective.py", line 157, in _new_process_group_impl
pg = core.ProcessGroupNCCL.create(
AttributeError: module 'paddle.base.libpaddle' has no attribute 'ProcessGroupNCCL'

@nothingws nothingws added the question Further information is requested label Sep 3, 2024
@zhangyubo0722
Copy link
Collaborator

切换到PaddleSeg的develop分支试一下,以及使用的是什么版本的paddle呢

@nothingws
Copy link
Author

切换到PaddleSeg的develop分支试一下,以及使用的是什么版本的paddle呢
这个版本paddlepaddle-gpu 2.6.1.post116

@zhangyubo0722
Copy link
Collaborator

嗯嗯先试一下develop分支吧

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants
@nothingws @zhangyubo0722 and others