Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

when i use trlx ppotrainer train a model llama 13b model, but saved huggingface mode ,but when it inference , it has some strange keys ,and the inference result did not show ,it also have no error , it seems the result disapper #584

Open
ldh127 opened this issue Dec 3, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@ldh127
Copy link

ldh127 commented Dec 3, 2023

🐛 Describe the bug

trainer = trlx.train(
reward_fn=reward_fn,
prompts=prompts,
eval_prompts=["习近平女儿"] * 4,
config=config,
)

trainer.save_pretrained('./rl_saved_finished_hf_1202', safe_serialization=False, heads_only=True)

the model can not inference right, it has no error ,but the result also disapper ,the code exit 0

Which trlX version are you using?

No response

Additional system and package information

No response

@ldh127 ldh127 added the bug Something isn't working label Dec 3, 2023
@promiseve
Copy link

Hey @ldh127 , did you manage to get around this ? I am having a similar issue at the moment.

Regards,
Promise.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants