Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Non-streaming Conformer model with pruned_rnnt_loss always emits the first non-blank characters on the very first frames. #1666

Open
guoyifan97 opened this issue Jun 24, 2024 · 1 comment

Comments

@guoyifan97
Copy link

I trained two offline reworked conformer models on my own Chinese data using pruned_rnnt_loss and standard rnnt loss (warp-rnnt==0.7.0) respectively following pruned_transducer_stateless5. However, I still experience the issue where the first word timestamp is aways zero with the conformer + pruned_rnnt_loss. While the conformer + standard rnnt loss does not have this phenomenon.

I have seen the comments in :
#1347
#942
#923
k2-fsa/sherpa#52

But I still don't know how to avoid similar problems. Is there any way to solve such problems?
Thanks!

@guoyifan97 guoyifan97 changed the title Non-streaming Conformer model with pruned_rnnt_loss always emits non-blank characters on the very first frames. Non-streaming Conformer model with pruned_rnnt_loss always emits the first non-blank characters on the very first frames. Jun 24, 2024
@guoyifan97
Copy link
Author

I also found that using odinary rnnt loss + k2 simple loss will experience the same issue. But why?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant