Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models being saved in temporary directory #65

Open
LucaSainteCroix opened this issue Sep 16, 2021 · 0 comments
Open

Models being saved in temporary directory #65

LucaSainteCroix opened this issue Sep 16, 2021 · 0 comments

Comments

@LucaSainteCroix
Copy link

Hello,

I have an issue when using crfsuite that appears to only exist on Linux (or at least I never found the files on Windows). I have a program to extract accounting data from invoices that is hosted on an ec2 machine running on Amazon Linux 2. I load a CRF model using joblib and make a prediction, and apparently everytime my program is called, the model is saved as a file called model********.crfsuite in the /tmp directory. The problem is that I eventually end up with hundreds of files that take up all the space of my ec2 machine. (See Screenshot)

Screenshot_5

This problem also happens locally on my Ubuntu running with WSL2, also in /tmp.

The only relevant text I found was in the documentation of the CRF class:
Screenshot_6
I don't use this parameter, I only load my trained model using joblib.

Does anyone have an idea on how I can get it to stop saving those files infinitely?

Or alternatively, is it safe to use a cron script to delete those files? (I would prefer a cleaner solution)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant