Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intermittent "snappy: corrupt input" with plain CSVs #2157

Open
tmtmtmtm opened this issue Sep 21, 2024 · 4 comments
Open

Intermittent "snappy: corrupt input" with plain CSVs #2157

tmtmtmtm opened this issue Sep 21, 2024 · 4 comments

Comments

@tmtmtmtm
Copy link
Contributor

From time to time (somewhere on the order of 1% of runs), with no pattern I can discern, I get a io error: snappy: corrupt input (expected stream header but got unexpected chunk type byte 112) error when reading plain CSV files. I can never replicate this: on a subsequent run of the same command everything works fine. It usually happens somewhere in the middle of a chain of piped commands, so I also can't tell which of the commands is blowing up, or if there's any pattern to it.

I have been holding off on reporting in the hope that I could pin it down at least a little more, but I've been unable to do so. I have no .sz files anywhere, so I'm assuming that either qsv is running a check for snappy-ness somewhere (in which case perhaps there's a way I could explicitly turn that off?), or one of the sub-commands is producing a temporary snappy file. But I don't really understand what's going on well enough so hopefully there's enough info here for someone else to pick up a useful clue.

(I'm currently on qsv 0.134.0-mimalloc-apply;fetch;foreach;geocode;Luau 0.640;to;polars-0.42.0-fe04390;self_update-8-8;12.80 GiB-677.75 MiB-0 B-16.00 GiB (aarch64-apple-darwin compiled with Rust 1.81) prebuilt but it's been happening with other recent versions too)

@jqnatividad
Copy link
Owner

That's interesting @tmtmtmtm , can you also run qsv --envlist to see if there are any applicable environment variables?

@tmtmtmtm
Copy link
Contributor Author

> qsv --envlist
No qsv-relevant environment variables set.

@jqnatividad
Copy link
Owner

Can you also set QSV_LOG_LEVEL=debug?

export QSV_LOG_LEVEL=debug

and check if it logs any snappy encoding/decoding ops in the qsv_rCURRENT.log file?

@tmtmtmtm
Copy link
Contributor Author

no mention of snappy encoding or decoding anywhere in the log, even on a run that fails. Only reference was the io error: snappy: corrupt input (expected stream header but got unexpected chunk type byte 105) to normal output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants