-
-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PDS Q19 failing since Polars 1.7.0 #18710
Comments
We're on it. |
The rest runs ok? The issue is our new parquet prefiltering strategy. Choosing a different strategy for |
q6 and q12 also fail with similar errors, |
Does this fixes them? #18714 |
now it's just hanging indefinitely for me |
can confirm this still happens the code i'm running is just https://github.com/pola-rs/polars-benchmark/blob/main/queries/polars/q19.py |
Culrprit reverted. I am going to add benchmark runs to CI :') |
+1, observing this with a very simple scan & collect. Here's some more debugging info: sep = pl.scan_parquet(parquet.path("SEP")).filter(pl.col("date") < day).sort("date")
dates = sep.select("date").unique().tail(200)
prices = sep.select("date", "ticker", "closeadj", "open", "low", "high", "close")
last = dates.join(prices, on="date").collect()
# *** polars.exceptions.ColumnNotFoundError: "closeadj" not found Then I went to reduce it down for this issue, and observed something else: it's complaining about columns that are in the table on the disk, but weren't selected: sep = pl.scan_parquet(parquet.path("SEP")).filter(pl.col("date") < day).sort("date")
dates = sep.select("date").unique().tail(200)
prices = sep.select("date", "closeadj", "open")
last = dates.join(prices, on="date").collect()
# *** polars.exceptions.ColumnNotFoundError: "high" not found Or sep = pl.scan_parquet(parquet.path("SEP")).filter(pl.col("date") < day).sort("date")
dates = sep.select("date").unique().tail(200)
prices = sep.select("date", "closeadj")
last = dates.join(prices, on="date").collect()
# *** polars.exceptions.ColumnNotFoundError: "open" not found Will test & report back here when a fix is released. 1.6.0 is OK |
Confirming fix in 1.7.1 |
Checks
Reproducible example
Log output
Issue description
Since 1.7.0, q19 has started failing
spotted in the Narwhals CI https://github.com/narwhals-dev/narwhals/actions/runs/10823260522/job/30028496951?pr=951
Expected behavior
Installed versions
The text was updated successfully, but these errors were encountered: