You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of Polars.
Reproducible example
import polars as pl
import pandas as pd
data = pl.datetime_range(
start=pl.lit("2024-08-19T08:00:00", dtype=pl.Datetime(time_unit="ns")),
end=pl.lit("2024-08-19T16:00:00", dtype=pl.Datetime(time_unit="ns")),
interval="100us",
eager=True,
)
df = pl.DataFrame(data, schema={"datetime": pl.Datetime(time_unit="ns")})
def create_calculation_plan(
df: pl.DataFrame, end_dt: pd.Timestamp, offset_s: int
) -> pl.LazyFrame:
plan = (
df.lazy()
.filter(
pl.col("datetime")
<= pl.lit(
end_dt - pd.Timedelta(seconds=offset_s),
dtype=pl.Datetime(time_unit="ns"),
)
)
.unique()
.last()
)
return plan
calculation_plans = [
create_calculation_plan(df, pd.Timestamp("2024-08-19T16:00:00"), offset)
for offset in range(3)
]
pl.collect_all(calculation_plans)
Log output
No response
Issue description
The code is finishing successfully when run locally on Windows with approx. 8GB memory available. It saturates memory (100% RAM utillisation) during the calculations. It finishes successfully even when run in two separate processes simultaneously.
However when run locally using Docker the container is instantly OOM killed (memory limits not set, all available memory could be used). Also similar code was run succesfully on local Windows and docker container on kubernetes with more RAM available (12GB). The container on kubernetes was also OOM killed.
I think that the problem could be connected with cgroups limits. However issue regarding those was completed #15797
Expected behavior
The code finishes successfuly when executed in Docker container.
Checks
Reproducible example
Log output
No response
Issue description
The code is finishing successfully when run locally on Windows with approx. 8GB memory available. It saturates memory (100% RAM utillisation) during the calculations. It finishes successfully even when run in two separate processes simultaneously.
However when run locally using Docker the container is instantly OOM killed (memory limits not set, all available memory could be used). Also similar code was run succesfully on local Windows and docker container on kubernetes with more RAM available (12GB). The container on kubernetes was also OOM killed.
I think that the problem could be connected with cgroups limits. However issue regarding those was completed #15797
Expected behavior
The code finishes successfuly when executed in Docker container.
Installed versions
The text was updated successfully, but these errors were encountered: