Doris 3.0.8 on Kubernetes: File cache memory not released quickly, causing MEM_ALLOC_FAILED and query starvation #62130
Unanswered
shashankk90
asked this question in
A - General / Q&A
Replies: 1 comment
-
|
we think we need be.INFO and master fe.log if you can reproduce this issue. btw, you can join our Discord(https://discord.gg/JemTbMKj ), we are always online at Discord. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Description
I am running Apache Doris 3.0.8 on Kubernetes using the operator, and encountering an issue where backend (BE) pods stop executing new queries due to memory pressure caused by OS file cache.
During query execution, Doris loads data into the file cache. However, this cache is not released quickly enough, leading to high memory usage at the OS level. As a result, Doris refuses to execute new queries even though the internal memory tracker shows low usage.
This appears related to how the OS file cache is accounted for in Kubernetes environments (possibly similar to: kubernetes/kubernetes#43916).
Error Message
[MEM_ALLOC_FAILED] Create Expr failed because [E11] Allocator sys memory check failed:
Cannot alloc:64, consuming tracker:<Load#Id=cc827453e9894a89-94be3f0322fd9413>,
peak used 0, current used 0, exec node:<>,
process memory used 18.58 GB exceed limit 198.00 GB
or sys available memory 3.55 GB less than low water mark 12.00 GB.
Environment
Observed Behavior
sys available memorydrops below low water markMEM_ALLOC_FAILEDExpected Behavior
Questions
sys available memorybe used in containerized environments, or can this check be relaxed?Beta Was this translation helpful? Give feedback.
All reactions