Memory leak while submitting Spark apps using Ozone #7320
-
Beta Was this translation helpful? Give feedback.
Replies: 7 comments 14 replies
-
|
May I know where does the memory issue happen? Client-side (S3G / Ozone client) or server-side (OM, SCM, DNs)? Might be related, but 1.4.1 has a memory issue due to regression Ratis version 3.1.0 that causes memory issue. If possible, you can try to downgrade to 1.4.0 first and see if the memory issue improves. We probably will have another RC that upgrades the Ratis version to 3.1.1 that revert the problematic patches. cc: @xichen01 |
Beta Was this translation helpful? Give feedback.
-
|
Cleanup happens on
Are number of |
Beta Was this translation helpful? Give feedback.
-
|
Looks similar to the issue I described in https://issues.apache.org/jira/browse/HDDS-11505. In this case, the application didn't properly call close() If we are suspecting issues in Ozone client itself, HDDS-11543 fixed almost all of the leaks originating from Ozone client. |
Beta Was this translation helpful? Give feedback.
-
|
@aekom87 Do you have any recent plans to create a PR to fix this issue to master branch? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
Is the problem effectively resolved |
Beta Was this translation helpful? Give feedback.
-
|
Hi @aekom87, thanks for pointing out this issue and for the fix (PR #7382). Since the original question has been answered, I am closing this discussion. Please feel free to open a new discussion to target the ratis JMX growth. @smileyboy2019, I think the original problem has been fixed by #7382 |
Beta Was this translation helpful? Give feedback.


Thanks @aekom87 for the details, very helpful.
You are right.
RootedOzFsis leaking this internalRootedOzoneFileSysteminstance.By contrast,
S3Acloses its own internal instance infinalize():