-
Notifications
You must be signed in to change notification settings - Fork 9.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
etcd memory usage spikes to unsustainable levels and OOMs #14362
Comments
The OOM is a known issue, the workaroud AFAIK are:
With respect to the WAL files, it looks like an issue because the max WAL file is 5, but there are lots of WAL files. It means that the old WAL files failed to be purged. I checked the log file you attached, but did not see anything useful, the reason should be you attachd isn't the complete log. Please try to reproduce the issue and attached the complete log if possible. |
Can I just delete the WAL files? |
It isn't recommended to manually delete the WAL files, otherwise the WAL files may not be matching the snap files. Please try to reproduce the issue and provide complete log. If you are interested, please try to figure out why etcd failed to purge the WAL files automatically. |
When I try getting debug info:
When I try to get the logs:
I'm trying to get more info but etcd hogs memory and I can't even use my system properly. |
The huge memory usage might be caused by the db file size. What's the size of the db file, which locates in |
I just added more log for debugging the reason why etcd fails to purge WAL file. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions. |
/sub |
/mark |
What happened?
Whenever I run my etcd container, its memory usage slowly goes up to meet the docker limit on my machine (~7.6 GB) and then it OOMs. I've investigated and believe this is because it has created a large amount of wal files, above the limit (5), and when it has to process these it breaks down.
I've attached etcd docker logs.
etcdlogs.txt
What did you expect to happen?
etcd would run normally.
How can we reproduce it (as minimally and precisely as possible)?
I had inserted lots of data and now when I launch etcd it seems it can simply not handle the saved .wal files.
Anything else we need to know?
ls -has on the etcd wal directory:
Etcd version (please run commands below)
etcd v3.5.0 from docker
Etcd configuration (command line flags or environment variables)
Etcd debug information (please run commands blow, feel free to obfuscate the IP address or FQDN in the output)
I cannot run these commands because etcd hangs and crashes.
Relevant log output
No response
The text was updated successfully, but these errors were encountered: