-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bridges - Too Many Files Open, Consecutive Sessions Create #1361
Comments
What versions are you running, and on what resource? Thanks, Andre |
I can reproduce this by now. Tracking this down will unfortunately take a while... |
@andre-merzky I get this error also when I am submitting jobs to Comet, fyi. |
Yeah, that ist somewhat independent of the target resource. |
So, here is the catch: Python's logging module does not really allow us to reclaim log handles. The documentation says:
The last statement kind of makes this unusable for your specific use case: the logging system will be used in the next session. So, we are collecting log handles along the way. I tried to manually close handles, but that seems not to work - I assume the module keeps private handles. This is a bloody mess. I'll keep on it, but if that logging problem does not go away, we either have to rethink our generous use of logging handles (ugh), or write our own logging module (UGH), or live with that limitation (gah!). We also collected socket handles, but that part is fixed now, so you should be able to get more sessions in an application than before. The exact number depends on system specific settings, so your mileage may vary. You may want to set Lets discuss on Monday if this is worth holding up the release - because either way, I don't see a quick fix to that. FWIW, I did not yet merge the partial fix into devel. |
A partial fix for this has now been merged into devel. The problem is not completely solved - but the resource leakage has been reduced to a level where one can create about 60 sessions per application. I'll open a new ticket as a reminder that this needs more work (see #1387). |
I am automating my experiment runs, where each run executes some number of Synapse emulations (CUs). The first few runs complete successfully (with some failed CUs, but that's another ticket). However, subsequent runs give the following errors, which seem like variants of the problem.
There are no other errors from log files that can be found. The error only occurs after RP mentions the database used in the terminal, namely
database : [mongodb:// ....... ]
When I restart my experiments, the same situation occurs; the first few runs execute correctly, but all subsequent runs encounter the same error.
I will include a timer of 5 minutes between runs to see if the problem is running too many sessions to quickly.
The text was updated successfully, but these errors were encountered: