-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DVM startup failed #1237
Comments
Excellect - thanks! |
Any indication of whether this is just a Titan blip in which case the ticket is for more logging (next release) or is it repeatable? Certainly Titan was working OK during the RC1 testing. |
No idea right now, really, but I am looking into this today. |
@andre-merzky any update on this one? |
Sorry that it took me so long to see this, but the DVM was started twice in the There is of course the possibility that this masked different errors, since we saw different error modes than DMV startup - but those should get separate tickets now. |
This is not the correct fix. See 1968501. |
Note that we didn't backport this as it was said that all users of titan_lib would be using the split_module branch anyway. |
Right, that makes sense. Well, that patch applies with only a minor conflict, so I am impartial to backporting or not... |
So this ticket should be looked at in the context of the next release to check whether (a) that the actual dual-startup problem is resolved, and (b) consider if additional tracing around DVM startup should be made default. |
#1277 is currently tested in this context |
We seem to have a working configuration on titan by now, including a fix to the double-startup problem. Is anybody opposed to closing this ticket? |
I would say that this issue only applied to the old situation. |
Thanks Mark. |
This is during test of
v0.45.RC2
onornl.titan
, example 00:Note that titan seems unusually slow right now, so I am not sure if the machine is healthy. What I am missing so is any indication on DVM status and health, we need some logging there...
The text was updated successfully, but these errors were encountered: