-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batchspawner spawning / keep-alive is instable #174
Comments
Can you provide more info - this isn't enough to debug (these logs
show how it starts, not how it stops).
In particular, what kind of batch job are you running and what caused
the batch job to terminate? Did the batch job exceed some time or
memory limit? (check batch system and job logs) Did the hub itself
lose contact and trigger batchspawner to cancel it? (check hub logs)
This could be by the jupyter job ceasing to function before the batch
job terminates, causing the periodic hub status check to fail.
The spawn callback PR is unrelated and would only affect the
starting.
|
Thank you for your answer @rkdarst.
Jupyterhub logs around the time of failure:
|
Ah, I think this might be caused by #171:
|
This will be fixed by #179. |
Closed by #187 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi, I often end up in the following situation:
+ which jupyterhub-singleuser /opt/anaconda/3-2019.10/bin/jupyterhub-singleuser + batchspawner-singleuser jupyterhub-singleuser --ip=0.0.0.0 --NotebookApp.default_url=/lab [I 2020-03-29 18:18:43.199 SingleUserNotebookApp manager:48] [nb_conda_kernels] enabled, 84 kernels found [I 2020-03-29 18:18:44.331 SingleUserNotebookApp extension:157] JupyterLab extension loaded from /opt/anaconda/3-2019.10/lib/python3.7/site-packages/jupyterlab [I 2020-03-29 18:18:44.331 SingleUserNotebookApp extension:158] JupyterLab application directory is /opt/anaconda/3-2019.10/share/jupyter/lab [I 2020-03-29 18:18:44.784 SingleUserNotebookApp __init__:31] [Jupytext Server Extension] Deriving a JupytextContentsManager from LargeFileManager [I 2020-03-29 18:18:44.788 SingleUserNotebookApp singleuser:561] Starting jupyterhub-singleuser server version 1.1.0 [I 2020-03-29 18:18:44.800 SingleUserNotebookApp notebookapp:1924] Serving notebooks from local directory: /data/nasif12/home_if12/the_user [I 2020-03-29 18:18:44.800 SingleUserNotebookApp notebookapp:1924] The Jupyter Notebook is running at: [I 2020-03-29 18:18:44.800 SingleUserNotebookApp notebookapp:1924] http://lab-desk12:38375/jupyter/user/the_user/ [I 2020-03-29 18:18:44.800 SingleUserNotebookApp notebookapp:1925] Use Control-C to stop this server and shut down all kernels (twice to skip confirmation). [I 2020-03-29 18:18:44.820 SingleUserNotebookApp singleuser:542] Updating Hub with activity every 300 seconds
Also, after ~ 24h the
jupyterlab-singleuser
worker looses connection to the JupyterHub.However, depending on the configuration I would like to have the jupyter instances running for more than one day.
Does somebody know why the connection between JupyterHub and the worker is so instable?
Could jupyterhub/jupyterhub#2727 be the solution?
The text was updated successfully, but these errors were encountered: