Skip to content

Random timeouts for GraphQL getKubeNamespace/getKubeObject #5079

@stollero

Description

@stollero

What happened:
I create a new experiment and want to select the namespace and app label.
But the dropdown does not list a namespace or if the namespace is available, the app labels.

Watching the network I can see that the graphql query ends with 504 gateway timeout after a minute.
Reloading the page helps sometimes.

What you expected to happen:
The drop down for namespace and app label selection always works.

Where can this issue be corrected? (optional)

How to reproduce it (as minimally and precisely as possible):
If I run just the graphql query getKubeObject or getKubeNamespace with a standalone client I can reproduce the issue.
Running the query multiple times it might return nothing (6 out of 10 requests), the correct data (2 out of 10 requests) or runs into the timeout (2 out of 10 requests).

I tried other Queries, e.g. to list the experiments and these work every time.
Only the query that pulls information from the kubernetes cluster runs into timeouts.

Anything else we need to know?:
Cluster currently has 109 namespaces, but kubeobject also fails for namespaces only having a single app label.

Maybe related issues:

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions