Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce grace period for failed connections in the tree service #1456

Closed
Tracked by #1322
fyrchik opened this issue May 30, 2022 · 2 comments · Fixed by #1488
Closed
Tracked by #1322

Introduce grace period for failed connections in the tree service #1456

fyrchik opened this issue May 30, 2022 · 2 comments · Fixed by #1488
Assignees
Labels
U3 Regular

Comments

@fyrchik
Copy link
Contributor

fyrchik commented May 30, 2022

If a connection to the node fails, we should wait some time before trying to connect again, at least in the replicator.

Originally caught at #1445:

  1. Connect to the TLS node in dev-env with grpc.Insecure
  2. Fail with timeout.
  3. Latency of the client operations is increased to 2 seconds (dial timeout) because all workers are busy with connecting to the failed node and channels eventually become full.

Another solution would be to have a dedicated goroutine for each client, but this requires some refactoring and is somewhat related to #1447 .

@carpawell
Copy link
Member

was it solved in the #1488? or we do not close the issue until the feature branch is merged?

@fyrchik
Copy link
Contributor Author

fyrchik commented Jun 21, 2022

Closed via #1488

@alexchetaev alexchetaev added the U3 Regular label Sep 1, 2022
aprasolova pushed a commit to aprasolova/neofs-node that referenced this issue Oct 19, 2022
…ter failure

In case node is down or failing for some reason, we can expect `Dial` to
fail. In case we actively try to replicate and `Dial` always takes 2
seconds, replication-related channels quickly become full. That affects
latency of all other write operations.

Signed-off-by: Evgenii Stratonikov <evgeniy@nspcc.ru>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
U3 Regular
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants