Skip to content
This repository has been archived by the owner on Nov 6, 2020. It is now read-only.

Warp sync stalls, probably because the peer isn't sending anymore data #11526

Open
dvdplm opened this issue Feb 27, 2020 · 0 comments
Open

Warp sync stalls, probably because the peer isn't sending anymore data #11526

dvdplm opened this issue Feb 27, 2020 · 0 comments
Labels
A3-stale 🍃 Pull request did not receive any updates in a long time. No review needed at this stage. Close it. F3-annoyance 💩 The client behaves within expectations, however this “expected behaviour” itself is at issue. M4-core ⛓ Core client code / Rust. P5-sometimesoon 🌲 Issue is worth doing soon. Q5-substantial 📓 Can be fixed by a developer with decent experience.

Comments

@dvdplm
Copy link
Collaborator

dvdplm commented Feb 27, 2020

Sometimes warp sync stalls. The likely cause is that the warp sync peer is not sending any data and no progress is made. It can look like this:

2020-02-27 10:18:07  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:12  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:17  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:22  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:27  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:32  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:37  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:42  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:47  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:52  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:18:57  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:02  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:07  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:12  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:17  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:22  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:27  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:32  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:37  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:42  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:47  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:52  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:19:57  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:02  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:07  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:12  IO Worker #2 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:17  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:22  IO Worker #3 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:27  IO Worker #0 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs
2020-02-27 10:20:32  IO Worker #1 INFO import  Syncing snapshot 509/4349        #0   22/25 peers   920 bytes chain 3 MiB db 0 bytes queue 53 KiB sync  RPC:  0 conn,    0 req/s,    0 µs

TODO:

  1. Validate the hypothesis
  2. Add heuristics to detect the stall and try to find a different peer with the same snapshot and continue from there, and/or restart the search for a snapshot peer.
@vorot93 vorot93 added F3-annoyance 💩 The client behaves within expectations, however this “expected behaviour” itself is at issue. M4-core ⛓ Core client code / Rust. P5-sometimesoon 🌲 Issue is worth doing soon. Q5-substantial 📓 Can be fixed by a developer with decent experience. labels Apr 15, 2020
@adria0 adria0 added the A3-stale 🍃 Pull request did not receive any updates in a long time. No review needed at this stage. Close it. label Jul 27, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
A3-stale 🍃 Pull request did not receive any updates in a long time. No review needed at this stage. Close it. F3-annoyance 💩 The client behaves within expectations, however this “expected behaviour” itself is at issue. M4-core ⛓ Core client code / Rust. P5-sometimesoon 🌲 Issue is worth doing soon. Q5-substantial 📓 Can be fixed by a developer with decent experience.
Projects
None yet
Development

No branches or pull requests

3 participants