Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tendermint light client is failing due to context timeout when state syncing #11

Open
3 tasks
daniel1302 opened this issue Nov 11, 2024 · 0 comments
Open
3 tasks
Labels
bug Something isn't working

Comments

@daniel1302
Copy link
Collaborator

The snapshot-testing has checks if the RPC server on remote host is healthy, but it is not restrictive enough. It checks only GET method on host given in the config. Sometimes entire state sync is failing with the following error:

2024-11-11T22:39:15.918+0100	ERROR	core	node/node.go:203	problem starting blockchain	{"error": "failed to start state sync: failed to set up light client state provider: post failed: Post \"http://<some-server>:26657\": context deadline exceeded"}

We should add checks against post method similar that state sync reactor in the tendermint is doing, and disable servers that are not healthy enough.

  • Find timeout used in the statesync engine of the tendermint
  • Check what post data is sent to the remote rpc server and use similar in the server health check.
  • Remove all unhealthy servers from the statesync.rpc_servers in the tendermint config.
@daniel1302 daniel1302 added the bug Something isn't working label Nov 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant