Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add data-test.sh #26

Draft
wants to merge 15 commits into
base: main
Choose a base branch
from
Draft

Add data-test.sh #26

wants to merge 15 commits into from

Conversation

dostrelith678
Copy link
Member

@dostrelith678 dostrelith678 commented Apr 6, 2022

Description

Adds data-tests integration for HAproxy to run advanced test for servers in topology of monitoring nodes. First determines a healthy node to use for comparison from a list of trusted nodes, then tests each server in the topology against it.
If the tested node does not pass tests it is marked for maintenance (MAINT) until next run when it could pass. If a MAINT node passes the data tests, it is brought back to READY state.

To-do:

  • Add cron job setup for running these tests automatically (1 hour)

Where should the reviewer start?

Motivation and context

Which issue it fixes?

How has this been tested?

Copy link
Contributor

@rdlrt rdlrt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added few comments for discussion

tests/data-test.sh Outdated Show resolved Hide resolved
tests/data-test.sh Outdated Show resolved Hide resolved
tests/data-test.sh Show resolved Hide resolved
@rdlrt rdlrt linked an issue Apr 7, 2022 that may be closed by this pull request
@dostrelith678
Copy link
Member Author

One thing I am considering is to save the responses from ${HEALTHY_INSTANCE} to use them for comparison instead of re-querying for each server in test topology. Given a lot of servers though, the testing could take a while and responses could change over the course of the script runtime.

@dostrelith678
Copy link
Member Author

dostrelith678 commented Jun 23, 2022

TO-DO: Make sure tests check offline data (asset registry cache / pool_offline_data).

@rdlrt rdlrt added this to the Koios vNext milestone Jun 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

Successfully merging this pull request may close these issues.

Data-test monitoring integration
2 participants