Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add monitoring terraform module #9

Merged
merged 4 commits into from
Jul 29, 2024
Merged

Conversation

nvtaveras
Copy link
Contributor

@nvtaveras nvtaveras commented Jul 29, 2024

This adds a monitoring module that sets up the monitoring for the watchdog function within GCP. It includes:

  • A log based metric that counts the number of health check events received by the function
  • An alert policy that triggers whenever an event hasn't been emitted for more than 6 hours
  • A notification channel where the alerts above are sent (victorops)

The only "manual" part is the creation of the webhook url which is a one time thing done in victorops and has to be configured as a variable in terraform.

I tested this by temporarily setting the aggregation time of the metric to 5 minutes and disabling the quicknode notification. The alert was correctly sent to victorops and it auto resolved after re-enabling the quicknode notifications.

@nvtaveras nvtaveras merged commit 243a7c7 into main Jul 29, 2024
1 check passed
@nvtaveras nvtaveras deleted the feat/infraMonitoring branch July 29, 2024 23:44
chapati23 pushed a commit that referenced this pull request Aug 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant