Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Observability of Envoy Gateway Component (not Envoy) #3404

Closed
nezdolik opened this issue May 16, 2024 · 6 comments · Fixed by #3446
Closed

docs: Observability of Envoy Gateway Component (not Envoy) #3404

nezdolik opened this issue May 16, 2024 · 6 comments · Fixed by #3446
Assignees
Labels
area/observability Observability related issues documentation Improvements or additions to documentation
Milestone

Comments

@nezdolik
Copy link
Member

Observability of Envoy Gateway Component (not Envoy):

To have full EG system observability and ease troubleshooting of various failure modes, is there is EG native metrics (not Envoy proxy) that helps to have insights into EG subcomponents like:

  • Total/failed to translate resources (Resource translator)
  • Total/failed to translate xDS resources (xDS translator)
  • Total/failed to translate Infra resources (Infra translator)

Introducing such metrics (if they are not yet supported) will ease EG operators to troubleshoot various failure scenarios and increase confidence for prod deployment.

Could not find mentions of EG native metric in user docs.

@shawnh2 shawnh2 added area/observability Observability related issues and removed triage labels May 16, 2024
@shawnh2
Copy link
Contributor

shawnh2 commented May 16, 2024

Hi @nezdolik

The metrics of EG are still work in progress, tracked by #2092, for now, the latest version has support these metrics:

The usage of EG metrics can refer to: https://gateway.envoyproxy.io/latest/tasks/observability/gateway-observability/

Total/failed to translate resources (Resource translator)
Total/failed to translate xDS resources (xDS translator)
Total/failed to translate Infra resources (Infra translator)

These metrics can refer to:

watchableSubscribedTotal = metrics.NewCounter("watchable_subscribed_total", "Total number of subscribed watchable.")
watchableSubscribedErrorsTotal = metrics.NewCounter("watchable_subscribed_errors_total", "Total number of subscribed watchable errors.")

with different runner label, for example, like:

image
  • the gateway-api runner is for Resource translator
  • the xds-translator runner is for xDS translator
  • the infrastructure runner is for Infra translator

@arkodg arkodg added the documentation Improvements or additions to documentation label May 16, 2024
@arkodg
Copy link
Contributor

arkodg commented May 16, 2024

converting this into a docs issue, because these metrics exist, but are not documented

@arkodg arkodg changed the title Observability of Envoy Gateway Component (not Envoy) docs: Observability of Envoy Gateway Component (not Envoy) May 16, 2024
@arkodg arkodg added this to the v1.1.0-rc1 milestone May 16, 2024
@arkodg
Copy link
Contributor

arkodg commented May 16, 2024

@Xunzhuo @zirain @shawnh2 is there a tool / tags to easily convert metrics to docs (.md) which we can publish in the docs website ?

@shawnh2
Copy link
Contributor

shawnh2 commented May 17, 2024

Done some research, couldn't find any tool like that.

Since the metrics is not changed very often, maybe we could write a doc on our own.

@arkodg
Copy link
Contributor

arkodg commented May 17, 2024

sure @shawnh2, lets start with that

@shawnh2
Copy link
Contributor

shawnh2 commented May 20, 2024

will file a doc after #3393 landed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/observability Observability related issues documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants