api queue message_stats returns wrong numbers occasionally #10067
-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
You do not mention how you define "correct" but management UI uses a data source that is updated every 5 seconds by default and rates are computed using a sliding window, which means every now and then they can be different for different metrics. In addition, while a metric is collected or serve, other things continue happening in parallel. This "absurdly high" number is likely a side effect of the sliding window calculation. Relying on point-in-time rates for making decisions is not a great idea because they can quickly change. Using multiple measurements in a row and removing outliers may work for monitoring. The recommendation (for many different reasons) is to disable metrics in the management UI and use Prometheus for monitoring. Management UI is fine as a way of getting started and during development. It has a number of limitations that external monitoring systems do not have. |
Beta Was this translation helpful? Give feedback.
You do not mention how you define "correct" but management UI uses a data source that is updated every 5 seconds by default and rates are computed using a sliding window, which means every now and then they can be different for different metrics. In addition, while a metric is collected or serve, other things continue happening in parallel.
This "absurdly high" number is likely a side effect of the sliding window calculation.
We have never been able to reproduce it, and I don't remember seeing it recently. The rate is reported "correctly" on the next calculation.
Relying on point-in-time rates for making decisions is not a great idea because they can quickly change. Using multiple measure…