Add a new service pgss_dealloc #331

anayrat · 2022-09-14T10:12:31Z

Hello!
This conversation on hackers remind me that pg_stat_statements deallocs should me monitored.

I suggest adding such service.

Cheer

anayrat · 2022-09-14T13:21:47Z

If the number of dealloc per second is too low, we can change it to number of dealloc per millisecond.

rjuju

I'm a bit dubious about this service. I agree that this is something you should keep you eyes on, but I don't think it would play very well as a check here.

The main problem is that if you schedule the check too frequently it wil be a bit useless. For instance, if you schedule it every 5 minutes, how do you differentiate from "there was once 1 deallocate and then none" from "there is 1 deallocate every 5 minutes" from this service point of view? The only way to know if there's really a problem is either:

the service is constantly raising a problem
the frequency of the service moving from ok to problem is high

But if you're in the first case it's likely that the global performance will immediately drop down by a huge factor, so it's unlikely that you won't notice there's a problem. And the second isn't a good way to spot a problem.

The fact that you only return (and handle thresholds as) a rate and not also the raw number probably exacerbates this problem.

rjuju · 2022-09-16T04:32:34Z

check_pgactivity

+    -exitval => 127
+    ) if @hosts != 1;
+
+    is_compat $hosts[0], 'check_pg_stat_statements_dealloc', $PG_VERSION_140 or exit 1;


Note that having postgres 14 doesn't mean that you updated the pg_stat_statements extension to get the needed field.

Indeed, I will add a test to check pgss' version.

I added several tests:

pg_stat_statements version must be above or equal 1.9

pg_stat_statements has been created on target database

pg_stat_statements has been loaded in shared_preload_libraries

anayrat · 2022-09-16T08:26:06Z

Yeah, I shouldn't report rate as perfdata. I will change it to a counter. But for the threshold, I don't see other way.
We can't use a threshold as a raw value. For example, if you have 1 dealloc every minute, once you reach 100 (if it is your threshold). The check be critical even if there is no more deallocation.

My idea is to, first graph the dealloc rate. For example, it will give you a mean rate of 100 dealloc between 5 minutes. Then, you add a threshold at 500. That means if you reach this threshold, your workload has changed. And, you should understand why you have an increase in dealloc rate before hitting production issue.

ioguix · 2023-11-29T11:40:42Z

Hi guys,

Instead of working on rates, the threshold can apply on delta since last call. I think this is what @anayrat explains in his last message, is it?

So you plan to keep working on it @anayrat?

anayrat · 2024-01-11T11:49:31Z

Hello,
Yes, I can use the delta since last call as a threshold.
I will work on this.

anayrat · 2024-01-11T15:06:40Z

Hello,
I added more checks, replaced rate by dealloc delta and rephrased service description.
If you want to test :
Run a pgbench with

\set id random(1,1000000)
set client.id = :id

And enable pg_stat_statements.track_utility with a low pg_stat_statements.max

anayrat added the enhancement label Sep 14, 2022

rjuju reviewed Sep 16, 2022

View reviewed changes

anayrat force-pushed the pgss-dealloc branch from 523bd0f to 16a7ca6 Compare January 11, 2024 15:00

Add a new service pgss_dealloc

ae75caf

anayrat force-pushed the pgss-dealloc branch from 16a7ca6 to ae75caf Compare January 11, 2024 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a new service pgss_dealloc #331

Add a new service pgss_dealloc #331

anayrat commented Sep 14, 2022

anayrat commented Sep 14, 2022

rjuju left a comment

rjuju Sep 16, 2022

anayrat Sep 16, 2022

anayrat Jan 11, 2024

anayrat commented Sep 16, 2022 •

edited

Loading

ioguix commented Nov 29, 2023

anayrat commented Jan 11, 2024

anayrat commented Jan 11, 2024

Add a new service pgss_dealloc #331

Are you sure you want to change the base?

Add a new service pgss_dealloc #331

Conversation

anayrat commented Sep 14, 2022

anayrat commented Sep 14, 2022

rjuju left a comment

Choose a reason for hiding this comment

rjuju Sep 16, 2022

Choose a reason for hiding this comment

anayrat Sep 16, 2022

Choose a reason for hiding this comment

anayrat Jan 11, 2024

Choose a reason for hiding this comment

anayrat commented Sep 16, 2022 • edited Loading

ioguix commented Nov 29, 2023

anayrat commented Jan 11, 2024

anayrat commented Jan 11, 2024

anayrat commented Sep 16, 2022 •

edited

Loading