Anomaly detection - peak/trough detection and caching of matrix profle #1081

ram-senth · 2024-08-28T18:40:31Z

This PR has multiple changes:

Trough and peak detection using interquartile range
Caching of matrix profile
Changes to the detect end point to support historic time steps separated from current time steps of a time series

…data

jennmueng · 2024-08-29T22:35:51Z

src/seer/anomaly_detection/anomaly_detection.py

+    @inject
+    @sentry_sdk.trace
+    def _combo_detect(
+        self, ts_with_history: TimeSeriesWithHistory, config: AnomalyDetectionConfig


Missing a = injected for DI?

No, the config param is part of request payload.

I do need to remove the @inject above.

aayush-se · 2024-09-03T15:13:59Z

src/seer/anomaly_detection/anomaly_detection.py

+        transaction_name = (
+            "Stream AD for alert"
            if isinstance(request.context, AlertInSeer)
-            else self._batch_detect(request.context)
+            else (
+                "Stream AD for timeseries with history"
+                if isinstance(request.context, TimeSeriesWithHistory)
+                else "Batch AD for timeseries"
+            )


nit: nested ternary makes this section somewhat hard to follow. Could this be done using a switch or an if/else?

Sure, will change it.

trillville · 2024-09-03T17:32:10Z

tests/seer/anomaly_detection/test_accessors.py

will there be a follow up PR for the peak/trough logic?

Yes I will tweak the peak/trough logic some more based on your experiments in a follow-up PR. Wanted to get this out as frontend work is dependent on the API change.

trillville · 2024-09-03T17:38:25Z

src/seer/anomaly_detection/anomaly_detection.py

-        return timeseries
+        batch_detector = MPBatchAnomalyDetector()
+        anomalies = batch_detector.detect(convert_external_ts_to_internal(timeseries), config)
+        return timeseries, anomalies

    @inject
    @sentry_sdk.trace
    def _online_detect(


i think it would be good to add some brief docstrings to these to clarify the context in which each is used

trillville · 2024-09-03T17:40:22Z

src/seer/anomaly_detection/anomaly_detection.py

+        else:
+            transaction_name = "Batch AD for timeseries"
+
+        with sentry_sdk.start_transaction(op="task", name=transaction_name):


any particular reason why we want a separate transaction here?

This one endpoint caters to three different types of anomaly detection - stateless batch, stateless online and stateful alert based. Each serve a different use case and will have different performance characteristics. So I think it is worth tracking them as different transactions with separate names. Any potential problem with this approach?

trillville · 2024-09-03T17:44:59Z

src/seer/anomaly_detection/detectors/mp_scorers.py

        if np.isnan(mp_dist):
            return "none"
        if mp_dist < threshold_lower:
            return "none"
        if mp_dist < threshold_upper:
            return "anomaly_lower_confidence"
        return "anomaly_higher_confidence"
+
+    def _adjust_flag_for_vicinity(
+        self, flag: AnomalyFlags, ts_value: float, context: npt.NDArray[np.float64]


this logic is pretty difficult to follow without a docstring - specifically what is context and how should i think about it? (i think i know the answer from talking to you but will probably be confused looking at this in 2 months)

trillville

code LGTM - do we feel confident about the peak/trough logic?

ram-senth · 2024-09-03T18:36:13Z

code LGTM - do we feel confident about the peak/trough logic?

The logic that is going in is incrementally better than what we had previously. It does need more tweaking. I will be following up on it immediately and have an updated version out before the internal release.

sentry-io · 2024-09-04T17:12:39Z

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

‼️ IntegrityError: (psycopg.errors.NotNullViolation) column "anomaly_type" of relation "dynamic_alert_time_series" c... 00a7fb4f4911_migration_py in upgrade View Issue
‼️ Exception: Search for optimal window failed. app.store_data_endpoint View Issue

_{Did you find this useful? React with a 👍 or 👎}

ram-senth added 4 commits August 28, 2024 16:50

feat(dynamic alert thresholds): Trough and peak detection

ccfa950

Minor tweak to peak-trough detection logic

fbd6b78

Not using IQR improves performance

fcee886

feat(dynamic alert thresholds): Caching of matrix profile

0a702e7

ram-senth force-pushed the anomaly_detection/ram/support_config_flags branch from f393fef to d773ddb Compare August 29, 2024 05:41

feat(dynamic threshold alerts) Support separated history and current …

b02b6a6

…data

ram-senth force-pushed the anomaly_detection/ram/support_config_flags branch 6 times, most recently from 37507c2 to 17276d6 Compare August 29, 2024 22:10

jennmueng reviewed Aug 29, 2024

View reviewed changes

aayush-se reviewed Sep 3, 2024

View reviewed changes

ram-senth force-pushed the anomaly_detection/ram/support_config_flags branch from 17276d6 to aa84a44 Compare September 3, 2024 17:14

trillville reviewed Sep 3, 2024

View reviewed changes

trillville approved these changes Sep 3, 2024

View reviewed changes

Loading matrix profile from database

b72f2ca

ram-senth force-pushed the anomaly_detection/ram/support_config_flags branch from aa84a44 to b72f2ca Compare September 3, 2024 17:54

Fix error from flask_migrate complaining about multiple heads

3fb276f

corps and others added 4 commits September 3, 2024 11:57

Add debugging output

63c3737

Merge branch 'main' into anomaly_detection/ram/support_config_flags

b61b676

Fix the migration divergence

36f0129

Fix mypy error

3c25960

ram-senth merged commit 7a3ee44 into main Sep 3, 2024
11 checks passed

ram-senth deleted the anomaly_detection/ram/support_config_flags branch September 3, 2024 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anomaly detection - peak/trough detection and caching of matrix profle #1081

Anomaly detection - peak/trough detection and caching of matrix profle #1081

ram-senth commented Aug 28, 2024 •

edited

Loading

jennmueng Aug 29, 2024

ram-senth Aug 30, 2024

ram-senth Aug 30, 2024

ram-senth Sep 3, 2024

aayush-se Sep 3, 2024

ram-senth Sep 3, 2024

ram-senth Sep 3, 2024

trillville Sep 3, 2024

ram-senth Sep 3, 2024

trillville Sep 3, 2024

ram-senth Sep 3, 2024

trillville Sep 3, 2024

ram-senth Sep 3, 2024

trillville Sep 3, 2024

ram-senth Sep 3, 2024

trillville left a comment

ram-senth commented Sep 3, 2024

sentry-io bot commented Sep 4, 2024 •

edited

Loading

Anomaly detection - peak/trough detection and caching of matrix profle #1081

Anomaly detection - peak/trough detection and caching of matrix profle #1081

Conversation

ram-senth commented Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

trillville left a comment

Choose a reason for hiding this comment

ram-senth commented Sep 3, 2024

sentry-io bot commented Sep 4, 2024 • edited Loading

Suspect Issues

ram-senth commented Aug 28, 2024 •

edited

Loading

sentry-io bot commented Sep 4, 2024 •

edited

Loading