Add per-operation timing to segment_current_trips using ect.Timer #990

TeachMeTW · 2024-11-01T22:31:30Z

Wrapped each significant operation within the segment_current_trips function with ect.Timer context managers.
Named each timer using the pattern ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/operation" for consistent identification.
After each timed block, recorded the elapsed time by calling esds.store_pipeline_time with the appropriate parameters.
Ensured that only timing-related code was added without altering existing logic, error handling, or formatting.

This enhancement enables granular performance monitoring of the trip segmentation process, allowing for better identification of potential bottlenecks and optimization opportunities.

TeachMeTW · 2024-11-01T22:33:29Z

@shankari Please review;

added a timer to the "ad-hoc" pipeline step ✅
add timers to parts of the trip segmentation, with suffixes ✅
tested with a user intake, updates show when querying ✅

shankari · 2024-11-02T02:38:37Z

@TeachMeTW Thank you for the clean PR!

I don't think that the trip segmentations stats are fine-grained enough.
Did you look at the logs/stats from the "tested with a user intake, updates show when querying"?
My expectation is that the create_places_and_trips timer will account for the bulk of the overall time for the step (e.g. if the step takes 100ms, create_places_and_trips will account for 95ms). And you don't have any timers within create_places_and_trips, so you won't know where we are spending the time, and what we should try to optimize.
Further, I don't think we need timers for simple if/then statements - e.g. the below

        elif len(segmentation_points) == 0:
            with ect.Timer(ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/early_return_no_segmentation") as timer_early_return_no_segmentation:
                # no new segments, no need to keep looking at these again
                logging.debug("len(segmentation_points) == 0, early return")
                epq.mark_segmentation_done(user_id, None)
            esds.store_pipeline_time(
                user_id,
                ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/early_return_no_segmentation",
                time.time(),
                timer_early_return_no_segmentation.elapsed
            )

There are literally two statements within the timer in the code snippet above - a log statement and a mark_segmentation_done, neither of which are expected to be heavyweight.

I would:

look at the logs from running the single user intake,
correlate that to the trip segmentation codebase, and
add timers for the sections of code (within create_places_and_trips) that are taking the most time.

as a nice-to-have, can you also split commit into two:

easy: one for adding instrumentation to the _get_and_store_range?
more complex: one for adding instrumentation to trip_segmentation?

You can even create two PRs so I can merge the easy one while reviewing the more complex one.
Feel free to close this PR and create two new ones if that makes it easier!

shankari · 2024-11-02T23:28:28Z

@TeachMeTW is there an ETA for addressing my comments?
I see that you have split the commit, but I still don't see the "easy: one for adding instrumentation to the _get_and_store_range?" PR and you haven't addressed my comment around "I don't think that the trip segmentations stats are fine-grained enough." in this PR.

In particular, "easy: one for adding instrumentation to the _get_and_store_range?" is the main change that we want to get into production soon.

TeachMeTW · 2024-11-03T00:28:07Z

@TeachMeTW is there an ETA for addressing my comments? I see that you have split the commit, but I still don't see the "easy: one for adding instrumentation to the _get_and_store_range?" PR and you haven't addressed my comment around "I don't think that the trip segmentations stats are fine-grained enough." in this PR.

In particular, "easy: one for adding instrumentation to the _get_and_store_range?" is the main change that we want to get into production soon.

It is almost finished but I need to test it; is there a uuid in ca_ebike that has trips that are segmentable? How do I search for that? The tests I've done stops at:

    if len(loc_df) == 0:
        # no new segments, no need to keep looking at these again
        logging.debug("len(loc_df) == 0, early return")
        epq.mark_segmentation_done(user_id, None)
        return

because there are no new segments hence I cannot see if the fine grained timers are being called.

shankari · 2024-11-03T00:37:26Z

you can reset the pipeline using bin/reset_pipeline.py

TeachMeTW · 2024-11-03T03:46:48Z

@shankari I have added more timings, specifically for create_places_and_trips

TeachMeTW · 2024-11-03T20:48:03Z

@shankari ready for feedback; is this in depth timers enough; I also only noticed 3 things that take the most times; filter methods, create_places_and_trips, single/multi filter

emission/pipeline/intake_stage.py

shankari

@TeachMeTW no, this doesn't actually address my comments.

And you don't have any timers within create_places_and_trips, so you won't know where we are spending the time, and what we should try to optimize.

I miswrote; I would actually expect most of the time to be in segment_into_trips. I think that is what you found as well.

From #990 (comment)

I also only noticed 3 things that take the most times; filter methods, create_places_and_trips, single/multi filter

what are those times?

I would expect to see fine-grained instrumentation in segment_into_trips as well.

Note also that this has broken several tests.

As I said earlier, it looks like this will take some time to implement, so please move
#990 (comment)
to a separate PR

emission/analysis/intake/segmentation/trip_segmentation.py

TeachMeTW · 2024-11-04T20:49:35Z

@shankari Addressed comments

shankari · 2024-11-05T18:18:21Z

@TeachMeTW tests are still failing

#990 (review)

Note also that this has broken several tests.

I am not going to review unless the tests are passing.

shankari · 2024-11-05T18:51:44Z

@JGreenlee since this is no longer the weekend, can you review this before it comes to me?

TeachMeTW · 2024-11-05T20:17:14Z

@JGreenlee Please review, tests are passing

JGreenlee

This looks pretty clean! We should be able to assess bottlenecks in segmentation; I think this will allow for even more granular instrumentation than I had in mind.

emission/analysis/intake/segmentation/trip_segmentation.py

TeachMeTW · 2024-11-05T23:40:56Z

@shankari @JGreenlee and I discovered a confusing behavior with the filters. I tested with both an ios and android user -- but the main issue stems from the fact that both only use the distance filter, while the time filter is left unused. What are your thoughts? We are both stumped in regards to this. @JGreenlee can add more to this thread if I have not covered everything.

shankari · 2024-11-05T23:58:19Z

I tested with both an ios and android user -- but the main issue stems from the fact that both only use the distance filter, while the time filter is left unused.

I would like to see proof of this. Please use <details> to avoid a wall of text and strip out any identifying information before posting

JGreenlee · 2024-11-06T00:22:16Z

There is no bug. We were testing against 2 iOS users when we thought it was 1 Android and 1 iOS user.
I was correct that the reason @TeachMeTW wasn't seeing the time filter be used is because he was testing a user on one platform and needed to test the other platform. However, I mixed it up and thought android -> distance and ios -> time, when it is actually the opposite.
After choosing an Android user, @TeachMeTW was indeed able to see readings from the time filter.

TeachMeTW · 2024-11-06T18:15:03Z

@JGreenlee Please review the two changes. I do not believe the failing ubuntu test has anything to do with the changes I made.

...ion/analysis/intake/segmentation/trip_segmentation_methods/dwell_segmentation_dist_filter.py

JGreenlee · 2024-11-07T21:47:19Z

Noted that tests are passing but workflow fails due to e-mission/e-mission-docs#1097

fixed dist name Added name to time

TeachMeTW · 2024-11-08T18:18:42Z

@JGreenlee please re review

JGreenlee

I think this is fine to merge.
Some of these Timer blocks might still be overkill, but it's not easy to tell without digging into every function called. We can always remove some later after assessing on prod and seeing what is and isn't a bottleneck.
At this stage, better too granular than not granular enough

shankari

I agree with @JGreenlee that more fine-grained is better than less, but I also want to point out that there is a cost to storing a lot of stats in terms of database usage. This is particularly true for the pipeline stats which are generated ~ every hour, as opposed to the admin dashboard stats, which are only generated when the admin user logs in to the dashboard.

I can deploy this to staging, but we should use the staging results to strip out the bottom 89-90% of stats before we move to production (even the limited 3 environment production). We can add them back if we resolve all the issues with the top 10-20% of readings!

In particular, I do not anticipate that any of the stats below 👇 will be relevant. They are literally just creating python objects.

shankari · 2024-11-09T00:41:01Z

emission/analysis/intake/segmentation/trip_segmentation.py

+        ts = esta.TimeSeries.get_time_series(user_id)
+    esds.store_pipeline_time(user_id, ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/get_time_series", time.time(), t_get_time_series.elapsed)


creating a python object

shankari · 2024-11-09T00:42:15Z

emission/analysis/intake/segmentation/trip_segmentation.py

+    with ect.Timer() as t_get_time_range:
+        time_query = epq.get_time_range_for_segmentation(user_id)
+    esds.store_pipeline_time(user_id, ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/get_time_range_for_segmentation", time.time(), t_get_time_range.elapsed)


simple database query

shankari · 2024-11-09T00:42:39Z

emission/analysis/intake/segmentation/trip_segmentation.py

+        dstfsm = dstf.DwellSegmentationTimeFilter(time_threshold=5 * 60,  # 5 mins
+                                                 point_threshold=9,
+                                                 distance_threshold=100)  # 100 m
+    esds.store_pipeline_time(user_id, ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/create_time_filter", time.time(), t_create_time_filter.elapsed)
+


created python object

shankari · 2024-11-09T00:42:52Z

emission/analysis/intake/segmentation/trip_segmentation.py

+    with ect.Timer() as t_create_dist_filter:
+        dsdfsm = dsdf.DwellSegmentationDistFilter(time_threshold=10 * 60,  # 10 mins
+                                                 point_threshold=9,
+                                                 distance_threshold=50)  # 50 m
+    esds.store_pipeline_time(user_id, ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/create_dist_filter", time.time(), t_create_dist_filter.elapsed)


created python object

TeachMeTW force-pushed the Add-Lower-Level-Timings branch 2 times, most recently from 013d099 to b8f5523 Compare November 2, 2024 03:14

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from a236c62 to fe21f7b Compare November 3, 2024 03:44

shankari requested changes Nov 4, 2024

View reviewed changes

emission/pipeline/intake_stage.py Outdated Show resolved Hide resolved

TeachMeTW mentioned this pull request Nov 4, 2024

added instrumentation to the _get_and_store_range #991

Merged

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from 899c0cb to fe21f7b Compare November 4, 2024 06:35

shankari requested changes Nov 4, 2024

View reviewed changes

emission/analysis/intake/segmentation/trip_segmentation.py Outdated Show resolved Hide resolved

emission/analysis/intake/segmentation/trip_segmentation.py Outdated Show resolved Hide resolved

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from fe21f7b to 6d798c6 Compare November 4, 2024 20:48

TeachMeTW closed this Nov 5, 2024

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from 6d798c6 to c8e8080 Compare November 5, 2024 18:26

TeachMeTW reopened this Nov 5, 2024

TeachMeTW closed this Nov 5, 2024

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from 5c916ed to c8e8080 Compare November 5, 2024 18:49

Segment_Current_Trips

7b5ef4f

TeachMeTW reopened this Nov 5, 2024

JGreenlee reviewed Nov 5, 2024

View reviewed changes

emission/analysis/intake/segmentation/trip_segmentation.py Show resolved Hide resolved

emission/analysis/intake/segmentation/trip_segmentation.py Outdated Show resolved Hide resolved

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from 09c4fdc to f4f6d19 Compare November 5, 2024 21:03

TeachMeTW force-pushed the Add-Lower-Level-Timings branch 3 times, most recently from cc2a352 to 7b5ef4f Compare November 6, 2024 03:35

Time Filter

0382abf

JGreenlee reviewed Nov 7, 2024

View reviewed changes

...ion/analysis/intake/segmentation/trip_segmentation_methods/dwell_segmentation_dist_filter.py Outdated Show resolved Hide resolved

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from 09b3800 to c48a655 Compare November 8, 2024 00:04

Dist Filter

1a5a0d9

fixed dist name Added name to time

TeachMeTW force-pushed the Add-Lower-Level-Timings branch from c48a655 to 1a5a0d9 Compare November 8, 2024 00:06

JGreenlee approved these changes Nov 8, 2024

View reviewed changes

shankari approved these changes Nov 9, 2024

View reviewed changes

shankari merged commit 3900d3c into e-mission:master Nov 9, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add per-operation timing to segment_current_trips using ect.Timer #990

Add per-operation timing to segment_current_trips using ect.Timer #990

TeachMeTW commented Nov 1, 2024

TeachMeTW commented Nov 1, 2024

shankari commented Nov 2, 2024 •

edited

Loading

shankari commented Nov 2, 2024

TeachMeTW commented Nov 3, 2024

shankari commented Nov 3, 2024

TeachMeTW commented Nov 3, 2024

TeachMeTW commented Nov 3, 2024

shankari left a comment

TeachMeTW commented Nov 4, 2024

shankari commented Nov 5, 2024

shankari commented Nov 5, 2024

TeachMeTW commented Nov 5, 2024

JGreenlee left a comment

TeachMeTW commented Nov 5, 2024

shankari commented Nov 5, 2024 •

edited

Loading

JGreenlee commented Nov 6, 2024

TeachMeTW commented Nov 6, 2024

JGreenlee commented Nov 7, 2024

TeachMeTW commented Nov 8, 2024

JGreenlee left a comment

shankari left a comment

shankari Nov 9, 2024

shankari Nov 9, 2024

shankari Nov 9, 2024

shankari Nov 9, 2024

		ts = esta.TimeSeries.get_time_series(user_id)
		esds.store_pipeline_time(user_id, ecwp.PipelineStages.TRIP_SEGMENTATION.name + "/get_time_series", time.time(), t_get_time_series.elapsed)

Add per-operation timing to segment_current_trips using ect.Timer #990

Add per-operation timing to segment_current_trips using ect.Timer #990

Conversation

TeachMeTW commented Nov 1, 2024

TeachMeTW commented Nov 1, 2024

shankari commented Nov 2, 2024 • edited Loading

shankari commented Nov 2, 2024

TeachMeTW commented Nov 3, 2024

shankari commented Nov 3, 2024

TeachMeTW commented Nov 3, 2024

TeachMeTW commented Nov 3, 2024

shankari left a comment

Choose a reason for hiding this comment

TeachMeTW commented Nov 4, 2024

shankari commented Nov 5, 2024

shankari commented Nov 5, 2024

TeachMeTW commented Nov 5, 2024

JGreenlee left a comment

Choose a reason for hiding this comment

TeachMeTW commented Nov 5, 2024

shankari commented Nov 5, 2024 • edited Loading

JGreenlee commented Nov 6, 2024

TeachMeTW commented Nov 6, 2024

JGreenlee commented Nov 7, 2024

TeachMeTW commented Nov 8, 2024

JGreenlee left a comment

Choose a reason for hiding this comment

shankari left a comment

Choose a reason for hiding this comment

shankari Nov 9, 2024

Choose a reason for hiding this comment

shankari Nov 9, 2024

Choose a reason for hiding this comment

shankari Nov 9, 2024

Choose a reason for hiding this comment

shankari Nov 9, 2024

Choose a reason for hiding this comment

shankari commented Nov 2, 2024 •

edited

Loading

shankari commented Nov 5, 2024 •

edited

Loading