Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compute system latencies and visualize #157

Open
scottveirs opened this issue Aug 8, 2024 · 1 comment
Open

Compute system latencies and visualize #157

scottveirs opened this issue Aug 8, 2024 · 1 comment
Assignees
Labels
2024-hackathon Goals or issues for the 2024 annual Microsoft hackathon moderator portal Issues pertaining to ModeratorPortal at https://aifororcas.azurewebsites.net notification system Issues relating to the notification system

Comments

@scottveirs
Copy link
Member

A potential Key Performance Indicator (KPI) for the OrcaHello system is how long it takes between the first high-confidence (>50%) AI detection and notification of real time end-users.

Problem
We don't track OrcaHello system performance and therefore only have a rough intuitive idea of an answer to this question:

What is the latency of each step in the ML pipeline, including the humans in the loop?

Background

Here's an initial articulation of the the steps where latency could be measured, visualized, and eventually reduced over time:

  1. Time between audio data acquisition at a hydrophone node and moderator notification output. This would be governed by the each of:
    a) the duration of each live-streamed audio segment (currently 10 seconds)
    b) the duration of an OrcaHello candidate (currently a 60-sec concatenation of ~6 Orcasound HLS segments)
    c) the amount of time it takes to run the model on all ~2.5 sec sections of the 60-sec candidate
    d) the time it takes to compute mean confidence and a spectrogram for the candidate
    e) the time it takes to issue a notification to a moderator
  2. The delay between when the moderator notification is sent and a moderator validates the candidate
  3. The delay between moderator validation and notification of end-users (currently dictated by the SendGrid integration, but potentially sped up and made cheaper by integration with the general Orcasound notification system)

For example, a simple subtraction of the two date-times displayed in the moderator portal would be an easy initial metric to display:
Screenshot 2024-08-08 at 11 24 37 AM

Proposed solutions:

  1. Compute some measures of latency and display them in the OrcaHello Dashboard (for the default or requested time period). It might be interesting to plot the metrics for all candidates, and then a breakdown by true positives and false positives. This could be used internally (for authenticated users) or even publicly to promote a friendly competition between moderators. (Of the moderator beta-testers, who is fastest to respond? Who moderates more at night than during the day? How does the team do during holidays vs not (i.e. when maybe all 3 are distracted with family/travel)?
  2. Compute latency metrics within Azure then aggregate them with similar measures, e.g. human detection latency within the Orcasound live-listening web app, and then visualize the results in a way that tracks and incentivizes human+machine system performance. One place to do this might be the network status dashboard (drafted in early 2024 by @dthaler ) or a similar "high-level" dashboard that has been discussed in past hackathons, see Add heartbeat/monitoring dashboard for inference system #88 ...
@scottveirs scottveirs added moderator portal Issues pertaining to ModeratorPortal at https://aifororcas.azurewebsites.net notification system Issues relating to the notification system 2024-hackathon Goals or issues for the 2024 annual Microsoft hackathon labels Aug 8, 2024
@scottveirs scottveirs self-assigned this Aug 8, 2024
@scottveirs
Copy link
Member Author

For fun, and only for true positives from OrcaHello and/or human listeners, I've been computing some of these metrics in the shared Orcasound event spreadsheet. It may offer some additional latency metric ideas (some of which would require parsing email send/receive times):

Screenshot 2024-08-08 at 11 36 10 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2024-hackathon Goals or issues for the 2024 annual Microsoft hackathon moderator portal Issues pertaining to ModeratorPortal at https://aifororcas.azurewebsites.net notification system Issues relating to the notification system
Projects
Status: Todo
Development

No branches or pull requests

1 participant