Vision camera v3 #1121

jtklein · 2024-02-09T14:23:12Z

Now that support for tap to focus on Android has landed in react-native-vision-camera, another try to switch to vision-camera v3 from v2.

Vision camera requires XCode 15 to compile.

This reverts commit 42a2ea0.

This reverts commit 2b4718f.

jtklein · 2024-03-14T18:35:32Z

@kueda I have tested this on debug and release builds and from what I can tell it does work as it should is now finally a replacement for vision-camera v2 for all features that we used. The code I think is mostly fine and I would merge it in, however, I had so many attempts with this library that I mainly need a fresh pair of eyes, so if you could test the camera once again with these changes and check the code that would be fab.
On iOS the experience should stay the same, from what I have tested the average time it takes to process a frame is the same compared to using vision-camera v2.
On Android, this PR adds around 400-500ms to the time it takes to process one frame on average. This might be a blocker or we merge for iOS and keep iterating for Android. There are some tools for v3 that we can use to speed up the processing that are not available for v2. I have tried e.g. replacing our code for resizing in our vision-plugin with this https://github.com/mrousavy/vision-camera-resize-plugin and it did make the average time faster again.

kueda · 2024-03-14T19:21:05Z

I'll give it a test this afternoon, @jtklein. Any idea where the slowdown in Android is coming from? Is it a problem with vision-camera itself, or perhaps with our frame processor?

kueda

Mostly this seems to work fine, with one blocking and one non-blocking issue.

Blocking: a prediction in the ARCamera seems to linger on screen for at least 5 seconds after moving away from that subject, even when you use the debug tool to change numStoredResults to 1. So it feels like the default behavior here has changed, and that numStoredResults is not having an effect.

Non-blocking: in Android (Pixel 8, Android 14) if I walk around for a few minutes with the ARCamera open identifying some plants, I get this crash fairly consistently:
. I don't think this needs to block for MVP since it's only on Android, but it is fairly annoying for an Android user like me.

This makes use of .shift() in worklet array and depends on the previous patch.

jtklein · 2024-03-25T22:20:22Z

@kueda there was indeed a bug with numStoredResults. I have updated the branch: if you set numStoredResults to 1 now you should always see the last frame's result. So, if you move from one object to the next it should change results pretty quickly (depending on the speed of frame processing though).
I have added the age of the last result on the screen in debug mode, so, e.g. if you have 4 results stored and you change from a species with a high prediction score to one with a lower you will see the age of the result ticking up until the new species is the best result. It will still take quite some time for the result to change to the new species depending on the phone's speed when the score of the second species is smaller.
A prediction in the ARCamera might still linger on screen for longer than 5 seconds after moving away from that subject. I chatted with Alex a bit about that and one way to mitigate that I implemented now is to use a linear weighting in the frames' results, so older frames' results are now weighed less than newer ones. I feel that it improves the lingering aspect when you move from one subject to the next one now, although it does not help when moving from one subject to no subject.
However, the long lingering of a prediction should not happen anymore if you set numStoredResults to 1.
Also, changing to a lower number for numStoredResults should now be possible while keeping the ARCamera open.

All in all, I think I addressed the blockers fully, let me know what you think.
I did not look at the non-blocker issue.

kueda

Good job finding and patching that missing method in react-native-worklets-core! I can confirm that "num stored results" is working, and the age of results thing is useful (though it might benefit from debug styling so everyone knows it's just a debugging feature; doesn't need to block).

I'm still experiencing the Android crash after ~5+ minutes of continuous usage, so that might be worth an issue after you merge.

jtklein added 30 commits February 9, 2024 13:50

Bump vision-camera

74bc156

Refactor patch

184adfc

Move patched orientation into patch functions file

4949fc9

Update react-native-vision-camera+3.4.1.patch

f96d2cc

Switch to MacOS 13 runner

699662d

Vision camera requires XCode 15 to compile.

Add step to specify XCode 15

f994dd7

Higher level of logging

b948eaa

Increase test timeout

e2c58dd

Add comment

35bbd69

Remove navigation to obs without evidence for signed out user

28e56ba

Patch for location permission not working on iOS

a8c1b82

Increase setup timeout

542699b

Increase some more timeouts

f0d86f1

Revert back to less logging in CI

02742bf

Does it have to do with timeouts?

010a8fb

Trace log level

dfe8799

Update README.md

450c27e

Disable Homebrew’s auto update and install cleanup

a6b0831

Setup ruby step

2f473ab

Install pods only if not cached

88c01f6

Revert "Install pods only if not cached"

ef41f11

This reverts commit 42a2ea0.

Run simulator in headless mode, record all logs

e5e2012

Increase timeouts again

eb614b4

Revert "Remove navigation to obs without evidence for signed out user"

edd77a9

This reverts commit 2b4718f.

Add boolean to run use effect only once

6665091

Did merge wrong code

715e3cb

There is one more permission gate when entering obs edit now

0d07332

Add permission gate dismissal to signed out user test

6e3aad3

Add comment, rename state

9c81cd8

Lower action timeout

59e67da

Snapshot updates

50129f5

jtklein marked this pull request as ready for review March 13, 2024 09:27

jtklein added 3 commits March 13, 2024 10:31

Revert changes to ios e2e

c9ffc54

Add a log for the average time it takes a frame to be processed

5001604

Use release version of plugin

938627e

jtklein marked this pull request as draft March 13, 2024 13:43

Add a patch for runAsync to work in release builds

2a2689b

jtklein marked this pull request as ready for review March 14, 2024 13:07

jtklein requested a review from kueda March 14, 2024 18:18

Update react-native-worklets-core to 0.4.0

bca4680

kueda requested changes Mar 14, 2024

View reviewed changes

jtklein added 7 commits March 22, 2024 23:00

Use latest plugin version

fc00525

Add a shift method to worklet arrays

232f8c5

Use latest vision-plugin

146ebd6

This makes use of .shift() in worklet array and depends on the previous patch.

Set result timestamp and show age of result in debug mode

e141675

Fix an error with timestamp being undefined

3eecb5a

Remove log

c265f45

Use latest vision plugin

ba67bac

Merge branch 'main' into vision-camera-3

8dc9c8a

kueda approved these changes Mar 26, 2024

View reviewed changes

jtklein added 3 commits March 26, 2024 22:32

Change result timestamp to pink

6dd518f

Merge branch 'main' into vision-camera-3

b6bd13f

Comment

cefcd83

jtklein merged commit 0e0a656 into main Mar 26, 2024
12 of 14 checks passed

jtklein deleted the vision-camera-3 branch March 29, 2024 17:35

jtklein restored the vision-camera-3 branch March 29, 2024 17:35

jtklein deleted the vision-camera-3 branch June 18, 2024 11:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vision camera v3 #1121

Vision camera v3 #1121

jtklein commented Feb 9, 2024

jtklein commented Mar 14, 2024

kueda commented Mar 14, 2024

kueda left a comment

jtklein commented Mar 25, 2024

kueda left a comment

Vision camera v3 #1121

Vision camera v3 #1121

Conversation

jtklein commented Feb 9, 2024

jtklein commented Mar 14, 2024

kueda commented Mar 14, 2024

kueda left a comment

Choose a reason for hiding this comment

jtklein commented Mar 25, 2024

kueda left a comment

Choose a reason for hiding this comment