Added batching using DB. #724

piohei · 2024-05-07T06:43:22Z

This is a first step to introduce batches and transactions table. This is not HA solution yet - just the first step towards it.

Motivation

Make small changes instead of one big.

Solution

Now processing batches is changed into 2 steps. First one is to create a batch (select identities, leaf indexes, etc.). Then the batch is saved in database. Second step is reading data from database (batch) and executes it. Proper batch execution includes storing transaction id in database.

As an alternative solution we could save in database a batch with a proves (no need to calculate it later). It will allow second step (transaction creation/execution) to not use any tree at all. It will require more memory used in database (it can be cleared from time to time as we don't need all the batches forever) but will make HA solution much easier.

PR Checklist

Added Tests
Added Documentation

piohei · 2024-05-07T07:30:52Z

src/identity_tree.rs

@@ -263,6 +263,9 @@ impl BasicTreeOps for TreeVersionData<lazy_merkle_tree::Derived> {
    fn update(&mut self, leaf_index: usize, element: Hash) {
        let updated_tree = self.tree.update(leaf_index, &element);

+        if self.tree.root() == updated_tree.root() {


I added it here because of recreating tree on startup. It is hard to easily select from database proper groups of commitments. There are three options:

As done change the code to not apply updates that don't change the tree at all. Why? Because then we are peeking changes to create a batch that are not changing anything and we can't put such a batch in database (unique/fk constraints).

Instead of doing it here we can peek the "no-changing" changes and just not save them in database.

Add additional internal status just for this "batched but not processed yet" situation.

Do more queries on startup but may use much more memory as we are loading almost all commitments. :/

piohei · 2024-05-07T07:31:02Z

src/identity_tree.rs

@@ -280,7 +283,25 @@ impl BasicTreeOps for TreeVersionData<lazy_merkle_tree::Derived> {
    fn apply_diffs(&mut self, mut diffs: Vec<AppliedTreeUpdate>) {
        let last = diffs.last().cloned();

-        self.metadata.diff.append(&mut diffs);
+        if !diffs.is_empty() {


Same as above.

piohei · 2024-05-07T07:32:22Z

src/identity_tree.rs

-    latest:    TreeVersion<Latest>,
+    mined:              TreeVersion<Canonical>,
+    processed:          TreeVersion<Intermediate>,
+    processed_batching: TreeVersion<Intermediate>,


This additional tree is required to generate proofs. We need a tree in a state when root is same as prev_root in batch. We can use batching tree as it is updated to create new batches (with proper leaf_indexes).

Dzejkop · 2024-05-07T11:32:06Z

src/database/mod.rs

+                batches.id as id,
+                batches.next_root as next_root,
+                batches.prev_root as prev_root,
+                batches.created_at as created_at,


AFAIK sqlx handles fully qualified names, so you can leave this (and others) as batches.created_at

I'm sure I have done it due to created_at conflict but will reassure myself. :)

Dzejkop · 2024-05-07T11:55:44Z

src/database/mod.rs

+        let batches = tx.get_all_batches_after(batch_entry.id).await?;
+        let mut result = vec![];
+        for batch in batches {
+            match batch.batch_type {


Can't we have the same code for both cases? In case for deletion we replace *batch.commitments.0.get(i).unwrap() with Hash::ZERO but *batch.commitments.0.get(i).unwrap() should be zero anyway, no?

This reverts commit f4fcc03.

piohei force-pushed the piohei/ha_step_1 branch from 81adc4d to cfff3ef Compare May 7, 2024 06:57

piohei commented May 7, 2024

View reviewed changes

piohei marked this pull request as ready for review May 7, 2024 07:32

piohei requested a review from a team as a code owner May 7, 2024 07:32

Dzejkop reviewed May 7, 2024

View reviewed changes

piohei force-pushed the piohei/ha_step_1 branch 2 times, most recently from 0c61b0c to b38c59f Compare May 9, 2024 05:04

piohei changed the title ~~Added batching using DB. Additional tree added to have proper state on startup.~~ Added batching using DB. May 9, 2024

Added batching using DB.

ad256dc

piohei force-pushed the piohei/ha_step_1 branch from b38c59f to ad256dc Compare May 9, 2024 07:10

Dzejkop approved these changes May 9, 2024

View reviewed changes

Dzejkop merged commit f4fcc03 into worldcoin:main May 9, 2024
3 checks passed

Dzejkop added a commit that referenced this pull request May 14, 2024

Revert "Added batching using DB. (#724)"

7a3a8d5

This reverts commit f4fcc03.

Dzejkop mentioned this pull request May 14, 2024

Revert "Added batching using DB." #729

Merged

Dzejkop added a commit that referenced this pull request May 14, 2024

Revert "Added batching using DB. (#724)" (#729)

20b9efd

This reverts commit f4fcc03.

piohei deleted the piohei/ha_step_1 branch June 24, 2024 11:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added batching using DB. #724

Added batching using DB. #724

piohei commented May 7, 2024

piohei May 7, 2024

piohei May 7, 2024

piohei May 7, 2024

Dzejkop May 7, 2024

piohei May 7, 2024

Dzejkop May 7, 2024

Added batching using DB. #724

Added batching using DB. #724

Conversation

piohei commented May 7, 2024

Motivation

Solution

PR Checklist

piohei May 7, 2024

Choose a reason for hiding this comment

piohei May 7, 2024

Choose a reason for hiding this comment

piohei May 7, 2024

Choose a reason for hiding this comment

Dzejkop May 7, 2024

Choose a reason for hiding this comment

piohei May 7, 2024

Choose a reason for hiding this comment

Dzejkop May 7, 2024

Choose a reason for hiding this comment