Improve the Partition module performance #2

DanGould · 2024-09-28T17:33:17Z

Suggestions for Improvement

@nothingmuch has suggested the following improvements in the pull request #1:

The Filter implementations rely on Bloom filters to skip checking negatives. However, the filters are arguably sized inappropriately, specifying a false positive rate for the size of the set, not its corresponding sumset, which may be substantially larger. This is particularly problematic with Bloom filters, as opposed to say cuckoo or quotient filters which are both resizable, and their FPR does not asymptotically approach 100% when close to capacity.
An LFU cache would handle positive matches, avoiding expensive computations (e.g., is_subset_sum).
If adjusting the bit vector-based powerset enumeration to use a grey code instead of 2's complement (i.e., normal integer incrementing), the sums could be adjusted by subtracting or adding one element at a time, allowing the previous sum value to be reused more easily. This might make an LRU cache a better fit than LFU for positive cases.
Filter.contains is invoked repeatedly on a fixed value (left set of SumFilteredPartitionIterator).
The trait object for Filter seems unnecessary, preventing monomorphisation and any optimizations that would enable, and adding indirection in fairly hot code paths.

The text was updated successfully, but these errors were encountered:

nothingmuch · 2024-09-28T18:43:18Z

FWIW the reason I didn't open an issue is (and i need to sleep on this i have zombie brain after long flight) that i think a better approach would be to refactor the iterators so that they have a simpler structure first, then think about performance improvements

DanGould added the enhancement New feature or request label Sep 28, 2024

nothingmuch mentioned this issue Oct 16, 2024

sub-transaction metrics: enumeration, approximation and optimization payjoin/rust-payjoin#366

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the Partition module performance #2

Improve the Partition module performance #2

DanGould commented Sep 28, 2024

nothingmuch commented Sep 28, 2024

Improve the Partition module performance #2

Improve the Partition module performance #2

Comments

DanGould commented Sep 28, 2024

Suggestions for Improvement

nothingmuch commented Sep 28, 2024