Encode JSON encode on write rather than read #257

robacourt · 2024-08-06T19:50:16Z

JSON encoding on write rather than read to reduce memory footprint. For example the memory use of an initial sync of a 35MB/200k row table can be reduced from 50MB to 25MB on the first initial sync (which includes the encoding and the writing to storage) and to 6MB on subsequent initial syncs (where we just read from storage and there's no encoding).

This change also allows some simplifications, so I have included them as refactoring in this PR:

Logic to do with the structure and how to create Log Items has been consolidated to a new module LogItems
The prepared_change intermediate state has been removed. The transformation Changes > list(prepared_change) > LogItems now is simply Changes > LogItems which makes the code more readable ( prepared_change was a 5 tuple rather than a nicely labeled map) and easier to reason about (since there's one less data structure to worry about)
The snapshot row intermediate state has a few less references, ideally I'd like to get encapsulate it into a single module, but for now it's mainly Shapes.Querying that creates it and LogItems.from_snapshot_row/4 that reads it.
Some logic duplicated in InMemoryStorage and CubDbStorage has been consolidated.

I've kept the refactoring in a separate commit to the functional change to aid with reviewing this PR.

KyleAMathews · 2024-08-06T20:09:45Z

Reminder to move this PR to electric — I'm going to close all the PRs tomorrow 🙏

robacourt force-pushed the rob/json-encode-on-write branch from 9522e83 to 15189e4 Compare August 6, 2024 19:53

robacourt marked this pull request as draft August 6, 2024 19:55

robacourt added 2 commits August 7, 2024 08:43

Encode on write

cec6940

Extract LogItem code to remove duplication

46d9554

robacourt force-pushed the rob/json-encode-on-write branch from 15189e4 to 46d9554 Compare August 7, 2024 07:43

robacourt closed this Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encode JSON encode on write rather than read #257

Encode JSON encode on write rather than read #257

robacourt commented Aug 6, 2024 •

edited

Loading

KyleAMathews commented Aug 6, 2024

Encode JSON encode on write rather than read #257

Encode JSON encode on write rather than read #257

Conversation

robacourt commented Aug 6, 2024 • edited Loading

KyleAMathews commented Aug 6, 2024

robacourt commented Aug 6, 2024 •

edited

Loading