Add `ScaleLimitsAnalyzer` using bluejay to function-runner #351

mssalemi · 2024-09-05T12:09:02Z

#gsd:37875

Addresses:

https://github.com/Shopify/script-service/issues/7502

Description

Current scaling implementation: https://github.com/Shopify/shopify/pull/518219/files#diff-bef9ccfcca30dfc391c6ad9d0a3e0739bf8a2ab27e659473894e4e62d43bad08

We currently scale our function run limits based on the directive scaleLimits with an argument rate on cartLines. We scale by the length of the field. We multiply the directive rate found on the fields definition, by the length of that fields (seem by either array length or string length) to determine how much to scale limits our default limits by.

This PR adds an analyzer that will determine the maximum scale factor from all fields with the scale limits directive on it. We chose to use this implementation because we want to ability to add directives for scale limits of different fields in the future. Currently we only scale on cart lines.

Notes on Visitor implementation:

The ScaleLimitsAnalyzer is used to dynamically adjust resource limits for function executions based on data provided in GraphQL queries. It does this by analyzing the query and associated input JSON, calculating a scaling factor based on the lengths of fields specified in the query that have an associated @scaleLimits directive.

Avoiding Double Counting
A key feature of the ScaleLimitsAnalyzer is its ability to avoid double counting fields when calculating the scale factor. This is crucial for ensuring that the scaling adjustments are accurate and reflect the true resource needs of the function execution. Here’s how double counting is avoided:

Path Tracking: The analyzer uses a path_stack to keep track of the current path within the query during traversal. This helps in uniquely identifying each field's location within the query structure.

Unique Path Identification: Each field's path, combined with its index in case of arrays, is used to create a unique identifier (PathWithIndex). This ensures that each instance of a field, even if the field name is the same, is treated uniquely based on its context and position in the input data.

Rate Aggregation: When calculating the scale factor, the analyzer aggregates rates using these unique identifiers. If a field appears multiple times in different parts of the query or within nested structures, each occurrence is treated independently based on its unique path and context.

Maximal Increment Strategy: For fields that appear multiple times in the same context (e.g., duplicated fields in a query), the analyzer only considers the maximum scale increment calculated for these fields. This approach ensures that the scale factor reflects the maximum resource requirement for repeated fields without cumulatively adding the increments.

Tophat

can be seen via the tests, it will be implemented in the next PR.

mssalemi · 2024-09-05T12:29:13Z

Cargo.toml

@@ -34,6 +34,9 @@ rust-embed = "8.5.0"
 rmp-serde = "1.3"
 is-terminal = "0.4.13"
 wasmprof = "0.7.0"
+bluejay-parser = { git = "https://github.com/Shopify/bluejay.git", rev = "c7e7c2bfb73c7b4869aa8569c15cd3c4eb48b8bf", features = ["format-errors"] }


Will update after new version of bluejay is released.

src/bluejay_schema_analyzer.rs

src/scale_limits_analyzer.rs

adampetro

🚀

Cargo.toml

jacobsteves

Awesome. Is #349 still an issue with your new double counting strategy? I think so?

mssalemi · 2024-09-06T15:23:48Z

PathWithIndex

No, we can close that I believe (will verify), added a PathWithIndex in the scale limits analyzer that we can use to uniquely identify fields with same name to avoid the overcounting :)

I added a test for it in the next PR.

adds bluejay code

f622d82

mssalemi force-pushed the ms.create-bluejay-analyzer branch from 7b99814 to f622d82 Compare September 5, 2024 12:23

mssalemi commented Sep 5, 2024

View reviewed changes

mssalemi changed the title ~~Adds bluejay analyzer~~ Add ScaleLimitsAnalyzer using bluejay to function-runner Sep 5, 2024

mssalemi mentioned this pull request Sep 5, 2024

[ShopifyVM] Add schema analyzer using blue to function-runner #343

Closed

mssalemi requested review from andrewhassan and adampetro September 5, 2024 13:11

mssalemi marked this pull request as ready for review September 5, 2024 13:12

adampetro reviewed Sep 5, 2024

View reviewed changes

src/bluejay_schema_analyzer.rs Outdated Show resolved Hide resolved

src/bluejay_schema_analyzer.rs Show resolved Hide resolved

src/scale_limits_analyzer.rs Outdated Show resolved Hide resolved

mssalemi added 2 commits September 5, 2024 12:24

code review updates

7901d98

update bluejay version

4d8384c

adampetro approved these changes Sep 5, 2024

View reviewed changes

Cargo.toml Show resolved Hide resolved

mssalemi requested a review from jacobsteves September 5, 2024 16:53

jacobsteves approved these changes Sep 6, 2024

View reviewed changes

mssalemi merged commit 6241b8e into main Sep 6, 2024
5 checks passed

This was referenced Sep 10, 2024

[ShopifyVM] avoid duplicated fields names double counting scale factor in new schema analyzer #349

Closed

Implement Dynamic Resource Limits Adjustment in Function Runner Using ScaleLimitsAnalyzer #352

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `ScaleLimitsAnalyzer` using bluejay to function-runner #351

Add `ScaleLimitsAnalyzer` using bluejay to function-runner #351

mssalemi commented Sep 5, 2024 •

edited by jacobsteves

Loading

mssalemi Sep 5, 2024

adampetro left a comment

jacobsteves left a comment

mssalemi commented Sep 6, 2024 •

edited

Loading

Add ScaleLimitsAnalyzer using bluejay to function-runner #351

Add ScaleLimitsAnalyzer using bluejay to function-runner #351

Conversation

mssalemi commented Sep 5, 2024 • edited by jacobsteves Loading

Description

Tophat

mssalemi Sep 5, 2024

Choose a reason for hiding this comment

adampetro left a comment

Choose a reason for hiding this comment

jacobsteves left a comment

Choose a reason for hiding this comment

mssalemi commented Sep 6, 2024 • edited Loading

Add `ScaleLimitsAnalyzer` using bluejay to function-runner #351

Add `ScaleLimitsAnalyzer` using bluejay to function-runner #351

mssalemi commented Sep 5, 2024 •

edited by jacobsteves

Loading

mssalemi commented Sep 6, 2024 •

edited

Loading