Merge pull request #228 from nats-io/request-many

ADR-47 Request Many
nats-io · Oct 10, 2024 · 182511d · 182511d
2 parents 2c9cc9d + db17f7f
commit 182511d
Show file tree

Hide file tree

Showing 3 changed files with 138 additions and 2 deletions.
diff --git a/.readme.templ b/.readme.templ
@@ -26,6 +26,6 @@ We want to move away from using these to document individual minor decisions, mo
 
 ## Template
 
-Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However the initial Table of metadata and header format is required to match.
+Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However, the initial Table of metadata and header format is required to match.
 
 After editing / adding a ADR please run `go run main.go > README.md` to update the embedded index. This will also validate the header part of your ADR.
diff --git a/README.md b/README.md
@@ -35,6 +35,7 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
 |[ADR-37](adr/ADR-37.md)|jetstream, client, spec|JetStream Simplification|
 |[ADR-40](adr/ADR-40.md)|client, server, spec|NATS Connection|
 |[ADR-43](adr/ADR-43.md)|jetstream, client, server|JetStream Per-Message TTL|
+|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|
 
 ## Jetstream
 
@@ -83,6 +84,12 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
 |[ADR-3](adr/ADR-3.md)|observability, server|NATS Service Latency Distributed Tracing Interoperability|
 |[ADR-41](adr/ADR-41.md)|observability, server|NATS Message Path Tracing|
 
+## Orbit
+
+|Index|Tags|Description|
+|-----|----|-----------|
+|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|
+
 ## Security
 
 |Index|Tags|Description|
@@ -128,6 +135,7 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
 |[ADR-32](adr/ADR-32.md)|client, spec|Service API|
 |[ADR-37](adr/ADR-37.md)|jetstream, client, spec|JetStream Simplification|
 |[ADR-40](adr/ADR-40.md)|client, server, spec|NATS Connection|
+|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|
 
 ## Deprecated
 
@@ -147,6 +155,6 @@ We want to move away from using these to document individual minor decisions, mo
 
 ## Template
 
-Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However the initial Table of metadata and header format is required to match.
+Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However, the initial Table of metadata and header format is required to match.
 
 After editing / adding a ADR please run `go run main.go > README.md` to update the embedded index. This will also validate the header part of your ADR.
diff --git a/adr/ADR-47.md b/adr/ADR-47.md
@@ -0,0 +1,128 @@
+# Request Many
+
+| Metadata | Value                      |
+|----------|----------------------------|
+| Date     | 2024-09-26                 |
+| Author   | @aricart, @scottf, @Jarema |
+| Status   | Partially Implemented      |
+| Tags     | client,spec,orbit         |
+
+| Revision | Date       | Author    | Info                    |
+|----------|------------|-----------|-------------------------|
+| 1        | 2024-09-26 | @scottf   | Document Initial Design |
+
+## Problem Statement
+Have the client support receiving multiple replies from a single request, instead of limiting the client to the first reply,
+and support patterns like scatter-gather and sentinel.
+
+## Basic Design
+
+The user can provide some configuration controlling how and how long to wait for messages.
+The client handles the requests and subscriptions and provides the messages to the user.
+
+* The client doesn't assume success or failure - only that it might receive messages.
+* The various configuration options are there to manage and short circuit the length of the wait, 
+and provide the user the ability to directly stop the processing.
+* Request Many is not a recoverable operation, but it could be wrapped in a retry pattern.
+* The client should communicate status whenever possible, for instance if it gets a 503 No Responders
+
+## Config
+
+### Total timeout
+
+The maximum amount of time to wait for responses. When the time is expired, the process is complete.
+The wait for the first message is always made with the total timeout since at least one message must come in within the total time.
+
+* Always used
+* Defaults to the connection or system request timeout.
+
+### Stall timer
+
+The amount time to wait for messages other than the first (subsequent waits). 
+Considered "stalled" if this timeout is reached, indicating the request is complete.
+
+* Optional
+* Less than 1 or greater than or equal to the total timeout behaves the same as if not supplied.
+* Defaults to not supplied.
+* When supplied, subsequent waits are the lesser of the stall time or the calculated remaining time. 
+This allows the total timeout to be honored and for the stall to not extend the loop past the total timeout.
+
+### Max messages
+
+The maximum number of messages to wait for. 
+* Optional
+* If this number of messages is received, the request is complete.
+* If this number is supplied and total timeout is not set, total timeout defaults to the connection or system timeout.
+
+### Sentinel
+
+While processing the messages, the user should have the ability to indicate that it no longer wants to receive any more messages.
+* Optional
+* Language specific implementation
+* If sentinel is supplied and total timeout is not set, total timeout defaults to the connection or system timeout.
+
+## Notes
+
+### Message Handling
+
+Each client must determine how to give messages to the user.
+* They could all be collected and given at once.
+* They could be put in an iterator, queue, channel, etc.
+* A callback could be made.
+
+### End of Data
+
+The developer should notify the user when the request has stopped processing and the receiving mechanism is not fixed like a list
+or iterator that termination is obvious. A queue or a callback for instance, should get a termination message.
+Implementation is language specific based on control flow.
+
+### Status Messages / Server Errors
+
+If a status (like a 503) or an error comes in place of a user message, this is terminal.
+This is probably useful information for the user and can be conveyed as part of the end of data.
+
+#### Callback timing
+
+If callbacks are made in a blocking fashion, 
+the client must account for the time it takes for the user to process the message 
+and not consider that time against the timeouts.
+
+### Sentinel
+
+If the client supports a sentinel with a callback/predicate that accepts the message and returns a boolean, 
+a return of true would mean continue to process and false would mean stop processing.
+
+If possible, the client should support the "standard sentinel", which is a message with a null/nil or empty payload.
+
+### Cancelling
+
+A client can offer other ways for the user to be able to cancel the request. This is another pathway besides sentinel
+allowing that the dev can cancel the entire request-many arbitrarily.
+
+## Disconnection
+
+It's possible that there is a connectivity issue that prevents messages from reaching the requester,
+It might be difficult to differentiate that timeout from a total or stall timeout. 
+If possible to know the difference, this could be conveyed as part of the end of data. 
+
+## Strategies
+It's acceptable to make "strategies" via enum / api / helpers / builders / whatever.
+Strategies are just pre-canned configurations, for example:
+
+**Timeout or Wait** - this is the default strategy where only the total timeout is used.
+
+**Stall** - the stall defaults to the lessor of 1/10th of the total wait time (if provided) or the default connection timeout.
+
+**Max Responses** - accepts a max response number and uses the default timeout.
+
+### Subscription Management
+Since the client is in charge of the subscription, it should always unsubscribe upon completion of the request handling instead of leaving it up to the server to time it out.
+
+#### Max Responses Optimization
+On requests that specify max responses, and when not using mux inboxes, the client can unsubscribe with a count immediately after subscribing.
+Theoretically this unsub could be processed after a reply has come in and out of the server, so you still must check the count manually.
+
+#### Mux Inbox
+If possible, the implementation can offer the use of the mux inbox. 
+Consider that the implementation of managing the subscription will differ from a non-mux inbox, 
+for instance not closing the subscription and not implementing a max response optimization.