Skip to content

Commit

Permalink
Merge pull request #228 from nats-io/request-many
Browse files Browse the repository at this point in the history
ADR-47 Request Many
  • Loading branch information
scottf authored Oct 10, 2024
2 parents 2c9cc9d + db17f7f commit 182511d
Show file tree
Hide file tree
Showing 3 changed files with 138 additions and 2 deletions.
2 changes: 1 addition & 1 deletion .readme.templ
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,6 @@ We want to move away from using these to document individual minor decisions, mo

## Template

Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However the initial Table of metadata and header format is required to match.
Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However, the initial Table of metadata and header format is required to match.

After editing / adding a ADR please run `go run main.go > README.md` to update the embedded index. This will also validate the header part of your ADR.
10 changes: 9 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
|[ADR-37](adr/ADR-37.md)|jetstream, client, spec|JetStream Simplification|
|[ADR-40](adr/ADR-40.md)|client, server, spec|NATS Connection|
|[ADR-43](adr/ADR-43.md)|jetstream, client, server|JetStream Per-Message TTL|
|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|

## Jetstream

Expand Down Expand Up @@ -83,6 +84,12 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
|[ADR-3](adr/ADR-3.md)|observability, server|NATS Service Latency Distributed Tracing Interoperability|
|[ADR-41](adr/ADR-41.md)|observability, server|NATS Message Path Tracing|

## Orbit

|Index|Tags|Description|
|-----|----|-----------|
|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|

## Security

|Index|Tags|Description|
Expand Down Expand Up @@ -128,6 +135,7 @@ This repository captures Architecture, Design Specifications and Feature Guidanc
|[ADR-32](adr/ADR-32.md)|client, spec|Service API|
|[ADR-37](adr/ADR-37.md)|jetstream, client, spec|JetStream Simplification|
|[ADR-40](adr/ADR-40.md)|client, server, spec|NATS Connection|
|[ADR-47](adr/ADR-47.md)|client, spec, orbit|Request Many|

## Deprecated

Expand All @@ -147,6 +155,6 @@ We want to move away from using these to document individual minor decisions, mo

## Template

Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However the initial Table of metadata and header format is required to match.
Please see the [template](adr-template.md). The template body is a guideline. Feel free to add sections as you feel appropriate. Look at the other ADRs for examples. However, the initial Table of metadata and header format is required to match.

After editing / adding a ADR please run `go run main.go > README.md` to update the embedded index. This will also validate the header part of your ADR.
128 changes: 128 additions & 0 deletions adr/ADR-47.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,128 @@
# Request Many

| Metadata | Value |
|----------|----------------------------|
| Date | 2024-09-26 |
| Author | @aricart, @scottf, @Jarema |
| Status | Partially Implemented |
| Tags | client,spec,orbit |

| Revision | Date | Author | Info |
|----------|------------|-----------|-------------------------|
| 1 | 2024-09-26 | @scottf | Document Initial Design |

## Problem Statement
Have the client support receiving multiple replies from a single request, instead of limiting the client to the first reply,
and support patterns like scatter-gather and sentinel.

## Basic Design

The user can provide some configuration controlling how and how long to wait for messages.
The client handles the requests and subscriptions and provides the messages to the user.

* The client doesn't assume success or failure - only that it might receive messages.
* The various configuration options are there to manage and short circuit the length of the wait,
and provide the user the ability to directly stop the processing.
* Request Many is not a recoverable operation, but it could be wrapped in a retry pattern.
* The client should communicate status whenever possible, for instance if it gets a 503 No Responders

## Config

### Total timeout

The maximum amount of time to wait for responses. When the time is expired, the process is complete.
The wait for the first message is always made with the total timeout since at least one message must come in within the total time.

* Always used
* Defaults to the connection or system request timeout.

### Stall timer

The amount time to wait for messages other than the first (subsequent waits).
Considered "stalled" if this timeout is reached, indicating the request is complete.

* Optional
* Less than 1 or greater than or equal to the total timeout behaves the same as if not supplied.
* Defaults to not supplied.
* When supplied, subsequent waits are the lesser of the stall time or the calculated remaining time.
This allows the total timeout to be honored and for the stall to not extend the loop past the total timeout.

### Max messages

The maximum number of messages to wait for.
* Optional
* If this number of messages is received, the request is complete.
* If this number is supplied and total timeout is not set, total timeout defaults to the connection or system timeout.

### Sentinel

While processing the messages, the user should have the ability to indicate that it no longer wants to receive any more messages.
* Optional
* Language specific implementation
* If sentinel is supplied and total timeout is not set, total timeout defaults to the connection or system timeout.

## Notes

### Message Handling

Each client must determine how to give messages to the user.
* They could all be collected and given at once.
* They could be put in an iterator, queue, channel, etc.
* A callback could be made.

### End of Data

The developer should notify the user when the request has stopped processing and the receiving mechanism is not fixed like a list
or iterator that termination is obvious. A queue or a callback for instance, should get a termination message.
Implementation is language specific based on control flow.

### Status Messages / Server Errors

If a status (like a 503) or an error comes in place of a user message, this is terminal.
This is probably useful information for the user and can be conveyed as part of the end of data.

#### Callback timing

If callbacks are made in a blocking fashion,
the client must account for the time it takes for the user to process the message
and not consider that time against the timeouts.

### Sentinel

If the client supports a sentinel with a callback/predicate that accepts the message and returns a boolean,
a return of true would mean continue to process and false would mean stop processing.

If possible, the client should support the "standard sentinel", which is a message with a null/nil or empty payload.

### Cancelling

A client can offer other ways for the user to be able to cancel the request. This is another pathway besides sentinel
allowing that the dev can cancel the entire request-many arbitrarily.

## Disconnection

It's possible that there is a connectivity issue that prevents messages from reaching the requester,
It might be difficult to differentiate that timeout from a total or stall timeout.
If possible to know the difference, this could be conveyed as part of the end of data.

## Strategies
It's acceptable to make "strategies" via enum / api / helpers / builders / whatever.
Strategies are just pre-canned configurations, for example:

**Timeout or Wait** - this is the default strategy where only the total timeout is used.

**Stall** - the stall defaults to the lessor of 1/10th of the total wait time (if provided) or the default connection timeout.

**Max Responses** - accepts a max response number and uses the default timeout.

### Subscription Management
Since the client is in charge of the subscription, it should always unsubscribe upon completion of the request handling instead of leaving it up to the server to time it out.

#### Max Responses Optimization
On requests that specify max responses, and when not using mux inboxes, the client can unsubscribe with a count immediately after subscribing.
Theoretically this unsub could be processed after a reply has come in and out of the server, so you still must check the count manually.

#### Mux Inbox
If possible, the implementation can offer the use of the mux inbox.
Consider that the implementation of managing the subscription will differ from a non-mux inbox,
for instance not closing the subscription and not implementing a max response optimization.

0 comments on commit 182511d

Please sign in to comment.