Skip to content

Commit

Permalink
Doc: Vary support
Browse files Browse the repository at this point in the history
  • Loading branch information
hamishforbes committed Oct 23, 2017
1 parent a4f3812 commit daa0221
Showing 1 changed file with 60 additions and 13 deletions.
73 changes: 60 additions & 13 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -87,6 +87,7 @@ An `upstream` is the only thing which must be manually configured, and points to

Cache body data is handled by the `storage` system, and as mentioned, by default shares the same Redis instance as the `metadata`. However, `storage` is abstracted via a [driver system](#storage_driver) making it possible to store cache body data in a separate Redis instance, or a group of horizontally scalable Redis instances via a [proxy](https://github.com/twitter/twemproxy), or to roll your own `storage` driver, for example targeting PostreSQL or even simply a filesystem. It's perhaps important to consider that by default all cache storage uses Redis, and as such is bound by system memory.

[Back to TOC](#table-of-contents)

### Cache keys

Expand All @@ -102,8 +103,7 @@ URI arguments are sorted alphabetically by default, so `http://example.com?a=1&b

HTTP response sizes can be wildly different, sometimes tiny and sometimes huge, and it's not always possible to know the total size up front.

To guarantee predictable memory usage regardless of response sizes Ledge operates a streaming design, meaning it only ever operates on a single `buffer` per request at a time. This is equally true when fetching upstream to when reading from cache or
serving to the client request.
To guarantee predictable memory usage regardless of response sizes Ledge operates a streaming design, meaning it only ever operates on a single `buffer` per request at a time. This is equally true when fetching upstream to when reading from cache or serving to the client request.

It's also true (mostly) when processing [ESI](#edge-size-includes) instructions, except for in the case where an instruction is found to span multiple buffers. In this case, we continue buffering until a complete instruction can be understood, up to a [configurable limit](#esi_max_size).

Expand All @@ -123,7 +123,6 @@ This is particularly useful to reduce upstream load if a spike of traffic occurs

Beyond standard RFC compliant cache behaviours, Ledge has many features designed to maximise cache HIT rates and to reduce latency for requests. See the sections on [Edge Side Includes](#edge-side-includes), [serving stale](#serving-stale) and [revalidating on purge](#purging) for more information.


[Back to TOC](#table-of-contents)


Expand All @@ -135,14 +134,14 @@ Assuming you have Redis running on `localhost:6379`, and your upstream is at `lo
http {
if_modified_since Off;
lua_check_client_abort On;

init_by_lua_block {
require("ledge").configure({
redis_connector_params = {
url = "redis://127.0.0.1:6379/0",
},
})

require("ledge").set_handler_defaults({
upstream_host = "127.0.0.1",
upstream_port = 8080,
Expand All @@ -168,6 +167,7 @@ http {

[Back to TOC](#table-of-contents)


## Config systems

There are four different layers to the configuration system. Firstly there is the main [Redis config](#ledgeconfigure) and [handler defaults](#ledgeset_handler_defaults) config, which are global and must be set during the Nginx `init` phase.
Expand Down Expand Up @@ -221,11 +221,11 @@ More commonly, we just want to alter behaviour for a given Nginx `location`.
location /foo_location {
content_by_lua_block {
local handler = require("ledge").create_handler()

handler:bind("before_serve", function(res)
res.header["X-Foo"] = "bar" -- only set X-Foo for this location
end)

handler:run()
}
}
Expand Down Expand Up @@ -267,6 +267,8 @@ The goal is to be 100% RFC compliant, but with some extensions to allow more agr

To manually invalidate a cache item (or purge), we support the non-standard `PURGE` method familiar to users of Squid. Send a HTTP request to the URI with the method set, and Ledge will attempt to invalidate the item, returning status `200` on success and `404` if the URI was not found in cache, along with a JSON body for more details.

A purge request will affect all representations associated with the cache key, for example compressed and uncompressed responses separated by the `Vary: Accept-Encoding` response header will all be purged.

`$> curl -X PURGE -H "Host: example.com" http://cache.example.com/page1 | jq .`

```json
Expand Down Expand Up @@ -315,7 +317,6 @@ limit_except GET POST PUT DELETE {

[Back to TOC](#table-of-contents)


### JSON API

A JSON based API is also available for purging cache multiple cache items at once.
Expand Down Expand Up @@ -346,7 +347,6 @@ Returns a results hash keyed by URI or a JSON error response

[Back to TOC](#table-of-contents)


### Wildcard purging

Wildcard (\*) patterns are also supported in `PURGE` URIs, which will always return a status of `200` and a JSON body detailing a background job. Wildcard purges involve scanning the entire keyspace, and so can take a little while. See [keyspace\_scan\_count](#keyspace_scan_count) for tuning help.
Expand Down Expand Up @@ -397,7 +397,6 @@ In other words, set the TTL to the highest comfortable frequency of requests at

All stale behaviours are constrained by normal cache control semantics. For example, if the origin is down, and the response could be served stale due to the upstream error, but the request contains `Cache-Control: no-cache` or even `Cache-Control: max-age=60` where the content is older than 60 seconds, they will be served the error, rather than the stale content.


[Back to TOC](#table-of-contents)


Expand Down Expand Up @@ -559,6 +558,7 @@ init_by_lua_block {

Ledge uses [lua-resty-redis-connector](https://github.com/pintsized/lua-resty-redis-connector) to handle all Redis connections. It simply passes anything given in `redis_connector_params` straight to [lua-resty-redis-connector](https://github.com/pintsized/lua-resty-redis-connector), so review the documentation there for options, including how to use [Redis Sentinel](https://redis.io/topics/sentinel).


#### qless_db

`default: 1`
Expand Down Expand Up @@ -667,7 +667,6 @@ syntax: `handler:run()`

Must be called during the `init_worker` phase, otherwise background tasks will not be run, including garbage collection which is very importatnt.


[Back to TOC](#table-of-contents)


Expand Down Expand Up @@ -726,6 +725,7 @@ This is a `string` value, which will be used to attempt to load a storage driver

[Back to TOC](#handler-configuration-options)


#### storage_driver_config

`default: {}`
Expand All @@ -734,6 +734,7 @@ Storage configuration can vary based on the driver. Currently we only have a Red

[Back to TOC](#handler-configuration-options)


##### Redis storage driver config

* `redis_connector_params` Redis params table, as per [lua-resty-redis-connector](https://github.com/pintsized/lua-resty-redis-connector)
Expand All @@ -744,6 +745,7 @@ If `supports_transactions` is set to `false`, cache bodies are not written atomi

[Back to TOC](#handler-configuration-options)


#### upstream_connect_timeout

default: `1000 (ms)`
Expand All @@ -752,6 +754,7 @@ Maximum time to wait for an upstream connection (in milliseconds). If it is exce

[Back to TOC](#handler-configuration-options)


#### upstream_send_timeout

default: `2000 (ms)`
Expand All @@ -760,6 +763,7 @@ Maximum time to wait sending data on a connected upstream socket (in millisecond

[Back to TOC](#handler-configuration-options)


#### upstream_read_timeout

default: `10000 (ms)`
Expand All @@ -768,18 +772,21 @@ Maximum time to wait on a connected upstream socket (in milliseconds). If it is

[Back to TOC](#handler-configuration-options)


#### upstream_keepalive_timeout

default: `75000`

[Back to TOC](#handler-configuration-options)


#### upstream_keepalive_poolsize

default: `64`

[Back to TOC](#handler-configuration-options)


#### upstream_host

default: `""`
Expand All @@ -792,6 +799,7 @@ resolver 8.8.8.8;

[Back to TOC](#handler-configuration-options)


#### upstream_port

default: `80`
Expand All @@ -800,6 +808,7 @@ Specifies the port of the upstream host.

[Back to TOC](#handler-configuration-options)


#### upstream_use_ssl

default: `false`
Expand All @@ -808,6 +817,7 @@ Toggles the use of SSL on the upstream connection. Other `upstream_ssl_*` option

[Back to TOC](#handler-configuration-options)


#### upstream_ssl_server_name

default: `""`
Expand All @@ -816,6 +826,7 @@ Specifies the SSL server name used for Server Name Indication (SNI). See [sslhan

[Back to TOC](#handler-configuration-options)


#### upstream_ssl_verify

default: `false`
Expand All @@ -824,6 +835,7 @@ Toggles SSL verification. See [sslhandshake](https://github.com/openresty/lua-ng

[Back to TOC](#handler-configuration-options)


#### cache_key_spec

`default: cache_key_spec = { "scheme", "host", "uri", "args" },`
Expand Down Expand Up @@ -862,6 +874,8 @@ require("ledge").create_handler({
}):run()
```

Consider leveraging vary, via the [before_vary_selection](#before_vary_selection) event, for separating cache entries rather than modifying the main `cache_key_spec` directly.

[Back to TOC](#handler-configuration-options)


Expand Down Expand Up @@ -1042,14 +1056,15 @@ If set to false, disables advertising the software name and version, e.g. `(ledg
* [before_save](#before_save)
* [before_serve](#before_serve)
* [before_save_revalidation_data](#before_save_revalidation_data)
* [before_vary_selection](#before_vary_selection)

#### after_cache_read

syntax: `bind("after_cache_read", function(res) -- end)`

params: `res`. The cached response table.

Fires directly after the response was successfully loaded from cache.
Fires directly after the response was successfully loaded from cache.

The `res` table given contains:

Expand Down Expand Up @@ -1174,8 +1189,39 @@ The `reval_params` are values derived from the current running configuration for
[Back to TOC](#events)


## Administration
#### before_vary_selection

syntax: `bind("before_vary_selection", function(vary_key) -- end)`

params: `vary_key` A table of selecting headers

Fires when we're about to generate the vary key, used to select the correct cache representation.

The `vary_key` table is a hash of header field names (lowercase) to values.
A field name which exists in the Vary response header but does not exist in the current request header will have a value of `ngx.null`.

```
Request Headers:
Accept-Encoding: gzip
X-Test: abc
X-test: def
Response Headers:
Vary: Accept-Encoding, X-Test
Vary: X-Foo
vary_key table:
{
["accept-encoding"] = "gzip",
["x-test"] = "abc,def",
["x-foo"] = ngx.null
}
```

[Back to TOC](#events)


## Administration

### X-Cache

Expand Down Expand Up @@ -1231,6 +1277,7 @@ You may also wish to tweak the [qless job history](https://github.com/pintsized/

[Back to TOC](#table-of-contents)


## Author

James Hurst <james@pintsized.co.uk>
Expand Down

0 comments on commit daa0221

Please sign in to comment.