feat: allow for easily matching rules using path prefixes #1073

davidspek · 2023-03-01T13:50:16Z

Currently rules are matched to an incoming request based on regex or glob pattern matching. Since only a single rule is allowed to match a request, the regex or glob patterns must be very precise and become increasingly complex as different paths need to be matched. While the used regex library supports negative lookahead, creating rules where requests are matched based on path prefixes is still difficult since you'd need to provide negative matches for all other rule regexes with the same base path. As glob doesn't support negative lookahead doing path prefix based routing is not possible. Even when using regex patterns with negative lookahead, the issue that arises is that the end user ends up needing to manage system with the state of all rules to then be able to create rules with the appropriate regexes including negative lookahead patterns for all other rules. Since the most common type of routing people likely would want to implement is prefixed based routing, the fact that doing so with oathkeeper is difficult and error prone is in my views its biggest drawback.

This PR introduces a system that allows for matching rules based on the longest matching path prefix between a request URL and the paths of all the rules using a Trie. With this trie, oathkeeper does not need to range over each rule when performing the matching which I expect will decrease latency.

Note, this implementation is likely incomplete and requires some further work which I would like to implement after some further discussion here. Some things that come to mind are:

a way to be able to define multiple schemes in a single rule (currently only a single scheme per rule is possible since the URL needs to be parsable by url.Parse())
~~allow for regex or glob matching groups since these can be used downstream in authenticators, authorisers and mutators.~~

Update:
I've made the prefix matching a separate config from the matching strategy. This way, if multiple rules are found based on the path prefix those rules are then further filtered using the matching pattern. Matching patterns can only be used in the path of the URL and are not added into the Trie (since they would break the Trie). Thus, 2 rules with the same path prefix but different matching patterns will be added to the same node in the Trie. Then if a request comes in that matches those rules it will be matched using the pattern. Note that the intension for pattern matching combined with prefix matching is for use of the patterns in downstream handlers. It is not intended to be used for determining which rule an incoming request should match against, but it will fall back to doing so. In any case, it is much easier to create a negative lookahead for a single path section rather than all paths known to oathkeeper.

Related issue(s)

#1073
#441

Checklist

I have read the contributing guidelines.
I have referenced an issue containing the design document if my change
introduces a new feature.
I am following the
contributing code guidelines.
I have read the security policy.
I confirm that this pull request does not address a security
vulnerability. If this pull request addresses a security vulnerability, I
confirm that I got the approval (please contact
security@ory.sh) from the maintainers to push
the changes.
I have added tests that prove my fix is effective or that my feature
works.
I have added or changed the documentation.

Further Comments

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1204651423958875

codecov · 2023-03-01T18:05:00Z

Codecov Report

Merging #1073 (eb4a176) into master (3a716f2) will decrease coverage by 0.44%.
The diff coverage is 60.86%.

❗ Current head eb4a176 differs from pull request most recent head 5542209. Consider uploading reports for the commit 5542209 to get more accurate results

@@            Coverage Diff             @@
##           master    #1073      +/-   ##
==========================================
- Coverage   78.17%   77.73%   -0.44%     
==========================================
  Files          80       81       +1     
  Lines        3853     3979     +126     
==========================================
+ Hits         3012     3093      +81     
- Misses        566      603      +37     
- Partials      275      283       +8

Impacted Files	Coverage Δ
driver/configuration/provider_koanf.go	`88.13% <0.00%> (-0.76%)`	⬇️
rule/repository_memory.go	`75.92% <45.45%> (-9.61%)`	⬇️
rule/trie.go	`69.56% <69.56%> (ø)`

aeneasr · 2023-03-22T10:48:23Z

Hey David, thank you for this PR. We are also struggling ourselves with this problem (increasingly complex rules that are hard to read) but do not have a clear idea how to address these issues properly. Do you have some examples for the rule matching you're suggestion? I'm not quite sure if I understand the approach correctly.

In the future may I suggest to first create a design document to exchange ideas and follow up with an implementation afterwards? :) I know that we are not as responsive as in other repositories but design documents are a good way to work on topics such as this one.

Thanks!

davidspek · 2023-03-23T22:19:01Z

Hi Aeneasr. I can create an issue with a design document for what I’ve done in this PR tomorrow. However, I find that it usually helps to have a (brief) synchronous discussion to get on the same page before creating long write-ups. I think this would be particularly helpful since this codebase is mostly foreign to me. Do you think you’d have time to have a short chat some day soon?

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

davidspek requested a review from aeneasr as a code owner March 1, 2023 13:50

davidspek force-pushed the prefix-trie-engine branch from bfbf92f to 97192c9 Compare March 1, 2023 14:04

davidspek force-pushed the prefix-trie-engine branch 2 times, most recently from cc114e8 to bab7632 Compare March 20, 2023 12:39

davidspek mentioned this pull request Apr 11, 2023

Allow for easily matching rules using path prefixes #1089

Closed

6 tasks

davidspek force-pushed the prefix-trie-engine branch from bab7632 to fbcbeeb Compare April 11, 2023 13:13

davidspek mentioned this pull request Jun 21, 2023

Allow rule matching using path prefixes dadrus/heimdall#652

Closed

3 tasks

davidspek added 8 commits July 10, 2023 13:47

feat: find rules with prefix matching using trie

3a267c7

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

make prefix matching configurable

82f93e3

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

init add some unit tests for prefix matching

47094fd

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

fix unit test

df3363f

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

add protocol to trie

cb81e5e

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

fix tests

12d52e5

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

add separate config + fallback to pattern matching

00eb5ea

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

add prefix_matching_enabled to config schema

5542209

Signed-off-by: David van der Spek <vanderspek.david@gmail.com>

davidspek force-pushed the prefix-trie-engine branch from fbcbeeb to 5542209 Compare July 10, 2023 11:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: allow for easily matching rules using path prefixes #1073

feat: allow for easily matching rules using path prefixes #1073

davidspek commented Mar 1, 2023 •

edited

Loading

codecov bot commented Mar 1, 2023 •

edited

Loading

aeneasr commented Mar 22, 2023

davidspek commented Mar 23, 2023

feat: allow for easily matching rules using path prefixes #1073

Are you sure you want to change the base?

feat: allow for easily matching rules using path prefixes #1073

Conversation

davidspek commented Mar 1, 2023 • edited Loading

Related issue(s)

Checklist

Further Comments

codecov bot commented Mar 1, 2023 • edited Loading

Codecov Report

aeneasr commented Mar 22, 2023

davidspek commented Mar 23, 2023

davidspek commented Mar 1, 2023 •

edited

Loading

codecov bot commented Mar 1, 2023 •

edited

Loading