rec: dedup records #14617

omoerbeek · 2024-09-03T11:31:03Z

Short description

This deduplicaties records in two places:

Records received from authoritative servers
Records sent out to clients (only on non-packet cache hits).

When testing this, I encountered a few cases where auths sent duplicate records.

dig @ns4.dnsv5.com getui.com txt 
dig @ord.geons.ftw.jiveip.net api.geoip.jive.com txt
dig @dnsd10.chatango.com chatango.com
dig @ns.sotoon53.com yektanet.com txt
dig @ns.viettelidc.com.vn ns9.viettelidc.com.vn
dig @ns1.cdnetdns.net flypgs.com txt

Nothing actually breaks if we don't dedup afaik. So it is questionable of we want this part.

But on the client side, this fixes #14120 and maybe other cases I do not know.

The big question is if we want this. Dedupping is fundamentally not cheap, although I tried to optimize the dedup code.

Originally I played with the idea to change the data structure building the reply vector to avoid duplicates, but that requires changes in many places. Still the idea is not completely off the table.

On the receiving side the dedup internals could also be weaved into the sanitize code, at the cost of increasing complexity. That would avoid the separate dedup() call.

This will remain a draft until I have some some speed measurements and pondered the alternative approaches some more. This PR is mainly to share thoughts.

The test changes are needed as a few of them use duplicate records.

Checklist

I have:

read the CONTRIBUTING.md document
compiled this code
tested this code
included documentation (including possible behaviour changes)
documented the code
added or modified regression test(s)
added or modified unit test(s)

coveralls · 2024-09-03T12:07:26Z

Pull Request Test Coverage Report for Build 11912381008

Details

75 of 75 (100.0%) changed or added relevant lines in 8 files are covered.
889 unchanged lines in 18 files lost coverage.
Overall coverage increased (+0.009%) to 64.691%

Files with Coverage Reduction	New Missed Lines	%
pdns/recursordist/syncres.cc	1	79.48%
pdns/pollmplexer.cc	1	83.66%
pdns/auth-secondarycommunicator.cc	1	63.66%
pdns/query-local-address.cc	2	90.43%
pdns/recursordist/test-syncres_cc2.cc	3	88.91%
pdns/remote_logger.cc	3	54.71%
pdns/zoneparser-tng.cc	4	83.07%
pdns/recursordist/rec-tcp.cc	5	64.89%
pdns/recursordist/rec-tcpout.cc	6	50.79%
pdns/dnsbackend.hh	7	51.69%

Totals
Change from base Build 11891742819:	0.009%
Covered Lines:	125820
Relevant Lines:	163652

💛 - Coveralls

rgacogne

The logic looks good to me. I guess we could try a few more heuristics before actually serializing the content, like checking if we have several records with the same (qtype, qname), but it might not be worth it. Having real-world numbers would indeed be useful.

rgacogne · 2024-10-07T09:51:09Z

pdns/dnsparser.hh

+
+    string record;
+    packetWriter.getContentWireFormat(record); // needs to be called before commit()
+    return record;


If we go forward, we might want to consider a small refactoring to avoid duplicating code between serialize and wireFormatContent.

rgacogne · 2024-10-07T09:53:00Z

pdns/shuffle.hh

@@ -29,4 +29,5 @@ namespace pdns
 {
 void shuffle(std::vector<DNSZoneRecord>& rrs);
 void orderAndShuffle(std::vector<DNSRecord>& rrs, bool includingAdditionals);
+unsigned int dedup(std::vector<DNSRecord>& rrs);


Perhaps dedupRecords instead of dedup?

omoerbeek · 2024-10-25T06:50:29Z

Rebased to fix conflict.

omoerbeek · 2024-10-25T07:12:41Z

Some observations:
quad1 does dedup in some cases (only when the dups are adjacent?)
quad8 does not dedup
OpenDNS does dedup

Software tested in default config
unbound does not do dedup
bind does dedup
knot-resolver does dedup

omoerbeek · 2024-10-25T08:50:34Z

Speedtest results:

'2 DedupRecords (generate only)' 0.10 seconds: 1230169.1 runs/s, 0.81 us/run
'2 DedupRecords' 0.10 seconds: 504897.9 runs/s, 1.98 us/run
'2 DedupRecords (with dup)' 0.10 seconds: 628577.4 runs/s, 1.59 us/run
'256 DedupRecords (generate only)' 0.10 seconds: 9248.3 runs/s, 108.13 us/run
'256 DedupRecords' 0.10 seconds: 3867.5 runs/s, 258.57 us/run
'256 DedupRecords (with dup)' 0.10 seconds: 3823.4 runs/s, 261.55 us/run
'4096 DedupRecords (generate only)' 0.11 seconds: 561.7 runs/s, 1780.35 us/run
'4096 DedupRecords' 0.10 seconds: 241.7 runs/s, 4137.49 us/run
'4096 DedupRecords (with dup)' 0.10 seconds: 239.3 runs/s, 4178.26 us/run

The measured slowdown is about 2.5 and is uniform over the various test case sizes.

So the dedupping takes time as expected, but for the already pretty extreme case of 256, records, its absolute value is not a lot compared to the expected network latency. For the 4096 case we spent time that comes closer to the expected network latency.

paddg · 2024-10-25T08:56:19Z

Can we rule out that deduping is an attack vector?

omoerbeek · 2024-10-25T09:05:21Z

Can we rule out that deduping is an attack vector?

Not completely, in the extreme case spending even a few CPU ms on a single auth result is quite a lot.

paddg · 2024-10-25T10:00:48Z

Not completely, in the extreme case spending even a few CPU ms on a single auth result is quite a lot.

Are you considering an on/off switch for it?

omoerbeek · 2024-10-25T10:02:15Z

Not completely, in the extreme case spending even a few CPU ms on a single auth result is quite a lot.

Are you considering an on/off switch for it?

Yes, that would be one of the options. Another alternative would be to not do the dedupping on large answers as we already refuse to cache them anyway.

omoerbeek · 2024-11-04T08:23:40Z

The logic looks good to me. I guess we could try a few more heuristics before actually serializing the content, like checking if we have several records with the same (qtype, qname), but it might not be worth it. Having real-world numbers would indeed be useful.

I played a bit with a pre-scan on qtype and name only, but saw no speedup

…ows for an unordered_set as well.

omoerbeek added the rec label Sep 3, 2024

rgacogne reviewed Oct 7, 2024

View reviewed changes

omoerbeek self-assigned this Oct 25, 2024

omoerbeek force-pushed the rec-dedup-recs branch from 863fd47 to 8089aeb Compare October 25, 2024 06:50

omoerbeek force-pushed the rec-dedup-recs branch from 1de6a90 to 5a98b0f Compare October 25, 2024 08:34

omoerbeek force-pushed the rec-dedup-recs branch from 080d63b to 97795db Compare October 25, 2024 09:36

omoerbeek added 8 commits November 19, 2024 12:00

rec: dedup results from auths and results constructed ourselves

298efa5

No need to dedup the dns64 case seperately anymore

594719d

Adapt test to not use repeating records

068fe5a

Rework dedup code and add a test for pdsn::dedup

51b72ba

Faster dedup, not using zoneRepresentation but wire format, which all…

af610df

…ows for an unordered_set as well.

Add speedtest for shuffle, plus a speedup in shuffle itself

709cf06

rename pdns::shuffle to pdns::shufleRecords, as suggested by @rgacogne

97e3d01

Refactor serialize/wireFormatContent as suggested by @rgacogne

fb4e019

omoerbeek force-pushed the rec-dedup-recs branch from 97795db to 274b82b Compare November 19, 2024 11:33

Dedup only in specific places

e26c334

omoerbeek force-pushed the rec-dedup-recs branch from 274b82b to e26c334 Compare November 19, 2024 11:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rec: dedup records #14617

rec: dedup records #14617

omoerbeek commented Sep 3, 2024

coveralls commented Sep 3, 2024 •

edited

Loading

rgacogne left a comment

rgacogne Oct 7, 2024

rgacogne Oct 7, 2024

omoerbeek commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

paddg commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

paddg commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

omoerbeek commented Nov 4, 2024

rec: dedup records #14617

Are you sure you want to change the base?

rec: dedup records #14617

Conversation

omoerbeek commented Sep 3, 2024

Short description

Checklist

coveralls commented Sep 3, 2024 • edited Loading

Pull Request Test Coverage Report for Build 11912381008

Details

💛 - Coveralls

rgacogne left a comment

Choose a reason for hiding this comment

rgacogne Oct 7, 2024

Choose a reason for hiding this comment

rgacogne Oct 7, 2024

Choose a reason for hiding this comment

omoerbeek commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

paddg commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

paddg commented Oct 25, 2024

omoerbeek commented Oct 25, 2024

omoerbeek commented Nov 4, 2024

coveralls commented Sep 3, 2024 •

edited

Loading