How much value normalization will objecthash do? #16

pedrocr · 2017-07-05T08:15:04Z

I see from the code that unicode normalization is done. Is the idea also to do NaN normalization in floats? There are uses where this is not ideal and you don't want normalization at all. Will this be an option in the future? I'd actually argue the default case should be that otherwise it's easy to generate matching hashes for objects that will then behave differently when passed to certain functions.

tarcieri · 2017-07-05T18:30:06Z

All other objecthash implementations make Unicode normalization a configurable toggle. The present ObjectHasher API does not carry around the context to provide such toggles, although it could.

I will go ahead and leave this issue open to discuss whether such a toggle should be added.

That said, there are two places objecthash performs the sort of normalization you might see in a canonicalization scheme:

Unicode normalization
Sorting object keys by their objecthashes

pedrocr · 2017-07-05T20:12:06Z

And how about float, any plan for that?

tarcieri · 2017-07-05T21:35:39Z

@pedrocr personally I hate floating points, and the nascent objecthash-inspired scheme I have been working on does not support them at all.

That said, NaN (along with Infinity) is not part of the JSON data model, and IMO is best avoided. Attempting to serialize those (or hash them) should be an error.

pedrocr · 2017-07-05T23:10:13Z

Ok, between this and the performance penalty of all those text conversions I'll have to roll my own then. Supporting float is essential and NaN's are just part of the data. serde->bincode->Sha256 seems to be working well but I may take a stab at just implementing some form of #[derive(CryptoHash)].

tarcieri · 2017-07-05T23:44:54Z

Note the scheme I'm working on avoids the text conversion

pedrocr · 2017-07-05T23:54:16Z

If it doesn't do floats it's a no-go for me. Good image processing is all about f32. I'd have to do a bunch of manual conversions to/from fixed-point or something of the sort and that just makes for extremely ugly code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much value normalization will objecthash do? #16

How much value normalization will objecthash do? #16

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017

How much value normalization will objecthash do? #16

How much value normalization will objecthash do? #16

Comments

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017

tarcieri commented Jul 5, 2017

pedrocr commented Jul 5, 2017