Validating distances against reference implementations #245

sdmccabe · 2019-08-23T16:33:27Z

sdmccabe · 2019-08-23T17:39:54Z

There are differences in output between our distance and the reference implementation of Portrait Divergence, but the differences are consistently small (the largest I've seen is 0.005, and it's usually more like 0.001). I'll keep investigating but I'd guess it's nothing.

sdmccabe · 2019-08-23T17:52:07Z

We should bump the PyPI version after finishing this.

sdmccabe · 2019-08-23T20:13:17Z

@leotrs I've checked off NBD because I assume the implementations are the same.

sdmccabe · 2019-08-25T17:57:47Z

HIM is producing different outputs from the R NetworkDistance implementation for RGGs (N=200, p=0.26, using the edgelists from the graphwend repo); will need to investigate further.

leotrs · 2019-08-26T14:31:32Z

@leotrs I've checked off NBD because I assume the implementations are the same.

At this point I wouldn't be surprised if netrd's implementation is more updated than mine. However, you can forget about NBD as I am the maintainer of the other one. If the outputs from the two different repos are different, then probably netrd's are correct...

leotrs · 2019-08-26T14:33:33Z

For NetSimile, I found this and this. Haven't compared them yet tho.

sdmccabe · 2019-08-26T14:55:00Z

NetSimile is a frustrating one since there isn't a reference implementation in the sense of author's code, so we're assuming the other independent implementations are correct. When I was debugging some NetSimile issues back in the spring I remember comparing the outputs to those from the netcomp library; I don't know if anything has changed since but I believe they were producing similar or identical outputs.

leotrs · 2019-08-26T15:00:27Z

We could use it as a touchstone only then. As long as we're in their ballpark, we're good.

Harrison pointed out in a comment on our paper that our Hamming implementation has an implicit $N^2$ instead of $N(N-1)$ normalization, so it's wrong for graphs without selfloops. This corrects that, similar to #242. A couple of notes: 1. I think this could be a little cleaner; the fact that `np.triu_indices()` et al return 2-tuples cramped my style a bit. 2. The fact that this and #242 exist raise concern that this normalization issue may be present elsewhere. Perhaps we should open a checklist issue, like we have for #245? 3. I have not applied the same correction to `HammingIpsenMikhailov`, on the grounds that: (i) it's sufficiently different from regular `Hamming` to consider separately, and (ii) it probably deserves a more thorough cleanup.

leotrs · 2019-10-09T17:49:10Z

Frobenius and Jaccard depend on row ordering, yes?

Unrelatedly, they both seem to be simple enough that we can just check them off?

sdmccabe · 2019-10-09T17:52:18Z

They should depend on row ordering, @jkbren would be able to confirm from his experiments.

They're probably simple enough to check off, but simplicity can be deceiving; see the issues we had with Jaccard before in #180.

sdmccabe mentioned this issue Sep 22, 2019

Use proper normalization in Hamming #250

Merged

sdmccabe mentioned this issue Oct 9, 2019

Validating reconstructors against reference implementations #254

Open

17 tasks

sdmccabe added this to the 1.0 milestone Oct 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validating distances against reference implementations #245

Validating distances against reference implementations #245

sdmccabe commented Aug 23, 2019 •

edited

Loading

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 25, 2019

leotrs commented Aug 26, 2019

leotrs commented Aug 26, 2019

sdmccabe commented Aug 26, 2019

leotrs commented Aug 26, 2019

leotrs commented Oct 9, 2019

sdmccabe commented Oct 9, 2019

Validating distances against reference implementations #245

Validating distances against reference implementations #245

Comments

sdmccabe commented Aug 23, 2019 • edited Loading

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 23, 2019

sdmccabe commented Aug 25, 2019

leotrs commented Aug 26, 2019

leotrs commented Aug 26, 2019

sdmccabe commented Aug 26, 2019

leotrs commented Aug 26, 2019

leotrs commented Oct 9, 2019

sdmccabe commented Oct 9, 2019

sdmccabe commented Aug 23, 2019 •

edited

Loading