Resetting local connectivity and metrics #841
-
Somewhat related to #838 (and maybe #58 although the code has changed substantially since then), there are rather esoteric questions which are not likely to have huge practical significance currently, but may be of interest in terms of extending UMAP to mixed data sets.
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Let's see if I can provide at least some answers. The first question is at least relatively straightforward: mostly resetting local connectivity should be the right thing to do, but the For the second question: it's a rough approximation to For the third question: I think reading it as the fuzzy-set union is the right approach. The bipartite case is trickier. That leads into the fourth question. The transform, as written currently, is really just trying to attach new points without altering the existing structure (unlike, for example, the |
Beta Was this translation helpful? Give feedback.
Let's see if I can provide at least some answers. The first question is at least relatively straightforward: mostly resetting local connectivity should be the right thing to do, but the
__sub__
routine is pretty experimental (I added simply because it seemed like an obvious counterpart once I added the__add__
and__mul__
), and resetting local connectivity there ... pretty much eliminated anything__sub__
did, so I just didn't do it in that case. It is just an odd and quirky example really.For the second question: it's a rough approximation to
smooth_knn_dist
; the first step is equivalent to the local connectivity to a degree. Perhaps this could be done better/more accurately with more c…