Update from gudhi #20

hschreiber · 2024-09-17T10:54:52Z

As Multipers is starting to get integrated into Gudhi, I will start to update the C++ files such that they match those in Gudhi.

I will make sure that the unit tests (run with pytest) and all notebooks (in docs/notebooks) work before pushing. But as they are not exhaustive, it would be good to run your own notebooks too @DavidLapous to ensure that I forgot nothing (I did not went through all the python/cython code line by line...)

DavidLapous · 2024-09-17T17:00:18Z

Linked with GUDHI/gudhi-devel#976. Should we wait that this PR is merged before merging this one to avoid unnecessary work? I'll test this update soon. I'll also deal with the notebooks, as I'd like to deprecate/change some interface.

hschreiber · 2024-09-17T17:40:47Z

Linked with GUDHI/gudhi-devel#976. Should we wait that this PR is merged before merging this one to avoid unnecessary work?

Updating Multipers while working on GUDHI/gudhi-devel#976 helps me to find some bugs, so I will do both in parallel in any case. So you can either merge it step by step or all at once, whatever you think is less work (my guess is that if you don't do big changes in Multipers for the time being, it does not make a big difference. Otherwise, it would be better to merge things before to avoid having to update the new code again).

The most important thing is that the update is properly tested as some behaviour changes, so the update is not trivial. That is the main reason why this PR is a draft.

The good news is that the notebooks cover most methods. The problem is that they cover them usually with just one set of parameters, so a lot of combination and backend versions are not tested at all...

I'll test this update soon. I'll also deal with the notebooks, as I'd like to deprecate/change some interface.

Ok!

DavidLapous

Quick comments

_tempita_grid_gen.py

multipers/filtration_conversions.pxd.tp

multipers/filtrations.pxd

DavidLapous · 2024-09-18T07:24:19Z

multipers/mma_structures.pyx.tp

+    if len(births) == 1 and births[0].size == 1 and isinf(births[0][0]):
+        if len(deaths) == 1 and deaths[0].size == 1 and isinf(deaths[0][0]):
+            return []


These checks should be done in c++ (we have access to v). Each One_critical_filtration has the is inf method

So, should I add is_plus/minus_inf() to the Cython interface of One_critical_filtration? Is not there for now.

multipers/multiparameter_module_approximation/approximation.h

DavidLapous · 2024-09-18T07:37:33Z

multipers/multiparameter_module_approximation/approximation.h

+                            ? box.get_lower_corner()[i]
+                            : negInf;
+      value_type t_death = death.is_plus_inf() ? max_i : (death.is_minus_inf() ? -inf : std::min(death[i], max_i));
+      value_type t_birth = birth.is_plus_inf() ? inf : (birth.is_minus_inf() ? min_i : std::max(birth[i], min_i));
+      s = std::min(s, t_death - t_birth);
+    }
+  } else {
+    unsigned int dim = std::max(birth.size(), death.size());
+    for (unsigned int i = 0; i < dim; i++){
+      //if they don't have the same size, then one of them has to (+/-)infinite.
+      value_type t_death = death.size() > i ? death[i] : death[0];  //assumes death is never empty
+      value_type t_birth = birth.size() > i ? birth[i] : birth[0];  //assumes birth is never empty


I'm not a big fan of all of this.

Can you be more precise? Is it more or less the same than before with just an extra tests for infinity values...

This assumes inf is of the form {inf} + there is too many ..?..: .. imbriqué, which is not really readable. I haven't though about an alternative though

multipers/tests/test_mma.py

…nto update_from_gudhi

DavidLapous

ok

DavidLapous · 2024-09-19T15:53:07Z

multipers/filtration_conversions.pxd.tp

DavidLapous · 2024-09-19T16:24:03Z

From the time series notebook, if we take

import multipers as mp
from gudhi.point_cloud.timedelay import TimeDelayEmbedding
from os.path import expanduser
import pandas as pd
import numpy as np
import multipers.ml.point_clouds as mmp
DATASET_PATH=expanduser("~/Datasets/UCR/")
dataset_path = DATASET_PATH + "Coffee/Coffee"
xtrain = np.array(pd.read_csv(dataset_path+"_TRAIN.tsv", delimiter='\t', header=None, index_col=None))
TDE = TimeDelayEmbedding(dim=3, delay=1, skip=1)
xtrain = TDE.transform(xtrain)
sts = mmp.PointCloud2FilteredComplex(bandwidths=[-.1], num_collapses=-2, expand_dim=2).fit_transform(xtrain)

Then according to

mp.module_approximation(sts[0][0], threshold=True, box=[[0,1],[1,3]]).plot(box=[[0,1],[1,4]])

The summands whose death curve are [inf] are not thresholded properly.
Also, the verbose seems to lead to a segmentation fault

mp.module_approximation(sts[0][0], verbose=True)

DavidLapous

OK!

DavidLapous · 2024-09-20T12:44:23Z

multipers/gudhi/gudhi/Multi_persistence/Line.h

DavidLapous

Great! This fixes some nice stuff, I still have this segfaulting though:

import multipers as mp
st = mp.SimplexTreeMulti(num_parameters=2)
st.insert([0], [0, 1])
st.insert([1], [1, 0])
st.insert([0, 1], [1, 1])
mp.module_approximation(st,verbose=True)

I'll analyze the core dump when I've got time.

DavidLapous · 2024-09-20T17:09:39Z

multipers/multiparameter_module_approximation/approximation.h

+    bool allInf = true;
+    for (std::size_t i = 0U; i < birth_container.num_parameters(); i++) {
+      auto t = box_.get_lower_corner()[i];
+      if (birth_container[i] < t - 1e-10) birth_container[i] = threshold_in ? t : -filtration_type::T_inf;


Reminder for later, the 1e-10 is funny

hschreiber · 2024-09-23T12:59:50Z

Great! This fixes some nice stuff, I still have this segfaulting though:
import multipers as mp
st = mp.SimplexTreeMulti(num_parameters=2)
st.insert([0], [0, 1])
st.insert([1], [1, 0])
st.insert([0, 1], [1, 1])
mp.module_approximation(st,verbose=True)
I'll analyze the core dump when I've got time.

I can't reproduce the segfault...

DavidLapous · 2024-09-23T13:54:21Z

I can't reproduce the segfault...

I think it is because of my environment; I don't have this issue on my personal laptop

DavidLapous · 2024-09-23T14:17:57Z

LGTM

DavidLapous · 2024-09-23T14:23:55Z

Hmm this is not compiling on macOS, it seems that my "PR" workflow doesn't properly test the PR folder ?

hschreiber and others added 3 commits September 17, 2024 12:03

gudhi update

762a078

Merge branch 'DavidLapous:main' into update_from_gudhi

eb9b997

gudhi update

5928833

DavidLapous reviewed Sep 18, 2024

View reviewed changes

hschreiber and others added 4 commits September 18, 2024 14:22

minor changes

b64d74e

correct output

fb2daa0

Merge branch 'DavidLapous:main' into update_from_gudhi

077d280

Merge branch 'update_from_gudhi' of github.com:hschreiber/multipers i…

c8d64f4

…nto update_from_gudhi

DavidLapous reviewed Sep 19, 2024

View reviewed changes

multipers/filtration_conversions.pxd.tp Outdated

Copy link

Owner

DavidLapous Sep 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok

fix

ca2eb5b

DavidLapous reviewed Sep 20, 2024

View reviewed changes

multipers/gudhi/gudhi/Multi_persistence/Line.h Outdated

Copy link

Owner

DavidLapous Sep 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok

fix

3299966

DavidLapous requested changes Sep 20, 2024

View reviewed changes

upstream merge

c307a71

DavidLapous marked this pull request as ready for review September 23, 2024 14:18

DavidLapous approved these changes Sep 23, 2024

View reviewed changes

DavidLapous merged commit 28c25ad into DavidLapous:main Sep 23, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update from gudhi #20

Update from gudhi #20

hschreiber commented Sep 17, 2024

DavidLapous commented Sep 17, 2024 •

edited

Loading

hschreiber commented Sep 17, 2024

DavidLapous left a comment

DavidLapous Sep 18, 2024

hschreiber Sep 18, 2024

DavidLapous Sep 18, 2024

hschreiber Sep 18, 2024

DavidLapous Sep 18, 2024

DavidLapous left a comment

DavidLapous Sep 19, 2024

DavidLapous commented Sep 19, 2024 •

edited

Loading

DavidLapous left a comment

DavidLapous Sep 20, 2024

DavidLapous left a comment •

edited

Loading

DavidLapous Sep 20, 2024

hschreiber commented Sep 23, 2024

DavidLapous commented Sep 23, 2024 •

edited

Loading

DavidLapous commented Sep 23, 2024

DavidLapous commented Sep 23, 2024 •

edited

Loading

Update from gudhi #20

Update from gudhi #20

Conversation

hschreiber commented Sep 17, 2024

DavidLapous commented Sep 17, 2024 • edited Loading

hschreiber commented Sep 17, 2024

DavidLapous left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DavidLapous left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DavidLapous commented Sep 19, 2024 • edited Loading

DavidLapous left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DavidLapous left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hschreiber commented Sep 23, 2024

DavidLapous commented Sep 23, 2024 • edited Loading

DavidLapous commented Sep 23, 2024

DavidLapous commented Sep 23, 2024 • edited Loading

DavidLapous commented Sep 17, 2024 •

edited

Loading

DavidLapous commented Sep 19, 2024 •

edited

Loading

DavidLapous left a comment •

edited

Loading

DavidLapous commented Sep 23, 2024 •

edited

Loading

DavidLapous commented Sep 23, 2024 •

edited

Loading