PersistenceLengths and its unitary tests #1117

VincentRouvreau · 2024-08-12T15:15:26Z

No description provided.

mglisse · 2024-08-28T13:51:48Z

src/python/gudhi/representations/vector_methods.py

+        pers_lengths = []
+        for pd in X:
+            # Sort in reverse order persistence lengths (where length = death - birth)
+            lengths = np.flip(np.sort(pd[:,1] - pd[:,0]))


To get the top k elements (with k significantly smaller than n), a slightly more efficient algorithm is to use partition(*, -k)[-k:] (all the numbers I give are ±1, to be checked) and then you only need to sort those k elements. But that can come later if you prefer.

I did so on 768774a, but I had to assert on num_lengths <= 0 in the constructor. To be discussed, cf. this discussion in Ripser PR

…ntation

mglisse · 2024-09-03T11:31:53Z

src/python/gudhi/representations/vector_methods.py

@@ -912,28 +918,29 @@ def fit(self, X, y=None):

    def transform(self, X):
        """
-        Compute the persistence landscape for each persistence diagram individually and concatenate the results.
+        Compute the persistence lengths for each persistence diagram individually and concatenate the results.


When I read "concatenate", I expect the output to be a 1d array of length N*k.

I removed the "concatenate" on 3bbd3af

mglisse · 2024-09-03T11:33:10Z

src/python/gudhi/representations/vector_methods.py


        Parameters:
            X (list of n x 2 numpy arrays): input persistence diagrams.
-    
+
        Returns:
            numpy array with shape (number of diagrams) x (num_lengths): output persistence lengths.


That would be nice, but isn't the current output a list?

You are right, so I took the opportunity to rewrite it with a first result array filled with zeros, that is instantiated rows by rows on 3bbd3af

mglisse · 2024-09-03T14:12:33Z

src/python/gudhi/representations/vector_methods.py

+        idx = 0
+        for pd in X:


I think the usual way to do that in Python is

for idx, pd in enumerate(X):

(so you don't need idx = idx + 1)

Oh, nice ! I did it on af41754

mglisse · 2024-09-03T15:30:21Z

src/python/gudhi/representations/vector_methods.py

+        useful when PersistenceLengths is included in a scikit-learn Pipeline).
+
+        Parameters:
+            X (list of n x 2 or n x 1 numpy arrays): input persistence diagrams.


Thanks, I missed this copy /paste that has to be changed. I fixed it on e984496

mglisse · 2024-09-03T15:43:44Z

src/python/gudhi/representations/vector_methods.py

+
+        Parameters:
+            X (list of n x 2 or n x 1 numpy arrays): input persistence diagrams.
+            y (n x 1 array): persistence diagram lengths (unused).


Do we have to be that specific? Sklearn seems to say just

y (None): Ignored.

Sounds good to me. I changed it on 009748c

mglisse

Looks ok to me, give @MathieuCarriere a bit of time to comment.

VincentRouvreau added 2 commits August 12, 2024 17:14

PersistenceLengths and its unitary tests

01ea034

typo

d30299f

VincentRouvreau marked this pull request as ready for review August 14, 2024 14:31

VincentRouvreau mentioned this pull request Aug 28, 2024

Cech persistence sklearn #1126

Open

mglisse reviewed Aug 28, 2024

View reviewed changes

mglisse requested a review from MathieuCarriere August 28, 2024 13:53

code review: use numpy partition and bad/copy paste/replace of docume…

768774a

…ntation

mglisse reviewed Sep 3, 2024

View reviewed changes

code review: PersistenceLengths to return numpy array. Some doc also

3bbd3af

mglisse reviewed Sep 3, 2024

View reviewed changes

code review: enumerate instead of for loop + index increment

af41754

mglisse reviewed Sep 3, 2024

View reviewed changes

VincentRouvreau added 2 commits September 3, 2024 17:49

doc review: no need to be that specific for a fit that does nothing

009748c

doc review: bad input description for fit

e984496

mglisse approved these changes Sep 3, 2024

View reviewed changes

VincentRouvreau merged commit 1670055 into GUDHI:master Sep 23, 2024
7 checks passed

VincentRouvreau deleted the persistence_lengths_representation branch September 23, 2024 07:28

VincentRouvreau added the 3.11.0 GUDHI version 3.11.0 label Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PersistenceLengths and its unitary tests #1117

PersistenceLengths and its unitary tests #1117

VincentRouvreau commented Aug 12, 2024

mglisse Aug 28, 2024

VincentRouvreau Sep 3, 2024

mglisse Sep 3, 2024

VincentRouvreau Sep 3, 2024

mglisse Sep 3, 2024

VincentRouvreau Sep 3, 2024

mglisse Sep 3, 2024

VincentRouvreau Sep 3, 2024

mglisse Sep 3, 2024

VincentRouvreau Sep 3, 2024

mglisse Sep 3, 2024

VincentRouvreau Sep 3, 2024

mglisse left a comment

PersistenceLengths and its unitary tests #1117

PersistenceLengths and its unitary tests #1117

Conversation

VincentRouvreau commented Aug 12, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mglisse left a comment

Choose a reason for hiding this comment