xcp ref implementation #2895

DhanusML · 2024-09-10T04:38:16Z

Notation

Data is a matrix $X\in\mathbb{R}^{p\times n}$. Each column is a $p$-dimensional vector sampled independently. The matrix $X$ is assumed to be stored in column-major fashion.

xcp

Matrix of cross products is a $p\times p$ matrix whose $(i,j)$th entry is given by
$$C_{ij} = \sum_{k=1}^n(x_{ik}-\mu_i)(x_{jk}-\mu_j).$$
Here $x_{ik}$ is the $i$th component of the $k$th random vector.

The implementation requires this to be computed in batches of $X$, where the raw sum is passed between the batch computations. This can be done using the following:

Let $\mu'$ be the mean of the data until the previous batch and let $\mu$ be the mean including the data from current batch. Let $n'$ be the total number of data points until the previous batch and $S'$ be their sum (i.e., $\mu' = S'/n'$ and $\mu = S/n$).

Then the $(i,j)$ entry of $C$ can be computed using
$$C_{ij}\leftarrow C_{ij}+(\mu_j'-\mu_j)S_i' + (\mu_i'-\mu_i)S_j' + (\mu_i\mu_j-\mu_i'\mu_j')n' + \sum_{k=n'+1}^n(x_{ik}-\mu_i)(x_{jk}-\mu_j).$$

This can be simplified to
$$C_{ij} \leftarrow C_{ij} + \frac{S_i' S_j'}{n'} - \frac{S_i S_j}{n} + \sum_{k=n'+1}^n x_{ik}x_{jk}$$
or
$$C \leftarrow C + \frac{S'S'^t}{n'} - \frac{SS^t}{n} + XX^t$$

The level3 BLAS routine GEMM is used to compute the matrix products.

This routine computes the matrix of cross product of data stored in column major format, in batches. For matrix X of dimensions p x n, the i,j th entry of the cross product matrix is C_ij = \sum_k (x_ik-\mu_i) (x_jk-\mu_k) where x_ij is the jth element of the ith row, of the matrix X. Implementation uses the BLAS routine GEMM. Signed-off-by: Dhanus M Lal <Dhanus.MLal@fujitsu.com>

Signed-off-by: Dhanus M Lal <Dhanus.MLal@fujitsu.com>

KulikovNikita · 2024-09-15T11:14:22Z

cpp/daal/src/externals/service_stat_ref.h

+        double alpha;
+        if (accWtOld != 0)
+        {
+            double * sumOld = daal::services::internal::service_malloc<double, cpu>(nFeatures, sizeof(double));


Suggested change

double * sumOld = daal::services::internal::service_malloc<double, cpu>(nFeatures, sizeof(double));

double* const sumOld = daal::services::internal::service_malloc<double, cpu>(nFeatures, sizeof(double));

KulikovNikita · 2024-09-15T11:14:41Z

cpp/daal/src/externals/service_stat_ref.h

+                sumOld[i] = sum[i];
+            }
+            // S_old S_old^t/accWtOld
+            alpha = 1.0 / accWtOld;


Any checks for overflow?

Does onedal have some macros to check floating point overflow?

KulikovNikita · 2024-09-15T11:16:22Z

cpp/daal/src/externals/service_stat_ref.h

+        transb = 'T';
+        alpha  = 1.0;
+        beta   = 1.0;


I would recommend not to use the same variables. It's confusing and also can be dangerous if someone will forgot to change them.

KulikovNikita · 2024-09-15T11:17:56Z

cpp/daal/src/externals/service_stat_ref.h

+        transb = 'T';
+        alpha  = 1.0;
+        beta   = 1.0;
+        blasInst.xgemm(&transa, &transb, &nFeatures, &nFeatures, &nVectors, &alpha, data, &nFeatures, data, &nFeatures, &beta, crossProduct,
+                       &nFeatures);


Suggested change

transb = 'T';

alpha = 1.0;

beta = 1.0;

blasInst.xgemm(&transa, &transb, &nFeatures, &nFeatures, &nVectors, &alpha, data, &nFeatures, data, &nFeatures, &beta, crossProduct,

&nFeatures);

{

constexpr char transb = 'T';

constexpr double alpha = 1.0;

constexpr double beta = 1.0;

blasInst.xgemm(&transa, &transb, &nFeatures, &nFeatures, &nVectors, &alpha, data, &nFeatures, data, &nFeatures, &beta, crossProduct,

&nFeatures);

}

Signed-off-by: Dhanus M Lal <Dhanus.MLal@fujitsu.com>

Alexsandruss · 2024-09-23T13:25:14Z

/intelci: run

Alexsandruss · 2024-09-25T14:46:54Z

/intelci: run

Alexsandruss · 2024-09-25T14:48:01Z

@KulikovNikita, were your comments addressed?

Alexandr-Solovev · 2024-10-10T10:40:31Z

/intelci: run

Alexsandruss · 2024-10-14T13:53:11Z

/intelci: run

DhanusML · 2024-10-16T11:52:26Z

Can you provide details of the check that is failing?

Alexsandruss · 2024-10-18T11:45:13Z

Can you provide details of the check that is failing?

I suppose these are sporadic failures. Restarting CI

Alexsandruss · 2024-10-18T11:45:17Z

/intelci: run

Alexsandruss · 2024-10-21T15:03:40Z

/intelci: run

Alexsandruss · 2024-10-21T19:24:05Z

/intelci: run

DhanusML requested review from Alexsandruss, samir-nasibli and Alexandr-Solovev as code owners September 10, 2024 04:38

refactor and check for malloc fail

0293b6f

Signed-off-by: Dhanus M Lal <Dhanus.MLal@fujitsu.com>

KulikovNikita reviewed Sep 15, 2024

View reviewed changes

review changes

9617b65

Signed-off-by: Dhanus M Lal <Dhanus.MLal@fujitsu.com>

DhanusML requested a review from KulikovNikita September 23, 2024 11:15

Alexsandruss added the enhancement label Sep 23, 2024

Merge branch 'oneapi-src:main' into dhanus/xcp

988389d

Merge branch 'oneapi-src:main' into dhanus/xcp

a7a74df

Merge branch 'oneapi-src:main' into dhanus/xcp

4a60b17

Merge branch 'oneapi-src:main' into dhanus/xcp

bd6aa89

Alexandr-Solovev approved these changes Oct 22, 2024

View reviewed changes

Alexsandruss approved these changes Oct 22, 2024

View reviewed changes

Alexsandruss merged commit 4145d40 into oneapi-src:main Oct 22, 2024
17 of 18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xcp ref implementation #2895

xcp ref implementation #2895

DhanusML commented Sep 10, 2024

KulikovNikita Sep 15, 2024

KulikovNikita Sep 15, 2024

DhanusML Sep 23, 2024

KulikovNikita Sep 15, 2024

KulikovNikita Sep 15, 2024

Alexsandruss commented Sep 23, 2024

Alexsandruss commented Sep 25, 2024

Alexsandruss commented Sep 25, 2024

Alexandr-Solovev commented Oct 10, 2024

Alexsandruss commented Oct 14, 2024

DhanusML commented Oct 16, 2024

Alexsandruss commented Oct 18, 2024 •

edited

Loading

Alexsandruss commented Oct 18, 2024

Alexsandruss commented Oct 21, 2024

Alexsandruss commented Oct 21, 2024

	double * sumOld = daal::services::internal::service_malloc<double, cpu>(nFeatures, sizeof(double));
	double* const sumOld = daal::services::internal::service_malloc<double, cpu>(nFeatures, sizeof(double));

xcp ref implementation #2895

xcp ref implementation #2895

Conversation

DhanusML commented Sep 10, 2024

Notation

xcp

KulikovNikita Sep 15, 2024

Choose a reason for hiding this comment

KulikovNikita Sep 15, 2024

Choose a reason for hiding this comment

DhanusML Sep 23, 2024

Choose a reason for hiding this comment

KulikovNikita Sep 15, 2024

Choose a reason for hiding this comment

KulikovNikita Sep 15, 2024

Choose a reason for hiding this comment

Alexsandruss commented Sep 23, 2024

Alexsandruss commented Sep 25, 2024

Alexsandruss commented Sep 25, 2024

Alexandr-Solovev commented Oct 10, 2024

Alexsandruss commented Oct 14, 2024

DhanusML commented Oct 16, 2024

Alexsandruss commented Oct 18, 2024 • edited Loading

Alexsandruss commented Oct 18, 2024

Alexsandruss commented Oct 21, 2024

Alexsandruss commented Oct 21, 2024

Alexsandruss commented Oct 18, 2024 •

edited

Loading