Migrate from RAFT to CUVS #3549

tarang-jain · 2024-06-25T15:55:59Z

Remove the dependency on raft::compiled and modify GPU implementations to use cuVS backend in place of RAFT.

A deeper insight into the dependency:
FAISS gets the ANN algorithm implementations such as IVF-Flat and IVF-PQ from cuVS. RAFT is meant to be a lightweight C++ header-only template library that cuVS relies on for the more fundamental / low-level utilities. Some examples of these are RAFT's device mdarray and mdspan objects; the RAFT resource object (raft::resource) that takes care of the stream ordering of device functions; linear algebra functions such as mapping, reduction, BLAS routines etc. A lot of the cuVS functions take the RAFT mdspan objects as arguments (for example raft::device_matrix_view). Therefore FAISS relies on both cuVS and RAFT. FAISS gets RAFT headers through cuVS and uses them to create the function arguments that can be consumed by cuVS. Note that we are not explicitly linking FAISS against raft::raft or raft::compiled. Only the required headers are included and compiled rather than compiling the whole RAFT shared library. This is the reason we still see mentions of raft in FAISS.

…-hnsw

tarang-jain · 2024-09-05T06:25:27Z

@asadoughi I was able to resolve the seg fault. It was related to some bugs in the cuvs parts of bfKnn.

mfoerste4

@tarang-jain , as discussed offline I added the changes required to get rid of additional overhead and achieve comparable performance of faiss+cuvs vs plain cuvs.

faiss/gpu/GpuDistance.cu

faiss/gpu/GpuIndex.cu

faiss/gpu/StandardGpuResources.cpp

use cached device properties Co-authored-by: Malte Förster <97973773+mfoerste4@users.noreply.github.com>

…nto cuvs-migrate

facebook-github-bot · 2024-09-16T13:44:10Z

@asadoughi has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

… cuvs-migrate

asadoughi · 2024-09-17T08:00:35Z

CMakeLists.txt

@@ -79,9 +79,9 @@ if(FAISS_ENABLE_GPU)
 endif()
 endif()

-if(FAISS_ENABLE_RAFT AND NOT TARGET raft::raft)
- find_package(raft COMPONENTS compiled distributed)


Is RAFT still needed for compilation or we need to install rmm explicitly?

https://github.com/facebookresearch/faiss/actions/runs/10889999572/job/30218385769?pr=3549

The link interface of target "cuvs::cuvs" contains: rmm::rmm but the target was not found. Possible reasons include:

@asadoughi I tried linking raft::raft but now I see another failure in ci https://github.com/facebookresearch/faiss/actions/runs/10908718713/job/30275337629?pr=3549

rmm and raft should ideally be available through cuvs. But I have also tried explicitly linking raft::raft.

… cuvs-migrate

…nto cuvs-migrate

.github/actions/build_cmake/action.yml

asadoughi · 2024-09-30T22:52:31Z

It looks like one test is still failing TestTorchUtilsGPU.test_train_add_with_ids. Have you had a chance to troubleshoot?

tarang-jain · 2024-09-30T23:33:18Z

@asadoughi yes I have been looking into it. It is quite strange. The following order of operations gives erroneous results:
create a GPU torch tensor --> train GPU IVF-Flat Index --> search the index --> reset the index --> train again with the same tensor but this time have the data on CPU --> train the GPU IVF-Flat index --> search the index.

However, if both the tensors are on the same device (either CPU or GPU), there is no failure and it works fine.

divyegala added 30 commits October 5, 2023 14:46

start integration of cagra

753a109

merge upstream

6ce2467

add public API layer

f21c1f1

merge upstream

11c0c54

write tests, figure out a way to compare

656f493

Merge remote-tracking branch 'upstream/main' into raft-cagra

de67ca6

passing tests

ed32954

remove cpp test file

42ca862

Merge remote-tracking branch 'upstream/main' into raft-cagra

2fdfc6f

style check

2c9e965

add required methods

2e434fe

conditionally compile cagra

382c178

copyTo and copyFrom

8675974

style check

c7fcf4a

Merge branch 'main' into raft-cagra-hnsw

eae832d

Merge branch 'main' into raft-cagra-hnsw

4b76e5f

add read/write

065f912

Merge remote-tracking branch 'origin/raft-cagra-hnsw' into raft-cagra…

301f429

…-hnsw

add destructor

2b0ea76

destructor body, copyto reset

8c83bd2

remove destructor

39fb35a

move cmake sources around

49e2610

merge upstream

11bf6b2

more protections for copying

d4434bb

support default constructed IndexHnswCagra in copyTo

ac65c2d

fix failing binary hnsw tests

619c376

link faiss_gpu target to OpenMP

e25f8a4

raft still can't find openmp

e835150

openmp flags and uint32 IndexType

aeabe12

forgot conditional check in index_read

4e80586

mfoerste4 suggested changes Sep 11, 2024

View reviewed changes

faiss/gpu/GpuDistance.cu Outdated Show resolved Hide resolved

faiss/gpu/GpuIndex.cu Outdated Show resolved Hide resolved

faiss/gpu/StandardGpuResources.cpp Show resolved Hide resolved

mfoerste4 mentioned this pull request Sep 11, 2024

Brute force knn tile size selection rapidsai/cuvs#277

Open

tarang-jain and others added 5 commits September 11, 2024 08:40

merge upstream main

141bcb9

Apply suggestions from code review

337a74a

use cached device properties Co-authored-by: Malte Förster <97973773+mfoerste4@users.noreply.github.com>

update stream; dont reset handle

62bf7f3

Merge branch 'cuvs-migrate' of https://github.com/tarang-jain/faiss i…

c75a7cc

…nto cuvs-migrate

resolve compilation error

aae6cf2

tarang-jain added 2 commits September 16, 2024 11:37

do not link cutlass

4b84f46

Merge branch 'main' of https://github.com/facebookresearch/faiss into…

8bd4793

… cuvs-migrate

asadoughi reviewed Sep 17, 2024

View reviewed changes

tarang-jain and others added 9 commits September 17, 2024 10:14

link raft::raft

a6f1775

Merge branch 'main' of https://github.com/facebookresearch/faiss into…

3ccb6d6

… cuvs-migrate

sconditionally fndinng raft

c1b959c

Trigger Build

7024c0d

change link order

f207def

Merge branch 'main' into cuvs-migrate

42dc0b7

endif()

7ac798d

Merge branch 'main' of https://github.com/facebookresearch/faiss into…

64d424a

… cuvs-migrate

Merge branch 'cuvs-migrate' of https://github.com/tarang-jain/faiss i…

e10a05e

…nto cuvs-migrate

asadoughi reviewed Sep 18, 2024

View reviewed changes

.github/actions/build_cmake/action.yml Show resolved Hide resolved

mfoerste4 mentioned this pull request Sep 18, 2024

faiss.knn_gpu vs torch.topk #3621

Open

4 tasks

tarang-jain and others added 4 commits September 19, 2024 11:11

change installed libcuvs version in git actions

93172cf

Merge branch 'main' into cuvs-migrate

1b183c9

Merge branch 'main' into cuvs-migrate

2691b08

Merge branch 'main' into cuvs-migrate

d4851ab

Merge branch 'main' into cuvs-migrate

54df8fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate from RAFT to CUVS #3549

Migrate from RAFT to CUVS #3549

tarang-jain commented Jun 25, 2024 •

edited

Loading

tarang-jain commented Sep 5, 2024

mfoerste4 left a comment

facebook-github-bot commented Sep 16, 2024

asadoughi Sep 17, 2024

tarang-jain Sep 17, 2024

tarang-jain Sep 17, 2024

asadoughi commented Sep 30, 2024

tarang-jain commented Sep 30, 2024

Migrate from RAFT to CUVS #3549

Are you sure you want to change the base?

Migrate from RAFT to CUVS #3549

Conversation

tarang-jain commented Jun 25, 2024 • edited Loading

tarang-jain commented Sep 5, 2024

mfoerste4 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Sep 16, 2024

asadoughi Sep 17, 2024

Choose a reason for hiding this comment

tarang-jain Sep 17, 2024

Choose a reason for hiding this comment

tarang-jain Sep 17, 2024

Choose a reason for hiding this comment

asadoughi commented Sep 30, 2024

tarang-jain commented Sep 30, 2024

tarang-jain commented Jun 25, 2024 •

edited

Loading