Provide support for RAFT-based indexes #377

lowener · 2023-05-15T16:43:55Z

Describe the changes in the pull request

This PR add support for RAFT-based index. IVF-Flat and IVF-PQ indexes will be added.
The main API change needed is the introduction of Batch vector addition (possibly through a VecSimIndex_AddVectorBatch function)

Mark if applicable

This PR introduces API changes
This PR introduces serialization changes

Work still in progress

CLAassistant · 2023-05-15T16:44:01Z

All committers have signed the CLA.

Spartee · 2023-05-18T00:15:21Z

@lowener Thanks for much for putting this up! super excited. To run the tests can you sign the CLA? Let me know if you have and it's just not appearing.

DvirDukhan · 2023-05-21T20:54:02Z

cmake/fetch_rapids.cmake

@@ -0,0 +1,18 @@
+# =============================================================================


redis license

cmake/fetch_rapids.cmake

DvirDukhan · 2023-05-21T20:54:35Z

cmake/fetch_rapids.cmake

+# or implied. See the License for the specific language governing permissions and limitations under
+# the License.
+
+if(NOT EXISTS ${CMAKE_CURRENT_BINARY_DIR}/RAPIDS.cmake)


can we use cmake fetch content?

cmake/libcutlass.cmake

DvirDukhan · 2023-05-21T20:58:31Z

CMakeLists.txt

+  include(rapids-cmake)
+  include(rapids-cpm)
+  include(rapids-cuda)
+  include(rapids-export)
+  include(rapids-find)


please add a comment about each specific include purpose

DvirDukhan · 2023-05-22T15:55:52Z

src/VecSim/algorithms/raft_ivf/ivf_index.cuh

+        if (!flat_index_) {
+            flat_index_ = std::make_unique<raftIvfFlatIndex>(
+                raft::neighbors::ivf_flat::build<DataType, std::int64_t>(
+                    res_, *build_params_flat_, raft::make_const_mdspan(vector_data_gpu.view())));


add comments please

DvirDukhan · 2023-05-22T15:58:10Z

src/VecSim/algorithms/raft_ivf/ivf_index.cuh

+int RaftIVFIndex::addVectorBatch(const void *vector_data, labelType *labels, size_t batch_size,
+                                 bool overwrite_allowed) {
+    auto vector_data_gpu =
+        raft::make_device_matrix<DataType, std::int64_t>(res_, batch_size, this->dim);


can you explain ifstd::int64_t is used for id or label?

src/VecSim/algorithms/raft_ivf/ivf_index.cuh

DvirDukhan · 2023-05-22T16:12:39Z

src/VecSim/algorithms/raft_ivf/ivf_index.cuh

+        return result_list;
+    }
+    auto vector_data_gpu =
+        raft::make_device_matrix<DataType, std::int64_t>(res_, queryParams->batchSize, this->dim);


batchSize is misused here. It is used in our batch iterator for hybrid queries

DvirDukhan · 2023-05-22T16:20:17Z

src/VecSim/algorithms/raft_ivf/ivf_tiered.cuh

+        this->ivf_index_->addVectorBatchGpuBuffer(vectorDataGpuBuffer.data_handle(),
+                                                  labels_gpu.data_handle(),
+                                                  nVectors);
+        updateIvfIndex = false;


Once the flat buffer is copied to the IVF it should be flushed

lowener added 18 commits March 23, 2023 11:33

Initialize CUDA build and raft architecture

f600202

Add initial ivf flat implementation

4353816

Use only floats for ivf flat

f265809

Add ivfpq structure

70f6766

Implement raft ivf pq

4b60b91

Use raft 23.04

ea6d36c

Initial batch search

1aaed43

Fix naming

21d9298

Add initial tests for raft ivf

567ab3c

Test addition and use common index interface

6f7344e

Use async copy

1dd890b

Init ivf tiered index

fa444ec

Initial benchmark creation

27246c6

Use updateIvf in tiered index and extend after build call

427da08

Unify raft ivf flat and pq

9792a9e

Fix style

e052eb7

Clean unused code

8bd6add

Remove benchmark code

a307a4a

Merge branch 'main' into fea-raft-clean

5b54804

lowener added 2 commits May 18, 2023 17:38

Add test for Tiered raft index

bf09526

Add IVF Flat benchmark

0160950

DvirDukhan reviewed May 23, 2023

View reviewed changes

lowener added 5 commits May 30, 2023 17:40

Add IVF Flat and PQ Tiered Index for benchmarking

11776d5

Add missing empty line

20cc261

Separate ivf flat and pq code

4194a9a

Fix Tiered index + add Flat index reset

bebc9bc

Remove counts_ class attribute

88a6b32

wphicks mentioned this pull request Aug 4, 2023

Provide GPU-accelerated vector indexes with RAFT #413

Draft

7 tasks

lowener closed this Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provide support for RAFT-based indexes #377

Provide support for RAFT-based indexes #377

Uh oh!

lowener commented May 15, 2023 •

edited

Loading

Uh oh!

CLAassistant commented May 15, 2023 •

edited

Loading

Uh oh!

Spartee commented May 18, 2023

Uh oh!

DvirDukhan May 21, 2023

Uh oh!

Uh oh!

DvirDukhan May 21, 2023

Uh oh!

Uh oh!

DvirDukhan May 21, 2023

Uh oh!

DvirDukhan May 22, 2023

Uh oh!

DvirDukhan May 22, 2023

Uh oh!

Uh oh!

DvirDukhan May 22, 2023

Uh oh!

DvirDukhan May 22, 2023

Uh oh!

Uh oh!

		@@ -0,0 +1,18 @@
		# =============================================================================

Provide support for RAFT-based indexes #377

Provide support for RAFT-based indexes #377

Uh oh!

Conversation

lowener commented May 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented May 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Spartee commented May 18, 2023

Uh oh!

DvirDukhan May 21, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DvirDukhan May 21, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DvirDukhan May 21, 2023

Choose a reason for hiding this comment

Uh oh!

DvirDukhan May 22, 2023

Choose a reason for hiding this comment

Uh oh!

DvirDukhan May 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DvirDukhan May 22, 2023

Choose a reason for hiding this comment

Uh oh!

DvirDukhan May 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lowener commented May 15, 2023 •

edited

Loading

CLAassistant commented May 15, 2023 •

edited

Loading