Add support for PQ preprocessing API #1278

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

lowener wants to merge 19 commits into rapidsai:branch-25.12 from lowener:25.10-pq-preprocessing

Contributor

lowener commented Aug 23, 2025 •

edited

Loading

Related issue: #107

This PR adds support for a PQ preprocessing API. It gives access to train() and transform() function that can be used to transform a dataset into PQ codes. It is re-using the VPQ functions from CAGRA-Q.


          Initial commit for PQ preprocessing API

cfe4f92

Signed-off-by: Mickael Ide <mide@nvidia.com>

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board

copy-pr-bot bot commented Aug 23, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

lowener added feature request non-breaking C++ labels

cjnolet reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.hpp Outdated

    
            @@ -0,0 +1,164 @@
          
              /*

               * Copyright (c) 2024-2025, NVIDIA CORPORATION.

Member

cjnolet Aug 25, 2025

Please only include the current year in new files

cjnolet reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.hpp Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.hpp Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/src/preprocessing/quantize/detail/product.cuh Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/src/preprocessing/quantize/detail/product.cuh Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/src/preprocessing/quantize/product.cu Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/tests/preprocessing/product_quantization.cu Outdated Show resolved Hide resolved

cjnolet reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.hpp Outdated Show resolved Hide resolved


          Support n_lists and cleanup code

096daa5

Signed-off-by: Mickael Ide <mide@nvidia.com>

cjnolet moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board

cjnolet assigned lowener

gland1 reviewed

View reviewed changes

cpp/src/preprocessing/quantize/detail/product.cuh Outdated

    
                pq_params.add_data_on_build            = false;

                pq_params.max_train_points_per_pq_code = params.max_train_points_per_pq_code;

                auto pq_index = cuvs::neighbors::ivf_pq::build(res, pq_params, dataset);

gland1 Sep 14, 2025

I've tried using ivf build + pq-centeres api to get the generated centroids as part of generating pq for diskann. It appears I get significant lower recall values. I think it because the pq strategy used by ivf has 2 quantizers and also ivf uses balanced kmeans which can lower the recall.

Contributor

tarang-jain Sep 26, 2025

Based on my understanding, the current VPQ also has two quantizers. @lowener correct me if I am missing something, but based on what I had seen in DiskANN, the raw vectors are PQ quantized directly, there is no coarse quantizer (VQ centroids).

gland1 reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.hpp Outdated Show resolved Hide resolved

lowener added 6 commits

September 22, 2025 09:10


          Switch to VPQ

244a9cd

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Fix trainpq and train workflow

9537eb1

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Remove timer

2883a25

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Merge branch 'branch-25.10' into 25.10-pq-preprocessing

78dfd69


          Cleanup Code

9dd0cfe

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Add double dtype

9c543fb

Signed-off-by: Mickael Ide <mide@nvidia.com>

lowener marked this pull request as ready for review

September 24, 2025 16:09

lowener requested review from a team as code owners

September 24, 2025 16:09

KyleFromNVIDIA approved these changes

View reviewed changes

Member

KyleFromNVIDIA left a comment

Approved trivial CMake changes


          Add C and python API

5471d9a

Signed-off-by: Mickael Ide <mide@nvidia.com>

lowener requested a review from a team as a code owner

September 26, 2025 13:52


          Merge branch 'branch-25.10' into 25.10-pq-preprocessing

716fa58

KyleFromNVIDIA approved these changes

View reviewed changes

cpp/tests/CMakeLists.txt Outdated Show resolved Hide resolved

tarang-jain reviewed

View reviewed changes

cpp/src/neighbors/detail/vpq_dataset.cuh Show resolved Hide resolved


          Merge branch 'branch-25.10' into 25.10-pq-preprocessing

6d6d4ca

gland1 reviewed

View reviewed changes

cpp/src/preprocessing/quantize/detail/product.cuh Show resolved Hide resolved

lowener added 2 commits

September 29, 2025 08:15


          Make VQ optional

746cac4

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Add option for classical KMeans

1950da4

Signed-off-by: Mickael Ide <mide@nvidia.com>

lowener requested a review from a team as a code owner

September 29, 2025 17:08

cjnolet reviewed

View reviewed changes

cpp/include/cuvs/preprocessing/quantize/product.h Show resolved Hide resolved

lowener added 5 commits

September 29, 2025 17:00


          Add kmeans option to python

75629e5

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Merge branch 'branch-25.10' into 25.10-pq-preprocessing

32c8912


          Add getter for pq codebooks

d774999

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Fix doc

a55df82

Signed-off-by: Mickael Ide <mide@nvidia.com>


          Fix reconstruct kernel

5a151f9

Signed-off-by: Mickael Ide <mide@nvidia.com>

lowener changed the base branch from branch-25.10 to branch-25.12

October 6, 2025 13:48


          Merge branch 'branch-25.12' into 25.10-pq-preprocessing

a0c5071

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

C++ feature request non-breaking