FP16 API for CAGRA and IVF-PQ #264

tfeher · 2024-07-30T22:02:22Z

This PR adds public API to CAGRA and IVF-PQ ANN search using FP16 input data.

Note that the fp16 kernels are already compiled in libcuvs. This PR just adds the missing public API declarations into the C++ headers, and restores the instantiations of public API functions.

This PR partially fixes #144 (the IVF-Flat API is not yet added here).

cjnolet · 2024-07-30T23:11:08Z

We also need to measure the impact of binary size and compile time for adding these new types to the public API.

We can't keep increasing without figuring out ways we can consolidate what's there. These two thjngs are the number 1 complaint from users currently. (It's not just this PR. This is also holding up the half precision bfknn and RBC PRs).

cjnolet · 2024-07-30T23:43:18Z

Linking #110

tfeher · 2024-07-30T23:43:33Z

The kernels are already compiled and included in libcuvs.so. The additional instantiations of the API entry points shall have negligible size. I will confirm this once CI finishes.

It was a mistake during porting the code from RAFT, that the public API was not defined for fp16, therefore I labelled this PR as a bugfix.

achirkin

Thanks Tamas for the PR! The changes are straightforward and it looks good to me as-is, yet a few nitpicks below (if the time permits).

cpp/bench/ann/src/cuvs/cuvs_ivf_pq.cu

cpp/include/cuvs/neighbors/cagra.hpp

cpp/src/neighbors/cagra_search_half.cu

cpp/src/neighbors/ivf_pq/detail/ivf_pq_search_half_int64_t.cu

tfeher

Thanks Artem for the review, I have addressed the issues.

cpp/bench/ann/src/cuvs/cuvs_ivf_pq.cu

cpp/include/cuvs/neighbors/cagra.hpp

cpp/src/neighbors/ivf_pq/detail/ivf_pq_search_half_int64_t.cu

cjnolet · 2024-07-31T21:28:06Z

Sorry @tfeher, I understand that there were some bits which were not exposed in the prior port, but this still doesn't change the increase in binary size. We need to address this before we continue to merge changes that increase it.

I propose we look at things that can be consolidated and maybe try using JIT for some of these things. We have to take a step back and fix this at the source.

achirkin · 2024-09-27T11:54:52Z

Binary size: 679 -> 661 MB
The size is actually reduced, because I removed a few unused template instances and most of FP16-related instances were already in the binary (unused until this PR).

A quick check with bench-ann shows the FP16 benchmarks seem to work. I've checked that CAGRA is a little bit faster on FP16 vs FP32 on the glove dataset.

cpp/src/neighbors/detail/cagra/cagra_build.cuh

…his change breaking the code

docs/source/developer_guide.md

achirkin · 2024-09-27T19:07:42Z

/merge

tfeher added 5 commits July 30, 2024 01:44

Instantiate CAGRA for fp16 data type

9abf789

Merge remote-tracking branch 'origin/branch-24.08' into cagra_fp16

63323c9

Fix build instantiation

ef2d8c5

Merge remote-tracking branch 'origin/branch-24.08' into cagra_fp16

456ab6e

Remove debug comment

02faed9

tfeher requested review from a team as code owners July 30, 2024 22:02

github-actions bot added cpp CMake labels Jul 30, 2024

enable ANN benchmarks with FP16 input type for CAGRA and IVF-PQ

123558a

tfeher added bug Something isn't working non-breaking Introduces a non-breaking change labels Jul 30, 2024

tfeher requested a review from achirkin July 30, 2024 23:43

achirkin approved these changes Jul 31, 2024

View reviewed changes

tfeher added 2 commits July 31, 2024 14:40

Merge remote-tracking branch 'origin/branch-24.08' into cagra_fp16

d428ecd

Fix style

99eaaa1

cjnolet assigned tfeher Jul 31, 2024

tfeher mentioned this pull request Jul 31, 2024

Remove fp16 kernels that have no public entry point #268

Merged

Remove serialize_to_hnswlib for fp16 index

9eb8be5

tfeher mentioned this pull request Jul 31, 2024

Simplify template instantiations #269

Open

Merge remote-tracking branch 'origin/branch-24.08' into cagra_fp16

0519116

tfeher commented Jul 31, 2024

View reviewed changes

Remove implicit template instantiations during compilation of CAGRA

732aae6

tfeher requested a review from a team as a code owner August 1, 2024 00:44

tfeher added the DO NOT MERGE label Aug 1, 2024

Merge branch 'branch-24.08' into cagra_fp16

51102f3

achirkin changed the base branch from branch-24.08 to branch-24.10 September 27, 2024 08:03

achirkin added 4 commits September 27, 2024 10:58

Merge branch 'branch-24.10' into cagra_fp16

c6ae3b4

Merge branch 'branch-24.10' into cagra_fp16

ef3314c

Fix style

44a603a

Remove unused instances

d0841bd

achirkin removed the DO NOT MERGE label Sep 27, 2024

achirkin mentioned this pull request Sep 27, 2024

Persistent CAGRA kernel #215

Merged

achirkin reviewed Sep 27, 2024

View reviewed changes

cpp/src/neighbors/detail/cagra/cagra_build.cuh Outdated Show resolved Hide resolved

Undo switching nn_descent::build from internal to public API due to t…

243c618

…his change breaking the code

cjnolet reviewed Sep 27, 2024

View reviewed changes

docs/source/developer_guide.md Outdated Show resolved Hide resolved

docs/source/developer_guide.md Outdated Show resolved Hide resolved

Remove the changes to the docs, which are not relevant to the PR

ae3a96c

achirkin requested a review from cjnolet September 27, 2024 14:02

achirkin mentioned this pull request Sep 27, 2024

Reduce cagra build binary size #334

Open

Merge branch 'branch-24.10' into cagra_fp16

8cdd27f

cjnolet approved these changes Sep 27, 2024

View reviewed changes

rapids-bot bot merged commit c616a22 into rapidsai:branch-24.10 Sep 27, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP16 API for CAGRA and IVF-PQ #264

FP16 API for CAGRA and IVF-PQ #264

tfeher commented Jul 30, 2024

cjnolet commented Jul 30, 2024

cjnolet commented Jul 30, 2024

tfeher commented Jul 30, 2024

achirkin left a comment

tfeher left a comment

cjnolet commented Jul 31, 2024

achirkin commented Sep 27, 2024 •

edited

Loading

achirkin commented Sep 27, 2024

FP16 API for CAGRA and IVF-PQ #264

FP16 API for CAGRA and IVF-PQ #264

Conversation

tfeher commented Jul 30, 2024

cjnolet commented Jul 30, 2024

cjnolet commented Jul 30, 2024

tfeher commented Jul 30, 2024

achirkin left a comment

Choose a reason for hiding this comment

tfeher left a comment

Choose a reason for hiding this comment

cjnolet commented Jul 31, 2024

achirkin commented Sep 27, 2024 • edited Loading

achirkin commented Sep 27, 2024

achirkin commented Sep 27, 2024 •

edited

Loading