Skip to content

Enhance Support for Larger Datasets and Buckets in Encoding#11

Open
EladGabay wants to merge 1 commit intolinvon:mainfrom
EladGabay:main
Open

Enhance Support for Larger Datasets and Buckets in Encoding#11
EladGabay wants to merge 1 commit intolinvon:mainfrom
EladGabay:main

Conversation

@EladGabay
Copy link
Contributor

This commit improves encoding by enabling the handling of number of items and buckets exceeding max(uint32). Formerly, the encoding used uint32 for counts, but the filter structure already supported larger values using uint. Until now, the filter partially supported larger datasets, not all the buckets were utilized, note to the change in generateIndexTagHash, altIndex and indexHash.

Now, all references to bucket indices and item counts explicitly use uint64. A new encoding format accommodates larger filter. To distinguish between legacy (up to max(uint32) items) and the new format, a prefix marker is introduced.

Decoding seamlessly supports both formats.
The encode method takes a legacy boolean parameter for gradual adoption.

This commit improves encoding by enabling the handling of number of items and buckets exceeding max(uint32). Formerly, the encoding used uint32 for counts, but the filter structure already supported larger values using uint.
Until now, the filter partially supported larger datasets, not all the buckets were utilized, note to the change in `generateIndexTagHash`, `altIndex` and `indexHash`.

Now, all references to bucket indices and item counts explicitly use uint64. A new encoding format accommodates larger filter.
To distinguish between legacy (up to max(uint32) items) and the new format, a prefix marker is introduced.

Decoding seamlessly supports both formats.
The encode method takes a legacy boolean parameter for gradual adoption.
@EladGabay
Copy link
Contributor Author

@linvon would you like to take a look? 😊

@linvon
Copy link
Owner

linvon commented Aug 21, 2023

@linvon would you like to take a look? 😊

Sorry, busy with work, but I will find some time to handle this

@EladGabay
Copy link
Contributor Author

Hi, @linvon , let me know if you need any help :)

@EladGabay
Copy link
Contributor Author

@linvon gentle ping

@EladGabay
Copy link
Contributor Author

Hi @linvon do you think it's going to be merged soon? 🙏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants