-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: My collection loses vectors for some IDs #2912
Comments
Hey @ChocoL0rd, thanks for reporting this. It seems like a severe problem, so I'll prioritize investigating. |
Closes #2922 Closes #2912 It might be related to #2905 ## Description of changes *Summarize the changes made by this PR.* - Improvements & Bug fixes - ... - New functionality - ... ## Test plan *How are these changes tested?* - [ ] Tests pass locally with `pytest` for python, `yarn test` for js, `cargo test` for rust ## Documentation Changes *Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the [docs repository](https://github.com/chroma-core/docs)?*
@ChocoL0rd, found the culprit of this, you can check #2922, if you want to dive deeper. Bottom line is we're cutting a new release that will fix this and possibly deprecating old releases (>0.5.5 <=0.5.12) on pypi. |
When new version is going to be released? 10 окт. 2024 г. 21:48 пользователь Trayan Azarov ***@***.***> написал:
@ChocoL0rd, found the culprit of this, you can check #2922, if you want to dive deeper. Bottom line is we're cutting a new release that will fix this and possibly deprecating old releases (>0.5.5 <=0.5.12) on pypi.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@ChocoL0rd, shortly. Possibly in the next couple of hours. |
Ok, as i see in pypi new version is available, so i should just upgrade my chromadb, or recreate my db with this chromadb new version? 10.10.2024, 22:06, "Trayan Azarov" ***@***.***>***@***.***, shortly. Possibly in the next couple of hours.—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Hey @ChocoL0rd, yes. You don't necessarily need to recreate the whole DB. If you have lots of data and you don't want to re-embed then let me know I can help with recreating the embeddings that were missing. |
@tazarov, first of all, i want to know, which version is stable to use it. As i see 0.4.24 doesn't have this problem. Would you recommend use this version, or 0.5.13 and why? It's important question, because i also use 0.4.24, so i want to know what kind of problems can be there.
And yeah, i need to recreate my db. Thanks, for your answers |
What happened?
My collection loses vectors for some IDs after some time (or after a few queries, the cause of this is unknown to me). For example, if the collection contains 5202 elements, 202 embeddings disappear, leaving 5000 valid records, while 202 records no longer have embeddings. Similarly, for another collection with 70,634 elements, only 70,000 remain valid, and the others lose their embeddings.
I am certain that the data was written correctly into the collection, and I am sure that the code interacting with these collections does nothing other than queries and get operations. All embeddings are of the same size, and the collection functions normally until an unknown point in time.
Additionally, the command collection.get(include=["embeddings"]) stops working and throws an error:
However, if I run collection.get(include=[anything but embeddings]), it returns everything as expected.
Chroma version: chromadb==0.5.7
(If an update is necessary, how will this affect my collections? Will I need to recalculate them?)
Versions
chromadb==0.5.7
ubuntu linux 22.04
Relevant log output
No response
The text was updated successfully, but these errors were encountered: