Important
The V2 implementation of qdrant-haystack has been moved to deepset-ai/haystack-core-integrations.
Find the latest documentation here.
An integration of Qdrant vector database with Haystack by deepset.
The library finally allows using Qdrant as a document store, and provides an in-place replacement
for any other vector embeddings store. Thus, you should expect any kind of application to be working
smoothly just by changing the provider to QdrantDocumentStore.
qdrant-haystack might be installed as any other Python library, using pip or poetry:
pip install qdrant-haystackpoetry add qdrant-haystackOnce installed, you can already start using QdrantDocumentStore as any other store that supports
embeddings.
from qdrant_haystack import QdrantDocumentStore
document_store = QdrantDocumentStore(
"localhost",
index="Document",
embedding_dim=512,
recreate_index=True,
hnsw_config={"m": 16, "ef_construct": 64} # Optional
)The list of parameters accepted by QdrantDocumentStore is complementary to those used in the
official Python Qdrant client.
Qdrant Python client, from version 1.1.1, supports local in-memory/disk-persisted mode. That's a good choice for any test scenarios and quick experiments in which you do not plan to store lots of vectors. In such a case spinning a Docker container might be even not required.
The local mode was also implemented in qdrant-haystack integration.
In case you want to have a transient storage, for example in case of automated tests launched
during your CI/CD pipeline, using Qdrant Local mode with in-memory storage might be a preferred
option. It might be simply enabled by passing :memory: as first parameter, while creating an
instance of QdrantDocumentStore.
from qdrant_haystack import QdrantDocumentStore
document_store = QdrantDocumentStore(
":memory:",
index="Document",
embedding_dim=512,
recreate_index=True,
hnsw_config={"m": 16, "ef_construct": 64} # Optional
)However, if you prefer to keep the vectors between different runs of your application, it might be better to use on disk storage and pass the path that should be used to persist the data.
from qdrant_haystack import QdrantDocumentStore
document_store = QdrantDocumentStore(
path="/home/qdrant/storage_local",
index="Document",
embedding_dim=512,
recreate_index=True,
hnsw_config={"m": 16, "ef_construct": 64} # Optional
)If you prefer not to manage your own Qdrant instance, Qdrant Cloud might be a better option.
from qdrant_haystack import QdrantDocumentStore
document_store = QdrantDocumentStore(
"https://YOUR-CLUSTER-URL.aws.cloud.qdrant.io",
index="Document",
api_key="<< YOUR QDRANT CLOUD API KEY >>",
embedding_dim=512,
recreate_index=True,
)There is no difference in terms of functionality between local instances and cloud clusters.