Question: Support for colbertv2.0 ? #355

shatealaboxiaowang · 2024-09-10T07:42:58Z

Model description

Hi dear:
Thanks for your source code. can support for colbertv2.0 deployment ?

Thank you!

Open source status

The model implementation is available on transformers
The model weights are available on huggingface-hub
I verified that the model is currently not running in the lastest version pip install infinity_emb[all] --upgrade

Provide useful links for the implementation

No response

The text was updated successfully, but these errors were encountered:

michaelfeil · 2024-09-10T22:13:00Z

Colbert is a late-interaction model (stateful).

Please provide some example code with only torch and the transformers library. I think it requires some client side computation (late interaction). Don't use any third party packages like colbert package.

wirthual · 2024-12-12T16:49:25Z

Hi @shatealaboxiaowang ,

You are able to run colbertv2 with infinity like so:

infinity_emb v2 --model-id colbert-ir/colbertv2.0

simjak · 2025-01-02T14:14:31Z

@wirthual can you run it on RunPod?

michaelfeil · 2025-01-02T15:42:58Z

@simjak same as Colleen, you can’t use infinity serverless yet. You can spin up your own serverfull Runpod etc.

simjak · 2025-01-02T16:01:13Z

@michaelfeil any plans to support serveless ColPali?
Is there an example of spinning off ColPali pod?

michaelfeil · 2025-01-02T17:33:01Z

port=7997
model1=michaelfeil/colqwen2-v0.1
model2=colbert-ir/colbertv2.0

# needs 16GB+
docker run -it --gpus all \
 -p $port:$port \
 michaelf34/infinity:latest \
 v2 \
 --model-id $model1 \
 --model-id $model2 \
 --port $port \
--dtype bfloat16 \
--batch-size 8 \
--device cuda

simjak · 2025-01-03T13:18:09Z

@michaelfeil I tried to run on runpod, but got:

2025-01-03T13:14:45.429197971Z huggingface_hub.errors.EntryNotFoundError: 404 Client Error. (Request ID: Root=1-6777e2c5-19b02a382e0c73337abbfc1f;bd37d90f-3a39-4744-a1bd-c70bb380dfba)
2025-01-03T13:14:45.429203568Z Entry Not Found for url: https://huggingface.co/vidore/colqwen2-v1.0/resolve/main/config.json.
2025-01-03T13:14:45.429208543Z ERROR:    Application startup failed. Exiting.

Is there something wrong with this model https://huggingface.co/vidore/colqwen2-v1.0

simjak · 2025-01-03T13:20:46Z

oh, I needed to use the merged version https://huggingface.co/vidore/colqwen2-v1.0-merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Support for colbertv2.0 ? #355

Question: Support for colbertv2.0 ? #355

shatealaboxiaowang commented Sep 10, 2024

michaelfeil commented Sep 10, 2024

wirthual commented Dec 12, 2024

simjak commented Jan 2, 2025

michaelfeil commented Jan 2, 2025

simjak commented Jan 2, 2025 •

edited

Loading

michaelfeil commented Jan 2, 2025

simjak commented Jan 3, 2025

simjak commented Jan 3, 2025

Question: Support for colbertv2.0 ? #355

Question: Support for colbertv2.0 ? #355

Comments

shatealaboxiaowang commented Sep 10, 2024

Model description

Open source status

Provide useful links for the implementation

michaelfeil commented Sep 10, 2024

wirthual commented Dec 12, 2024

simjak commented Jan 2, 2025

michaelfeil commented Jan 2, 2025

simjak commented Jan 2, 2025 • edited Loading

michaelfeil commented Jan 2, 2025

simjak commented Jan 3, 2025

simjak commented Jan 3, 2025

simjak commented Jan 2, 2025 •

edited

Loading