fix: pass missing revision arg for lora adapter when loading multiple… #2510

drbh · 2024-09-11T13:47:33Z

This PR correctly includes the missing adapter.revision param in load_module_map which should resolve a bug when loading multiple lora adapters and at least on has a revision

… adapters

#2510) fix: pass missing revision arg for lora adapter when loading multiple adapters

nbroad1881 · 2024-09-17T18:46:39Z

A customer is still reporting that this doesn't work.

Here is what I tried on Inference Endpoints and the associated error (TGI image taken from 1 hour ago):

from huggingface_hub import create_inference_endpoint

endpoint = create_inference_endpoint(
    "phi-3-lora-revision-test",
    repository="microsoft/Phi-3-mini-4k-instruct",
    framework="pytorch",
    task="text-generation",
    accelerator="gpu",
    vendor="aws",
    region="us-east-1",
    type="protected",
    instance_size="x1",
    instance_type="nvidia-a10g",
    custom_image={
        "health_route": "/health",
        "env": {
            "MAX_BATCH_PREFILL_TOKENS": "2048",
            "MAX_INPUT_LENGTH": "1024",
            "MAX_TOTAL_TOKENS": "1512",
            "MODEL_ID": "/repository",
            "LORA_ADAPTERS": "grounded-ai/phi3-hallucination-judge@5f5f8c0483200db2ceb4db66adfae9ce77273bde"
        },
        "url": "ghcr.io/huggingface/text-generation-inference:sha-ce85efa",
    },
    token="token"
)

error

Exit code: 1. Reason:                                          │
│ /opt/conda/lib/python3.11/site-packages/huggingface_hub/utils/_validators.py │
│ :160 in validate_repo_id                                                     │
│                                                                              │
│   157 │   │   )                                                              │
│   158 │                                                                      │
│   159 │   if not REPO_ID_REGEX.match(repo_id):                               │
│ ❱ 160 │   │   raise HFValidationError(                                       │
│   161 │   │   │   "Repo id must use alphanumeric chars or '-', '_', '.', '-- │
│   162 │   │   │   " forbidden, '-' and '.' cannot start or end the name, max │
│   163 │   │   │   f" '{repo_id}'."                                           │
│                                                                              │
│ ╭───────────────────────────────── locals ─────────────────────────────────╮ │
│ │ repo_id = 'grounded-ai/phi3-hallucination-judge@5f5f8c0483200db2ceb4db6… │ │
│ ╰──────────────────────────────────────────────────────────────────────────╯ │
╰──────────────────────────────────────────────────────────────────────────────╯
HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' 
and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 
96: 
'grounded-ai/phi3-hallucination-judge@5f5f8c0483200db2ceb4db66adfae9ce77273bde'.

huggingface#2510) fix: pass missing revision arg for lora adapter when loading multiple adapters

fix: pass missing revision arg for lora adapter when loading multiple…

397a10a

… adapters

Narsil approved these changes Sep 12, 2024

View reviewed changes

Narsil merged commit 628334d into main Sep 12, 2024
10 of 11 checks passed

Narsil deleted the fix-missing-lora-adapter-revision branch September 12, 2024 15:04

Narsil pushed a commit that referenced this pull request Sep 14, 2024

fix: pass missing revision arg for lora adapter when loading multiple… (

48ad70b

#2510) fix: pass missing revision arg for lora adapter when loading multiple adapters

nbroad1881 mentioned this pull request Sep 17, 2024

bump hf hub to 0.25.0 #2529

Closed

yuanwu2017 pushed a commit to yuanwu2017/tgi-gaudi that referenced this pull request Sep 26, 2024

fix: pass missing revision arg for lora adapter when loading multiple… (

5fc0e0c

huggingface#2510) fix: pass missing revision arg for lora adapter when loading multiple adapters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pass missing revision arg for lora adapter when loading multiple… #2510

fix: pass missing revision arg for lora adapter when loading multiple… #2510

drbh commented Sep 11, 2024

nbroad1881 commented Sep 17, 2024 •

edited

Loading

fix: pass missing revision arg for lora adapter when loading multiple… #2510

fix: pass missing revision arg for lora adapter when loading multiple… #2510

Conversation

drbh commented Sep 11, 2024

nbroad1881 commented Sep 17, 2024 • edited Loading

nbroad1881 commented Sep 17, 2024 •

edited

Loading