Skip to content

Best effort support for all Hugging Face transformers models #616

Closed
@dwyatte

Description

@dwyatte

With https://github.com/huggingface/text-generation-inference adopting a less friendly license, this seems like a good opportunity to add best effort support for all Hugging Face transformers models that generate text e.g., via AutoModelForCausalLM and AutoModelForSeq2SeqLM. This would allow them to take advantage of vLLM's other serving features while specific models can retain optimized implementations or gain them as they are implemented

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions