Closed as not planned
Description
Feature request
I have my own implementation of a LogitsWarper in this format https://huggingface.co/docs/transformers/internal/generation_utils#transformers.LogitsProcessor that I would like to add to TGI. There are already warpers for temperature, top_p, etc., but I want to add my own. Note: I don't want my LogitsWarper to be merged in to the repo, since I have a vey specific use case -- I just want to know how it can be done.
Motivation
I want to have the benefits of TGI like batching and tensor parallelism while also adding custom LogitsWarper functionality.
Your contribution
I've tried editing the source files at text-generation-inference/server/text_generation_server/utils/logits_process.py
etc., but I started going down a rabbit hole of files that I needed to edit.