Not able to used qlora models with vllm 

I have trained falcon 7b model with qlora but the inference time for outputs is too high.So I want to use vllm for increasing the inference time for that I have used a code snippet to load the model path  
`llm = LLM(model="/content/trained-model/").`
But I am getting an error :
```
OSError: /content/trained-model/ does not appear to have a file named config.json. Checkout 
'https://huggingface.co//content/trained-model//None' for available files.
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Not able to used qlora models with vllm #252

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Not able to used qlora models with vllm #252

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions