GPU consumption

when i load 13b llama in HF, GPU usage is about 26G. 

However, when load 13b llama in vllm, GPU usage is about 73G.

<img width="641" alt="image" src="https://github.com/vllm-project/vllm/assets/29653610/94d7848d-c5bc-481c-bf2a-a64aeeb1ca77">


Is this ususal?