Skip to content

FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs

License

Notifications You must be signed in to change notification settings

Said-Akbar/vllm-rocm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CacheFlow

About

FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 83.5%
  • Cuda 11.2%
  • C++ 3.3%
  • C 0.8%
  • Shell 0.7%
  • CMake 0.4%
  • Dockerfile 0.1%